Portada: Infraestructura para la Logística Sustentable 2050
DESTACADO | CPI Propone - Resumen Ejecutivo

Infraestructura para el desarrollo que queremos 2026-2030

Elaborado por el Consejo de Políticas de Infraestructura (CPI), este documento constituye una hoja de ruta estratégica para orientar la inversión y la gestión de infraestructura en Chile. Presenta propuestas organizadas en siete ejes estratégicos, sin centrarse en proyectos específicos, sino en influir en las decisiones de política pública para promover una infraestructura que conecte territorios, genere oportunidades y eleve la calidad de vida de la población.
Redirigiendo al acceso original de articulo en 20 segundos...
ARTÍCULO
TITULO

Feasibility Analysis and Application of Reinforcement Learning Algorithm Based on Dynamic Parameter Adjustment

Menglin Li    
Xueqiang Gu    
Chengyi Zeng and Yuan Feng    

Resumen

Reinforcement learning, as a branch of machine learning, has been gradually applied in the control field. However, in the practical application of the algorithm, the hyperparametric approach to network settings for deep reinforcement learning still follows the empirical attempts of traditional machine learning (supervised learning and unsupervised learning). This method ignores part of the information generated by agents exploring the environment contained in the updating of the reinforcement learning value function, which will affect the performance of the convergence and cumulative return of reinforcement learning. The reinforcement learning algorithm based on dynamic parameter adjustment is a new method for setting learning rate parameters of deep reinforcement learning. Based on the traditional method of setting parameters for reinforcement learning, this method analyzes the advantages of different learning rates at different stages of reinforcement learning and dynamically adjusts the learning rates in combination with the temporal-difference (TD) error values to achieve the advantages of different learning rates in different stages to improve the rationality of the algorithm in practical application. At the same time, by combining the Robbins?Monro approximation algorithm and deep reinforcement learning algorithm, it is proved that the algorithm of dynamic regulation learning rate can theoretically meet the convergence requirements of the intelligent control algorithm. In the experiment, the effect of this method is analyzed through the continuous control scenario in the standard experimental environment of ?Car-on-The-Hill? of reinforcement learning, and it is verified that the new method can achieve better results than the traditional reinforcement learning in practical application. According to the model characteristics of the deep reinforcement learning, a more suitable setting method for the learning rate of the deep reinforcement learning network proposed. At the same time, the feasibility of the method has been proved both in theory and in the application. Therefore, the method of setting the learning rate parameter is worthy of further development and research.

Artículos similares

Hemos preparados una selección de otros artículos que pudieran ser de tu interés
Muhammad Sharjeel Ali, Syed Umaid Ali, Saeed Mian Qaisar, Asad Waqar, Faheem Haroon and Ahmad Alzahrani    
Gwadar is essential to Pakistan?s financial stability. Being the third deep-water port in Pakistan, it plays a significant role in trade between the Gulf States, Africa, UAE, and CARs. The load shedding of 12?16 h in Gwadar is the most concerning issue d... ver más
Revista: Sustainability
Muhammad Arshad, Joseph H.A. Guillaume and Andrew Ross    
Additional storage of water is a potential option to meet future water supply goals. Financial comparisons are needed to improve decision making about whether to store water in surface reservoirs or below ground, using managed aquifer recharge (MAR). In ... ver más
Revista: Water
Danium Shahnowaz Syed, Shababa Binte Hossain , Anika Islam , Tamim-Al- Islam Jim , Saniat Rahman Zishan     Pág. 135 - 144
In the context of earthquake and flood disasters, effective communication remains a pivotal concern due to the vulnerability of conventional networks. This study presents a comparative analysis of off-grid communication technologies?namely WiFi and LoRa?... ver más
Kongkun Charoenvisal, Atch Sreshthaputra and Sarin Pinich    
This study investigated the building energy retrofit potential of a shopping mall in Bangkok, Thailand, using a combined building energy modeling and economic analysis approach to achieve a balance between carbon emission reduction and financial feasibil... ver más
Revista: Buildings
Benedito Franciano Ferreira Rodrigues, Anderson Rocha Amaral, Fernanda Paula da Costa Assunção, Lucas Pinto Bernar, Marcelo Costa Santos, Neyson Martins Mendonça, José Almir Rodrigues Pereira, Douglas Alberto Rocha de Castro, Sergio Duvoisin, Jr., Pablo Henrique Ataide Oliveira, Luiz Eduardo Pizarro Borges and Nélio Teixeira Machado    
The objective of this study is to analyze the economic viability of municipal household solid waste (organic matter + paper) for the production of gas, coke and biofuel through the pyrolysis and distillation process. The waste was collected in the city o... ver más
Revista: Energies