Inicio  /  Algorithms  /  Vol: 15 Par: 3 (2022)  /  Artículo
ARTÍCULO
TITULO

Reinforcement Learning for Mean-Field Game

Mridul Agarwal    
Vaneet Aggarwal    
Arnob Ghosh and Nilay Tiwari    

Resumen

Stochastic games provide a framework for interactions among multiple agents and enable a myriad of applications. In these games, agents decide on actions simultaneously. After taking an action, the state of every agent updates to the next state, and each agent receives a reward. However, finding an equilibrium (if exists) in this game is often difficult when the number of agents becomes large. This paper focuses on finding a mean-field equilibrium (MFE) in an action-coupled stochastic game setting in an episodic framework. It is assumed that an agent can approximate the impact of the other agents? by the empirical distribution of the mean of the actions. All agents know the action distribution and employ lower-myopic best response dynamics to choose the optimal oblivious strategy. This paper proposes a posterior sampling-based approach for reinforcement learning in the mean-field game, where each agent samples a transition probability from the previous transitions. We show that the policy and action distributions converge to the optimal oblivious strategy and the limiting distribution, respectively, which constitute an MFE.

 Artículos similares

       
 
Jin Wang, Peng Zhao, Zhe Zhang, Ting Yue, Hailiang Liu and Lixin Wang    
The upset state is an unexpected flight state, which is characterized by an unintentional deviation from normal operating parameters. It is difficult for the pilot to recover the aircraft from the upset state accurately and quickly. In this paper, an ups... ver más
Revista: Aerospace

 
Bocheng Zhao, Mingying Huo, Ze Yu, Naiming Qi and Jianfeng Wang    
In this study, we propose an aerial rendezvous method to facilitate the recovery of unmanned aerial vehicles (UAVs) using carrier aircrafts, which is an important capability for the future use of UAVs. The main contribution of this study is the developme... ver más
Revista: Aerospace

 
Yuqi Yuan and Di Zhou    
Revista: Aerospace

 
Siyao Lu, Rui Xu, Zhaoyu Li, Bang Wang and Zhijun Zhao    
The International Lunar Research Station, to be established around 2030, will equip lunar rovers with robotic arms as constructors. Construction requires lunar soil and lunar rovers, for which rovers must go toward different waypoints without encounterin... ver más
Revista: Aerospace

 
Panagiotis D. Paraschos, Georgios K. Koulinas and Dimitrios E. Koulouriotis    
The manufacturing industry often faces challenges related to customer satisfaction, system degradation, product sustainability, inventory, and operation management. If not addressed, these challenges can be substantially harmful and costly for the sustai... ver más
Revista: Algorithms