|
|
|
Shiva Raj Pokhrel, Jonathan Kua, Deol Satish, Sebnem Ozer, Jeff Howe and Anwar Walid
We introduce a novel multipath data transport approach at the transport layer referred to as ?Deep Deterministic Policy Gradient for Multipath Performance-oriented Congestion Control? (DDPG-MPCC), which leverages deep reinforcement learning to enhance co...
ver más
|
|
|
|
|
|
|
Minseok Kong and Jungmin So
There are several automated stock trading programs using reinforcement learning, one of which is an ensemble strategy. The main idea of the ensemble strategy is to train DRL agents and make an ensemble with three different actor?critic algorithms: Advant...
ver más
|
|
|
|
|
|
|
Sheng Yu, Wei Zhu and Yong Wang
Wargames are essential simulators for various war scenarios. However, the increasing pace of warfare has rendered traditional wargame decision-making methods inadequate. To address this challenge, wargame-assisted decision-making methods that leverage ar...
ver más
|
|
|
|
|
|
|
Jianya Yuan, Mengxue Han, Hongjian Wang, Bo Zhong, Wei Gao and Dan Yu
Collision avoidance planning has always been a hot and important issue in the field of unmanned aircraft research. In this article, we describe an online collision avoidance planning algorithm for autonomous underwater vehicle (AUV) autonomous navigation...
ver más
|
|
|
|
|
|
|
Xi Lyu, Yushan Sun, Lifeng Wang, Jiehui Tan and Liwen Zhang
This study aims to solve the problems of sparse reward, single policy, and poor environmental adaptability in the local motion planning task of autonomous underwater vehicles (AUVs). We propose a two-layer deep deterministic policy gradient algorithm-bas...
ver más
|
|
|
|
|
|
|
Jiachi Zhao, Jun Li and Lifang Zeng
Birds and experienced glider pilots frequently use atmospheric updrafts for long-distance flight and energy conservation, with harvested energy from updrafts serving as the foundation. Inspired by their common characteristics in autonomous soaring, a rei...
ver más
|
|
|
|
|
|
|
Wenting Li, Xiuhui Zhang, Yunfeng Dong, Yan Lin and Hongjue Li
Multi-stage launch vehicles are currently the primary tool for humans to reach extraterrestrial space. The technology of recovering and reusing rockets can effectively shorten rocket launch cycles and reduce space launch costs. With the development of de...
ver más
|
|
|
|
|
|
|
Wanli Li, Jiong Li, Ningbo Li, Lei Shao and Mingjie Li
Concerned with the problem of interceptor midcourse guidance trajectory online planning satisfying multiple constraints, an online midcourse guidance trajectory planning method based on deep reinforcement learning (DRL) is proposed. The Markov decision p...
ver más
|
|
|
|
|
|
|
Hao Chen, Chuanqiang Gao, Jifei Wu, Kai Ren and Weiwei Zhang
Transonic buffet is a phenomenon of large self-excited shock oscillations caused by shock wave-boundary layer interaction, which is one of the common flow instability problems in aeronautical engineering. This phenomenon involves unsteady flow, which mak...
ver más
|
|
|
|
|
|
|
Zhibo Zhang, Bowen Zhou, Guangdi Li, Peng Gu, Jing Huang and Boyu Liu
Island microgrids play a crucial role in developing and utilizing offshore renewable energy sources. However, high operation costs and limited operational flexibility are significant challenges. To address these problems, this paper proposes a novel dual...
ver más
|
|
|
|