Asynchronous multi-agent deep reinforcement learning under partial observability

The International Journal of Robotics Research Study, Ahead of Publish.
The advanced multi-agent support discovering (MARL) techniques offer encouraging options to a range of facility troubles. Yet, these techniques all think that representatives execute primitive activities in an integrated way, making them not practical for long-horizon real-world multi-robot jobs that naturally need robotics to asynchronously factor concerning activity choice at differing time periods. To fix this trouble, we initially suggest a team of value-based participating MARL techniques for asynchronous implementation making use of temporally prolonged macro-actions. Right here, representatives execute asynchronous discovering and decision-making with macro-action-value features in 3 standards: decentralized discovering and control, central discovering and control, and systematized training for decentralized implementation (CTDE). Structure on the above job, we develop a collection of macro-action-based plan slope formulas under the 3 training standards, where representatives straight enhance their parameterized plans in an asynchronous way. We review our techniques both in simulation and on genuine robotics over a range of sensible domain names. Empirical outcomes show the performance of our formulas for discovering top notch and asynchronous options with macro-actions in big multi-agent troubles that were formerly unresolvable by means of primitive-action-based techniques. The suggested techniques stand for the very first basic MARL techniques for temporally prolonged activities and function as the structure for future techniques in the location.

发布者:Yuchen Xiao,转转请注明出处:https://robotalks.cn/asynchronous-multi-agent-deep-reinforcement-learning-under-partial-observability/

(0)
上一篇 7 2 月, 2025 6:17 下午
下一篇 7 2 月, 2025 6:18 下午

相关推荐

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注

联系我们

400-800-8888

在线咨询: QQ交谈

邮件:admin@example.com

工作时间:周一至周五,9:30-18:30,节假日休息

关注微信
社群的价值在于通过分享与互动,让想法产生更多想法,创新激发更多创新。