Asynchronous multi-agent deep reinforcement learning under partial observability

Yuchen Xiao • 7 2 月, 2025 6:17 下午 • All posts • 阅读 5

The International Journal of Robotics Research Study, Ahead of Publish.
The advanced multi-agent support discovering (MARL) techniques offer encouraging options to a range of facility troubles. Yet, these techniques all think that representatives execute primitive activities in an integrated way, making them not practical for long-horizon real-world multi-robot jobs that naturally need robotics to asynchronously factor concerning activity choice at differing time periods. To fix this trouble, we initially suggest a team of value-based participating MARL techniques for asynchronous implementation making use of temporally prolonged macro-actions. Right here, representatives execute asynchronous discovering and decision-making with macro-action-value features in 3 standards: decentralized discovering and control, central discovering and control, and systematized training for decentralized implementation (CTDE). Structure on the above job, we develop a collection of macro-action-based plan slope formulas under the 3 training standards, where representatives straight enhance their parameterized plans in an asynchronous way. We review our techniques both in simulation and on genuine robotics over a range of sensible domain names. Empirical outcomes show the performance of our formulas for discovering top notch and asynchronous options with macro-actions in big multi-agent troubles that were formerly unresolvable by means of primitive-action-based techniques. The suggested techniques stand for the very first basic MARL techniques for temporally prolonged activities and function as the structure for future techniques in the location.

发布者：Yuchen Xiao，转转请注明出处：https://robotalks.cn/asynchronous-multi-agent-deep-reinforcement-learning-under-partial-observability/

Asynchronous multi-agent deep reinforcement learning under partial observability

关于作者

Yuchen Xiao

发表回复

联系我们

400-800-8888

Asynchronous multi-agent deep reinforcement learning under partial observability

关于作者

Yuchen Xiao

相关推荐

NMG Gears Up for the Construction of the Phase-2 Matawinie Mine with Substation Contract, Engagement with Contractors and Contribution to Community’s Economic Vitality

Niantic launches Studio for web-based immersive 3D and mixed reality creation

Week in Review: Meta reveals its Oakley smart glasses

Video Friday: Watch this 3D-Printed Robot Escape

Kaposi Sarcoma Strategic Market Research Report 2024-2030, Competitor Profiles of 14 Players Including Bristol-Myers Squibb, Eli Lilly and Co, GlaxoSmithKline, Johnson & Johnson, Merck, Pfizer – ResearchAndMarkets.com

发表回复

联系我们

400-800-8888