Skill Matters: Dynamic Skill Learning for Multi-Agent Cooperative Reinforcement Learning

27 Pages Posted: 10 Apr 2024

See all articles by Tong Li

Tong Li

affiliation not provided to SSRN

Chenjia Bai

Shanghai Artificial Intelligence Laboratory

Kang Xu

Fudan University

Chen Chu

Yunnan University of Finance and Economics

Peican Zhu

affiliation not provided to SSRN

Zhen Wang

Northwestern Polytechnical University, China

Abstract

In Multi-Agent Reinforcement Learning (MARL), acquiring suitable behaviors for distinct agents in different scenarios is crucial to enhance the collaborative efficacy and adaptability of multi-agent systems. Existing methods address this challenge through role-based and hierarchical-based paradigms, while they can excessively depend on extrinsic rewards and obtain unsatisfactory results, or lead to homogeneous behaviors with shared agent parameterization. In this paper, we propose a novel Dynamic Skill Learning (DSL) framework to enable more effective adaptation and collaboration in complex tasks. Specifically, DSL learns diverse skills without external rewards and then assigns skills to agents dynamically. DSL has two components: (\romannumeral 1) dynamic skill discovery, which fosters distinguishable and far-reaching skill learning by using Lipschitz constraints, and (\romannumeral 2) dynamic skill assignment, which leverages a policy controller to dynamically allocate the optimal skill combination for each agent based on their local observations. Empirical results demonstrate that DSL leads to better collaborative ability and significantly improves the performance on challenging benchmarks including StarCraft II and Google Research Football.

Keywords: Multi-agent Reinforcement Learning, Diverse Behaviors, Skill Discovery, Skill Assignment

Suggested Citation

Li, Tong and Bai, Chenjia and Xu, Kang and Chu, Chen and Zhu, Peican and Wang, Zhen, Skill Matters: Dynamic Skill Learning for Multi-Agent Cooperative Reinforcement Learning. Available at SSRN: https://ssrn.com/abstract=4790564 or http://dx.doi.org/10.2139/ssrn.4790564

Tong Li (Contact Author)

affiliation not provided to SSRN ( email )

No Address Available

Chenjia Bai

Shanghai Artificial Intelligence Laboratory ( email )

China

Kang Xu

Fudan University ( email )

Beijing West District Baiyun Load 10th
Shanghai, 100045
China

Chen Chu

Yunnan University of Finance and Economics ( email )

Longquan Road 237, Wuhua District
Kunming, 650221
China

Peican Zhu

affiliation not provided to SSRN ( email )

No Address Available

Zhen Wang

Northwestern Polytechnical University, China ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
32
Abstract Views
124
PlumX Metrics