default author photo

Peng Li

National University of Defense Technology

Changsha Hunan, 410073

China

SCHOLARLY PAPERS

5

DOWNLOADS

148

TOTAL CITATIONS

1

Scholarly Papers (5)

1.

Finding and Using Stable Features: An Effective Way for Opponent Hidden Information Inference in Imperfect Information Games

Number of pages: 32 Posted: 01 Oct 2022
National University of Defense Technology, National University of Defense Technology, National University of Defense Technology, National University of Defense Technology and National University of Defense Technology
Downloads 52 (1,041,696)

Abstract:

Loading...

opponent hidden information inference, state estimation, stable feature, action model, Texas Hold'em

2.

An Enhanced Structural Developmental Neural Network with Information Saturation for Continual Unsupervised Learning

Number of pages: 12 Posted: 05 Oct 2022
National University of Defense Technology, National University of Defense Technology, National University of Defense Technology and National University of Defense Technology
Downloads 36 (1,249,711)

Abstract:

Loading...

structural developmental neural network, information saturation, continual learning, unsupervised learning

3.

Vaos: Enhancing the Stability of Cooperative Multi-Agent Policy Learning

Number of pages: 35 Posted: 11 Jun 2024
National University of Defense Technology, National University of Defense Technology, National University of Defense Technology, National University of Defense Technology and National University of Defense Technology
Downloads 34 (1,263,436)
Citation 1

Abstract:

Loading...

Overestimation reductionMulti-agent Operator switchingValue averagingReinforcement Learning.

4.

Risca: Enhancing the Interpretability of Cooperative Multi-Agent Policy Learning

Number of pages: 24 Posted: 26 May 2025
National University of Defense Technology, National University of Defense Technology, National University of Defense Technology and National University of Defense Technology
Downloads 14 (1,509,872)

Abstract:

Loading...

Multi-agent InterpretabilityRisk-sensitive Reinforcement Learning Cooperative Policy.

5.

Distributional weighted multi-agent policy learning method with adaptive risk attitudes

Number of pages: 39 Posted: 27 Mar 2026
Peng Li, Zhenzhen Hu and Jing Chen
National University of Defense Technology, National University of Defense Technology and National University of Defense Technology
Downloads 12 (1,526,714)

Abstract:

Loading...

Multi-agent \sep Adaptive risk attitudes \sep Distributional \sep Reinforcement learning \sep Risk-sensitive.