Changsha Hunan, 410073
China
National University of Defense Technology
opponent hidden information inference, state estimation, stable feature, action model, Texas Hold'em
Overestimation reductionMulti-agent Operator switchingValue averagingReinforcement Learning.