China
South China University of Technology
5G, reinforcement learning, evolutionary game theory, Markov decision process, Q-learning