default author photo

Xin Wang

Tsinghua University

Beijing, 100084

China

SCHOLARLY PAPERS

1

DOWNLOADS

116

TOTAL CITATIONS

0

Scholarly Papers (1)

1.

Rethinking Exploration-Exploitation Trade-Off in Reinforcement Learning Via Cognitive Consistency

Number of pages: 44 Posted: 11 Nov 2024
Da Wang, Wei Wei, Lin Li, Xin Wang and Jiye Liang
Shanxi University, Shanxi University, Shanxi University, Tsinghua University and Shanxi University
Downloads 116 (613,149)

Abstract:

Loading...

Off-policy reinforcement learning, Sample efficiency, Exploration-exploitation trade-off, Cognitive consistency