default author photo

Lin Li

Shanxi University

No.92 Wucheng Rd

Taiyuan, 030006

China

SCHOLARLY PAPERS

1

DOWNLOADS

116

TOTAL CITATIONS

0

Scholarly Papers (1)

1.

Rethinking Exploration-Exploitation Trade-Off in Reinforcement Learning Via Cognitive Consistency

Number of pages: 44 Posted: 11 Nov 2024
Shanxi University, Shanxi University, Shanxi University, Tsinghua University and Shanxi University
Downloads 116 (613,149)

Abstract:

Loading...

Off-policy reinforcement learning, Sample efficiency, Exploration-exploitation trade-off, Cognitive consistency