Adaptive Multi-Model Fusion Learning for Sparse-Reward Reinforcement Learning

13 Pages Posted: 1 Jun 2024

See all articles by Giseung Park

Giseung Park

affiliation not provided to SSRN

Whiyoung Jung

affiliation not provided to SSRN

Seungyul Han

affiliation not provided to SSRN

Sungho Choi

affiliation not provided to SSRN

Youngchul Sung

affiliation not provided to SSRN

Abstract

In this paper, we consider intrinsic reward generation for sparse-reward reinforcement learning, wherein the agent receives sparse extrinsic rewards from the environment. Conventionally, intrinsic reward generation relies on model prediction errors, where the agent's learning model estimates target values or distributions. The intrinsic reward is crafted as the disparity between the model's prediction output and the actual target, leveraging the tendency that less-visited state-action pairs yield larger prediction errors. We extend this approach to accommodate multiple prediction models, proposing an adaptive fusion technique tailored to the multi-model setting. To streamline the search for the optimal fusion rule, we impose axiomatic conditions that any viable fusion method should meet, and justify these conditions mathematically. Subsequently, we introduce adaptive fusion, which dynamically learns the optimal prediction-error fusion strategy throughout the learning process, thereby enhancing overall learning performance. Our numerical experiments demonstrate the superiority of the proposed intrinsic reward generation method over existing approaches, with performance gains observed across various tasks.

Keywords: Deep reinforcement learning, Neural network, Spare-reward reinforcement learning, Intrinsic reward, Multiple prediction models, Adaptive fusion

Suggested Citation

Park, Giseung and Jung, Whiyoung and Han, Seungyul and Choi, Sungho and Sung, Youngchul, Adaptive Multi-Model Fusion Learning for Sparse-Reward Reinforcement Learning. Available at SSRN: https://ssrn.com/abstract=4850588 or http://dx.doi.org/10.2139/ssrn.4850588

Giseung Park

affiliation not provided to SSRN ( email )

No Address Available

Whiyoung Jung

affiliation not provided to SSRN ( email )

No Address Available

Seungyul Han

affiliation not provided to SSRN ( email )

No Address Available

Sungho Choi

affiliation not provided to SSRN ( email )

No Address Available

Youngchul Sung (Contact Author)

affiliation not provided to SSRN ( email )

No Address Available

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
10
Abstract Views
47
PlumX Metrics