Rehearmixup: Improving Rehearsal-Based Continual Learning

22 Pages Posted: 24 Sep 2024

See all articles by Yan Zhang

Yan Zhang

Shandong University - School of Software

Kaiyun Qi

affiliation not provided to SSRN

Dong Wu

affiliation not provided to SSRN

Guoqiang Wu

Shandong University

Yilong Yin

Shandong University

Multiple version iconThere are 3 versions of this paper

Abstract

Neural networks often suffer from catastrophic forgetting when learning new tasks, leading to the loss of previously acquired knowledge. To address this issue, rehearsal-based methods have emerged, which involve storing a subset of data from previous tasks and accessing it during the learning of new tasks. Current rehearsal-based methods focus on selecting representative samples to store in memory. However, there is a considerable lack of exploration of how to exploit the data at hand and consider the correlation between tasks or between past and new knowledge to improve performance. Therefore, we propose a simple yet effective approach named RehearMixup that adapts the Mixup technique into rehearsal-based methods, which synthesizes new samples for learning by interpolating data from past or current tasks. Specifically, we introduce three strategies, namely Cross-Mixup, Intra-Memory-Mixup, and Intra-Current-Mixup, based on the inherent characteristics of rehearsal-based methods - involving the memory and new tasks. Through empirical evaluations under various benchmark scenarios, we compare our approach against different rehearsal-based baselines. The results demonstrate that ours, particularly Intra-Current-Mixup, improve accuracy, backward transfer, forward transfer, and enhance the model's robustness.

Keywords: Continual learning, Rehearsal-based method, Mixup technique, RehearMixup, Memory and new tasks

Suggested Citation

Zhang, Yan and Qi, Kaiyun and Wu, Dong and Wu, Guoqiang and Yin, Yilong, Rehearmixup: Improving Rehearsal-Based Continual Learning. Available at SSRN: https://ssrn.com/abstract=4965584 or http://dx.doi.org/10.2139/ssrn.4965584

Yan Zhang (Contact Author)

Shandong University - School of Software ( email )

Kaiyun Qi

affiliation not provided to SSRN ( email )

No Address Available

Dong Wu

affiliation not provided to SSRN ( email )

No Address Available

Guoqiang Wu

Shandong University ( email )

27 Shanda Nanlu
South Rd.
Jinan, SD 250100
China

Yilong Yin

Shandong University ( email )

27 Shanda Nanlu
South Rd.
Jinan, SD 250100
China

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
19
Abstract Views
105
PlumX Metrics