Memory-Adaptive Vision-and-Language Navigation

35 Pages Posted: 14 Aug 2023

See all articles by Keji He

Keji He

affiliation not provided to SSRN

Ya Jing

affiliation not provided to SSRN

Huang Yan

Chinese Academy of Sciences (CAS)

Zhihe Lu

National University of Singapore (NUS)

Dong An

affiliation not provided to SSRN

Liang Wang

affiliation not provided to SSRN

Abstract

Vision-and-Language Navigation (VLN) requests an agent to navigate in 3D environments following given instructions, where history is critical for decisionmaking in dynamic navigation process. Particularly, a memory bank storing histories is widely used in existing methods to incorporate with multimodel representations in current scenes for better decision-making. However, by weighting each history with a simple scalar, those methods cannot purely utilize the informative cues that co-exist with detrimental contents in each history, thereby inevitably introducing noises into decision-making. To that end, we propose a novel Memory-Adaptive Model (MAM) that can dynamically restrain the detrimental contents in histories for retaining contents that benefit navigation only. Specifically, two key modules, Visual and Textual Adaptive Modules, are designed to restrain history noises based on scene-related vision and text, respectively. A Reliability Estimator Module is further introduced to refine above adaptation operations. Our experiments on the widely used RxR and R2R datasets show that MAM outperforms its baseline method by 4.0% / 2.5% and 2% / 1% on the validation unseen/test split, respectively, wrt the SR metric.

Keywords: Vision-and-Language Navigation, Memory Bank, History Noises, Memory-Adaptive Model

Suggested Citation

He, Keji and Jing, Ya and Yan, Huang and Lu, Zhihe and An, Dong and Wang, Liang, Memory-Adaptive Vision-and-Language Navigation. Available at SSRN: https://ssrn.com/abstract=4539118 or http://dx.doi.org/10.2139/ssrn.4539118

Keji He (Contact Author)

affiliation not provided to SSRN ( email )

No Address Available

Ya Jing

affiliation not provided to SSRN ( email )

No Address Available

Huang Yan

Chinese Academy of Sciences (CAS) ( email )

Zhihe Lu

National University of Singapore (NUS) ( email )

Singapore
Singapore

Dong An

affiliation not provided to SSRN ( email )

No Address Available

Liang Wang

affiliation not provided to SSRN ( email )

No Address Available

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
215
Abstract Views
351
Rank
307,170
PlumX Metrics