affiliation not provided to SSRN
Remote Sensing Image-Text Retrieval, Vision-and-Language Pre-training, Parameter-Efficient Fine-Tuning, Cross-Modal Asymmetric Adapter, Multi-Task Learning