affiliation not provided to SSRN
Clustering-guided negative sampling, Contrastive Learning, Masked image reconstruction, Multimodal self-supervised learning