2 Lushan South Rd
Changsha, CA 410082
China
Hunan University
Spatial-temporal Video Grounding, Cross-modal Learning, Transformer, Contrastive learning