China
Southwest University of Finance and Economics
Cross-modal retrieval, text-video retrieval, video semantic compression, granularity alignment