default author photo

Yubing Shen

Southwest University of Finance and Economics

China

SCHOLARLY PAPERS

1

DOWNLOADS

89

TOTAL CITATIONS

0

Scholarly Papers (1)

1.

V-Sparse: Temporal-Spatial Visual Compression and Coarse-to-Fine Alignment for Text-Video Retrieval

Number of pages: 48 Posted: 10 Feb 2025
affiliation not provided to SSRN, affiliation not provided to SSRN, Southwestern University of Finance and Economics (SWUFE), Southwestern University of Finance and Economics (SWUFE), Southwestern University of Finance and Economics (SWUFE), Southwest University of Finance and Economics and University of Alberta
Downloads 89 (747,755)

Abstract:

Loading...

Cross-modal retrieval, text-video retrieval, video semantic compression, granularity alignment