Chengdu
China
Southwestern University of Finance and Economics (SWUFE)
Multimodal LLM, Fish detection, Fish classification, ChatGPT
Cross-modal retrieval, text-video retrieval, video semantic compression, granularity alignment