affiliation not provided to SSRN
Video Entity Linking, Multimodal Entity Linking, Multimodal fusion, Knowledge graph, Transformer