affiliation not provided to SSRN
Machine Learning, Deep Learning, VQA, Natural Language Processing, Reasoning, Computer vision
In-context Learning, Scene Graph Generation, Large Multimodal Models
Multimodal Intent Recognition, Modality Bias, Self-critical Learning
Incomplete Multimodal Learning, Transformer, Prompt Learning, Teacher-student Learning
Incomplete Multimodal Learning, transformer, prompt learning, Knowledge distillation, Teacher-student learning