Tao He

affiliation not provided to SSRN

SCHOLARLY PAPERS

5

DOWNLOADS

367

TOTAL CITATIONS

5

Scholarly Papers (5)

1.

Vqa and Visual Reasoning: An Overview of Approaches, Datasets, and Future Direction

Number of pages: 27 Posted: 22 May 2023
affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, Southwestern University of Finance and Economics (SWUFE), University of Brunei Darussalam and affiliation not provided to SSRN
Downloads 124 (498,178)
Citation 5

Abstract:

Loading...

Machine Learning, Deep Learning, VQA, Natural Language Processing, Reasoning, Computer vision

2.

Lifelong Scene Graph Generation

Number of pages: 34 Posted: 14 Jan 2025
affiliation not provided to SSRN, affiliation not provided to SSRN, Monash University, University of Electronic Science and Technology of China (UESTC), affiliation not provided to SSRN, Monash University and Carleton University
Downloads 72 (714,031)

Abstract:

Loading...

In-context Learning, Scene Graph Generation, Large Multimodal Models

3.

Unbiased Multimodal Intent Recognition with Auxiliary Rationale Generation

Number of pages: 31 Posted: 21 Jan 2025
affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN and affiliation not provided to SSRN
Downloads 69 (730,893)

Abstract:

Loading...

Multimodal Intent Recognition, Modality Bias, Self-critical Learning

4.

Progdiff: Progressive Incomplete Multimodal Learning with Diffusion Models

Number of pages: 31 Posted: 07 Feb 2025
affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN and affiliation not provided to SSRN
Downloads 63 (774,136)

Abstract:

Loading...

Incomplete Multimodal Learning, Transformer, Prompt Learning, Teacher-student Learning

5.

Towards Incomplete Multimodal Learning with Prompt-Based Hierarchical Knowledge Distillation

Number of pages: 30 Posted: 07 Mar 2025
affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN, affiliation not provided to SSRN and Fudan University
Downloads 39 (970,336)

Abstract:

Loading...

Incomplete Multimodal Learning, transformer, prompt learning, Knowledge distillation, Teacher-student learning