Oxford Road
Manchester, M13 9PL
United Kingdom
The University of Manchester
referring image segmentation, Segment Anything Model, Vision--Language Interaction, Multi-scale fusion, multimodality