University College Dublin
crowd counting, wireless signal, image, Transformer, Linear Attention
Video Action Recognition, Motion Vectors, CLIP Features, Multi-modal Fusion, Compressed Domain