University College Dublin
Video Action Recognition, Motion Vectors, CLIP Features, Multi-modal Fusion, Compressed Domain