Nan’an District
Chongqing, 400065
China
Chongqing University of Posts and Telecommunications
Aerial video classification, video transformer, local semantic enhancement, video class attention, video feature representation