R2-Trans: Fine-Grained Visual Categorization with Redundancy Reduction

23 Pages Posted: 17 Oct 2023

See all articles by shuo ye

shuo ye

Huazhong University of Science and Technology

Shujian Yu

UiT The Arctic University of Norway

yu wang

Huazhong University of Science and Technology

Xinge You

Huazhong University of Science and Technology

Abstract

Fine-grained visual categorization (FGVC) aims to discriminate similar subcategories, whose main challenge is the large intraclass diversities and subtle interclass differences. Existing FGVC methods usually select discriminant regions found by a trained model, which is prone to neglect other potential discriminant information. On the other hand, the massive interactions between the sequence of image patches in ViT make the resulting class token contain lots of redundant information, which may also impact FGVC performance. In this paper, we present a novel approach for FGVC, which can simultaneously make use of partial yet sufficient discriminative information in environmental cues and also compress the redundant information in class-token with respect to the target. Specifically, our model calculates the ratio of high-weight regions in a batch, adaptively adjusts the masking threshold, and achieves moderate extraction of background information in the input space. Moreover, we also use the Information Bottleneck (IB) approach to guide our network to learn a minimum sufficient representations in the feature space. Experimental results on three widely-used benchmark datasets verify that our approach can achieve better performance than other stateof-the-art approaches and baseline models. The code of our model is available at: https://github.com/SYe-hub/R-2-Trans.

Keywords: fine-grained visual categorization, batch-based dynamic mask, information bottleneck

Suggested Citation

ye, shuo and Yu, Shujian and wang, yu and You, Xinge, R2-Trans: Fine-Grained Visual Categorization with Redundancy Reduction. Available at SSRN: https://ssrn.com/abstract=4604467 or http://dx.doi.org/10.2139/ssrn.4604467

Shuo Ye (Contact Author)

Huazhong University of Science and Technology ( email )

1037 Luoyu Rd
Wuhan, 430074
China

Shujian Yu

UiT The Arctic University of Norway ( email )

Norway

Yu Wang

Huazhong University of Science and Technology ( email )

1037 Luoyu Rd
Wuhan, 430074
China

Xinge You

Huazhong University of Science and Technology ( email )

1037 Luoyu Rd
Wuhan, 430074
China

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
42
Abstract Views
183
PlumX Metrics