DEMF-Net: A Dual Encoder Multi-Scale Feature Fusion Network for Polyp Segmentation

20 Pages Posted: 24 Jan 2024

See all articles by Xiaorui Cao

Xiaorui Cao

Changchun University

He Yu

Changchun University

Kang Yan

Changchun University

Rong Cui

Changchun University

Jinming Guo

Changchun University

Xuan Li

Changchun University

Xiaoxue Xing

Changchun University

Tao Huang

James Cook University - College of Science and Engineering

Abstract

Colorectal cancer is a common malignant tumour of the gastrointestinal tract. Studies have shown that colonoscopy can be an effective screening method for detecting colon polyps and removing them to prevent the development of colorectal cancer. In this study, we propose a new approach called the Dual Encoder Multi-Scale Feature Fusion Network (DEMF-Net). This approach uses a dual-scale Swin Transformer and CNN as an encoder to extract semantic features at different scales. To fully fuse the different scale features, we propose a Dual-Branch Attention Fusion Module (DAF) that captures different shapes of target features through the attention mechanism and assigns higher weights to feature channels with high contributions. Additionally, we use an Advanced Feature Fusion Module (AFFM) to establish long-range dependencies and strengthen the target region. We also propose Characterization Supplementary Blocks (CSB) for colorectal polyp images with irregular shapes and unclear boundaries to capture the structure and details of images and enhance model accuracy. We conducted experiments on five widely adopted polyp datasets and showed that our method achieved superior results in terms of both segmentation accuracy and edge details.

Note:
Funding declaration: This research was supported by the Natural Science Foundation of Jilin Provincial Science and Technology Department under (No: 20220101133JC).

Conflict of Interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Keywords: polyp segmentation, Feature Fusion, Swin Transformer, Medical Image Segmentation

Suggested Citation

Cao, Xiaorui and Yu, He and Yan, Kang and Cui, Rong and Guo, Jinming and Li, Xuan and Xing, Xiaoxue and Huang, Tao, DEMF-Net: A Dual Encoder Multi-Scale Feature Fusion Network for Polyp Segmentation. Available at SSRN: https://ssrn.com/abstract=4700957 or http://dx.doi.org/10.2139/ssrn.4700957

Xiaorui Cao

Changchun University ( email )

He Yu

Changchun University ( email )

Kang Yan

Changchun University ( email )

Rong Cui

Changchun University ( email )

Jinming Guo

Changchun University ( email )

Xuan Li

Changchun University ( email )

Xiaoxue Xing

Changchun University ( email )

Tao Huang (Contact Author)

James Cook University - College of Science and Engineering ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
47
Abstract Views
147
PlumX Metrics