Optimized Cross Alignment Based Multimodal Radiology Report Summarization

80 Pages Posted: 8 Apr 2025

See all articles by Somenath Nag Choudhury

Somenath Nag Choudhury

affiliation not provided to SSRN

Asif Ekbal

Indian Institute of Technology (IIT), Patna

Amit Kumar Verma

affiliation not provided to SSRN

Abstract

Multimodal Radiology Report Summarization (MRRS) aims to summarize the radiology report text with the assistance of paired images. This inclusion of vision has already proven its importance and improvements over text-only summarization approaches. We realized that local and global features of the modalities and their alignments play a pivotal role in forming a better couple and thus improving the summarization. Given a pair of images (PA+LAT) and the report text, we have used radiology-specific knowledge, sourced and verified by radiologists, as the marginal representative text and the concatenated image as the marginal representative image. Then we optimized the image-text local-global joint representation and injected them into a transformer encoder-decoder model to improve the summary text generation. Our intermediate fusion approach reflects a minimum improvement over the SOTA text-only and multi-modal approaches for the Open-I dataset, augmented to 6,569 samples, not only in METEOR, SPICE, and sacreBLEU by 3%, 5.44%, and 4.5% respectively in terms of generation, but also in ROUGE-(F1), ROUGE-2(F1), ROUGE-L(F1) and BERTScore of 8%, 2%, 1.15% and 0.04% for summarization. Our model also reflects good qualitative evidence as compared to the baselines.

Keywords: Radiology Report Summarization, Biomedical Text Summarization, Abstractive text summarization, Medical Information Fusion, Cross-modal Alignment, Fusion Optimization

Suggested Citation

Nag Choudhury, Somenath and Ekbal, Asif and Verma, Amit Kumar, Optimized Cross Alignment Based Multimodal Radiology Report Summarization. Available at SSRN: https://ssrn.com/abstract=5210097 or http://dx.doi.org/10.2139/ssrn.5210097

Somenath Nag Choudhury (Contact Author)

affiliation not provided to SSRN ( email )

Asif Ekbal

Indian Institute of Technology (IIT), Patna ( email )

Bihta

Amit Kumar Verma

affiliation not provided to SSRN ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
12
Abstract Views
120
PlumX Metrics