Toward Real Text Manipulation Detection: New Dataset and New Solution

35 Pages Posted: 16 Nov 2023

See all articles by Dongliang Luo

Dongliang Luo

Huazhong University of Science and Technology

Yuliang Liu

Huazhong University of Science and Technology

Rui Yang

Alibaba Group

Xianjin Liu

Alibaba Group

Jishen Zeng

Alibaba Group

Yu Zhou

Huazhong University of Science and Technology

Xiang Bai

Huazhong University of Science and Technology - School of Artificial Intelligence and Automation

Abstract

With the surge in realistic text tampering, detecting fraudulent text in images has gained prominence for maintaining information security. However, the high costs associated with professional text manipulation and annotation limit the availability of real-world datasets, with most relying on synthetic tampering, which inadequately replicates real-world tampering attributes. To address this issue, we present the Real Text Manipulation (RTM) dataset, encompassing 14,250 text images, which include 5,986 manually and 5,258 automatically tampered images, created using a variety of techniques, alongside 3,006 unaltered text images for evaluating solution stability. Our evaluations indicate that existing methods falter in text forgery detection on the RTM dataset. We propose a robust baseline solution featuring a Consistency-aware Aggregation Hub and a Gated Cross Neighborhood-attention Fusion module for efficient multi-modal information fusion, supplemented by a Tampered-Authentic Contrastive Learning module during training, enriching feature representation distinction. This framework, extendable to other dual-stream architectures, demonstrated notable localization performance improvements of 7.33% and 6.38% on manual and overall manipulations, respectively. Our contributions aim to propel advancements in real-world text tampering detection. Code and dataset will be made available at https://github.com/DrLuo/RTM.

Keywords: Tampered text detection, Document forgery, Image forensics, Image manipulation detection, Multi-modal

Suggested Citation

Luo, Dongliang and Liu, Yuliang and Yang, Rui and Liu, Xianjin and Zeng, Jishen and Zhou, Yu and Bai, Xiang, Toward Real Text Manipulation Detection: New Dataset and New Solution. Available at SSRN: https://ssrn.com/abstract=4634959 or http://dx.doi.org/10.2139/ssrn.4634959

Dongliang Luo

Huazhong University of Science and Technology ( email )

1037 Luoyu Rd
Wuhan, 430074
China

Yuliang Liu (Contact Author)

Huazhong University of Science and Technology ( email )

1037 Luoyu Rd
Wuhan, 430074
China

Rui Yang

Alibaba Group ( email )

Xianjin Liu

Alibaba Group ( email )

Jishen Zeng

Alibaba Group ( email )

Yu Zhou

Huazhong University of Science and Technology ( email )

1037 Luoyu Rd
Wuhan, 430074
China

Xiang Bai

Huazhong University of Science and Technology - School of Artificial Intelligence and Automation ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
103
Abstract Views
185
Rank
572,368
PlumX Metrics