DeepTranSeq: An Image-Based Approach for Bacterial Sigma 70 Promoter Sequence Identification Using Deep Transfer Learning

22 Pages Posted: 14 Mar 2024

See all articles by Mustain Billah

Mustain Billah

Jashore University of Science and Technology

Md Easin Arafat

Eötvös Loránd University; Jahangirnagar University

Nazrul Islam

Jahangirnagar University

Al Imtiaz

University of Information Technology and Sciences

Swakkhar Shatabda

United International University

Shamim Kaiser

Jahangirnagar University

Tamás Orosz

Eötvös Loránd University

Abstract

Identification of biological sequences and their functions is one of the core tasks in bioinformatics. The identification and classification of such biological sequences are essential to facilitate the study of different organisms and their continuous evolution. Moreover, Escherichia coli (E. coli) bacteria is currently the best-understood organism, the preferred host for gene cloning and protein production. Sequence analysis of E. coli bacteria is carried out in most molecular biology and biotechnology laboratories. To cope with the stressful conditions of the environment, E. coli bacteria alter their gene expression patterns. The process of choosing which genes will be transcribed is extremely dependent on particular sequences of nucleotides that are referred to as promoters. Among different sigma promoters of E. coli, the sigma 70 promoter is responsible for starting the transcription of nearly all genes in growing cells. However, due to the high time complexity and expenses of conventional laboratory methods, computational methods offer alternate solutions to the problem of identifying such bacterial promoter sequences. In this paper, we propose a new approach called DeepTranSeq for transforming promoter sequences into image representations with the help of CNN to identify bacterial sigma 70 promoters using a deep transfer learning network-based D-LeNet model. The proposed method obtains 99.32% accuracy compared to a standard benchmark dataset, significantly outperforming other state-of-the-art methods.

Keywords: CNN, Deep Transfer Learning, Promoter Identification, Sigma 70 Promoter, Image-based Approach

Suggested Citation

Billah, Mustain and Arafat, Md Easin and Islam, Nazrul and Imtiaz, Al and Shatabda, Swakkhar and Kaiser, Shamim and Orosz, Tamás, DeepTranSeq: An Image-Based Approach for Bacterial Sigma 70 Promoter Sequence Identification Using Deep Transfer Learning. Available at SSRN: https://ssrn.com/abstract=4750303 or http://dx.doi.org/10.2139/ssrn.4750303

Mustain Billah

Jashore University of Science and Technology ( email )

Bangladesh

Md Easin Arafat

Eötvös Loránd University ( email )

Jahangirnagar University ( email )

Savar
Social Science Faculty, Savar
Dhaka, 1342
Bangladesh

Nazrul Islam (Contact Author)

Jahangirnagar University ( email )

Savar
Social Science Faculty, Savar
Dhaka, 1342
Bangladesh

Al Imtiaz

University of Information Technology and Sciences ( email )

Holding 190, Road 5, Block J, Baridhara, Maddha Na
Dhaka, 1212
Bangladesh

Swakkhar Shatabda

United International University ( email )

Madani Avenue, Dhaka, Bangladesh
Dhaka, Dhaka 1216
Bangladesh

Shamim Kaiser

Jahangirnagar University ( email )

Savar
Social Science Faculty, Savar
Dhaka, 1342
Bangladesh

Tamás Orosz

Eötvös Loránd University ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
65
Abstract Views
247
Rank
752,983
PlumX Metrics