Machine Learning-Guided Engineering of T7 Rna Polymerase and Mrna Capping Enzymes for Enhanced Gene Expression in Eukaryotic Systems

38 Pages Posted: 12 Apr 2025

See all articles by Seung Gyun Woo

Seung Gyun Woo

affiliation not provided to SSRN

Danny J. Diaz

affiliation not provided to SSRN

Wantae Kim

affiliation not provided to SSRN

Mason Galliver

affiliation not provided to SSRN

Andrew D. Ellington

University of Texas at Austin - Department of Molecular Biosciences

Abstract

The integration of synthetic biology tools into eukaryotic systems offers both significant opportunities and challenges, particularly in optimizing transcriptional and post-transcriptional processes. T7 RNA polymerase (T7 RNAP) and mRNA capping enzymes (CEs) have been fused to enable eukaryotic mRNA production within a single construct. However, the activity of the fusion construct between the African Swine Fever Virus capping enzyme (ASFVCE) and T7 RNAP was relatively low. To address this, we fused the Brazilian Marseillevirus capping enzyme (BMCE) to T7 RNAP and developed a machine learning (ML) pipeline to engineer greatly improved fusion variants. This approach enabled the additive integration of nine predicted single substitutions that improved gene expression in yeast, thereby generating fusion polymerases that exhibited over 10-fold improvements in gene expression efficiency relative to the original fusion enzyme. Not only were ML substitutions additive for gene expression, they could be further combined with variants identified via directed evolution for even higher activities. By allowing ML predictions to guide validations we could rapidly explore the sequence landscape for enzyme optimization, achieving superior results even when compared to directed evolution. The improved enzymes have potential impact for numerous synthetic biology applications, including metabolic engineering, mRNA therapeutics, and cell free systems.

Keywords: T7 RNA Polymerase (T7 RNAP), mRNA Capping Enzyme (CE), Machine Learning (ML), protein engineering, Yeast Expression System, Synthetic Biology

Suggested Citation

Woo, Seung Gyun and Diaz, Danny J. and Kim, Wantae and Galliver, Mason and Ellington, Andrew D., Machine Learning-Guided Engineering of T7 Rna Polymerase and Mrna Capping Enzymes for Enhanced Gene Expression in Eukaryotic Systems. Available at SSRN: https://ssrn.com/abstract=5196047 or http://dx.doi.org/10.2139/ssrn.5196047

Seung Gyun Woo

affiliation not provided to SSRN ( email )

No Address Available

Danny J. Diaz

affiliation not provided to SSRN ( email )

No Address Available

Wantae Kim

affiliation not provided to SSRN ( email )

No Address Available

Mason Galliver

affiliation not provided to SSRN ( email )

No Address Available

Andrew D. Ellington (Contact Author)

University of Texas at Austin - Department of Molecular Biosciences ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
15
Abstract Views
104
PlumX Metrics