Investigating cis-regulatory elements and gene expression in multiple tomato varieties using interpretable deep learning
Xiang Ji, Cong Wang, He Zhang, Xiuyi Gong, Chunyu Zhu, Yanshen Zhao, Yang Liu, Zhongzhi Han & Hang He
Theoretical and Applied Genetics; December 23 2025; vol. 139; article 16
ABSTRACT
Cis-regulatory elements (CREs) govern gene expression, and the relationship between non-coding regulatory elements and gene expression is inherently complex. To further elucidate how these elements influence gene expression, we refined previous models and developed an interpretable deep learning model, termed L-CRE. By analyzing the flanking regions of genes from four distinct tomato varieties, this model successfully predicted high and low levels of gene expression, achieving a peak accuracy of 86.9%, demonstrating robust performance. To gain deeper insights into the model’s predictive mechanisms, we conducted an interpretability analysis, calculating and evaluating the contribution scores of different genomic regions to the prediction outcomes. Through further exploration of these contribution scores, we identified several critical genomic regions that significantly influence the prediction of gene expression levels. Notably, these regions frequently encompass transcription factor binding sites. Moreover, based on the analysis of contribution scores, we successfully identified several experimentally validated regulatory elements. This study not only enhances our understanding of gene regulatory mechanisms in tomatoes but also provides novel insights and methodologies for future research in crop genetic improvement and functional genomics.
See https://link.springer.com/article/10.1007/s00122-025-05109-1
Views: 135


