Investigating cis-regulatory elements and gene expression in multiple tomato varieties using interpretable deep learning

Update date: 29 December 2025
Share

Xiang JiCong WangHe ZhangXiuyi GongChunyu ZhuYanshen ZhaoYang LiuZhongzhi Han & Hang He

Theoretical and Applied Genetics; December 23 2025; vol. 139; article 16

ABSTRACT

Cis-regulatory elements (CREs) govern gene expression, and the relationship between non-coding regulatory elements and gene expression is inherently complex. To further elucidate how these elements influence gene expression, we refined previous models and developed an interpretable deep learning model, termed L-CRE. By analyzing the flanking regions of genes from four distinct tomato varieties, this model successfully predicted high and low levels of gene expression, achieving a peak accuracy of 86.9%, demonstrating robust performance. To gain deeper insights into the model’s predictive mechanisms, we conducted an interpretability analysis, calculating and evaluating the contribution scores of different genomic regions to the prediction outcomes. Through further exploration of these contribution scores, we identified several critical genomic regions that significantly influence the prediction of gene expression levels. Notably, these regions frequently encompass transcription factor binding sites. Moreover, based on the analysis of contribution scores, we successfully identified several experimentally validated regulatory elements. This study not only enhances our understanding of gene regulatory mechanisms in tomatoes but also provides novel insights and methodologies for future research in crop genetic improvement and functional genomics.

See https://link.springer.com/article/10.1007/s00122-025-05109-1

Views: 135

Institute of Agricultural Sciences For Southern Vietnam
Address: 121 Nguyen Binh Khiem, Tan Đinh Ward, HCM City, Vietnam
Tel: +84.8. 38291746 –  38228371
Website : http://iasvn.org - Email: iasvn@vnn.vn