[期刊论文]


Integration of eQTL and machine learning to dissect causal genes with pleiotropic effects in genetic regulation networks of seed cotton yield

作   者:
Ting Zhao;Hongyu Wu;Xutong Wang;Yongyan Zhao;Luyao Wang;Jiaying Pan;Huan Mei;Jin Han;Siyuan Wang;Kening Lu;Menglin Li;Mengtao Gao;Zeyi Cao;Hailin Zhang;Ke Wan;Jie Li;Lei Fang;Tianzhen Zhang;Xueying Guan;

出版年:2023

页    码:113111 - 113111
出版社:Elsevier


摘   要:

The dissection of a gene regulatory network (GRN) that complements the genome-wide association study (GWAS) locus and the crosstalk underlying multiple agronomical traits remains a major challenge. In this study, we generate 558 transcriptional profiles of lint-bearing ovules at one day post-anthesis from a selective core cotton germplasm, from which 12,207 expression quantitative trait loci (eQTLs) are identified. Sixty-six known phenotypic GWAS loci are colocalized with 1,090 eQTLs, forming 38 functional GRNs associated predominantly with seed yield. Of the eGenes, 34 exhibit pleiotropic effects. Combining the eQTLs within the seed yield GRNs significantly increases the portion of narrow-sense heritability. The extreme gradient boosting (XGBoost) machine learning approach is applied to predict seed cotton yield phenotypes on the basis of gene expression. Top-ranking eGenes (NF-YB3, FLA2, and GRDP1) derived with pleiotropic effects on yield traits are validated, along with their potential roles by correlation analysis, domestication selection analysis, and transgenic plants.

Copyright © 2023 The Authors. Published by Elsevier Inc. All rights reserved.



关键字:

CP: Genomics;CP: Plants;GWAS;XGBoost;cotton;eQTL;machine learning;seed size;yield


全文
所属期刊
Cell Reports
ISSN: 2211-1247
来自:Elsevier