Machine learning algorithms translate big data into predictive breeding accuracy

Share

Citation

Crossa, J., Montesinos-Lopez, O. A., Costa-Neto, G., Vitale, P., Martini, J. W. R., Runcie, D., Fritsche-Neto, R., Montesinos-Lopez, A., Pérez-Rodríguez, P., Gerard, G., Dreisigacker, S., Crespo-Herrera, L., Pierre, C.S., Lillemo, M., Cuevas, J., Bentley, A., & Ortiz, R. (2024). Machine learning algorithms translate big data into predictive breeding accuracy. Trends in Plant Science. https://doi.org/10.1016/j.tplants.2024.09.011

Permanent link to cite or share this item

External link to download this item

Abstract/Description

Statistical machine learning (ML) extracts patterns from extensive genomic, phenotypic, and environmental data. ML algorithms automatically identify relevant features and use cross-validation to ensure robust models and improve prediction reliability in new lines. Furthermore, ML analyses of genotype-by-environment (G×E) interactions can offer insights into the genetic factors that affect performance in specific environments. By leveraging historical breeding data, ML streamlines strategies and automates analyses to reveal genomic patterns. In this review we examine the transformative impact of big data, including multi-trait genomics, phenomics, and environmental covariables, on genomic-enabled prediction in plant breeding. We discuss how big data and ML are revolutionizing the field by enhancing prediction accuracy, deepening our understanding of G×E interactions, and optimizing breeding strategies through the analysis of extensive and diverse datasets.

Investors/sponsors
CGIAR Action Areas
CGIAR Initiatives