Data augmentation enhances plant-genomic-enabled predictions

Montesinos-López, O. A., Solis-Camacho, M. A., Crespo-Herrera, L., Saint Pierre, C., Huerta Prado, G. I., Ramos-Pulido, S., Al-Nowibet, K., Fritsche-Neto, R., Gerard, G. S., Montesinos-López, A., & Crossa, J. (2024). Data augmentation enhances plant-genomic-enabled predictions. Genes, 15(3), 286. https://doi.org/10.3390/genes15030286

Permanent link to cite or share this item

https://hdl.handle.net/10568/159826

External link to download this item

https://hdl.handle.net/10883/23127

DOI

https://doi.org/10.3390/genes15030286

Abstract/Description

Genomic selection (GS) is revolutionizing plant breeding. However, its practical implementation is still challenging, since there are many factors that affect its accuracy. For this reason, this research explores data augmentation with the goal of improving its accuracy. Deep neural networks with data augmentation (DA) generate synthetic data from the original training set to increase the training set and to improve the prediction performance of any statistical or machine learning algorithm. There is much empirical evidence of their success in many computer vision applications. Due to this, DA was explored in the context of GS using 14 real datasets. We found empirical evidence that DA is a powerful tool to improve the prediction accuracy, since we improved the prediction accuracy of the top lines in the 14 datasets under study. On average, across datasets and traits, the gain in prediction performance of the DA approach regarding the Conventional method in the top 20% of lines in the testing set was 108.4% in terms of the NRMSE and 107.4% in terms of the MAAPE, but a worse performance was observed on the whole testing set. We encourage more empirical evaluations to support our findings.

Author ORCID identifiers

Collections

CGIAR Initiative on Accelerated Breeding

Data augmentation enhances plant-genomic-enabled predictions

Files

Authors

Date Issued

Date Online

Language

Type

Review Status

Access Rights

Usage Rights

Metadata

Share

Citation

Permanent link to cite or share this item

External link to download this item

DOI

Abstract/Description

Author ORCID identifiers

AGROVOC Keywords

Organizations Affiliated to the Authors

Investors/sponsors

CGIAR Action Areas

CGIAR Impact Areas

CGIAR Initiatives

Collections