%0 Journal Article %A Jeffrey L. Neyhart %A Tyler Tiede %A Aaron J. Lorenz %A Kevin P. Smith %T Evaluating Methods of Updating Training Data in Long-Term Genomewide Selection %D 2016 %R 10.1101/087163 %J bioRxiv %P 087163 %X Genomewide selection is hailed for its ability to facilitate greater genetic gains per unit time. Over breeding cycles, the requisite linkage disequilibrium (LD) between quantitative trait loci (QTL) and markers is expected to change as a result of recombination, selection, and drift, leading to a decay in prediction accuracy. Previous research has identified the need to update the training population using data that may capture new LD generated over breeding cycles, however optimal methods of updating have not been explored. In a barley (Hordeum vulgare L.) breeding simulation experiment, we examined prediction accuracy and response to selection when updating the training population each cycle with the best predicted lines, the worst predicted lines, random lines, criterion-selected lines, or no lines. In the short-term, we found that updating with the best predicted lines resulted in greater prediction accuracy and genetic gain, but in the long-term, all methods (besides not updating) performed similarly. We also examined the impact of including all data in the training population or only the most recent data. Though patterns among update methods were similar, using a smaller, but more recent training population provided a slight advantage in prediction accuracy and genetic gain. In an actual breeding program, a breeder might desire to gather phenotypic data on lines predicted to be the best, perhaps to evaluate possible cultivars. Therefore, our results suggest that the most optimal method of updating the training population is also the most practical. %U https://www.biorxiv.org/content/biorxiv/early/2016/11/10/087163.full.pdf