Please use this identifier to cite or link to this item: http://www.alice.cnptia.embrapa.br/alice/handle/doc/1110534
Research center of Embrapa/Collection: Embrapa Pecuária Sudeste - Artigo em periódico indexado (ALICE)
Date Issued: 2019
Type of Material: Artigo em periódico indexado (ALICE)
Authors: FERREIRA FILHO, D.
BUENO FILHO, J. S. de S.
REGITANO, L. C. de A.
ALENCAR, M. M. de
ALVES, R. R.
BAENA, M. M.
MEIRELLES, S. L. C.
Additional Information: Diógenes Ferreira Filho, UFRRJ; Júio Sílvio de Sousa Bueno Filho, UFLA; LUCIANA CORREIA DE ALMEIDA REGITANO, CPPSE; MAURICIO MELLO DE ALENCAR, CPPSE; ROSIANA RODRIGUES ALVES, CNPASA; Marielle Moura Baena, UFLA; Sarah Laguna Conceição Meirelles, UFLA.
Title: Tournaments between markers as a strategy to enhance genomic predictions.
Publisher: Plos One, v. 14, n. 7, e0219448, p. 1-17, 2019.
Language: en
Keywords: Genome-wide
GWAS
GWS
SNPs
Genomic Breeding Values
Description: Analysis of a large number of markers is crucial in both genome-wide association studies (GWAS) and genome-wide selection (GWS). However there are two methodological issues that restrict statistical analysis: high dimensionality (p>>n) and multicollinearity. Although there are methodologies that can be used to fit models for data with high dimensionality (eg,the Bayesian Lasso), a big problem that can occurs in this cases is that the predictive ability of the model should perform well for the individuals used to fit the model, but should not perform well for other individuals, restricting the applicability of the model. This problem can be circumvent by applying some selection methodology to reduce the number of markers (but keeping the markers associated with the phenotypic trait) before adjusting a model to predict GBVs. We revisit a tournament-based strategy between marker samples, where each sample has good statistical properties for estimation: n>p and low collinearity. Such tournaments are elaborated using multiple linear regression to eliminate markers. This method is adapted from previous works found in the literature. We used simulated data as well as real data derived from a study with SNPs in beef cattle. Tournament strategies not only circumvent the p>>n issue, but also minimize spurious associations. For real data, when we selected a few more than 20 markers, we obtained correlations greater than 0.70 between predicted Genomic Breeding Values (GBVs) and phenotypes in validation groups of a cross-validation scheme; and when we selected a larger number of markers (more than 100), the correlations exceeded 0.90, showing the efficiency in identifying relevant SNPs (or segregations) for both GWAS and GWS. In the simulation study, we obtained similar results.
Thesagro: Genótipo
Genoma
NAL Thesaurus: Genotyping
Data Created: 2019-07-10
Appears in Collections:Artigo em periódico indexado (CPPSE)

Files in This Item:
File Description SizeFormat 
Tournamentsbetweenmarkersasastrategycorrecao.pdf2,7 MBAdobe PDFThumbnail
View/Open
TournamentsBetweenMarkers.pdf2,08 MBAdobe PDFThumbnail
View/Open

FacebookTwitterDeliciousLinkedInGoogle BookmarksMySpace