Warning: file_get_contents(https://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=27286957
&cmd=llinks): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests
in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 215
Deprecated: Implicit conversion from float 211.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 211.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 211.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Warning: imagejpeg(C:\Inetpub\vhosts\kidney.de\httpdocs\phplern\27286957
.jpg): Failed to open stream: No such file or directory in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 117 Genet+Sel+Evol
2016 ; 48
(1
): 42
Nephropedia Template TP
gab.com Text
Twit Text FOAVip
Twit Text #
English Wikipedia
Genome-wide prediction using Bayesian additive regression trees
#MMPMID27286957
Waldmann P
Genet Sel Evol
2016[Jun]; 48
(1
): 42
PMID27286957
show ga
BACKGROUND: The goal of genome-wide prediction (GWP) is to predict phenotypes
based on marker genotypes, often obtained through single nucleotide polymorphism
(SNP) chips. The major problem with GWP is high-dimensional data from many
thousands of SNPs scored on several thousands of individuals. A large number of
methods have been developed for GWP, which are mostly parametric methods that
assume statistical linearity and only additive genetic effects. The Bayesian
additive regression trees (BART) method was recently proposed and is based on the
sum of nonparametric regression trees with the priors being used to regularize
the parameters. Each regression tree is based on a recursive binary partitioning
of the predictor space that approximates an unknown function, which will
automatically model nonlinearities within SNPs (dominance) and interactions
between SNPs (epistasis). In this study, we introduced BART and compared its
predictive performance with that of the LASSO, Bayesian LASSO (BLASSO), genomic
best linear unbiased prediction (GBLUP), reproducing kernel Hilbert space (RKHS)
regression and random forest (RF) methods. RESULTS: Tests on the QTLMAS2010
simulated data, which are mainly based on additive genetic effects, show that
cross-validated optimization of BART provides a smaller prediction error than the
RF, BLASSO, GBLUP and RKHS methods, and is almost as accurate as the LASSO
method. If dominance and epistasis effects are added to the QTLMAS2010 data, the
accuracy of BART relative to the other methods was increased. We also showed that
BART can produce importance measures on the SNPs through variable inclusion
proportions. In evaluations using real data on pigs, the prediction error was
smaller with BART than with the other methods. CONCLUSIONS: BART was shown to be
an accurate method for GWP, in which the regression trees guarantee a very sparse
representation of additive and complex non-additive genetic effects. Moreover,
the Markov chain Monte Carlo algorithm with Bayesian back-fitting provides a
computationally efficient procedure that is suitable for high-dimensional genomic
data.