Warning: file_get_contents(https://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=28472344
&cmd=llinks): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests
in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 215
Deprecated: Implicit conversion from float 209.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 209.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Warning: imagejpeg(C:\Inetpub\vhosts\kidney.de\httpdocs\phplern\28472344
.jpg): Failed to open stream: No such file or directory in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 117 Nucleic+Acids+Res
2017 ; 45
(12
): e114
Nephropedia Template TP
gab.com Text
Twit Text FOAVip
Twit Text #
English Wikipedia
Gene set selection via LASSO penalized regression (SLPR)
#MMPMID28472344
Frost HR
; Amos CI
Nucleic Acids Res
2017[Jul]; 45
(12
): e114
PMID28472344
show ga
Gene set testing is an important bioinformatics technique that addresses the
challenges of power, interpretation and replication. To better support the
analysis of large and highly overlapping gene set collections, researchers have
recently developed a number of multiset methods that jointly evaluate all gene
sets in a collection to identify a parsimonious group of functionally independent
sets. Unfortunately, current multiset methods all use binary indicators for gene
and gene set activity and assume that a gene is active if any containing gene set
is active. This simplistic model limits performance on many types of genomic
data. To address this limitation, we developed gene set Selection via LASSO
Penalized Regression (SLPR), a novel mapping of multiset gene set testing to
penalized multiple linear regression. The SLPR method assumes a linear
relationship between continuous measures of gene activity and the activity of all
gene sets in the collection. As we demonstrate via simulation studies and the
analysis of TCGA data using MSigDB gene sets, the SLPR method outperforms
existing multiset methods when the true biological process is well approximated
by continuous activity measures and a linear association between genes and gene
sets.