Warning: file_get_contents(https://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=29108274
&cmd=llinks): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests
in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 215
Deprecated: Implicit conversion from float 219.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 219.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 219.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 219.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 219.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 219.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 219.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 219.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 253.2 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 253.2 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Warning: imagejpeg(C:\Inetpub\vhosts\kidney.de\httpdocs\phplern\29108274
.jpg): Failed to open stream: No such file or directory in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 117 Oncotarget
2017 ; 8
(45
): 78901-78916
Nephropedia Template TP
gab.com Text
Twit Text FOAVip
Twit Text #
English Wikipedia
Chromosome preference of disease genes and vectorization for the prediction of
non-coding disease genes
#MMPMID29108274
Peng H
; Lan C
; Liu Y
; Liu T
; Blumenstein M
; Li J
Oncotarget
2017[Oct]; 8
(45
): 78901-78916
PMID29108274
show ga
Disease-related protein-coding genes have been widely studied, but
disease-related non-coding genes remain largely unknown. This work introduces a
new vector to represent diseases, and applies the newly vectorized data for a
positive-unlabeled learning algorithm to predict and rank disease-related long
non-coding RNA (lncRNA) genes. This novel vector representation for diseases
consists of two sub-vectors, one is composed of 45 elements, characterizing the
information entropies of the disease genes distribution over 45 chromosome
substructures. This idea is supported by our observation that some substructures
(e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein
coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second
sub-vector is 30-dimensional, characterizing the distribution of disease gene
enriched KEGG pathways in comparison with our manually created pathway groups.
The second sub-vector complements with the first one to differentiate between
various diseases. Our prediction method outperforms the state-of-the-art methods
on benchmark datasets for prioritizing disease related lncRNA genes. The method
also works well when only the sequence information of an lncRNA gene is known, or
even when a given disease has no currently recognized long non-coding genes.