Deprecated: Implicit conversion from float 233.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 233.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 233.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 233.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 233.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 233.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 233.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 233.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 267.2 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 267.2 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 267.2 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Warning: imagejpeg(C:\Inetpub\vhosts\kidney.de\httpdocs\phplern\29650041
.jpg): Failed to open stream: No such file or directory in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 117 J+Biomed+Semantics
2018 ; 9
(1
): 13
Nephropedia Template TP
gab.com Text
Twit Text FOAVip
Twit Text #
English Wikipedia
Deep learning meets ontologies: experiments to anchor the cardiovascular disease
ontology in the biomedical literature
#MMPMID29650041
Arguello Casteleiro M
; Demetriou G
; Read W
; Fernandez Prieto MJ
; Maroto N
; Maseda Fernandez D
; Nenadic G
; Klein J
; Keane J
; Stevens R
J Biomed Semantics
2018[Apr]; 9
(1
): 13
PMID29650041
show ga
BACKGROUND: Automatic identification of term variants or acceptable alternative
free-text terms for gene and protein names from the millions of biomedical
publications is a challenging task. Ontologies, such as the Cardiovascular
Disease Ontology (CVDO), capture domain knowledge in a computational form and can
provide context for gene/protein names as written in the literature. This study
investigates: 1) if word embeddings from Deep Learning algorithms can provide a
list of term variants for a given gene/protein of interest; and 2) if biological
knowledge from the CVDO can improve such a list without modifying the word
embeddings created. METHODS: We have manually annotated 105 gene/protein names
from 25 PubMed titles/abstracts and mapped them to 79 unique UniProtKB entries
corresponding to gene and protein classes from the CVDO. Using more than 14 M
PubMed articles (titles and available abstracts), word embeddings were generated
with CBOW and Skip-gram. We setup two experiments for a synonym detection task,
each with four raters, and 3672 pairs of terms (target term and candidate term)
from the word embeddings created. For Experiment I, the target terms for 64
UniProtKB entries were those that appear in the titles/abstracts; Experiment II
involves 63 UniProtKB entries and the target terms are a combination of terms
from PubMed titles/abstracts with terms (i.e. increased context) from the CVDO
protein class expressions and labels. RESULTS: In Experiment I, Skip-gram finds
term variants (full and/or partial) for 89% of the 64 UniProtKB entries, while
CBOW finds term variants for 67%. In Experiment II (with the aid of the CVDO),
Skip-gram finds term variants for 95% of the 63 UniProtKB entries, while CBOW
finds term variants for 78%. Combining the results of both experiments, Skip-gram
finds term variants for 97% of the 79 UniProtKB entries, while CBOW finds term
variants for 81%. CONCLUSIONS: This study shows performance improvements for both
CBOW and Skip-gram on a gene/protein synonym detection task by adding knowledge
formalised in the CVDO and without modifying the word embeddings created. Hence,
the CVDO supplies context that is effective in inducing term variability for both
CBOW and Skip-gram while reducing ambiguity. Skip-gram outperforms CBOW and finds
more pertinent term variants for gene/protein names annotated from the scientific
literature.