Deprecated: Implicit conversion from float 217.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 217.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 217.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 217.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 217.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 217.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 217.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Warning: imagejpeg(C:\Inetpub\vhosts\kidney.de\httpdocs\phplern\27855707
.jpg): Failed to open stream: No such file or directory in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 117 Genome+Biol
2016 ; 17
(1
): 232
Nephropedia Template TP
gab.com Text
Twit Text FOAVip
Twit Text #
English Wikipedia
A novel codon-based de Bruijn graph algorithm for gene construction from
unassembled transcriptomes
#MMPMID27855707
Peng G
; Ji P
; Zhao F
Genome Biol
2016[Nov]; 17
(1
): 232
PMID27855707
show ga
Most gene prediction methods detect coding sequences from transcriptome
assemblies in the absence of closely related reference genomes. Such methods are
of limited application due to high transcript fragmentation and extensive
assembly errors, which may lead to redundant or false coding sequence
predictions. We present inGAP-CDG, which can construct full-length and
non-redundant coding sequences from unassembled transcriptomes by using a
codon-based de Bruijn graph to simplify the assembly process and a machine
learning-based approach to filter false positives. Compared with other methods,
inGAP-CDG exhibits a significant increase in predicted coding sequence length and
robustness to sequencing errors and varied read length.