Deprecated: Implicit conversion from float 231.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 231.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534 Nature 2015 ; 517 (7536): 608-11 Nephropedia Template TP
gab.com Text
Twit Text FOAVip
Twit Text #
English Wikipedia
Resolving the complexity of the human genome using single-molecule sequencing #MMPMID25383537
Chaisson MJ; Huddleston J; Dennis MY; Sudmant PH; Malig M; Hormozdiari F; Antonacci F; Surti U; Sandstrom R; Boitano M; Landolin JM; Stamatoyannopoulos JA; Hunkapiller MW; Korlach J; Eichler EE
Nature 2015[Jan]; 517 (7536): 608-11 PMID25383537show ga
The human genome is arguably the most complete mammalian reference assembly1?3 yet more than 160 euchromatic gaps remain4?6 and aspects of its structural variation remain poorly understood ten years after its completion7?9. In order to identify missing sequence and genetic variation, we sequenced and analyzed a haploid human genome (CHM1) using single-molecule, real-time (SMRT) DNA sequencing10. We closed or extended 55% of the remaining interstitial gaps in the human GRCh37 reference genome?78% of which carried long runs of degenerate short tandem repeats (STRs) often multiple kilobases in length embedded within GC-rich genomic regions. We resolved the complete sequence of 26,079 euchromatic structural variants at the basepair level, including inversions, complex insertions, and long tracts of tandem repeats. Most have not been previously reported with the greatest increases in sensitivity occurring for events less than 5 kbp in size. Compared to the human reference, we find a significant insertional bias (3:1) in regions corresponding to complex insertions and long STRs. Our results suggest a greater complexity of the human genome in the form of variation of longer and more complex repetitive DNA that can now be largely resolved with the application of this longer-read sequencing technology.