Deprecated: Implicit conversion from float 209.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 209.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Deprecated: Implicit conversion from float 209.6 to int loses precision in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 534
Warning: imagejpeg(C:\Inetpub\vhosts\kidney.de\httpdocs\phplern\28249565
.jpg): Failed to open stream: No such file or directory in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 117 BMC+Bioinformatics
2017 ; 18
(1
): 139
Nephropedia Template TP
gab.com Text
Twit Text FOAVip
Twit Text #
English Wikipedia
Statistical method to compare massive parallel sequencing pipelines
#MMPMID28249565
Elsensohn MH
; Leblay N
; Dimassi S
; Campan-Fournier A
; Labalme A
; Roucher-Boulez F
; Sanlaville D
; Lesca G
; Bardel C
; Roy P
BMC Bioinformatics
2017[Mar]; 18
(1
): 139
PMID28249565
show ga
BACKGROUND: Today, sequencing is frequently carried out by Massive Parallel
Sequencing (MPS) that cuts drastically sequencing time and expenses.
Nevertheless, Sanger sequencing remains the main validation method to confirm the
presence of variants. The analysis of MPS data involves the development of
several bioinformatic tools, academic or commercial. We present here a
statistical method to compare MPS pipelines and test it in a comparison between
an academic (BWA-GATK) and a commercial pipeline (TMAP-NextGENe®), with and
without reference to a gold standard (here, Sanger sequencing), on a panel of 41
genes in 43 epileptic patients. This method used the number of variants to fit
log-linear models for pairwise agreements between pipelines. To assess the
heterogeneity of the margins and the odds ratios of agreement, four log-linear
models were used: a full model, a homogeneous-margin model, a model with single
odds ratio for all patients, and a model with single intercept. Then a log-linear
mixed model was fitted considering the biological variability as a random effect.
RESULTS: Among the 390,339 base-pairs sequenced, TMAP-NextGENe® and BWA-GATK
found, on average, 2253.49 and 1857.14 variants (single nucleotide variants and
indels), respectively. Against the gold standard, the pipelines had similar
sensitivities (63.47% vs. 63.42%) and close but significantly different
specificities (99.57% vs. 99.65%; p?0.001). Same-trend results were obtained
when only single nucleotide variants were considered (99.98% specificity and
76.81% sensitivity for both pipelines). CONCLUSIONS: The method allows thus
pipeline comparison and selection. It is generalizable to all types of MPS data
and all pipelines.