Warning: imagejpeg(C:\Inetpub\vhosts\kidney.de\httpdocs\phplern\29360259
.jpg): Failed to open stream: No such file or directory in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 117 Mol+Inform
2018 ; 37
(1-2
): ä Nephropedia Template TP
gab.com Text
Twit Text FOAVip
Twit Text #
English Wikipedia
Classifiers and their Metrics Quantified
#MMPMID29360259
Brown JB
Mol Inform
2018[Jan]; 37
(1-2
): ä PMID29360259
show ga
Molecular modeling frequently constructs classification models for the prediction
of two-class entities, such as compound bio(in)activity, chemical property
(non)existence, protein (non)interaction, and so forth. The models are evaluated
using well known metrics such as accuracy or true positive rates. However, these
frequently used metrics applied to retrospective and/or artificially generated
prediction datasets can potentially overestimate true performance in actual
prospective experiments. Here, we systematically consider metric value surface
generation as a consequence of data balance, and propose the computation of an
inverse cumulative distribution function taken over a metric surface. The
proposed distribution analysis can aid in the selection of metrics when
formulating study design. In addition to theoretical analyses, a practical
example in chemogenomic virtual screening highlights the care required in metric
selection and interpretation.