Abstract
We compared two algorithms for ligand-target prediction, namely, the Laplacian-modified Bayesian classifier and the Winnow algorithm. A dataset derived from the WOMBAT database, spanning 20 pharmaceutically relevant activity classes with 13000 compounds, was used for performance assessment in 24 different experiments, each of which was assessed using a 15-fold Monte Carlo cross-validation. Compounds were described by different circular fingerprints, ECFP_4 and MOLPRINT 2D. A detailed analysis of the resulting 2.4 million predictions led to very similar measures for overall accuracy for both classifiers, whereas we observed significant differences for individual activity classes. Moreover, we analyzed our data with respect to the numbers of compounds which are exclusively retrieved by either of the algorithms - but never by the other - or by neither of them. This provided detailed information that can never be obtained by considering the overall performance statistics alone.
Original language | English |
---|---|
Pages (from-to) | 2313-2325 |
Number of pages | 13 |
Journal | Journal of Chemical Information and Modeling |
Volume | 48 |
Issue number | 12 |
DOIs | |
Publication status | Published - Dec 2008 |
Keywords
- MOLECULAR SIMILARITY
- QSPR
- FINGERPRINTS
- DESCRIPTORS
- CLASSIFIER
- SOLUBILITY
- CHEMISTRY
- MACHINE
- SPECTRA
- MODELS