Is experimental data quality the limiting factor in predicting the aqueous solubility of druglike molecules?

David S. Palmer*, John B.O. Mitchell*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

We report the results of testing quantitative structure-property relationships (QSPR) that were trained upon the same druglike molecules but two different sets of solubility data: (i) data extracted from several different sources from the published literature, for which the experimental uncertainty is estimated to be 0.6-0.7 log S units (referred to mol/L); (ii) data measured by a single accurate experimental method (CheqSol), for which experimental uncertainty is typically <0.05 log S units. Contrary to what might be expected, the models derived from the CheqSol experimental data are not more accurate than those derived from the "noisy" literature data. The results suggest that, at the present time, it is the deficiency of QSPR methods (algorithms and/or descriptor sets), and not, as is commonly quoted, the uncertainty in the experimental measurements, which is the limiting factor in accurately predicting aqueous solubility for pharmaceutical molecules.

Original languageEnglish
Pages (from-to)2962-2972
Number of pages11
JournalMolecular Pharmaceutics
Volume11
Issue number8
Early online date9 Jul 2014
DOIs
Publication statusPublished - 4 Aug 2014

Keywords

  • Solubility
  • Bioavailability
  • QSPR
  • QSAR
  • Druglike
  • ADME
  • Random Forest
  • Dissolution
  • Experimental error
  • CheqSol
  • Noyes-Whitney
  • Henderson-Hasselbalch
  • Polymorph
  • Crystal
  • Machine learning
  • General solubility equation
  • ADMET
  • Pharmaceutical
  • Rule-of-five

Fingerprint

Dive into the research topics of 'Is experimental data quality the limiting factor in predicting the aqueous solubility of druglike molecules?'. Together they form a unique fingerprint.

Cite this