Abstract
We report the results of testing quantitative structure-property relationships (QSPR) that were trained upon the same druglike molecules but two different sets of solubility data: (i) data extracted from several different sources from the published literature, for which the experimental uncertainty is estimated to be 0.6-0.7 log S units (referred to mol/L); (ii) data measured by a single accurate experimental method (CheqSol), for which experimental uncertainty is typically <0.05 log S units. Contrary to what might be expected, the models derived from the CheqSol experimental data are not more accurate than those derived from the "noisy" literature data. The results suggest that, at the present time, it is the deficiency of QSPR methods (algorithms and/or descriptor sets), and not, as is commonly quoted, the uncertainty in the experimental measurements, which is the limiting factor in accurately predicting aqueous solubility for pharmaceutical molecules.
Original language | English |
---|---|
Pages (from-to) | 2962-2972 |
Number of pages | 11 |
Journal | Molecular Pharmaceutics |
Volume | 11 |
Issue number | 8 |
Early online date | 9 Jul 2014 |
DOIs | |
Publication status | Published - 4 Aug 2014 |
Keywords
- Solubility
- Bioavailability
- QSPR
- QSAR
- Druglike
- ADME
- Random Forest
- Dissolution
- Experimental error
- CheqSol
- Noyes-Whitney
- Henderson-Hasselbalch
- Polymorph
- Crystal
- Machine learning
- General solubility equation
- ADMET
- Pharmaceutical
- Rule-of-five
Fingerprint
Dive into the research topics of 'Is experimental data quality the limiting factor in predicting the aqueous solubility of druglike molecules?'. Together they form a unique fingerprint.Profiles
Datasets
-
Data underpinning: Is Experimental Data Quality the Limiting Factor in Predicting the Aqueous Solubility of Druglike Molecules?
Mitchell, J. B. O. (Creator) & Palmer, D. (Creator), University of St Andrews, 11 Jun 2014
DOI: 10.17630/1a4dbdf0-b2ba-42f6-9408-e5895ccb9faf
Dataset
File