Machine learning methods in chemoinformatics

Research output: Contribution to journalArticlepeer-review

Abstract

Machine learning algorithms are generally developed in computer science or adjacent disciplines and find their way into chemical modeling by a process of diffusion. Though particular machine learning methods are popular in chemoinformatics and quantitative structure-activity relationships (QSAR), many others exist in the technical literature. This discussion is methods-based and focused on some algorithms that chemoinformatics researchers frequently use. It makes no claim to be exhaustive. We concentrate on methods for supervised learning, predicting the unknown property values of a test set of instances, usually molecules, based on the known values for a training set. Particularly relevant approaches include Artificial Neural Networks, Random Forest, Support Vector Machine, k-Nearest Neighbors and naïve Bayes classifiers.
Original languageEnglish
Pages (from-to)468–481
JournalWiley Interdisciplinary Reviews: Computational Molecular Science
Volume4
Issue number5
Early online date24 Feb 2014
DOIs
Publication statusPublished - 24 Feb 2014

Keywords

  • Machine learning
  • Quantitative structure–activity relationships (QSAR)
  • Chemoinformatics
  • Algorithm
  • Artificial Neural Networks
  • Random Forest
  • Support Vector Machine
  • k-Nearest Neighbours
  • Naïve Bayes classifiers

Fingerprint

Dive into the research topics of 'Machine learning methods in chemoinformatics'. Together they form a unique fingerprint.

Cite this