Statistical and machine learning approaches to predicting protein-ligand interactions

Curr Opin Struct Biol. 2018 Apr:49:123-128. doi: 10.1016/j.sbi.2018.01.006. Epub 2018 Feb 20.

Abstract

Data driven computational approaches to predicting protein-ligand binding are currently achieving unprecedented levels of accuracy on held-out test datasets. Up until now, however, this has not led to corresponding breakthroughs in our ability to design novel ligands for protein targets of interest. This review summarizes the current state of the art in this field, emphasizing the recent development of deep neural networks for predicting protein-ligand binding. We explain the major technical challenges that have caused difficulty with predicting novel ligands, including the problems of sampling noise and the challenge of using benchmark datasets that are sufficiently unbiased that they allow the model to extrapolate to new regimes.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Computational Biology* / methods
  • Humans
  • Ligands*
  • Machine Learning*
  • Models, Statistical*
  • Protein Binding
  • Proteins / chemistry*
  • Quantitative Structure-Activity Relationship*

Substances

  • Ligands
  • Proteins