QuantumTox: Utilizing quantum chemistry with ensemble learning for molecular toxicity prediction
Computers in Biology and Medicine, 2023•Elsevier
Molecular toxicity prediction plays an important role in drug discovery, which is directly
related to human health and drug fate. Accurately determining the toxicity of molecules can
help weed out low-quality molecules in the early stage of drug discovery process and avoid
depletion later in the drug development process. Nowadays, more and more researchers
are starting to use machine learning methods to predict the toxicity of molecules, but these
models do not fully exploit the 3D information of molecules. Quantum chemical information …
related to human health and drug fate. Accurately determining the toxicity of molecules can
help weed out low-quality molecules in the early stage of drug discovery process and avoid
depletion later in the drug development process. Nowadays, more and more researchers
are starting to use machine learning methods to predict the toxicity of molecules, but these
models do not fully exploit the 3D information of molecules. Quantum chemical information …
Abstract
Molecular toxicity prediction plays an important role in drug discovery, which is directly related to human health and drug fate. Accurately determining the toxicity of molecules can help weed out low-quality molecules in the early stage of drug discovery process and avoid depletion later in the drug development process. Nowadays, more and more researchers are starting to use machine learning methods to predict the toxicity of molecules, but these models do not fully exploit the 3D information of molecules. Quantum chemical information, which provides stereo structural information of molecules, can influence their toxicity. To this end, we propose QuantumTox, the first application of quantum chemistry in the field of drug molecule toxicity prediction compared to existing work. We extract the quantum chemical information of molecules as their 3D features. In the downstream prediction phase, we use Gradient Boosting Decision Tree and Bagging ensemble learning methods together to improve the accuracy and generalization of the model. A series of experiments on various tasks show that our model consistently outperforms the baseline model and that the model still performs well on small datasets of less than 300.
Elsevier
Showing the best result for this search. See all results