How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions?

Goh, Garrett B.; Siegel, Charles; Vishnu, Abhinav; Hodas, Nathan O.; Baker, Nathan

Statistics > Machine Learning

arXiv:1710.02238 (stat)

[Submitted on 5 Oct 2017 (v1), last revised 18 Mar 2018 (this version, v2)]

Title:How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions?

Authors:Garrett B. Goh, Charles Siegel, Abhinav Vishnu, Nathan O. Hodas, Nathan Baker

View PDF

Abstract:The meteoric rise of deep learning models in computer vision research, having achieved human-level accuracy in image recognition tasks is firm evidence of the impact of representation learning of deep neural networks. In the chemistry domain, recent advances have also led to the development of similar CNN models, such as Chemception, that is trained to predict chemical properties using images of molecular drawings. In this work, we investigate the effects of systematically removing and adding localized domain-specific information to the image channels of the training data. By augmenting images with only 3 additional basic information, and without introducing any architectural changes, we demonstrate that an augmented Chemception (AugChemception) outperforms the original model in the prediction of toxicity, activity, and solvation free energy. Then, by altering the information content in the images, and examining the resulting model's performance, we also identify two distinct learning patterns in predicting toxicity/activity as compared to solvation free energy. These patterns suggest that Chemception is learning about its tasks in the manner that is consistent with established knowledge. Thus, our work demonstrates that advanced chemical knowledge is not a pre-requisite for deep learning models to accurately predict complex chemical properties.

Comments:	In Proceedings of 2018 IEEE Winter Conference on Applications of Computer Vision (WACV)
Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1710.02238 [stat.ML]
	(or arXiv:1710.02238v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1710.02238

Submission history

From: Garrett Goh [view email]
[v1] Thu, 5 Oct 2017 23:53:59 UTC (1,020 KB)
[v2] Sun, 18 Mar 2018 14:03:12 UTC (251 KB)

Statistics > Machine Learning

Title:How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators