Skip to main content

Showing 1–28 of 28 results for author: Tamblyn, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.02620  [pdf, other

    cs.LG cs.AI

    Dynamic Observation Policies in Observation Cost-Sensitive Reinforcement Learning

    Authors: Colin Bellinger, Mark Crowley, Isaac Tamblyn

    Abstract: Reinforcement learning (RL) has been shown to learn sophisticated control policies for complex tasks including games, robotics, heating and cooling systems and text generation. The action-perception cycle in RL, however, generally assumes that a measurement of the state of the environment is available at each time step without a cost. In applications such as materials design, deep-sea and planetar… ▽ More

    Submitted 18 April, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023 Workshop WANT

    MSC Class: 68T01 ACM Class: I.2.0

  2. arXiv:2305.14177  [pdf, other

    cs.LG physics.chem-ph

    ChemGymRL: An Interactive Framework for Reinforcement Learning for Digital Chemistry

    Authors: Chris Beeler, Sriram Ganapathi Subramanian, Kyle Sprague, Nouha Chatti, Colin Bellinger, Mitchell Shahen, Nicholas Paquin, Mark Baula, Amanuel Dawit, Zihan Yang, Xinkai Li, Mark Crowley, Isaac Tamblyn

    Abstract: This paper provides a simulated laboratory for making use of Reinforcement Learning (RL) for chemical discovery. Since RL is fairly data intensive, training agents `on-the-fly' by taking actions in the real world is infeasible and possibly dangerous. Moreover, chemical processing and discovery involves challenges which are not commonly found in RL benchmarks and therefore offer a rich space to wor… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 19 pages, 13 figures, 2 tables

  3. arXiv:2301.01807  [pdf, other

    cs.LG cs.MA q-fin.CP

    fintech-kMC: Agent based simulations of financial platforms for design and testing of machine learning systems

    Authors: Isaac Tamblyn, Tengkai Yu, Ian Benlolo

    Abstract: We discuss our simulation tool, fintech-kMC, which is designed to generate synthetic data for machine learning model development and testing. fintech-kMC is an agent-based model driven by a kinetic Monte Carlo (a.k.a. continuous time Monte Carlo) engine which simulates the behaviour of customers using an online digital financial platform. The tool provides an interpretable, reproducible, and reali… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: To appear at AAAI-23 Bridge Program: AI for Financial Services, Washington D.C., February 7 - 8, 2023

  4. arXiv:2205.07408  [pdf, other

    cs.LG cond-mat.stat-mech cs.NE

    Training neural networks using Metropolis Monte Carlo and an adaptive variant

    Authors: Stephen Whitelam, Viktor Selin, Ian Benlolo, Corneel Casert, Isaac Tamblyn

    Abstract: We examine the zero-temperature Metropolis Monte Carlo algorithm as a tool for training a neural network by minimizing a loss function. We find that, as expected on theoretical grounds and shown empirically by other authors, Metropolis Monte Carlo can train a neural net with an accuracy comparable to that of gradient descent, if not necessarily as quickly. The Metropolis algorithm does not fail au… ▽ More

    Submitted 9 August, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

  5. arXiv:2205.04547  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cs.AI

    Machine Learning Diffusion Monte Carlo Energies

    Authors: Kevin Ryczko, Jaron T. Krogel, Isaac Tamblyn

    Abstract: We present two machine learning methodologies that are capable of predicting diffusion Monte Carlo (DMC) energies with small datasets (~60 DMC calculations in total). The first uses voxel deep neural networks (VDNNs) to predict DMC energy densities using Kohn-Sham density functional theory (DFT) electron densities as input. The second uses kernel ridge regression (KRR) to predict atomic contributi… ▽ More

    Submitted 5 October, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

  6. arXiv:2204.02474  [pdf, other

    q-bio.BM cs.LG

    Generative Enriched Sequential Learning (ESL) Approach for Molecular Design via Augmented Domain Knowledge

    Authors: Mohammad Sajjad Ghaemi, Karl Grantham, Isaac Tamblyn, Yifeng Li, Hsu Kiang Ooi

    Abstract: Deploying generative machine learning techniques to generate novel chemical structures based on molecular fingerprint representation has been well established in molecular design. Typically, sequential learning (SL) schemes such as hidden Markov models (HMM) and, more recently, in the sequential deep learning context, recurrent neural network (RNN) and long short-term memory (LSTM) were used exten… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: 6 pages

  7. arXiv:2203.05551  [pdf, other

    cs.NE cond-mat.stat-mech

    Cellular automata can classify data by inducing trajectory phase coexistence

    Authors: Stephen Whitelam, Isaac Tamblyn

    Abstract: We show that cellular automata can classify data by inducing a form of dynamical phase coexistence. We use Monte Carlo methods to search for general two-dimensional deterministic automata that classify images on the basis of activity, the number of state changes that occur in a trajectory initiated from the image. When the number of timesteps of the automaton is a trainable parameter, the search s… ▽ More

    Submitted 25 July, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

  8. arXiv:2202.08708  [pdf, other

    cond-mat.stat-mech cond-mat.soft cs.LG

    Learning stochastic dynamics and predicting emergent behavior using transformers

    Authors: Corneel Casert, Isaac Tamblyn, Stephen Whitelam

    Abstract: We show that a neural network originally designed for language processing can learn the dynamical rules of a stochastic system by observation of a single dynamical trajectory of the system, and can accurately predict its emergent behavior under conditions not observed during training. We consider a lattice model of active matter undergoing continuous-time Monte Carlo dynamics, simulated at a densi… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  9. arXiv:2112.14657  [pdf, other

    math.OC cs.AI

    Dynamic programming with incomplete information to overcome navigational uncertainty in a nautical environment

    Authors: Chris Beeler, Xinkai Li, Colin Bellinger, Mark Crowley, Maia Fraser, Isaac Tamblyn

    Abstract: Using a novel toy nautical navigation environment, we show that dynamic programming can be used when only incomplete information about a partially observed Markov decision process (POMDP) is known. By incorporating uncertainty into our model, we show that navigation policies can be constructed that maintain safety, outperforming the baseline performance of traditional dynamic programming for Marko… ▽ More

    Submitted 19 July, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

    Comments: 11 pages, 5 figures

  10. arXiv:2112.07535  [pdf, other

    cs.LG cs.AI

    Scientific Discovery and the Cost of Measurement -- Balancing Information and Cost in Reinforcement Learning

    Authors: Colin Bellinger, Andriy Drozdyuk, Mark Crowley, Isaac Tamblyn

    Abstract: The use of reinforcement learning (RL) in scientific applications, such as materials design and automated chemistry, is increasing. A major challenge, however, lies in fact that measuring the state of the system is often costly and time consuming in scientific applications, whereas policy learning with RL requires a measurement after each time step. In this work, we make the measurement costs expl… ▽ More

    Submitted 6 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: To appear in: 1st Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE)

  11. Twin Neural Network Regression is a Semi-Supervised Regression Algorithm

    Authors: Sebastian J. Wetzel, Roger G. Melko, Isaac Tamblyn

    Abstract: Twin neural network regression (TNNR) is a semi-supervised regression algorithm, it can be trained on unlabelled data points as long as other, labelled anchor data points, are present. TNNR is trained to predict differences between the target values of two different data points rather than the targets themselves. By ensembling predicted differences between the targets of an unseen data point and a… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

  12. arXiv:2103.03716  [pdf, other

    math.OC cs.LG physics.chem-ph

    Golem: An algorithm for robust experiment and process optimization

    Authors: Matteo Aldeghi, Florian Häse, Riley J. Hickman, Isaac Tamblyn, Alán Aspuru-Guzik

    Abstract: Numerous challenges in science and engineering can be framed as optimization tasks, including the maximization of reaction yields, the optimization of molecular and materials properties, and the fine-tuning of automated hardware protocols. Design of experiment and optimization algorithms are often adopted to solve these tasks efficiently. Increasingly, these experiment planning strategies are coup… ▽ More

    Submitted 12 October, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: 37 pages, 25 figures; additional experiments, expanded discussions and references

    Journal ref: Chemical Science, 2021, 12, 14792 - 14807

  13. arXiv:2102.11743  [pdf, other

    cs.CV cs.AI

    Weakly-supervised multi-class object localization using only object counts as labels

    Authors: Kyle Mills, Isaac Tamblyn

    Abstract: We demonstrate the use of an extensive deep neural network to localize instances of objects in images. The EDNN is naturally able to accurately perform multi-class counting using only ground truth count values as labels. Without providing any conceptual information, object annotations, or pixel segmentation information, the neural network is able to formulate its own conceptual representation of t… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

  14. arXiv:2101.04383  [pdf

    cond-mat.mtrl-sci cs.LG

    Interpretable discovery of new semiconductors with machine learning

    Authors: Hitarth Choubisa, Petar Todorović, Joao M. Pina, Darshan H. Parmar, Ziliang Li, Oleksandr Voznyy, Isaac Tamblyn, Edward Sargent

    Abstract: Machine learning models of materials$^{1-5}$ accelerate discovery compared to ab initio methods: deep learning models now reproduce density functional theory (DFT)-calculated results at one hundred thousandths of the cost of DFT$^{6}$. To provide guidance in experimental materials synthesis, these need to be coupled with an accurate yet effective search algorithm and training data consistent with… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: 25 pages, 4 figures, 1 table

  15. arXiv:2012.14873  [pdf, other

    cs.LG stat.ML

    Twin Neural Network Regression

    Authors: Sebastian J. Wetzel, Kevin Ryczko, Roger G. Melko, Isaac Tamblyn

    Abstract: We introduce twin neural network (TNN) regression. This method predicts differences between the target values of two different data points rather than the targets themselves. The solution of a traditional regression problem is then obtained by averaging over an ensemble of all predicted differences between the targets of an unseen data point and all training data points. Whereas ensembles are norm… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

  16. arXiv:2012.11832  [pdf, other

    cond-mat.stat-mech cond-mat.soft cs.NE

    Neuroevolutionary learning of particles and protocols for self-assembly

    Authors: Stephen Whitelam, Isaac Tamblyn

    Abstract: Within simulations of molecules deposited on a surface we show that neuroevolutionary learning can design particles and time-dependent protocols to promote self-assembly, without input from physical concepts such as thermal equilibrium or mechanical stability and without prior knowledge of candidate or competing structures. The learning algorithm is capable of both directed and exploratory design:… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

    Journal ref: Phys. Rev. Lett. 127, 018003 (2021)

  17. arXiv:2012.10328  [pdf, ps, other

    physics.optics cs.LG physics.atom-ph

    Deep learning and high harmonic generation

    Authors: M. Lytova, M. Spanner, I. Tamblyn

    Abstract: Using machine learning, we explore the utility of various deep neural networks (NN) when applied to high harmonic generation (HHG) scenarios. First, we train the NNs to predict the time-dependent dipole and spectra of HHG emission from reduced-dimensionality models of di- and triatomic systems based of on sets of randomly generated parameters (laser pulse intensity, internuclear distance, and mole… ▽ More

    Submitted 4 January, 2021; v1 submitted 18 December, 2020; originally announced December 2020.

    Journal ref: Can. J. Phys. 101, 132 (2023)

  18. arXiv:2011.08657  [pdf, other

    cond-mat.stat-mech cond-mat.dis-nn cs.LG

    Dynamical large deviations of two-dimensional kinetically constrained models using a neural-network state ansatz

    Authors: Corneel Casert, Tom Vieijra, Stephen Whitelam, Isaac Tamblyn

    Abstract: We use a neural network ansatz originally designed for the variational optimization of quantum systems to study dynamical large deviations in classical ones. We obtain the scaled cumulant-generating function for the dynamical activity of the Fredrickson-Andersen model, a prototypical kinetically constrained model, in one and two dimensions, and present the first size-scaling analysis of the dynami… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Journal ref: Phys. Rev. Lett. 127, 120602 (2021)

  19. arXiv:2010.14236  [pdf, other

    cs.LG cs.AI cs.CE physics.chem-ph quant-ph

    Scientific intuition inspired by machine learning generated hypotheses

    Authors: Pascal Friederich, Mario Krenn, Isaac Tamblyn, Alan Aspuru-Guzik

    Abstract: Machine learning with application to questions in the physical sciences has become a widely used tool, successfully applied to classification, regression and optimization tasks in many areas. Research focus mostly lies in improving the accuracy of the machine learning models in numerical predictions, while scientific understanding is still almost exclusively generated by human researchers analysin… ▽ More

    Submitted 14 December, 2020; v1 submitted 27 October, 2020; originally announced October 2020.

    Journal ref: Machine Learning: Science and Technology 2, 025027 (2021)

  20. arXiv:2008.06643  [pdf, other

    cs.NE cond-mat.stat-mech

    Correspondence between neuroevolution and gradient descent

    Authors: Stephen Whitelam, Viktor Selin, Sang-Won Park, Isaac Tamblyn

    Abstract: We show analytically that training a neural network by conditioned stochastic mutation or neuroevolution of its weights is equivalent, in the limit of small mutations, to gradient descent on the loss function in the presence of Gaussian white noise. Averaged over independent realizations of the learning process, neuroevolution is equivalent to gradient descent on the loss function. We use numerica… ▽ More

    Submitted 10 September, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

  21. arXiv:2005.12697  [pdf, other

    cs.AI

    Active Measure Reinforcement Learning for Observation Cost Minimization

    Authors: Colin Bellinger, Rory Coles, Mark Crowley, Isaac Tamblyn

    Abstract: Standard reinforcement learning (RL) algorithms assume that the observation of the next state comes instantaneously and at no cost. In a wide variety of sequential decision making tasks ranging from medical treatment to scientific discovery, however, multiple classes of state observations are possible, each of which has an associated cost. We propose the active measure RL framework (Amrl) as an in… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

    Comments: Under review at NeurIPS 2020

    MSC Class: 68T01

  22. arXiv:2004.07333  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning in a Physics-Inspired Semi-Markov Environment

    Authors: Colin Bellinger, Rory Coles, Mark Crowley, Isaac Tamblyn

    Abstract: Reinforcement learning (RL) has been demonstrated to have great potential in many applications of scientific discovery and design. Recent work includes, for example, the design of new structures and compositions of molecules for therapeutic drugs. Much of the existing work related to the application of RL to scientific domains, however, assumes that the available state representation obeys the Mar… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: To appear in the Canadian Conference on Artificial Intelligence, 2020

    ACM Class: I.2; J.2

  23. arXiv:2003.02647  [pdf, other

    physics.data-an cs.LG physics.comp-ph

    Watch and learn -- a generalized approach for transferrable learning in deep neural networks via physical principles

    Authors: Kyle Sprague, Juan Carrasquilla, Steve Whitelam, Isaac Tamblyn

    Abstract: Transfer learning refers to the use of knowledge gained while solving a machine learning task and applying it to the solution of a closely related problem. Such an approach has enabled scientific breakthroughs in computer vision and natural language processing where the weights learned in state-of-the-art models can be used to initialize models for other tasks which dramatically improve their perf… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

  24. arXiv:1912.08333  [pdf, other

    cond-mat.stat-mech cs.LG cs.NE

    Learning to grow: control of material self-assembly using evolutionary reinforcement learning

    Authors: Stephen Whitelam, Isaac Tamblyn

    Abstract: We show that neural networks trained by evolutionary reinforcement learning can enact efficient molecular self-assembly protocols. Presented with molecular simulation trajectories, networks learn to change temperature and chemical potential in order to promote the assembly of desired structures or choose between competing polymorphs. In the first case, networks reproduce in a qualitative sense the… ▽ More

    Submitted 28 May, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

    Journal ref: Phys. Rev. E 101, 052604 (2020)

  25. arXiv:1909.00835  [pdf, other

    cond-mat.stat-mech cs.LG cs.NE physics.comp-ph

    Evolutionary reinforcement learning of dynamical large deviations

    Authors: Stephen Whitelam, Daniel Jacobson, Isaac Tamblyn

    Abstract: We show how to calculate the likelihood of dynamical large deviations using evolutionary reinforcement learning. An agent, a stochastic model, propagates a continuous-time Monte Carlo trajectory and receives a reward conditioned upon the values of certain path-extensive quantities. Evolution produces progressively fitter agents, eventually allowing the calculation of a piece of a large-deviation r… ▽ More

    Submitted 21 February, 2020; v1 submitted 2 September, 2019; originally announced September 2019.

  26. arXiv:1903.08543  [pdf, other

    cs.NE cond-mat.stat-mech cs.LG physics.comp-ph

    Optimizing thermodynamic trajectories using evolutionary and gradient-based reinforcement learning

    Authors: Chris Beeler, Uladzimir Yahorau, Rory Coles, Kyle Mills, Stephen Whitelam, Isaac Tamblyn

    Abstract: Using a model heat engine, we show that neural network-based reinforcement learning can identify thermodynamic trajectories of maximal efficiency. We consider both gradient and gradient-free reinforcement learning. We use an evolutionary learning algorithm to evolve a population of neural networks, subject to a directive to maximize the efficiency of a trajectory composed of a set of elementary th… ▽ More

    Submitted 22 November, 2021; v1 submitted 20 March, 2019; originally announced March 2019.

    Comments: 11 pages, 5 figures

    Journal ref: Phys. Rev. E 104, 064128 (2021)

  27. arXiv:1702.01361  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Deep learning and the Schrödinger equation

    Authors: Kyle Mills, Michael Spanner, Isaac Tamblyn

    Abstract: We have trained a deep (convolutional) neural network to predict the ground-state energy of an electron in four classes of confining two-dimensional electrostatic potentials. On randomly generated potentials, for which there is no analytic form for either the potential or the ground-state energy, the neural network model was able to predict the ground-state energy to within chemical accuracy, with… ▽ More

    Submitted 3 November, 2017; v1 submitted 4 February, 2017; originally announced February 2017.

    Journal ref: Phys. Rev. A 96, 042113 (2017)

  28. arXiv:1610.07458  [pdf, other

    cs.SI physics.soc-ph

    Hashkat: Large-scale simulations of online social networks

    Authors: Kevin Ryczko, Adam Domurad, Nicholas Buhagiar, Isaac Tamblyn

    Abstract: Hashkat (http://hashkat.org) is a free, open source, agent based simulation software package designed to simulate large-scale online social networks (e.g. Twitter, Facebook, LinkedIn, etc). It allows for dynamic agent generation, edge creation, and information propagation. The purpose of hashkat is to study the growth of online social networks and how information flows within them. Like real life… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.