Skip to main content

Showing 1–50 of 55 results for author: Baydin, A G

.
  1. arXiv:2402.04830  [pdf, other

    cs.LG astro-ph.EP

    Closing the Gap Between SGP4 and High-Precision Propagation via Differentiable Programming

    Authors: Giacomo Acciarini, Atılım Güneş Baydin, Dario Izzo

    Abstract: The Simplified General Perturbations 4 (SGP4) orbital propagation method is widely used for predicting the positions and velocities of Earth-orbiting objects rapidly and reliably. Despite continuous refinement, SGP models still lack the precision of numerical propagators, which offer significantly smaller errors. This study presents dSGP4, a novel differentiable version of SGP4 implemented using P… ▽ More

    Submitted 7 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  2. arXiv:2312.06845  [pdf, other

    physics.space-ph astro-ph.EP cs.LG

    High-Cadence Thermospheric Density Estimation enabled by Machine Learning on Solar Imagery

    Authors: Shreshth A. Malik, James Walsh, Giacomo Acciarini, Thomas E. Berger, Atılım Güneş Baydin

    Abstract: Accurate estimation of thermospheric density is critical for precise modeling of satellite drag forces in low Earth orbit (LEO). Improving this estimation is crucial to tasks such as state estimation, collision avoidance, and re-entry calculations. The largest source of uncertainty in determining thermospheric density is modeling the effects of space weather driven by solar and geomagnetic activit… ▽ More

    Submitted 12 November, 2023; originally announced December 2023.

    Comments: Accepted at the Machine Learning and the Physical Sciences workshop, NeurIPS 2023

  3. arXiv:2310.17688  [pdf, other

    cs.CY cs.AI cs.CL cs.LG

    Managing extreme AI risks amid rapid progress

    Authors: Yoshua Bengio, Geoffrey Hinton, Andrew Yao, Dawn Song, Pieter Abbeel, Trevor Darrell, Yuval Noah Harari, Ya-Qin Zhang, Lan Xue, Shai Shalev-Shwartz, Gillian Hadfield, Jeff Clune, Tegan Maharaj, Frank Hutter, Atılım Güneş Baydin, Sheila McIlraith, Qiqi Gao, Ashwin Acharya, David Krueger, Anca Dragan, Philip Torr, Stuart Russell, Daniel Kahneman, Jan Brauner, Sören Mindermann

    Abstract: Artificial Intelligence (AI) is progressing rapidly, and companies are shifting their focus to develo** generalist AI systems that can autonomously act and pursue goals. Increases in capabilities and autonomy may soon massively amplify AI's impact, with risks that include large-scale social harms, malicious uses, and an irreversible loss of human control over autonomous AI systems. Although rese… ▽ More

    Submitted 22 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Published in Science: https://www.science.org/doi/10.1126/science.adn0117

  4. arXiv:2308.10624  [pdf, other

    astro-ph.EP astro-ph.IM

    PyATMOS: A Scalable Grid of Hypothetical Planetary Atmospheres

    Authors: Aditya Chopra, Aaron C Bell, William Fawcett, Rodd Talebi, Daniel Angerhausen, Atılım Güneş Baydin, Anamaria Berea, Nathalie A. Cabrol, Christopher Kempes, Massimo Mascaro

    Abstract: Cloud computing offers an opportunity to run compute-resource intensive climate models at scale by parallelising model runs such that datasets useful to the exoplanet community can be produced efficiently. To better understand the statistical distributions and properties of potentially habitable planetary atmospheres we implemented a parallelised climate modelling tool to scan a range of hypotheti… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 9 pages, 6 figures

  5. arXiv:2304.00073  [pdf, other

    astro-ph.EP astro-ph.IM

    Towards 3D Retrieval of Exoplanet Atmospheres: Assessing Thermochemical Equilibrium Estimation Methods

    Authors: Michael D. Himes, Joseph Harrington, Atilim Gunes Baydin

    Abstract: Characterizing exoplanetary atmospheres via Bayesian retrievals requires assuming some chemistry model, such as thermochemical equilibrium or parameterized abundances. The higher-resolution data offered by upcoming telescopes enables more complex chemistry models within retrieval frameworks. Yet, many chemistry codes that model more complex processes like photochemistry and vertical transport are… ▽ More

    Submitted 9 April, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

    Comments: 22 pages, 14 figures, submitted to PSJ 2022/11/22, revised 2023/3/7, accepted 2023/3/23. Updated to add Zenodo link to Reproducible Research Compendium

  6. arXiv:2208.09512  [pdf, other

    astro-ph.SR astro-ph.IM cs.CV cs.LG

    Exploring the Limits of Synthetic Creation of Solar EUV Images via Image-to-Image Translation

    Authors: Valentina Salvatelli, Luiz F. G. dos Santos, Souvik Bose, Brad Neuberg, Mark C. M. Cheung, Miho Janvier, Meng **, Yarin Gal, Atilim Gunes Baydin

    Abstract: The Solar Dynamics Observatory (SDO), a NASA multi-spectral decade-long mission that has been daily producing terabytes of observational data from the Sun, has been recently used as a use-case to demonstrate the potential of machine learning methodologies and to pave the way for future deep-space mission planning. In particular, the idea of using image-to-image translation to virtually produce ext… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Comments: 16 pages, 8 figures. To be published on ApJ (submitted on Feb 21st, accepted on July 28th)

    Journal ref: ApJ 937 (2022) 100

  7. arXiv:2203.13818  [pdf, other

    physics.ins-det

    Toward the End-to-End Optimization of Particle Physics Instruments with Differentiable Programming: a White Paper

    Authors: Tommaso Dorigo, Andrea Giammanco, Pietro Vischia, Max Aehle, Mateusz Bawaj, Alexey Boldyrev, Pablo de Castro Manzano, Denis Derkach, Julien Donini, Auralee Edelen, Federica Fanzago, Nicolas R. Gauger, Christian Glaser, Atılım G. Baydin, Lukas Heinrich, Ralf Keidel, Jan Kieseler, Claudius Krause, Maxime Lagrange, Max Lamparth, Lukas Layer, Gernot Maier, Federico Nardi, Helge E. S. Pettersen, Alberto Ramos , et al. (11 additional authors not shown)

    Abstract: The full optimization of the design and operation of instruments whose functioning relies on the interaction of radiation with matter is a super-human task, given the large dimensionality of the space of possible choices for geometry, detection technology, materials, data-acquisition, and information-extraction techniques, and the interdependence of the related parameters. On the other hand, massi… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: 109 pages, 32 figures. To be submitted to Reviews in Physics

  8. arXiv:2202.08587  [pdf, other

    cs.LG stat.ML

    Gradients without Backpropagation

    Authors: Atılım Güneş Baydin, Barak A. Pearlmutter, Don Syme, Frank Wood, Philip Torr

    Abstract: Using backpropagation to compute gradients of objective functions for optimization has remained a mainstay of machine learning. Backpropagation, or reverse-mode differentiation, is a special case within the general family of automatic differentiation algorithms that also includes the forward mode. We present a method to compute gradients based solely on the directional derivative that one can comp… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: 10 pages, 6 figures

    MSC Class: 68T07 ACM Class: I.2.6; I.2.5

  9. arXiv:2112.09051  [pdf

    astro-ph.SR cs.LG

    Simultaneous Multivariate Forecast of Space Weather Indices using Deep Neural Network Ensembles

    Authors: Bernard Benson, Edward Brown, Stefano Bonasera, Giacomo Acciarini, Jorge A. Pérez-Hernández, Eric Sutton, Moriba K. Jah, Christopher Bridges, Meng **, Atılım Güneş Baydin

    Abstract: Solar radio flux along with geomagnetic indices are important indicators of solar activity and its effects. Extreme solar events such as flares and geomagnetic storms can negatively affect the space environment including satellites in low-Earth orbit. Therefore, forecasting these space weather indices is of great importance in space operations and science. In this study, we propose a model based o… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: Fourth Workshop on Machine Learning and the Physical Sciences (NeurIPS 2021)

  10. arXiv:2112.03235  [pdf, other

    cs.AI cs.CE cs.LG cs.MS

    Simulation Intelligence: Towards a New Generation of Scientific Methods

    Authors: Alexander Lavin, David Krakauer, Hector Zenil, Justin Gottschlich, Tim Mattson, Johann Brehmer, Anima Anandkumar, Sanjay Choudry, Kamil Rocki, Atılım Güneş Baydin, Carina Prunkl, Brooks Paige, Olexandr Isayev, Erik Peterson, Peter L. McMahon, Jakob Macke, Kyle Cranmer, Jiaxin Zhang, Haruko Wainwright, Adi Hanuka, Manuela Veloso, Samuel Assefa, Stephan Zheng, Avi Pfeffer

    Abstract: The original "Seven Motifs" set forth a roadmap of essential methods for the field of scientific computing, where a motif is an algorithmic method that captures a pattern of computation and data movement. We present the "Nine Motifs of Simulation Intelligence", a roadmap for the development and integration of the essential algorithms necessary for a merger of scientific computing, scientific simul… ▽ More

    Submitted 27 November, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  11. arXiv:2110.02483  [pdf, other

    stat.ML cs.CR cs.LG stat.AP

    Detecting and Quantifying Malicious Activity with Simulation-based Inference

    Authors: Andrew Gambardella, Bogdan State, Naeemullah Khan, Leo Tsourides, Philip H. S. Torr, Atılım Güneş Baydin

    Abstract: We propose the use of probabilistic programming techniques to tackle the malicious user identification problem in a recommendation algorithm. Probabilistic programming provides numerous advantages over other techniques, including but not limited to providing a disentangled representation of how malicious users acted under a structured model, as well as allowing for the quantification of damage cau… ▽ More

    Submitted 7 October, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Short version, appeared at ICML workshop on Socially Responsible Machine Learning 2021

  12. arXiv:2106.07780  [pdf, other

    cs.LG

    KL Guided Domain Adaptation

    Authors: A. Tuan Nguyen, Toan Tran, Yarin Gal, Philip H. S. Torr, Atılım Güneş Baydin

    Abstract: Domain adaptation is an important problem and often needed for real-world applications. In this problem, instead of i.i.d. training and testing datapoints, we assume that the source (training) data and the target (testing) data have different distributions. With that setting, the empirical risk minimization training procedure often does not perform well, since it does not account for the change in… ▽ More

    Submitted 14 March, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: Accepted to ICLR2022

  13. arXiv:2102.05082  [pdf, other

    cs.LG

    Domain Invariant Representation Learning with Domain Density Transformations

    Authors: A. Tuan Nguyen, Toan Tran, Yarin Gal, Atılım Güneş Baydin

    Abstract: Domain generalization refers to the problem where we aim to train a model on data from a set of source domains so that the model can generalize to unseen target domains. Naively training a model on the aggregate set of data (pooled from all source domains) has been shown to perform suboptimally, since the information learned by that model might be domain-specific and generalize imperfectly to targ… ▽ More

    Submitted 15 February, 2022; v1 submitted 9 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2021

  14. Technology Readiness Levels for Machine Learning Systems

    Authors: Alexander Lavin, Ciarán M. Gilligan-Lee, Alessya Visnjic, Siddha Ganju, Dava Newman, Atılım Güneş Baydin, Sujoy Ganguly, Danny Lange, Amit Sharma, Stephan Zheng, Eric P. Xing, Adam Gibson, James Parr, Chris Mattmann, Yarin Gal

    Abstract: The development and deployment of machine learning (ML) systems can be executed easily with modern tools, but the process is typically rushed and means-to-an-end. The lack of diligence can lead to technical debt, scope creep and misaligned objectives, model misuse and failures, and expensive consequences. Engineering systems, on the other hand, follow well-defined processes and testing standards t… ▽ More

    Submitted 29 November, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

  15. arXiv:2012.14023  [pdf, other

    astro-ph.SR astro-ph.IM cs.LG physics.data-an physics.space-ph

    Multi-Channel Auto-Calibration for the Atmospheric Imaging Assembly using Machine Learning

    Authors: Luiz F. G. dos Santos, Souvik Bose, Valentina Salvatelli, Brad Neuberg, Mark C. M. Cheung, Miho Janvier, Meng **, Yarin Gal, Paul Boerner, Atılım Güneş Baydin

    Abstract: Solar activity plays a quintessential role in influencing the interplanetary medium and space-weather around the Earth. Remote sensing instruments onboard heliophysics space missions provide a pool of information about the Sun's activity via the measurement of its magnetic field and the emission of light from the multi-layered, multi-thermal, and dynamic solar atmosphere. Extreme UV (EUV) waveleng… ▽ More

    Submitted 1 February, 2021; v1 submitted 27 December, 2020; originally announced December 2020.

    Comments: 12 pages, 7 figures, 8 tables. This is a pre-print of an article submitted and accepted by A&A Journal

    Journal ref: A&A 648, A53 (2021)

  16. arXiv:2012.12450  [pdf, other

    cs.LG stat.ML

    Towards Automated Satellite Conjunction Management with Bayesian Deep Learning

    Authors: Francesco Pinto, Giacomo Acciarini, Sascha Metz, Sarah Boufelja, Sylvester Kaczmarek, Klaus Merz, José A. Martinez-Heras, Francesca Letizia, Christopher Bridges, Atılım Güneş Baydin

    Abstract: After decades of space travel, low Earth orbit is a junkyard of discarded rocket bodies, dead satellites, and millions of pieces of debris from collisions and explosions. Objects in high enough altitudes do not re-enter and burn up in the atmosphere, but stay in orbit around Earth for a long time. With a speed of 28,000 km/h, collisions in these orbits can generate fragments and potentially trigge… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

    Comments: 7 pages, 2 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

    Journal ref: AI for Earth Sciences Workshop at NeurIPS 2020, Vancouver, Canada

  17. arXiv:2012.10260  [pdf, other

    cs.LG physics.app-ph

    Spacecraft Collision Risk Assessment with Probabilistic Programming

    Authors: Giacomo Acciarini, Francesco Pinto, Sascha Metz, Sarah Boufelja, Sylvester Kaczmarek, Klaus Merz, José A. Martinez-Heras, Francesca Letizia, Christopher Bridges, Atılım Güneş Baydin

    Abstract: Over 34,000 objects bigger than 10 cm in length are known to orbit Earth. Among them, only a small percentage are active satellites, while the rest of the population is made of dead satellites, rocket bodies, and debris that pose a collision threat to operational spacecraft. Furthermore, the predicted growth of the space sector and the planned launch of megaconstellations will add even more comple… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 8 pages, 2 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

    Journal ref: Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020), Vancouver, Canada

  18. arXiv:2008.08424  [pdf, other

    cs.CV cs.GR cs.LG stat.ML

    AutoSimulate: (Quickly) Learning Synthetic Data Generation

    Authors: Harkirat Singh Behl, Atılım Güneş Baydin, Ran Gal, Philip H. S. Torr, Vibhav Vineet

    Abstract: Simulation is increasingly being used for generating large labelled datasets in many machine learning problems. Recent methods have focused on adjusting simulator parameters with the goal of maximising accuracy on a validation task, usually relying on REINFORCE-like gradient estimators. However these approaches are very expensive as they treat the entire data generation, model training, and valida… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

    Journal ref: European Conference on Computer Vision (ECCV) 2020

  19. arXiv:2005.07062  [pdf, other

    cs.LG stat.AP stat.ML

    Simulation-Based Inference for Global Health Decisions

    Authors: Christian Schroeder de Witt, Bradley Gram-Hansen, Nantas Nardelli, Andrew Gambardella, Rob Zinkov, Puneet Dokania, N. Siddharth, Ana Belen Espinosa-Gonzalez, Ara Darzi, Philip Torr, Atılım Güneş Baydin

    Abstract: The COVID-19 pandemic has highlighted the importance of in-silico epidemiological modelling in predicting the dynamics of infectious diseases to inform health policy and decision makers about suitable prevention and containment strategies. Work in this setting involves solving challenging inference and control problems in individual-based models of ever increasing complexity. Here we discuss recen… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Journal ref: ICML Workshop on Machine Learning for Global Health, Thirty-Seventh International Conference on Machine Learning (ICML 2020)

  20. arXiv:2003.02430  [pdf, other

    astro-ph.IM astro-ph.EP

    Accurate Machine Learning Atmospheric Retrieval via a Neural Network Surrogate Model for Radiative Transfer

    Authors: Michael D. Himes, Joseph Harrington, Adam D. Cobb, Atilim Gunes Baydin, Frank Soboczenski, Molly D. O'Beirne, Simone Zorzan, David C. Wright, Zacchaeus Scheffer, Shawn D. Domagal-Goldman, Giada N. Arney

    Abstract: Atmospheric retrieval determines the properties of an atmosphere based on its measured spectrum. The low signal-to-noise ratio of exoplanet observations require a Bayesian approach to determine posterior probability distributions of each model parameter, given observed spectra. This inference is computationally expensive, as it requires many executions of a costly radiative transfer (RT) simulatio… ▽ More

    Submitted 3 May, 2022; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: 16 pages, 4 figures, submitted to PSJ 3/4/2020, revised 1/22/2021, accepted 2/4/2021, published 4/25/2022. Updated to match the published manuscript. Himes et al. 2022, PSJ, 3, 91

    Journal ref: Planet. Sci. J. 3 (2022) 91

  21. arXiv:2002.04632  [pdf, other

    cs.LG hep-ex physics.data-an stat.ML

    Black-Box Optimization with Local Generative Surrogates

    Authors: Sergey Shirobokov, Vladislav Belavin, Michael Kagan, Andrey Ustyuzhanin, Atılım Güneş Baydin

    Abstract: We propose a novel method for gradient-based optimization of black-box simulators using differentiable local surrogate models. In fields such as physics and engineering, many processes are modeled with non-differentiable simulators with intractable likelihoods. Optimization of these forward models is particularly challenging, especially when the simulator is stochastic. To address such cases, we i… ▽ More

    Submitted 15 June, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Journal ref: In Advances in Neural Information Processing Systems 34 (NeurIPS), 2020

  22. arXiv:1911.13270  [pdf, other

    cs.LG cs.CV stat.ML

    Transflow Learning: Repurposing Flow Models Without Retraining

    Authors: Andrew Gambardella, Atılım Güneş Baydin, Philip H. S. Torr

    Abstract: It is well known that deep generative models have a rich latent space, and that it is possible to smoothly manipulate their outputs by traversing this latent space. Recently, architectures have emerged that allow for more complex manipulations, such as making an image look as though it were from a different class, or painted in a certain style. These methods typically require large amounts of trai… ▽ More

    Submitted 5 December, 2019; v1 submitted 29 November, 2019; originally announced November 2019.

  23. arXiv:1911.04008  [pdf, other

    astro-ph.SR astro-ph.IM cs.LG physics.space-ph

    Auto-Calibration of Remote Sensing Solar Telescopes with Deep Learning

    Authors: Brad Neuberg, Souvik Bose, Valentina Salvatelli, Luiz F. G. dos Santos, Mark Cheung, Miho Janvier, Atilim Gunes Baydin, Yarin Gal, Meng **

    Abstract: As a part of NASA's Heliophysics System Observatory (HSO) fleet of satellites,the Solar Dynamics Observatory (SDO) has continuously monitored the Sun since2010. Ultraviolet (UV) and Extreme UV (EUV) instruments in orbit, such asSDO's Atmospheric Imaging Assembly (AIA) instrument, suffer time-dependent degradation which reduces instrument sensitivity. Accurate calibration for (E)UV instruments curr… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

    Comments: 6 pages, 3 figures, Accepted at NeurIPS 2019 Workshop ML4PS

  24. arXiv:1911.04006  [pdf, other

    astro-ph.SR astro-ph.IM cs.LG physics.space-ph

    Using U-Nets to Create High-Fidelity Virtual Observations of the Solar Corona

    Authors: Valentina Salvatelli, Souvik Bose, Brad Neuberg, Luiz F. G. dos Santos, Mark Cheung, Miho Janvier, Atilim Gunes Baydin, Yarin Gal, Meng **

    Abstract: Understanding and monitoring the complex and dynamic processes of the Sun is important for a number of human activities on Earth and in space. For this reason, NASA's Solar Dynamics Observatory (SDO) has been continuously monitoring the multi-layered Sun's atmosphere in high-resolution since its launch in 2010, generating terabytes of observational data every day. The synergy between machine learn… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

    Comments: 5 pages, 6 figures, Accepted at the NeurIPS 2019 Workshop ML4PS

  25. arXiv:1911.01490  [pdf, other

    astro-ph.SR cs.LG

    Single-Frame Super-Resolution of Solar Magnetograms: Investigating Physics-Based Metrics \& Losses

    Authors: Anna Jungbluth, Xavier Gitiaux, Shane A. Maloney, Carl Shneider, Paul J. Wright, Alfredo Kalaitzis, Michel Deudon, Atılım Güneş Baydin, Yarin Gal, Andrés Muñoz-Jaramillo

    Abstract: Breakthroughs in our understanding of physical phenomena have traditionally followed improvements in instrumentation. Studies of the magnetic field of the Sun, and its influence on the solar dynamo and space weather events, have benefited from improvements in resolution and measurement frequency of new instruments. However, in order to fully understand the solar cycle, high-quality data across tim… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

  26. arXiv:1911.01486  [pdf, other

    cs.LG astro-ph.SR eess.IV stat.ML

    Probabilistic Super-Resolution of Solar Magnetograms: Generating Many Explanations and Measuring Uncertainties

    Authors: Xavier Gitiaux, Shane A. Maloney, Anna Jungbluth, Carl Shneider, Paul J. Wright, Atılım Güneş Baydin, Michel Deudon, Yarin Gal, Alfredo Kalaitzis, Andrés Muñoz-Jaramillo

    Abstract: Machine learning techniques have been successfully applied to super-resolution tasks on natural images where visually pleasing results are sufficient. However in many scientific domains this is not adequate and estimations of errors and uncertainties are crucial. To address this issue we propose a Bayesian framework that decomposes uncertainties into epistemic and aleatoric uncertainties. We test… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

  27. arXiv:1910.11961  [pdf, other

    cs.LG stat.ML

    Attention for Inference Compilation

    Authors: William Harvey, Andreas Munk, Atılım Güneş Baydin, Alexander Bergholm, Frank Wood

    Abstract: We present a new approach to automatic amortized inference in universal probabilistic programs which improves performance compared to current methods. Our approach is a variation of inference compilation (IC) which leverages deep neural networks to approximate a posterior distribution over latent variables in a probabilistic program. A challenge with existing IC network architectures is that they… ▽ More

    Submitted 25 October, 2019; originally announced October 2019.

  28. arXiv:1910.11950  [pdf, other

    cs.LG stat.ML

    Probabilistic Surrogate Networks for Simulators with Unbounded Randomness

    Authors: Andreas Munk, Berend Zwartsenberg, Adam Ścibior, Atılım Güneş Baydin, Andrew Stewart, Goran Fernlund, Anoush Poursartip, Frank Wood

    Abstract: We present a framework for automatically structuring and training fast, approximate, deep neural surrogates of stochastic simulators. Unlike traditional approaches to surrogate modeling, our surrogates retain the interpretable structure and control flow of the reference simulator. Our surrogates target stochastic simulators where the number of random variables itself can be stochastic and potentia… ▽ More

    Submitted 20 January, 2023; v1 submitted 25 October, 2019; originally announced October 2019.

  29. arXiv:1910.09056  [pdf, other

    cs.LG cs.AI stat.ML

    Amortized Rejection Sampling in Universal Probabilistic Programming

    Authors: Saeid Naderiparizi, Adam Ścibior, Andreas Munk, Mehrdad Ghadiri, Atılım Güneş Baydin, Bradley Gram-Hansen, Christian Schroeder de Witt, Robert Zinkov, Philip H. S. Torr, Tom Rainforth, Yee Whye Teh, Frank Wood

    Abstract: Naive approaches to amortized inference in probabilistic programs with unbounded loops can produce estimators with infinite variance. This is particularly true of importance sampling inference in programs that explicitly include rejection sampling as part of the user-programmed generative procedure. In this paper we develop a new and efficient amortized importance sampling estimator. We prove fini… ▽ More

    Submitted 28 March, 2022; v1 submitted 20 October, 2019; originally announced October 2019.

    Comments: AISTATS 2022 camera ready

  30. arXiv:1910.06243  [pdf, other

    stat.ML cs.LG

    Introducing an Explicit Symplectic Integration Scheme for Riemannian Manifold Hamiltonian Monte Carlo

    Authors: Adam D. Cobb, Atılım Güneş Baydin, Andrew Markham, Stephen J. Roberts

    Abstract: We introduce a recent symplectic integration scheme derived for solving physically motivated systems with non-separable Hamiltonians. We show its relevance to Riemannian manifold Hamiltonian Monte Carlo (RMHMC) and provide an alternative to the currently used generalised leapfrog symplectic integrator, which relies on solving multiple fixed point iterations to convergence. Via this approach, we ar… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

  31. arXiv:1910.03085  [pdf, other

    physics.space-ph astro-ph.IM cs.LG

    Correlation of Auroral Dynamics and GNSS Scintillation with an Autoencoder

    Authors: Kara Lamb, Garima Malhotra, Athanasios Vlontzos, Edward Wagstaff, Atılım Günes Baydin, Anahita Bhiwandiwalla, Yarin Gal, Alfredo Kalaitzis, Anthony Reina, Asti Bhatt

    Abstract: High energy particles originating from solar activity travel along the the Earth's magnetic field and interact with the atmosphere around the higher latitudes. These interactions often manifest as aurora in the form of visible light in the Earth's ionosphere. These interactions also result in irregularities in the electron density, which cause disruptions in the amplitude and phase of the radio si… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: Four first authors contributed equally; Paper accepted in Machine Learning for the Physical Sciences workshop of NeurIPS 2019; Camera Ready Version to Follow

  32. arXiv:1910.03019  [pdf, other

    eess.IV cs.LG stat.ML

    Flood Detection On Low Cost Orbital Hardware

    Authors: Gonzalo Mateo-Garcia, Silviu Oprea, Lewis Smith, Josh Veitch-Michaelis, Guy Schumann, Yarin Gal, Atılım Güneş Baydin, Dietmar Backes

    Abstract: Satellite imaging is a critical technology for monitoring and responding to natural disasters such as flooding. Despite the capabilities of modern satellites, there is still much to be desired from the perspective of first response organisations like UNICEF. Two main challenges are rapid access to data, and the ability to automatically identify flooded regions in images. We describe a prototypical… ▽ More

    Submitted 15 January, 2020; v1 submitted 4 October, 2019; originally announced October 2019.

    Journal ref: Artificial Intelligence for Humanitarian Assistance and Disaster Response Workshop, 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  33. arXiv:1910.01570  [pdf, other

    cs.LG stat.ML

    Prediction of GNSS Phase Scintillations: A Machine Learning Approach

    Authors: Kara Lamb, Garima Malhotra, Athanasios Vlontzos, Edward Wagstaff, Atılım Günes Baydin, Anahita Bhiwandiwalla, Yarin Gal, Alfredo Kalaitzis, Anthony Reina, Asti Bhatt

    Abstract: A Global Navigation Satellite System (GNSS) uses a constellation of satellites around the earth for accurate navigation, timing, and positioning. Natural phenomena like space weather introduce irregularities in the Earth's ionosphere, disrupting the propagation of the radio signals that GNSS relies upon. Such disruptions affect both the amplitude and the phase of the propagated waves. No physics-b… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: First 4 authors contributed equally Paper accepted in Machine Learning for the Physical Sciences workshop of NeurIPS 2019 Camera Ready Version to Follow

  34. arXiv:1907.03382  [pdf, other

    cs.LG cs.PF stat.ML

    Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

    Authors: Atılım Güneş Baydin, Lei Shao, Wahid Bhimji, Lukas Heinrich, Lawrence Meadows, Jialin Liu, Andreas Munk, Saeid Naderiparizi, Bradley Gram-Hansen, Gilles Louppe, Mingfei Ma, Xiaohui Zhao, Philip Torr, Victor Lee, Kyle Cranmer, Prabhat, Frank Wood

    Abstract: Probabilistic programming languages (PPLs) are receiving widespread attention for performing Bayesian inference in complex generative models. However, applications to science remain limited because of the impracticability of rewriting complex scientific simulators in a PPL, the computational cost of inference, and the lack of scalable implementations. To address these, we present a novel PPL frame… ▽ More

    Submitted 27 August, 2019; v1 submitted 7 July, 2019; originally announced July 2019.

    Comments: 14 pages, 8 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

    Journal ref: Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC19), November 17--22, 2019

  35. arXiv:1905.12432  [pdf, other

    stat.ML cs.LG

    Hijacking Malaria Simulators with Probabilistic Programming

    Authors: Bradley Gram-Hansen, Christian Schröder de Witt, Tom Rainforth, Philip H. S. Torr, Yee Whye Teh, Atılım Güneş Baydin

    Abstract: Epidemiology simulations have become a fundamental tool in the fight against the epidemics of various infectious diseases like AIDS and malaria. However, the complicated and stochastic nature of these simulators can mean their output is difficult to interpret, which reduces their usefulness to policymakers. In this paper, we introduce an approach that allows one to treat a large class of populatio… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: 6 pages, 3 figures, Accepted at the International Conference on Machine Learning AI for Social Good Workshop, Long Beach, United States, 2019

    Journal ref: ICML Workshop on AI for Social Good, 2018

  36. arXiv:1905.10659  [pdf, other

    astro-ph.EP cs.LG

    An Ensemble of Bayesian Neural Networks for Exoplanetary Atmospheric Retrieval

    Authors: Adam D. Cobb, Michael D. Himes, Frank Soboczenski, Simone Zorzan, Molly D. O'Beirne, Atılım Güneş Baydin, Yarin Gal, Shawn D. Domagal-Goldman, Giada N. Arney, Daniel Angerhausen

    Abstract: Machine learning is now used in many areas of astrophysics, from detecting exoplanets in Kepler transit signals to removing telescope systematics. Recent work demonstrated the potential of using machine learning algorithms for atmospheric retrieval by implementing a random forest to perform retrievals in seconds that are consistent with the traditional, computationally-expensive nested-sampling re… ▽ More

    Submitted 25 May, 2019; originally announced May 2019.

  37. arXiv:1905.07435  [pdf, other

    cs.LG cs.AI stat.ML

    Alpha MAML: Adaptive Model-Agnostic Meta-Learning

    Authors: Harkirat Singh Behl, Atılım Güneş Baydin, Philip H. S. Torr

    Abstract: Model-agnostic meta-learning (MAML) is a meta-learning technique to train a model on a multitude of learning tasks in a way that primes the model for few-shot learning of new tasks. The MAML algorithm performs well on few-shot learning problems in classification, regression, and fine-tuning of policy gradients in reinforcement learning, but comes with the need for costly hyperparameter tuning for… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 6th ICML Workshop on Automated Machine Learning (2019)

    Journal ref: ICML Workshop on Automated Machine Learning (2019)

  38. arXiv:1811.03390  [pdf, other

    astro-ph.EP cs.LG

    Bayesian Deep Learning for Exoplanet Atmospheric Retrieval

    Authors: Frank Soboczenski, Michael D. Himes, Molly D. O'Beirne, Simone Zorzan, Atilim Gunes Baydin, Adam D. Cobb, Yarin Gal, Daniel Angerhausen, Massimo Mascaro, Giada N. Arney, Shawn D. Domagal-Goldman

    Abstract: Over the past decade, the study of extrasolar planets has evolved rapidly from plain detection and identification to comprehensive categorization and characterization of exoplanet systems and their atmospheres. Atmospheric retrieval, the inverse modeling technique used to determine an exoplanetary atmosphere's temperature structure and composition from an observed spectrum, is both time-consuming… ▽ More

    Submitted 2 December, 2018; v1 submitted 8 November, 2018; originally announced November 2018.

    Comments: Third workshop on Bayesian Deep Learning (NeurIPS 2018), Montreal, Canada

    MSC Class: 85A20; 68T05 ACM Class: J.2; I.2.6

  39. arXiv:1807.07706  [pdf, other

    cs.LG hep-ph physics.data-an stat.ML

    Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model

    Authors: Atılım Güneş Baydin, Lukas Heinrich, Wahid Bhimji, Lei Shao, Saeid Naderiparizi, Andreas Munk, Jialin Liu, Bradley Gram-Hansen, Gilles Louppe, Lawrence Meadows, Philip Torr, Victor Lee, Prabhat, Kyle Cranmer, Frank Wood

    Abstract: We present a novel probabilistic programming framework that couples directly to existing large-scale simulators through a cross-platform probabilistic execution protocol, which allows general-purpose inference engines to record and control random number draws within simulators in a language-agnostic way. The execution of existing simulators as probabilistic programs enables highly interpretable po… ▽ More

    Submitted 17 February, 2020; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: 20 pages, 9 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

    Journal ref: In Advances in Neural Information Processing Systems 33 (NeurIPS), Vancouver, Canada, 2019

  40. arXiv:1712.07901  [pdf, other

    cs.AI physics.data-an

    Improvements to Inference Compilation for Probabilistic Programming in Large-Scale Scientific Simulators

    Authors: Mario Lezcano Casado, Atilim Gunes Baydin, David Martinez Rubio, Tuan Anh Le, Frank Wood, Lukas Heinrich, Gilles Louppe, Kyle Cranmer, Karen Ng, Wahid Bhimji, Prabhat

    Abstract: We consider the problem of Bayesian inference in the family of probabilistic models implicitly defined by stochastic generative models of data. In scientific fields ranging from population biology to cosmology, low-level mechanistic components are composed to create complex generative models. These models lead to intractable likelihoods and are typically non-differentiable, which poses challenges… ▽ More

    Submitted 21 December, 2017; originally announced December 2017.

    Comments: 7 pages, 2 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

  41. arXiv:1703.04782  [pdf, other

    cs.LG stat.ML

    Online Learning Rate Adaptation with Hypergradient Descent

    Authors: Atilim Gunes Baydin, Robert Cornish, David Martinez Rubio, Mark Schmidt, Frank Wood

    Abstract: We introduce a general method for improving the convergence rate of gradient-based optimizers that is easy to implement and works well in practice. We demonstrate the effectiveness of the method in a range of optimization problems by applying it to stochastic gradient descent, stochastic gradient descent with Nesterov momentum, and Adam, showing that it significantly reduces the need for the manua… ▽ More

    Submitted 25 February, 2018; v1 submitted 14 March, 2017; originally announced March 2017.

    Comments: 11 pages, 4 figures

    MSC Class: 68T05 ACM Class: G.1.6; I.2.6

    Journal ref: In Sixth International Conference on Learning Representations (ICLR), Vancouver, Canada, April 30 -- May 3, 2018. https://openreview.net/forum?id=BkrsAzWAb

  42. arXiv:1703.00868  [pdf, other

    cs.LG cs.CV stat.ML

    Using Synthetic Data to Train Neural Networks is Model-Based Reasoning

    Authors: Tuan Anh Le, Atilim Gunes Baydin, Robert Zinkov, Frank Wood

    Abstract: We draw a formal connection between using synthetic training data to optimize neural network parameters and approximate, Bayesian, model-based reasoning. In particular, training a neural network using synthetic data can be viewed as learning a proposal distribution generator for approximate inference in the synthetic-data generative model. We demonstrate this connection in a recognition task where… ▽ More

    Submitted 2 March, 2017; originally announced March 2017.

    Comments: 8 pages, 4 figures

    MSC Class: 68T05; 68T10 ACM Class: I.2.6; I.7.5

  43. arXiv:1611.03777  [pdf, ps, other

    cs.LG stat.ML

    Tricks from Deep Learning

    Authors: Atılım Güneş Baydin, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: The deep learning community has devised a diverse set of methods to make gradient optimization, using large datasets, of large and highly complex models with deeply cascaded nonlinearities, practical. Taken as a whole, these methods constitute a breakthrough, allowing computational structures which are quite wide, very deep, and with an enormous number and variety of free parameters to be effectiv… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  44. arXiv:1611.03423  [pdf, ps, other

    cs.MS cs.LG

    DiffSharp: An AD Library for .NET Languages

    Authors: Atılım Güneş Baydin, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: DiffSharp is an algorithmic differentiation or automatic differentiation (AD) library for the .NET ecosystem, which is targeted by the C# and F# languages, among others. The library has been designed with machine learning applications in mind, allowing very succinct implementations of models and optimization routines. DiffSharp is implemented in F# and exposes forward and reverse AD operators as g… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  45. arXiv:1610.09900  [pdf, other

    cs.AI cs.LG stat.ML

    Inference Compilation and Universal Probabilistic Programming

    Authors: Tuan Anh Le, Atilim Gunes Baydin, Frank Wood

    Abstract: We introduce a method for using deep neural networks to amortize the cost of inference in models from the family induced by universal probabilistic programming languages, establishing a framework that combines the strengths of probabilistic programming and deep learning methods. We call what we do "compilation of inference" because our method transforms a denotational specification of an inference… ▽ More

    Submitted 2 March, 2017; v1 submitted 31 October, 2016; originally announced October 2016.

    Comments: 11 pages, 6 figures

    MSC Class: 68T37; 68T05 ACM Class: G.3; I.2.6

    Journal ref: In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS 2017), 54:1338--1348. Proceedings of Machine Learning Research. Fort Lauderdale, FL, USA: PMLR

  46. arXiv:1511.07727  [pdf, ps, other

    cs.MS

    DiffSharp: Automatic Differentiation Library

    Authors: Atilim Gunes Baydin, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: In this paper we introduce DiffSharp, an automatic differentiation (AD) library designed with machine learning in mind. AD is a family of techniques that evaluate derivatives at machine precision with only a small constant factor of overhead, by systematically applying the chain rule of calculus at the elementary operator level. DiffSharp aims to make an extensive array of AD techniques available,… ▽ More

    Submitted 26 November, 2015; v1 submitted 24 November, 2015; originally announced November 2015.

    Comments: 5 pages, 1 figure, minor fixes, added coauthor

    MSC Class: 68T05; 68W30 ACM Class: I.2.6; G.1.4

  47. arXiv:1502.05767  [pdf, ps, other

    cs.SC cs.LG stat.ML

    Automatic differentiation in machine learning: a survey

    Authors: Atilim Gunes Baydin, Barak A. Pearlmutter, Alexey Andreyevich Radul, Jeffrey Mark Siskind

    Abstract: Derivatives, mostly in the form of gradients and Hessians, are ubiquitous in machine learning. Automatic differentiation (AD), also called algorithmic differentiation or simply "autodiff", is a family of techniques similar to but more general than backpropagation for efficiently and accurately evaluating derivatives of numeric functions expressed as computer programs. AD is a small but established… ▽ More

    Submitted 5 February, 2018; v1 submitted 19 February, 2015; originally announced February 2015.

    Comments: 43 pages, 5 figures

    MSC Class: 68W30; 65D25; 68T05 ACM Class: G.1.4; I.2.6

    Journal ref: Atilim Gunes Baydin, Barak A. Pearlmutter, Alexey Andreyevich Radul, Jeffrey Mark Siskind. Automatic differentiation in machine learning: a survey. The Journal of Machine Learning Research, 18(153):1--43, 2018

  48. arXiv:1409.7316  [pdf, other

    cs.DL cs.MS

    An Analysis of Publication Venues for Automatic Differentiation Research

    Authors: Atilim Gunes Baydin, Barak A. Pearlmutter

    Abstract: We present the results of our analysis of publication venues for papers on automatic differentiation (AD), covering academic journals and conference proceedings. Our data are collected from the AD publications database maintained by the autodiff.org community website. The database is purpose-built for the AD field and is expanding via submissions by AD researchers. Therefore, it provides a relativ… ▽ More

    Submitted 25 September, 2014; originally announced September 2014.

    Comments: 6 pages, 3 figures

    MSC Class: 00A15

  49. A semantic network-based evolutionary algorithm for computational creativity

    Authors: Atilim Gunes Baydin, Ramon Lopez de Mantaras, Santiago Ontanon

    Abstract: We introduce a novel evolutionary algorithm (EA) with a semantic network-based representation. For enabling this, we establish new formulations of EA variation operators, crossover and mutation, that we adapt to work on semantic networks. The algorithm employs commonsense reasoning to ensure all operations preserve the meaningfulness of the networks, using ConceptNet and WordNet knowledge bases. T… ▽ More

    Submitted 14 July, 2014; v1 submitted 30 April, 2014; originally announced April 2014.

    Comments: 20 pages, 14 figures, revision after reviews, changed title

    MSC Class: 92D15; 91E40; 68T20; 68T30 ACM Class: I.2.4; I.2.6; G.1.6; J.4; J.3

    Journal ref: Evolutionary Intelligence, 8(1):3-21 (2015)

  50. arXiv:1404.7456  [pdf, other

    cs.LG cs.SC stat.ML

    Automatic Differentiation of Algorithms for Machine Learning

    Authors: Atilim Gunes Baydin, Barak A. Pearlmutter

    Abstract: Automatic differentiation---the mechanical transformation of numeric computer programs to calculate derivatives efficiently and accurately---dates to the origin of the computer age. Reverse mode automatic differentiation both antedates and generalizes the method of backwards propagation of errors used in machine learning. Despite this, practitioners in a variety of fields, including machine learni… ▽ More

    Submitted 28 April, 2014; originally announced April 2014.

    Comments: 7 pages, 1 figure

    MSC Class: 68W30; 65D25; 68T05 ACM Class: G.1.4; I.2.6