Skip to main content

Showing 1–10 of 10 results for author: Char, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12416  [pdf, other

    physics.plasm-ph cs.LG

    Full Shot Predictions for the DIII-D Tokamak via Deep Recurrent Networks

    Authors: Ian Char, Youngseog Chung, Joseph Abbate, Egemen Kolemen, Jeff Schneider

    Abstract: Although tokamaks are one of the most promising devices for realizing nuclear fusion as an energy source, there are still key obstacles when it comes to understanding the dynamics of the plasma and controlling it. As such, it is crucial that high quality models are developed to assist in overcoming these obstacles. In this work, we take an entirely data driven approach to learn such a model. In pa… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  2. arXiv:2307.05891  [pdf, other

    cs.LG cs.AI

    PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks

    Authors: Ian Char, Jeff Schneider

    Abstract: Deep reinforcement learning (RL) has shown immense potential for learning to control systems through data alone. However, one challenge deep RL faces is that the full state of the system is often not observable. When this is the case, the policy needs to leverage the history of observations to infer the current state. At the same time, differences between the training and testing environments make… ▽ More

    Submitted 25 October, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  3. arXiv:2212.09510  [pdf, other

    stat.ML cs.AI cs.LG

    Near-optimal Policy Identification in Active Reinforcement Learning

    Authors: Xiang Li, Viraj Mehta, Johannes Kirschner, Ian Char, Willie Neiswanger, Jeff Schneider, Andreas Krause, Ilija Bogunovic

    Abstract: Many real-world reinforcement learning tasks require control of complex dynamical systems that involve both costly data acquisition processes and large state spaces. In cases where the transition dynamics can be readily evaluated at specified states (e.g., via a simulator), agents can operate in what is often referred to as planning with a \emph{generative model}. We propose the AE-LSVI algorithm… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  4. arXiv:2210.04642  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Exploration via Planning for Information about the Optimal Trajectory

    Authors: Viraj Mehta, Ian Char, Joseph Abbate, Rory Conlin, Mark D. Boyer, Stefano Ermon, Jeff Schneider, Willie Neiswanger

    Abstract: Many potential applications of reinforcement learning (RL) are stymied by the large numbers of samples required to learn an effective policy. This is especially true when applying RL to real-world control tasks, e.g. in the sciences or robotics, where executing a policy in the environment is costly. In popular RL algorithms, agents typically explore either by adding stochasticity to a reward-maxim… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: Conference paper at Neurips 2022. Code available at https://github.com/fusion-ml/trajectory-information-rl. arXiv admin note: text overlap with arXiv:2112.05244

  5. arXiv:2205.10439  [pdf, other

    cs.LG

    How Useful are Gradients for OOD Detection Really?

    Authors: Conor Igoe, Youngseog Chung, Ian Char, Jeff Schneider

    Abstract: One critical challenge in deploying highly performant machine learning models in real-life applications is out of distribution (OOD) detection. Given a predictive model which is accurate on in distribution (ID) data, an OOD detection system will further equip the model with the option to defer prediction when the input is novel and the model has little confidence in prediction. There has been some… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  6. arXiv:2204.12026  [pdf, other

    cs.LG

    BATS: Best Action Trajectory Stitching

    Authors: Ian Char, Viraj Mehta, Adam Villaflor, John M. Dolan, Jeff Schneider

    Abstract: The problem of offline reinforcement learning focuses on learning a good policy from a log of environment interactions. Past efforts for develo** algorithms in this area have revolved around introducing constraints to online reinforcement learning algorithms to ensure the actions of the learned policy are constrained to the logged data. In this work, we explore an alternative approach by plannin… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted to NeurIPS Offline RL Workshop 2021

  7. arXiv:2109.10254  [pdf, other

    cs.LG stat.ML

    Uncertainty Toolbox: an Open-Source Library for Assessing, Visualizing, and Improving Uncertainty Quantification

    Authors: Youngseog Chung, Ian Char, Han Guo, Jeff Schneider, Willie Neiswanger

    Abstract: With increasing deployment of machine learning systems in various real-world tasks, there is a greater need for accurate quantification of predictive uncertainty. While the common goal in uncertainty quantification (UQ) in machine learning is to approximate the true distribution of the target data, many works in UQ tend to be disjoint in the evaluation metrics utilized, and disparate implementatio… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

  8. arXiv:2011.09588  [pdf, other

    cs.LG stat.ML

    Beyond Pinball Loss: Quantile Methods for Calibrated Uncertainty Quantification

    Authors: Youngseog Chung, Willie Neiswanger, Ian Char, Jeff Schneider

    Abstract: Among the many ways of quantifying uncertainty in a regression setting, specifying the full quantile function is attractive, as quantiles are amenable to interpretation and evaluation. A model that predicts the true conditional quantiles for each input, at all quantile levels, presents a correct and efficient representation of the underlying uncertainty. To achieve this, many current quantile-base… ▽ More

    Submitted 9 December, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    Comments: Appears in Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  9. Neural Dynamical Systems: Balancing Structure and Flexibility in Physical Prediction

    Authors: Viraj Mehta, Ian Char, Willie Neiswanger, Youngseog Chung, Andrew Oakleigh Nelson, Mark D Boyer, Egemen Kolemen, Jeff Schneider

    Abstract: We introduce Neural Dynamical Systems (NDS), a method of learning dynamical models in various gray-box settings which incorporates prior knowledge in the form of systems of ordinary differential equations. NDS uses neural networks to estimate free parameters of the system, predicts residual terms, and numerically integrates over time to predict future states. A key insight is that many real dynami… ▽ More

    Submitted 27 April, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

  10. arXiv:2001.01793  [pdf, other

    cs.LG stat.ML

    Offline Contextual Bayesian Optimization for Nuclear Fusion

    Authors: Youngseog Chung, Ian Char, Willie Neiswanger, Kirthevasan Kandasamy, Andrew Oakleigh Nelson, Mark D Boyer, Egemen Kolemen, Jeff Schneider

    Abstract: Nuclear fusion is regarded as the energy of the future since it presents the possibility of unlimited clean energy. One obstacle in utilizing fusion as a feasible energy source is the stability of the reaction. Ideally, one would have a controller for the reactor that makes actions in response to the current state of the plasma in order to prolong the reaction as long as possible. In this work, we… ▽ More

    Submitted 6 January, 2020; originally announced January 2020.

    Comments: 6 pages, 2 figures, Machine Learning and Physical Sciences workshop