Skip to main content

Showing 1–16 of 16 results for author: Tigas, P

.
  1. arXiv:2406.10023  [pdf, other

    cs.LG cs.CL stat.ML

    Deep Bayesian Active Learning for Preference Modeling in Large Language Models

    Authors: Luckeciano C. Melo, Panagiotis Tigas, Alessandro Abate, Yarin Gal

    Abstract: Leveraging human preferences for steering the behavior of Large Language Models (LLMs) has demonstrated notable success in recent years. Nonetheless, data selection and labeling are still a bottleneck for these systems, particularly at large scale. Hence, selecting the most informative points for acquiring human feedback may considerably reduce the cost of preference labeling and unleash the furth… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2406.03209  [pdf, other

    cs.LG cs.AI

    Challenges and Considerations in the Evaluation of Bayesian Causal Discovery

    Authors: Amir Mohammad Karimi Mamaghan, Panagiotis Tigas, Karl Henrik Johansson, Yarin Gal, Yashas Annadani, Stefan Bauer

    Abstract: Representing uncertainty in causal discovery is a crucial component for experimental design, and more broadly, for safe and reliable causal decision making. Bayesian Causal Discovery (BCD) offers a principled approach to encapsulating this uncertainty. Unlike non-Bayesian causal discovery, which relies on a single estimated causal graph and model parameters for assessment, evaluating BCD presents… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2405.16718  [pdf, other

    cs.LG cs.AI

    Amortized Active Causal Induction with Deep Reinforcement Learning

    Authors: Yashas Annadani, Panagiotis Tigas, Stefan Bauer, Adam Foster

    Abstract: We present Causal Amortized Active Structure Learning (CAASL), an active intervention design policy that can select interventions that are adaptive, real-time and that does not require access to the likelihood. This policy, an amortized network based on the transformer, is trained with reinforcement learning on a simulator of the design environment, and a reward function that measures how close th… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  4. arXiv:2302.10607  [pdf, other

    cs.LG cs.AI stat.ME

    Differentiable Multi-Target Causal Bayesian Experimental Design

    Authors: Yashas Annadani, Panagiotis Tigas, Desi R. Ivanova, Andrew Jesson, Yarin Gal, Adam Foster, Stefan Bauer

    Abstract: We introduce a gradient-based approach for the problem of Bayesian optimal experimental design to learn causal models in a batch setting -- a critical component for causal discovery from finite data where interventions can be costly or risky. Existing methods rely on greedy approximations to construct a batch of experiments while using black-box methods to optimize over a single target-state pair… ▽ More

    Submitted 2 June, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Camera-ready version ICML 2023

  5. arXiv:2207.13699  [pdf, other

    cs.LG cs.AI

    Modelling non-reinforced preferences using selective attention

    Authors: Noor Sajid, Panagiotis Tigas, Zafeirios Fountas, Qinghai Guo, Alexey Zakharov, Lancelot Da Costa

    Abstract: How can artificial agents learn non-reinforced preferences to continuously adapt their behaviour to a changing environment? We decompose this question into two challenges: ($i$) encoding diverse memories and ($ii$) selectively attending to these for preference formation. Our proposed \emph{no}n-\emph{re}inforced preference learning mechanism using selective attention, \textsc{Nore}, addresses both… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: 4 pages, 3 figures - Workshop Track: 1st Conference on Lifelong Learning Agents, 2022

  6. arXiv:2205.12734  [pdf, other

    physics.space-ph astro-ph.IM astro-ph.SR cs.LG

    Global geomagnetic perturbation forecasting using Deep Learning

    Authors: Vishal Upendran, Panagiotis Tigas, Banafsheh Ferdousi, Teo Bloch, Mark C. M. Cheung, Siddha Ganju, Asti Bhatt, Ryan M. McGranaghan, Yarin Gal

    Abstract: Geomagnetically Induced Currents (GICs) arise from spatio-temporal changes to Earth's magnetic field which arise from the interaction of the solar wind with Earth's magnetosphere, and drive catastrophic destruction to our technologically dependent society. Hence, computational models to forecast GICs globally with large forecast horizon, high spatial resolution and temporal cadence are of increasi… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: 23 pages, 8 figures, 5 tables; accepted for publication in AGU: Spaceweather

  7. arXiv:2203.02016  [pdf, other

    cs.LG cs.AI stat.ML

    Interventions, Where and How? Experimental Design for Causal Models at Scale

    Authors: Panagiotis Tigas, Yashas Annadani, Andrew Jesson, Bernhard Schölkopf, Yarin Gal, Stefan Bauer

    Abstract: Causal discovery from observational and interventional data is challenging due to limited data and non-identifiability: factors that introduce uncertainty in estimating the underlying structural causal model (SCM). Selecting experiments (interventions) based on the uncertainty arising from both factors can expedite the identification of the SCM. Existing methods in experimental design for causal d… ▽ More

    Submitted 21 October, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Presented at the thirty-sixth Conference on Neural Information Processing Systems (2022)

  8. arXiv:2111.02275  [pdf, other

    cs.LG stat.ML

    Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data

    Authors: Andrew Jesson, Panagiotis Tigas, Joost van Amersfoort, Andreas Kirsch, Uri Shalit, Yarin Gal

    Abstract: Estimating personalized treatment effects from high-dimensional observational data is essential in situations where experimental designs are infeasible, unethical, or expensive. Existing approaches rely on fitting deep models on outcomes observed for treated and control populations. However, when measuring individual outcomes is costly, as is the case of a tumor biopsy, a sample-efficient strategy… ▽ More

    Submitted 1 February, 2022; v1 submitted 3 November, 2021; originally announced November 2021.

    Comments: 24 pages, 8 Figures, 5 tables, NeurIPS 2021

  9. arXiv:2107.07455  [pdf, other

    cs.LG cs.AI stat.ML

    Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks

    Authors: Andrey Malinin, Neil Band, Ganshin, Alexander, German Chesnokov, Yarin Gal, Mark J. F. Gales, Alexey Noskov, Andrey Ploskonosov, Liudmila Prokhorenkova, Ivan Provilkov, Vatsal Raina, Vyas Raina, Roginskiy, Denis, Mariya Shmatova, Panos Tigas, Boris Yangel

    Abstract: There has been significant research done on develo** methods for improving robustness to distributional shift and uncertainty estimation. In contrast, only limited work has examined develo** standard datasets and benchmarks for assessing these approaches. Additionally, most work on uncertainty estimation and robustness has developed new techniques based on small-scale regression or image class… ▽ More

    Submitted 11 February, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

  10. arXiv:2106.08867  [pdf

    cs.HC cs.MM

    Latent Map**s: Generating Open-Ended Expressive Map**s Using Variational Autoencoders

    Authors: Tim Murray-Browne, Panagiotis Tigas

    Abstract: In many contexts, creating map**s for gestural interactions can form part of an artistic process. Creators seeking a map** that is expressive, novel, and affords them a sense of authorship may not know how to program it up in a signal processing patch. Tools like Wekinator and MIMIC allow creators to use supervised machine learning to learn map**s from example input/output pairings. However,… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: Published at the International Conference on New Interfaces for Musical Expression, June 2021. 3000 word short paper. 5 figures plus video which may be seen at https://timmb.com/sonified-body-r-and-d-lab

  11. arXiv:2106.04316  [pdf, other

    cs.AI q-bio.NC

    Exploration and preference satisfaction trade-off in reward-free learning

    Authors: Noor Sajid, Panagiotis Tigas, Alexey Zakharov, Zafeirios Fountas, Karl Friston

    Abstract: Biological agents have meaningful interactions with their environment despite the absence of immediate reward signals. In such instances, the agent can learn preferred modes of behaviour that lead to predictable states -- necessary for survival. In this paper, we pursue the notion that this learnt behaviour can be a consequence of reward-free preference learning that ensures an appropriate trade-o… ▽ More

    Submitted 18 July, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: 23 pages, 15 figures

    Journal ref: Proceedings of the Unsupervised Reinforcement Learning Workshop ICML 2021

  12. arXiv:2102.01447  [pdf, other

    physics.geo-ph astro-ph.SR cs.LG physics.space-ph

    Global Earth Magnetic Field Modeling and Forecasting with Spherical Harmonics Decomposition

    Authors: Panagiotis Tigas, Téo Bloch, Vishal Upendran, Banafsheh Ferdoushi, Mark C. M. Cheung, Siddha Ganju, Ryan M. McGranaghan, Yarin Gal, Asti Bhatt

    Abstract: Modeling and forecasting the solar wind-driven global magnetic field perturbations is an open challenge. Current approaches depend on simulations of computationally demanding models like the Magnetohydrodynamics (MHD) model or sampling spatially and temporally through sparse ground-based stations (SuperMAG). In this paper, we develop a Deep Learning model that forecasts in Spherical Harmonics spac… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020), Vancouver, Canada

  13. arXiv:2101.07579  [pdf, other

    cs.AI cs.GR cs.LG

    Spatial Assembly: Generative Architecture With Reinforcement Learning, Self Play and Tree Search

    Authors: Panagiotis Tigas, Tyson Hosmer

    Abstract: With this work, we investigate the use of Reinforcement Learning (RL) for the generation of spatial assemblies, by combining ideas from Procedural Generation algorithms (Wave Function Collapse algorithm (WFC)) and RL for Game Solving. WFC is a Generative Design algorithm, inspired by Constraint Solving. In WFC, one defines a set of tiles/blocks and constraints and the algorithm generates an assemb… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    Comments: Workshop on Machine Learning for Creativity and Design at the 34rd Conference on Neural Information Processing Systems (NeurIPS 2020)

  14. arXiv:2006.14911  [pdf, other

    cs.LG cs.RO stat.ML

    Can Autonomous Vehicles Identify, Recover From, and Adapt to Distribution Shifts?

    Authors: Angelos Filos, Panagiotis Tigas, Rowan McAllister, Nicholas Rhinehart, Sergey Levine, Yarin Gal

    Abstract: Out-of-training-distribution (OOD) scenarios are a common challenge of learning agents at deployment, typically leading to arbitrary deductions and poorly-informed decisions. In principle, detection of and adaptation to OOD scenes can mitigate their adverse effects. In this paper, we highlight the limitations of current approaches to novel driving scenes and propose an epistemic uncertainty-aware… ▽ More

    Submitted 2 September, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: The first two authors contributed equally. Accepted at ICML 2020. Supplementary videos and code available at: https://sites.google.com/view/av-detect-recover-adapt

  15. arXiv:1905.07444  [pdf, other

    cs.CR cs.LG stat.ML

    Percival: Making In-Browser Perceptual Ad Blocking Practical With Deep Learning

    Authors: Zain ul abi Din, Panagiotis Tigas, Samuel T. King, Benjamin Livshits

    Abstract: In this paper we present Percival, a browser-embedded, lightweight, deep learning-powered ad blocker. Percival embeds itself within the browser's image rendering pipeline, which makes it possible to intercept every image obtained during page execution and to perform blocking based on applying machine learning for image classification to flag potential ads. Our implementation inside both Chromium a… ▽ More

    Submitted 19 May, 2020; v1 submitted 17 May, 2019; originally announced May 2019.

    Comments: 13 Pages

  16. arXiv:1201.6251  [pdf, other

    cs.HC cs.LG cs.SD

    Real-time jam-session support system

    Authors: Panagiotis Tigas

    Abstract: We propose a method for the problem of real time chord accompaniment of improvised music. Our implementation can learn an underlying structure of the musical performance and predict next chord. The system uses Hidden Markov Model to find the most probable chord sequence for the played melody and then a Variable Order Markov Model is used to a) learn the structure (if any) and b) predict next chord… ▽ More

    Submitted 27 January, 2012; originally announced January 2012.