Skip to main content

Showing 1–12 of 12 results for author: Hansen, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.15585  [pdf, other

    econ.EM stat.ML

    Inference for Regression with Variables Generated from Unstructured Data

    Authors: Laura Battaglia, Timothy Christensen, Stephen Hansen, Szymon Sacher

    Abstract: The leading strategy for analyzing unstructured data uses two steps. First, latent variables of economic interest are estimated with an upstream information retrieval model. Second, the estimates are treated as "data" in a downstream econometric model. We establish theoretical arguments for why this two-step strategy leads to biased inference in empirically plausible settings. More constructively,… ▽ More

    Submitted 9 May, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  2. arXiv:2303.04209  [pdf, other

    cs.LG cs.AI stat.ML

    Causal Dependence Plots

    Authors: Joshua R. Loftus, Lucius E. J. Bynum, Sakina Hansen

    Abstract: Explaining artificial intelligence or machine learning models is increasingly important. To use such data-driven systems wisely we must understand how they interact with the world, including how they depend causally on data inputs. In this work we develop Causal Dependence Plots (CDPs) to visualize how one variable--an outcome--depends on changes in another variable--a predictor--… ▽ More

    Submitted 5 July, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  3. arXiv:2210.11107  [pdf, other

    stat.AP

    Graphical model inference with external network data

    Authors: Jack Jewson, Li Li, Laura Battaglia, Stephen Hansen, David Rossell, Piotr Zwiernik

    Abstract: We consider two applications where we study how dependence structure between many variables is linked to external network data. We first study the interplay between social media connectedness and the co-evolution of the COVID-19 pandemic across USA counties. We next study study how the dependence between stock market returns across firms relates to similarities in economic and policy indicators fr… ▽ More

    Submitted 13 November, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

  4. arXiv:2107.14226  [pdf, other

    cs.LG cs.AI stat.ML

    Learning more skills through optimistic exploration

    Authors: DJ Strouse, Kate Baumli, David Warde-Farley, Vlad Mnih, Steven Hansen

    Abstract: Unsupervised skill learning objectives (Gregor et al., 2016, Eysenbach et al., 2018) allow agents to learn rich repertoires of behavior in the absence of extrinsic rewards. They work by simultaneously training a policy to produce distinguishable latent-conditioned trajectories, and a discriminator to evaluate distinguishability by trying to infer latents from trajectories. The hope is for the agen… ▽ More

    Submitted 12 May, 2022; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: Accepted at ICLR 2022 (spotlight)

  5. arXiv:2107.08112  [pdf, other

    econ.EM stat.ME

    Hamiltonian Monte Carlo for Regression with High-Dimensional Categorical Data

    Authors: Szymon Sacher, Laura Battaglia, Stephen Hansen

    Abstract: Latent variable models are increasingly used in economics for high-dimensional categorical data like text and surveys. We demonstrate the effectiveness of Hamiltonian Monte Carlo (HMC) with parallelized automatic differentiation for analyzing such data in a computationally efficient and methodologically sound manner. Our new model, Supervised Topic Model with Covariates, shows that carefully model… ▽ More

    Submitted 29 February, 2024; v1 submitted 16 July, 2021; originally announced July 2021.

    Comments: 20 pages 5 figures, 2 tables

  6. arXiv:2102.13515  [pdf, other

    cs.LG cs.AI stat.ML

    Beyond Fine-Tuning: Transferring Behavior in Reinforcement Learning

    Authors: Víctor Campos, Pablo Sprechmann, Steven Hansen, Andre Barreto, Steven Kapturowski, Alex Vitvitskyi, Adrià Puigdomènech Badia, Charles Blundell

    Abstract: Designing agents that acquire knowledge autonomously and use it to solve new tasks efficiently is an important challenge in reinforcement learning. Knowledge acquired during an unsupervised pre-training phase is often transferred by fine-tuning neural network weights once rewards are exposed, as is common practice in supervised domains. Given the nature of the reinforcement learning problem, we ar… ▽ More

    Submitted 8 June, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

  7. arXiv:1910.13406  [pdf, other

    cs.LG cs.AI stat.ML

    Generalization of Reinforcement Learners with Working and Episodic Memory

    Authors: Meire Fortunato, Melissa Tan, Ryan Faulkner, Steven Hansen, Adrià Puigdomènech Badia, Gavin Buttimore, Charlie Deck, Joel Z Leibo, Charles Blundell

    Abstract: Memory is an important aspect of intelligence and plays a role in many deep reinforcement learning models. However, little progress has been made in understanding when specific memory systems help more than others and how well they generalize. The field also has yet to see a prevalent consistent and rigorous approach for evaluating agent performance on holdout data. In this paper, we aim to develo… ▽ More

    Submitted 18 February, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019. Equal contribution of first 4 authors

    Journal ref: 33rd Conference on Neural Information Processing Systems (Neurips 2019)

  8. arXiv:1906.05030  [pdf, other

    cs.LG cs.AI stat.ML

    Fast Task Inference with Variational Intrinsic Successor Features

    Authors: Steven Hansen, Will Dabney, Andre Barreto, Tom Van de Wiele, David Warde-Farley, Volodymyr Mnih

    Abstract: It has been established that diverse behaviors spanning the controllable subspace of an Markov decision process can be trained by rewarding a policy for being distinguishable from other policies \citep{gregor2016variational, eysenbach2018diversity, warde2018unsupervised}. However, one limitation of this formulation is generalizing behaviors beyond the finite set being explicitly learned, as is nee… ▽ More

    Submitted 27 January, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: Accepted for publication at ICLR 2020

  9. arXiv:1811.11359  [pdf, other

    cs.LG cs.AI stat.ML

    Unsupervised Control Through Non-Parametric Discriminative Rewards

    Authors: David Warde-Farley, Tom Van de Wiele, Tejas Kulkarni, Catalin Ionescu, Steven Hansen, Volodymyr Mnih

    Abstract: Learning to control an environment without hand-crafted rewards or expert data remains challenging and is at the frontier of reinforcement learning research. We present an unsupervised learning algorithm to train agents to achieve perceptually-specified goals using only a stream of observations and actions. Our agent simultaneously learns a goal-conditioned policy and a goal achievement reward fun… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: 10 pages + references & 5 page appendix

  10. arXiv:1705.03562  [pdf, other

    stat.ML cs.AI cs.LG

    Deep Episodic Value Iteration for Model-based Meta-Reinforcement Learning

    Authors: Steven Stenberg Hansen

    Abstract: We present a new deep meta reinforcement learner, which we call Deep Episodic Value Iteration (DEVI). DEVI uses a deep neural network to learn a similarity metric for a non-parametric model-based reinforcement learning algorithm. Our model is trained end-to-end via back-propagation. Despite being trained using the model-free Q-learning objective, we show that DEVI's model-based internal structure… ▽ More

    Submitted 9 May, 2017; originally announced May 2017.

  11. Identification of release sources in advection-diffusion system by machine learning combined with Green function inverse method

    Authors: Valentin G. Stanev, Filip L. Iliev, Scott Hansen, Velimir V. Vesselinov, Boian S. Alexandrov

    Abstract: The identification of sources of advection-diffusion transport is based usually on solving complex ill-posed inverse models against the available state- variable data records. However, if there are several sources with different locations and strengths, the data records represent mixtures rather than the separate influences of the original sources. Importantly, the number of these original release… ▽ More

    Submitted 23 March, 2018; v1 submitted 12 December, 2016; originally announced December 2016.

    Report number: LA-UR-16-27231 MSC Class: 68T10

  12. arXiv:1401.7020  [pdf, other

    math.OC cs.LG stat.ML

    A Stochastic Quasi-Newton Method for Large-Scale Optimization

    Authors: R. H. Byrd, S. L. Hansen, J. Nocedal, Y. Singer

    Abstract: The question of how to incorporate curvature information in stochastic approximation methods is challenging. The direct application of classical quasi- Newton updating techniques for deterministic optimization leads to noisy curvature estimates that have harmful effects on the robustness of the iteration. In this paper, we propose a stochastic quasi-Newton method that is efficient, robust and scal… ▽ More

    Submitted 18 February, 2015; v1 submitted 27 January, 2014; originally announced January 2014.