Skip to main content

Showing 1–10 of 10 results for author: Georgescu, R

.
  1. arXiv:2312.02312  [pdf, other

    cs.LG cs.AI cs.CV

    Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games

    Authors: Lukas Schäfer, Logan Jones, Anssi Kanervisto, Yuhan Cao, Tabish Rashid, Raluca Georgescu, Dave Bignell, Siddhartha Sen, Andrea Treviño Gavito, Sam Devlin

    Abstract: Video games have served as useful benchmarks for the decision making community, but going beyond Atari games towards training agents in modern games has been prohibitively expensive for the vast majority of the research community. Recent progress in the research, development and open release of large vision models has the potential to amortize some of these costs across the community. However, it… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Preprint

  2. arXiv:2303.02160  [pdf, other

    cs.HC cs.LG cs.RO

    Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games

    Authors: Stephanie Milani, Arthur Juliani, Ida Momennejad, Raluca Georgescu, Jaroslaw Rzpecki, Alison Shaw, Gavin Costello, Fei Fang, Sam Devlin, Katja Hofmann

    Abstract: We aim to understand how people assess human likeness in navigation produced by people and artificially intelligent (AI) agents in a video game. To this end, we propose a novel AI agent with the goal of generating more human-like behavior. We collect hundreds of crowd-sourced assessments comparing the human-likeness of navigation behavior generated by our agent and baseline AI agents with human-ge… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 18 pages; accepted at CHI 2023

  3. arXiv:2301.10677  [pdf, other

    cs.AI cs.LG stat.ML

    Imitating Human Behaviour with Diffusion Models

    Authors: Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin

    Abstract: Diffusion models have emerged as powerful generative models in the text-to-image domain. This paper studies their application as observation-to-action models for imitating human behaviour in sequential environments. Human behaviour is stochastic and multimodal, with structured correlations between action dimensions. Meanwhile, standard modelling choices in behaviour cloning are limited in their ex… ▽ More

    Submitted 3 March, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Published in ICLR 2023

    Journal ref: ICLR 2023

  4. arXiv:2211.10869  [pdf, other

    cs.LG

    UniMASK: Unified Inference in Sequential Decision Problems

    Authors: Micah Carroll, Orr Paradise, Jessy Lin, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

    Abstract: Randomly masking and predicting word tokens has been a successful approach in pre-training language models for a variety of downstream tasks. In this work, we observe that the same idea also applies naturally to sequential decision-making, where many well-studied tasks like behavior cloning, offline reinforcement learning, inverse dynamics, and waypoint conditioning correspond to different sequenc… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022 (Oral). A prior version was published at an ICML Workshop, available at arXiv:2204.13326

  5. arXiv:2209.00570  [pdf, other

    cs.AI cs.LG cs.SE

    Go-Explore Complex 3D Game Environments for Automated Reachability Testing

    Authors: Cong Lu, Raluca Georgescu, Johan Verwey

    Abstract: Modern AAA video games feature huge game levels and maps which are increasingly hard for level testers to cover exhaustively. As a result, games often ship with catastrophic bugs such as the player falling through the floor or being stuck in walls. We propose an approach specifically targeted at reachability bugs in simulated 3D environments based on the powerful exploration algorithm, Go-Explore,… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

  6. arXiv:2204.13326  [pdf, other

    cs.LG

    Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers

    Authors: Micah Carroll, Jessy Lin, Orr Paradise, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

    Abstract: Randomly masking and predicting word tokens has been a successful approach in pre-training language models for a variety of downstream tasks. In this work, we observe that the same idea also applies naturally to sequential decision making, where many well-studied tasks like behavior cloning, offline RL, inverse dynamics, and waypoint conditioning correspond to different sequence maskings over a se… ▽ More

    Submitted 9 December, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: Superseded by arXiv:2211.10869

  7. arXiv:2105.09637  [pdf, other

    cs.AI cs.LG

    Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation

    Authors: Sam Devlin, Raluca Georgescu, Ida Momennejad, Jaroslaw Rzepecki, Evelyn Zuniga, Gavin Costello, Guy Leroy, Ali Shaw, Katja Hofmann

    Abstract: A key challenge on the path to develo** agents that learn complex human-like behavior is the need to quickly and accurately quantify human-likeness. While human assessments of such behavior can be highly accurate, speed and scalability are limited. We address these limitations through a novel automated Navigation Turing Test (ANTT) that learns to predict human judgments of human-likeness. We dem… ▽ More

    Submitted 28 July, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: All data collected throughout this study, plus the code to reproduce our analysis and ANTT are available at https://github.com/microsoft/NTT

    Journal ref: Proceedings of the 38th International Conference on Machine Learning (ICML), 139:2644-2653, 2021

  8. arXiv:1208.1707  [pdf, ps, other

    math.DS nlin.CD

    Numerical investigation of the Bautin bifurcation in a delay differential equation modeling leukemia

    Authors: Anca Veronica Ion, Raluca Mihaela Georgescu

    Abstract: In a previous work we investigated the existence of Hopf degenerate bifurcation points for a differential delay equation modeling leukemia and we actually found Hopf points of codimension two for the considered problem. If around the parameters corresponding to such a point we vary two parameters (the considered problem has five parameters), then a Bautin bifurcation should occur. In this work we… ▽ More

    Submitted 8 August, 2012; originally announced August 2012.

    Comments: To be presented at CAIM 2012 (Conference on Applied and Industrial Mathematics), 23-25 August 2012, Chisinau, Republic of Moldova

    MSC Class: 37C75; 65L03; 37G05; 37G15

  9. arXiv:1205.3917  [pdf, ps, other

    math.DS

    Hopf points of codimension two in a delay differential equation modeling leukemia

    Authors: Anca Veronica Ion, Raluca Mihaela Georgescu

    Abstract: This paper continues the work contained in two previous papers, devoted to the study of the dynamical system generated by a delay differential equation that models leukemia. Here our aim is to identify degenerate Hopf bifurcation points. By using an approximation of the center manifold, we compute the first Lyapunov coefficient for Hopf bifurcation points. We find by direct computation, in some zo… ▽ More

    Submitted 17 May, 2012; originally announced May 2012.

    MSC Class: 65L03; 37C75; 37G05; 37G15

  10. arXiv:1001.5354  [pdf, ps, other

    math.DS

    Stability of equilibrium and periodic solutions of a delay equation modeling leukemia

    Authors: Anca-Veronica Ion, Raluca-Mihaela Georgescu

    Abstract: We consider a delay differential equation that occurs in the study of chronic myelogenous leukemia. After shortly reminding some previous results concerning the stability of equilibrium solutions, we concentrate on the study of stability of periodic solutions emerged by Hopf bifurcation from a certain equilibrium point. We give the algorithm for approximating a center manifold at a typical point… ▽ More

    Submitted 22 March, 2010; v1 submitted 29 January, 2010; originally announced January 2010.

    MSC Class: 65L03; 37C75; 37G05; 37G15

    Journal ref: "Journal of Middle Volga Mathematical Society", 11, 2(2009), 146-157