Skip to main content

Showing 1–21 of 21 results for author: Nardelli, N

.
  1. arXiv:2404.14310  [pdf, other

    physics.optics physics.atom-ph

    Cryogenic sapphire optical reference cavity with crystalline coatings at $\mathrm{ 1 \times 10^{-16}}$ fractional instability

    Authors: Jose Valencia, George Iskander, Nicholas V. Nardelli, David R. Leibrandt, David B. Hume

    Abstract: The frequency stability of a laser locked to an optical reference cavity is fundamentally limited by thermal noise in the cavity length. These fluctuations are linked to material dissipation, which depends both on the temperature of the optical components and the material properties. Here, the design and experimental characterization of a sapphire optical cavity operated at 10 K with crystalline c… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  2. arXiv:2312.06527  [pdf, other

    cs.AI cs.CE

    Can Reinforcement Learning support policy makers? A preliminary study with Integrated Assessment Models

    Authors: Theodore Wolf, Nantas Nardelli, John Shawe-Taylor, Maria Perez-Ortiz

    Abstract: Governments around the world aspire to ground decision-making on evidence. Many of the foundations of policy making - e.g. sensing patterns that relate to societal needs, develo** evidence-based programs, forecasting potential outcomes of policy changes, and monitoring effectiveness of policy programs - have the potential to benefit from the use of large-scale datasets or simulations together wi… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Published at NeurIPS'23 Workshop on Tackling Climate Change with Machine Learning

  3. arXiv:2208.12869  [pdf, other

    physics.optics physics.ins-det

    Optical and microwave metrology at the 10-18 level with an Er/Yb:glass frequency comb

    Authors: Nicholas V. Nardelli, Holly Leopardi, Thomas R. Schibli, Tara M. Fortier

    Abstract: Optical frequency combs are an essential tool for precision metrology experiments ranging in application from remote spectroscopic sensing of trace gases to the characterization and comparison of optical atomic clocks for precision time-kee** and searches for physics beyond the standard model. Here we describe the architecture and fully characterize a telecom-band, self-modelocking frequency com… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

  4. arXiv:2203.11889  [pdf, other

    cs.LG cs.AI cs.NE cs.SC stat.ML

    Insights From the NeurIPS 2021 NetHack Challenge

    Authors: Eric Hambro, Sharada Mohanty, Dmitrii Babaev, Minwoo Byeon, Dipam Chakraborty, Edward Grefenstette, Minqi Jiang, Dae** Jo, Anssi Kanervisto, Jongmin Kim, Sungwoong Kim, Robert Kirk, Vitaly Kurin, Heinrich Küttler, Taehwon Kwon, Donghoon Lee, Vegard Mella, Nantas Nardelli, Ivan Nazarov, Nikita Ovsov, Jack Parker-Holder, Roberta Raileanu, Karolis Ramanauskas, Tim Rocktäschel, Danielle Rothermel , et al. (4 additional authors not shown)

    Abstract: In this report, we summarize the takeaways from the first NeurIPS 2021 NetHack Challenge. Participants were tasked with develo** a program or agent that can win (i.e., 'ascend' in) the popular dungeon-crawler game of NetHack by interacting with the NetHack Learning Environment (NLE), a scalable, procedurally generated, and challenging Gym environment for reinforcement learning (RL). The challeng… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: Under review at PMLR for the NeuRIPS 2021 Competition Workshop Track, 10 pages + 10 in appendices

  5. arXiv:2110.00593  [pdf, other

    physics.optics physics.ins-det

    10 GHz Generation with Ultra-Low Phase Noise via the Transfer Oscillator Technique

    Authors: Nicholas V. Nardelli, Tara M. Fortier, Marco Pomponio, Esther Baumann, Craig Nelson, Thomas R. Schibli, Archita Hati

    Abstract: Coherent frequency division of high-stability optical sources permits the extraction of microwave signals with ultra-low phase noise, enabling their application to systems with stringent timing precision. To date, the highest performance systems have required tight phase stabilization of laboratory grade optical frequency combs to Fabry-Perot optical reference cavities for faithful optical-to-micr… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

  6. arXiv:2109.09540  [pdf, other

    physics.atom-ph physics.optics

    Optical coherence between atomic species at the second scale: improved clock comparisons via differential spectroscopy

    Authors: May E. Kim, William F. McGrew, Nicholas V. Nardelli, Ethan R. Clements, Youssef S. Hassan, Xiaogang Zhang, Jose L. Valencia, Holly Leopardi, David B. Hume, Tara M. Fortier, Andrw D. Ludlow, David R. Leibrandt

    Abstract: Comparisons of high-accuracy optical atomic clocks \cite{Ludlow2015} are essential for precision tests of fundamental physics \cite{Safronova2018}, relativistic geodesy \cite{McGrew2018, Grotti2018, Delva2019}, and the anticipated redefinition of the SI second \cite{Riehle2018}. The scientific reach of these applications is restricted by the statistical precision of interspecies comparison measure… ▽ More

    Submitted 24 November, 2021; v1 submitted 20 September, 2021; originally announced September 2021.

  7. arXiv:2007.09185  [pdf, other

    cs.AI cs.CL cs.LG

    WordCraft: An Environment for Benchmarking Commonsense Agents

    Authors: Minqi Jiang, Jelena Luketina, Nantas Nardelli, Pasquale Minervini, Philip H. S. Torr, Shimon Whiteson, Tim Rocktäschel

    Abstract: The ability to quickly solve a wide range of real-world tasks requires a commonsense understanding of the world. Yet, how to best extract such knowledge from natural language corpora and integrate it with reinforcement learning (RL) agents remains an open challenge. This is partly due to the lack of lightweight simulation environments that sufficiently reflect the semantics of the real world and p… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

  8. arXiv:2006.13760  [pdf, other

    cs.LG cs.AI cs.CL cs.NE stat.ML

    The NetHack Learning Environment

    Authors: Heinrich Küttler, Nantas Nardelli, Alexander H. Miller, Roberta Raileanu, Marco Selvatici, Edward Grefenstette, Tim Rocktäschel

    Abstract: Progress in Reinforcement Learning (RL) algorithms goes hand-in-hand with the development of challenging environments that test the limits of current methods. While existing RL environments are either sufficiently complex or based on fast simulation, they are rarely both. Here, we present the NetHack Learning Environment (NLE), a scalable, procedurally generated, stochastic, rich, and challenging… ▽ More

    Submitted 1 December, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: 28 pages. Accepted at NeurIPS 2020

  9. arXiv:2005.07062  [pdf, other

    cs.LG stat.AP stat.ML

    Simulation-Based Inference for Global Health Decisions

    Authors: Christian Schroeder de Witt, Bradley Gram-Hansen, Nantas Nardelli, Andrew Gambardella, Rob Zinkov, Puneet Dokania, N. Siddharth, Ana Belen Espinosa-Gonzalez, Ara Darzi, Philip Torr, Atılım Güneş Baydin

    Abstract: The COVID-19 pandemic has highlighted the importance of in-silico epidemiological modelling in predicting the dynamics of infectious diseases to inform health policy and decision makers about suitable prevention and containment strategies. Work in this setting involves solving challenging inference and control problems in individual-based models of ever increasing complexity. Here we discuss recen… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Journal ref: ICML Workshop on Machine Learning for Global Health, Thirty-Seventh International Conference on Machine Learning (ICML 2020)

  10. arXiv:1912.06615  [pdf, other

    q-bio.NC

    Lessons from reinforcement learning for biological representations of space

    Authors: Alex Muryy, N. Siddharth, Nantas Nardelli, Philip H. S. Torr, Andrew Glennerster

    Abstract: Neuroscientists postulate 3D representations in the brain in a variety of different coordinate frames (e.g. 'head-centred', 'hand-centred' and 'world-based'). Recent advances in reinforcement learning demonstrate a quite different approach that may provide a more promising model for biological representations underlying spatial perception and navigation. In this paper, we focus on reinforcement le… ▽ More

    Submitted 6 July, 2020; v1 submitted 13 December, 2019; originally announced December 2019.

    Comments: 40 pages including Appendix, 6 figures plus 3 figures in Appendix. Accepted for publication in Vision Research

  11. arXiv:1910.04054  [pdf, other

    cs.LG cs.DC cs.NI stat.ML

    MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions

    Authors: Viswanath Sivakumar, Olivier Delalleau, Tim Rocktäschel, Alexander H. Miller, Heinrich Küttler, Nantas Nardelli, Mike Rabbat, Joelle Pineau, Sebastian Riedel

    Abstract: Effective network congestion control strategies are key to kee** the Internet (or any large computer network) operational. Network congestion control has been dominated by hand-crafted heuristics for decades. Recently, ReinforcementLearning (RL) has emerged as an alternative to automatically optimize such control strategies. Research so far has primarily considered RL interfaces which block the… ▽ More

    Submitted 26 May, 2021; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: Workshop on ML for Systems at NeurIPS 2019

  12. arXiv:1910.03552  [pdf, other

    cs.LG stat.ML

    TorchBeast: A PyTorch Platform for Distributed RL

    Authors: Heinrich Küttler, Nantas Nardelli, Thibaut Lavril, Marco Selvatici, Viswanath Sivakumar, Tim Rocktäschel, Edward Grefenstette

    Abstract: TorchBeast is a platform for reinforcement learning (RL) research in PyTorch. It implements a version of the popular IMPALA algorithm for fast, asynchronous, parallel training of RL agents. Additionally, TorchBeast has simplicity as an explicit design goal: We provide both a pure-Python implementation ("MonoBeast") as well as a multi-machine high-performance version ("PolyBeast"). In the latter, p… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

  13. arXiv:1906.03926  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    A Survey of Reinforcement Learning Informed by Natural Language

    Authors: Jelena Luketina, Nantas Nardelli, Gregory Farquhar, Jakob Foerster, Jacob Andreas, Edward Grefenstette, Shimon Whiteson, Tim Rocktäschel

    Abstract: To be successful in real-world tasks, Reinforcement Learning (RL) needs to exploit the compositional, relational, and hierarchical structure of the world, and learn to transfer it to the task at hand. Recent advances in representation learning for language make it possible to build models that acquire world knowledge from text corpora and integrate this knowledge into downstream decision making pr… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: Published at IJCAI'19

  14. arXiv:1904.01033  [pdf, other

    cs.LG stat.ML

    Multitask Soft Option Learning

    Authors: Maximilian Igl, Andrew Gambardella, **ke He, Nantas Nardelli, N. Siddharth, Wendelin Böhmer, Shimon Whiteson

    Abstract: We present Multitask Soft Option Learning(MSOL), a hierarchical multitask framework based on Planning as Inference. MSOL extends the concept of options, using separate variational posteriors for each task, regularized by a shared prior. This ''soft'' version of options avoids several instabilities during training in a multitask setting, and provides a natural way to learn both intra-option policie… ▽ More

    Submitted 21 June, 2020; v1 submitted 1 April, 2019; originally announced April 2019.

    Comments: Published at UAI 2020

  15. arXiv:1902.04043  [pdf, other

    cs.LG cs.MA stat.ML

    The StarCraft Multi-Agent Challenge

    Authors: Mikayel Samvelyan, Tabish Rashid, Christian Schroeder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob Foerster, Shimon Whiteson

    Abstract: In the last few years, deep multi-agent reinforcement learning (RL) has become a highly active area of research. A particularly challenging class of problems in this area is partially observable, cooperative, multi-agent learning, in which teams of agents must learn to coordinate their behaviour while conditioning only on their private observations. This is an attractive research area since such p… ▽ More

    Submitted 9 December, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

  16. arXiv:1805.11199  [pdf, other

    cs.AI cs.LG

    Value Propagation Networks

    Authors: Nantas Nardelli, Gabriel Synnaeve, Zeming Lin, Pushmeet Kohli, Philip H. S. Torr, Nicolas Usunier

    Abstract: We present Value Propagation (VProp), a set of parameter-efficient differentiable planning modules built on Value Iteration which can successfully be trained using reinforcement learning to solve unseen tasks, has the capability to generalize to larger map sizes, and can learn to navigate in dynamic environments. We show that the modules enable learning to plan when the environment also includes s… ▽ More

    Submitted 25 March, 2019; v1 submitted 28 May, 2018; originally announced May 2018.

    Comments: Updated to match ICLR 2019 OpenReview's version

  17. arXiv:1705.08926  [pdf, other

    cs.AI cs.MA

    Counterfactual Multi-Agent Policy Gradients

    Authors: Jakob Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, Shimon Whiteson

    Abstract: Cooperative multi-agent systems can be naturally used to model many real world problems, such as network packet routing and the coordination of autonomous vehicles. There is a great need for new reinforcement learning methods that can efficiently learn decentralised policies for such systems. To this end, we propose a new multi-agent actor-critic method called counterfactual multi-agent (COMA) pol… ▽ More

    Submitted 14 December, 2017; v1 submitted 24 May, 2017; originally announced May 2017.

  18. arXiv:1702.08887  [pdf, other

    cs.AI cs.LG cs.MA

    Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

    Authors: Jakob Foerster, Nantas Nardelli, Gregory Farquhar, Triantafyllos Afouras, Philip H. S. Torr, Pushmeet Kohli, Shimon Whiteson

    Abstract: Many real-world problems, such as network packet routing and urban traffic control, are naturally modeled as multi-agent reinforcement learning (RL) problems. However, existing multi-agent RL methods typically scale poorly in the problem size. Therefore, a key challenge is to translate the success of deep learning on single-agent RL to the multi-agent setting. A major stumbling block is that indep… ▽ More

    Submitted 21 May, 2018; v1 submitted 28 February, 2017; originally announced February 2017.

    Comments: Camera-ready version, International Conference of Machine Learning 2017; updated to fix print-breaking image

  19. arXiv:1612.00380  [pdf, other

    cs.AI cs.CV stat.ML

    Playing Doom with SLAM-Augmented Deep Reinforcement Learning

    Authors: Shehroze Bhatti, Alban Desmaison, Ondrej Miksik, Nantas Nardelli, N. Siddharth, Philip H. S. Torr

    Abstract: A number of recent approaches to policy learning in 2D game domains have been successful going directly from raw input images to actions. However when employed in complex 3D environments, they typically suffer from challenges related to partial observability, combinatorial exploration spaces, path planning, and a scarcity of rewarding scenarios. Inspired from prior work in human cognition that ind… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.

  20. arXiv:1611.00625  [pdf, other

    cs.LG cs.AI

    TorchCraft: a Library for Machine Learning Research on Real-Time Strategy Games

    Authors: Gabriel Synnaeve, Nantas Nardelli, Alex Auvolat, Soumith Chintala, Timothée Lacroix, Zeming Lin, Florian Richoux, Nicolas Usunier

    Abstract: We present TorchCraft, a library that enables deep learning research on Real-Time Strategy (RTS) games such as StarCraft: Brood War, by making it easier to control these games from a machine learning framework, here Torch. This white paper argues for using RTS games as a benchmark for AI research, and describes the design and components of TorchCraft.

    Submitted 3 November, 2016; v1 submitted 1 November, 2016; originally announced November 2016.

    ACM Class: I.2.1

  21. Counterfactual Reasoning about Intent for Interactive Navigation in Dynamic Environments

    Authors: A. Bordallo, F. Previtali, N. Nardelli, S. Ramamoorthy

    Abstract: Many modern robotics applications require robots to function autonomously in dynamic environments including other decision making agents, such as people or other robots. This calls for fast and scalable interactive motion planning. This requires models that take into consideration the other agent's intended actions in one's own planning. We present a real-time motion planning framework that brings… ▽ More

    Submitted 26 October, 2016; originally announced October 2016.

    Journal ref: 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2015), pp. 2943-2950