Skip to main content

Showing 1–50 of 65 results for author: Côté, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06769  [pdf, other

    cs.AI cs.CL

    DISCOVERYWORLD: A Virtual Environment for Develo** and Evaluating Automated Scientific Discovery Agents

    Authors: Peter Jansen, Marc-Alexandre Côté, Tushar Khot, Erin Bransom, Bhavana Dalvi Mishra, Bodhisattwa Prasad Majumder, Oyvind Tafjord, Peter Clark

    Abstract: Automated scientific discovery promises to accelerate progress across scientific domains. However, develo** and evaluating an AI agent's capacity for end-to-end scientific reasoning is challenging as running real-world experiments is often prohibitively expensive or infeasible. In this work we introduce DISCOVERYWORLD, the first virtual environment for develo** and benchmarking an agent's abil… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 9 pages, 4 figures. Preprint, under review

  2. arXiv:2406.06485  [pdf, other

    cs.CL cs.AI

    Can Language Models Serve as Text-Based World Simulators?

    Authors: Ruoyao Wang, Graham Todd, Ziang Xiao, Xingdi Yuan, Marc-Alexandre Côté, Peter Clark, Peter Jansen

    Abstract: Virtual environments play a key role in benchmarking advances in complex planning and decision-making tasks but are expensive and complicated to build by hand. Can current language models themselves serve as world simulators, correctly predicting how actions change different world states, thus bypassing the need for extensive manual coding? Our goal is to answer this question in the context of tex… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: ACL 2024

  3. arXiv:2405.02749  [pdf, other

    cs.LG

    Sub-goal Distillation: A Method to Improve Small Language Agents

    Authors: Maryam Hashemzadeh, Elias Stengel-Eskin, Sarath Chandar, Marc-Alexandre Cote

    Abstract: While Large Language Models (LLMs) have demonstrated significant promise as agents in interactive tasks, their substantial computational requirements and restricted number of calls constrain their practical utility, especially in long-horizon interactive tasks such as decision-making or in scenarios involving continuous ongoing tasks. To address these constraints, we propose a method for transferr… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  4. arXiv:2403.03017  [pdf, other

    cs.AI

    OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following

    Authors: Haochen Shi, Zhiyuan Sun, Xingdi Yuan, Marc-Alexandre Côté, Bang Liu

    Abstract: Embodied Instruction Following (EIF) is a crucial task in embodied learning, requiring agents to interact with their environment through egocentric observations to fulfill natural language instructions. Recent advancements have seen a surge in employing large language models (LLMs) within a framework-centric approach to enhance performance in embodied learning tasks, including EIF. Despite these e… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  5. arXiv:2402.16354  [pdf, other

    cs.LG cs.AI cs.CL

    Language-guided Skill Learning with Temporal Variational Inference

    Authors: Haotian Fu, Pratyusha Sharma, Elias Stengel-Eskin, George Konidaris, Nicolas Le Roux, Marc-Alexandre Côté, Xingdi Yuan

    Abstract: We present an algorithm for skill discovery from expert demonstrations. The algorithm first utilizes Large Language Models (LLMs) to propose an initial segmentation of the trajectories. Following that, a hierarchical variational inference framework incorporates the LLM-generated segmentation information to discover reusable skills by merging trajectory segments. To further control the trade-off be… ▽ More

    Submitted 27 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  6. arXiv:2402.07876  [pdf, other

    cs.LG cs.AI cs.CL

    Policy Improvement using Language Feedback Models

    Authors: Victor Zhong, Dipendra Misra, Xingdi Yuan, Marc-Alexandre Côté

    Abstract: We introduce Language Feedback Models (LFMs) that identify desirable behaviour - actions that help achieve tasks specified in the instruction - for imitation learning in instruction following. To train LFMs, we obtain feedback from Large Language Models (LLMs) on visual trajectories verbalized to language descriptions. First, by using LFMs to identify desirable behaviour to imitate, we improve in… ▽ More

    Submitted 18 April, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  7. arXiv:2310.13724  [pdf, other

    cs.HC cs.AI cs.CV cs.GR cs.MA cs.RO

    Habitat 3.0: A Co-Habitat for Humans, Avatars and Robots

    Authors: Xavier Puig, Eric Undersander, Andrew Szot, Mikael Dallaire Cote, Tsung-Yen Yang, Ruslan Partsey, Ruta Desai, Alexander William Clegg, Michal Hlavac, So Yeon Min, Vladimír Vondruš, Theophile Gervet, Vincent-Pierre Berges, John M. Turner, Oleksandr Maksymets, Zsolt Kira, Mrinal Kalakrishnan, Jitendra Malik, Devendra Singh Chaplot, Unnat Jain, Dhruv Batra, Akshara Rai, Roozbeh Mottaghi

    Abstract: We present Habitat 3.0: a simulation platform for studying collaborative human-robot tasks in home environments. Habitat 3.0 offers contributions across three dimensions: (1) Accurate humanoid simulation: addressing challenges in modeling complex deformable bodies and diversity in appearance and motion, all while ensuring high simulation speed. (2) Human-in-the-loop infrastructure: enabling real h… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Project page: http://aihabitat.org/habitat3

  8. arXiv:2306.12509  [pdf, other

    cs.CL cs.LG

    Joint Prompt Optimization of Stacked LLMs using Variational Inference

    Authors: Alessandro Sordoni, Xingdi Yuan, Marc-Alexandre Côté, Matheus Pereira, Adam Trischler, Ziang Xiao, Arian Hosseini, Friederike Niedtner, Nicolas Le Roux

    Abstract: Large language models (LLMs) can be seen as atomic units of computation map** sequences to a distribution over sequences. Thus, they can be seen as stochastic language layers in a language network, where the learnable parameters are the natural language prompts at each layer. By stacking two such layers and feeding the output of one layer to the next, we obtain a Deep Language Network (DLN). We… ▽ More

    Submitted 4 December, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  9. Who are CUIs Really For? Representation and Accessibility in the Conversational User Interface Literature

    Authors: William Seymour, Xiao Zhan, Mark Cote, Jose Such

    Abstract: The theme for CUI 2023 is 'designing for inclusive conversation', but who are CUIs really designed for? The field has its roots in computer science, which has a long acknowledged diversity problem. Inspired by studies map** out the diversity of the CHI and voice assistant literature, we set out to investigate how these issues have (or have not) shaped the CUI literature. To do this we reviewed t… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: To appear in the Proceedings of the 2023 ACM conference on Conversational User Interfaces (CUI 23)

  10. arXiv:2305.14879  [pdf, other

    cs.CL cs.AI

    ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games

    Authors: Ruoyao Wang, Graham Todd, Eric Yuan, Ziang Xiao, Marc-Alexandre Côté, Peter Jansen

    Abstract: In this work, we investigate the capacity of language models to generate explicit, interpretable, and interactive world models of scientific and common-sense reasoning tasks. We operationalize this as a task of generating text games, expressed as hundreds of lines of Python code. To facilitate this task, we introduce ByteSized32 (Code: github.com/cognitiveailab/BYTESIZED32), a corpus of 32 reasoni… ▽ More

    Submitted 23 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP 2023

  11. arXiv:2305.12487  [pdf, other

    cs.AI cs.CL cs.LG

    Augmenting Autotelic Agents with Large Language Models

    Authors: Cédric Colas, Laetitia Teodorescu, Pierre-Yves Oudeyer, Xingdi Yuan, Marc-Alexandre Côté

    Abstract: Humans learn to master open-ended repertoires of skills by imagining and practicing their own goals. This autotelic learning process, literally the pursuit of self-generated (auto) goals (telos), becomes more and more open-ended as the goals become more diverse, abstract and creative. The resulting exploration of the space of possible skills is supported by an inter-individual exploration: goal re… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

  12. arXiv:2302.05244  [pdf, other

    cs.AI cs.CL cs.LG

    A Song of Ice and Fire: Analyzing Textual Autotelic Agents in ScienceWorld

    Authors: Laetitia Teodorescu, Xingdi Yuan, Marc-Alexandre Côté, Pierre-Yves Oudeyer

    Abstract: Building open-ended agents that can autonomously discover a diversity of behaviours is one of the long-standing goals of artificial intelligence. This challenge can be studied in the framework of autotelic RL agents, i.e. agents that learn by selecting and pursuing their own goals, self-organizing a learning curriculum. Recent work identified language as a key dimension of autotelic learning, in p… ▽ More

    Submitted 24 February, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: In review at ICML 2023

  13. Legal Obligation and Ethical Best Practice: Towards Meaningful Verbal Consent for Voice Assistants

    Authors: William Seymour, Mark Cote, Jose Such

    Abstract: To improve user experience, Alexa now allows users to consent to data sharing via voice rather than directing them to the companion smartphone app. While verbal consent mechanisms for voice assistants (VAs) can increase usability, they can also undermine principles core to informed consent. We conducted a Delphi study with experts from academia, industry, and the public sector on requirements for… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: To appear in the Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23)

  14. arXiv:2211.06552  [pdf, other

    cs.CL cs.AI

    Collecting Interactive Multi-modal Datasets for Grounded Language Understanding

    Authors: Shrestha Mohanty, Negar Arabzadeh, Milagro Teruel, Yuxuan Sun, Artem Zholus, Alexey Skrynnik, Mikhail Burtsev, Kavya Srinet, Aleksandr Panov, Arthur Szlam, Marc-Alexandre Côté, Julia Kiseleva

    Abstract: Human intelligence can remarkably adapt quickly to new tasks and environments. Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural language instructions. To facilitate research which can enable similar capabilities in machines, we made the following contributions (1) formalized the co… ▽ More

    Submitted 21 March, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Journal ref: Interactive Learning for Natural Language Processing NeurIPS 2022 Workshop

  15. A Systematic Review of Ethical Concerns with Voice Assistants

    Authors: William Seymour, Xiao Zhan, Mark Cote, Jose Such

    Abstract: Since Siri's release in 2011 there have been a growing number of AI-driven domestic voice assistants that are increasingly being integrated into devices such as smartphones and TVs. But as their presence has expanded, a range of ethical concerns has been identified around the use of voice assistants, such as the privacy implications of having devices that are always listening and the ways that the… ▽ More

    Submitted 23 June, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted to AIES 2023

  16. arXiv:2211.00688  [pdf, other

    cs.AI cs.CL

    Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions

    Authors: Alexey Skrynnik, Zoya Volovikova, Marc-Alexandre Côté, Anton Voronov, Artem Zholus, Negar Arabzadeh, Shrestha Mohanty, Milagro Teruel, Ahmed Awadallah, Aleksandr Panov, Mikhail Burtsev, Julia Kiseleva

    Abstract: The adoption of pre-trained language models to generate action plans for embodied agents is a promising research strategy. However, execution of instructions in real or simulated environments requires verification of the feasibility of actions as well as their relevance to the completion of a goal. We propose a new method that combines a language model and reinforcement learning for the task of bu… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 6 pages, 3 figures

  17. arXiv:2210.07382  [pdf, other

    cs.CL cs.AI

    Behavior Cloned Transformers are Neurosymbolic Reasoners

    Authors: Ruoyao Wang, Peter Jansen, Marc-Alexandre Côté, Prithviraj Ammanabrolu

    Abstract: In this work, we explore techniques for augmenting interactive agents with information from symbolic modules, much like humans use tools like calculators and GPS systems to assist with arithmetic and navigation. We test our agent's abilities in text games -- challenging benchmarks for evaluating the multi-step reasoning abilities of game agents in grounded, language-based environments. Our experim… ▽ More

    Submitted 11 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to EACL 2023

  18. arXiv:2208.01174  [pdf, other

    cs.CL cs.AI

    TextWorldExpress: Simulating Text Games at One Million Steps Per Second

    Authors: Peter A. Jansen, Marc-Alexandre Côté

    Abstract: Text-based games offer a challenging test bed to evaluate virtual agents at language understanding, multi-step problem-solving, and common-sense reasoning. However, speed is a major limitation of current text-based games, cap** at 300 steps per second, mainly due to the use of legacy tooling. In this work we present TextWorldExpress, a high-performance simulator that includes implementations of… ▽ More

    Submitted 2 March, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted to EACL 2023

  19. arXiv:2207.04118  [pdf, other

    cs.AI

    Automatic Exploration of Textual Environments with Language-Conditioned Autotelic Agents

    Authors: Laetitia Teodorescu, Eric Yuan, Marc-Alexandre Côté, Pierre-Yves Oudeyer

    Abstract: In this extended abstract we discuss the opportunities and challenges of studying intrinsically-motivated agents for exploration in textual environments. We argue that there is important synergy between text environments and autonomous agents. We identify key properties of text worlds that make them suitable for exploration by autonmous agents, namely, depth, breadth, progress niches and the ease… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  20. When It's Not Worth the Paper It's Written On: A Provocation on the Certification of Skills in the Alexa and Google Assistant Ecosystems

    Authors: William Seymour, Mark Cote, Jose Such

    Abstract: The increasing reach and functionality of voice assistants has allowed them to become a general-purpose platform for tasks like playing music, accessing information, and controlling smart home devices. In order to maintain the quality of third-party skills and to protect children and other members of the public from inappropriate or malicious skills, platform providers have developed content polic… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: To appear in the Proceedings of the 4th Conference on Conversational User Interfaces (CUI 2022)

  21. Can you meaningfully consent in eight seconds? Identifying Ethical Issues with Verbal Consent for Voice Assistants

    Authors: William Seymour, Mark Cote, Jose Such

    Abstract: Determining how voice assistants should broker consent to share data with third party software has proven to be a complex problem. Devices often require users to switch to companion smartphone apps in order to navigate permissions menus for their otherwise hands-free voice assistant. More in line with smartphone app stores, Alexa now offers "voice-forward consent", allowing users to grant skills a… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: To appear in the Proceedings of the 4th Conference on Conversational User Interfaces (CUI 2022). arXiv admin note: substantial text overlap with arXiv:2204.10058

  22. arXiv:2206.00142  [pdf, other

    cs.LG cs.AI cs.CL

    IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents

    Authors: Artem Zholus, Alexey Skrynnik, Shrestha Mohanty, Zoya Volovikova, Julia Kiseleva, Artur Szlam, Marc-Alexandre Coté, Aleksandr I. Panov

    Abstract: We present the IGLU Gridworld: a reinforcement learning environment for building and evaluating language conditioned embodied agents in a scalable way. The environment features visual agent embodiment, interactive learning through collaboration, language conditioned RL, and combinatorically hard task (3d blocks building) space.

    Submitted 31 May, 2022; originally announced June 2022.

  23. arXiv:2205.13771  [pdf, other

    cs.CL

    IGLU 2022: Interactive Grounded Language Understanding in a Collaborative Environment at NeurIPS 2022

    Authors: Julia Kiseleva, Alexey Skrynnik, Artem Zholus, Shrestha Mohanty, Negar Arabzadeh, Marc-Alexandre Côté, Mohammad Aliannejadi, Milagro Teruel, Ziming Li, Mikhail Burtsev, Maartje ter Hoeve, Zoya Volovikova, Aleksandr Panov, Yuxuan Sun, Kavya Srinet, Arthur Szlam, Ahmed Awadallah

    Abstract: Human intelligence has the remarkable ability to adapt to new tasks and environments quickly. Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural language instructions. To facilitate research in this direction, we propose IGLU: Interactive Grounded Language Understanding in a Collabor… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: text overlap with arXiv:2110.06536

  24. arXiv:2205.06111  [pdf, other

    cs.AI cs.CL

    Asking for Knowledge: Training RL Agents to Query External Knowledge Using Language

    Authors: Iou-Jen Liu, Xingdi Yuan, Marc-Alexandre Côté, Pierre-Yves Oudeyer, Alexander G. Schwing

    Abstract: To solve difficult tasks, humans ask questions to acquire knowledge from external sources. In contrast, classical reinforcement learning agents lack such an ability and often resort to exploratory behavior. This is exacerbated as few present-day environments support querying for knowledge. In order to study how agents can be taught to query external knowledge via language, we first introduce two n… ▽ More

    Submitted 3 July, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: ICML 2022; Project page: https://ioujenliu.github.io/AFK/

  25. arXiv:2205.02388  [pdf, other

    cs.CL cs.AI

    Interactive Grounded Language Understanding in a Collaborative Environment: IGLU 2021

    Authors: Julia Kiseleva, Ziming Li, Mohammad Aliannejadi, Shrestha Mohanty, Maartje ter Hoeve, Mikhail Burtsev, Alexey Skrynnik, Artem Zholus, Aleksandr Panov, Kavya Srinet, Arthur Szlam, Yuxuan Sun, Marc-Alexandre Côté, Katja Hofmann, Ahmed Awadallah, Linar Abdrazakov, Igor Churin, Putra Manggala, Kata Naszadi, Michiel van der Meer, Taewoon Kim

    Abstract: Human intelligence has the remarkable ability to quickly adapt to new tasks and environments. Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural language instructions. To facilitate research in this direction, we propose \emph{IGLU: Interactive Grounded Language Understanding in a Co… ▽ More

    Submitted 27 May, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2110.06536

    Journal ref: Proceedings of Machine Learning Research NeurIPS 2021 Competition and Demonstration Track

  26. arXiv:2204.10058  [pdf, other

    cs.HC cs.CY

    Consent on the Fly: Develo** Ethical Verbal Consent for Voice Assistants

    Authors: William Seymour, Mark Cote, Jose Such

    Abstract: Determining how voice assistants should broker consent to share data with third party software has proven to be a complex problem. Devices often require users to switch to companion smartphone apps in order to navigate permissions menus for their otherwise hands-free voice assistant. More in line with smartphone app stores, Alexa now offers "voice-forward consent", allowing users to grant skills a… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: Accepted to the CHI'22 Workshop on the Ethics of Conversational User Interfaces

  27. arXiv:2203.07540  [pdf, other

    cs.CL cs.AI

    ScienceWorld: Is your Agent Smarter than a 5th Grader?

    Authors: Ruoyao Wang, Peter Jansen, Marc-Alexandre Côté, Prithviraj Ammanabrolu

    Abstract: We present ScienceWorld, a benchmark to test agents' scientific reasoning abilities in a new interactive text environment at the level of a standard elementary school science curriculum. Despite the transformer-based progress seen in question-answering and scientific text processing, we find that current models cannot reason about or explain learned science concepts in novel contexts. For instance… ▽ More

    Submitted 14 November, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted to EMNLP 2022

  28. arXiv:2203.04806  [pdf, other

    cs.CL

    One-Shot Learning from a Demonstration with Hierarchical Latent Language

    Authors: Nathaniel Weir, Xingdi Yuan, Marc-Alexandre Côté, Matthew Hausknecht, Romain Laroche, Ida Momennejad, Harm Van Seijen, Benjamin Van Durme

    Abstract: Humans have the capability, aided by the expressive compositionality of their language, to learn quickly by demonstration. They are able to describe unseen task-performing procedures and generalize their execution to other contexts. In this work, we introduce DescribeWorld, an environment designed to test this sort of generalization skill in grounded agents, where tasks are linguistically and proc… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

  29. arXiv:2202.10977  [pdf, other

    cs.RO

    Organ Shape Sensing using Pneumatically Attachable Flexible Rails in Robotic-Assisted Laparoscopic Surgery

    Authors: Aoife McDonald-Bowyer, Solène Dietsch, Emmanouil Dimitrakakis, Joanna M Coote, Lukas Lindenroth, Danail Stoyanov, Agostino Stilli

    Abstract: In robotic-assisted partial nephrectomy, surgeons remove a part of a kidney often due to the presence of a mass. A drop-in ultrasound probe paired to a surgical robot is deployed to execute multiple swipes over the kidney surface to localise the mass and define the margins of resection. This sub-task is challenging and must be performed by a highly skilled surgeon. Automating this sub-task may red… ▽ More

    Submitted 21 November, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: 9 pages, 11 figures

  30. arXiv:2201.13267  [pdf, other

    cs.LG econ.EM stat.AP

    Micro-level Reserving for General Insurance Claims using a Long Short-Term Memory Network

    Authors: Ihsan Chaoubi, Camille Besse, Hélène Cossette, Marie-Pier Côté

    Abstract: Detailed information about individual claims are completely ignored when insurance claims data are aggregated and structured in development triangles for loss reserving. In the hope of extracting predictive power from the individual claims characteristics, researchers have recently proposed to move away from these macro-level methods in favor of micro-level loss reserving approaches. We introduce… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  31. arXiv:2110.12306  [pdf, other

    cs.LG cs.DC cs.NE eess.SY

    Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning

    Authors: Sergio Valcarcel Macua, Ian Davies, Aleksi Tukiainen, Enrique Munoz de Cote

    Abstract: We propose a fully distributed actor-critic architecture, named Diff-DAC, with application to multitask reinforcement learning (MRL). During the learning process, agents communicate their value and policy parameters to their neighbours, diffusing the information across a network of agents with no need for a central station. Each agent can only access data from its local task, but aims to learn a c… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: 27 pages, 8 figures

    Journal ref: The Knowledge Engineering Review, 36, E6 (2021)

  32. arXiv:2010.03768  [pdf, other

    cs.CL cs.AI cs.CV cs.LG cs.RO

    ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

    Authors: Mohit Shridhar, Xingdi Yuan, Marc-Alexandre Côté, Yonatan Bisk, Adam Trischler, Matthew Hausknecht

    Abstract: Given a simple request like Put a washed apple in the kitchen fridge, humans can reason in purely abstract terms by imagining action sequences and scoring their likelihood of success, prototypicality, and efficiency, all without moving a muscle. Once we see the kitchen in question, we can update our abstract plans to fit the scene. Embodied agents require the same abilities, but existing work does… ▽ More

    Submitted 14 March, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: ICLR 2021; Data, code, and videos are available at alfworld.github.io

  33. Bias and Discrimination in AI: a cross-disciplinary perspective

    Authors: Xavier Ferrer, Tom van Nuenen, Jose M. Such, Mark Coté, Natalia Criado

    Abstract: With the widespread and pervasive use of Artificial Intelligence (AI) for automated decision-making systems, AI bias is becoming more apparent and problematic. One of its negative consequences is discrimination: the unfair, or unequal treatment of individuals based on certain characteristics. However, the relationship between bias and discrimination is not always clear. In this paper, we survey re… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    MSC Class: 68T01

  34. arXiv:2008.06110  [pdf, other

    stat.ML cs.LG

    Synthesizing Property & Casualty Ratemaking Datasets using Generative Adversarial Networks

    Authors: Marie-Pier Cote, Brian Hartman, Olivier Mercier, Joshua Meyers, Jared Cummings, Elijah Harmon

    Abstract: Due to confidentiality issues, it can be difficult to access or share interesting datasets for methodological development in actuarial science, or other fields where personal data are important. We show how to design three different types of generative adversarial networks (GANs) that can build a synthetic insurance dataset from a confidential original dataset. The goal is to obtain synthetic data… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

  35. arXiv:2007.06894  [pdf, other

    stat.ML cs.LG

    When stakes are high: balancing accuracy and transparency with Model-Agnostic Interpretable Data-driven suRRogates

    Authors: Roel Henckaerts, Katrien Antonio, Marie-Pier Côté

    Abstract: Highly regulated industries, like banking and insurance, ask for transparent decision-making algorithms. At the same time, competitive markets are pushing for the use of complex black box models. We therefore present a procedure to develop a Model-Agnostic Interpretable Data-driven suRRogate (maidrr) suited for structured tabular data. Knowledge is extracted from a black box via partial dependence… ▽ More

    Submitted 10 December, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

  36. arXiv:2006.13463  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Policy Network for Transferable Active Learning on Graphs

    Authors: Shengding Hu, Zheng Xiong, Meng Qu, Xingdi Yuan, Marc-Alexandre Côté, Zhiyuan Liu, Jian Tang

    Abstract: Graph neural networks (GNNs) have been attracting increasing popularity due to their simplicity and effectiveness in a variety of fields. However, a large number of labeled data is generally required to train these networks, which could be very expensive to obtain in some domains. In this paper, we study active learning for GNNs, i.e., how to efficiently label the nodes on a graph to reduce the an… ▽ More

    Submitted 23 October, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

    ACM Class: I.2

  37. arXiv:2006.00684  [pdf, other

    cs.CV cs.LG

    Symbol Spotting on Digital Architectural Floor Plans Using a Deep Learning-based Framework

    Authors: Alireza Rezvanifar, Melissa Cote, Alexandra Branzan Albu

    Abstract: This papers focuses on symbol spotting on real-world digital architectural floor plans with a deep learning (DL)-based framework. Traditional on-the-fly symbol spotting methods are unable to address the semantic challenge of graphical notation variability, i.e. low intra-class symbol similarity, an issue that is particularly important in architectural floor plan analysis. The presence of occlusion… ▽ More

    Submitted 31 May, 2020; originally announced June 2020.

    Comments: Accepted to CVPR2020 Workshop on Text and Documents in the Deep Learning Era

  38. arXiv:2004.05222  [pdf

    cs.CY cs.SI

    Give more data, awareness and control to individual citizens, and they will help COVID-19 containment

    Authors: Mirco Nanni, Gennady Andrienko, Albert-László Barabási, Chiara Boldrini, Francesco Bonchi, Ciro Cattuto, Francesca Chiaromonte, Giovanni Comandé, Marco Conti, Mark Coté, Frank Dignum, Virginia Dignum, Josep Domingo-Ferrer, Paolo Ferragina, Fosca Giannotti, Riccardo Guidotti, Dirk Helbing, Kimmo Kaski, Janos Kertesz, Sune Lehmann, Bruno Lepri, Paul Lukowicz, Stan Matwin, David Megías Jiménez, Anna Monreale , et al. (14 additional authors not shown)

    Abstract: The rapid dynamics of COVID-19 calls for quick and effective tracking of virus transmission chains and early detection of outbreaks, especially in the phase 2 of the pandemic, when lockdown and other restriction measures are progressively withdrawn, in order to avoid or minimize contagion resurgence. For this purpose, contact-tracing apps are being proposed for large scale adoption by many countri… ▽ More

    Submitted 16 April, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

    Comments: Revised text. Additional authors

    Journal ref: Transactions on Data Privacy 13(1): 61-66 (2020), http://www.tdp.cat/issues16/abs.a389a20.php

  39. arXiv:2002.09127  [pdf, other

    cs.CL cs.LG

    Learning Dynamic Belief Graphs to Generalize on Text-Based Games

    Authors: Ashutosh Adhikari, Xingdi Yuan, Marc-Alexandre Côté, Mikuláš Zelinka, Marc-Antoine Rondeau, Romain Laroche, Pascal Poupart, Jian Tang, Adam Trischler, William L. Hamilton

    Abstract: Playing text-based games requires skills in processing natural language and sequential decision making. Achieving human-level performance on text-based games remains an open challenge, and prior research has largely relied on hand-crafted structured representations and heuristics. In this work, we investigate how an agent can plan and generalize in text-based games using graph-structured represent… ▽ More

    Submitted 11 May, 2021; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Bug fixed in Table 1

  40. arXiv:1910.09532  [pdf, other

    cs.CL cs.LG

    Building Dynamic Knowledge Graphs from Text-based Games

    Authors: Mikuláš Zelinka, Xingdi Yuan, Marc-Alexandre Côté, Romain Laroche, Adam Trischler

    Abstract: We are interested in learning how to update Knowledge Graphs (KG) from text. In this preliminary work, we propose a novel Sequence-to-Sequence (Seq2Seq) architecture to generate elementary KG operations. Furthermore, we introduce a new dataset for KG extraction built upon text-based game transitions (over 300k data points). We conduct experiments and discuss the results.

    Submitted 23 January, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019, Graph Representation Learning (GRL) Workshop

  41. arXiv:1910.08215  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    A Deep Learning-based Framework for the Detection of Schools of Herring in Echograms

    Authors: Alireza Rezvanifar, Tunai Porto Marques, Melissa Cote, Alexandra Branzan Albu, Alex Slonimer, Thomas Tolhurst, Kaan Ersahin, Todd Mudge, Stephane Gauthier

    Abstract: Tracking the abundance of underwater species is crucial for understanding the effects of climate change on marine ecosystems. Biologists typically monitor underwater sites with echosounders and visualize data as 2D images (echograms); they interpret these data manually or semi-automatically, which is time-consuming and prone to inconsistencies. This paper proposes a deep learning framework for the… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: Accepted to NeurIPS 2019 workshop on Tackling Climate Change with Machine Learning, Vancouver, Canada

  42. arXiv:1910.03880  [pdf, other

    cs.LG cs.AI stat.ML

    Compatible features for Monotonic Policy Improvement

    Authors: Marcin B. Tomczak, Sergio Valcarcel Macua, Enrique Munoz de Cote, Peter Vrancx

    Abstract: Recent policy optimization approaches have achieved substantial empirical success by constructing surrogate optimization objectives. The Approximate Policy Iteration objective (Schulman et al., 2015a; Kakade and Langford, 2002) has become a standard optimization target for reinforcement learning problems. Using this objective in practice requires an estimator of the advantage function. Policy opti… ▽ More

    Submitted 30 October, 2019; v1 submitted 9 October, 2019; originally announced October 2019.

  43. arXiv:1909.05398  [pdf, other

    cs.AI cs.CL

    Interactive Fiction Games: A Colossal Adventure

    Authors: Matthew Hausknecht, Prithviraj Ammanabrolu, Marc-Alexandre Côté, Xingdi Yuan

    Abstract: A hallmark of human intelligence is the ability to understand and communicate with language. Interactive Fiction games are fully text-based simulation environments where a player issues text commands to effect change in the environment and progress through the story. We argue that IF games are an excellent testbed for studying language-based autonomous agents. In particular, IF games combine chall… ▽ More

    Submitted 25 February, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

  44. arXiv:1908.10909  [pdf, other

    cs.CL cs.LG

    Interactive Language Learning by Question Answering

    Authors: Xingdi Yuan, Marc-Alexandre Cote, Jie Fu, Zhouhan Lin, Christopher Pal, Yoshua Bengio, Adam Trischler

    Abstract: Humans observe and interact with the world to acquire knowledge. However, most existing machine reading comprehension (MRC) tasks miss the interactive, information-seeking component of comprehension. Such tasks present models with static documents that contain all necessary information, usually concentrated in a single short substring. Thus, models can achieve strong performance through simple wor… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Comments: EMNLP 2019

  45. arXiv:1908.10449  [pdf, other

    cs.CL cs.LG stat.ML

    Interactive Machine Comprehension with Information Seeking Agents

    Authors: Xingdi Yuan, Jie Fu, Marc-Alexandre Cote, Yi Tay, Christopher Pal, Adam Trischler

    Abstract: Existing machine reading comprehension (MRC) models do not scale effectively to real-world applications like web-level information retrieval and question answering (QA). We argue that this stems from the nature of MRC datasets: most of these are static environments wherein the supporting documents and all necessary information are fully observed. In this paper, we propose a simple method that refr… ▽ More

    Submitted 16 April, 2020; v1 submitted 27 August, 2019; originally announced August 2019.

    Comments: ACL2020

  46. arXiv:1906.08226  [pdf, other

    cs.LG stat.ML

    Unsupervised State Representation Learning in Atari

    Authors: Ankesh Anand, Evan Racah, Sherjil Ozair, Yoshua Bengio, Marc-Alexandre Côté, R Devon Hjelm

    Abstract: State representation learning, or the ability to capture latent generative factors of an environment, is crucial for building intelligent agents that can perform a wide variety of tasks. Learning such representations without supervision from rewards is a challenging open problem. We introduce a method that learns state representations by maximizing mutual information across spatially and temporall… ▽ More

    Submitted 5 November, 2020; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019; v6 fixes a broken figure reference

  47. arXiv:1905.06821  [pdf, other

    stat.ML cs.LG

    Adaptive Sensor Placement for Continuous Spaces

    Authors: James A Grant, Alexis Boukouvalas, Ryan-Rhys Griffiths, David S Leslie, Sattar Vakili, Enrique Munoz de Cote

    Abstract: We consider the problem of adaptively placing sensors along an interval to detect stochastically-generated events. We present a new formulation of the problem as a continuum-armed bandit problem with feedback in the form of partial observations of realisations of an inhomogeneous Poisson process. We design a solution method by combining Thompson sampling with nonparametric inference via increasing… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Comments: 13 pages, accepted to ICML 2019

  48. arXiv:1904.10890  [pdf, other

    stat.AP cs.LG

    Boosting insights in insurance tariff plans with tree-based machine learning methods

    Authors: Roel Henckaerts, Marie-Pier Côté, Katrien Antonio, Roel Verbelen

    Abstract: Pricing actuaries typically operate within the framework of generalized linear models (GLMs). With the upswing of data analytics, our study puts focus on machine learning methods to develop full tariff plans built from both the frequency and severity of claims. We adapt the loss functions used in the algorithms such that the specific characteristics of insurance data are carefully incorporated: hi… ▽ More

    Submitted 2 March, 2020; v1 submitted 12 April, 2019; originally announced April 2019.

  49. arXiv:1901.10923  [pdf, other

    cs.MA cs.GT

    Coordinating the Crowd: Inducing Desirable Equilibria in Non-Cooperative Systems

    Authors: David Mguni, Joel Jennings, Sergio Valcarcel Macua, Emilio Sison, Sofia Ceppi, Enrique Munoz de Cote

    Abstract: Many real-world systems such as taxi systems, traffic networks and smart grids involve self-interested actors that perform individual tasks in a shared environment. However, in such systems, the self-interested behaviour of agents produces welfare inefficient and globally suboptimal outcomes that are detrimental to all - some common examples are congestion in traffic networks, demand spikes for re… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.

  50. arXiv:1812.00855  [pdf, other

    cs.LG cs.CL stat.ML

    Towards Solving Text-based Games by Producing Adaptive Action Spaces

    Authors: Ruo Yu Tao, Marc-Alexandre Côté, Xingdi Yuan, Layla El Asri

    Abstract: To solve a text-based game, an agent needs to formulate valid text commands for a given context and find the ones that lead to success. Recent attempts at solving text-based games with deep reinforcement learning have focused on the latter, i.e., learning to act optimally when valid actions are known in advance. In this work, we propose to tackle the first task and train a model that generates the… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.