Search | arXiv e-print repository

doi 10.1609/aaai.v37i6.25881

Exploiting Multiple Abstractions in Episodic RL via Reward Sha**

Authors: Roberto Cipollone, Giuseppe De Giacomo, Marco Favorito, Luca Iocchi, Fabio Patrizi

Abstract: One major limitation to the applicability of Reinforcement Learning (RL) to many practical domains is the large number of samples required to learn an optimal policy. To address this problem and improve learning efficiency, we consider a linear hierarchy of abstraction layers of the Markov Decision Process (MDP) underlying the target domain. Each layer is an MDP representing a coarser model of the… ▽ More One major limitation to the applicability of Reinforcement Learning (RL) to many practical domains is the large number of samples required to learn an optimal policy. To address this problem and improve learning efficiency, we consider a linear hierarchy of abstraction layers of the Markov Decision Process (MDP) underlying the target domain. Each layer is an MDP representing a coarser model of the one immediately below in the hierarchy. In this work, we propose a novel form of Reward Sha** where the solution obtained at the abstract level is used to offer rewards to the more concrete MDP, in such a way that the abstract solution guides the learning in the more complex domain. In contrast with other works in Hierarchical RL, our technique has few requirements in the design of the abstract models and it is also tolerant to modeling errors, thus making the proposed approach practical. We formally analyze the relationship between the abstract models and the exploration heuristic induced in the lower-level domain. Moreover, we prove that the method guarantees optimal convergence and we demonstrate its effectiveness experimentally. △ Less

Submitted 4 August, 2023; v1 submitted 28 February, 2023; originally announced March 2023.

Comments: This is an extended version of the paper presented at AAAI 2023, https://doi.org/10.1609/aaai.v37i6.25881

ACM Class: I.2

arXiv:2212.12478 [pdf, other]

doi 10.1109/ACCESS.2022.3172939

Image Classification with Small Datasets: Overview and Benchmark

Authors: L. Brigato, B. Barz, L. Iocchi, J. Denzler

Abstract: Image classification with small datasets has been an active research area in the recent past. However, as research in this scope is still in its infancy, two key ingredients are missing for ensuring reliable and truthful progress: a systematic and extensive overview of the state of the art, and a common benchmark to allow for objective comparisons between published methods. This article addresses… ▽ More Image classification with small datasets has been an active research area in the recent past. However, as research in this scope is still in its infancy, two key ingredients are missing for ensuring reliable and truthful progress: a systematic and extensive overview of the state of the art, and a common benchmark to allow for objective comparisons between published methods. This article addresses both issues. First, we systematically organize and connect past studies to consolidate a community that is currently fragmented and scattered. Second, we propose a common benchmark that allows for an objective comparison of approaches. It consists of five datasets spanning various domains (e.g., natural images, medical imagery, satellite data) and data types (RGB, grayscale, multispectral). We use this benchmark to re-evaluate the standard cross-entropy baseline and ten existing methods published between 2017 and 2021 at renowned venues. Surprisingly, we find that thorough hyper-parameter tuning on held-out validation data results in a highly competitive baseline and highlights a stunted growth of performance over the years. Indeed, only a single specialized method dating back to 2019 clearly wins our benchmark and outperforms the baseline classifier. △ Less

Submitted 23 December, 2022; originally announced December 2022.

Comments: arXiv admin note: text overlap with arXiv:2108.13122

arXiv:2212.08659 [pdf]

A Hierarchical Framework for Collaborative Artificial Intelligence

Authors: James L. Crowley, Joëlle L Coutaz, Jasmin Grosinger, Javier Vázquez-Salceda, Cecilio Angulo, Alberto Sanfeliu, Luca Iocchi, Anthony G. Cohn

Abstract: We propose a hierarchical framework for collaborative intelligent systems. This framework organizes research challenges based on the nature of the collaborative activity and the information that must be shared, with each level building on capabilities provided by lower levels. We review research paradigms at each level, with a description of classical engineering-based approaches and modern altern… ▽ More We propose a hierarchical framework for collaborative intelligent systems. This framework organizes research challenges based on the nature of the collaborative activity and the information that must be shared, with each level building on capabilities provided by lower levels. We review research paradigms at each level, with a description of classical engineering-based approaches and modern alternatives based on machine learning, illustrated with a running example using a hypothetical personal service robot. We discuss cross-cutting issues that occur at all levels, focusing on the problem of communicating and sharing comprehension, the role of explanation and the social nature of collaboration. We conclude with a summary of research challenges and a discussion of the potential for economic and societal impact provided by technologies that enhance human abilities and empower people and society through collaboration with Intelligent Systems. △ Less

Submitted 14 December, 2022; originally announced December 2022.

Journal ref: IEEE Pervasive Computing, 2022

arXiv:2112.14480 [pdf, ps, other]

On some Foundational Aspects of Human-Centered Artificial Intelligence

Authors: Luciano Serafini, Raul Barbosa, Jasmin Grosinger, Luca Iocchi, Christian Napoli, Salvatore Rinzivillo, Jacques Robin, Alessandro Saffiotti, Teresa Scantamburlo, Peter Schueller, Paolo Traverso, Javier Vazquez-Salceda

Abstract: The burgeoning of AI has prompted recommendations that AI techniques should be "human-centered". However, there is no clear definition of what is meant by Human Centered Artificial Intelligence, or for short, HCAI. This paper aims to improve this situation by addressing some foundational aspects of HCAI. To do so, we introduce the term HCAI agent to refer to any physical or software computational… ▽ More The burgeoning of AI has prompted recommendations that AI techniques should be "human-centered". However, there is no clear definition of what is meant by Human Centered Artificial Intelligence, or for short, HCAI. This paper aims to improve this situation by addressing some foundational aspects of HCAI. To do so, we introduce the term HCAI agent to refer to any physical or software computational agent equipped with AI components and that interacts and/or collaborates with humans. This article identifies five main conceptual components that participate in an HCAI agent: Observations, Requirements, Actions, Explanations and Models. We see the notion of HCAI agent, together with its components and functions, as a way to bridge the technical and non-technical discussions on human-centered AI. In this paper, we focus our analysis on scenarios consisting of a single agent operating in dynamic environments in presence of humans. △ Less

Submitted 29 December, 2021; originally announced December 2021.

arXiv:2111.14493 [pdf, other]

On the Effectiveness of Neural Ensembles for Image Classification with Small Datasets

Authors: Lorenzo Brigato, Luca Iocchi

Abstract: Deep neural networks represent the gold standard for image classification. However, they usually need large amounts of data to reach superior performance. In this work, we focus on image classification problems with a few labeled examples per class and improve data efficiency by using an ensemble of relatively small networks. For the first time, our work broadly studies the existing concept of neu… ▽ More Deep neural networks represent the gold standard for image classification. However, they usually need large amounts of data to reach superior performance. In this work, we focus on image classification problems with a few labeled examples per class and improve data efficiency by using an ensemble of relatively small networks. For the first time, our work broadly studies the existing concept of neural ensembling in domains with small data, through extensive validation using popular datasets and architectures. We compare ensembles of networks to their deeper or wider single competitors given a total fixed computational budget. We show that ensembling relatively shallow networks is a simple yet effective technique that is generally better than current state-of-the-art approaches for learning from small datasets. Finally, we present our interpretation according to which neural ensembles are more sample efficient because they learn simpler functions. △ Less

Submitted 29 November, 2021; originally announced November 2021.

arXiv:2109.13561 [pdf, other]

A Strong Baseline for the VIPriors Data-Efficient Image Classification Challenge

Authors: Björn Barz, Lorenzo Brigato, Luca Iocchi, Joachim Denzler

Abstract: Learning from limited amounts of data is the hallmark of intelligence, requiring strong generalization and abstraction skills. In a machine learning context, data-efficient methods are of high practical importance since data collection and annotation are prohibitively expensive in many domains. Thus, coordinated efforts to foster progress in this area emerged recently, e.g., in the form of dedicat… ▽ More Learning from limited amounts of data is the hallmark of intelligence, requiring strong generalization and abstraction skills. In a machine learning context, data-efficient methods are of high practical importance since data collection and annotation are prohibitively expensive in many domains. Thus, coordinated efforts to foster progress in this area emerged recently, e.g., in the form of dedicated workshops and competitions. Besides a common benchmark, measuring progress requires strong baselines. We present such a strong baseline for data-efficient image classification on the VIPriors challenge dataset, which is a sub-sampled version of ImageNet-1k with 100 images per class. We do not use any methods tailored to data-efficient classification but only standard models and techniques as well as common competition tricks and thorough hyper-parameter tuning. Our baseline achieves 69.7% accuracy on the VIPriors image classification dataset and outperforms 50% of submissions to the VIPriors 2021 challenge. △ Less

Submitted 28 September, 2021; originally announced September 2021.

arXiv:2108.13122 [pdf, other]

Tune It or Don't Use It: Benchmarking Data-Efficient Image Classification

Authors: Lorenzo Brigato, Björn Barz, Luca Iocchi, Joachim Denzler

Abstract: Data-efficient image classification using deep neural networks in settings, where only small amounts of labeled data are available, has been an active research area in the recent past. However, an objective comparison between published methods is difficult, since existing works use different datasets for evaluation and often compare against untuned baselines with default hyper-parameters. We desig… ▽ More Data-efficient image classification using deep neural networks in settings, where only small amounts of labeled data are available, has been an active research area in the recent past. However, an objective comparison between published methods is difficult, since existing works use different datasets for evaluation and often compare against untuned baselines with default hyper-parameters. We design a benchmark for data-efficient image classification consisting of six diverse datasets spanning various domains (e.g., natural images, medical imagery, satellite data) and data types (RGB, grayscale, multispectral). Using this benchmark, we re-evaluate the standard cross-entropy baseline and eight methods for data-efficient deep learning published between 2017 and 2021 at renowned venues. For a fair and realistic comparison, we carefully tune the hyper-parameters of all methods on each dataset. Surprisingly, we find that tuning learning rate, weight decay, and batch size on a separate validation split results in a highly competitive baseline, which outperforms all but one specialized method and performs competitively to the remaining one. △ Less

Submitted 30 August, 2021; originally announced August 2021.

Comments: Accepted at the 2nd Visual Inductive Priors for Data-Efficient Deep Learning Workshop (ICCV 2021)

arXiv:2106.08308 [pdf]

Il progetto Lab2Go per la diffusione della pratica laboratoriale nelle Scuole Secondarie di II grado

Authors: Mirco Andreotti, Pia Astone, Donatella Campana, Antonella Cartoni, Fausto Casaburo, Francesca Cavanna, Gianluigi Cibinetto, Antonella Dalla Cort, Giulia De Bonis, Marta Della Seta, Francesca Di Mauro, Giuseppe Di Sciascio, Riccardo Faccini, Federica Favino, Luca Iocchi, Marcello Lissia, Giulia Morganti, Mauro Mancini, Giovanni Organtini, Francesco Pennazio, Francesco Piacentini, Alina Piras, Maria Ragosta, Lorenzo Roberti, Anna Rita Rossi , et al. (2 additional authors not shown)

Abstract: Even if laboratory practice is essential for all scientific branches of knowledge, it is often neglected at High School, due to lack of time and/or resources. To establish a closer contact between school and experimental sciences, the University Sapienza of Roma and the Istituto Nazionale di Fisica Nucleare (INFN) launched the Lab2Go project, with the goal of spreading laboratory practice among st… ▽ More Even if laboratory practice is essential for all scientific branches of knowledge, it is often neglected at High School, due to lack of time and/or resources. To establish a closer contact between school and experimental sciences, the University Sapienza of Roma and the Istituto Nazionale di Fisica Nucleare (INFN) launched the Lab2Go project, with the goal of spreading laboratory practice among students and teachers in high schools. △ Less

Submitted 20 November, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

Comments: in Italian

arXiv:2106.05018 [pdf, other]

RLupus: Cooperation through emergent communication in The Werewolf social deduction game

Authors: Nicolo' Brandizzi, Davide Grossi, Luca Iocchi

Abstract: This paper focuses on the emergence of communication to support cooperation in environments modeled as social deduction games (SDG), that are games where players communicate freely to deduce each others' hidden intentions. We first state the problem by giving a general formalization of SDG and a possible solution framework based on reinforcement learning. Next, we focus on a specific SDG, known as… ▽ More This paper focuses on the emergence of communication to support cooperation in environments modeled as social deduction games (SDG), that are games where players communicate freely to deduce each others' hidden intentions. We first state the problem by giving a general formalization of SDG and a possible solution framework based on reinforcement learning. Next, we focus on a specific SDG, known as The Werewolf, and study if and how various forms of communication influence the outcome of the game. Experimental results show that introducing a communication signal greatly increases the winning chances of a class of players. We also study the effect of the signal's length and range on the overall performance showing a non-linear relationship. △ Less

Submitted 8 November, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

arXiv:2105.05094 [pdf, other]

doi 10.1109/PerComWorkshops51409.2021.9431048

A Reinforcement Learning Environment for Multi-Service UAV-enabled Wireless Systems

Authors: Damiano Brunori, Stefania Colonnese, Francesca Cuomo, Luca Iocchi

Abstract: We design a multi-purpose environment for autonomous UAVs offering different communication services in a variety of application contexts (e.g., wireless mobile connectivity services, edge computing, data gathering). We develop the environment, based on OpenAI Gym framework, in order to simulate different characteristics of real operational environments and we adopt the Reinforcement Learning to ge… ▽ More We design a multi-purpose environment for autonomous UAVs offering different communication services in a variety of application contexts (e.g., wireless mobile connectivity services, edge computing, data gathering). We develop the environment, based on OpenAI Gym framework, in order to simulate different characteristics of real operational environments and we adopt the Reinforcement Learning to generate policies that maximize some desired performance.The quality of the resulting policies are compared with a simple baseline to evaluate the system and derive guidelines to adopt this technique in different use cases. The main contribution of this paper is a flexible and extensible OpenAI Gym environment, which allows to generate, evaluate, and compare policies for autonomous multi-drone systems in multi-service applications. This environment allows for comparative evaluation and benchmarking of different approaches in a variety of application contexts. △ Less

Submitted 11 May, 2021; originally announced May 2021.

Journal ref: 2021 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops)

arXiv:2003.12843 [pdf, other]

A Close Look at Deep Learning with Small Data

Authors: L. Brigato, L. Iocchi

Abstract: In this work, we perform a wide variety of experiments with different deep learning architectures on datasets of limited size. According to our study, we show that model complexity is a critical factor when only a few samples per class are available. Differently from the literature, we show that in some configurations, the state of the art can be improved using low complexity models. For instance,… ▽ More In this work, we perform a wide variety of experiments with different deep learning architectures on datasets of limited size. According to our study, we show that model complexity is a critical factor when only a few samples per class are available. Differently from the literature, we show that in some configurations, the state of the art can be improved using low complexity models. For instance, in problems with scarce training samples and without data augmentation, low-complexity convolutional neural networks perform comparably well or better than state-of-the-art architectures. Moreover, we show that even standard data augmentation can boost recognition performance by large margins. This result suggests the development of more complex data generation/augmentation pipelines for cases when data is limited. Finally, we show that dropout, a widely used regularization technique, maintains its role as a good regularizer even when data is scarce. Our findings are empirically validated on the sub-sampled versions of popular CIFAR-10, Fashion-MNIST and, SVHN benchmarks. △ Less

Submitted 25 October, 2020; v1 submitted 28 March, 2020; originally announced March 2020.

Comments: Published at ICPR 2020

arXiv:1909.04812

Proceedings of the AI-HRI Symposium at AAAI-FSS 2019

Authors: Justin W. Hart, Nick DePalma, Richard G. Freedman, Luca Iocchi, Matteo Leonetti, Katrin Lohan, Ross Mead, Emmanuel Senft, Jivko Sinapov, Elin A. Topp, Tom Williams

Abstract: The past few years have seen rapid progress in the development of service robots. Universities and companies alike have launched major research efforts toward the deployment of ambitious systems designed to aid human operators performing a variety of tasks. These robots are intended to make those who may otherwise need to live in assisted care facilities more independent, to help workers perform t… ▽ More The past few years have seen rapid progress in the development of service robots. Universities and companies alike have launched major research efforts toward the deployment of ambitious systems designed to aid human operators performing a variety of tasks. These robots are intended to make those who may otherwise need to live in assisted care facilities more independent, to help workers perform their jobs, or simply to make life more convenient. Service robots provide a powerful platform on which to study Artificial Intelligence (AI) and Human-Robot Interaction (HRI) in the real world. Research sitting at the intersection of AI and HRI is crucial to the success of service robots if they are to fulfill their mission. This symposium seeks to highlight research enabling robots to effectively interact with people autonomously while modeling, planning, and reasoning about the environment that the robot operates in and the tasks that it must perform. AI-HRI deals with the challenge of interacting with humans in environments that are relatively unstructured or which are structured around people rather than machines, as well as the possibility that the robot may need to interact naturally with people rather than through teach pendants, programming, or similar interfaces. △ Less

Submitted 19 September, 2019; v1 submitted 10 September, 2019; originally announced September 2019.

Comments: HTML file with clickable links to papers - All papers have been reviewed by at least two reviewers in a single blind fashion - Symposium website: https://ai-hri.github.io/2019/

arXiv:1809.06606

Proceedings of the AI-HRI Symposium at AAAI-FSS 2018

Authors: Kalesha Bullard, Nick DePalma, Richard G. Freedman, Bradley Hayes, Luca Iocchi, Katrin Lohan, Ross Mead, Emmanuel Senft, Tom Williams

Abstract: The goal of the Interactive Learning for Artificial Intelligence (AI) for Human-Robot Interaction (HRI) symposium is to bring together the large community of researchers working on interactive learning scenarios for interactive robotics. While current HRI research involves investigating ways for robots to effectively interact with people, HRI's overarching goal is to develop robots that are autono… ▽ More The goal of the Interactive Learning for Artificial Intelligence (AI) for Human-Robot Interaction (HRI) symposium is to bring together the large community of researchers working on interactive learning scenarios for interactive robotics. While current HRI research involves investigating ways for robots to effectively interact with people, HRI's overarching goal is to develop robots that are autonomous while intelligently modeling and learning from humans. These goals greatly overlap with some central goals of AI and interactive machine learning, such that HRI is an extremely challenging problem domain for interactive learning and will elicit fresh problem areas for robotics research. Present-day AI research still does not widely consider situations for interacting directly with humans and within human-populated environments, which present inherent uncertainty in dynamics, structure, and interaction. We believe that the HRI community already offers a rich set of principles and observations that can be used to structure new models of interaction. The human-aware AI initiative has primarily been approached through human-in-the-loop methods that use people's data and feedback to improve refinement and performance of the algorithms, learned functions, and personalization. We thus believe that HRI is an important component to furthering AI and robotics research. △ Less

Submitted 18 September, 2018; originally announced September 2018.

Comments: HTML file with clickable links to papers - All papers have been reviewed by two reviewers and a meta reviewer in a single blind fashion - Symposium website: https://ai-hri.github.io/2018/

arXiv:1807.06333 [pdf, other]

Foundations for Restraining Bolts: Reinforcement Learning with LTLf/LDLf restraining specifications

Authors: Giuseppe De Giacomo, Luca Iocchi, Marco Favorito, Fabio Patrizi

Abstract: In this work we investigate on the concept of "restraining bolt", envisioned in Science Fiction. Specifically we introduce a novel problem in AI. We have two distinct sets of features extracted from the world, one by the agent and one by the authority imposing restraining specifications (the "restraining bolt"). The two sets are apparently unrelated since of interest to independent parties, howeve… ▽ More In this work we investigate on the concept of "restraining bolt", envisioned in Science Fiction. Specifically we introduce a novel problem in AI. We have two distinct sets of features extracted from the world, one by the agent and one by the authority imposing restraining specifications (the "restraining bolt"). The two sets are apparently unrelated since of interest to independent parties, however they both account for (aspects of) the same world. We consider the case in which the agent is a reinforcement learning agent on the first set of features, while the restraining bolt is specified logically using linear time logic on finite traces LTLf/LDLf over the second set of features. We show formally, and illustrate with examples, that, under general circumstances, the agent can learn while sha** its goals to suitably conform (as much as possible) to the restraining bolt specifications. △ Less

Submitted 11 November, 2019; v1 submitted 17 July, 2018; originally announced July 2018.

Journal ref: ICAPS 2019: 128-136

arXiv:1606.03719 [pdf, other]

doi 10.1109/ECMR.2015.7324198

A Proposal for Semantic Map Representation and Evaluation

Authors: Roberto Capobianco, Jacopo Serafin, Johann Dichtl, Giorgio Grisetti, Luca Iocchi, Daniele Nardi

Abstract: Semantic map** is the incremental process of "map**" relevant information of the world (i.e., spatial information, temporal events, agents and actions) to a formal description supported by a reasoning engine. Current research focuses on learning the semantic of environments based on their spatial location, geometry and appearance. Many methods to tackle this problem have been proposed, but the… ▽ More Semantic map** is the incremental process of "map**" relevant information of the world (i.e., spatial information, temporal events, agents and actions) to a formal description supported by a reasoning engine. Current research focuses on learning the semantic of environments based on their spatial location, geometry and appearance. Many methods to tackle this problem have been proposed, but the lack of a uniform representation, as well as standard benchmarking suites, prevents their direct comparison. In this paper, we propose a standardization in the representation of semantic maps, by defining an easily extensible formalism to be used on top of metric maps of the environments. Based on this, we describe the procedure to build a dataset (based on real sensor data) for benchmarking semantic map** techniques, also hypothesizing some possible evaluation metrics. Nevertheless, by providing a tool for the construction of a semantic map ground truth, we aim at the contribution of the scientific community in acquiring data for populating the dataset. △ Less

Submitted 12 June, 2016; originally announced June 2016.

arXiv:1307.7351 [pdf]

Knowledge Representation for Robots through Human-Robot Interaction

Authors: Emanuele Bastianelli, Domenico Bloisi, Roberto Capobianco, Guglielmo Gemignani, Luca Iocchi, Daniele Nardi

Abstract: The representation of the knowledge needed by a robot to perform complex tasks is restricted by the limitations of perception. One possible way of overcoming this situation and designing "knowledgeable" robots is to rely on the interaction with the user. We propose a multi-modal interaction framework that allows to effectively acquire knowledge about the environment where the robot operates. In pa… ▽ More The representation of the knowledge needed by a robot to perform complex tasks is restricted by the limitations of perception. One possible way of overcoming this situation and designing "knowledgeable" robots is to rely on the interaction with the user. We propose a multi-modal interaction framework that allows to effectively acquire knowledge about the environment where the robot operates. In particular, in this paper we present a rich representation framework that can be automatically built from the metric map annotated with the indications provided by the user. Such a representation, allows then the robot to ground complex referential expressions for motion commands and to devise topological navigation plans to achieve the target locations. △ Less

Submitted 1 August, 2013; v1 submitted 28 July, 2013; originally announced July 2013.

Comments: Knowledge Representation and Reasoning in Robotics Workshop at ICLP 2013

Showing 1–16 of 16 results for author: Iocchi, L