-
CNL2ASP: converting controlled natural language sentences into ASP
Authors:
Simone Caruso,
Carmine Dodaro,
Marco Maratea,
Marco Mochi,
Francesco Riccio
Abstract:
Answer Set Programming (ASP) is a popular declarative programming language for solving hard combinatorial problems. Although ASP has gained widespread acceptance in academic and industrial contexts, there are certain user groups who may find it more advantageous to employ a higher-level language that closely resembles natural language when specifying ASP programs. In this paper, we propose a novel…
▽ More
Answer Set Programming (ASP) is a popular declarative programming language for solving hard combinatorial problems. Although ASP has gained widespread acceptance in academic and industrial contexts, there are certain user groups who may find it more advantageous to employ a higher-level language that closely resembles natural language when specifying ASP programs. In this paper, we propose a novel tool, called CNL2ASP, for translating English sentences expressed in a controlled natural language (CNL) form into ASP. In particular, we first provide a definition of the type of sentences allowed by our CNL and their translation as ASP rules, and then exemplify the usage of the CNL for the specification of both synthetic and real-world combinatorial problems. Finally, we report the results of an experimental analysis conducted on the real-world problems to compare the performance of automatically generated encodings with the ones written by ASP practitioners, showing that our tool can obtain satisfactory performance on these benchmarks. Under consideration in Theory and Practice of Logic Programming (TPLP).
△ Less
Submitted 17 November, 2023;
originally announced November 2023.
-
RoSmEEry: Robotic Simulated Environment for Evaluation and Benchmarking of Semantic Map** Algorithms
Authors:
Sara Kaszuba,
Sandeep Reddy Sabbella,
Vincenzo Suriani,
Francesco Riccio,
Daniele Nardi
Abstract:
Human-robot interaction requires a common understanding of the operational environment, which can be provided by a representation that blends geometric and symbolic knowledge: a semantic map. Through a semantic map the robot can interpret user commands by grounding them to its sensory observations. Semantic map** is the process that builds such a representation. Despite being fundamental to enab…
▽ More
Human-robot interaction requires a common understanding of the operational environment, which can be provided by a representation that blends geometric and symbolic knowledge: a semantic map. Through a semantic map the robot can interpret user commands by grounding them to its sensory observations. Semantic map** is the process that builds such a representation. Despite being fundamental to enable cognition and high-level reasoning in robotics, semantic map** is a challenging task due to generalization to different scenarios and sensory data types. In fact, it is difficult to obtain a rich and accurate semantic map of the environment and of the objects therein. Moreover, to date, there are no frameworks that allow for a comparison of the performance in building semantic maps for a given environment. To tackle these issues we design RoSmEEry, a novel framework based on the Gazebo simulator, where we introduce an accessible and ready-to-use methodology for a systematic evaluation of semantic map** algorithms. We release our framework, as an open-source package, with multiple simulation environments with the aim to provide a general set-up to quantitatively measure the performances in acquiring semantic knowledge about the environment.
△ Less
Submitted 17 May, 2021;
originally announced May 2021.
-
DOP: Deep Optimistic Planning with Approximate Value Function Evaluation
Authors:
Francesco Riccio,
Roberto Capobianco,
Daniele Nardi
Abstract:
Research on reinforcement learning has demonstrated promising results in manifold applications and domains. Still, efficiently learning effective robot behaviors is very difficult, due to unstructured scenarios, high uncertainties, and large state dimensionality (e.g. multi-agent systems or hyper-redundant robots). To alleviate this problem, we present DOP, a deep model-based reinforcement learnin…
▽ More
Research on reinforcement learning has demonstrated promising results in manifold applications and domains. Still, efficiently learning effective robot behaviors is very difficult, due to unstructured scenarios, high uncertainties, and large state dimensionality (e.g. multi-agent systems or hyper-redundant robots). To alleviate this problem, we present DOP, a deep model-based reinforcement learning algorithm, which exploits action values to both (1) guide the exploration of the state space and (2) plan effective policies. Specifically, we exploit deep neural networks to learn Q-functions that are used to attack the curse of dimensionality during a Monte-Carlo tree search. Our algorithm, in fact, constructs upper confidence bounds on the learned value function to select actions optimistically. We implement and evaluate DOP on different scenarios: (1) a cooperative navigation problem, (2) a fetching task for a 7-DOF KUKA robot, and (3) a human-robot handover with a humanoid robot (both in simulation and real). The obtained results show the effectiveness of DOP in the chosen applications, where action values drive the exploration and reduce the computational demand of the planning process while achieving good performance.
△ Less
Submitted 22 March, 2018;
originally announced March 2018.
-
Q-CP: Learning Action Values for Cooperative Planning
Authors:
Francesco Riccio,
Roberto Capobianco,
Daniele Nardi
Abstract:
Research on multi-robot systems has demonstrated promising results in manifold applications and domains. Still, efficiently learning an effective robot behaviors is very difficult, due to unstructured scenarios, high uncertainties, and large state dimensionality (e.g. hyper-redundant and groups of robot). To alleviate this problem, we present Q-CP a cooperative model-based reinforcement learning a…
▽ More
Research on multi-robot systems has demonstrated promising results in manifold applications and domains. Still, efficiently learning an effective robot behaviors is very difficult, due to unstructured scenarios, high uncertainties, and large state dimensionality (e.g. hyper-redundant and groups of robot). To alleviate this problem, we present Q-CP a cooperative model-based reinforcement learning algorithm, which exploits action values to both (1) guide the exploration of the state space and (2) generate effective policies. Specifically, we exploit Q-learning to attack the curse-of-dimensionality in the iterations of a Monte-Carlo Tree Search. We implement and evaluate Q-CP on different stochastic cooperative (general-sum) games: (1) a simple cooperative navigation problem among 3 robots, (2) a cooperation scenario between a pair of KUKA YouBots performing hand-overs, and (3) a coordination task between two mobile robots entering a door. The obtained results show the effectiveness of Q-CP in the chosen applications, where action values drive the exploration and reduce the computational demand of the planning process while achieving good performance.
△ Less
Submitted 1 March, 2018;
originally announced March 2018.
-
Learning Human-Robot Handovers Through $π$-STAM: Policy Improvement With Spatio-Temporal Affordance Maps
Authors:
Francesco Riccio,
Roberto Capobianco,
Daniele Nardi
Abstract:
Human-robot handovers are characterized by high uncertainty and poor structure of the problem that make them difficult tasks. While machine learning methods have shown promising results, their application to problems with large state dimensionality, such as in the case of humanoid robots, is still limited. Additionally, by using these methods and during the interaction with the human operator, no…
▽ More
Human-robot handovers are characterized by high uncertainty and poor structure of the problem that make them difficult tasks. While machine learning methods have shown promising results, their application to problems with large state dimensionality, such as in the case of humanoid robots, is still limited. Additionally, by using these methods and during the interaction with the human operator, no guarantees can be obtained on the correct interpretation of spatial constraints (e.g., from social rules). In this paper, we present Policy Improvement with Spatio-Temporal Affordance Maps -- $π$-STAM, a novel iterative algorithm to learn spatial affordances and generate robot behaviors. Our goal consists in generating a policy that adapts to the unknown action semantics by using affordances. In this way, while learning to perform a human-robot handover task, we can (1) efficiently generate good policies with few training episodes, and (2) easily encode action semantics and, if available, enforce prior knowledge in it. We experimentally validate our approach both in simulation and on a real NAO robot whose task consists in taking an object from the hands of a human. The obtained results show that our algorithm obtains a good policy while reducing the computational load and time duration of the learning process.
△ Less
Submitted 15 October, 2016; v1 submitted 8 October, 2016;
originally announced October 2016.
-
Results of the ASY-EOS experiment at GSI: The symmetry energy at suprasaturation density
Authors:
P. Russotto,
S. Gannon,
S. Kupny,
P. Lasko,
L. Acosta,
M. Adamczyk,
A. Al-Ajlan,
M. Al-Garawi,
S. Al-Homaidhi,
F. Amorini,
L. Auditore,
T. Aumann,
Y. Ayyad,
Z. Basrak,
J. Benlliure,
M. Boisjoli,
K. Boretzky,
J. Brzychczyk,
A. Budzanowski,
C. Caesar,
G. Cardella,
P. Cammarata,
Z. Chajecki,
M. Chartier,
A. Chbihi
, et al. (67 additional authors not shown)
Abstract:
Directed and elliptic flows of neutrons and light charged particles were measured for the reaction 197Au+197Au at 400 MeV/nucleon incident energy within the ASY-EOS experimental campaign at the GSI laboratory. The detection system consisted of the Large Area Neutron Detector LAND, combined with parts of the CHIMERA multidetector, of the ALADIN Time-of-flight Wall, and of the Washington-University…
▽ More
Directed and elliptic flows of neutrons and light charged particles were measured for the reaction 197Au+197Au at 400 MeV/nucleon incident energy within the ASY-EOS experimental campaign at the GSI laboratory. The detection system consisted of the Large Area Neutron Detector LAND, combined with parts of the CHIMERA multidetector, of the ALADIN Time-of-flight Wall, and of the Washington-University Microball detector. The latter three arrays were used for the event characterization and reaction-plane reconstruction. In addition, an array of triple telescopes, KRATTA, was used for complementary measurements of the isotopic composition and flows of light charged particles. From the comparison of the elliptic flow ratio of neutrons with respect to charged particles with UrQMD predictions, a value γ= 0.72 \pm 0.19 is obtained for the power-law coefficient describing the density dependence of the potential part in the parametrization of the symmetry energy. It represents a new and more stringent constraint for the regime of supra-saturation density and confirms, with a considerably smaller uncertainty, the moderately soft to linear density dependence deduced from the earlier FOPI-LAND data. The densities probed are shown to reach beyond twice saturation.
△ Less
Submitted 27 September, 2016; v1 submitted 15 August, 2016;
originally announced August 2016.
-
STAM: A Framework for Spatio-Temporal Affordance Maps
Authors:
Francesco Riccio,
Roberto Capobianco,
Marc Hanheide,
Daniele Nardi
Abstract:
Affordances have been introduced in literature as action opportunities that objects offer, and used in robotics to semantically represent their interconnection. However, when considering an environment instead of an object, the problem becomes more complex due to the dynamism of its state. To tackle this issue, we introduce the concept of Spatio-Temporal Affordances (STA) and Spatio-Temporal Affor…
▽ More
Affordances have been introduced in literature as action opportunities that objects offer, and used in robotics to semantically represent their interconnection. However, when considering an environment instead of an object, the problem becomes more complex due to the dynamism of its state. To tackle this issue, we introduce the concept of Spatio-Temporal Affordances (STA) and Spatio-Temporal Affordance Map (STAM). Using this formalism, we encode action semantics related to the environment to improve task execution capabilities of an autonomous robot. We experimentally validate our approach to support the execution of robot tasks by showing that affordances encode accurate semantics of the environment.
△ Less
Submitted 1 July, 2016;
originally announced July 2016.
-
Using Monte Carlo Search With Data Aggregation to Improve Robot Soccer Policies
Authors:
Francesco Riccio,
Roberto Capobianco,
Daniele Nardi
Abstract:
RoboCup soccer competitions are considered among the most challenging multi-robot adversarial environments, due to their high dynamism and the partial observability of the environment. In this paper we introduce a method based on a combination of Monte Carlo search and data aggregation (MCSDA) to adapt discrete-action soccer policies for a defender robot to the strategy of the opponent team. By ex…
▽ More
RoboCup soccer competitions are considered among the most challenging multi-robot adversarial environments, due to their high dynamism and the partial observability of the environment. In this paper we introduce a method based on a combination of Monte Carlo search and data aggregation (MCSDA) to adapt discrete-action soccer policies for a defender robot to the strategy of the opponent team. By exploiting a simple representation of the domain, a supervised learning algorithm is trained over an initial collection of data consisting of several simulations of human expert policies. Monte Carlo policy rollouts are then generated and aggregated to previous data to improve the learned policy over multiple epochs and games. The proposed approach has been extensively tested both on a soccer-dedicated simulator and on real robots. Using this method, our learning robot soccer team achieves an improvement in ball interceptions, as well as a reduction in the number of opponents' goals. Together with a better performance, an overall more efficient positioning of the whole team within the field is achieved.
△ Less
Submitted 1 June, 2016;
originally announced June 2016.
-
The ASY-EOS experiment at GSI: investigating the symmetry energy at supra-saturation densities
Authors:
P. Russotto,
M. Chartier,
E. De Filippo,
A. Le Févre,
S. Gannon,
I. Gašparić,
M. Kiš,
S. Kupny,
Y. Leifels,
R. C. Lemmon,
J. Łukasik,
P. Marini,
A. Pagano,
P. Pawłowski,
S. Santoro,
W. Trautmann,
M. Veselsky,
L. Acosta,
M. Adamczyk,
A. Al-Ajlan,
M. Al-Garawi,
S. Al-Homaidhi,
F. Amorini,
L. Auditore,
T. Aumann
, et al. (67 additional authors not shown)
Abstract:
The elliptic-flow ratio of neutrons with respect to protons in reactions of neutron rich heavy-ions systems at intermediate energies has been proposed as an observable sensitive to the strength of the symmetry term in the nuclear Equation Of State (EOS) at supra-saturation densities. The recent results obtained from the existing FOPI/LAND data for $^{197}$Au+$^{197}$Au collisions at 400 MeV/nucleo…
▽ More
The elliptic-flow ratio of neutrons with respect to protons in reactions of neutron rich heavy-ions systems at intermediate energies has been proposed as an observable sensitive to the strength of the symmetry term in the nuclear Equation Of State (EOS) at supra-saturation densities. The recent results obtained from the existing FOPI/LAND data for $^{197}$Au+$^{197}$Au collisions at 400 MeV/nucleon in comparison with the UrQMD model allowed a first estimate of the symmetry term of the EOS but suffer from a considerable statistical uncertainty. In order to obtain an improved data set for Au+Au collisions and to extend the study to other systems, a new experiment was carried out at the GSI laboratory by the ASY-EOS collaboration in May 2011.
△ Less
Submitted 26 September, 2012;
originally announced September 2012.