-
Towards Human-AI Complementarity with Predictions Sets
Authors:
Giovanni De Toni,
Nastaran Okati,
Suhas Thejaswi,
Eleni Straitouri,
Manuel Gomez-Rodriguez
Abstract:
Decision support systems based on prediction sets have proven to be effective at hel** human experts solve classification tasks. Rather than providing single-label predictions, these systems provide sets of label predictions constructed using conformal prediction, namely prediction sets, and ask human experts to predict label values from these sets. In this paper, we first show that the predicti…
▽ More
Decision support systems based on prediction sets have proven to be effective at hel** human experts solve classification tasks. Rather than providing single-label predictions, these systems provide sets of label predictions constructed using conformal prediction, namely prediction sets, and ask human experts to predict label values from these sets. In this paper, we first show that the prediction sets constructed using conformal prediction are, in general, suboptimal in terms of average accuracy. Then, we show that the problem of finding the optimal prediction sets under which the human experts achieve the highest average accuracy is NP-hard. More strongly, unless P = NP, we show that the problem is hard to approximate to any factor less than the size of the label set. However, we introduce a simple and efficient greedy algorithm that, for a large class of expert models and non-conformity scores, is guaranteed to find prediction sets that provably offer equal or greater performance than those constructed using conformal prediction. Further, using a simulation study with both synthetic and real expert predictions, we demonstrate that, in practice, our greedy algorithm finds near-optimal prediction sets offering greater performance than conformal prediction.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Exploiting Preference Elicitation in Interactive and User-centered Algorithmic Recourse: An Initial Exploration
Authors:
Seyedehdelaram Esfahani,
Giovanni De Toni,
Bruno Lepri,
Andrea Passerini,
Katya Tentori,
Massimo Zancanaro
Abstract:
Algorithmic Recourse aims to provide actionable explanations, or recourse plans, to overturn potentially unfavourable decisions taken by automated machine learning models. In this paper, we propose an interaction paradigm based on a guided interaction pattern aimed at both eliciting the users' preferences and heading them toward effective recourse interventions. In a fictional task of money lendin…
▽ More
Algorithmic Recourse aims to provide actionable explanations, or recourse plans, to overturn potentially unfavourable decisions taken by automated machine learning models. In this paper, we propose an interaction paradigm based on a guided interaction pattern aimed at both eliciting the users' preferences and heading them toward effective recourse interventions. In a fictional task of money lending, we compare this approach with an exploratory interaction pattern based on a combination of alternative plans and the possibility of freely changing the configurations by the users themselves. Our results suggest that users may recognize that the guided interaction paradigm improves efficiency. However, they also feel less freedom to experiment with "what-if" scenarios. Nevertheless, the time spent on the purely exploratory interface tends to be perceived as a lack of efficiency, which reduces attractiveness, perspicuity, and dependability. Conversely, for the guided interface, more time on the interface seems to increase its attractiveness, perspicuity, and dependability while not impacting the perceived efficiency. That might suggest that this type of interfaces should combine these two approaches by trying to support exploratory behavior while gently pushing toward a guided effective solution.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
AMICO-COSMOS galaxy cluster and group catalogue up to $z = 2$: Sample properties and X-ray counterparts
Authors:
Greta Toni,
Matteo Maturi,
Alexis Finoguenov,
Lauro Moscardini,
Gianluca Castignani
Abstract:
We present a new galaxy cluster search in the COSMOS field through the use of the Adaptive Matched Identifier of Clustered Objects (AMICO). We produced a new cluster and group catalogue up to $z=2$, by performing an innovative application of AMICO with respect to previous successful applications to wide-field surveys, in terms of depth (down to $r < 26.7$), small area covered ($1.69 deg^2$ of unma…
▽ More
We present a new galaxy cluster search in the COSMOS field through the use of the Adaptive Matched Identifier of Clustered Objects (AMICO). We produced a new cluster and group catalogue up to $z=2$, by performing an innovative application of AMICO with respect to previous successful applications to wide-field surveys, in terms of depth (down to $r < 26.7$), small area covered ($1.69 deg^2$ of unmasked area) and redshift extent. This sample, and the comparative analysis we performed with the X-rays, allowed for the calibration of mass-proxy scaling relations up to $z=2$ and down to less than $10^{13} M_{sun}$ and constitutes the base for the refinement of the cluster model for future applications of AMICO, like the analysis of upcoming Euclid data. AMICO is based on an optimal linear matched filter and detects clusters in photometric galaxy catalogues using galaxy location, photometric redshift and, in the simplest case, one galaxy property. We used one magnitude as galaxy property, avoiding explicit use of galaxy colour, and performed 3 independent runs in the r, Y and H bands using both COSMOS2020 and COSMOS2015 galaxy catalogues. The final catalogue resulting from matching the results of the three runs contains 1269 and 666 candidate clusters with $S/N >3.0$ and $>3.5$, respectively. Most of the unmatched ones have $S/N <3.5$ which can be chosen as cut for a more robust sample. We assigned X-ray properties to our detections via matching with a public X-ray group sample and by estimating, for unmatched detections, X-ray properties at the location of AMICO candidates based on Chandra+XMM-Newton data. 622 are the candidates with X-ray flux estimate. This large sample allowed for the calibration of the scaling relations between AMICO mass-proxies and X-ray mass and the study of their redshift dependence for the selection of the most stable photometric bands.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Personalized Algorithmic Recourse with Preference Elicitation
Authors:
Giovanni De Toni,
Paolo Viappiani,
Stefano Teso,
Bruno Lepri,
Andrea Passerini
Abstract:
Algorithmic Recourse (AR) is the problem of computing a sequence of actions that -- once performed by a user -- overturns an undesirable machine decision. It is paramount that the sequence of actions does not require too much effort for users to implement. Yet, most approaches to AR assume that actions cost the same for all users, and thus may recommend unfairly expensive recourse plans to certain…
▽ More
Algorithmic Recourse (AR) is the problem of computing a sequence of actions that -- once performed by a user -- overturns an undesirable machine decision. It is paramount that the sequence of actions does not require too much effort for users to implement. Yet, most approaches to AR assume that actions cost the same for all users, and thus may recommend unfairly expensive recourse plans to certain users. Prompted by this observation, we introduce PEAR, the first human-in-the-loop approach capable of providing personalized algorithmic recourse tailored to the needs of any end-user. PEAR builds on insights from Bayesian Preference Elicitation to iteratively refine an estimate of the costs of actions by asking choice set queries to the target user. The queries themselves are computed by maximizing the Expected Utility of Selection, a principled measure of information gain accounting for uncertainty on both the cost estimate and the user's responses. PEAR integrates elicitation into a Reinforcement Learning agent coupled with Monte Carlo Tree Search to quickly identify promising recourse plans. Our empirical evaluation on real-world datasets highlights how PEAR produces high-quality personalized recourse in only a handful of iterations.
△ Less
Submitted 23 January, 2024; v1 submitted 26 May, 2022;
originally announced May 2022.
-
Synthesizing explainable counterfactual policies for algorithmic recourse with program synthesis
Authors:
Giovanni De Toni,
Bruno Lepri,
Andrea Passerini
Abstract:
Being able to provide counterfactual interventions - sequences of actions we would have had to take for a desirable outcome to happen - is essential to explain how to change an unfavourable decision by a black-box machine learning model (e.g., being denied a loan request). Existing solutions have mainly focused on generating feasible interventions without providing explanations on their rationale.…
▽ More
Being able to provide counterfactual interventions - sequences of actions we would have had to take for a desirable outcome to happen - is essential to explain how to change an unfavourable decision by a black-box machine learning model (e.g., being denied a loan request). Existing solutions have mainly focused on generating feasible interventions without providing explanations on their rationale. Moreover, they need to solve a separate optimization problem for each user. In this paper, we take a different approach and learn a program that outputs a sequence of explainable counterfactual actions given a user description and a causal graph. We leverage program synthesis techniques, reinforcement learning coupled with Monte Carlo Tree Search for efficient exploration, and rule learning to extract explanations for each recommended action. An experimental evaluation on synthetic and real-world datasets shows how our approach generates effective interventions by making orders of magnitude fewer queries to the black-box classifier with respect to existing solutions, with the additional benefit of complementing them with interpretable explanations.
△ Less
Submitted 12 October, 2022; v1 submitted 18 January, 2022;
originally announced January 2022.
-
Learning compositional programs with arguments and sampling
Authors:
Giovanni De Toni,
Luca Erculiani,
Andrea Passerini
Abstract:
One of the most challenging goals in designing intelligent systems is empowering them with the ability to synthesize programs from data. Namely, given specific requirements in the form of input/output pairs, the goal is to train a machine learning model to discover a program that satisfies those requirements. A recent class of methods exploits combinatorial search procedures and deep learning to l…
▽ More
One of the most challenging goals in designing intelligent systems is empowering them with the ability to synthesize programs from data. Namely, given specific requirements in the form of input/output pairs, the goal is to train a machine learning model to discover a program that satisfies those requirements. A recent class of methods exploits combinatorial search procedures and deep learning to learn compositional programs. However, they usually generate only toy programs using a domain-specific language that does not provide any high-level feature, such as function arguments, which reduces their applicability in real-world settings. We extend upon a state of the art model, AlphaNPI, by learning to generate functions that can accept arguments. This improvement will enable us to move closer to real computer programs. Moreover, we investigate employing an Approximate version of Monte Carlo Tree Search (A-MCTS) to speed up convergence. We showcase the potential of our approach by learning the Quicksort algorithm, showing how the ability to deal with arguments is crucial for learning and generalization.
△ Less
Submitted 15 October, 2021; v1 submitted 1 September, 2021;
originally announced September 2021.
-
A general method for estimating the prevalence of Influenza-Like-Symptoms with Wikipedia data
Authors:
Giovanni De Toni,
Cristian Consonni,
Alberto Montresor
Abstract:
Influenza is an acute respiratory seasonal disease that affects millions of people worldwide and causes thousands of deaths in Europe alone. Being able to estimate in a fast and reliable way the impact of an illness on a given country is essential to plan and organize effective countermeasures, which is now possible by leveraging unconventional data sources like web searches and visits. In this st…
▽ More
Influenza is an acute respiratory seasonal disease that affects millions of people worldwide and causes thousands of deaths in Europe alone. Being able to estimate in a fast and reliable way the impact of an illness on a given country is essential to plan and organize effective countermeasures, which is now possible by leveraging unconventional data sources like web searches and visits. In this study, we show the feasibility of exploiting information about Wikipedia's page views of a selected group of articles and machine learning models to obtain accurate estimates of influenza-like illnesses incidence in four European countries: Italy, Germany, Belgium, and the Netherlands. We propose a novel language-agnostic method, based on two algorithms, Personalized PageRank and CycleRank, to automatically select the most relevant Wikipedia pages to be monitored without the need for expert supervision. We then show how our model is able to reach state-of-the-art results by comparing it with previous solutions.
△ Less
Submitted 28 October, 2020;
originally announced October 2020.