-
Simultaneous Parameter Estimation and Variable Selection via the LN-CASS Prior
Authors:
William Thomson,
Sara Jabbari,
Angela Taylor,
Wiebke Arlt,
David Smith
Abstract:
We introduce a Bayesian prior distribution, the Logit-Normal continuous analogue of the spike-and-slab (LN-CASS), which enables flexible parameter estimation and variable/model selection in a variety of settings. We demonstrate its use and efficacy in three case studies -- a simulation study and two studies on real biological data from the fields of metabolomics and genomics. The prior allows the…
▽ More
We introduce a Bayesian prior distribution, the Logit-Normal continuous analogue of the spike-and-slab (LN-CASS), which enables flexible parameter estimation and variable/model selection in a variety of settings. We demonstrate its use and efficacy in three case studies -- a simulation study and two studies on real biological data from the fields of metabolomics and genomics. The prior allows the use of classical statistical models, which are easily interpretable and well-known to applied scientists, but performs comparably to common machine learning methods in terms of generalisability to previously unseen data.
△ Less
Submitted 3 October, 2018;
originally announced October 2018.
-
Fair Algorithms for Learning in Allocation Problems
Authors:
Hadi Elzayn,
Shahin Jabbari,
Christopher Jung,
Michael Kearns,
Seth Neel,
Aaron Roth,
Zachary Schutzman
Abstract:
Settings such as lending and policing can be modeled by a centralized agent allocating a resource (loans or police officers) amongst several groups, in order to maximize some objective (loans given that are repaid or criminals that are apprehended). Often in such problems fairness is also a concern. A natural notion of fairness, based on general principles of equality of opportunity, asks that con…
▽ More
Settings such as lending and policing can be modeled by a centralized agent allocating a resource (loans or police officers) amongst several groups, in order to maximize some objective (loans given that are repaid or criminals that are apprehended). Often in such problems fairness is also a concern. A natural notion of fairness, based on general principles of equality of opportunity, asks that conditional on an individual being a candidate for the resource, the probability of actually receiving it is approximately independent of the individual's group. In lending this means that equally creditworthy individuals in different racial groups have roughly equal chances of receiving a loan. In policing it means that two individuals committing the same crime in different districts would have roughly equal chances of being arrested.
We formalize this fairness notion for allocation problems and investigate its algorithmic consequences. Our main technical results include an efficient learning algorithm that converges to an optimal fair allocation even when the frequency of candidates (creditworthy individuals or criminals) in each group is unknown. The algorithm operates in a censored feedback model in which only the number of candidates who received the resource in a given allocation can be observed, rather than the true number of candidates. This models the fact that we do not learn the creditworthiness of individuals we do not give loans to nor learn about crimes committed if the police presence in a district is low.
As an application of our framework, we consider the predictive policing problem. The learning algorithm is trained on arrest data gathered from its own deployments on previous days, resulting in a potential feedback loop that our algorithm provably overcomes. We empirically investigate the performance of our algorithm on the Philadelphia Crime Incidents dataset.
△ Less
Submitted 14 November, 2018; v1 submitted 30 August, 2018;
originally announced August 2018.
-
A Convex Framework for Fair Regression
Authors:
Richard Berk,
Hoda Heidari,
Shahin Jabbari,
Matthew Joseph,
Michael Kearns,
Jamie Morgenstern,
Seth Neel,
Aaron Roth
Abstract:
We introduce a flexible family of fairness regularizers for (linear and logistic) regression problems. These regularizers all enjoy convexity, permitting fast optimization, and they span the rang from notions of group fairness to strong individual fairness. By varying the weight on the fairness regularizer, we can compute the efficient frontier of the accuracy-fairness trade-off on any given datas…
▽ More
We introduce a flexible family of fairness regularizers for (linear and logistic) regression problems. These regularizers all enjoy convexity, permitting fast optimization, and they span the rang from notions of group fairness to strong individual fairness. By varying the weight on the fairness regularizer, we can compute the efficient frontier of the accuracy-fairness trade-off on any given dataset, and we measure the severity of this trade-off via a numerical quantity we call the Price of Fairness (PoF). The centerpiece of our results is an extensive comparative study of the PoF across six different datasets in which fairness is a primary consideration.
△ Less
Submitted 7 June, 2017;
originally announced June 2017.
-
Fairness in Criminal Justice Risk Assessments: The State of the Art
Authors:
Richard Berk,
Hoda Heidari,
Shahin Jabbari,
Michael Kearns,
Aaron Roth
Abstract:
Objectives: Discussions of fairness in criminal justice risk assessments typically lack conceptual precision. Rhetoric too often substitutes for careful analysis. In this paper, we seek to clarify the tradeoffs between different kinds of fairness and between fairness and accuracy.
Methods: We draw on the existing literatures in criminology, computer science and statistics to provide an integrate…
▽ More
Objectives: Discussions of fairness in criminal justice risk assessments typically lack conceptual precision. Rhetoric too often substitutes for careful analysis. In this paper, we seek to clarify the tradeoffs between different kinds of fairness and between fairness and accuracy.
Methods: We draw on the existing literatures in criminology, computer science and statistics to provide an integrated examination of fairness and accuracy in criminal justice risk assessments. We also provide an empirical illustration using data from arraignments.
Results: We show that there are at least six kinds of fairness, some of which are incompatible with one another and with accuracy.
Conclusions: Except in trivial cases, it is impossible to maximize accuracy and fairness at the same time, and impossible simultaneously to satisfy all kinds of fairness. In practice, a major complication is different base rates across different legally protected groups. There is a need to consider challenging tradeoffs.
△ Less
Submitted 27 May, 2017; v1 submitted 27 March, 2017;
originally announced March 2017.