-
Model-agnostic bias mitigation methods with regressor distribution control for Wasserstein-based fairness metrics
Authors:
Alexey Miroshnikov,
Konstandinos Kotsiopoulos,
Ryan Franks,
Arjun Ravi Kannan
Abstract:
This article is a companion paper to our earlier work Miroshnikov et al. (2021) on fairness interpretability, which introduces bias explanations. In the current work, we propose a bias mitigation methodology based upon the construction of post-processed models with fairer regressor distributions for Wasserstein-based fairness metrics. By identifying the list of predictors contributing the most to…
▽ More
This article is a companion paper to our earlier work Miroshnikov et al. (2021) on fairness interpretability, which introduces bias explanations. In the current work, we propose a bias mitigation methodology based upon the construction of post-processed models with fairer regressor distributions for Wasserstein-based fairness metrics. By identifying the list of predictors contributing the most to the bias, we reduce the dimensionality of the problem by mitigating the bias originating from those predictors. The post-processing methodology involves resha** the predictor distributions by balancing the positive and negative bias explanations and allows for the regressor bias to decrease. We design an algorithm that uses Bayesian optimization to construct the bias-performance efficient frontier over the family of post-processed models, from which an optimal model is selected. Our novel methodology performs optimization in low-dimensional spaces and avoids expensive model retraining.
△ Less
Submitted 19 November, 2021;
originally announced November 2021.
-
Wasserstein-based fairness interpretability framework for machine learning models
Authors:
Alexey Miroshnikov,
Konstandinos Kotsiopoulos,
Ryan Franks,
Arjun Ravi Kannan
Abstract:
The objective of this article is to introduce a fairness interpretability framework for measuring and explaining the bias in classification and regression models at the level of a distribution. In our work, we measure the model bias across sub-population distributions in the model output using the Wasserstein metric. To properly quantify the contributions of predictors, we take into account the fa…
▽ More
The objective of this article is to introduce a fairness interpretability framework for measuring and explaining the bias in classification and regression models at the level of a distribution. In our work, we measure the model bias across sub-population distributions in the model output using the Wasserstein metric. To properly quantify the contributions of predictors, we take into account the favorability of both the model and predictors with respect to the non-protected class. The quantification is accomplished by the use of transport theory, which gives rise to the decomposition of the model bias and bias explanations to positive and negative contributions. To gain more insight into the role of favorability and allow for additivity of bias explanations, we adapt techniques from cooperative game theory.
△ Less
Submitted 8 March, 2022; v1 submitted 5 November, 2020;
originally announced November 2020.
-
The fluid dynamics of collective vortex structures of plant-animal worms
Authors:
George T. Fortune,
Alan Worley,
Ana B. Sendova-Franks,
Nigel R. Franks,
Kyriacos C. Leptos,
Eric Lauga,
Raymond E. Goldstein
Abstract:
Circular milling, a stunning manifestation of collective motion, is found across the natural world, from fish shoals to army ants. It has been observed recently that the plant-animal worm $Symsagittifera~roscoffensis$ exhibits circular milling behaviour, both in shallow pools at the beach and in Petri dishes in the laboratory. Here we investigate this phenomenon, through experiment and theory, fro…
▽ More
Circular milling, a stunning manifestation of collective motion, is found across the natural world, from fish shoals to army ants. It has been observed recently that the plant-animal worm $Symsagittifera~roscoffensis$ exhibits circular milling behaviour, both in shallow pools at the beach and in Petri dishes in the laboratory. Here we investigate this phenomenon, through experiment and theory, from a fluid dynamical viewpoint, focusing on the effect that an established circular mill has on the surrounding fluid. Unlike systems such as confined bacterial suspensions and collections of molecular motors and filaments that exhibit spontaneous circulatory behaviour, and which are modelled as force dipoles, the front-back symmetry of individual worms precludes a stresslet contribution. Instead, singularities such as source dipoles and Stokes quadrupoles are expected to dominate. A series of models is analyzed to understand the contributions of these singularities to the azimuthal flow fields generated by a mill, in light of the particular boundary conditions that hold for flow in a Petri dish. A model that treats a circular mill as a rigid rotating disc that generates a Stokes flow is shown to capture basic experimental results well, and gives insights into the emergence and stability of multiple mill systems.
△ Less
Submitted 5 November, 2020; v1 submitted 4 November, 2020;
originally announced November 2020.
-
A statistical method for revealing form-function relations in biological networks
Authors:
Andrew Mugler,
Boris Grinshpun,
Riley Franks,
Chris H. Wiggins
Abstract:
Over the past decade, a number of researchers in systems biology have sought to relate the function of biological systems to their network-level descriptions -- lists of the most important players and the pairwise interactions between them. Both for large networks (in which statistical analysis is often framed in terms of the abundance of repeated small subgraphs) and for small networks which can…
▽ More
Over the past decade, a number of researchers in systems biology have sought to relate the function of biological systems to their network-level descriptions -- lists of the most important players and the pairwise interactions between them. Both for large networks (in which statistical analysis is often framed in terms of the abundance of repeated small subgraphs) and for small networks which can be analyzed in greater detail (or even synthesized in vivo and subjected to experiment), revealing the relationship between the topology of small subgraphs and their biological function has been a central goal. We here seek to pose this revelation as a statistical task, illustrated using a particular setup which has been constructed experimentally and for which parameterized models of transcriptional regulation have been studied extensively. The question "how does function follow form" is here mathematized by identifying which topological attributes correlate with the diverse possible information-processing tasks which a transcriptional regulatory network can realize. The resulting method reveals one form-function relationship which had earlier been predicted based on analytic results, and reveals a second for which we can provide an analytic interpretation. Resulting source code is distributed via http://formfunction.sourceforge.net.
△ Less
Submitted 30 November, 2010;
originally announced December 2010.