Skip to main content

Showing 1–50 of 62 results for author: Kearns, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20272  [pdf, other

    cs.LG cs.CR

    Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable

    Authors: Martin Bertran, Shuai Tang, Michael Kearns, Jamie Morgenstern, Aaron Roth, Zhiwei Steven Wu

    Abstract: Machine unlearning is motivated by desire for data autonomy: a person can request to have their data's influence removed from deployed models, and those models should be updated as if they were retrained without the person's data. We show that, counter-intuitively, these updates expose individuals to high-accuracy reconstruction attacks which allow the attacker to recover their data in its entiret… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2405.16752  [pdf, other

    cs.LG cs.AI

    Model Ensembling for Constrained Optimization

    Authors: Ira Globus-Harris, Varun Gupta, Michael Kearns, Aaron Roth

    Abstract: There is a long history in machine learning of model ensembling, beginning with boosting and bagging and continuing to the present day. Much of this history has focused on combining models for classification and regression, but recently there is interest in more complex settings such as ensembling policies in reinforcement learning. Strong connections have also emerged between ensembling and multi… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  3. arXiv:2405.16739  [pdf, other

    cs.LG cs.AI eess.SY

    Oracle-Efficient Reinforcement Learning for Max Value Ensembles

    Authors: Marcel Hussing, Michael Kearns, Aaron Roth, Sikata Bela Sengupta, Jessica Sorrell

    Abstract: Reinforcement learning (RL) in large or infinite state spaces is notoriously challenging, both theoretically (where worst-case sample and computational complexities must scale with state space cardinality) and experimentally (where function approximation and policy gradient techniques often scale poorly and suffer from instability and high variance). One line of research attempting to address thes… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  4. arXiv:2402.10795  [pdf, other

    cs.LG cs.CY cs.HC

    Diversified Ensembling: An Experiment in Crowdsourced Machine Learning

    Authors: Ira Globus-Harris, Declan Harrison, Michael Kearns, Pietro Perona, Aaron Roth

    Abstract: Crowdsourced machine learning on competition platforms such as Kaggle is a popular and often effective method for generating accurate models. Typically, teams vie for the most accurate model, as measured by overall error on a holdout set, and it is common towards the end of such competitions for teams at the top of the leaderboard to ensemble or average their models outside the platform mechanism… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  5. arXiv:2312.05140  [pdf, other

    cs.LG cs.CR

    Membership Inference Attacks on Diffusion Models via Quantile Regression

    Authors: Shuai Tang, Zhiwei Steven Wu, Sergul Aydore, Michael Kearns, Aaron Roth

    Abstract: Recently, diffusion models have become popular tools for image synthesis because of their high-quality outputs. However, like other large-scale models, they may leak private information about their training data. Here, we demonstrate a privacy vulnerability of diffusion models through a \emph{membership inference (MI) attack}, which aims to identify whether a target example belongs to the training… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  6. arXiv:2307.03694  [pdf, other

    cs.LG cs.AI cs.CR

    Scalable Membership Inference Attacks via Quantile Regression

    Authors: Martin Bertran, Shuai Tang, Michael Kearns, Jamie Morgenstern, Aaron Roth, Zhiwei Steven Wu

    Abstract: Membership inference attacks are designed to determine, using black box access to trained models, whether a particular example was used in training or not. Membership inference can be formalized as a hypothesis testing problem. The most effective existing attacks estimate the distribution of some test statistic (usually the model's confidence on the true label) on points that were (and were not) u… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  7. Balanced Filtering via Disclosure-Controlled Proxies

    Authors: Siqi Deng, Emily Diana, Michael Kearns, Aaron Roth

    Abstract: We study the problem of collecting a cohort or set that is balanced with respect to sensitive groups when group membership is unavailable or prohibited from use at deployment time. Specifically, our deployment-time collection mechanism does not reveal significantly more about the group membership of any individual sample than can be ascertained from base rates alone. To do this, we study a learner… ▽ More

    Submitted 17 June, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Journal ref: 5th Symposium on Foundations of Responsible Computing (FORC 2024)

  8. arXiv:2305.15284  [pdf, other

    cs.LG

    Replicable Reinforcement Learning

    Authors: Eric Eaton, Marcel Hussing, Michael Kearns, Jessica Sorrell

    Abstract: The replicability crisis in the social, behavioral, and data sciences has led to the formulation of algorithm frameworks for replicability -- i.e., a requirement that an algorithm produce identical outputs (with high probability) when run on two different samples from the same underlying distribution. While still in its infancy, provably replicable algorithms have been developed for many fundament… ▽ More

    Submitted 31 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  9. arXiv:2304.03545  [pdf, other

    cs.LG cs.CR

    AI Model Disgorgement: Methods and Choices

    Authors: Alessandro Achille, Michael Kearns, Carson Klingenberg, Stefano Soatto

    Abstract: Responsible use of data is an indispensable part of any machine learning (ML) implementation. ML developers must carefully collect and curate their datasets, and document their provenance. They must also make sure to respect intellectual property rights, preserve individual privacy, and use data in an ethical way. Over the past few years, ML models have significantly increased in size and complexi… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  10. arXiv:2303.03451  [pdf, other

    cs.LG cs.CR

    Improved Differentially Private Regression via Gradient Boosting

    Authors: Shuai Tang, Sergul Aydore, Michael Kearns, Saeyoung Rho, Aaron Roth, Yichen Wang, Yu-Xiang Wang, Zhiwei Steven Wu

    Abstract: We revisit the problem of differentially private squared error linear regression. We observe that existing state-of-the-art methods are sensitive to the choice of hyperparameters -- including the ``clip** threshold'' that cannot be set optimally in a data-independent way. We give a new algorithm for private linear regression based on gradient boosting. We show that our method consistently improv… ▽ More

    Submitted 20 May, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  11. arXiv:2301.13767  [pdf, other

    cs.LG cs.DS

    Multicalibration as Boosting for Regression

    Authors: Ira Globus-Harris, Declan Harrison, Michael Kearns, Aaron Roth, Jessica Sorrell

    Abstract: We study the connection between multicalibration and boosting for squared error regression. First we prove a useful characterization of multicalibration in terms of a ``swap regret'' like condition on squared error. Using this characterization, we give an exceedingly simple algorithm that can be analyzed both as a boosting algorithm for regression and as a multicalibration algorithm for a class H… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: Code available here: https://github.com/Declancharrison/Level-Set-Boosting

  12. arXiv:2211.03128  [pdf, other

    cs.CY cs.CR cs.LG

    Confidence-Ranked Reconstruction of Census Microdata from Published Statistics

    Authors: Travis Dick, Cynthia Dwork, Michael Kearns, Terrance Liu, Aaron Roth, Giuseppe Vietri, Zhiwei Steven Wu

    Abstract: A reconstruction attack on a private dataset $D$ takes as input some publicly accessible information about the dataset and produces a list of candidate elements of $D$. We introduce a new class of data reconstruction attacks based on randomized methods for non-convex optimization. We empirically demonstrate that our attacks can not only reconstruct full rows of $D$ from aggregate query statistics… ▽ More

    Submitted 6 February, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

  13. arXiv:2209.07400  [pdf, other

    cs.LG

    Private Synthetic Data for Multitask Learning and Marginal Queries

    Authors: Giuseppe Vietri, Cedric Archambeau, Sergul Aydore, William Brown, Michael Kearns, Aaron Roth, Ankit Siva, Shuai Tang, Zhiwei Steven Wu

    Abstract: We provide a differentially private algorithm for producing synthetic data simultaneously useful for multiple tasks: marginal queries and multitask machine learning (ML). A key innovation in our algorithm is the ability to directly handle numerical features, in contrast to a number of related prior approaches which require numerical features to be first converted into {high cardinality} categorica… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: The short version of this paper appears in the proceedings of NeurIPS-22

  14. arXiv:2209.07312  [pdf, other

    cs.LG cs.DS

    Multicalibrated Regression for Downstream Fairness

    Authors: Ira Globus-Harris, Varun Gupta, Christopher Jung, Michael Kearns, Jamie Morgenstern, Aaron Roth

    Abstract: We show how to take a regression function $\hat{f}$ that is appropriately ``multicalibrated'' and efficiently post-process it into an approximately error minimizing classifier satisfying a large variety of fairness constraints. The post-processing requires no labeled data, and only a modest amount of unlabeled data and computation. The computational and sample complexity requirements of computing… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  15. arXiv:2207.04192  [pdf, ps, other

    cs.GT

    Efficient Stackelberg Strategies for Finitely Repeated Games

    Authors: Natalie Collina, Eshwar Ram Arunachaleswaran, Michael Kearns

    Abstract: We study Stackelberg equilibria in finitely repeated games, where the leader commits to a strategy that picks actions in each round and can be adaptive to the history of play (i.e. they commit to an algorithm). In particular, we study static repeated games with no discounting. We give efficient algorithms for finding approximate Stackelberg equilibria in this setting, along with rates of convergen… ▽ More

    Submitted 6 March, 2024; v1 submitted 9 July, 2022; originally announced July 2022.

  16. arXiv:2203.11481  [pdf, other

    cs.CV cs.CR

    Mixed Differential Privacy in Computer Vision

    Authors: Aditya Golatkar, Alessandro Achille, Yu-Xiang Wang, Aaron Roth, Michael Kearns, Stefano Soatto

    Abstract: We introduce AdaMix, an adaptive differentially private algorithm for training deep neural network classifiers using both private and public image data. While pre-training language models on large public datasets has enabled strong differential privacy (DP) guarantees with minor loss of accuracy, a similar practice yields punishing trade-offs in vision tasks. A few-shot or even zero-shot learning… ▽ More

    Submitted 28 March, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted at CVPR 2022

  17. arXiv:2201.10408  [pdf, other

    cs.LG cs.CY cs.DS

    An Algorithmic Framework for Bias Bounties

    Authors: Ira Globus-Harris, Michael Kearns, Aaron Roth

    Abstract: We propose and analyze an algorithmic framework for "bias bounties": events in which external participants are invited to propose improvements to a trained model, akin to bug bounty events in software and security. Our framework allows participants to submit arbitrary subgroup improvements, which are then algorithmically incorporated into an updated model. Our algorithm has the property that there… ▽ More

    Submitted 9 May, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

  18. arXiv:2107.04423  [pdf, other

    cs.LG cs.DS

    Multiaccurate Proxies for Downstream Fairness

    Authors: Emily Diana, Wesley Gill, Michael Kearns, Krishnaram Kenthapadi, Aaron Roth, Saeed Sharifi-Malvajerdi

    Abstract: We study the problem of training a model that must obey demographic fairness conditions when the sensitive features are not available at training time -- in other words, how can we train a model to be fair by race when we don't have data about race? We adopt a fairness pipeline perspective, in which an "upstream" learner that does have access to the sensitive features will learn a proxy model for… ▽ More

    Submitted 25 January, 2022; v1 submitted 9 July, 2021; originally announced July 2021.

  19. arXiv:2103.06641  [pdf, other

    cs.LG cs.CR

    Differentially Private Query Release Through Adaptive Projection

    Authors: Sergul Aydore, William Brown, Michael Kearns, Krishnaram Kenthapadi, Luca Melis, Aaron Roth, Ankit Siva

    Abstract: We propose, implement, and evaluate a new algorithm for releasing answers to very large numbers of statistical queries like $k$-way marginals, subject to differential privacy. Our algorithm makes adaptive use of a continuous relaxation of the Projection Mechanism, which answers queries on the private dataset using simple perturbation, and then attempts to find the synthetic dataset that most close… ▽ More

    Submitted 23 June, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

  20. arXiv:2102.08454  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Lexicographically Fair Learning: Algorithms and Generalization

    Authors: Emily Diana, Wesley Gill, Ira Globus-Harris, Michael Kearns, Aaron Roth, Saeed Sharifi-Malvajerdi

    Abstract: We extend the notion of minimax fairness in supervised learning problems to its natural conclusion: lexicographic minimax fairness (or lexifairness for short). Informally, given a collection of demographic groups of interest, minimax fairness asks that the error of the group with the highest error be minimized. Lexifairness goes further and asks that amongst all minimax fair solutions, the error o… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  21. arXiv:2011.03108  [pdf, other

    cs.LG

    Minimax Group Fairness: Algorithms and Experiments

    Authors: Emily Diana, Wesley Gill, Michael Kearns, Krishnaram Kenthapadi, Aaron Roth

    Abstract: We consider a recently introduced framework in which fairness is measured by worst-case outcomes across groups, rather than by the more standard differences between group outcomes. In this framework we provide provably convergent oracle-efficient learning algorithms (or equivalently, reductions to non-fair learning) for minimax group fairness. Here the goal is that of minimizing the maximum loss a… ▽ More

    Submitted 7 March, 2021; v1 submitted 5 November, 2020; originally announced November 2020.

  22. arXiv:2007.03661  [pdf

    cs.CY cs.HC

    Mathematical Foundations for Social Computing

    Authors: Yiling Chen, Arpita Ghosh, Michael Kearns, Tim Roughgarden, Jennifer Wortman Vaughan

    Abstract: Social computing encompasses the mechanisms through which people interact with computational systems: crowdsourcing systems, ranking and recommendation systems, online prediction markets, citizen science projects, and collaboratively edited wikis, to name a few. These systems share the common feature that humans are active participants, making choices that determine the input to, and therefore the… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: A Computing Community Consortium (CCC) workshop report, 15 pages

    Report number: ccc2014report_5

  23. arXiv:2006.07281  [pdf, other

    cs.LG cs.CE cs.GT stat.ML

    Algorithms and Learning for Fair Portfolio Design

    Authors: Emily Diana, Travis Dick, Hadi Elzayn, Michael Kearns, Aaron Roth, Zachary Schutzman, Saeed Sharifi-Malvajerdi, Juba Ziani

    Abstract: We consider a variation on the classical finance problem of optimal portfolio design. In our setting, a large population of consumers is drawn from some distribution over risk tolerances, and each consumer must be assigned to a portfolio of lower risk than her tolerance. The consumers may also belong to underlying groups (for instance, of demographic properties or wealth), and the goal is to desig… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  24. arXiv:2002.05699  [pdf, other

    cs.GT cs.CE

    Differentially Private Call Auctions and Market Impact

    Authors: Emily Diana, Hadi Elzayn, Michael Kearns, Aaron Roth, Saeed Sharifi-Malvajerdi, Juba Ziani

    Abstract: We propose and analyze differentially private (DP) mechanisms for call auctions as an alternative to the complex and ad-hoc privacy efforts that are common in modern electronic markets. We prove that the number of shares cleared in the DP mechanisms compares favorably to the non-private optimal and provide a matching lower bound. We analyze the incentive properties of our mechanisms and their beha… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  25. arXiv:1912.06202  [pdf, ps, other

    cs.GT cs.DS cs.LG q-fin.CP

    Optimal, Truthful, and Private Securities Lending

    Authors: Emily Diana, Michael Kearns, Seth Neel, Aaron Roth

    Abstract: We consider a fundamental dynamic allocation problem motivated by the problem of $\textit{securities lending}$ in financial markets, the mechanism underlying the short selling of stocks. A lender would like to distribute a finite number of identical copies of some scarce resource to $n$ clients, each of whom has a private demand that is unknown to the lender. The lender would like to maximize the… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

  26. arXiv:1906.00241  [pdf, ps, other

    cs.GT

    Network Formation under Random Attack and Probabilistic Spread

    Authors: Yu Chen, Shahin Jabbari, Michael Kearns, Sanjeev Khanna, Jamie Morgenstern

    Abstract: We study a network formation game where agents receive benefits by forming connections to other agents but also incur both direct and indirect costs from the formed connections. Specifically, once the agents have purchased their connections, an attack starts at a randomly chosen vertex in the network and spreads according to the independent cascade model with a fixed probability, destroying any in… ▽ More

    Submitted 1 June, 2019; originally announced June 2019.

    Comments: The short version of this paper appears in the proceedings of IJCAI-19

  27. arXiv:1905.10660  [pdf, other

    cs.LG stat.ML

    An Algorithmic Framework for Fairness Elicitation

    Authors: Christopher Jung, Michael Kearns, Seth Neel, Aaron Roth, Logan Stapleton, Zhiwei Steven Wu

    Abstract: We consider settings in which the right notion of fairness is not captured by simple mathematical definitions (such as equality of error rates across groups), but might be more complex and nuanced and thus require elicitation from individual or collective stakeholders. We introduce a framework in which pairs of individuals can be identified as requiring (approximately) equal treatment under a lear… ▽ More

    Submitted 14 October, 2020; v1 submitted 25 May, 2019; originally announced May 2019.

  28. arXiv:1905.10607  [pdf, other

    cs.LG cs.GT stat.ML

    Average Individual Fairness: Algorithms, Generalization and Experiments

    Authors: Michael Kearns, Aaron Roth, Saeed Sharifi-Malvajerdi

    Abstract: We propose a new family of fairness definitions for classification problems that combine some of the best properties of both statistical and individual notions of fairness. We posit not only a distribution over individuals, but also a distribution over (or collection of) classification tasks. We then ask that standard statistics (such as error or false positive/negative rates) be (approximately) e… ▽ More

    Submitted 16 December, 2019; v1 submitted 25 May, 2019; originally announced May 2019.

  29. arXiv:1905.08909  [pdf, other

    cs.GT cs.LG

    Equilibrium Characterization for Data Acquisition Games

    Authors: **shuo Dong, Hadi Elzayn, Shahin Jabbari, Michael Kearns, Zachary Schutzman

    Abstract: We study a game between two firms in which each provide a service based on machine learning. The firms are presented with the opportunity to purchase a new corpus of data, which will allow them to potentially improve the quality of their products. The firms can decide whether or not they want to buy the data, as well as which learning model to build with that data. We demonstrate a reduction from… ▽ More

    Submitted 23 May, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: The short version of this paper appears in the proceedings of IJCAI-19

  30. arXiv:1812.02696  [pdf, other

    cs.LG cs.DS cs.GT stat.ML

    Differentially Private Fair Learning

    Authors: Matthew Jagielski, Michael Kearns, Jieming Mao, Alina Oprea, Aaron Roth, Saeed Sharifi-Malvajerdi, Jonathan Ullman

    Abstract: Motivated by settings in which predictive models may be required to be non-discriminatory with respect to certain attributes (such as race), but even collecting the sensitive attribute may be forbidden or restricted, we initiate the study of fair learning under the constraint of differential privacy. We design two learning algorithms that simultaneously promise differential privacy and equalized o… ▽ More

    Submitted 31 May, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

  31. arXiv:1808.10549  [pdf, other

    cs.LG stat.ML

    Fair Algorithms for Learning in Allocation Problems

    Authors: Hadi Elzayn, Shahin Jabbari, Christopher Jung, Michael Kearns, Seth Neel, Aaron Roth, Zachary Schutzman

    Abstract: Settings such as lending and policing can be modeled by a centralized agent allocating a resource (loans or police officers) amongst several groups, in order to maximize some objective (loans given that are repaid or criminals that are apprehended). Often in such problems fairness is also a concern. A natural notion of fairness, based on general principles of equality of opportunity, asks that con… ▽ More

    Submitted 14 November, 2018; v1 submitted 30 August, 2018; originally announced August 2018.

    Comments: The short version of this paper appears in the proceedings of ACM FAT*-19

  32. arXiv:1808.08166  [pdf, other

    cs.LG stat.ML

    An Empirical Study of Rich Subgroup Fairness for Machine Learning

    Authors: Michael Kearns, Seth Neel, Aaron Roth, Zhiwei Steven Wu

    Abstract: Kearns et al. [2018] recently proposed a notion of rich subgroup fairness intended to bridge the gap between statistical and individual notions of fairness. Rich subgroup fairness picks a statistical fairness constraint (say, equalizing false positive rates across protected groups), but then asks that this constraint hold over an exponentially or infinitely large collection of subgroups defined by… ▽ More

    Submitted 24 August, 2018; originally announced August 2018.

  33. arXiv:1802.06936  [pdf, ps, other

    cs.LG

    Online Learning with an Unknown Fairness Metric

    Authors: Stephen Gillen, Christopher Jung, Michael Kearns, Aaron Roth

    Abstract: We consider the problem of online learning in the linear contextual bandits setting, but in which there are also strong individual fairness constraints governed by an unknown similarity metric. These constraints demand that we select similar actions or individuals with approximately equal probability (arXiv:1104.3913), which may be at odds with optimizing reward, thus modeling settings where profi… ▽ More

    Submitted 17 September, 2018; v1 submitted 19 February, 2018; originally announced February 2018.

  34. arXiv:1711.05144  [pdf, other

    cs.LG cs.DS cs.GT

    Preventing Fairness Gerrymandering: Auditing and Learning for Subgroup Fairness

    Authors: Michael Kearns, Seth Neel, Aaron Roth, Zhiwei Steven Wu

    Abstract: The most prevalent notions of fairness in machine learning are statistical definitions: they fix a small collection of pre-defined groups, and then ask for parity of some statistic of the classifier across these groups. Constraints of this form are susceptible to intentional or inadvertent "fairness gerrymandering", in which a classifier appears to be fair on each individual group, but badly viola… ▽ More

    Submitted 3 December, 2018; v1 submitted 14 November, 2017; originally announced November 2017.

    Comments: Added new experimental results and a slightly modified fairness definition

  35. arXiv:1706.02409  [pdf, other

    cs.LG stat.ML

    A Convex Framework for Fair Regression

    Authors: Richard Berk, Hoda Heidari, Shahin Jabbari, Matthew Joseph, Michael Kearns, Jamie Morgenstern, Seth Neel, Aaron Roth

    Abstract: We introduce a flexible family of fairness regularizers for (linear and logistic) regression problems. These regularizers all enjoy convexity, permitting fast optimization, and they span the rang from notions of group fairness to strong individual fairness. By varying the weight on the fairness regularizer, we can compute the efficient frontier of the accuracy-fairness trade-off on any given datas… ▽ More

    Submitted 7 June, 2017; originally announced June 2017.

  36. arXiv:1705.02321  [pdf, ps, other

    cs.GT

    Fairness Incentives for Myopic Agents

    Authors: Sampath Kannan, Michael Kearns, Jamie Morgenstern, Mallesh Pai, Aaron Roth, Rakesh Vohra, Z. Steven Wu

    Abstract: We consider settings in which we wish to incentivize myopic agents (such as Airbnb landlords, who may emphasize short-term profits and property safety) to treat arriving clients fairly, in order to prevent overall discrimination against individuals or groups. We model such settings in both classical and contextual bandit models in which the myopic agents maximize rewards according to current empir… ▽ More

    Submitted 5 May, 2017; originally announced May 2017.

  37. arXiv:1611.03071  [pdf, other

    cs.LG

    Fairness in Reinforcement Learning

    Authors: Shahin Jabbari, Matthew Joseph, Michael Kearns, Jamie Morgenstern, Aaron Roth

    Abstract: We initiate the study of fairness in reinforcement learning, where the actions of a learning algorithm may affect its environment and future rewards. Our fairness constraint requires that an algorithm never prefers one action over another if the long-term (discounted) reward of choosing the latter action is higher. Our first result is negative: despite the fact that fairness is consistent with the… ▽ More

    Submitted 5 August, 2017; v1 submitted 9 November, 2016; originally announced November 2016.

    Comments: The short version of this paper appears in the proceedings of ICML-17

  38. arXiv:1610.09559  [pdf, other

    cs.LG

    Fair Algorithms for Infinite and Contextual Bandits

    Authors: Matthew Joseph, Michael Kearns, Jamie Morgenstern, Seth Neel, Aaron Roth

    Abstract: We study fairness in linear bandit problems. Starting from the notion of meritocratic fairness introduced in Joseph et al. [2016], we carry out a more refined analysis of a more general problem, achieving better performance guarantees with fewer modelling assumptions on the number and structure of available choices as well as the number selected. We also analyze the previously-unstudied question o… ▽ More

    Submitted 29 June, 2017; v1 submitted 29 October, 2016; originally announced October 2016.

  39. arXiv:1606.01275  [pdf, ps, other

    cs.DS cs.LG

    Predicting with Distributions

    Authors: Michael Kearns, Zhiwei Steven Wu

    Abstract: We consider a new learning model in which a joint distribution over vector pairs $(x,y)$ is determined by an unknown function $c(x)$ that maps input vectors $x$ not to individual outputs, but to entire {\em distributions\/} over output vectors $y$. Our main results take the form of rather general reductions from our model to algorithms for PAC learning the function class and the distribution class… ▽ More

    Submitted 9 June, 2017; v1 submitted 3 June, 2016; originally announced June 2016.

  40. arXiv:1605.07139  [pdf, other

    cs.LG stat.ML

    Fairness in Learning: Classic and Contextual Bandits

    Authors: Matthew Joseph, Michael Kearns, Jamie Morgenstern, Aaron Roth

    Abstract: We introduce the study of fairness in multi-armed bandit problems. Our fairness definition can be interpreted as demanding that given a pool of applicants (say, for college admission or mortgages), a worse applicant is never favored over a better one, despite a learning algorithm's uncertainty over the true payoffs. We prove results of two types. First, in the important special case of the class… ▽ More

    Submitted 7 November, 2016; v1 submitted 23 May, 2016; originally announced May 2016.

    Comments: A condensed version of this work appears in the 30th Annual Conference on Neural Information Processing Systems (NIPS), 2016

  41. arXiv:1512.02698  [pdf, ps, other

    cs.GT

    Robust Mediators in Large Games

    Authors: Michael Kearns, Mallesh M. Pai, Ryan Rogers, Aaron Roth, Jonathan Ullman

    Abstract: A mediator is a mechanism that can only suggest actions to players, as a function of all agents' reported types, in a given game of incomplete information. We study what is achievable by two kinds of mediators, "strong" and "weak." Players can choose to opt-out of using a strong mediator but cannot misrepresent their type if they opt-in. Such a mediator is "strong" because we can view it as having… ▽ More

    Submitted 10 December, 2015; v1 submitted 8 December, 2015; originally announced December 2015.

    Comments: This work unifies and subsumes the two papers "Mechanism design in large games: incentives and privacy" ITCS'14 (arXiv:1207.4084) and "Asymptotically truthful equilibrium selection in large congestion games" EC '14 (arXiv:1311.2625)

  42. arXiv:1511.05196  [pdf, other

    cs.GT

    Strategic Network Formation with Attack and Immunization

    Authors: Sanjeev Goyal, Shahin Jabbari, Michael Kearns, Sanjeev Khanna, Jamie Morgenstern

    Abstract: Strategic network formation arises where agents receive benefit from connections to other agents, but also incur costs for forming links. We consider a new network formation game that incorporates an adversarial attack, as well as immunization against attack. An agent's benefit is the expected size of her connected component post-attack, and agents may also choose to immunize themselves from attac… ▽ More

    Submitted 9 November, 2016; v1 submitted 16 November, 2015; originally announced November 2015.

    Comments: The short version of this paper appears in the proceedings of WINE-16

  43. arXiv:1506.00242  [pdf, other

    cs.DS cs.CR cs.CY cs.SI

    Privacy for the Protected (Only)

    Authors: Michael Kearns, Aaron Roth, Zhiwei Steven Wu, Grigory Yaroslavtsev

    Abstract: Motivated by tensions between data privacy for individual citizens, and societal priorities such as counterterrorism and the containment of infectious disease, we introduce a computational model that distinguishes between parties for whom privacy is explicitly protected, and those for whom it is not (the targeted subpopulation). The goal is the development of algorithms that can effectively identi… ▽ More

    Submitted 31 May, 2015; originally announced June 2015.

  44. arXiv:1407.7740  [pdf, other

    cs.DS cs.GT

    Privacy and Truthful Equilibrium Selection for Aggregative Games

    Authors: Rachel Cummings, Michael Kearns, Aaron Roth, Zhiwei Steven Wu

    Abstract: We study a very general class of games --- multi-dimensional aggregative games --- which in particular generalize both anonymous games and weighted congestion games. For any such game that is also large, we solve the equilibrium selection problem in a strong sense. In particular, we give an efficient weak mediator: a mechanism which has only the power to listen to reported types and provide non-bi… ▽ More

    Submitted 24 February, 2015; v1 submitted 29 July, 2014; originally announced July 2014.

  45. arXiv:1407.7294  [pdf, other

    cs.DS cs.GT cs.LG

    Online Learning and Profit Maximization from Revealed Preferences

    Authors: Kareem Amin, Rachel Cummings, Lili Dworkin, Michael Kearns, Aaron Roth

    Abstract: We consider the problem of learning from revealed preferences in an online setting. In our framework, each period a consumer buys an optimal bundle of goods from a merchant according to her (linear) utility function and current prices, subject to a budget constraint. The merchant observes only the purchased goods, and seeks to adapt prices to optimize his profits. We give an efficient algorithm fo… ▽ More

    Submitted 28 November, 2014; v1 submitted 27 July, 2014; originally announced July 2014.

  46. arXiv:1406.4837  [pdf, other

    cs.GT

    A Computational Study of Feasible Repackings in the FCC Incentive Auctions

    Authors: Michael Kearns, Lili Dworkin

    Abstract: We report the results of a computational study of repacking in the FCC Incentive Auctions. Our interest lies in the structure and constraints of the solution space of feasible repackings. Our analyses are "mechanism-free", in the sense that they identify constraints that must hold regardless of the reverse auction mechanism chosen or the prices offered for broadcaster clearing. We examine topics s… ▽ More

    Submitted 18 June, 2014; originally announced June 2014.

  47. arXiv:1308.3822  [pdf

    cs.DS

    Sublinear Matching With Finite Automata Using Reverse Suffix Scanning

    Authors: Steven M. Kearns

    Abstract: We give algorithms to accelerate the computation of deterministic finite automata (DFA) by calculating the state of a DFA n positions ahead utilizing a reverse scan of the next n characters. Often this requires scanning fewer than n characters resulting in a fraction of the input being skipped and a commensurate increase in processing speed. The skipped fraction is > 80% in several of our examples… ▽ More

    Submitted 15 January, 2015; v1 submitted 17 August, 2013; originally announced August 2013.

    Comments: This version of the paper is a streamlined presentation that includes the definition of Offsetting Finite Automata, which replaces the name Accelerated Finite Automata in previous version

  48. arXiv:1302.1552  [pdf

    cs.LG stat.ML

    An Information-Theoretic Analysis of Hard and Soft Assignment Methods for Clustering

    Authors: Michael Kearns, Yishay Mansour, Andrew Y. Ng

    Abstract: Assignment methods are at the heart of many algorithms for unsupervised learning and clustering - in particular, the well-known K-means and Expectation-Maximization (EM) algorithms. In this work, we study several different methods of assignment, including the "hard" assignments used by K-means and the ?soft' assignments used by EM. While it is known that K-means minimizes the distortion on the da… ▽ More

    Submitted 6 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI1997)

    Report number: UAI-P-1997-PG-282-293

  49. arXiv:1301.7392  [pdf

    cs.LG stat.ML

    Large Deviation Methods for Approximate Probabilistic Inference

    Authors: Michael Kearns, Lawrence Saul

    Abstract: We study two-layer belief networks of binary random variables in which the conditional probabilities Pr[childlparents] depend monotonically on weighted sums of the parents. In large networks where exact probabilistic inference is intractable, we show how to compute upper and lower bounds on many probabilities of interest. In particular, using methods from large deviation theory, we derive rigorous… ▽ More

    Submitted 30 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

    Report number: UAI-P-1998-PG-311-319

  50. arXiv:1301.7391  [pdf

    cs.AI

    Exact Inference of Hidden Structure from Sample Data in Noisy-OR Networks

    Authors: Michael Kearns, Yishay Mansour

    Abstract: In the literature on graphical models, there has been increased attention paid to the problems of learning hidden structure (see Heckerman [H96] for survey) and causal mechanisms from sample data [H96, P88, S93, P95, F98]. In most settings we should expect the former to be difficult, and the latter potentially impossible without experimental intervention. In this work, we examine some restricted… ▽ More

    Submitted 30 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

    Report number: UAI-P-1998-PG-304-310