Skip to main content

Showing 1–18 of 18 results for author: Ghani, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05809  [pdf

    cs.LG cs.AI cs.CY

    Aequitas Flow: Streamlining Fair ML Experimentation

    Authors: Sérgio Jesus, Pedro Saleiro, Inês Oliveira e Silva, Beatriz M. Jorge, Rita P. Ribeiro, João Gama, Pedro Bizarro, Rayid Ghani

    Abstract: Aequitas Flow is an open-source framework for end-to-end Fair Machine Learning (ML) experimentation in Python. This package fills the existing integration gaps in other Fair ML packages of complete and accessible experimentation. It provides a pipeline for fairness-aware model training, hyperparameter optimization, and evaluation, enabling rapid and simple experiments and result analysis. Aimed at… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  2. arXiv:2403.12599  [pdf, other

    cs.CY cs.LG

    Preventing Eviction-Caused Homelessness through ML-Informed Distribution of Rental Assistance

    Authors: Catalina Vajiac, Arun Frey, Joachim Baumann, Abigail Smith, Kasun Amarasinghe, Alice Lai, Kit Rodolfa, Rayid Ghani

    Abstract: Rental assistance programs provide individuals with financial assistance to prevent housing instabilities caused by evictions and avert homelessness. Since these programs operate under resource constraints, they must decide who to prioritize. Typically, funding is distributed by a reactive or first-come-first serve allocation process that does not systematically consider risk of future homelessnes… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Published at AAAI 2024

  3. arXiv:2309.17337  [pdf, other

    cs.LG cs.AI cs.CY

    Toward Operationalizing Pipeline-aware ML Fairness: A Research Agenda for Develo** Practical Guidelines and Tools

    Authors: Emily Black, Rakshit Naidu, Rayid Ghani, Kit T. Rodolfa, Daniel E. Ho, Hoda Heidari

    Abstract: While algorithmic fairness is a thriving area of research, in practice, mitigating issues of bias often gets reduced to enforcing an arbitrarily chosen fairness metric, either by enforcing fairness constraints during the optimization step, post-processing model outputs, or by manipulating the training data. Recent work has called on the ML community to take a more holistic approach to tackle fairn… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: EAAMO'23 (Archival)

  4. arXiv:2306.15994  [pdf, other

    cs.LG cs.CY

    Systematic analysis of the impact of label noise correction on ML Fairness

    Authors: I. Oliveira e Silva, C. Soares, I. Sousa, R. Ghani

    Abstract: Arbitrary, inconsistent, or faulty decision-making raises serious concerns, and preventing unfair models is an increasingly important challenge in Machine Learning. Data often reflect past discriminatory behavior, and models trained on such data may reflect bias on sensitive attributes, such as gender, race, or age. One approach to develo** fair models is to preprocess the training data to remov… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  5. arXiv:2207.05855  [pdf

    cs.CY cs.LG

    A Conceptual Framework for Using Machine Learning to Support Child Welfare Decisions

    Authors: Ka Ho Brian Chor, Kit T. Rodolfa, Rayid Ghani

    Abstract: Human services systems make key decisions that impact individuals in the society. The U.S. child welfare system makes such decisions, from screening-in hotline reports of suspected abuse or neglect for child protective investigations, placing children in foster care, to returning children to permanent home settings. These complex and impactful decisions on children's lives rely on the judgment of… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: 69 pages, 1 table, 5 figures, 1 appendix

    MSC Class: 91C99; 62P25 ACM Class: J.4; K.4.1

  6. arXiv:2206.13503  [pdf, other

    cs.LG cs.HC

    On the Importance of Application-Grounded Experimental Design for Evaluating Explainable ML Methods

    Authors: Kasun Amarasinghe, Kit T. Rodolfa, Sérgio Jesus, Valerie Chen, Vladimir Balayan, Pedro Saleiro, Pedro Bizarro, Ameet Talwalkar, Rayid Ghani

    Abstract: Most existing evaluations of explainable machine learning (ML) methods rely on simplifying assumptions or proxies that do not reflect real-world use cases; the handful of more robust evaluations on real-world settings have shortcomings in their design, resulting in limited conclusions of methods' real-world utility. In this work, we seek to bridge this gap by conducting a study that evaluates thre… ▽ More

    Submitted 21 February, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

  7. arXiv:2203.01363  [pdf, other

    cs.LG stat.AP

    Faking feature importance: A cautionary tale on the use of differentially-private synthetic data

    Authors: Oscar Giles, Kasra Hosseini, Grigorios Mingas, Oliver Strickson, Louise Bowler, Camila Rangel Smith, Harrison Wilde, Jen Ning Lim, Bilal Mateen, Kasun Amarasinghe, Rayid Ghani, Alison Heppenstall, Nik Lomax, Nick Malleson, Martin O'Reilly, Sebastian Vollmerteke

    Abstract: Synthetic datasets are often presented as a silver-bullet solution to the problem of privacy-preserving data publishing. However, for many applications, synthetic data has been shown to have limited utility when used to train predictive models. One promising potential application of these data is in the exploratory phase of the machine learning workflow, which involves understanding, engineering a… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: 27 pages, 8 figures

  8. arXiv:2105.06442  [pdf, other

    cs.LG cs.CY

    An Empirical Comparison of Bias Reduction Methods on Real-World Problems in High-Stakes Policy Settings

    Authors: Hemank Lamba, Kit T. Rodolfa, Rayid Ghani

    Abstract: Applications of machine learning (ML) to high-stakes policy settings -- such as education, criminal justice, healthcare, and social service delivery -- have grown rapidly in recent years, sparking important conversations about how to ensure fair outcomes from these systems. The machine learning research community has responded to this challenge with a wide array of proposed fairness-enhancing stra… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

    Comments: 17 pages, 9 figures, 2 tables

  9. Empirical observation of negligible fairness-accuracy trade-offs in machine learning for public policy

    Authors: Kit T. Rodolfa, Hemank Lamba, Rayid Ghani

    Abstract: Growing use of machine learning in policy and social impact settings have raised concerns for fairness implications, especially for racial minorities. These concerns have generated considerable interest among machine learning and artificial intelligence researchers, who have developed new methods and established theoretical bounds for improving fairness, focusing on the source data, regularization… ▽ More

    Submitted 3 September, 2021; v1 submitted 5 December, 2020; originally announced December 2020.

    Comments: 40 pages, 4 figures, 2 tables, 7 supplementary figures, 4 supplementary tables; revised to improve clarity and discussion

    Journal ref: Nat Mach Intell 3, 896-904 (2021)

  10. Explainable Machine Learning for Public Policy: Use Cases, Gaps, and Research Directions

    Authors: Kasun Amarasinghe, Kit Rodolfa, Hemank Lamba, Rayid Ghani

    Abstract: Explainability is highly-desired in Machine Learning (ML) systems supporting high-stakes policy decisions in areas such as health, criminal justice, education, and employment. While the field of explainable ML has expanded in recent years, much of this work has not taken real-world needs into account. A majority of proposed methods are designed with \textit{generic} explainability goals without we… ▽ More

    Submitted 16 February, 2023; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: In press at Data & Policy

    Journal ref: Data & Policy , Volume 5 , 2023 , e5

  11. Map** New Informal Settlements using Machine Learning and Time Series Satellite Images: An Application in the Venezuelan Migration Crisis

    Authors: Isabelle Tingzon, Niccolo Dejito, Ren Avell Flores, Rodolfo De Guzman, Liliana Carvajal, Katerine Zapata Erazo, Ivan Enrique Contreras Cala, Jeffrey Villaveces, Daniela Rubio, Rayid Ghani

    Abstract: Since 2014, nearly 2 million Venezuelans have fled to Colombia to escape an economically devastated country during what is one of the largest humanitarian crises in modern history. Non-government organizations and local government units are faced with the challenge of identifying, assessing, and monitoring rapidly growing migrant communities in order to provide urgent humanitarian aid. However, wi… ▽ More

    Submitted 15 December, 2020; v1 submitted 27 August, 2020; originally announced August 2020.

  12. arXiv:2008.11707  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Bandit Data-Driven Optimization

    Authors: Zheyuan Ryan Shi, Zhiwei Steven Wu, Rayid Ghani, Fei Fang

    Abstract: Applications of machine learning in the non-profit and public sectors often feature an iterative workflow of data acquisition, prediction, and optimization of interventions. There are four major pain points that a machine learning pipeline must overcome in order to be actually useful in these settings: small data, data collected only under the default intervention, unmodeled objectives due to comm… ▽ More

    Submitted 14 January, 2022; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: This is the complete version of the paper. A version of this paper is also published at AAAI-22

  13. arXiv:2006.04944  [pdf, other

    cs.CY stat.ML

    A Machine Learning System for Retaining Patients in HIV Care

    Authors: Avishek Kumar, Arthi Ramachandran, Adolfo De Unanue, Christina Sung, Joe Walsh, John Schneider, Jessica Ridgway, Stephanie Masiello Schuette, Jeff Lauritsen, Rayid Ghani

    Abstract: Retaining persons living with HIV (PLWH) in medical care is paramount to preventing new transmissions of the virus and allowing PLWH to live normal and healthy lifespans. Maintaining regular appointments with an HIV provider and taking medication daily for a lifetime is exceedingly difficult. 51% of PLWH are non-adherent with their medications and eventually drop out of medical care. Current metho… ▽ More

    Submitted 31 May, 2020; originally announced June 2020.

  14. Case Study: Predictive Fairness to Reduce Misdemeanor Recidivism Through Social Service Interventions

    Authors: Kit T. Rodolfa, Erika Salomon, Lauren Haynes, Ivan Higuera Mendieta, Jamie Larson, Rayid Ghani

    Abstract: The criminal justice system is currently ill-equipped to improve outcomes of individuals who cycle in and out of the system with a series of misdemeanor offenses. Often due to constraints of caseload and poor record linkage, prior interactions with an individual may not be considered when an individual comes back into the system, let alone in a proactive manner through the application of diversion… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

    Comments: 12 pages, 4 figures, 1 algorithm. The definitive Version of Record will be published in the proceedings of the Conference on Fairness, Accountability, and Transparency (FAT* '20), January 27-30, 2020, Barcelona, Spain

    ACM Class: K.4.1; K.4.2; K.5.0

  15. A Clinical Approach to Training Effective Data Scientists

    Authors: Kit T Rodolfa, Adolfo De Unanue, Matt Gee, Rayid Ghani

    Abstract: Like medicine, psychology, or education, data science is fundamentally an applied discipline, with most students who receive advanced degrees in the field going on to work on practical problems. Unlike these disciplines, however, data science education remains heavily focused on theory and methods, and practical coursework typically revolves around cleaned or simplified data sets that have little… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: 18 pages, 3 figures, 2 tables

    Journal ref: Big Data 7:4, 249-261 (2019)

  16. arXiv:1901.05406  [pdf

    cs.CY

    Artificial Intelligence for Social Good

    Authors: Gregory D. Hager, Ann Drobnis, Fei Fang, Rayid Ghani, Amy Greenwald, Terah Lyons, David C. Parkes, Jason Schultz, Suchi Saria, Stephen F. Smith, Milind Tambe

    Abstract: The Computing Community Consortium (CCC), along with the White House Office of Science and Technology Policy (OSTP), and the Association for the Advancement of Artificial Intelligence (AAAI), co-sponsored a public workshop on Artificial Intelligence for Social Good on June 7th, 2016 in Washington, DC. This was one of five workshops that OSTP co-sponsored and held around the country to spur public… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: A Computing Community Consortium (CCC) workshop report, 22 pages

    Report number: ccc2016report_1

  17. arXiv:1812.10404  [pdf

    cs.CY cs.LG stat.AP stat.ML

    Machine learning and AI research for Patient Benefit: 20 Critical Questions on Transparency, Replicability, Ethics and Effectiveness

    Authors: Sebastian Vollmer, Bilal A. Mateen, Gergo Bohner, Franz J Király, Rayid Ghani, Pall Jonsson, Sarah Cumbers, Adrian Jonas, Katherine S. L. McAllister, Puja Myles, David Granger, Mark Birse, Richard Branson, Karel GM Moons, Gary S Collins, John P. A. Ioannidis, Chris Holmes, Harry Hemingway

    Abstract: Machine learning (ML), artificial intelligence (AI) and other modern statistical methods are providing new opportunities to operationalize previously untapped and rapidly growing sources of data for patient benefit. Whilst there is a lot of promising research currently being undertaken, the literature as a whole lacks: transparency; clear reporting to facilitate replicability; exploration for pote… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: 25 pages, 2 boxes, 1 figure

    MSC Class: 68T01

  18. arXiv:1811.05577  [pdf, other

    cs.LG cs.AI cs.CY

    Aequitas: A Bias and Fairness Audit Toolkit

    Authors: Pedro Saleiro, Benedict Kuester, Loren Hinkson, Jesse London, Abby Stevens, Ari Anisfeld, Kit T. Rodolfa, Rayid Ghani

    Abstract: Recent work has raised concerns on the risk of unintended bias in AI systems being used nowadays that can affect individuals unfairly based on race, gender or religion, among other possible characteristics. While a lot of bias metrics and fairness definitions have been proposed in recent years, there is no consensus on which metric/definition should be used and there are very few available resourc… ▽ More

    Submitted 29 April, 2019; v1 submitted 13 November, 2018; originally announced November 2018.

    Comments: Aequitas website: http://dsapp.uchicago.edu/aequitas