Skip to main content

Showing 1–7 of 7 results for author: Feffer, M

.
  1. arXiv:2405.11083  [pdf, other

    cs.CL cs.LG

    Prompt Exploration with Prompt Regression

    Authors: Michael Feffer, Ronald Xu, Yuekai Sun, Mikhail Yurochkin

    Abstract: In the advent of democratized usage of large language models (LLMs), there is a growing desire to systematize LLM prompt creation and selection processes beyond iterative trial-and-error. Prior works majorly focus on searching the space of prompts without accounting for relations between prompt variations. Here we propose a framework, Prompt Exploration with Prompt Regression (PEPR), to predict… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  2. arXiv:2401.15897  [pdf, other

    cs.CY cs.HC cs.LG

    Red-Teaming for Generative AI: Silver Bullet or Security Theater?

    Authors: Michael Feffer, Anusha Sinha, Wesley Hanwen Deng, Zachary C. Lipton, Hoda Heidari

    Abstract: In response to rising concerns surrounding the safety, security, and trustworthiness of Generative AI (GenAI) models, practitioners and regulators alike have pointed to AI red-teaming as a key component of their strategies for identifying and mitigating these risks. However, despite AI red-teaming's central role in policy discussions and corporate messaging, significant questions remain about what… ▽ More

    Submitted 15 May, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  3. arXiv:2310.06269  [pdf, other

    cs.CY cs.AI cs.HC

    The AI Incident Database as an Educational Tool to Raise Awareness of AI Harms: A Classroom Exploration of Efficacy, Limitations, & Future Improvements

    Authors: Michael Feffer, Nikolas Martelaro, Hoda Heidari

    Abstract: Prior work has established the importance of integrating AI ethics topics into computer and data sciences curricula. We provide evidence suggesting that one of the critical objectives of AI Ethics education must be to raise awareness of AI harms. While there are various sources to learn about such harms, The AI Incident Database (AIID) is one of the few attempts at offering a relatively comprehens… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 37 pages, 11 figures; To appear in the proceedings of EAAMO 2023

  4. arXiv:2308.00133  [pdf, other

    cs.LG cs.AI cs.CY

    A Suite of Fairness Datasets for Tabular Classification

    Authors: Martin Hirzel, Michael Feffer

    Abstract: There have been many papers with algorithms for improving fairness of machine-learning classifiers for tabular data. Unfortunately, most use only very few datasets for their experimental evaluation. We introduce a suite of functions for fetching 20 fairness datasets and providing associated fairness metadata. Hopefully, these will lead to more rigorous experimental evaluations in future fairness-a… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

  5. arXiv:2305.17319  [pdf, other

    cs.CY cs.AI cs.GT

    Moral Machine or Tyranny of the Majority?

    Authors: Michael Feffer, Hoda Heidari, Zachary C. Lipton

    Abstract: With Artificial Intelligence systems increasingly applied in consequential domains, researchers have begun to ask how these systems ought to act in ethically charged situations where even humans lack consensus. In the Moral Machine project, researchers crowdsourced answers to "Trolley Problems" concerning autonomous vehicles. Subsequently, Noothigattu et al. (2018) proposed inferring linear functi… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: To appear in the proceedings of AAAI 2023

  6. arXiv:2210.05594  [pdf, other

    cs.LG cs.CY

    Navigating Ensemble Configurations for Algorithmic Fairness

    Authors: Michael Feffer, Martin Hirzel, Samuel C. Hoffman, Kiran Kate, Parikshit Ram, Avraham Shinnar

    Abstract: Bias mitigators can improve algorithmic fairness in machine learning models, but their effect on fairness is often not stable across data splits. A popular approach to train more stable models is ensemble learning, but unfortunately, it is unclear how to combine ensembles with mitigators to best navigate trade-offs between fairness and predictive performance. To that end, we built an open-source l… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: text overlap with arXiv:2202.00751

  7. arXiv:2202.00751  [pdf, other

    cs.LG cs.CY

    An Empirical Study of Modular Bias Mitigators and Ensembles

    Authors: Michael Feffer, Martin Hirzel, Samuel C. Hoffman, Kiran Kate, Parikshit Ram, Avraham Shinnar

    Abstract: There are several bias mitigators that can reduce algorithmic bias in machine learning models but, unfortunately, the effect of mitigators on fairness is often not stable when measured across different data splits. A popular approach to train more stable models is ensemble learning. Ensembles, such as bagging, boosting, voting, or stacking, have been successful at making predictive performance mor… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.