Skip to main content

Showing 1–17 of 17 results for author: Baldini, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05918  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Why Don't Prompt-Based Fairness Metrics Correlate?

    Authors: Abdelrahman Zayed, Goncalo Mordido, Ioana Baldini, Sarath Chandar

    Abstract: The widespread use of large language models has brought up essential questions about the potential biases these models might learn. This led to the development of several metrics aimed at evaluating and mitigating these biases. In this paper, we first demonstrate that prompt-based fairness metrics exhibit poor agreement, as measured by correlation, raising important questions about the reliability… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: In Proceedings of ACL main 2024

  2. arXiv:2403.09704  [pdf, other

    cs.CL cs.AI cs.LG

    Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations

    Authors: Swapnaja Achintalwar, Ioana Baldini, Djallel Bouneffouf, Joan Byamugisha, Maria Chang, Pierre Dognin, Eitan Farchi, Ndivhuwo Makondo, Aleksandra Mojsilovic, Manish Nagireddy, Karthikeyan Natesan Ramamurthy, Inkit Padhi, Orna Raz, Jesus Rios, Prasanna Sattigeri, Moninder Singh, Siphiwe Thwala, Rosario A. Uceda-Sosa, Kush R. Varshney

    Abstract: The alignment of large language models is usually done by model providers to add or control behaviors that are common or universally understood across use cases and contexts. In contrast, in this article, we present an approach and architecture that empowers application developers to tune a model to their particular values, social norms, laws and other regulations, and orchestrate between potentia… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures

  3. arXiv:2403.06009  [pdf, other

    cs.LG

    Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

    Authors: Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Kirushikesh DB, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Nishtha Madaan, Sameep Mehta, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy , et al. (13 additional authors not shown)

    Abstract: Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations. Due to several limiting factors surrounding LLMs (training cost, API access, data availability, etc.), it may not always be feasible to impose direct safety constraints on a deployed model. Therefore, an efficient and reliable alternative is required. To this end, we presen… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  4. arXiv:2312.15398  [pdf, other

    cs.CL cs.CY cs.LG

    Fairness-Aware Structured Pruning in Transformers

    Authors: Abdelrahman Zayed, Goncalo Mordido, Samira Shabanian, Ioana Baldini, Sarath Chandar

    Abstract: The increasing size of large language models (LLMs) has introduced challenges in their training and inference. Removing model components is perceived as a solution to tackle the large model sizes, however, existing pruning methods solely focus on performance, without considering an essential aspect for the responsible use of LLMs: model fairness. It is crucial to address the fairness of LLMs towar… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: In Proceedings of AAAI 2024

  5. arXiv:2312.07492  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models

    Authors: Manish Nagireddy, Lamogha Chiazor, Moninder Singh, Ioana Baldini

    Abstract: Current datasets for unwanted social bias auditing are limited to studying protected demographic features such as race and gender. In this work, we introduce a comprehensive benchmark that is meant to capture the amplification of social bias, via stigmas, in generative language models. Taking inspiration from social science research, we start with a documented list of 93 US-centric stigmas and cur… ▽ More

    Submitted 27 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  6. arXiv:2311.09443  [pdf, other

    cs.CL

    Subtle Misogyny Detection and Mitigation: An Expert-Annotated Dataset

    Authors: Brooklyn Sheppard, Anna Richter, Allison Cohen, Elizabeth Allyn Smith, Tamara Kneese, Carolyne Pelletier, Ioana Baldini, Yue Dong

    Abstract: Using novel approaches to dataset development, the Biasly dataset captures the nuance and subtlety of misogyny in ways that are unique within the literature. Built in collaboration with multi-disciplinary experts and annotators themselves, the dataset contains annotations of movie subtitles, capturing colloquial expressions of misogyny in North American film. The dataset can be used for a range of… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 figures

  7. arXiv:2305.12620  [pdf, other

    cs.CL

    Kee** Up with the Language Models: Robustness-Bias Interplay in NLI Data and Models

    Authors: Ioana Baldini, Chhavi Yadav, Payel Das, Kush R. Varshney

    Abstract: Auditing unwanted social bias in language models (LMs) is inherently hard due to the multidisciplinary nature of the work. In addition, the rapid evolution of LMs can make benchmarks irrelevant in no time. Bias auditing is further complicated by LM brittleness: when a presumably biased outcome is observed, is it due to model bias or model brittleness? We propose enlisting the models themselves to… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

  8. Write It Like You See It: Detectable Differences in Clinical Notes By Race Lead To Differential Model Recommendations

    Authors: Hammaad Adam, Ming Ying Yang, Kenrick Cato, Ioana Baldini, Charles Senteio, Leo Anthony Celi, Jiaming Zeng, Moninder Singh, Marzyeh Ghassemi

    Abstract: Clinical notes are becoming an increasingly important data source for machine learning (ML) applications in healthcare. Prior research has shown that deploying ML models can perpetuate existing biases against racial minorities, as bias can be implicitly embedded in data. In this study, we investigate the level of implicit race information available to ML models and human experts and the implicatio… ▽ More

    Submitted 1 November, 2022; v1 submitted 8 May, 2022; originally announced May 2022.

    Journal ref: Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society (AIES 2022)

  9. arXiv:2203.04462  [pdf, other

    cs.LG

    Downstream Fairness Caveats with Synthetic Healthcare Data

    Authors: Karan Bhanot, Ioana Baldini, Dennis Wei, Jiaming Zeng, Kristin P. Bennett

    Abstract: This paper evaluates synthetically generated healthcare data for biases and investigates the effect of fairness mitigation techniques on utility-fairness. Privacy laws limit access to health data such as Electronic Medical Records (EMRs) to preserve patient privacy. Albeit essential, these laws hinder research reproducibility. Synthetic data is a viable solution that can enable access to data simi… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  10. arXiv:2112.03529  [pdf, ps, other

    cs.CL

    Ground-Truth, Whose Truth? -- Examining the Challenges with Annotating Toxic Text Datasets

    Authors: Kofi Arhin, Ioana Baldini, Dennis Wei, Karthikeyan Natesan Ramamurthy, Moninder Singh

    Abstract: The use of machine learning (ML)-based language models (LMs) to monitor content online is on the rise. For toxic text identification, task-specific fine-tuning of these models are performed using datasets labeled by annotators who provide ground-truth labels in an effort to distinguish between offensive and normal content. These projects have led to the development, improvement, and expansion of l… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: 15 pages

  11. arXiv:2108.01250  [pdf, other

    cs.CL cs.LG

    Your fairness may vary: Pretrained language model fairness in toxic text classification

    Authors: Ioana Baldini, Dennis Wei, Karthikeyan Natesan Ramamurthy, Mikhail Yurochkin, Moninder Singh

    Abstract: The popularity of pretrained language models in natural language processing systems calls for a careful evaluation of such models in down-stream tasks, which have a higher potential for societal impact. The evaluation of such systems usually focuses on accuracy measures. Our findings in this paper call for attention to be paid to fairness measures as well. Through the analysis of more than a dozen… ▽ More

    Submitted 13 April, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: Findings of ACL 2022

  12. arXiv:2106.09502  [pdf, other

    cs.CL cs.LG

    Biomedical Interpretable Entity Representations

    Authors: Diego Garcia-Olano, Yasumasa Onoe, Ioana Baldini, Joydeep Ghosh, Byron C. Wallace, Kush R. Varshney

    Abstract: Pre-trained language models induce dense entity representations that offer strong performance on entity-centric NLP tasks, but such representations are not immediately interpretable. This can be a barrier to model uptake in important domains such as biomedicine. There has been recent work on general interpretable representation learning (Onoe and Durrett, 2020), but these domain-agnostic represent… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted into Findings of ACL-IJCNLP 2021

  13. arXiv:2104.04633  [pdf, other

    cs.CY

    Automated Meta-Analysis: A Causal Learning Perspective

    Authors: Lu Cheng, Dmitriy A. Katz-Rogozhnikov, Kush R. Varshney, Ioana Baldini

    Abstract: Meta-analysis is a systematic approach for understanding a phenomenon by analyzing the results of many previously published experimental studies. It is central to deriving conclusions about the summary effect of treatments and interventions in medicine, poverty alleviation, and other applications with social impact. Unfortunately, meta-analysis involves great human effort, rendering a process that… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: 11 pages, 6 figures

  14. arXiv:1911.07819  [pdf, other

    cs.CL cs.LG stat.ML

    Drug Repurposing for Cancer: An NLP Approach to Identify Low-Cost Therapies

    Authors: Shivashankar Subramanian, Ioana Baldini, Sushma Ravichandran, Dmitriy A. Katz-Rogozhnikov, Karthikeyan Natesan Ramamurthy, Prasanna Sattigeri, Kush R. Varshney, Annmarie Wang, Pradeep Mangalath, Laura B. Kleiman

    Abstract: More than 200 generic drugs approved by the U.S. Food and Drug Administration for non-cancer indications have shown promise for treating cancer. Due to their long history of safe patient use, low cost, and widespread availability, repurposing of generic drugs represents a major opportunity to rapidly improve outcomes for cancer patients and reduce healthcare costs worldwide. Evidence on the effica… ▽ More

    Submitted 5 December, 2019; v1 submitted 18 November, 2019; originally announced November 2019.

  15. arXiv:1909.03486  [pdf, other

    cs.CY cs.AI cs.HC

    How Data Scientists Work Together With Domain Experts in Scientific Collaborations: To Find The Right Answer Or To Ask The Right Question?

    Authors: Yaoli Mao, Dakuo Wang, Michael Muller, Kush R. Varshney, Ioana Baldini, Casey Dugan, AleksandraMojsilović

    Abstract: In recent years there has been an increasing trend in which data scientists and domain experts work together to tackle complex scientific questions. However, such collaborations often face challenges. In this paper, we aim to decipher this collaboration complexity through a semi-structured interview study with 22 interviewees from teams of bio-medical scientists collaborating with data scientists.… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.

  16. arXiv:1807.05691  [pdf, other

    cs.AI cs.SE

    Teaching machines to understand data science code by semantic enrichment of dataflow graphs

    Authors: Evan Patterson, Ioana Baldini, Aleksandra Mojsilovic, Kush R. Varshney

    Abstract: Your computer is continuously executing programs, but does it really understand them? Not in any meaningful sense. That burden falls upon human knowledge workers, who are increasingly asked to write and understand code. They deserve to have intelligent tools that reveal the connections between code and its subject matter. Towards this prospect, we develop an AI system that forms semantic represent… ▽ More

    Submitted 25 January, 2019; v1 submitted 16 July, 2018; originally announced July 2018.

    Comments: 33 pages. Significantly expanded from previous version

  17. arXiv:1706.03178  [pdf, other

    cs.DC

    Serverless Computing: Current Trends and Open Problems

    Authors: Ioana Baldini, Paul Castro, Kerry Chang, Perry Cheng, Stephen Fink, Vatche Ishakian, Nick Mitchell, Vinod Muthusamy, Rodric Rabbah, Aleksander Slominski, Philippe Suter

    Abstract: Serverless computing has emerged as a new compelling paradigm for the deployment of applications and services. It represents an evolution of cloud programming models, abstractions, and platforms, and is a testament to the maturity and wide adoption of cloud technologies. In this chapter, we survey existing serverless platforms from industry, academia, and open source projects, identify key charact… ▽ More

    Submitted 10 June, 2017; originally announced June 2017.