Skip to main content

Showing 1–10 of 10 results for author: Jesus, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05809  [pdf

    cs.LG cs.AI cs.CY

    Aequitas Flow: Streamlining Fair ML Experimentation

    Authors: Sérgio Jesus, Pedro Saleiro, Inês Oliveira e Silva, Beatriz M. Jorge, Rita P. Ribeiro, João Gama, Pedro Bizarro, Rayid Ghani

    Abstract: Aequitas Flow is an open-source framework for end-to-end Fair Machine Learning (ML) experimentation in Python. This package fills the existing integration gaps in other Fair ML packages of complete and accessible experimentation. It provides a pipeline for fairness-aware model training, hyperparameter optimization, and evaluation, enabling rapid and simple experiments and result analysis. Aimed at… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  2. arXiv:2403.06906  [pdf, other

    cs.LG cs.AI

    Cost-Sensitive Learning to Defer to Multiple Experts with Workload Constraints

    Authors: Jean V. Alves, Diogo Leitão, Sérgio Jesus, Marco O. P. Sampaio, Javier Liébana, Pedro Saleiro, Mário A. T. Figueiredo, Pedro Bizarro

    Abstract: Learning to defer (L2D) aims to improve human-AI collaboration systems by learning how to defer decisions to humans when they are more likely to be correct than an ML classifier. Existing research in L2D overlooks key aspects of real-world systems that impede its practical adoption, namely: i) neglecting cost-sensitive scenarios, where type 1 and type 2 errors have different costs; ii) requiring c… ▽ More

    Submitted 21 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  3. arXiv:2312.13218  [pdf, other

    cs.LG cs.AI

    FiFAR: A Fraud Detection Dataset for Learning to Defer

    Authors: Jean V. Alves, Diogo Leitão, Sérgio Jesus, Marco O. P. Sampaio, Pedro Saleiro, Mário A. T. Figueiredo, Pedro Bizarro

    Abstract: Public dataset limitations have significantly hindered the development and benchmarking of learning to defer (L2D) algorithms, which aim to optimally combine human and AI capabilities in hybrid decision-making systems. In such systems, human availability and domain-specific concerns introduce difficulties, while obtaining human predictions for training and evaluation is costly. Financial fraud det… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: The public dataset and detailed synthetic expert information are available at: https://github.com/feedzai/fifar-dataset

  4. arXiv:2310.04269  [pdf, other

    cs.SE

    Detecting Semantic Conflicts using Static Analysis

    Authors: Galileu Santos de Jesus, Paulo Borba, Rodrigo Bonifácio, Matheus Barbosa de Oliveira

    Abstract: Version control system tools empower developers to independently work on their development tasks. These tools also facilitate the integration of changes through merging operations, and report textual conflicts. However, when developers integrate their changes, they might encounter other types of conflicts that are not detected by current merge tools. In this paper, we focus on dynamic semantic con… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  5. arXiv:2302.07444  [pdf, other

    cs.LG cs.HC

    A Case Study on Designing Evaluations of ML Explanations with Simulated User Studies

    Authors: Ada Martin, Valerie Chen, Sérgio Jesus, Pedro Saleiro

    Abstract: When conducting user studies to ascertain the usefulness of model explanations in aiding human decision-making, it is important to use real-world use cases, data, and users. However, this process can be resource-intensive, allowing only a limited number of explanation methods to be evaluated. Simulated user evaluations (SimEvals), which use machine learning models as a proxy for human users, have… ▽ More

    Submitted 20 March, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 9 pages, 2 figures. Will appear in ICLR 2023's TrustML-(un)Limited workshop

  6. arXiv:2211.13358  [pdf, other

    cs.LG

    Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation

    Authors: Sérgio Jesus, José Pombal, Duarte Alves, André Cruz, Pedro Saleiro, Rita P. Ribeiro, João Gama, Pedro Bizarro

    Abstract: Evaluating new techniques on realistic datasets plays a crucial role in the development of ML research and its broader adoption by practitioners. In recent years, there has been a significant increase of publicly available unstructured data resources for computer vision and NLP tasks. However, tabular data -- which is prevalent in many high-stakes domains -- has been lagging behind. To bridge this… ▽ More

    Submitted 28 November, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted at NeurIPS 2022. https://openreview.net/forum?id=UrAYT2QwOX8

  7. arXiv:2209.07850  [pdf, other

    cs.LG cs.AI cs.CY

    FairGBM: Gradient Boosting with Fairness Constraints

    Authors: André F Cruz, Catarina Belém, Sérgio Jesus, João Bravo, Pedro Saleiro, Pedro Bizarro

    Abstract: Tabular data is prevalent in many high-stakes domains, such as financial services or public policy. Gradient Boosted Decision Trees (GBDT) are popular in these settings due to their scalability, performance, and low training cost. While fairness in these domains is a foremost concern, existing in-processing Fair ML methods are either incompatible with GBDT, or incur in significant performance loss… ▽ More

    Submitted 3 March, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: Published as a conference paper at ICLR 2023

  8. arXiv:2206.13503  [pdf, other

    cs.LG cs.HC

    On the Importance of Application-Grounded Experimental Design for Evaluating Explainable ML Methods

    Authors: Kasun Amarasinghe, Kit T. Rodolfa, Sérgio Jesus, Valerie Chen, Vladimir Balayan, Pedro Saleiro, Pedro Bizarro, Ameet Talwalkar, Rayid Ghani

    Abstract: Most existing evaluations of explainable machine learning (ML) methods rely on simplifying assumptions or proxies that do not reflect real-world use cases; the handful of more robust evaluations on real-world settings have shortcomings in their design, resulting in limited conclusions of methods' real-world utility. In this work, we seek to bridge this gap by conducting a study that evaluates thre… ▽ More

    Submitted 21 February, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

  9. How can I choose an explainer? An Application-grounded Evaluation of Post-hoc Explanations

    Authors: Sérgio Jesus, Catarina Belém, Vladimir Balayan, João Bento, Pedro Saleiro, Pedro Bizarro, João Gama

    Abstract: There have been several research works proposing new Explainable AI (XAI) methods designed to generate model explanations having specific properties, or desiderata, such as fidelity, robustness, or human-interpretability. However, explanations are seldom evaluated based on their true practical impact on decision-making tasks. Without that assessment, explanations might be chosen that, in fact, hur… ▽ More

    Submitted 22 January, 2021; v1 submitted 21 January, 2021; originally announced January 2021.

    Comments: Accepted at FAccT'21, the ACM Conference on Fairness, Accountability, and Transparency

  10. arXiv:1906.12166  [pdf, ps, other

    cs.CE math.NA

    A study on Stokes-Brinkman dimensionless model for flow in porous media

    Authors: Anna Caroline Felix Santos de Jesus

    Abstract: In this work we propose a non-dimensionalization approach for the Stokes-Brinkman model for flow in porous media. We study the effect of the dimensionless number found, which will be denoted by A and named as Anna's number, has on the outflow and transition between the Darcy and Stokes regime.

    Submitted 26 June, 2019; originally announced June 2019.