Skip to main content

Showing 1–13 of 13 results for author: Mougan, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.09145  [pdf, other

    cs.LG stat.ML

    Model Agnostic Explainable Selective Regression via Uncertainty Estimation

    Authors: Andrea Pugnana, Carlos Mougan, Dan Saattrup Nielsen

    Abstract: With the wide adoption of machine learning techniques, requirements have evolved beyond sheer high performance, often requiring models to be trustworthy. A common approach to increase the trustworthiness of such systems is to allow them to refrain from predicting. Such a framework is known as selective prediction. While selective prediction for classification tasks has been widely analyzed, the pr… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  2. arXiv:2311.05227  [pdf, ps, other

    cs.AI

    Kantian Deontology Meets AI Alignment: Towards Morally Grounded Fairness Metrics

    Authors: Carlos Mougan, Joshua Brand

    Abstract: Deontological ethics, specifically understood through Immanuel Kant, provides a moral framework that emphasizes the importance of duties and principles, rather than the consequences of action. Understanding that despite the prominence of deontology, it is currently an overlooked approach in fairness metrics, this paper explores the compatibility of a Kantian deontological framework in fairness met… ▽ More

    Submitted 26 February, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

  3. arXiv:2309.09770  [pdf, other

    cs.AI

    How to Data in Datathons

    Authors: Carlos Mougan, Richard Plant, Clare Teng, Marya Bazzi, Alvaro Cabrejas-Egea, Ryan Sze-Yin Chan, David Salvador Jasin, Martin Stoffel, Kirstie Jane Whitaker, Jules Manser

    Abstract: The rise of datathons, also known as data or data science hackathons, has provided a platform to collaborate, learn, and innovate in a short timeframe. Despite their significant potential benefits, organizations often struggle to effectively work with data due to a lack of clear guidelines and best practices for potential issues that might arise. Drawing on our own experiences and insights from or… ▽ More

    Submitted 25 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmark

  4. arXiv:2308.02033  [pdf, ps, other

    cs.CY cs.AI

    AI and the EU Digital Markets Act: Addressing the Risks of Bigness in Generative AI

    Authors: Ayse Gizem Yasar, Andrew Chong, Evan Dong, Thomas Krendl Gilbert, Sarah Hladikova, Roland Maio, Carlos Mougan, Xudong Shen, Shubham Singh, Ana-Andreea Stoica, Savannah Thais, Miri Zilka

    Abstract: As AI technology advances rapidly, concerns over the risks of bigness in digital markets are also growing. The EU's Digital Markets Act (DMA) aims to address these risks. Still, the current framework may not adequately cover generative AI systems that could become gateways for AI-based services. This paper argues for integrating certain AI software as core platform services and classifying certain… ▽ More

    Submitted 7 July, 2023; originally announced August 2023.

    Comments: ICML'23 Workshop Generative AI + Law (GenLaw)

  5. arXiv:2304.06030  [pdf, ps, other

    cs.CY

    The Role of Large Language Models in the Recognition of Territorial Sovereignty: An Analysis of the Construction of Legitimacy

    Authors: Francisco Castillo-Eslava, Carlos Mougan, Alejandro Romero-Reche, Steffen Staab

    Abstract: We examine the potential impact of Large Language Models (LLM) on the recognition of territorial sovereignty and its legitimization. We argue that while technology tools, such as Google Maps and Large Language Models (LLM) like OpenAI's ChatGPT, are often perceived as impartial and objective, this perception is flawed, as AI algorithms reflect the biases of their designers or the data they are bui… ▽ More

    Submitted 18 April, 2023; v1 submitted 17 March, 2023; originally announced April 2023.

    Comments: European Workshop of Algorithmic Fairness'23

  6. arXiv:2303.08081  [pdf, other

    cs.LG stat.ML

    Explanation Shift: How Did the Distribution Shift Impact the Model?

    Authors: Carlos Mougan, Klaus Broelemann, David Masip, Gjergji Kasneci, Thanassis Thiropanis, Steffen Staab

    Abstract: As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In practice, new input data tend to come without target labels. Then, state-of-the-art techniques model input data distributions or model prediction distributions and try to understand issues regarding the interactions between learned models and shifting distributions. We suggest a novel… ▽ More

    Submitted 7 September, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2210.12369

  7. arXiv:2303.08040  [pdf, other

    cs.LG cs.CY

    Beyond Demographic Parity: Redefining Equal Treatment

    Authors: Carlos Mougan, Laura State, Antonio Ferrara, Salvatore Ruggieri, Steffen Staab

    Abstract: Liberalism-oriented political philosophy reasons that all individuals should be treated equally independently of their protected characteristics. Related work in machine learning has translated the concept of \emph{equal treatment} into terms of \emph{equal outcome} and measured it as \emph{demographic parity} (also called \emph{statistical parity}). Our analysis reveals that the two concepts of e… ▽ More

    Submitted 1 October, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

  8. arXiv:2210.12369  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Explanation Shift: Detecting distribution shifts on tabular data via the explanation space

    Authors: Carlos Mougan, Klaus Broelemann, Gjergji Kasneci, Thanassis Tiropanis, Steffen Staab

    Abstract: As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In the past, predictive performance was considered the key indicator to monitor. However, explanation aspects have come to attention within the last years. In this work, we investigate how model predictive performance and model explanation characteristics are affected under distribution… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: Neural Information Processing Systems (NeurIPS 2022). Workshop on Distribution Shifts: Connecting Methods and Applications

  9. arXiv:2202.03212  [pdf, other

    cs.LG stat.AP stat.ML

    Introducing explainable supervised machine learning into interactive feedback loops for statistical production system

    Authors: Carlos Mougan, George Kanellos, Johannes Micheler, Jose Martinez, Thomas Gottron

    Abstract: Statistical production systems cover multiple steps from the collection, aggregation, and integration of data to tasks like data quality assurance and dissemination. While the context of data quality assurance is one of the most promising fields for applying machine learning, the lack of curated and labeled training data is often a limiting factor. The statistical production system for the Centr… ▽ More

    Submitted 18 February, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: Irving Fisher Committee (IFC) - Bank of Italy workshop on Data science in central banking: Applications and tools. arXiv admin note: text overlap with arXiv:2107.08045

  10. arXiv:2201.11676  [pdf, other

    cs.LG stat.ML

    Monitoring Model Deterioration with Explainable Uncertainty Estimation via Non-parametric Bootstrap

    Authors: Carlos Mougan, Dan Saattrup Nielsen

    Abstract: Monitoring machine learning models once they are deployed is challenging. It is even more challenging to decide when to retrain models in real-case scenarios when labeled data is beyond reach, and monitoring performance metrics becomes unfeasible. In this work, we use non-parametric bootstrapped uncertainty estimates and SHAP values to provide explainable uncertainty estimation as a technique that… ▽ More

    Submitted 22 November, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: 7+6 pages. Accepted at AAAI'23 Safe and Robust AI track

  11. arXiv:2201.11358  [pdf, other

    cs.LG cs.CY cs.DS stat.ML

    Fairness Implications of Encoding Protected Categorical Attributes

    Authors: Carlos Mougan, Jose M. Alvarez, Salvatore Ruggieri, Steffen Staab

    Abstract: Past research has demonstrated that the explicit use of protected attributes in machine learning can improve both performance and fairness. Many machine learning algorithms, however, cannot directly process categorical attributes, such as country of birth or ethnicity. Because protected attributes frequently are categorical, they must be encoded as features that can be input to a chosen machine le… ▽ More

    Submitted 5 May, 2023; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: AIES'23 6th AAAI/ACM Conference on AI, Ethics, and Society 22 pages

  12. arXiv:2107.08045  [pdf, other

    cs.CY cs.AI cs.LG

    Desiderata for Explainable AI in statistical production systems of the European Central Bank

    Authors: Carlos Mougan, Georgios Kanellos, Thomas Gottron

    Abstract: Explainable AI constitutes a fundamental step towards establishing fairness and addressing bias in algorithmic decision-making. Despite the large body of work on the topic, the benefit of solutions is mostly evaluated from a conceptual or theoretical point of view and the usefulness for real-world use cases remains uncertain. In this work, we aim to state clear user-centric desiderata for explaina… ▽ More

    Submitted 12 February, 2022; v1 submitted 18 July, 2021; originally announced July 2021.

    Comments: European Congress of Machine Learning (ECMLPKDD) - 2nd Workshop on bias and fairness in AI

    ACM Class: I.2

  13. arXiv:2105.13783  [pdf, other

    cs.LG

    Quantile Encoder: Tackling High Cardinality Categorical Features in Regression Problems

    Authors: Carlos Mougan, David Masip, Jordi Nin, Oriol Pujol

    Abstract: Regression problems have been widely studied in machinelearning literature resulting in a plethora of regression models and performance measures. However, there are few techniques specially dedicated to solve the problem of how to incorporate categorical features to regression problems. Usually, categorical feature encoders are general enough to cover both classification and regression problems. T… ▽ More

    Submitted 4 July, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: Accepted at The 18th International Conference on Modeling Decisions for Artificial Intelligence (MDAI)

    MSC Class: 68T05 ACM Class: I.2.6