Skip to main content

Showing 1–28 of 28 results for author: Lozano, J A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19066  [pdf, other

    cs.LG cs.CY

    Dancing in the Shadows: Harnessing Ambiguity for Fairer Classifiers

    Authors: Ainhize Barrainkua, Paula Gordaliza, Jose A. Lozano, Novi Quadrianto

    Abstract: This paper introduces a novel approach to bolster algorithmic fairness in scenarios where sensitive information is only partially known. In particular, we propose to leverage instances with uncertain identity with regards to the sensitive attribute to train a conventional machine learning classifier. The enhanced fairness observed in the final predictions of this classifier highlights the promisin… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    MSC Class: 68T01; 68T37 ACM Class: A.0; I.2

    Journal ref: Presented at the XI Symposium of Theory and Applications of Data Mining from the XX Conference of the Spanish Association for Artificial Intelligence CAEPIA 2024

  2. arXiv:2403.13740  [pdf, other

    cs.LG

    Uncertainty-Aware Explanations Through Probabilistic Self-Explainable Neural Networks

    Authors: Jon Vadillo, Roberto Santana, Jose A. Lozano, Marta Kwiatkowska

    Abstract: The lack of transparency of Deep Neural Networks continues to be a limitation that severely undermines their reliability and usage in high-stakes applications. Promising approaches to overcome such limitations are Prototype-Based Self-Explainable Neural Networks (PSENNs), whose predictions rely on the similarity between the input at hand and a set of prototypical representations of the output clas… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  3. arXiv:2311.09369  [pdf, other

    stat.ML cs.CY cs.LG

    Time-dependent Probabilistic Generative Models for Disease Progression

    Authors: Onintze Zaballa, Aritz Pérez, Elisa Gómez-Inhiesto, Teresa Acaiturri-Ayesta, Jose A. Lozano

    Abstract: Electronic health records contain valuable information for monitoring patients' health trajectories over time. Disease progression models have been developed to understand the underlying patterns and dynamics of diseases using these data as sequences. However, analyzing temporal data from EHRs is challenging due to the variability and irregularities present in medical records. We propose a Markovi… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 17 pages

  4. arXiv:2310.15974  [pdf, ps, other

    stat.ML cs.LG

    Minimax Forward and Backward Learning of Evolving Tasks with Performance Guarantees

    Authors: Verónica Álvarez, Santiago Mazuelas, Jose A. Lozano

    Abstract: For a sequence of classification tasks that arrive over time, it is common that tasks are evolving in the sense that consecutive tasks often have a higher similarity. The incremental learning of a growing sequence of tasks holds promise to enable accurate classification even with few samples per task by leveraging information from all the tasks in the sequence (forward and backward learning). Howe… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  5. arXiv:2303.02801  [pdf, ps, other

    cs.NE cs.AI

    Neuroevolutionary algorithms driven by neuron coverage metrics for semi-supervised classification

    Authors: Roberto Santana, Ivan Hidalgo-Cenalmor, Unai Garciarena, Alexander Mendiburu, Jose Antonio Lozano

    Abstract: In some machine learning applications the availability of labeled instances for supervised classification is limited while unlabeled instances are abundant. Semi-supervised learning algorithms deal with these scenarios and attempt to exploit the information contained in the unlabeled examples. In this paper, we address the question of how to evolve neural networks for semi-supervised problems. We… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

  6. arXiv:2302.01079  [pdf, other

    cs.LG

    Uncertainty in Fairness Assessment: Maintaining Stable Conclusions Despite Fluctuations

    Authors: Ainhize Barrainkua, Paula Gordaliza, Jose A. Lozano, Novi Quadrianto

    Abstract: Several recent works encourage the use of a Bayesian framework when assessing performance and fairness metrics of a classification algorithm in a supervised setting. We propose the Uncertainty Matters (UM) framework that generalizes a Beta-Binomial approach to derive the posterior distribution of any criteria combination, allowing stable performance assessment in a bias-aware setting.We suggest mo… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: 25 pages (including references and appendix), 10 figures. Submitted to ICML 2023

    MSC Class: 62P99 ACM Class: G.3

  7. arXiv:2211.07530  [pdf, other

    cs.LG cs.CY

    A Survey on Preserving Fairness Guarantees in Changing Environments

    Authors: Ainhize Barrainkua, Paula Gordaliza, Jose A. Lozano, Novi Quadrianto

    Abstract: Human lives are increasingly being affected by the outcomes of automated decision-making systems and it is essential for the latter to be, not only accurate, but also fair. The literature of algorithmic fairness has grown considerably over the last decade, where most of the approaches are evaluated under the strong assumption that the train and test samples are independently and identically drawn… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 29 pages, 6 figures. Submitted to ACM Computing Surveys: Special Issue on Trustworthy AI

    MSC Class: 68-02; 68T05; 68T37; ACM Class: A.1; I.0

  8. arXiv:2205.15942  [pdf, ps, other

    stat.ML cs.LG

    Minimax Classification under Concept Drift with Multidimensional Adaptation and Performance Guarantees

    Authors: Verónica Álvarez, Santiago Mazuelas, Jose A. Lozano

    Abstract: The statistical characteristics of instance-label pairs often change with time in practical scenarios of supervised classification. Conventional learning techniques adapt to such concept drift accounting for a scalar rate of change by means of a carefully chosen learning rate, forgetting factor, or window size. However, the time changes in common scenarios are multidimensional, i.e., different sta… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

  9. arXiv:2205.12943  [pdf, other

    cs.DM cs.CC

    Transitions from P to NP-hardness: the case of the Linear Ordering Problem

    Authors: Anne Elorza, Leticia Hernando, Jose A. Lozano

    Abstract: In this paper we evaluate how constructive heuristics degrade when a problem transits from P to NP-hard. This is done by means of the linear ordering problem. More specifically, for this problem we prove that the objective function can be expressed as the sum of two objective functions, one of which is associated with a P problem (an exact polynomial time algorithm is proposed to solve it), while… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  10. arXiv:2107.01943  [pdf, other

    cs.LG cs.CR

    When and How to Fool Explainable Models (and Humans) with Adversarial Examples

    Authors: Jon Vadillo, Roberto Santana, Jose A. Lozano

    Abstract: Reliable deployment of machine learning models such as neural networks continues to be challenging due to several limitations. Some of the main shortcomings are the lack of interpretability and the lack of robustness against adversarial examples or out-of-distribution inputs. In this exploratory review, we explore the possibilities and limits of adversarial attacks for explainable machine learning… ▽ More

    Submitted 7 July, 2023; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: Updated version. 43 pages, 9 figures, 4 tables

  11. The EMPATHIC Project: Mid-term Achievements

    Authors: M. I. Torres, J. M. Olaso, C. Montenegro, R. Santana, A. Vázquez, R. Justo, J. A. Lozano, S. Schlögl, G. Chollet, N. Dugan, M. Irvine, N. Glackin, C. Pickard, A. Esposito, G. Cordasco, A. Troncone, D. Petrovska-Delacretaz, A. Mtibaa, M. A. Hmani, M. S. Korsnes, L. J. Martinussen, S. Escalera, C. Palmero Cantariño, O. Deroo, O. Gordeeva , et al. (4 additional authors not shown)

    Abstract: The goal of active aging is to promote changes in the elderly community so as to maintain an active, independent and socially-engaged lifestyle. Technological advancements currently provide the necessary tools to foster and monitor such processes. This paper reports on mid-term achievements of the European H2020 EMPATHIC project, which aims to research, innovate, explore and validate new interacti… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 12 pages

  12. arXiv:2012.14352  [pdf, other

    cs.LG

    Analysis of Dominant Classes in Universal Adversarial Perturbations

    Authors: Jon Vadillo, Roberto Santana, Jose A. Lozano

    Abstract: The reasons why Deep Neural Networks are susceptible to being fooled by adversarial examples remains an open discussion. Indeed, many different strategies can be employed to efficiently generate adversarial attacks, some of them relying on different theoretical justifications. Among these strategies, universal (input-agnostic) perturbations are of particular interest, due to their capability to fo… ▽ More

    Submitted 11 January, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: 20 pages, 10 figures, 4 tables

  13. Probabilistic Load Forecasting Based on Adaptive Online Learning

    Authors: Verónica Álvarez, Santiago Mazuelas, José A. Lozano

    Abstract: Load forecasting is crucial for multiple energy management tasks such as scheduling generation capacity, planning supply and demand, and minimizing energy trade costs. Such relevance has increased even more in recent years due to the integration of renewable energies, electric cars, and microgrids. Conventional load forecasting techniques obtain single-value load forecasts by exploiting consumptio… ▽ More

    Submitted 15 January, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: \c{opyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  14. arXiv:2011.02743  [pdf, other

    math.OC cs.DS

    A revisited branch-and-cut algorithm for large-scale orienteering problems

    Authors: Gorka Kobeaga, María Merino, Jose A. Lozano

    Abstract: The orienteering problem is a route optimization problem which consists in finding a simple cycle that maximizes the total collected profit subject to a maximum distance limitation. In the last few decades, the occurrence of this problem in real-life applications has boosted the development of many heuristic algorithms to solve it. However, during the same period, not much research has been devote… ▽ More

    Submitted 13 January, 2021; v1 submitted 5 November, 2020; originally announced November 2020.

  15. On Solving Cycle Problems with Branch-and-Cut: Extending Shrinking and Exact Subcycle Elimination Separation Algorithms

    Authors: Gorka Kobeaga, María Merino, Jose A. Lozano

    Abstract: In this paper, we extend techniques developed in the context of the Travelling Salesperson Problem for cycle problems. Particularly, we study the shrinking of support graphs and the exact algorithms for subcycle elimination separation problems. The efficient application of the considered techniques has proved to be essential in the Travelling Salesperson Problem when solving large size problems by… ▽ More

    Submitted 6 September, 2021; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: 23 pages + 3 appendices

    MSC Class: 05C38; 90C10; 90C57

  16. arXiv:2004.06383  [pdf, other

    cs.LG stat.ML

    Extending Adversarial Attacks to Produce Adversarial Class Probability Distributions

    Authors: Jon Vadillo, Roberto Santana, Jose A. Lozano

    Abstract: Despite the remarkable performance and generalization levels of deep learning models in a wide range of artificial intelligence tasks, it has been demonstrated that these models can be easily fooled by the addition of imperceptible yet malicious perturbations to natural inputs. These altered inputs are known in the literature as adversarial examples. In this paper, we propose a novel probabilistic… ▽ More

    Submitted 25 January, 2023; v1 submitted 14 April, 2020; originally announced April 2020.

    Comments: Final version as accepted in JMLR. Attribution requirements are provided at http://jmlr.org/papers/v24/21-0326.html

    Journal ref: Journal of Machine Learning Research, 24(15):1-42, 2023

  17. arXiv:2002.04236  [pdf, other

    cs.LG stat.ML

    A review on outlier/anomaly detection in time series data

    Authors: Ane Blázquez-García, Angel Conde, Usue Mori, Jose A. Lozano

    Abstract: Recent advances in technology have brought major breakthroughs in data collection, enabling a large amount of data to be gathered over time and thus generating time series. Mining this data has become an important task for researchers and practitioners in the past few years, including the detection of outliers or anomalies that may represent errors or events of interest. This review aims to provid… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: 32 pages, 21 figures, submitted to ACM Computing Surveys (CSUR)

  18. arXiv:1910.05173  [pdf, other

    cs.LG stat.ML

    Evolving Gaussian Process kernels from elementary mathematical expressions

    Authors: Ibai Roman, Roberto Santana, Alexander Mendiburu, Jose A. Lozano

    Abstract: Choosing the most adequate kernel is crucial in many Machine Learning applications. Gaussian Process is a state-of-the-art technique for regression and classification that heavily relies on a kernel function. However, in the Gaussian Process literature, kernels have usually been either ad hoc designed, selected from a predefined set, or searched for in a space of compositions of kernels which have… ▽ More

    Submitted 14 October, 2019; v1 submitted 11 October, 2019; originally announced October 2019.

  19. arXiv:1905.10852  [pdf, ps, other

    cs.DM math.OC

    Taxonomization of Combinatorial Optimization Problems in Fourier Space

    Authors: Anne Elorza, Leticia Hernando, Jose A. Lozano

    Abstract: We propose and develop a novel framework for analyzing permutation-based combinatorial optimization problems, which could eventually be extended to other types of problems. Our approach is based on the decomposition of the objective functions via the generalized Fourier transform. We characterize the Fourier coefficients of three different problems: the Traveling Salesman Problem, the Linear Order… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

    Comments: 42 pages, 3 appendices

  20. arXiv:1904.00977  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Sentiment analysis with genetically evolved Gaussian kernels

    Authors: Ibai Roman, Alexander Mendiburu, Roberto Santana, Jose A. Lozano

    Abstract: Sentiment analysis consists of evaluating opinions or statements from the analysis of text. Among the methods used to estimate the degree in which a text expresses a given sentiment, are those based on Gaussian Processes. However, traditional Gaussian Processes methods use a predefined kernel with hyperparameters that can be tuned but whose structure can not be adapted. In this paper, we propose t… ▽ More

    Submitted 14 October, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

  21. Merge Non-Dominated Sorting Algorithm for Many-Objective Optimization

    Authors: Javier Moreno, Daniel Rodriguez, Antonio Nebro, Jose A. Lozano

    Abstract: Many Pareto-based multi-objective evolutionary algorithms require to rank the solutions of the population in each iteration according to the dominance principle, what can become a costly operation particularly in the case of dealing with many-objective optimization problems. In this paper, we present a new efficient algorithm for computing the non-dominated sorting procedure, called Merge Non-Domi… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

  22. arXiv:1806.04509  [pdf, ps, other

    stat.ML cs.LG

    A review on distance based time series classification

    Authors: Amaia Abanda, Usue Mori, Jose A. Lozano

    Abstract: Time series classification is an increasing research topic due to the vast amount of time series data that are being created over a wide variety of fields. The particularity of the data makes it a challenging task and different approaches have been taken, including the distance based approach. 1-NN has been a widely used method within distance based time series classification due to it simplicity… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

  23. arXiv:1801.02949  [pdf, other

    stat.ML cs.LG

    An efficient K -means clustering algorithm for massive data

    Authors: Marco Capó, Aritz Pérez, Jose A. Lozano

    Abstract: The analysis of continously larger datasets is a task of major importance in a wide variety of scientific fields. In this sense, cluster analysis algorithms are a key element of exploratory data analysis, due to their easiness in the implementation and relatively low computational cost. Among these algorithms, the K -means algorithm stands out as the most popular approach, besides its high depende… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

  24. arXiv:1608.08984  [pdf, other

    stat.ML cs.LG

    Towards Competitive Classifiers for Unbalanced Classification Problems: A Study on the Performance Scores

    Authors: Jonathan Ortigosa-Hernández, Iñaki Inza, Jose A. Lozano

    Abstract: Although a great methodological effort has been invested in proposing competitive solutions to the class-imbalance problem, little effort has been made in pursuing a theoretical understanding of this matter. In order to shed some light on this topic, we perform, through a novel framework, an exhaustive analysis of the adequateness of the most commonly used performance scores to assess this compl… ▽ More

    Submitted 31 August, 2016; originally announced August 2016.

  25. arXiv:1605.02989  [pdf, ps, other

    stat.ML cs.LG

    An efficient K-means algorithm for Massive Data

    Authors: Marco Capó, Aritz Pérez, José Antonio Lozano

    Abstract: Due to the progressive growth of the amount of data available in a wide variety of scientific fields, it has become more difficult to ma- nipulate and analyze such information. Even though datasets have grown in size, the K-means algorithm remains as one of the most popular clustering methods, in spite of its dependency on the initial settings and high computational cost, especially in terms of di… ▽ More

    Submitted 10 May, 2016; originally announced May 2016.

    Comments: 38 pages, 10 figures

  26. arXiv:1512.03466  [pdf, ps, other

    cs.NE

    Computing factorized approximations of Pareto-fronts using mNM-landscapes and Boltzmann distributions

    Authors: Roberto Santana, Alexander Mendiburu, Jose A. Lozano

    Abstract: NM-landscapes have been recently introduced as a class of tunable rugged models. They are a subset of the general interaction models where all the interactions are of order less or equal $M$. The Boltzmann distribution has been extensively applied in single-objective evolutionary algorithms to implement selection and study the theoretical properties of model-building algorithms. In this paper we p… ▽ More

    Submitted 10 December, 2015; originally announced December 2015.

    Comments: Accepted for CAEPIA-2015 conference, Albacete, Spain. 11 pages, 3 figures

  27. arXiv:1405.5646  [pdf, other

    cs.AI cs.DS

    Mathematical Programming Strategies for Solving the Minimum Common String Partition Problem

    Authors: Christian Blum, José A. Lozano, Pedro Pinacho Davidson

    Abstract: The minimum common string partition problem is an NP-hard combinatorial optimization problem with applications in computational biology. In this work we propose the first integer linear programming model for solving this problem. Moreover, on the basis of the integer linear programming model we develop a deterministic 2-phase heuristic which is applicable to larger problem instances. The results s… ▽ More

    Submitted 22 May, 2014; originally announced May 2014.

    MSC Class: 90-08

  28. arXiv:1301.3871  [pdf

    cs.AI cs.DS

    Combinatorial Optimization by Learning and Simulation of Bayesian Networks

    Authors: Pedro Larrañaga, Ramon Etxeberria, Jose A. Lozano, Jose M. Pena

    Abstract: This paper shows how the Bayesian network paradigm can be used in order to solve combinatorial optimization problems. To do it some methods of structure learning from data and simulation of Bayesian networks are inserted inside Estimation of Distribution Algorithms (EDA). EDA are a new tool for evolutionary computation in which populations of individuals are created by estimation and simulation of… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-343-352