Skip to main content

Showing 1–44 of 44 results for author: Santana, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10259  [pdf, other

    cs.CL cs.LG

    Optimal synthesis embeddings

    Authors: Roberto Santana, Mauricio Romero Sicre

    Abstract: In this paper we introduce a word embedding composition method based on the intuitive idea that a fair embedding representation for a given set of words should satisfy that the new vector will be at the same distance of the vector representation of each of its constituents, and this distance should be minimized. The embedding composition method can work with static and contextualized word represen… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2404.15118  [pdf, other

    cs.NE cond-mat.dis-nn cond-mat.stat-mech hep-lat physics.comp-ph

    Identifying phase transitions in physical systems with neural networks: a neural architecture search perspective

    Authors: Rodrigo Carmo Terin, Zochil González Arenas, Roberto Santana

    Abstract: The use of machine learning algorithms to investigate phase transitions in physical systems is a valuable way to better understand the characteristics of these systems. Neural networks have been used to extract information of phases and phase transitions directly from many-body configurations. However, one limitation of neural networks is that they require the definition of the model architecture… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 9 pages, 7 figures

  3. arXiv:2403.13740  [pdf, other

    cs.LG

    Uncertainty-Aware Explanations Through Probabilistic Self-Explainable Neural Networks

    Authors: Jon Vadillo, Roberto Santana, Jose A. Lozano, Marta Kwiatkowska

    Abstract: The lack of transparency of Deep Neural Networks continues to be a limitation that severely undermines their reliability and usage in high-stakes applications. Promising approaches to overcome such limitations are Prototype-Based Self-Explainable Neural Networks (PSENNs), whose predictions rely on the similarity between the input at hand and a set of prototypical representations of the output clas… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  4. arXiv:2403.09249  [pdf, other

    cs.AI

    Leveraging Constraint Programming in a Deep Learning Approach for Dynamically Solving the Flexible Job-Shop Scheduling Problem

    Authors: Imanol Echeverria, Maialen Murua, Roberto Santana

    Abstract: Recent advancements in the flexible job-shop scheduling problem (FJSSP) are primarily based on deep reinforcement learning (DRL) due to its ability to generate high-quality, real-time solutions. However, DRL approaches often fail to fully harness the strengths of existing techniques such as exact methods or constraint programming (CP), which can excel at finding optimal or near-optimal solutions f… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  5. A Connector for Integrating NGSI-LD Data into Open Data Portals

    Authors: Laura Martín, Jorge Lanza, Víctor González, Juan Ramón Santana, Pablo Sotres, Luis Sánchez

    Abstract: Nowadays, there are plenty of data sources generating massive amounts of information that, combined with novel data analytics frameworks, are meant to support optimisation in many application domains. Nonetheless, there are still shortcomings in terms of data discoverability, accessibility and interoperability. Open Data portals have emerged as a shift towards openness and discoverability. However… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: This work belongs to the Special Issue Data Engineering in the Internet of Things of MDPI Sensors. This work has been partially supported by the project SALTED from the European Union's Connecting Europe Facility program under Action Number 2020-EU-IA-0274, and by the project SITED under Grant Agreement No. PID2021-125725OB-I00 funded by MCIN/AEI/10.13039/501100011033 and the European Union FEDER

    Journal ref: Sensors 2024, 24, 1695

  6. SmartSantander: IoT Experimentation over a Smart City Testbed

    Authors: Luis Sanchez, Luis Muñoz, Jose Antonio Galache, Pablo Sotres, Juan R. Santana, Veronica Gutierrez, Rajiv Ramdhany, Alex Gluhak, Srdjan Krco, Evangelos Theodoridis, Dennis Pfisterer

    Abstract: This paper describes the deployment and experimentation architecture of the Internet of Things experimentation facility being deployed at Santander city. The facility is implemented within the SmartSantander project, one of the projects of the Future Internet Research and Experimentation initiative of the European Commission and represents a unique in the world city-scale experimental research fac… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: This work is published in Elsevier Computer Networks. This work has been funded by research project SmartSantander, under FP7-ICT-2009-5 of the 7th Framework Programme of the European Community

    Journal ref: Computer Networks, Volume 61, 14 March 2014, Pages 217-238

  7. arXiv:2310.15706  [pdf, other

    cs.AI cs.LG

    Solving the flexible job-shop scheduling problem through an enhanced deep reinforcement learning approach

    Authors: Imanol Echeverria, Maialen Murua, Roberto Santana

    Abstract: In scheduling problems common in the industry and various real-world scenarios, responding in real-time to disruptive events is essential. Recent methods propose the use of deep reinforcement learning (DRL) to learn policies capable of generating solutions under this constraint. The objective of this paper is to introduce a new DRL method for solving the flexible job-shop scheduling problem, parti… ▽ More

    Submitted 30 January, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

  8. arXiv:2306.09628  [pdf, other

    cs.CV stat.ML

    Structural Restricted Boltzmann Machine for image denoising and classification

    Authors: Arkaitz Bidaurrazaga, Aritz Pérez, Roberto Santana

    Abstract: Restricted Boltzmann Machines are generative models that consist of a layer of hidden variables connected to another layer of visible units, and they are used to model the distribution over visible variables. In order to gain a higher representability power, many hidden units are commonly used, which, in combination with a large number of visible units, leads to a high number of trainable paramete… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  9. arXiv:2305.03431  [pdf, other

    cs.SE

    Hearing the voice of experts: Unveiling Stack Exchange communities' knowledge of test smells

    Authors: Luana Martins, Denivan Campos, Railana Santana, Joselito Mota Junior, Heitor Costa, Ivan Machado

    Abstract: Refactorings are transformations to improve the code design without changing overall functionality and observable behavior. During the refactoring process of smelly test code, practitioners may struggle to identify refactoring candidates and define and apply corrective strategies. This paper reports on an empirical study aimed at understanding how test smells and test refactorings are discussed on… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Preprint of the manuscript accepted for publication at CHASE 2023

  10. arXiv:2303.02801  [pdf, ps, other

    cs.NE cs.AI

    Neuroevolutionary algorithms driven by neuron coverage metrics for semi-supervised classification

    Authors: Roberto Santana, Ivan Hidalgo-Cenalmor, Unai Garciarena, Alexander Mendiburu, Jose Antonio Lozano

    Abstract: In some machine learning applications the availability of labeled instances for supervised classification is limited while unlabeled instances are abundant. Semi-supervised learning algorithms deal with these scenarios and attempt to exploit the information contained in the unlabeled examples. In this paper, we address the question of how to evolve neural networks for semi-supervised problems. We… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

  11. arXiv:2302.12565  [pdf, other

    stat.ML cs.LG

    Variational Linearized Laplace Approximation for Bayesian Deep Learning

    Authors: Luis A. Ortega, Simón Rodríguez Santana, Daniel Hernández-Lobato

    Abstract: The Linearized Laplace Approximation (LLA) has been recently used to perform uncertainty estimation on the predictions of pre-trained deep neural networks (DNNs). However, its widespread application is hindered by significant computational costs, particularly in scenarios with a large number of training points or DNN parameters. Consequently, additional approximations of LLA, such as Kronecker-fac… ▽ More

    Submitted 22 May, 2024; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: 22 pages, 8 figures, ICML 2024

    Journal ref: PMLR 235 (2024)

  12. arXiv:2302.07557  [pdf, other

    cs.LG

    On the Generalization of PINNs outside the training domain and the Hyperparameters influencing it

    Authors: Andrea Bonfanti, Roberto Santana, Marco Ellero, Babak Gholami

    Abstract: Physics-Informed Neural Networks (PINNs) are Neural Network architectures trained to emulate solutions of differential equations without the necessity of solution data. They are currently ubiquitous in the scientific literature due to their flexible and promising settings. However, very little of the available research provides practical studies that aim for a better quantitative understanding of… ▽ More

    Submitted 24 August, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

  13. arXiv:2207.10673  [pdf, other

    stat.ML cs.LG stat.CO

    Correcting Model Bias with Sparse Implicit Processes

    Authors: Simón Rodríguez Santana, Luis A. Ortega, Daniel Hernández-Lobato, Bryan Zaldívar

    Abstract: Model selection in machine learning (ML) is a crucial part of the Bayesian learning procedure. Model choice may impose strong biases on the resulting predictions, which can hinder the performance of methods such as Bayesian neural networks and neural samplers. On the other hand, newly proposed approaches for Bayesian ML exploit features of approximate inference in function space with implicit stoc… ▽ More

    Submitted 8 August, 2022; v1 submitted 21 July, 2022; originally announced July 2022.

    Comments: 4 pages, 1 double figure. Included in ICML 2022 workshop "Beyond Bayes: Paths Towards Universal Reasoning Systems". Extension of previous work on Sparse Implicit Processes (arXiv:2110.07618)

  14. arXiv:2207.05539  [pdf, other

    cs.SE

    Refactoring Assertion Roulette and Duplicate Assert test smells: a controlled experiment

    Authors: Railana Santana, Luana Martins, Tássio Virgínio, Larissa Soares, Heitor Costa, Ivan Machado

    Abstract: Test smells can reduce the developers' ability to interact with the test code. Refactoring test code offers a safe strategy to handle test smells. However, the manual refactoring activity is not a trivial process, and it is often tedious and error-prone. This study aims to evaluate RAIDE, a tool for automatic identification and refactoring of test smells. We present an empirical assessment of RAID… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Journal ref: XXV Ibero-American Conference on Software Engineering (CIbSE 2022)

  15. arXiv:2206.10160  [pdf, other

    cs.LG

    Predicting Parking Lot Availability by Graph-to-Sequence Model: A Case Study with SmartSantander

    Authors: Yuya Sasaki, Junya Takayama, Juan Ramón Santana, Shohei Yamasaki, Tomoya Okuno, Makoto Onizuka

    Abstract: Nowadays, so as to improve services and urban areas livability, multiple smart city initiatives are being carried out throughout the world. SmartSantander is a smart city project in Santander, Spain, which has relied on wireless sensor network technologies to deploy heterogeneous sensors within the city to measure multiple parameters, including outdoor parking information. In this paper, we study… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  16. arXiv:2206.06720  [pdf, other

    stat.ML cs.LG

    Deep Variational Implicit Processes

    Authors: Luis A. Ortega, Simón Rodríguez Santana, Daniel Hernández-Lobato

    Abstract: Implicit processes (IPs) are a generalization of Gaussian processes (GPs). IPs may lack a closed-form expression but are easy to sample from. Examples include, among others, Bayesian neural networks or neural samplers. IPs can be used as priors over functions, resulting in flexible models with well-calibrated prediction uncertainty estimates. Methods based on IPs usually carry out function-space a… ▽ More

    Submitted 16 February, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 19 pages, 6 figures, ICLR 2023

  17. arXiv:2204.01468  [pdf

    cs.CY cs.CL

    Criação e aplicação de ferramenta para auxiliar no ensino de algoritmos e programação de computadores

    Authors: Afonso Henriques Fontes Neto Segundo, Joel Sotero da Cunha Neto, Maria Daniela Santabaia Cavalcanti, Paulo Cirillo Souza Barbosa, Raul Fontenele Santana

    Abstract: Knowledge about programming is part of the knowledge matrix that will be required of the professionals of the future. Based on this, this work aims to report the development of a teaching tool developed during the monitoring program of the Algorithm and Computer Programming discipline of the University of Fortaleza. The tool combines the knowledge acquired in the books, with a language closer to t… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

    Comments: in Portuguese language

  18. arXiv:2203.16927  [pdf

    cs.RO cs.CL

    Applying PBL in the Development and Modeling of kinematics for Robotic Manipulators with Interdisciplinarity between Computer-Assisted Project, Robotics, and Microcontrollers

    Authors: Afonso Henriques Fontes Neto Segundo, Joel Sotero da Cunha Neto, Paulo Cirillo Souza Barbosa, Raul Fontenele Santana

    Abstract: Considering the difficulty of students in calculating the direct and inverse kinematics of a robotic manipulator using only conventional tools of a classroom, this article proposes the application of Project Based Learning (ABP) through the design, development, mathematical modeling of a robotic manipulator as an integrative project of the disciplines of Industrial Robotics, Microcontrollers and C… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: in Portuguese language

  19. arXiv:2203.16924  [pdf

    cs.RO

    Development of a robotic manipulator: Applying interdisciplinarity in Computer Assister Project, Microcontrollers and Industrial Robotics

    Authors: Afonso Henriques Fontes Neto Segundo, Joel Sotero da Cunha Neto, Reginaldo Florencio da Silva, Paulo Cirillo Souza Barbosa, Raul Fontenele Santana

    Abstract: This work was conceived based on Project-Based Learning (ABP) and presents the design, development and mathematical modeling steps of a low-cost robotic manipulator with five degrees of freedom through an interdisciplinary project linking two very important disciplines of the course of Control Engineering and Automation of the University of Fortaleza: Computer Aided Design, Microcontrollers and In… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: in Portuguese language

  20. arXiv:2111.08165  [pdf, other

    cs.LG cs.CV eess.IV

    RapidRead: Global Deployment of State-of-the-art Radiology AI for a Large Veterinary Teleradiology Practice

    Authors: Michael Fitzke, Conrad Stack, Andre Dourson, Rodrigo M. B. Santana, Diane Wilson, Lisa Ziemer, Arjun Soin, Matthew P. Lungren, Paul Fisher, Mark Parkinson

    Abstract: This work describes the development and real-world deployment of a deep learning-based AI system for evaluating canine and feline radiographs across a broad range of findings and abnormalities. We describe a new semi-supervised learning approach that combines NLP-derived labels with self-supervised training leveraging more than 2.5 million x-ray images. Finally we describe the clinical deployment… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

  21. arXiv:2110.07618  [pdf, other

    stat.ML cs.LG

    Function-space Inference with Sparse Implicit Processes

    Authors: Simón Rodríguez Santana, Bryan Zaldivar, Daniel Hernández-Lobato

    Abstract: Implicit Processes (IPs) represent a flexible framework that can be used to describe a wide variety of models, from Bayesian neural networks, neural samplers and data generators to many others. IPs also allow for approximate inference in function-space. This change of formulation solves intrinsic degenerate problems of parameter-space approximate inference concerning the high number of parameters… ▽ More

    Submitted 21 July, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Published at ICML 2022 (long oral presentation). Code available at https://github.com/simonrsantana/sparse-implicit-processes

  22. arXiv:2107.01943  [pdf, other

    cs.LG cs.CR

    When and How to Fool Explainable Models (and Humans) with Adversarial Examples

    Authors: Jon Vadillo, Roberto Santana, Jose A. Lozano

    Abstract: Reliable deployment of machine learning models such as neural networks continues to be challenging due to several limitations. Some of the main shortcomings are the lack of interpretability and the lack of robustness against adversarial examples or out-of-distribution inputs. In this exploratory review, we explore the possibilities and limits of adversarial attacks for explainable machine learning… ▽ More

    Submitted 7 July, 2023; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: Updated version. 43 pages, 9 figures, 4 tables

  23. arXiv:2106.08972  [pdf, other

    cs.NE

    Redefining Neural Architecture Search of Heterogeneous Multi-Network Models by Characterizing Variation Operators and Model Components

    Authors: Unai Garciarena, Roberto Santana, Alexander Mendiburu

    Abstract: With neural architecture search methods gaining ground on manually designed deep neural networks -even more rapidly as model sophistication escalates-, the research trend shifts towards arranging different and often increasingly complex neural architecture search spaces. In this conjuncture, delineating algorithms which can efficiently explore these search spaces can result in a significant improv… ▽ More

    Submitted 17 August, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

    MSC Class: 68T07 ACM Class: I.2.6

  24. arXiv:2105.12836  [pdf, other

    cs.NE

    On the Exploitation of Neuroevolutionary Information: Analyzing the Past for a More Efficient Future

    Authors: Unai Garciarena, Nuno Lourenço, Penousal Machado, Roberto Santana, Alexander Mendiburu

    Abstract: Neuroevolutionary algorithms, automatic searches of neural network structures by means of evolutionary techniques, are computationally costly procedures. In spite of this, due to the great performance provided by the architectures which are found, these methods are widely applied. The final outcome of neuroevolutionary processes is the best structure found during the search, and the rest of the pr… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  25. The EMPATHIC Project: Mid-term Achievements

    Authors: M. I. Torres, J. M. Olaso, C. Montenegro, R. Santana, A. Vázquez, R. Justo, J. A. Lozano, S. Schlögl, G. Chollet, N. Dugan, M. Irvine, N. Glackin, C. Pickard, A. Esposito, G. Cordasco, A. Troncone, D. Petrovska-Delacretaz, A. Mtibaa, M. A. Hmani, M. S. Korsnes, L. J. Martinussen, S. Escalera, C. Palmero Cantariño, O. Deroo, O. Gordeeva , et al. (4 additional authors not shown)

    Abstract: The goal of active aging is to promote changes in the elderly community so as to maintain an active, independent and socially-engaged lifestyle. Technological advancements currently provide the necessary tools to foster and monitor such processes. This paper reports on mid-term achievements of the European H2020 EMPATHIC project, which aims to research, innovate, explore and validate new interacti… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 12 pages

  26. arXiv:2103.06138  [pdf, other

    cs.IR cs.LG

    Hybrid Model with Time Modeling for Sequential Recommender Systems

    Authors: Marlesson R. O. Santana, Anderson Soares

    Abstract: Deep learning based methods have been used successfully in recommender system problems. Approaches using recurrent neural networks, transformers, and attention mechanisms are useful to model users' long- and short-term preferences in sequential interactions. To explore different session-based recommendation solutions, Booking.com recently organized the WSDM WebTour 2021 Challenge, which aims to be… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: 5 pages, 2 figures, WSDM Workshop on Web Tourism 2021

    ACM Class: I.2.1; H.4.2

    Journal ref: ACM WSDM Workshop on Web Tourism (WSDM Webtour'21), March 12, 2021, Jerusalem, Israel

  27. arXiv:2012.14352  [pdf, other

    cs.LG

    Analysis of Dominant Classes in Universal Adversarial Perturbations

    Authors: Jon Vadillo, Roberto Santana, Jose A. Lozano

    Abstract: The reasons why Deep Neural Networks are susceptible to being fooled by adversarial examples remains an open discussion. Indeed, many different strategies can be employed to efficiently generate adversarial attacks, some of them relying on different theoretical justifications. Among these strategies, universal (input-agnostic) perturbations are of particular interest, due to their capability to fo… ▽ More

    Submitted 11 January, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: 20 pages, 10 figures, 4 tables

  28. arXiv:2010.07035  [pdf, other

    cs.IR cs.HC cs.LG stat.ML

    MARS-Gym: A Gym framework to model, train, and evaluate Recommender Systems for Marketplaces

    Authors: Marlesson R. O. Santana, Luckeciano C. Melo, Fernando H. F. Camargo, Bruno Brandão, Anderson Soares, Renan M. Oliveira, Sandor Caetano

    Abstract: Recommender Systems are especially challenging for marketplaces since they must maximize user satisfaction while maintaining the healthiness and fairness of such ecosystems. In this context, we observed a lack of resources to design, train, and evaluate agents that learn by interacting within these environments. For this matter, we propose MARS-Gym, an open-source framework to empower researchers… ▽ More

    Submitted 30 September, 2020; originally announced October 2020.

    Comments: 15 pages, 14 figures, see https://github.com/deeplearningbrasil/mars-gym

    ACM Class: I.6.5; H.4.2

  29. arXiv:2004.06383  [pdf, other

    cs.LG stat.ML

    Extending Adversarial Attacks to Produce Adversarial Class Probability Distributions

    Authors: Jon Vadillo, Roberto Santana, Jose A. Lozano

    Abstract: Despite the remarkable performance and generalization levels of deep learning models in a wide range of artificial intelligence tasks, it has been demonstrated that these models can be easily fooled by the addition of imperceptible yet malicious perturbations to natural inputs. These altered inputs are known in the literature as adversarial examples. In this paper, we propose a novel probabilistic… ▽ More

    Submitted 25 January, 2023; v1 submitted 14 April, 2020; originally announced April 2020.

    Comments: Final version as accepted in JMLR. Attribution requirements are provided at http://jmlr.org/papers/v24/21-0326.html

    Journal ref: Journal of Machine Learning Research, 24(15):1-42, 2023

  30. arXiv:2001.08444  [pdf, other

    eess.AS cs.HC cs.LG cs.SD stat.ML

    On the human evaluation of audio adversarial examples

    Authors: Jon Vadillo, Roberto Santana

    Abstract: Human-machine interaction is increasingly dependent on speech communication. Machine Learning models are usually applied to interpret human speech commands. However, these models can be fooled by adversarial examples, which are inputs intentionally perturbed to produce a wrong prediction without being noticed. While much research has been focused on develo** new techniques to generate adversaria… ▽ More

    Submitted 12 February, 2021; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: Preprint. 17 pages, 7 figures, 4 tables

  31. arXiv:1911.10182  [pdf, other

    cs.LG eess.AS stat.ML

    Universal adversarial examples in speech command classification

    Authors: Jon Vadillo, Roberto Santana

    Abstract: Adversarial examples are inputs intentionally perturbed with the aim of forcing a machine learning model to produce a wrong prediction, while the changes are not easily detectable by a human. Although this topic has been intensively studied in the image domain, classification tasks in the audio domain have received less attention. In this paper we address the existence of universal perturbations f… ▽ More

    Submitted 13 February, 2021; v1 submitted 22 November, 2019; originally announced November 2019.

    Comments: 14 pages, 2 figures, 4 tables; Revised external links

  32. arXiv:1910.05173  [pdf, other

    cs.LG stat.ML

    Evolving Gaussian Process kernels from elementary mathematical expressions

    Authors: Ibai Roman, Roberto Santana, Alexander Mendiburu, Jose A. Lozano

    Abstract: Choosing the most adequate kernel is crucial in many Machine Learning applications. Gaussian Process is a state-of-the-art technique for regression and classification that heavily relies on a kernel function. However, in the Gaussian Process literature, kernels have usually been either ad hoc designed, selected from a predefined set, or searched for in a space of compositions of kernels which have… ▽ More

    Submitted 14 October, 2019; v1 submitted 11 October, 2019; originally announced October 2019.

  33. arXiv:1909.06945  [pdf, other

    stat.ML cs.LG stat.CO

    Adversarial $α$-divergence Minimization for Bayesian Approximate Inference

    Authors: Simón Rodríguez Santana, Daniel Hernández-Lobato

    Abstract: Neural networks are popular state-of-the-art models for many different tasks.They are often trained via back-propagation to find a value of the weights that correctly predicts the observed data. Although back-propagation has shown good performance in many applications, it cannot easily output an estimate of the uncertainty in the predictions made. Estimating the uncertainty in the predictions is a… ▽ More

    Submitted 30 January, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: 47 pages, 10 figures (41 pages for the main article, 6 for the supplementary material)

  34. arXiv:1904.00977  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Sentiment analysis with genetically evolved Gaussian kernels

    Authors: Ibai Roman, Alexander Mendiburu, Roberto Santana, Jose A. Lozano

    Abstract: Sentiment analysis consists of evaluating opinions or statements from the analysis of text. Among the methods used to estimate the degree in which a text expresses a given sentiment, are those based on Gaussian Processes. However, traditional Gaussian Processes methods use a predefined kernel with hyperparameters that can be tuned but whose structure can not be adapted. In this paper, we propose t… ▽ More

    Submitted 14 October, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

  35. arXiv:1903.09171  [pdf, other

    cs.LG cs.AI

    Towards automatic construction of multi-network models for heterogeneous multi-task learning

    Authors: Unai Garciarena, Alexander Mendiburu, Roberto Santana

    Abstract: Multi-task learning, as it is understood nowadays, consists of using one single model to carry out several similar tasks. From classifying hand-written characters of different alphabets to figuring out how to play several Atari games using reinforcement learning, multi-task models have been able to widen their performance range across different tasks, although these tasks are usually of a similar… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

    Comments: Preprint

    MSC Class: 68T99 ACM Class: I.2.6

  36. arXiv:1806.09935  [pdf, ps, other

    cs.AI cs.DS

    On the performance of multi-objective estimation of distribution algorithms for combinatorial problems

    Authors: Marcella S. R. Martins, Mohamed El Yafrani, Roberto Santana, Myriam Delgado, Ricardo Lüders, Belaïd Ahiod

    Abstract: Fitness landscape analysis investigates features with a high influence on the performance of optimization algorithms, aiming to take advantage of the addressed problem characteristics. In this work, a fitness landscape analysis using problem features is performed for a Multi-objective Bayesian Optimization Algorithm (mBOA) on instances of MNK-landscape problem for 2, 3, 5 and 8 objectives. We also… ▽ More

    Submitted 4 June, 2018; originally announced June 2018.

    Comments: Accepted in IEEE WCCI/CEC '2018

  37. arXiv:1801.04407  [pdf, other

    cs.LG

    Towards a more efficient representation of imputation operators in TPOT

    Authors: Unai Garciarena, Alexander Mendiburu, Roberto Santana

    Abstract: Automated Machine Learning encompasses a set of meta-algorithms intended to design and apply machine learning techniques (e.g., model selection, hyperparameter tuning, model assessment, etc.). TPOT, a software for optimizing machine learning pipelines based on genetic programming (GP), is a novel example of this kind of applications. Recently we have proposed a way to introduce imputation methods… ▽ More

    Submitted 13 January, 2018; originally announced January 2018.

    Comments: 13 pages, 4 figures. Continuation of a previous work

    MSC Class: 68T99 ACM Class: I.2.6

  38. arXiv:1707.03093  [pdf, ps, other

    cs.NE

    Gray-box optimization and factorized distribution algorithms: where two worlds collide

    Authors: Roberto Santana

    Abstract: The concept of gray-box optimization, in juxtaposition to black-box optimization, revolves about the idea of exploiting the problem structure to implement more efficient evolutionary algorithms (EAs). Work on factorized distribution algorithms (FDAs), whose factorizations are directly derived from the problem structure, has also contributed to show how exploiting the problem structure produces imp… ▽ More

    Submitted 10 July, 2017; originally announced July 2017.

    Comments: 33 pages, 9 tables, 3 figures. This paper covers some of the topics of the talk "When the gray box was opened, model-based evolutionary algorithms were already there" presented in the Model-Based Evolutionary Algorithms workshop on July 20, 2016, in Denver

  39. arXiv:1706.01120  [pdf, ps, other

    cs.LG stat.ML

    Evolving imputation strategies for missing data in classification problems with TPOT

    Authors: Unai Garciarena, Roberto Santana, Alexander Mendiburu

    Abstract: Missing data has a ubiquitous presence in real-life applications of machine learning techniques. Imputation methods are algorithms conceived for restoring missing values in the data, based on other entries in the database. The choice of the imputation method has an influence on the performance of the machine learning technique, e.g., it influences the accuracy of the classification algorithm appli… ▽ More

    Submitted 14 August, 2017; v1 submitted 4 June, 2017; originally announced June 2017.

    Comments: 15 pages, 4 figures

    MSC Class: 65C99 ACM Class: D.2.2

  40. arXiv:1702.05624  [pdf, ps, other

    cs.CL

    Reproducing and learning new algebraic operations on word embeddings using genetic programming

    Authors: Roberto Santana

    Abstract: Word-vector representations associate a high dimensional real-vector to every word from a corpus. Recently, neural-network based methods have been proposed for learning this representation from large corpora. This type of word-to-vector embedding is able to keep, in the learned vector space, some of the syntactic and semantic relationships present in the original word corpus. This, in turn, serves… ▽ More

    Submitted 18 February, 2017; originally announced February 2017.

    Comments: 17 pages, 7 tables, 8 figures. Python code available from https://github.com/rsantana-isg/GP_word2vec

  41. arXiv:1608.05105  [pdf, ps, other

    cond-mat.dis-nn cs.NE quant-ph

    Evolutionary Approaches to Optimization Problems in Chimera Topologies

    Authors: Roberto Santana, Zheng Zhu, Helmut G. Katzgraber

    Abstract: Chimera graphs define the topology of one of the first commercially available quantum computers. A variety of optimization problems have been mapped to this topology to evaluate the behavior of quantum enhanced optimization heuristics in relation to other optimizers, being able to efficiently solve problems classically to use them as benchmarks for quantum machines. In this paper we investigate fo… ▽ More

    Submitted 17 August, 2016; originally announced August 2016.

    Comments: 8 pages, 5 figures, 3 tables

    Journal ref: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2016), ACM Press, 397-404 (2016)

  42. arXiv:1512.03466  [pdf, ps, other

    cs.NE

    Computing factorized approximations of Pareto-fronts using mNM-landscapes and Boltzmann distributions

    Authors: Roberto Santana, Alexander Mendiburu, Jose A. Lozano

    Abstract: NM-landscapes have been recently introduced as a class of tunable rugged models. They are a subset of the general interaction models where all the interactions are of order less or equal $M$. The Boltzmann distribution has been extensively applied in single-objective evolutionary algorithms to implement selection and study the theoretical properties of model-building algorithms. In this paper we p… ▽ More

    Submitted 10 December, 2015; originally announced December 2015.

    Comments: Accepted for CAEPIA-2015 conference, Albacete, Spain. 11 pages, 3 figures

  43. arXiv:1511.05625  [pdf, ps, other

    cs.NE

    MOEA/D-GM: Using probabilistic graphical models in MOEA/D for solving combinatorial optimization problems

    Authors: Murilo Zangari de Souza, Roberto Santana, Aurora Trinidad Ramirez Pozo, Alexander Mendiburu

    Abstract: Evolutionary algorithms based on modeling the statistical dependencies (interactions) between the variables have been proposed to solve a wide range of complex problems. These algorithms learn and sample probabilistic graphical models able to encode and exploit the regularities of the problem. This paper investigates the effect of using probabilistic modeling techniques as a way to enhance the beh… ▽ More

    Submitted 17 November, 2015; originally announced November 2015.

    Comments: 13 pages, 4 figures

  44. A probabilistic evolutionary optimization approach to compute quasiparticle braids

    Authors: Roberto Santana, Ross B. McDonald, Helmut G. Katzgraber

    Abstract: Topological quantum computing is an alternative framework for avoiding the quantum decoherence problem in quantum computation. The problem of executing a gate in this framework can be posed as the problem of braiding quasiparticles. Because these are not Abelian, the problem can be reduced to finding an optimal product of braid generators where the optimality is defined in terms of the gate approx… ▽ More

    Submitted 2 October, 2014; originally announced October 2014.

    Comments: 9 pages,7 figures. Accepted at SEAL 2014

    Journal ref: Simulated Evolution and Learning, Lecture Notes in Computer Science 8886, 13 (2014)