Skip to main content

Showing 1–8 of 8 results for author: Charte, D

Searching in archive cs. Search in all archives.
.
  1. Reducing Data Complexity using Autoencoders with Class-informed Loss Functions

    Authors: David Charte, Francisco Charte, Francisco Herrera

    Abstract: Available data in machine learning applications is becoming increasingly complex, due to higher dimensionality and difficult classes. There exists a wide variety of approaches to measuring complexity of labeled data, according to class overlap, separability or boundary shapes, as well as group morphology. Many techniques can transform the data in order to find better features, but few focus on spe… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: This paper has been accepted for publication by IEEE Transactions on Pattern Analysis and Machine Intelligence

    MSC Class: 68T07 ACM Class: I.2.6; I.5.1

  2. Revisiting Data Complexity Metrics Based on Morphology for Overlap and Imbalance: Snapshot, New Overlap Number of Balls Metrics and Singular Problems Prospect

    Authors: José Daniel Pascual-Triana, David Charte, Marta Andrés Arroyo, Alberto Fernández, Francisco Herrera

    Abstract: Data Science and Machine Learning have become fundamental assets for companies and research institutions alike. As one of its fields, supervised classification allows for class prediction of new samples, learning from given training data. However, some properties can cause datasets to be problematic to classify. In order to evaluate a dataset a priori, data complexity metrics have been used exte… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 23 pages, 9 figures, preprint

    Journal ref: Knowledge and Information Systems (Knowl Inf Syst 63, 1961-1989 (2021))

  3. arXiv:2006.01409  [pdf, other

    eess.IV cs.CV

    COVIDGR dataset and COVID-SDNet methodology for predicting COVID-19 based on Chest X-Ray images

    Authors: S. Tabik, A. Gómez-Ríos, J. L. Martín-Rodríguez, I. Sevillano-García, M. Rey-Area, D. Charte, E. Guirado, J. L. Suárez, J. Luengo, M. A. Valero-González, P. García-Villanova, E. Olmedo-Sánchez, F. Herrera

    Abstract: Currently, Coronavirus disease (COVID-19), one of the most infectious diseases in the 21st century, is diagnosed using RT-PCR testing, CT scans and/or Chest X-Ray (CXR) images. CT (Computed Tomography) scanners and RT-PCR testing are not available in most medical centers and hence in many cases CXR images become the most time/cost effective tool for assisting clinicians in making decisions. Deep l… ▽ More

    Submitted 11 November, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: Paper accepted in Journal of Biomedical And Health Informatics

  4. An analysis on the use of autoencoders for representation learning: fundamentals, learning task case studies, explainability and challenges

    Authors: David Charte, Francisco Charte, María J. del Jesus, Francisco Herrera

    Abstract: In many machine learning tasks, learning a good representation of the data can be the key to building a well-performant solution. This is because most learning algorithms operate with the features in order to find models for the data. For instance, classification performance can improve if the data is mapped to a space where classes are easily separated, and regression can be facilitated by findin… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    MSC Class: 68T05

    Journal ref: Neurocomputing 404 (2020) 93-107

  5. A Showcase of the Use of Autoencoders in Feature Learning Applications

    Authors: David Charte, Francisco Charte, María J. del Jesus, Francisco Herrera

    Abstract: Autoencoders are techniques for data representation learning based on artificial neural networks. Differently to other feature learning methods which may be focused on finding specific transformations of the feature space, they can be adapted to fulfill many purposes, such as data visualization, denoising, anomaly detection and semantic hashing. This work presents these applications and provides d… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: This manuscript was accepted as conference paper in IWINAC 2019. The final authenticated publication is available online at https://doi.org/10.1007/978-3-030-19651-6_40

    Journal ref: In: From Bioinspired Systems and Biomedical Applications to Machine Learning/IWINAC 2019. LNCS vol 11487. Springer (2019)

  6. A snapshot on nonstandard supervised learning problems: taxonomy, relationships and methods

    Authors: David Charte, Francisco Charte, Salvador García, Francisco Herrera

    Abstract: Machine learning is a field which studies how machines can alter and adapt their behavior, improving their actions according to the information they are given. This field is subdivided into multiple areas, among which the best known are supervised learning (e.g. classification and regression) and unsupervised learning (e.g. clustering and association rules). Within supervised learning, most stud… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    MSC Class: 68T05; 68T10

    Journal ref: Charte, D., Charte, F., García, S. et al. Prog Artif Intell (2018)

  7. Tips, guidelines and tools for managing multi-label datasets: the mldr.datasets R package and the Cometa data repository

    Authors: Francisco Charte, Antonio J. Rivera, David Charte, María J. del Jesus, Francisco Herrera

    Abstract: New proposals in the field of multi-label learning algorithms have been growing in number steadily over the last few years. The experimentation associated with each of them always goes through the same phases: selection of datasets, partitioning, training, analysis of results and, finally, comparison with existing methods. This last step is often hampered since it involves using exactly the same d… ▽ More

    Submitted 10 February, 2018; originally announced February 2018.

  8. A practical tutorial on autoencoders for nonlinear feature fusion: Taxonomy, models, software and guidelines

    Authors: David Charte, Francisco Charte, Salvador García, María J. del Jesus, Francisco Herrera

    Abstract: Many of the existing machine learning algorithms, both supervised and unsupervised, depend on the quality of the input characteristics to generate a good model. The amount of these variables is also important, since performance tends to decline as the input dimensionality increases, hence the interest in using feature fusion techniques, able to produce feature sets that are more compact and higher… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.

    Journal ref: Information Fusion 44 (2018) 78-96