Skip to main content

Showing 1–12 of 12 results for author: Charte, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.14812  [pdf, other

    cs.CV cs.LG

    PARDINUS: Weakly supervised discarding of photo-trap** empty images based on autoencoders

    Authors: David de la Rosa, Antonio J Rivera, María J del Jesus, Francisco Charte

    Abstract: Photo-trap** cameras are widely employed for wildlife monitoring. Those cameras take photographs when motion is detected to capture images where animals appear. A significant portion of these images are empty - no wildlife appears in the image. Filtering out those images is not a trivial task since it requires hours of manual work from biologists. Therefore, there is a notable interest in automa… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  2. arXiv:2305.17152  [pdf, other

    cs.LG cs.AI

    mldr.resampling: Efficient Reference Implementations of Multilabel Resampling Algorithms

    Authors: Antonio J. Rivera, Miguel A. Dávila, David Elizondo, María J. del Jesus, Francisco Charte

    Abstract: Resampling algorithms are a useful approach to deal with imbalanced learning in multilabel scenarios. These methods have to deal with singularities in the multilabel data, such as the occurrence of frequent and infrequent labels in the same instance. Implementations of these methods are sometimes limited to the pseudocode provided by their authors in a paper. This Original Software Publication pre… ▽ More

    Submitted 30 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  3. arXiv:2301.06047  [pdf, other

    cs.NE cs.AI cs.LG

    EvoAAA: An evolutionary methodology for automated \neural autoencoder architecture search

    Authors: Francisco Charte, Antonio J. Rivera, Francisco Martínez, María J. del Jesus

    Abstract: Machine learning models work better when curated features are provided to them. Feature engineering methods have been usually used as a preprocessing step to obtain or build a proper feature set. In late years, autoencoders (a specific type of symmetrical neural network) have been widely used to perform representation learning, proving their competitiveness against classical feature engineering al… ▽ More

    Submitted 15 January, 2023; originally announced January 2023.

    Comments: Paper submited to Integrated Computer-Aided Engineering

  4. Reducing Data Complexity using Autoencoders with Class-informed Loss Functions

    Authors: David Charte, Francisco Charte, Francisco Herrera

    Abstract: Available data in machine learning applications is becoming increasingly complex, due to higher dimensionality and difficult classes. There exists a wide variety of approaches to measuring complexity of labeled data, according to class overlap, separability or boundary shapes, as well as group morphology. Many techniques can transform the data in order to find better features, but few focus on spe… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: This paper has been accepted for publication by IEEE Transactions on Pattern Analysis and Machine Intelligence

    MSC Class: 68T07 ACM Class: I.2.6; I.5.1

  5. An analysis on the use of autoencoders for representation learning: fundamentals, learning task case studies, explainability and challenges

    Authors: David Charte, Francisco Charte, María J. del Jesus, Francisco Herrera

    Abstract: In many machine learning tasks, learning a good representation of the data can be the key to building a well-performant solution. This is because most learning algorithms operate with the features in order to find models for the data. For instance, classification performance can improve if the data is mapped to a space where classes are easily separated, and regression can be facilitated by findin… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    MSC Class: 68T05

    Journal ref: Neurocomputing 404 (2020) 93-107

  6. A Showcase of the Use of Autoencoders in Feature Learning Applications

    Authors: David Charte, Francisco Charte, María J. del Jesus, Francisco Herrera

    Abstract: Autoencoders are techniques for data representation learning based on artificial neural networks. Differently to other feature learning methods which may be focused on finding specific transformations of the feature space, they can be adapted to fulfill many purposes, such as data visualization, denoising, anomaly detection and semantic hashing. This work presents these applications and provides d… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: This manuscript was accepted as conference paper in IWINAC 2019. The final authenticated publication is available online at https://doi.org/10.1007/978-3-030-19651-6_40

    Journal ref: In: From Bioinspired Systems and Biomedical Applications to Machine Learning/IWINAC 2019. LNCS vol 11487. Springer (2019)

  7. A snapshot on nonstandard supervised learning problems: taxonomy, relationships and methods

    Authors: David Charte, Francisco Charte, Salvador García, Francisco Herrera

    Abstract: Machine learning is a field which studies how machines can alter and adapt their behavior, improving their actions according to the information they are given. This field is subdivided into multiple areas, among which the best known are supervised learning (e.g. classification and regression) and unsupervised learning (e.g. clustering and association rules). Within supervised learning, most stud… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    MSC Class: 68T05; 68T10

    Journal ref: Charte, D., Charte, F., García, S. et al. Prog Artif Intell (2018)

  8. arXiv:1802.08465  [pdf, other

    cs.LG cs.NE

    AEkNN: An AutoEncoder kNN-based classifier with built-in dimensionality reduction

    Authors: Francisco J. Pulgar, Francisco Charte, Antonio J. Rivera, María J. del Jesus

    Abstract: High dimensionality, i.e. data having a large number of variables, tends to be a challenge for most machine learning tasks, including classification. A classifier usually builds a model representing how a set of inputs explain the outputs. The larger is the set of inputs and/or outputs, the more complex would be that model. There is a family of classification algorithms, known as lazy learning met… ▽ More

    Submitted 9 March, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

    Comments: 35 pages, 13 figures, 12 tables

  9. Dealing with Difficult Minority Labels in Imbalanced Mutilabel Data Sets

    Authors: Francisco Charte, Antonio J. Rivera, María J. del Jesus, Francisco Herrera

    Abstract: Multilabel classification is an emergent data mining task with a broad range of real world applications. Learning from imbalanced multilabel data is being deeply studied latterly, and several resampling methods have been proposed in the literature. The unequal label distribution in most multilabel datasets, with disparate imbalance levels, could be a handicap while learning new classifiers. In add… ▽ More

    Submitted 14 February, 2018; originally announced February 2018.

  10. Tackling Multilabel Imbalance through Label Decoupling and Data Resampling Hybridization

    Authors: Francisco Charte, Antonio J. Rivera, María J. del Jesus, Francisco Herrera

    Abstract: The learning from imbalanced data is a deeply studied problem in standard classification and, in recent times, also in multilabel classification. A handful of multilabel resampling methods have been proposed in late years, aiming to balance the labels distribution. However these methods have to face a new obstacle, specific for multilabel data, as is the joint appearance of minority and majority l… ▽ More

    Submitted 14 February, 2018; originally announced February 2018.

  11. Tips, guidelines and tools for managing multi-label datasets: the mldr.datasets R package and the Cometa data repository

    Authors: Francisco Charte, Antonio J. Rivera, David Charte, María J. del Jesus, Francisco Herrera

    Abstract: New proposals in the field of multi-label learning algorithms have been growing in number steadily over the last few years. The experimentation associated with each of them always goes through the same phases: selection of datasets, partitioning, training, analysis of results and, finally, comparison with existing methods. This last step is often hampered since it involves using exactly the same d… ▽ More

    Submitted 10 February, 2018; originally announced February 2018.

  12. A practical tutorial on autoencoders for nonlinear feature fusion: Taxonomy, models, software and guidelines

    Authors: David Charte, Francisco Charte, Salvador García, María J. del Jesus, Francisco Herrera

    Abstract: Many of the existing machine learning algorithms, both supervised and unsupervised, depend on the quality of the input characteristics to generate a good model. The amount of these variables is also important, since performance tends to decline as the input dimensionality increases, hence the interest in using feature fusion techniques, able to produce feature sets that are more compact and higher… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.

    Journal ref: Information Fusion 44 (2018) 78-96