Skip to main content

Showing 1–11 of 11 results for author: Martins, R M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.06885  [pdf, other

    cs.HC cs.LG stat.CO

    DimVis: Interpreting Visual Clusters in Dimensionality Reduction With Explainable Boosting Machine

    Authors: Parisa Salmanian, Angelos Chatzimparmpas, Ali Can Karaca, Rafael M. Martins

    Abstract: Dimensionality Reduction (DR) techniques such as t-SNE and UMAP are popular for transforming complex datasets into simpler visual representations. However, while effective in uncovering general dataset patterns, these methods may introduce artifacts and suffer from interpretability issues. This paper presents DimVis, a visualization tool that employs supervised Explainable Boosting Machine (EBM) m… ▽ More

    Submitted 18 April, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: This manuscript is accepted for publication in EuroVis 2024 MLVis Workshop

  2. DeforestVis: Behavior Analysis of Machine Learning Models with Surrogate Decision Stumps

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Alexandru C. Telea, Andreas Kerren

    Abstract: As the complexity of machine learning (ML) models increases and their application in different (and critical) domains grows, there is a strong demand for more interpretable and trustworthy ML. A direct, model-agnostic, way to interpret such models is to train surrogate models-such as rule sets and decision trees-that sufficiently approximate the original ones while being simpler and easier-to-expl… ▽ More

    Submitted 18 April, 2024; v1 submitted 31 March, 2023; originally announced April 2023.

    Comments: This manuscript is accepted for publication in Computer Graphics Forum (CGF)

  3. arXiv:2112.00334  [pdf, other

    cs.LG cs.HC stat.ML

    VisRuler: Visual Analytics for Extracting Decision Rules from Bagged and Boosted Decision Trees

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Andreas Kerren

    Abstract: Bagging and boosting are two popular ensemble methods in machine learning (ML) that produce many individual decision trees. Due to the inherent ensemble characteristic of these methods, they typically outperform single decision trees or other ML models in predictive performance. However, numerous decision paths are generated for each decision tree, increasing the overall complexity of the model an… ▽ More

    Submitted 18 April, 2024; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: This manuscript is accepted for publication in the Information Visualization (IV) - SAGE Journals

    Journal ref: Information Visualization, 2021, 22(2), 115-139

  4. arXiv:2106.07718  [pdf, other

    cs.LG cs.GR

    HUMAP: Hierarchical Uniform Manifold Approximation and Projection

    Authors: Wilson E. Marcílio-Jr, Danilo M. Eler, Fernando V. Paulovich, Rafael M. Martins

    Abstract: Dimensionality reduction (DR) techniques help analysts understand patterns in high-dimensional spaces. These techniques, often represented by scatter plots, are employed in diverse science domains and facilitate similarity analysis among clusters and data samples. For datasets containing many granularities or when analysis follows the information visualization mantra, hierarchical DR techniques ar… ▽ More

    Submitted 24 April, 2023; v1 submitted 14 June, 2021; originally announced June 2021.

  5. arXiv:2103.14539  [pdf, other

    cs.LG cs.HC stat.ML

    FeatureEnVi: Visual Analytics for Feature Engineering Using Stepwise Selection and Semi-Automatic Extraction Approaches

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Kostiantyn Kucher, Andreas Kerren

    Abstract: The machine learning (ML) life cycle involves a series of iterative steps, from the effective gathering and preparation of the data, including complex feature engineering processes, to the presentation and improvement of results, with various algorithms to choose from in every step. Feature engineering in particular can be very beneficial for ML, leading to numerous improvements such as boosting t… ▽ More

    Submitted 18 April, 2024; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: This manuscript is accepted for publication in the IEEE Transactions on Visualization and Computer Graphics Journal (IEEE TVCG)

    Journal ref: IEEE TVCG 2022, 28(4), 1773-1791

  6. Using Visual Text Mining to Support the Study Selection Activity in Systematic Literature Reviews

    Authors: Katia Romero Felizardo, Norsaremah Salleh, Rafael M. Martins, Emília Mendes, Stephen G. MacDonell, José Carlos Maldonado

    Abstract: Background: A systematic literature review (SLR) is a methodology used to aggregate all relevant existing evidence to answer a research question of interest. Although crucial, the process used to select primary studies can be arduous, time consuming, and must often be conducted manually. Objective: We propose a novel approach, known as 'Systematic Literature Review based on Visual Text Mining' or… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: Conference paper, 11 pages, 1 table, 6 figures

    Journal ref: Proceedings of the 5th International Symposium on Empirical Software Engineering and Measurement (ESEM2011)

  7. arXiv:2012.01205  [pdf, other

    cs.LG cs.HC stat.ML

    VisEvol: Visual Analytics to Support Hyperparameter Search through Evolutionary Optimization

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Kostiantyn Kucher, Andreas Kerren

    Abstract: During the training phase of machine learning (ML) models, it is usually necessary to configure several hyperparameters. This process is computationally intensive and requires an extensive search to infer the best hyperparameter set for the given problem. The challenge is exacerbated by the fact that most ML models are complex internally, and training involves trial-and-error processes that could… ▽ More

    Submitted 18 April, 2024; v1 submitted 2 December, 2020; originally announced December 2020.

    Comments: This manuscript is accepted for publication in a special issue of Computer Graphics Forum (CGF)

    Journal ref: Computer Graphics Forum 2021, 40(3), 201-214

  8. arXiv:2005.01575  [pdf, other

    cs.LG cs.HC stat.ML

    StackGenVis: Alignment of Data, Algorithms, and Models for Stacking Ensemble Learning Using Performance Metrics

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Kostiantyn Kucher, Andreas Kerren

    Abstract: In machine learning (ML), ensemble methods such as bagging, boosting, and stacking are widely-established approaches that regularly achieve top-notch predictive performance. Stacking (also called "stacked generalization") is an ensemble method that combines heterogeneous base models, arranged in at least one layer, and then employs another metamodel to summarize the predictions of those models. Al… ▽ More

    Submitted 18 April, 2024; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: This manuscript is accepted for publication in a special issue of IEEE Transactions on Visualization and Computer Graphics Journal (IEEE TVCG)

    Journal ref: IEEE TVCG 2021, 27(2), 1547-1557

  9. arXiv:2003.09017  [pdf, other

    eess.SP cs.LG

    Xtreaming: an incremental multidimensional projection technique and its application to streaming data

    Authors: Tácito T. A. T. Neves, Rafael M. Martins, Danilo B. Coimbra, Kostiantyn Kucher, Andreas Kerren, Fernando V. Paulovich

    Abstract: Streaming data applications are becoming more common due to the ability of different information sources to continuously capture or produce data, such as sensors and social media. Despite recent advances, most visualization approaches, in particular, multidimensional projection or dimensionality reduction techniques, cannot be directly applied in such scenarios due to the transient nature of strea… ▽ More

    Submitted 7 March, 2020; originally announced March 2020.

    Comments: 12 pages, 11 figures

  10. arXiv:2002.06910  [pdf, other

    cs.LG cs.HC stat.ML

    t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Andreas Kerren

    Abstract: t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualization of multidimensional data has proven to be a popular approach, with successful applications in a wide range of domains. Despite their usefulness, t-SNE projections can be hard to interpret or even misleading, which hurts the trustworthiness of the results. Understanding the details of t-SNE itself and the reasons behind speci… ▽ More

    Submitted 18 April, 2024; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: This manuscript is published in the IEEE Transactions on Visualization and Computer Graphics Journal (IEEE TVCG)

    Journal ref: IEEE TVCG 2020, 26(8), 2696-2714

  11. arXiv:1903.06262  [pdf, other

    cs.CV

    A Grid-based Method for Removing Overlaps of Dimensionality Reduction Scatterplot Layouts

    Authors: Gladys M. Hilasaca, Wilson E. Marcílio-Jr, Danilo M. Eler, Rafael M. Martins, Fernando V. Paulovich

    Abstract: Dimensionality Reduction (DR) scatterplot layouts have become a ubiquitous visualization tool for analyzing multidimensional datasets. Despite their popularity, such scatterplots suffer from occlusion, especially when informative glyphs are used to represent data instances, potentially obfuscating critical information for the analysis under execution. Different strategies have been devised to addr… ▽ More

    Submitted 10 October, 2023; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: 14 pages, 10 figures. A preprint version of a publication at IEEE Transactions on Visualization and Computer Graphics (TVCG), 2023