Skip to main content

Showing 1–14 of 14 results for author: Chatzimparmpas, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08651  [pdf

    cs.HC cs.AI cs.CV

    How to Distinguish AI-Generated Images from Authentic Photographs

    Authors: Negar Kamali, Karyn Nakamura, Angelos Chatzimparmpas, Jessica Hullman, Matthew Groh

    Abstract: The high level of photorealism in state-of-the-art diffusion models like Midjourney, Stable Diffusion, and Firefly makes it difficult for untrained humans to distinguish between real photographs and AI-generated images. To address this problem, we designed a guide to help readers develop a more critical eye toward identifying artifacts, inconsistencies, and implausibilities that often appear in AI… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 54 pages, 189 Figures

  2. arXiv:2403.12005  [pdf, other

    cs.HC cs.LG stat.ML

    Visualization for Trust in Machine Learning Revisited: The State of the Field in 2023

    Authors: Angelos Chatzimparmpas, Kostiantyn Kucher, Andreas Kerren

    Abstract: Visualization for explainable and trustworthy machine learning remains one of the most important and heavily researched fields within information visualization and visual analytics with various application domains, such as medicine, finance, and bioinformatics. After our 2020 state-of-the-art report comprising 200 techniques, we have persistently collected peer-reviewed articles describing visuali… ▽ More

    Submitted 18 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: This manuscript is accepted for publication in the IEEE Computer Graphics and Applications Journal (IEEE CG&A)

  3. arXiv:2402.06885  [pdf, other

    cs.HC cs.LG stat.CO

    DimVis: Interpreting Visual Clusters in Dimensionality Reduction With Explainable Boosting Machine

    Authors: Parisa Salmanian, Angelos Chatzimparmpas, Ali Can Karaca, Rafael M. Martins

    Abstract: Dimensionality Reduction (DR) techniques such as t-SNE and UMAP are popular for transforming complex datasets into simpler visual representations. However, while effective in uncovering general dataset patterns, these methods may introduce artifacts and suffer from interpretability issues. This paper presents DimVis, a visualization tool that employs supervised Explainable Boosting Machine (EBM) m… ▽ More

    Submitted 18 April, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: This manuscript is accepted for publication in EuroVis 2024 MLVis Workshop

  4. arXiv:2401.08876  [pdf, other

    cs.HC cs.CV cs.LG

    Evaluating the Utility of Conformal Prediction Sets for AI-Advised Image Labeling

    Authors: Dong** Zhang, Angelos Chatzimparmpas, Negar Kamali, Jessica Hullman

    Abstract: As deep neural networks are more commonly deployed in high-stakes domains, their black-box nature makes uncertainty quantification challenging. We investigate the presentation of conformal prediction sets--a distribution-free class of methods for generating prediction sets with specified coverage--to express uncertainty in AI-advised decision-making. Through a large online experiment, we compare t… ▽ More

    Submitted 25 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 19 pages, 11 figures, 10 tables. Accepted by ACM CHI 2024

  5. arXiv:2311.18807  [pdf, other

    cs.LG stat.ME

    Pre-registration for Predictive Modeling

    Authors: Jake M. Hofman, Angelos Chatzimparmpas, Amit Sharma, Duncan J. Watts, Jessica Hullman

    Abstract: Amid rising concerns of reproducibility and generalizability in predictive modeling, we explore the possibility and potential benefits of introducing pre-registration to the field. Despite notable advancements in predictive modeling, spanning core machine learning tasks to various scientific applications, challenges such as overlooked contextual factors, data-dependent decision-making, and uninten… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  6. DeforestVis: Behavior Analysis of Machine Learning Models with Surrogate Decision Stumps

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Alexandru C. Telea, Andreas Kerren

    Abstract: As the complexity of machine learning (ML) models increases and their application in different (and critical) domains grows, there is a strong demand for more interpretable and trustworthy ML. A direct, model-agnostic, way to interpret such models is to train surrogate models-such as rule sets and decision trees-that sufficiently approximate the original ones while being simpler and easier-to-expl… ▽ More

    Submitted 18 April, 2024; v1 submitted 31 March, 2023; originally announced April 2023.

    Comments: This manuscript is accepted for publication in Computer Graphics Forum (CGF)

  7. arXiv:2212.11737  [pdf, other

    cs.LG cs.HC stat.ML

    The State of the Art in Enhancing Trust in Machine Learning Models with the Use of Visualizations

    Authors: A. Chatzimparmpas, R. Martins, I. Jusufi, K. Kucher, Fabrice Rossi, A. Kerren

    Abstract: Machine learning (ML) models are nowadays used in complex applications in various domains, such as medicine, bioinformatics, and other sciences. Due to their black box nature, however, it may sometimes be hard to understand and trust the results they provide. This has increased the demand for reliable visualization tools related to enhancing trust in ML models, which has become a prominent topic o… ▽ More

    Submitted 18 April, 2024; v1 submitted 22 December, 2022; originally announced December 2022.

    Journal ref: Computer Graphics Forum 2020, 39(3), 713-756

  8. arXiv:2212.03539  [pdf, other

    cs.LG cs.HC stat.ML

    MetaStackVis: Visually-Assisted Performance Evaluation of Metamodels

    Authors: Ilya Ploshchik, Angelos Chatzimparmpas, Andreas Kerren

    Abstract: Stacking (or stacked generalization) is an ensemble learning method with one main distinctiveness from the rest: even though several base models are trained on the original data set, their predictions are further used as input data for one or more metamodels arranged in at least one extra layer. Composing a stack of models can produce high-performance outcomes, but it usually involves a trial-and-… ▽ More

    Submitted 18 April, 2024; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: This manuscript is accepted for publication in Proceedings of the 16th IEEE Pacific Visualization Symposium (PacificVis '23)

  9. arXiv:2203.15753  [pdf, other

    cs.LG cs.HC stat.ML

    HardVis: Visual Analytics to Handle Instance Hardness Using Undersampling and Oversampling Techniques

    Authors: Angelos Chatzimparmpas, Fernando V. Paulovich, Andreas Kerren

    Abstract: Despite the tremendous advances in machine learning (ML), training with imbalanced data still poses challenges in many real-world applications. Among a series of diverse techniques to solve this problem, sampling algorithms are regarded as an efficient solution. However, the problem is more fundamental, with many works emphasizing the importance of instance hardness. This issue refers to the signi… ▽ More

    Submitted 18 April, 2024; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: This manuscript is accepted for publication in Computer Graphics Forum (CGF)

    Journal ref: Computer Graphics Forum 2023, 42(1), 135-154

  10. arXiv:2112.00334  [pdf, other

    cs.LG cs.HC stat.ML

    VisRuler: Visual Analytics for Extracting Decision Rules from Bagged and Boosted Decision Trees

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Andreas Kerren

    Abstract: Bagging and boosting are two popular ensemble methods in machine learning (ML) that produce many individual decision trees. Due to the inherent ensemble characteristic of these methods, they typically outperform single decision trees or other ML models in predictive performance. However, numerous decision paths are generated for each decision tree, increasing the overall complexity of the model an… ▽ More

    Submitted 18 April, 2024; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: This manuscript is accepted for publication in the Information Visualization (IV) - SAGE Journals

    Journal ref: Information Visualization, 2021, 22(2), 115-139

  11. arXiv:2103.14539  [pdf, other

    cs.LG cs.HC stat.ML

    FeatureEnVi: Visual Analytics for Feature Engineering Using Stepwise Selection and Semi-Automatic Extraction Approaches

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Kostiantyn Kucher, Andreas Kerren

    Abstract: The machine learning (ML) life cycle involves a series of iterative steps, from the effective gathering and preparation of the data, including complex feature engineering processes, to the presentation and improvement of results, with various algorithms to choose from in every step. Feature engineering in particular can be very beneficial for ML, leading to numerous improvements such as boosting t… ▽ More

    Submitted 18 April, 2024; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: This manuscript is accepted for publication in the IEEE Transactions on Visualization and Computer Graphics Journal (IEEE TVCG)

    Journal ref: IEEE TVCG 2022, 28(4), 1773-1791

  12. arXiv:2012.01205  [pdf, other

    cs.LG cs.HC stat.ML

    VisEvol: Visual Analytics to Support Hyperparameter Search through Evolutionary Optimization

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Kostiantyn Kucher, Andreas Kerren

    Abstract: During the training phase of machine learning (ML) models, it is usually necessary to configure several hyperparameters. This process is computationally intensive and requires an extensive search to infer the best hyperparameter set for the given problem. The challenge is exacerbated by the fact that most ML models are complex internally, and training involves trial-and-error processes that could… ▽ More

    Submitted 18 April, 2024; v1 submitted 2 December, 2020; originally announced December 2020.

    Comments: This manuscript is accepted for publication in a special issue of Computer Graphics Forum (CGF)

    Journal ref: Computer Graphics Forum 2021, 40(3), 201-214

  13. arXiv:2005.01575  [pdf, other

    cs.LG cs.HC stat.ML

    StackGenVis: Alignment of Data, Algorithms, and Models for Stacking Ensemble Learning Using Performance Metrics

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Kostiantyn Kucher, Andreas Kerren

    Abstract: In machine learning (ML), ensemble methods such as bagging, boosting, and stacking are widely-established approaches that regularly achieve top-notch predictive performance. Stacking (also called "stacked generalization") is an ensemble method that combines heterogeneous base models, arranged in at least one layer, and then employs another metamodel to summarize the predictions of those models. Al… ▽ More

    Submitted 18 April, 2024; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: This manuscript is accepted for publication in a special issue of IEEE Transactions on Visualization and Computer Graphics Journal (IEEE TVCG)

    Journal ref: IEEE TVCG 2021, 27(2), 1547-1557

  14. arXiv:2002.06910  [pdf, other

    cs.LG cs.HC stat.ML

    t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections

    Authors: Angelos Chatzimparmpas, Rafael M. Martins, Andreas Kerren

    Abstract: t-Distributed Stochastic Neighbor Embedding (t-SNE) for the visualization of multidimensional data has proven to be a popular approach, with successful applications in a wide range of domains. Despite their usefulness, t-SNE projections can be hard to interpret or even misleading, which hurts the trustworthiness of the results. Understanding the details of t-SNE itself and the reasons behind speci… ▽ More

    Submitted 18 April, 2024; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: This manuscript is published in the IEEE Transactions on Visualization and Computer Graphics Journal (IEEE TVCG)

    Journal ref: IEEE TVCG 2020, 26(8), 2696-2714