-
PADTHAI-MM: A Principled Approach for Designing Trustable, Human-centered AI systems using the MAST Methodology
Authors:
Nayoung Kim,
Myke C. Cohen,
Yang Ba,
Anna Pan,
Shawaiz Bhatti,
Pouria Salehi,
James Sung,
Erik Blasch,
Michelle V. Mancenido,
Erin K. Chiou
Abstract:
Designing for AI trustworthiness is challenging, with a lack of practical guidance despite extensive literature on trust. The Multisource AI Scorecard Table (MAST), a checklist rating system, addresses this gap in designing and evaluating AI-enabled decision support systems. We propose the Principled Approach for Designing Trustable Human-centered AI systems using MAST Methodology (PADTHAI-MM), a…
▽ More
Designing for AI trustworthiness is challenging, with a lack of practical guidance despite extensive literature on trust. The Multisource AI Scorecard Table (MAST), a checklist rating system, addresses this gap in designing and evaluating AI-enabled decision support systems. We propose the Principled Approach for Designing Trustable Human-centered AI systems using MAST Methodology (PADTHAI-MM), a nine-step framework what we demonstrate through the iterative design of a text analysis platform called the REporting Assistant for Defense and Intelligence Tasks (READIT). We designed two versions of READIT, high-MAST including AI context and explanations, and low-MAST resembling a "black box" type system. Participant feedback and state-of-the-art AI knowledge was integrated in the design process, leading to a redesigned prototype tested by participants in an intelligence reporting task. Results show that MAST-guided design can improve trust perceptions, and that MAST criteria can be linked to performance, process, and purpose information, providing a practical and theory-informed basis for AI system design.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Evaluating Trustworthiness of AI-Enabled Decision Support Systems: Validation of the Multisource AI Scorecard Table (MAST)
Authors:
Pouria Salehi,
Yang Ba,
Nayoung Kim,
Ahmadreza Mosallanezhad,
Anna Pan,
Myke C. Cohen,
Yixuan Wang,
Jieqiong Zhao,
Shawaiz Bhatti,
James Sung,
Erik Blasch,
Michelle V. Mancenido,
Erin K. Chiou
Abstract:
The Multisource AI Scorecard Table (MAST) is a checklist tool based on analytic tradecraft standards to inform the design and evaluation of trustworthy AI systems. In this study, we evaluate whether MAST is associated with people's trust perceptions in AI-enabled decision support systems (AI-DSSs). Evaluating trust in AI-DSSs poses challenges to researchers and practitioners. These challenges incl…
▽ More
The Multisource AI Scorecard Table (MAST) is a checklist tool based on analytic tradecraft standards to inform the design and evaluation of trustworthy AI systems. In this study, we evaluate whether MAST is associated with people's trust perceptions in AI-enabled decision support systems (AI-DSSs). Evaluating trust in AI-DSSs poses challenges to researchers and practitioners. These challenges include identifying the components, capabilities, and potential of these systems, many of which are based on the complex deep learning algorithms that drive DSS performance and preclude complete manual inspection. We developed two interactive, AI-DSS test environments using the MAST criteria. One emulated an identity verification task in security screening, and another emulated a text summarization system to aid in an investigative reporting task. Each test environment had one version designed to match low-MAST ratings, and another designed to match high-MAST ratings, with the hypothesis that MAST ratings would be positively related to the trust ratings of these systems. A total of 177 subject matter experts were recruited to interact with and evaluate these systems. Results generally show higher MAST ratings for the high-MAST conditions compared to the low-MAST groups, and that measures of trust perception are highly correlated with the MAST ratings. We conclude that MAST can be a useful tool for designing and evaluating systems that will engender high trust perceptions, including AI-DSS that may be used to support visual screening and text summarization tasks. However, higher MAST ratings may not translate to higher joint performance.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
An updated nuclear-physics and multi-messenger astrophysics framework for binary neutron star mergers
Authors:
Peter T. H. Pang,
Tim Dietrich,
Michael W. Coughlin,
Mattia Bulla,
Ingo Tews,
Mouza Almualla,
Tyler Barna,
Weizmann Kiendrebeogo,
Nina Kunert,
Gargi Mansingh,
Brandon Reed,
Niharika Sravan,
Andrew Toivonen,
Sarah Antier,
Robert O. VandenBerg,
Jack Heinzel,
Vsevolod Nedora,
Pouyan Salehi,
Ritwik Sharma,
Rahul Somasundaram,
Chris Van Den Broeck
Abstract:
The multi-messenger detection of the gravitational-wave signal GW170817, the corresponding kilonova AT2017gfo and the short gamma-ray burst GRB170817A, as well as the observed afterglow has delivered a scientific breakthrough. For an accurate interpretation of all these different messengers, one requires robust theoretical models that describe the emitted gravitational-wave, the electromagnetic em…
▽ More
The multi-messenger detection of the gravitational-wave signal GW170817, the corresponding kilonova AT2017gfo and the short gamma-ray burst GRB170817A, as well as the observed afterglow has delivered a scientific breakthrough. For an accurate interpretation of all these different messengers, one requires robust theoretical models that describe the emitted gravitational-wave, the electromagnetic emission, and dense matter reliably. In addition, one needs efficient and accurate computational tools to ensure a correct cross-correlation between the models and the observational data. For this purpose, we have developed the Nuclear-physics and Multi-Messenger Astrophysics framework NMMA. The code allows incorporation of nuclear-physics constraints at low densities as well as X-ray and radio observations of isolated neutron stars. In previous works, the NMMA code has allowed us to constrain the equation of state of supranuclear dense matter, to measure the Hubble constant, and to compare dense-matter physics probed in neutron-star mergers and in heavy-ion collisions, and to classify electromagnetic observations and perform model selection. Here, we show an extension of the NMMA code as a first attempt of analyzing the gravitational-wave signal, the kilonova, and the gamma-ray burst afterglow simultaneously. Incorporating all available information, we estimate the radius of a $1.4M_\odot$ neutron star to be $R=11.98^{+0.35}_{-0.40}$km.
△ Less
Submitted 8 January, 2024; v1 submitted 17 May, 2022;
originally announced May 2022.
-
Using Neural Networks to Perform Rapid High-Dimensional Kilonova Parameter Inference
Authors:
Mouza Almualla,
Yuhong Ning,
Pouyan Salehi,
Mattia Bulla,
Tim Dietrich,
Michael W. Coughlin,
Nidhal Guessoum
Abstract:
On the 17th of August, 2017 came the simultaneous detections of GW170817, a gravitational wave that originated from the coalescence of two neutron stars, along with the gamma-ray burst GRB170817A, and the kilonova counterpart AT2017gfo. Since then, there has been much excitement surrounding the study of neutron star mergers, both observationally, using a variety of tools, and theoretically, with t…
▽ More
On the 17th of August, 2017 came the simultaneous detections of GW170817, a gravitational wave that originated from the coalescence of two neutron stars, along with the gamma-ray burst GRB170817A, and the kilonova counterpart AT2017gfo. Since then, there has been much excitement surrounding the study of neutron star mergers, both observationally, using a variety of tools, and theoretically, with the development of complex models describing the gravitational-wave and electromagnetic signals. In this work, we improve upon our pipeline to infer kilonova properties from observed light-curves by employing a Neural-Network framework that reduces execution time and handles much larger simulation sets than previously possible. In particular, we use the radiative transfer code POSSIS to construct 5-dimensional kilonova grids where we employ different functional forms for the angular dependence of the dynamical ejecta component. We find that incorporating an angular dependence improves the fit to the AT2017gfo light-curves by up to ~50% when quantified in terms of the weighted Mean Square Error.
△ Less
Submitted 12 April, 2023; v1 submitted 31 December, 2021;
originally announced December 2021.
-
SinGAN-Seg: Synthetic training data generation for medical image segmentation
Authors:
Vajira Thambawita,
Pegah Salehi,
Sajad Amouei Sheshkal,
Steven A. Hicks,
Hugo L. Hammer,
Sravanthi Parasa,
Thomas de Lange,
Pål Halvorsen,
Michael A. Riegler
Abstract:
Analyzing medical data to find abnormalities is a time-consuming and costly task, particularly for rare abnormalities, requiring tremendous efforts from medical experts. Artificial intelligence has become a popular tool for the automatic processing of medical data, acting as a supportive tool for doctors. However, the machine learning models used to build these tools are highly dependent on the da…
▽ More
Analyzing medical data to find abnormalities is a time-consuming and costly task, particularly for rare abnormalities, requiring tremendous efforts from medical experts. Artificial intelligence has become a popular tool for the automatic processing of medical data, acting as a supportive tool for doctors. However, the machine learning models used to build these tools are highly dependent on the data used to train them. Large amounts of data can be difficult to obtain in medicine due to privacy, expensive and time-consuming annotations, and a general lack of data samples for infrequent lesions. Here, we present a novel synthetic data generation pipeline, called SinGAN-Seg, to produce synthetic medical images with corresponding masks using a single training image. Our method is different from the traditional GANs because our model needs only a single image and the corresponding ground truth to train. Our method produces alternative artificial segmentation datasets with ground truth masks when real datasets are not allowed to share. The pipeline is evaluated using qualitative and quantitative comparisons between real and synthetic data to show that the style transfer technique used in our pipeline significantly improves the quality of the generated data and our method is better than other state-of-the-art GANs to prepare synthetic images when the size of training datasets are limited. By training UNet++ using both real and the synthetic data generated from the SinGAN-Seg pipeline, we show that models trained with synthetic data have very close performances to those trained on real data when the datasets have a considerable amount of data. In contrast, Synthetic data generated from the SinGAN-Seg pipeline can improve the performance of segmentation models when training datasets do not have a considerable amount of data. The code is available on GitHub.
△ Less
Submitted 25 April, 2022; v1 submitted 29 June, 2021;
originally announced July 2021.
-
Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments
Authors:
Pegah Salehi,
Abdolah Chalechale,
Maryam Taghizadeh
Abstract:
One of the most significant challenges in statistical signal processing and machine learning is how to obtain a generative model that can produce samples of large-scale data distribution, such as images and speeches. Generative Adversarial Network (GAN) is an effective method to address this problem. The GANs provide an appropriate way to learn deep representations without widespread use of labele…
▽ More
One of the most significant challenges in statistical signal processing and machine learning is how to obtain a generative model that can produce samples of large-scale data distribution, such as images and speeches. Generative Adversarial Network (GAN) is an effective method to address this problem. The GANs provide an appropriate way to learn deep representations without widespread use of labeled training data. This approach has attracted the attention of many researchers in computer vision since it can generate a large amount of data without precise modeling of the probability density function (PDF). In GANs, the generative model is estimated via a competitive process where the generator and discriminator networks are trained simultaneously. The generator learns to generate plausible data, and the discriminator learns to distinguish fake data created by the generator from real data samples. Given the rapid growth of GANs over the last few years and their application in various fields, it is necessary to investigate these networks accurately. In this paper, after introducing the main concepts and the theory of GAN, two new deep generative models are compared, the evaluation metrics utilized in the literature and challenges of GANs are also explained. Moreover, the most remarkable GAN architectures are categorized and discussed. Finally, the essential applications in computer vision are examined.
△ Less
Submitted 27 May, 2020;
originally announced May 2020.
-
Pix2Pix-based Stain-to-Stain Translation: A Solution for Robust Stain Normalization in Histopathology Images Analysis
Authors:
Pegah Salehi,
Abdolah Chalechale
Abstract:
The diagnosis of cancer is mainly performed by visual analysis of the pathologists, through examining the morphology of the tissue slices and the spatial arrangement of the cells. If the microscopic image of a specimen is not stained, it will look colorless and textured. Therefore, chemical staining is required to create contrast and help identify specific tissue components. During tissue preparat…
▽ More
The diagnosis of cancer is mainly performed by visual analysis of the pathologists, through examining the morphology of the tissue slices and the spatial arrangement of the cells. If the microscopic image of a specimen is not stained, it will look colorless and textured. Therefore, chemical staining is required to create contrast and help identify specific tissue components. During tissue preparation due to differences in chemicals, scanners, cutting thicknesses, and laboratory protocols, similar tissues are usually varied significantly in appearance. This diversity in staining, in addition to Interpretive disparity among pathologists more is one of the main challenges in designing robust and flexible systems for automated analysis. To address the staining color variations, several methods for normalizing stain have been proposed. In our proposed method, a Stain-to-Stain Translation (STST) approach is used to stain normalization for Hematoxylin and Eosin (H&E) stained histopathology images, which learns not only the specific color distribution but also the preserves corresponding histopathological pattern. We perform the process of translation based on the pix2pix framework, which uses the conditional generator adversarial networks (cGANs). Our approach showed excellent results, both mathematically and experimentally against the state of the art methods. We have made the source code publicly available.
△ Less
Submitted 3 February, 2020;
originally announced February 2020.