Skip to main content

Showing 1–18 of 18 results for author: Patel, A B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.02656  [pdf, other

    cs.CL q-bio.QM

    RACER: An LLM-powered Methodology for Scalable Analysis of Semi-structured Mental Health Interviews

    Authors: Satpreet Harcharan Singh, Kevin Jiang, Kanchan Bhasin, Ashutosh Sabharwal, Nidal Moukaddam, Ankit B Patel

    Abstract: Semi-structured interviews (SSIs) are a commonly employed data-collection method in healthcare research, offering in-depth qualitative insights into subject experiences. Despite their value, the manual analysis of SSIs is notoriously time-consuming and labor-intensive, in part due to the difficulty of extracting and categorizing emotional responses, and challenges in scaling human evaluation for l… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  2. arXiv:2311.13717  [pdf, ps, other

    cs.CV

    Feature Extraction for Generative Medical Imaging Evaluation: New Evidence Against an Evolving Trend

    Authors: McKell Woodland, Austin Castelo, Mais Al Taie, Jessica Albuquerque Marques Silva, Mohamed Eltaher, Frank Mohn, Alexander Shieh, Austin Castelo, Suprateek Kundu, Joshua P. Yung, Ankit B. Patel, Kristy K. Brock

    Abstract: Fréchet Inception Distance (FID) is a widely used metric for assessing synthetic image quality. It relies on an ImageNet-based feature extractor, making its applicability to medical imaging unclear. A recent trend is to adapt FID to medical imaging through feature extractors trained on medical images. Our study challenges this practice by demonstrating that ImageNet-based extractors are more consi… ▽ More

    Submitted 29 May, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Preprint of manuscript early accepted to MICCAI 2024

  3. Dimensionality Reduction for Improving Out-of-Distribution Detection in Medical Image Segmentation

    Authors: McKell Woodland, Nihil Patel, Mais Al Taie, Joshua P. Yung, Tucker J. Netherton, Ankit B. Patel, Kristy K. Brock

    Abstract: Clinically deployed segmentation models are known to fail on data outside of their training distribution. As these models perform well on most cases, it is imperative to detect out-of-distribution (OOD) images at inference to protect against automation bias. This work applies the Mahalanobis distance post hoc to the bottleneck features of a Swin UNETR model that segments the liver on T1-weighted m… ▽ More

    Submitted 19 October, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution is published in the proceedings of UNSURE 2023, Lecture Notes in Computer Science, vol 14291, and is available online at https://doi.org/10.1007/978-3-031-44336-7_15

    Journal ref: In: UNSURE 2023. LNCS, vol 14291. Springer, Cham (2023)

  4. arXiv:2307.10193  [pdf, ps, other

    eess.IV cs.LG

    StyleGAN2-based Out-of-Distribution Detection for Medical Imaging

    Authors: McKell Woodland, John Wood, Caleb O'Connor, Ankit B. Patel, Kristy K. Brock

    Abstract: One barrier to the clinical deployment of deep learning-based models is the presence of images at runtime that lie far outside the training distribution of a given model. We aim to detect these out-of-distribution (OOD) images with a generative adversarial network (GAN). Our training dataset was comprised of 3,234 liver-containing computed tomography (CT) scans from 456 patients. Our OOD test data… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Extended abstract published in the "Medical Imaging Meets NeurIPS" workshop at NeurIPS 2022. Original abstract can be found at http://www.cse.cuhk.edu.hk/~qdou/public/medneurips2022/125.pdf

    Journal ref: Proceedings of Med-NeurIPS 2022

  5. arXiv:2307.07575  [pdf, other

    cs.LG cs.NE

    A Quantitative Approach to Predicting Representational Learning and Performance in Neural Networks

    Authors: Ryan Pyle, Sebastian Musslick, Jonathan D. Cohen, Ankit B. Patel

    Abstract: A key property of neural networks (both biological and artificial) is how they learn to represent and manipulate input information in order to solve a task. Different types of representations may be suited to different types of tasks, making identifying and understanding learned representations a critical part of understanding and designing useful networks. In this paper, we introduce a new pseudo… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: 30 pages, 16 figures

  6. arXiv:2302.03750  [pdf, other

    cs.CV cs.LG stat.ME

    Linking convolutional kernel size to generalization bias in face analysis CNNs

    Authors: Hao Liang, Josue Ortega Caro, Vikram Maheshri, Ankit B. Patel, Guha Balakrishnan

    Abstract: Training dataset biases are by far the most scrutinized factors when explaining algorithmic biases of neural networks. In contrast, hyperparameters related to the neural network architecture have largely been ignored even though different network parameterizations are known to induce different implicit biases over learned features. For example, convolutional kernel size is known to affect the freq… ▽ More

    Submitted 3 December, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: WACV 2024

  7. arXiv:2210.03786  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Evaluating the Performance of StyleGAN2-ADA on Medical Images

    Authors: McKell Woodland, John Wood, Brian M. Anderson, Suprateek Kundu, Ethan Lin, Eugene Koay, Bruno Odisio, Caroline Chung, Hyunseon Christine Kang, Aradhana M. Venkatesan, Sireesha Yedururi, Brian De, Yuan-Mao Lin, Ankit B. Patel, Kristy K. Brock

    Abstract: Although generative adversarial networks (GANs) have shown promise in medical imaging, they have four main limitations that impeded their utility: computational cost, data requirements, reliable evaluation measures, and training complexity. Our work investigates each of these obstacles in a novel application of StyleGAN2-ADA to high-resolution medical imaging datasets. Our dataset is comprised of… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: This preprint has not undergone post-submission improvements or corrections. The Version of Record of this contribution is published in LNCS, volume 13570, and is available online at https://doi.org/10.1007/978-3-031-16980-9_14

    Journal ref: Lecture Notes in Computer Science 13570 (2022)

  8. arXiv:2209.03901  [pdf, other

    cs.SD cs.AI eess.AS

    Dyadic Interaction Assessment from Free-living Audio for Depression Severity Assessment

    Authors: Bishal Lamichhane, Nidal Moukaddam, Ankit B. Patel, Ashutosh Sabharwal

    Abstract: Psychomotor retardation in depression has been associated with speech timing changes from dyadic clinical interviews. In this work, we investigate speech timing features from free-living dyadic interactions. Apart from the possibility of continuous monitoring to complement clinical visits, a study in free-living conditions would also allow inferring sociability features such as dyadic interaction… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Comments: Accepted to INTERSPEECH 2022

  9. arXiv:2203.08822  [pdf, other

    cs.CV cs.LG eess.IV

    Understanding robustness and generalization of artificial neural networks through Fourier masks

    Authors: Nikos Karantzas, Emma Besier, Josue Ortega Caro, Xaq Pitkow, Andreas S. Tolias, Ankit B. Patel, Fabio Anselmi

    Abstract: Despite the enormous success of artificial neural networks (ANNs) in many disciplines, the characterization of their computations and the origin of key properties such as generalization and robustness remain open questions. Recent literature suggests that robust networks with good generalization properties tend to be biased towards processing low frequencies in images. To explore the frequency bia… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  10. arXiv:2010.00763  [pdf, other

    cs.AI cs.CV cs.LG

    Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning

    Authors: Weili Nie, Zhiding Yu, Lei Mao, Ankit B. Patel, Yuke Zhu, Animashree Anandkumar

    Abstract: Humans have an inherent ability to learn novel concepts from only a few samples and generalize these concepts to different situations. Even though today's machine learning models excel with a plethora of training data on standard recognition tasks, a considerable gap exists between machine-level pattern recognition and human-level concept learning. To narrow this gap, the Bongard problems (BPs) we… ▽ More

    Submitted 4 January, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: 22 pages, NeurIPS 2020

  11. arXiv:2006.07460  [pdf, other

    cs.LG stat.ML

    An Improved Semi-Supervised VAE for Learning Disentangled Representations

    Authors: Weili Nie, Zichao Wang, Ankit B. Patel, Richard G. Baraniuk

    Abstract: Learning interpretable and disentangled representations is a crucial yet challenging task in representation learning. In this work, we focus on semi-supervised disentanglement learning and extend work by Locatello et al. (2019) by introducing another source of supervision that we denote as label replacement. Specifically, during training, we replace the inferred representation associated with a da… ▽ More

    Submitted 22 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

  12. arXiv:2003.03461  [pdf, other

    cs.CV cs.LG

    Semi-Supervised StyleGAN for Disentanglement Learning

    Authors: Weili Nie, Tero Karras, Animesh Garg, Shoubhik Debnath, Anjul Patney, Ankit B. Patel, Anima Anandkumar

    Abstract: Disentanglement learning is crucial for obtaining disentangled representations and controllable generation. Current disentanglement methods face several inherent limitations: difficulty with high-resolution images, primarily focusing on learning disentangled representations, and non-identifiability due to the unsupervised setting. To alleviate these limitations, we design new architectures and los… ▽ More

    Submitted 25 November, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

    Comments: ICML 2020, 21 pages. Project page: https://sites.google.com/nvidia.com/semi-stylegan

  13. arXiv:2002.09565  [pdf, other

    cs.LG cs.CR q-fin.ST

    Adversarial Attacks on Machine Learning Systems for High-Frequency Trading

    Authors: Micah Goldblum, Avi Schwarzschild, Ankit B. Patel, Tom Goldstein

    Abstract: Algorithmic trading systems are often completely automated, and deep learning is increasingly receiving attention in this domain. Nonetheless, little is known about the robustness properties of these models. We study valuation models for algorithmic trading from the perspective of adversarial machine learning. We introduce new attacks specific to this domain with size constraints that minimize att… ▽ More

    Submitted 29 October, 2021; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: ACM International Conference on AI in Finance (ICAIF) 2021

  14. arXiv:1902.10297  [pdf, other

    cs.LG cs.FL

    Representing Formal Languages: A Comparison Between Finite Automata and Recurrent Neural Networks

    Authors: Joshua J. Michalenko, Ameesh Shah, Abhinav Verma, Richard G. Baraniuk, Swarat Chaudhuri, Ankit B. Patel

    Abstract: We investigate the internal representations that a recurrent neural network (RNN) uses while learning to recognize a regular formal language. Specifically, we train a RNN on positive and negative examples from a regular language, and ask if there is a simple decoding function that maps states of this RNN to states of the minimal deterministic finite automaton (MDFA) for the language. Our experimen… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: 15 Pages, 13 Figures, Accepted to ICLR 2019

  15. arXiv:1612.01942  [pdf, other

    stat.ML cs.LG cs.NE

    Semi-Supervised Learning with the Deep Rendering Mixture Model

    Authors: Tan Nguyen, Wanjia Liu, Ethan Perez, Richard G. Baraniuk, Ankit B. Patel

    Abstract: Semi-supervised learning algorithms reduce the high cost of acquiring labeled training data by using both labeled and unlabeled data during learning. Deep Convolutional Networks (DCNs) have achieved great success in supervised tasks and as such have been widely employed in the semi-supervised learning. In this paper we leverage the recently developed Deep Rendering Mixture Model (DRMM), a probabil… ▽ More

    Submitted 6 December, 2016; originally announced December 2016.

  16. arXiv:1612.01936  [pdf, other

    stat.ML cs.LG cs.NE

    A Probabilistic Framework for Deep Learning

    Authors: Ankit B. Patel, Tan Nguyen, Richard G. Baraniuk

    Abstract: We develop a probabilistic framework for deep learning based on the Deep Rendering Mixture Model (DRMM), a new generative probabilistic model that explicitly capture variations in data due to latent task nuisance variables. We demonstrate that max-sum inference in the DRMM yields an algorithm that exactly reproduces the operations in deep convolutional neural networks (DCNs), providing a first pri… ▽ More

    Submitted 6 December, 2016; originally announced December 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1504.00641

  17. A Deep Learning Approach to Structured Signal Recovery

    Authors: Ali Mousavi, Ankit B. Patel, Richard G. Baraniuk

    Abstract: In this paper, we develop a new framework for sensing and recovering structured signals. In contrast to compressive sensing (CS) systems that employ linear measurements, sparse representations, and computationally complex convex/greedy algorithms, we introduce a deep learning framework that supports both linear and mildly nonlinear measurements, that learns a structured representation from trainin… ▽ More

    Submitted 17 August, 2015; originally announced August 2015.

    Journal ref: In Proceeding of 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton)

  18. arXiv:1504.00641  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    A Probabilistic Theory of Deep Learning

    Authors: Ankit B. Patel, Tan Nguyen, Richard G. Baraniuk

    Abstract: A grand challenge in machine learning is the development of computational algorithms that match or outperform humans in perceptual inference tasks that are complicated by nuisance variation. For instance, visual object recognition involves the unknown object position, orientation, and scale in object recognition while speech recognition involves the unknown voice pronunciation, pitch, and speed. R… ▽ More

    Submitted 2 April, 2015; originally announced April 2015.

    Comments: 56 pages, 6 figures, 2 tables

    Report number: Rice University Electrical and Computer Engineering Dept. Technical Report No 2015-1