Skip to main content

Showing 1–18 of 18 results for author: Saha, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18135  [pdf

    cs.CL cs.SD eess.AS

    Automatic Speech Recognition for Hindi

    Authors: Anish Saha, A. G. Ramakrishnan

    Abstract: Automatic speech recognition (ASR) is a key area in computational linguistics, focusing on develo** technologies that enable computers to convert spoken language into text. This field combines linguistics and machine learning. ASR models, which map speech audio to transcripts through supervised learning, require handling real and unrestricted text. Text-to-speech systems directly work with real… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  3. arXiv:2404.09666  [pdf, other

    eess.IV cs.CV q-bio.QM

    Deformable MRI Sequence Registration for AI-based Prostate Cancer Diagnosis

    Authors: Alessa Hering, Sarah de Boer, Anindo Saha, Jasper J. Twilt, Mattias P. Heinrich, Derya Yakar, Maarten de Rooij, Henkjan Huisman, Joeran S. Bosma

    Abstract: The PI-CAI (Prostate Imaging: Cancer AI) challenge led to expert-level diagnostic algorithms for clinically significant prostate cancer detection. The algorithms receive biparametric MRI scans as input, which consist of T2-weighted and diffusion-weighted scans. These scans can be misaligned due to multiple factors in the scanning process. Image registration can alleviate this issue by predicting t… ▽ More

    Submitted 28 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  4. arXiv:2311.11059  [pdf, other

    cs.CV cs.MM eess.IV

    HIDRO-VQA: High Dynamic Range Oracle for Video Quality Assessment

    Authors: Shreshth Saini, Avinab Saha, Alan C. Bovik

    Abstract: We introduce HIDRO-VQA, a no-reference (NR) video quality assessment model designed to provide precise quality evaluations of High Dynamic Range (HDR) videos. HDR videos exhibit a broader spectrum of luminance, detail, and color than Standard Dynamic Range (SDR) videos. As HDR content becomes increasingly popular, there is a growing demand for video quality assessment (VQA) algorithms that effecti… ▽ More

    Submitted 20 December, 2023; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: WACV 2024 Workshop Paper. Shreshth Saini, Avinab Saha contributed equally to this work

  5. arXiv:2306.00838  [pdf, other

    q-bio.OT eess.IV

    The Brain Tumor Segmentation (BraTS-METS) Challenge 2023: Brain Metastasis Segmentation on Pre-treatment MRI

    Authors: Ahmed W. Moawad, Anastasia Janas, Ujjwal Baid, Divya Ramakrishnan, Rachit Saluja, Nader Ashraf, Leon Jekel, Raisa Amiruddin, Maruf Adewole, Jake Albrecht, Udunna Anazodo, Sanjay Aneja, Syed Muhammad Anwar, Timothy Bergquist, Evan Calabrese, Veronica Chiang, Verena Chung, Gian Marco Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Juan Eugenio Iglesias, Zhifan Jiang , et al. (206 additional authors not shown)

    Abstract: The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and chara… ▽ More

    Submitted 17 June, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

  6. arXiv:2305.05984  [pdf, other

    eess.IV cs.CV

    Uncertainty-Aware Semi-Supervised Learning for Prostate MRI Zonal Segmentation

    Authors: Matin Hosseinzadeh, Anindo Saha, Joeran Bosma, Henkjan Huisman

    Abstract: Quality of deep convolutional neural network predictions strongly depends on the size of the training dataset and the quality of the annotations. Creating annotations, especially for 3D medical image segmentation, is time-consuming and requires expert knowledge. We propose a novel semi-supervised learning (SSL) approach that requires only a relatively small number of annotations while being able t… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 9 pages

  7. arXiv:2305.02422  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    GAMIVAL: Video Quality Prediction on Mobile Cloud Gaming Content

    Authors: Yu-Chih Chen, Avinab Saha, Chase Davis, Bo Qiu, Xiaoming Wang, Rahul Gowda, Ioannis Katsavounidis, Alan C. Bovik

    Abstract: The mobile cloud gaming industry has been rapidly growing over the last decade. When streaming gaming videos are transmitted to customers' client devices from cloud servers, algorithms that can monitor distorted video quality without having any reference video available are desirable tools. However, creating No-Reference Video Quality Assessment (NR VQA) models that can accurately predict the qual… ▽ More

    Submitted 29 August, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE SPL 2023. The implementation of GAMIVAL has been made available online: https://github.com/lskdream/GAMIVAL

    MSC Class: 68U10

    Journal ref: IEEE Signal Processing Letters, vol. 30, pp. 324-328, 2023

  8. arXiv:2303.13430  [pdf, other

    cs.CV eess.IV

    Medical diffusion on a budget: textual inversion for medical image generation

    Authors: Bram de Wilde, Anindo Saha, Richard P. G. ten Broek, Henkjan Huisman

    Abstract: Diffusion-based models for text-to-image generation have gained immense popularity due to recent advancements in efficiency, accessibility, and quality. Although it is becoming increasingly feasible to perform inference with these systems using consumer-grade GPUs, training them from scratch still requires access to large datasets and significant computational resources. In the case of medical ima… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  9. arXiv:2209.09825  [pdf, other

    eess.IV

    De-speckling of Optical Coherence Tomography Images Using Anscombe Transform and a Noisier2noise Model

    Authors: Arka Saha, Sourya Sengupta

    Abstract: Optical Coherence Tomography (OCT) image denoising is a fundamental problem as OCT images suffer from multiplicative speckle noise, resulting in poor visibility of retinal layers. The traditional denoising methods consider specific statistical properties of the noise, which are not always known. Furthermore, recent deep learning-based denoising methods require paired noisy and clean images, which… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: Accepted to MICCAI OMIA workshop 2022

  10. arXiv:2206.15311  [pdf, other

    eess.IV

    Performance Analysis of Optimized Versatile Video Coding Software Decoders on Embedded Platforms

    Authors: Anup Saha, Wassim Hamidouche, Miguel Chavarrías, Guillaume Gautier, Fernando Pescador, Ibrahim Farhat

    Abstract: In recent years, the global demand for high-resolution videos and the emergence of new multimedia applications have created the need for a new video coding standard. Hence, in July 2020 the Versatile Video Coding (VVC) standard was released providing up to 50% bit-rate saving for the same video quality compared to its predecessor High Efficiency Video Coding (HEVC). However, this bit-rate saving c… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

  11. Annotation-efficient cancer detection with report-guided lesion annotation for deep learning-based prostate cancer detection in bpMRI

    Authors: Joeran S. Bosma, Anindo Saha, Matin Hosseinzadeh, Ilse Slootweg, Maarten de Rooij, Henkjan Huisman

    Abstract: Deep learning-based diagnostic performance increases with more annotated data, but large-scale manual annotations are expensive and labour-intensive. Experts evaluate diagnostic images during clinical routine, and write their findings in reports. Leveraging unlabelled exams paired with clinical reports could overcome the manual labelling bottleneck. We hypothesise that detection models can be trai… ▽ More

    Submitted 19 February, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Journal ref: Radiology: Artificial Intelligence, 2023:e230031

  12. arXiv:2110.12889  [pdf, other

    eess.IV cs.CV

    Anatomical and Diagnostic Bayesian Segmentation in Prostate MRI $-$Should Different Clinical Objectives Mandate Different Loss Functions?

    Authors: Anindo Saha, Joeran Bosma, Jasper Linmans, Matin Hosseinzadeh, Henkjan Huisman

    Abstract: We hypothesize that probabilistic voxel-level classification of anatomy and malignancy in prostate MRI, although typically posed as near-identical segmentation tasks via U-Nets, require different loss functions for optimal performance due to inherent differences in their clinical objectives. We investigate distribution, region and boundary-based loss functions for both tasks across 200 patient exa… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: Accepted to Medical Imaging Meets NeurIPS Workshop of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  13. arXiv:2104.05642  [pdf, other

    eess.IV cs.CV

    Common Limitations of Image Processing Metrics: A Picture Story

    Authors: Annika Reinke, Minu D. Tizabi, Carole H. Sudre, Matthias Eisenmann, Tim Rädsch, Michael Baumgartner, Laura Acion, Michela Antonelli, Tal Arbel, Spyridon Bakas, Peter Bankhead, Arriel Benis, Matthew Blaschko, Florian Buettner, M. Jorge Cardoso, Jianxu Chen, Veronika Cheplygina, Evangelia Christodoulou, Beth Cimini, Gary S. Collins, Sandy Engelhardt, Keyvan Farahani, Luciana Ferrer, Adrian Galdran, Bram van Ginneken , et al. (68 additional authors not shown)

    Abstract: While the importance of automatic image analysis is continuously increasing, recent meta-research revealed major flaws with respect to algorithm validation. Performance metrics are particularly key for meaningful, objective, and transparent performance assessment and validation of the used automatic algorithms, but relatively little attention has been given to the practical pitfalls when using spe… ▽ More

    Submitted 6 December, 2023; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Shared first authors: Annika Reinke and Minu D. Tizabi. This is a dynamic paper on limitations of commonly used metrics. It discusses metrics for image-level classification, semantic and instance segmentation, and object detection. For missing use cases, comments or questions, please contact [email protected]. Substantial contributions to this document will be acknowledged with a co-authorship

  14. End-to-end Prostate Cancer Detection in bpMRI via 3D CNNs: Effects of Attention Mechanisms, Clinical Priori and Decoupled False Positive Reduction

    Authors: Anindo Saha, Matin Hosseinzadeh, Henkjan Huisman

    Abstract: We present a multi-stage 3D computer-aided detection and diagnosis (CAD) model for automated localization of clinically significant prostate cancer (csPCa) in bi-parametric MR imaging (bpMRI). Deep attention mechanisms drive its detection network, targeting salient structures and highly discriminative feature dimensions across multiple resolutions. Its goal is to accurately identify csPCa lesions… ▽ More

    Submitted 30 June, 2021; v1 submitted 8 January, 2021; originally announced January 2021.

    Comments: Accepted to MedIA: Medical Image Analysis. This manuscript incorporates and expands upon our 2020 Medical Imaging Meets NeurIPS Workshop paper (arXiv:2011.00263)

  15. Detection of masses and architectural distortions in digital breast tomosynthesis: a publicly available dataset of 5,060 patients and a deep learning model

    Authors: Mateusz Buda, Ashirbani Saha, Ruth Walsh, Sujata Ghate, Nianyi Li, Albert Święcicki, Joseph Y. Lo, Maciej A. Mazurowski

    Abstract: Breast cancer screening is one of the most common radiological tasks with over 39 million exams performed each year. While breast cancer screening has been one of the most studied medical imaging applications of artificial intelligence, the development and evaluation of the algorithms are hindered due to the lack of well-annotated large-scale publicly available datasets. This is particularly an is… ▽ More

    Submitted 20 November, 2022; v1 submitted 13 November, 2020; originally announced November 2020.

    Journal ref: JAMA Netw Open. 2021;4(8):e2119100

  16. arXiv:2011.00263  [pdf, other

    eess.IV cs.CV

    Encoding Clinical Priori in 3D Convolutional Neural Networks for Prostate Cancer Detection in bpMRI

    Authors: Anindo Saha, Matin Hosseinzadeh, Henkjan Huisman

    Abstract: We hypothesize that anatomical priors can be viable mediums to infuse domain-specific clinical knowledge into state-of-the-art convolutional neural networks (CNN) based on the U-Net architecture. We introduce a probabilistic population prior which captures the spatial prevalence and zonal distinction of clinically significant prostate cancer (csPCa), in order to improve its computer-aided detectio… ▽ More

    Submitted 21 September, 2021; v1 submitted 31 October, 2020; originally announced November 2020.

    Comments: Accepted to Medical Imaging Meets NeurIPS Workshop of the 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

  17. arXiv:2006.08925  [pdf, other

    cs.NI eess.SP

    Improving the Performance of Deep Learning for Wireless Localization

    Authors: Ramdoot Pydipaty, Johnu George, Krishna Selvaraju, Amit Saha

    Abstract: Indoor localization systems are most commonly based on Received Signal Strength Indicator (RSSI) measurements of either WiFi or Bluetooth-Low-Energy (BLE) beacons. In such systems, the two most common techniques are trilateration and fingerprinting, with the latter providing higher accuracy. In the fingerprinting technique, Deep Learning (DL) algorithms are often used to predict the location of th… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

  18. Association of genomic subtypes of lower-grade gliomas with shape features automatically extracted by a deep learning algorithm

    Authors: Mateusz Buda, Ashirbani Saha, Maciej A Mazurowski

    Abstract: Recent analysis identified distinct genomic subtypes of lower-grade glioma tumors which are associated with shape features. In this study, we propose a fully automatic way to quantify tumor imaging characteristics using deep learning-based segmentation and test whether these characteristics are predictive of tumor genomic subtypes. We used preoperative imaging and genomic data of 110 patients from… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Journal ref: Computers in Biology and Medicine, 109, 2019, 218-225