-
MFAAN: Unveiling Audio Deepfakes with a Multi-Feature Authenticity Network
Authors:
Karthik Sivarama Krishnan,
Koushik Sivarama Krishnan
Abstract:
In the contemporary digital age, the proliferation of deepfakes presents a formidable challenge to the sanctity of information dissemination. Audio deepfakes, in particular, can be deceptively realistic, posing significant risks in misinformation campaigns. To address this threat, we introduce the Multi-Feature Audio Authenticity Network (MFAAN), an advanced architecture tailored for the detection…
▽ More
In the contemporary digital age, the proliferation of deepfakes presents a formidable challenge to the sanctity of information dissemination. Audio deepfakes, in particular, can be deceptively realistic, posing significant risks in misinformation campaigns. To address this threat, we introduce the Multi-Feature Audio Authenticity Network (MFAAN), an advanced architecture tailored for the detection of fabricated audio content. MFAAN incorporates multiple parallel paths designed to harness the strengths of different audio representations, including Mel-frequency cepstral coefficients (MFCC), linear-frequency cepstral coefficients (LFCC), and Chroma Short Time Fourier Transform (Chroma-STFT). By synergistically fusing these features, MFAAN achieves a nuanced understanding of audio content, facilitating robust differentiation between genuine and manipulated recordings. Preliminary evaluations of MFAAN on two benchmark datasets, 'In-the-Wild' Audio Deepfake Data and The Fake-or-Real Dataset, demonstrate its superior performance, achieving accuracies of 98.93% and 94.47% respectively. Such results not only underscore the efficacy of MFAAN but also highlight its potential as a pivotal tool in the ongoing battle against deepfake audio content.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Advancing Ischemic Stroke Diagnosis: A Novel Two-Stage Approach for Blood Clot Origin Identification
Authors:
Koushik Sivarama Krishnan,
P. J. Joe Nikesh,
Swathi Gnanasekar,
Karthik Sivarama Krishnan
Abstract:
An innovative two-stage methodology for categorizing blood clot origins is presented in this paper, which is important for the diagnosis and treatment of ischemic stroke. First, a background classifier based on MobileNetV3 segments big whole-slide digital pathology images into numerous tiles to detect the presence of cellular material. After that, different pre-trained image classification algorit…
▽ More
An innovative two-stage methodology for categorizing blood clot origins is presented in this paper, which is important for the diagnosis and treatment of ischemic stroke. First, a background classifier based on MobileNetV3 segments big whole-slide digital pathology images into numerous tiles to detect the presence of cellular material. After that, different pre-trained image classification algorithms are fine-tuned to determine the origin of blood clots. Due to complex blood flow dynamics and limitations in conventional imaging methods such as computed tomography (CT), magnetic resonance imaging (MRI), and ultrasound, identifying the sources of blood clots is a challenging task. Although these techniques are useful for identifying blood clots, they are not very good at determining how they originated. To address these challenges, our method makes use of robust computer vision models that have been refined using information from whole-slide digital pathology images. Out of all the models tested, the PoolFormer \cite{yu2022metaformer} performs better than the others, with 93.4\% accuracy, 93.4\% precision, 93.4\% recall, and 93.4\% F1-score. Moreover, it achieves the good weighted multi-class logarithmic loss (WMCLL) of 0.4361, which emphasizes how effective it is in this particular application. These encouraging findings suggest that our approach can successfully identify the origin of blood clots in a variety of vascular locations, potentially advancing ischemic stroke diagnosis and treatment approaches.
△ Less
Submitted 5 January, 2024; v1 submitted 26 April, 2023;
originally announced April 2023.
-
Lodestar: Supporting Independent Learning and Rapid Experimentation Through Data-Driven Analysis Recommendations
Authors:
Deepthi Raghunandan,
Zhe Cui,
Kartik Krishnan,
Segen Tirfe,
Shenzhi Shi,
Tejaswi Darshan Shrestha,
Leilani Battle,
Niklas Elmqvist
Abstract:
Kee** abreast of current trends, technologies, and best practices in visualization and data analysis is becoming increasingly difficult, especially for fledgling data scientists. In this paper, we propose Lodestar, an interactive computational notebook that allows users to quickly explore and construct new data science workflows by selecting from a list of automated analysis recommendations. We…
▽ More
Kee** abreast of current trends, technologies, and best practices in visualization and data analysis is becoming increasingly difficult, especially for fledgling data scientists. In this paper, we propose Lodestar, an interactive computational notebook that allows users to quickly explore and construct new data science workflows by selecting from a list of automated analysis recommendations. We derive our recommendations from directed graphs of known analysis states, with two input sources: one manually curated from online data science tutorials, and another extracted through semi-automatic analysis of a corpus of over 6,000 Jupyter notebooks. We evaluate Lodestar in a formative study guiding our next set of improvements to the tool. Our results suggest that users find Lodestar useful for rapidly creating data science workflows.
△ Less
Submitted 16 April, 2022;
originally announced April 2022.
-
SwiftSRGAN -- Rethinking Super-Resolution for Efficient and Real-time Inference
Authors:
Koushik Sivarama Krishnan,
Karthik Sivarama Krishnan
Abstract:
In recent years, there have been several advancements in the task of image super-resolution using the state of the art Deep Learning-based architectures. Many super-resolution-based techniques previously published, require high-end and top-of-the-line Graphics Processing Unit (GPUs) to perform image super-resolution. With the increasing advancements in Deep Learning approaches, neural networks hav…
▽ More
In recent years, there have been several advancements in the task of image super-resolution using the state of the art Deep Learning-based architectures. Many super-resolution-based techniques previously published, require high-end and top-of-the-line Graphics Processing Unit (GPUs) to perform image super-resolution. With the increasing advancements in Deep Learning approaches, neural networks have become more and more compute hungry. We took a step back and, focused on creating a real-time efficient solution. We present an architecture that is faster and smaller in terms of its memory footprint. The proposed architecture uses Depth-wise Separable Convolutions to extract features and, it performs on-par with other super-resolution GANs (Generative Adversarial Networks) while maintaining real-time inference and a low memory footprint. A real-time super-resolution enables streaming high resolution media content even under poor bandwidth conditions. While maintaining an efficient trade-off between the accuracy and latency, we are able to produce a comparable performance model which is one-eighth (1/8) the size of super-resolution GANs and computes 74 times faster than super-resolution GANs.
△ Less
Submitted 28 November, 2021;
originally announced November 2021.
-
Vision Transformer based COVID-19 Detection using Chest X-rays
Authors:
Koushik Sivarama Krishnan,
Karthik Sivarama Krishnan
Abstract:
COVID-19 is a global pandemic, and detecting them is a momentous task for medical professionals today due to its rapid mutations. Current methods of examining chest X-rays and CT scan requires profound knowledge and are time consuming, which suggests that it shrinks the precious time of medical practitioners when people's lives are at stake. This study tries to assist this process by achieving sta…
▽ More
COVID-19 is a global pandemic, and detecting them is a momentous task for medical professionals today due to its rapid mutations. Current methods of examining chest X-rays and CT scan requires profound knowledge and are time consuming, which suggests that it shrinks the precious time of medical practitioners when people's lives are at stake. This study tries to assist this process by achieving state-of-the-art performance in classifying chest X-rays by fine-tuning Vision Transformer(ViT). The proposed approach uses pretrained models, fine-tuned for detecting the presence of COVID-19 disease on chest X-rays. This approach achieves an accuracy score of 97.61%, precision score of 95.34%, recall score of 93.84% and, f1-score of 94.58%. This result signifies the performance of transformer-based models on chest X-ray.
△ Less
Submitted 9 October, 2021;
originally announced October 2021.