Skip to main content

Showing 1–12 of 12 results for author: Sankarasubbu, M

.
  1. arXiv:2402.07023  [pdf, other

    cs.CL cs.AI cs.CV cs.HC cs.LG

    Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & Hallucinations

    Authors: Ankit Pal, Malaikannan Sankarasubbu

    Abstract: Large language models have the potential to be valuable in the healthcare industry, but it's crucial to verify their safety and effectiveness through rigorous evaluation. For this purpose, we comprehensively evaluated both open-source LLMs and Google's new multimodal LLM called Gemini across Medical reasoning, hallucination detection, and Medical Visual Question Answering tasks. While Gemini showe… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: Preprint version, Under Review

  2. arXiv:2307.15343  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Med-HALT: Medical Domain Hallucination Test for Large Language Models

    Authors: Ankit Pal, Logesh Kumar Umapathi, Malaikannan Sankarasubbu

    Abstract: This research paper focuses on the challenges posed by hallucinations in large language models (LLMs), particularly in the context of the medical domain. Hallucination, wherein these models generate plausible yet unverified or incorrect information, can have serious consequences in healthcare applications. We propose a new benchmark and dataset, Med-HALT (Medical Domain Hallucination Test), design… ▽ More

    Submitted 14 October, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted at EMNLP 2023(The SIGNLL Conference on Computational Natural Language Learning)

  3. arXiv:2211.07893  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    Federated Learning for Healthcare Domain - Pipeline, Applications and Challenges

    Authors: Madhura Joshi, Ankit Pal, Malaikannan Sankarasubbu

    Abstract: Federated learning is the process of develo** machine learning models over datasets distributed across data centers such as hospitals, clinical research labs, and mobile devices while preventing data leakage. This survey examines previous research and studies on federated learning in the healthcare sector across a range of use cases and applications. Our survey shows what challenges, methods, an… ▽ More

    Submitted 19 November, 2022; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: ACM Transactions on Computing for Healthcare, Vol. 3, No. 4, Article 40. Publication date: October 2022

    Journal ref: ACM Transactions on Computing for Healthcare, Vol. 3, No. 4, Article 40. Publication date: October 2022

  4. arXiv:2203.14371  [pdf, other

    cs.CL cs.AI cs.LG

    MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering

    Authors: Ankit Pal, Logesh Kumar Umapathi, Malaikannan Sankarasubbu

    Abstract: This paper introduces MedMCQA, a new large-scale, Multiple-Choice Question Answering (MCQA) dataset designed to address real-world medical entrance exam questions. More than 194k high-quality AIIMS \& NEET PG entrance exam MCQs covering 2.4k healthcare topics and 21 medical subjects are collected with an average token length of 12.77 and high topical diversity. Each sample contains a question, cor… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.

    Comments: Proceedings of Machine Learning Research (PMLR), ACM Conference on Health, Inference, and Learning (CHIL) 2022

    Journal ref: ACM Conference on Health, Inference, and Learning (CHIL) 2022

  5. arXiv:2110.06606  [pdf, other

    cond-mat.quant-gas

    Bayesian Optimization of Bose-Einstein Condensates

    Authors: Tamil Arasan Bakthavatchalam, Suriyadeepan Ramamoorthy, Malaikannan Sankarasubbu, Radha Ramaswamy, Vijayalakshmi Sethuraman

    Abstract: Machine Learning methods are emerging as faster and efficient alternatives to numerical simulation techniques. The field of Scientific Computing has started adopting these data-driven approaches to faithfully model physical phenomena using scattered, noisy observations from coarse-grained grid-based simulations. In this paper, we investigate data-driven modelling of Bose-Einstein Condensates (BECs… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Report number: Scientific Reports 11, Article number: 5054 (2021)

    Journal ref: Sci Rep 11, 5054 (2021)

  6. arXiv:2109.10847  [pdf, other

    cs.LG cs.CL

    Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

    Authors: Kamal Raj Kanakarajan, Bhuvana Kundumani, Malaikannan Sankarasubbu

    Abstract: Recent progress in the Natural Language Processing domain has given us several State-of-the-Art (SOTA) pretrained models which can be finetuned for specific tasks. These large models with billions of parameters trained on numerous GPUs/TPUs over weeks are leading in the benchmark leaderboards. In this paper, we discuss the need for a benchmark for cost and time effective smaller models trained on… ▽ More

    Submitted 23 September, 2021; v1 submitted 22 September, 2021; originally announced September 2021.

  7. arXiv:2010.02417  [pdf, other

    cs.LG cs.SD eess.AS

    Pay Attention to the cough: Early Diagnosis of COVID-19 using Interpretable Symptoms Embeddings with Cough Sound Signal Processing

    Authors: Ankit Pal, Malaikannan Sankarasubbu

    Abstract: COVID-19 (coronavirus disease 2019) pandemic caused by SARS-CoV-2 has led to a treacherous and devastating catastrophe for humanity. At the time of writing, no specific antivirus drugs or vaccines are recommended to control infection transmission and spread. The current diagnosis of COVID-19 is done by Reverse-Transcription Polymer Chain Reaction (RT-PCR) testing. However, this method is expensive… ▽ More

    Submitted 11 October, 2020; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Preprint Version

  8. arXiv:2003.11644  [pdf, other

    cs.CL cs.LG stat.ML

    Multi-Label Text Classification using Attention-based Graph Neural Network

    Authors: Ankit Pal, Muru Selvakumar, Malaikannan Sankarasubbu

    Abstract: In Multi-Label Text Classification (MLTC), one sample can belong to more than one class. It is observed that most MLTC tasks, there are dependencies or correlations among labels. Existing methods tend to ignore the relationship among labels. In this paper, a graph attention network-based model is proposed to capture the attentive dependency structure among the labels. The graph attention network u… ▽ More

    Submitted 22 March, 2020; originally announced March 2020.

    Journal ref: 12th International Conference on Agents and Artificial Intelligence (ICAART 2020)

  9. arXiv:1909.05624  [pdf, other

    cs.CV cs.LG stat.ML

    Detecting Parking Spaces in a Parcel using Satellite Images

    Authors: Murugesan Vadivel, SelvaKumar Murugan, Suriyadeepan Ramamoorthy, Vaidheeswaran Archana, Malaikannan Sankarasubbu

    Abstract: Remote Sensing Images from satellites have been used in various domains for detecting and understanding structures on the ground surface. In this work, satellite images were used for localizing parking spaces and vehicles in parking lots for a given parcel using an RCNN based Neural Network Architectures. Parcel shapefiles and raster images from USGS image archive were used for develo** images f… ▽ More

    Submitted 30 January, 2020; v1 submitted 28 August, 2019; originally announced September 2019.

  10. arXiv:1810.12698  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Compositional Attention Networks for Interpretability in Natural Language Question Answering

    Authors: Muru Selvakumar, Suriyadeepan Ramamoorthy, Vaidheeswaran Archana, Malaikannan Sankarasubbu

    Abstract: MAC Net is a compositional attention network designed for Visual Question Answering. We propose a modified MAC net architecture for Natural Language Question Answering. Question Answering typically requires Language Understanding and multi-step Reasoning. MAC net's unique architecture - the separation between memory and control, facilitates data-driven iterative reasoning. This makes it an ideal c… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

    Comments: 8 pages,10 figures, 1 table

  11. arXiv:1808.01128  [pdf, other

    cs.LG stat.ML

    PHI Scrubber: A Deep Learning Approach

    Authors: Abhai Kollara Dilip, Kamal Raj K, Malaikannan Sankarasubbu

    Abstract: Confidentiality of patient information is an essential part of Electronic Health Record System. Patient information, if exposed, can cause a serious damage to the privacy of individuals receiving healthcare. Hence it is important to remove such details from physician notes. A system is proposed which consists of a deep learning model where a de-convolutional neural network and bi-directional LSTM-… ▽ More

    Submitted 3 August, 2018; originally announced August 2018.

  12. arXiv:1807.09617  [pdf

    q-bio.GN cs.LG stat.ML

    Convolutional Neural Networks In Classifying Cancer Through DNA Methylation

    Authors: Soham Chatterjee, Archana Iyer, Satya Avva, Abhai Kollara, Malaikannan Sankarasubbu

    Abstract: DNA Methylation has been the most extensively studied epigenetic mark. Usually a change in the genotype, DNA sequence, leads to a change in the phenotype, observable characteristics of the individual. But DNA methylation, which happens in the context of CpG (cytosine and guanine bases linked by phosphate backbone) dinucleotides, does not lead to a change in the original DNA sequence but has the po… ▽ More

    Submitted 24 July, 2018; originally announced July 2018.