Skip to main content

Showing 1–50 of 156 results for author: Sudarshan

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19580  [pdf, other

    cs.AR cs.LG

    FRED: Flexible REduction-Distribution Interconnect and Communication Implementation for Wafer-Scale Distributed Training of DNN Models

    Authors: Saeed Rashidi, William Won, Sudarshan Srinivasan, Puneet Gupta, Tushar Krishna

    Abstract: Distributed Deep Neural Network (DNN) training is a technique to reduce the training overhead by distributing the training tasks into multiple accelerators, according to a parallelization strategy. However, high-performance compute and interconnects are needed for maximum speed-up and linear scaling of the system. Wafer-scale systems are a promising technology that allows for tightly integrating h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.03688  [pdf, other

    eess.IV cs.CV

    Shadow and Light: Digitally Reconstructed Radiographs for Disease Classification

    Authors: Benjamin Hou, Qingqing Zhu, Tejas Sudarshan Mathai, Qiao **, Zhiyong Lu, Ronald M. Summers

    Abstract: In this paper, we introduce DRR-RATE, a large-scale synthetic chest X-ray dataset derived from the recently released CT-RATE dataset. DRR-RATE comprises of 50,188 frontal Digitally Reconstructed Radiographs (DRRs) from 21,304 unique patients. Each image is paired with a corresponding radiology text report and binary labels for 18 pathology classes. Given the controllable nature of DRR generation,… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2406.03088  [pdf, other

    cs.AR cs.LG

    HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator

    Authors: Zhewen Yu, Sudarshan Sreeram, Krish Agrawal, Junyi Wu, Alexander Montgomerie-Corcoran, Cheng Zhang, Jianyi Cheng, Christos-Savvas Bouganis, Yiren Zhao

    Abstract: Deep Neural Networks (DNNs) excel in learning hierarchical representations from raw data, such as images, audio, and text. To compute these DNN models with high performance and energy efficiency, these models are usually deployed onto customized hardware accelerators. Among various accelerator designs, dataflow architecture has shown promising performance due to its layer-pipelined structure and i… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: accepted to FPL2024

  4. arXiv:2406.01698  [pdf, other

    cs.AR cs.AI cs.DC cs.LG

    Demystifying Platform Requirements for Diverse LLM Inference Use Cases

    Authors: Abhimanyu Bambhaniya, Ritik Raj, Geonhwa Jeong, Souvik Kundu, Sudarshan Srinivasan, Midhilesh Elavazhagan, Madhu Kumar, Tushar Krishna

    Abstract: Large language models (LLMs) have shown remarkable performance across a wide range of applications, often outperforming human experts. However, deploying these parameter-heavy models efficiently for diverse inference use cases requires carefully designed hardware platforms with ample computing, memory, and network resources. With LLM deployment scenarios and models evolving at breakneck speed, the… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 12 Pages, https://github.com/abhibambhaniya/GenZ-LLM-Analyzer

  5. arXiv:2405.13048  [pdf

    cs.HC cs.AI

    Human-Generative AI Collaborative Problem Solving Who Leads and How Students Perceive the Interactions

    Authors: Gaoxia Zhu, Vidya Sudarshan, Jason Fok Kow, Yew Soon Ong

    Abstract: This research investigates distinct human-generative AI collaboration types and students' interaction experiences when collaborating with generative AI (i.e., ChatGPT) for problem-solving tasks and how these factors relate to students' sense of agency and perceived collaborative problem solving. By analyzing the surveys and reflections of 79 undergraduate students, we identified three human-genera… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: This paper appears at the IEEE Conference on Artificial Intelligence (CAI) 2024

  6. arXiv:2405.11344  [pdf

    cs.LG cs.AI

    Improved Content Understanding With Effective Use of Multi-task Contrastive Learning

    Authors: Akanksha Bindal, Sudarshan Ramanujam, Dave Golland, TJ Hazen, Tina Jiang, Fengyu Zhang, Peng Yan

    Abstract: In enhancing LinkedIn core content recommendation models, a significant challenge lies in improving their semantic understanding capabilities. This paper addresses the problem by leveraging multi-task learning, a method that has shown promise in various domains. We fine-tune a pre-trained, transformer-based LLM using multi-task contrastive learning with data from a diverse set of semantic labeling… ▽ More

    Submitted 21 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

  7. arXiv:2405.11195  [pdf, other

    cs.LG cs.AI cs.IT

    Trustworthy Actionable Perturbations

    Authors: Jesse Friedbaum, Sudarshan Adiga, Ravi Tandon

    Abstract: Counterfactuals, or modified inputs that lead to a different outcome, are an important tool for understanding the logic used by machine learning classifiers and how to change an undesirable classification. Even if a counterfactual changes a classifier's decision, however, it may not affect the true underlying class probabilities, i.e. the counterfactual may act like an adversarial attack and ``foo… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: Accepted at the 41st International Conference on Machine Learning (ICML) 2024

  8. arXiv:2404.14586  [pdf, other

    cs.IT cs.LG cs.NI

    Latency-Distortion Tradeoffs in Communicating Classification Results over Noisy Channels

    Authors: Noel Teku, Sudarshan Adiga, Ravi Tandon

    Abstract: In this work, the problem of communicating decisions of a classifier over a noisy channel is considered. With machine learning based models being used in variety of time-sensitive applications, transmission of these decisions in a reliable and timely manner is of significant importance. To this end, we study the scenario where a probability vector (representing the decisions of a classifier) at th… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Submitted to IEEE Transactions on Communications

  9. arXiv:2404.13125  [pdf, other

    cs.CR cs.LG

    Towards Robust Real-Time Hardware-based Mobile Malware Detection using Multiple Instance Learning Formulation

    Authors: Harshit Kumar, Sudarshan Sharma, Biswadeep Chakraborty, Saibal Mukhopadhyay

    Abstract: This study introduces RT-HMD, a Hardware-based Malware Detector (HMD) for mobile devices, that refines malware representation in segmented time-series through a Multiple Instance Learning (MIL) approach. We address the mislabeling issue in real-time HMDs, where benign segments in malware time-series incorrectly inherit malware labels, leading to increased false positives. Utilizing the proposed Ma… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Under peer review

  10. arXiv:2404.03219  [pdf, other

    cs.CV cs.GR

    iSeg: Interactive 3D Segmentation via Interactive Attention

    Authors: Itai Lang, Fei Xu, Dale Decatur, Sudarshan Babu, Rana Hanocka

    Abstract: We present iSeg, a new interactive technique for segmenting 3D shapes. Previous works have focused mainly on leveraging pre-trained 2D foundation models for 3D segmentation based on text. However, text may be insufficient for accurately describing fine-grained spatial segmentations. Moreover, achieving a consistent 3D segmentation using a 2D model is challenging since occluded areas of the same se… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Project page: https://threedle.github.io/iSeg/

  11. arXiv:2403.12297  [pdf, other

    cs.CL cs.AI

    Leveraging Large Language Models to Extract Information on Substance Use Disorder Severity from Clinical Notes: A Zero-shot Learning Approach

    Authors: Maria Mahbub, Gregory M. Dams, Sudarshan Srinivasan, Caitlin Rizy, Ioana Danciu, Jodie Trafton, Kathryn Knight

    Abstract: Substance use disorder (SUD) poses a major concern due to its detrimental effects on health and society. SUD identification and treatment depend on a variety of factors such as severity, co-determinants (e.g., withdrawal symptoms), and social determinants of health. Existing diagnostic coding systems used by American insurance providers, like the International Classification of Diseases (ICD-10),… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 10 pages, 4 figures, 2 tables

  12. arXiv:2402.04035  [pdf, ps, other

    cs.GT

    Low-Distortion Clustering with Ordinal and Limited Cardinal Information

    Authors: Jakob Burkhardt, Ioannis Caragiannis, Karl Fehrs, Matteo Russo, Chris Schwiegelshohn, Sudarshan Shyam

    Abstract: Motivated by recent work in computational social choice, we extend the metric distortion framework to clustering problems. Given a set of $n$ agents located in an underlying metric space, our goal is to partition them into $k$ clusters, optimizing some social cost objective. The metric space is defined by a distance function $d$ between the agent locations. Information about $d$ is available only… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: to appear in AAAI 2024

  13. arXiv:2402.00637  [pdf, other

    cs.CV

    Fisheye Camera and Ultrasonic Sensor Fusion For Near-Field Obstacle Perception in Bird's-Eye-View

    Authors: Arindam Das, Sudarshan Paul, Niko Scholz, Akhilesh Kumar Malviya, Ganesh Sistu, Ujjwal Bhattacharya, Ciarán Eising

    Abstract: Accurate obstacle identification represents a fundamental challenge within the scope of near-field perception for autonomous driving. Conventionally, fisheye cameras are frequently employed for comprehensive surround-view perception, including rear-view obstacle localization. However, the performance of such cameras can significantly deteriorate in low-light conditions, during nighttime, or when s… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 16 pages, 12 Figures, 6 tables

  14. arXiv:2401.10733  [pdf, other

    cs.IR cs.AI

    Dynamic Q&A of Clinical Documents with Large Language Models

    Authors: Ran Elgedawy, Ioana Danciu, Maria Mahbub, Sudarshan Srinivasan

    Abstract: Electronic health records (EHRs) house crucial patient data in clinical notes. As these notes grow in volume and complexity, manual extraction becomes challenging. This work introduces a natural language interface using large language models (LLMs) for dynamic question-answering on clinical notes. Our chatbot, powered by Langchain and transformer-based LLMs, allows users to query in natural langua… ▽ More

    Submitted 2 July, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: 15 pages, 4 figures

  15. arXiv:2401.10580  [pdf, other

    cs.CL

    PHOENIX: Open-Source Language Adaption for Direct Preference Optimization

    Authors: Matthias Uhlig, Sigurd Schacht, Sudarshan Kamath Barkur

    Abstract: Large language models have gained immense importance in recent years and have demonstrated outstanding results in solving various tasks. However, despite these achievements, many questions remain unanswered in the context of large language models. Besides the optimal use of the models for inference and the alignment of the results to the desired specifications, the transfer of models to other lang… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  16. Efficient Neural Representation of Volumetric Data using Coordinate-Based Networks

    Authors: Sudarshan Devkota, Sumanta Pattanaik

    Abstract: In this paper, we propose an efficient approach for the compression and representation of volumetric data utilizing coordinate-based networks and multi-resolution hash encoding. Efficient compression of volumetric data is crucial for various applications, such as medical imaging and scientific simulations. Our approach enables effective compression by learning a map** between spatial coordinates… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Journal ref: Computer Graphics Forum (2023), 42: e14955

  17. arXiv:2312.04874  [pdf, other

    cs.CV

    Interpretable Underwater Diver Gesture Recognition

    Authors: Sudeep Mangalvedhekar, Shreyas Nahar, Sudarshan Maskare, Kaushal Mahajan, Dr. Anant Bagade

    Abstract: In recent years, usage and applications of Autonomous Underwater Vehicles has grown rapidly. Interaction of divers with the AUVs remains an integral part of the usage of AUVs for various applications and makes building robust and efficient underwater gesture recognition systems extremely important. In this paper, we propose an Underwater Gesture Recognition system trained on the Cognitive Autonomo… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  18. arXiv:2312.04071  [pdf, other

    cs.IR cs.AI

    Synergistic Signals: Exploiting Co-Engagement and Semantic Links via Graph Neural Networks

    Authors: Zijie Huang, Baolin Li, Hafez Asgharzadeh, Anne Cocos, Lingyi Liu, Evan Cox, Colby Wise, Sudarshan Lamkhede

    Abstract: Given a set of candidate entities (e.g. movie titles), the ability to identify similar entities is a core capability of many recommender systems. Most often this is achieved by collaborative filtering approaches, i.e. if users co-engage with a pair of entities frequently enough, the embeddings should be similar. However, relying on co-engagement data alone can result in lower-quality embeddings fo… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  19. arXiv:2311.12396  [pdf, other

    cs.AR

    GreenFPGA: Evaluating FPGAs as Environmentally Sustainable Computing Solutions

    Authors: Chetan Choppali Sudarshan, Aman Arora, Vidya A. Chhabria

    Abstract: Growing global concerns about climate change highlight the need for environmentally sustainable computing. The ecological impact of computing, including operational and embodied, is a key consideration. Field Programmable Gate Arrays (FPGAs) stand out as promising sustainable computing platforms due to their reconfigurability across various applications. This paper introduces GreenFPGA, a tool est… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: Under review at DAC 2024

  20. arXiv:2310.17075  [pdf, other

    cs.CV

    HyperFields: Towards Zero-Shot Generation of NeRFs from Text

    Authors: Sudarshan Babu, Richard Liu, Avery Zhou, Michael Maire, Greg Shakhnarovich, Rana Hanocka

    Abstract: We introduce HyperFields, a method for generating text-conditioned Neural Radiance Fields (NeRFs) with a single forward pass and (optionally) some fine-tuning. Key to our approach are: (i) a dynamic hypernetwork, which learns a smooth map** from text token embeddings to the space of NeRFs; (ii) NeRF distillation training, which distills scenes encoded in individual NeRFs into one dynamic hyperne… ▽ More

    Submitted 13 June, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to ICML 2024, Project page: https://threedle.github.io/hyperfields/

  21. arXiv:2310.09680  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring

    Authors: Ankitha Sudarshan, Vinay Samuel, Parth Patwa, Ibtihel Amara, Aman Chadha

    Abstract: Automatic Speech Recognition (ASR) has witnessed a profound research interest. Recent breakthroughs have given ASR systems different prospects such as faithfully transcribing spoken language, which is a pivotal advancement in building conversational agents. However, there is still an imminent challenge of accurately discerning context-dependent words and phrases. In this work, we propose a novel a… ▽ More

    Submitted 3 March, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

  22. arXiv:2309.05090  [pdf, other

    cs.CV

    Sculpting Efficiency: Pruning Medical Imaging Models for On-Device Inference

    Authors: Sudarshan Sreeram, Bernhard Kainz

    Abstract: Leveraging ML advancements to augment healthcare systems can improve patient outcomes. Yet, uninformed engineering decisions in early-stage research inadvertently hinder the feasibility of such solutions for high-throughput, on-device inference, particularly in settings involving legacy hardware and multi-modal gigapixel images. Through a preliminary case study concerning segmentation in cardiolog… ▽ More

    Submitted 1 November, 2023; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: Accepted at MedNeurIPS 2023

  23. arXiv:2307.02367  [pdf, other

    cs.LG physics.acc-ph

    Distance Preserving Machine Learning for Uncertainty Aware Accelerator Capacitance Predictions

    Authors: Steven Goldenberg, Malachi Schram, Kishansingh Rajput, Thomas Britton, Chris Pappas, Dan Lu, Jared Walden, Majdi I. Radaideh, Sarah Cousineau, Sudarshan Harave

    Abstract: Providing accurate uncertainty estimations is essential for producing reliable machine learning models, especially in safety-critical applications such as accelerator systems. Gaussian process models are generally regarded as the gold standard method for this task, but they can struggle with large, high-dimensional datasets. Combining deep neural networks with Gaussian process approximation techni… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  24. arXiv:2306.13203  [pdf, other

    cs.CV

    Neural Network Pruning for Real-time Polyp Segmentation

    Authors: Suman Sapkota, Pranav Poudel, Sudarshan Regmi, Bibek Panthi, Binod Bhattarai

    Abstract: Computer-assisted treatment has emerged as a viable application of medical imaging, owing to the efficacy of deep learning models. Real-time inference speed remains a key requirement for such applications to help medical personnel. Even though there generally exists a trade-off between performance and model size, impressive efforts have been made to retain near-original performance by compromising… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  25. arXiv:2306.09434  [pdf, other

    cs.AR

    ECO-CHIP: Estimation of Carbon Footprint of Chiplet-based Architectures for Sustainable VLSI

    Authors: Chetan Choppali Sudarshan, Nikhil Matkar, Sarma Vrudhula, Sachin S. Sapatnekar, Vidya A. Chhabria

    Abstract: Decades of progress in energy-efficient and low-power design have successfully reduced the operational carbon footprint in the semiconductor industry. However, this has led to an increase in embodied emissions, encompassing carbon emissions arising from design, manufacturing, packaging, and other infrastructural activities. While existing research has developed tools to analyze embodied carbon at… ▽ More

    Submitted 14 February, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted at International Symposium on High-Performance Computer Architecture (HPCA)

  26. arXiv:2306.06678  [pdf, other

    cs.DB

    Scheduling of Intermittent Query Processing

    Authors: Saranya C, Sudarshan S

    Abstract: Stream processing is usually done either on a tuple-by-tuple basis or in micro-batches. There are many applications where tuples over a predefined duration/window must be processed within certain deadlines. Processing such queries using stream processing engines can be very inefficient since there is often a significant overhead per tuple or micro-batch. The cost of computation can be significantl… ▽ More

    Submitted 21 April, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

  27. arXiv:2305.17797  [pdf, other

    cs.CV

    T2FNorm: Extremely Simple Scaled Train-time Feature Normalization for OOD Detection

    Authors: Sudarshan Regmi, Bibek Panthi, Sakar Dotel, Prashnna K. Gyawali, Danail Stoyanov, Binod Bhattarai

    Abstract: Neural networks are notorious for being overconfident predictors, posing a significant challenge to their safe deployment in real-world applications. While feature normalization has garnered considerable attention within the deep learning literature, current train-time regularization methods for Out-of-Distribution(OOD) detection are yet to fully exploit this potential. Indeed, the naive incorpora… ▽ More

    Submitted 8 June, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

  28. Generalization Bounds for Neural Belief Propagation Decoders

    Authors: Sudarshan Adiga, Xin Xiao, Ravi Tandon, Bane Vasic, Tamal Bose

    Abstract: Machine learning based approaches are being increasingly used for designing decoders for next generation communication systems. One widely used framework is neural belief propagation (NBP), which unfolds the belief propagation (BP) iterations into a deep neural network and the parameters are trained in a data-driven manner. NBP decoders have been shown to improve upon classical decoding algorithms… ▽ More

    Submitted 20 April, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Published in IEEE Transactions on Information Theory (2024)

  29. arXiv:2305.08777  [pdf, other

    cs.AI cs.CL cs.LG

    Question-Answering System Extracts Information on Injection Drug Use from Clinical Notes

    Authors: Maria Mahbub, Ian Goethert, Ioana Danciu, Kathryn Knight, Sudarshan Srinivasan, Suzanne Tamang, Karine Rozenberg-Ben-Dror, Hugo Solares, Susana Martins, Jodie Trafton, Edmon Begoli, Gregory Peterson

    Abstract: Background: Injection drug use (IDU) is a dangerous health behavior that increases mortality and morbidity. Identifying IDU early and initiating harm reduction interventions can benefit individuals at risk. However, extracting IDU behaviors from patients' electronic health records (EHR) is difficult because there is no International Classification of Disease (ICD) code and the only place IDU infor… ▽ More

    Submitted 28 December, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 31 pages, 11 tables, 7 figures

  30. arXiv:2305.01339  [pdf, ps, other

    cs.GT

    Guaranteeing Envy-Freeness under Generalized Assignment Constraints

    Authors: Siddharth Barman, Arindam Khan, Sudarshan Shyam, K. V. N. Sreenivas

    Abstract: We study fair division of goods under the broad class of generalized assignment constraints. In this constraint framework, the sizes and values of the goods are agent-specific, and one needs to allocate the goods among the agents fairly while further ensuring that each agent receives a bundle of total size at most the corresponding budget of the agent. Since, in such a constraint setting, it may n… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 29 pages

  31. arXiv:2304.05301  [pdf, other

    cs.DC cs.LG

    TACOS: Topology-Aware Collective Algorithm Synthesizer for Distributed Machine Learning

    Authors: William Won, Midhilesh Elavazhagan, Sudarshan Srinivasan, Ajaya Durg, Samvit Kaul, Swati Gupta, Tushar Krishna

    Abstract: The surge of artificial intelligence, specifically large language models, has led to a rapid advent towards the development of large-scale machine learning training clusters. Collective communications within these clusters tend to be heavily bandwidth-bound, necessitating techniques to optimally utilize the available network bandwidth. This puts the routing algorithm for the collective at the fore… ▽ More

    Submitted 29 March, 2024; v1 submitted 11 April, 2023; originally announced April 2023.

  32. arXiv:2304.01074  [pdf, other

    cs.RO

    FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation

    Authors: Sudarshan S Harithas, Gurkirat Singh, Aneesh Chavan, Sarthak Sharma, Suraj Patni, Chetan Arora, K. Madhava Krishna

    Abstract: We focus on the problem of LiDAR point cloud based loop detection (or Finding) and closure (LDC) in a multi-agent setting. State-of-the-art (SOTA) techniques directly generate learned embeddings of a given point cloud, require large data transfers, and are not robust to wide variations in 6 Degrees-of-Freedom (DOF) viewpoint. Moreover, absence of strong priors in an unstructured point cloud leads… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  33. arXiv:2303.14006  [pdf, other

    cs.DC cs.LG

    ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale

    Authors: William Won, Taekyung Heo, Saeed Rashidi, Srinivas Sridharan, Sudarshan Srinivasan, Tushar Krishna

    Abstract: As deep learning models and input data are scaling at an unprecedented rate, it is inevitable to move towards distributed training platforms to fit the model and increase training throughput. State-of-the-art approaches and techniques, such as wafer-scale nodes, multi-dimensional network topologies, disaggregated memory systems, and parallelization strategies, have been actively adopted by emergin… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  34. arXiv:2303.12888  [pdf, other

    cs.LG cs.AI

    A dynamic risk score for early prediction of cardiogenic shock using machine learning

    Authors: Yuxuan Hu, Albert Lui, Mark Goldstein, Mukund Sudarshan, Andrea Tinsay, Cindy Tsui, Samuel Maidman, John Medamana, Neil Jethani, Aahlad Puli, Vuthy Nguy, Yindalon Aphinyanaphongs, Nicholas Kiefer, Nathaniel Smilowitz, James Horowitz, Tania Ahuja, Glenn I Fishman, Judith Hochman, Stuart Katz, Samuel Bernard, Rajesh Ranganath

    Abstract: Myocardial infarction and heart failure are major cardiovascular diseases that affect millions of people in the US. The morbidity and mortality are highest among patients who develop cardiogenic shock. Early recognition of cardiogenic shock is critical. Prompt implementation of treatment measures can prevent the deleterious spiral of ischemia, low blood pressure, and reduced cardiac output due to… ▽ More

    Submitted 28 March, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  35. arXiv:2303.10195  [pdf, other

    cs.CV cs.RO

    Remote Task-oriented Grasp Area Teaching By Non-Experts through Interactive Segmentation and Few-Shot Learning

    Authors: Furkan Kaynar, Sudarshan Rajagopalan, Shaobo Zhou, Eckehard Steinbach

    Abstract: A robot operating in unstructured environments must be able to discriminate between different gras** styles depending on the prospective manipulation task. Having a system that allows learning from remote non-expert demonstrations can very feasibly extend the cognitive skills of a robot for task-oriented gras**. We propose a novel two-step framework towards this aim. The first step involves gr… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: Presented at the AAAI Workshop on Artificial Intelligence for User-Centric Assistance in at-Home Tasks (2023)

  36. Spectrum-inspired Low-light Image Translation for Saliency Detection

    Authors: Kitty Varghese, Sudarshan Rajagopalan, Mohit Lamba, Kaushik Mitra

    Abstract: Saliency detection methods are central to several real-world applications such as robot navigation and satellite imagery. However, the performance of existing methods deteriorate under low-light conditions because training datasets mostly comprise of well-lit images. One possible solution is to collect a new dataset for low-light conditions. This involves pixel-level annotations, which is not only… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: Presented at The Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP) 2022

  37. arXiv:2303.09293  [pdf

    cs.CV eess.IV

    A transformer-based approach to video frame-level prediction in Affective Behaviour Analysis In-the-wild

    Authors: Dang-Khanh Nguyen, Ngoc-Huynh Ho, Sudarshan Pant, Hyung-Jeong Yang

    Abstract: In recent years, transformer architecture has been a dominating paradigm in many applications, including affective computing. In this report, we propose our transformer-based model to handle Emotion Classification Task in the 5th Affective Behavior Analysis In-the-wild Competition. By leveraging the attentive model and the synthetic dataset, we attain a score of 0.4775 on the validation set of Aff… ▽ More

    Submitted 19 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: 3 pages, 1 figure

  38. arXiv:2302.06294  [pdf, other

    eess.IV cs.CV cs.LG

    CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection

    Authors: Chinedu Innocent Nwoye, Tong Yu, Saurav Sharma, Aditya Murali, Deepak Alapatt, Armine Vardazaryan, Kun Yuan, Jonas Hajek, Wolfgang Reiter, Amine Yamlahi, Finn-Henri Smidt, Xiaoyang Zou, Guoyan Zheng, Bruno Oliveira, Helena R. Torres, Satoshi Kondo, Satoshi Kasai, Felix Holm, Ege Özsoy, Shuangchun Gui, Han Li, Sista Raviteja, Rachana Sathish, Pranav Poudel, Binod Bhattarai , et al. (24 additional authors not shown)

    Abstract: Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier effor… ▽ More

    Submitted 14 July, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: MICCAI EndoVis CholecTriplet2022 challenge report. Published at Elsevier journal of Medical Image Analysis. 25 pages, 15 figures, 8 tables

    Journal ref: Medical Image Analysis, Volume 89, 2023, 102888, ISSN 1361-8415

  39. arXiv:2301.11962  [pdf, other

    cs.LG

    On the Feasibility of Machine Learning Augmented Magnetic Resonance for Point-of-Care Identification of Disease

    Authors: Raghav Singhal, Mukund Sudarshan, Anish Mahishi, Sri Kaushik, Luke Ginocchio, Angela Tong, Hersh Chandarana, Daniel K. Sodickson, Rajesh Ranganath, Sumit Chopra

    Abstract: Early detection of many life-threatening diseases (e.g., prostate and breast cancer) within at-risk population can improve clinical outcomes and reduce cost of care. While numerous disease-specific "screening" tests that are closer to Point-of-Care (POC) are in use for this task, their low specificity results in unnecessary biopsies, leading to avoidable patient trauma and wasteful healthcare spen… ▽ More

    Submitted 2 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  40. arXiv:2301.00737  [pdf, other

    quant-ph cs.LO

    Rotational Abstractions for Verification of Quantum Fourier Transform Circuits

    Authors: Arun Govindankutty, Sudarshan K. Srinivasan, Nimish Mathure

    Abstract: With the race to build large-scale quantum computers and efforts to exploit quantum algorithms for efficient problem solving in science and engineering disciplines, the requirement to have efficient and scalable verification methods are of vital importance. We propose a novel formal verification method that is targeted at Quantum Fourier Transform (QFT) circuits. QFT is a fundamental quantum algor… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

    Journal ref: IET Quantum Communication, March 2023

  41. arXiv:2210.08083  [pdf, other

    cs.CV cs.GR cs.LG

    Reference Based Color Transfer for Medical Volume Rendering

    Authors: Sudarshan Devkota, Summanta Pattanaik

    Abstract: The benefits of medical imaging are enormous. Medical images provide considerable amounts of anatomical information and this facilitates medical practitioners in performing effective disease diagnosis and deciding upon the best course of medical treatment. A transition from traditional monochromatic medical images like CT scans, X-Rays or MRI images to a colored 3D representation of the anatomical… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  42. arXiv:2210.08080  [pdf, other

    cs.GR cs.CV cs.LG

    Deep Learning based Super-Resolution for Medical Volume Visualization with Direct Volume Rendering

    Authors: Sudarshan Devkota, Sumanta Pattanaik

    Abstract: Modern-day display systems demand high-quality rendering. However, rendering at higher resolution requires a large number of data samples and is computationally expensive. Recent advances in deep learning-based image and video super-resolution techniques motivate us to investigate such networks for high-fidelity upscaling of frames rendered at a lower resolution to a higher resolution. While our w… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  43. arXiv:2210.00263  [pdf, other

    eess.AS cs.LG cs.SD

    Fine-tuning Wav2vec for Vocal-burst Emotion Recognition

    Authors: Dang-Khanh Nguyen, Sudarshan Pant, Ngoc-Huynh Ho, Guee-Sang Lee, Soo-Huyng Kim, Hyung-Jeong Yang

    Abstract: The ACII Affective Vocal Bursts (A-VB) competition introduces a new topic in affective computing, which is understanding emotional expression using the non-verbal sound of humans. We are familiar with emotion recognition via verbal vocal or facial expression. However, the vocal bursts such as laughs, cries, and signs, are not exploited even though they are very informative for behavior analysis. T… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

  44. A Framework for Formal Verification of DRAM Controllers

    Authors: Lukas Steiner, Chirag Sudarshan, Matthias Jung, Dominik Stoffel, Norbert Wehn

    Abstract: The large number of recent JEDEC DRAM standard releases and their increasing feature set makes it difficult for designers to rapidly upgrade the memory controller IPs to each new standard. Especially the hardware verification is challenging due to the higher protocol complexity of standards like DDR5, LPDDR5 or HBM3 in comparison with their predecessors. With traditional simulation-based verificat… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: ACM/IEEE International Symposium on Memory Systems (MEMSYS 2022)

  45. arXiv:2209.10001  [pdf, other

    cs.NI

    Building Flexible, Low-Cost Wireless Access Networks With Magma

    Authors: Shaddi Hasan, Amar Padmanabhan, Bruce Davie, Jennifer Rexford, Ulas Kozat, Hunter Gatewood, Shruti Sanadhya, Nick Yurchenko, Tariq Al-Khasib, Oriol Batalla, Marie Bremner, Andrei Lee, Evgeniy Makeev, Scott Moeller, Alex Rodriguez, Pravin Shelar, Karthik Subraveti, Sudarshan Kandi, Alejandro Xoconostle, Praveen Kumar Ramakrishnan, Xiaochen Tian, Anoop Tomar

    Abstract: Billions of people remain without Internet access due to availability or affordability of service. In this paper, we present Magma, an open and flexible system for building low-cost wireless access networks. Magma aims to connect users where operator economics are difficult due to issues such as low population density or income levels, while preserving features expected in cellular networks such a… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: 15 pages, 10 figures, to be published in the 20th USENIX Symposium on Networked Systems Design and Implementation (2023), source code available at https://github.com/magma/magma

  46. arXiv:2209.04430  [pdf, other

    astro-ph.IM cs.AI cs.LG

    Investigation of a Machine learning methodology for the SKA pulsar search pipeline

    Authors: Shashank Sanjay Bhat, Thiagaraj Prabu, Ben Stappers, Atul Ghalame, Snehanshu Saha, T. S. B Sudarshan, Zafiirah Hosenie

    Abstract: The SKA pulsar search pipeline will be used for real time detection of pulsars. Modern radio telescopes such as SKA will be generating petabytes of data in their full scale of operation. Hence experience-based and data-driven algorithms become indispensable for applications such as candidate detection. Here we describe our findings from testing a state of the art object detection algorithm called… ▽ More

    Submitted 17 January, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

    Journal ref: Journal of Astronomy and Astrophysics SKA special issue 2022-23 (Under review)

  47. arXiv:2208.10784  [pdf, other

    physics.chem-ph cond-mat.soft cs.LG

    Building Robust Machine Learning Models for Small Chemical Science Data: The Case of Shear Viscosity

    Authors: Nikhil V. S. Avula, Shivanand K. Veesam, Sudarshan Behera, Sundaram Balasubramanian

    Abstract: Shear viscosity, though being a fundamental property of all liquids, is computationally expensive to estimate from equilibrium molecular dynamics simulations. Recently, Machine Learning (ML) methods have been used to augment molecular simulations in many contexts, thus showing promise to estimate viscosity too in a relatively inexpensive manner. However, ML methods face significant challenges like… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: main: 17 pages, 11 figures ; SI: 55 pages, 29 figures ; to be submitted to Journal of Chemical Physics

    Journal ref: Mach. Learn.: Sci. Technol. 3 (2022) 045032

  48. arXiv:2208.08579  [pdf, other

    stat.ME cs.LG stat.ML

    DIET: Conditional independence testing with marginal dependence measures of residual information

    Authors: Mukund Sudarshan, Aahlad Manas Puli, Wesley Tansey, Rajesh Ranganath

    Abstract: Conditional randomization tests (CRTs) assess whether a variable $x$ is predictive of another variable $y$, having observed covariates $z$. CRTs require fitting a large number of predictive models, which is often computationally intractable. Existing solutions to reduce the cost of CRTs typically split the dataset into a train and test portion, or rely on heuristics for interactions, both of which… ▽ More

    Submitted 11 April, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

  49. arXiv:2208.08168  [pdf, other

    cs.GT

    Finding Fair Allocations under Budget Constraints

    Authors: Siddharth Barman, Arindam Khan, Sudarshan Shyam, K. V. N. Sreenivas

    Abstract: We study the fair allocation of indivisible goods among agents with identical, additive valuations but individual budget constraints. Here, the indivisible goods--each with a specific size and value--need to be allocated such that the bundle assigned to each agent is of total size at most the agent's budget. Since envy-free allocations do not necessarily exist in the indivisible goods context, com… ▽ More

    Submitted 17 March, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: 22 pages

  50. arXiv:2207.10293  [pdf, other

    cs.CV

    Affective Behavior Analysis using Action Unit Relation Graph and Multi-task Cross Attention

    Authors: Dang-Khanh Nguyen, Sudarshan Pant, Ngoc-Huynh Ho, Guee-Sang Lee, Soo-Huyng Kim, Hyung-Jeong Yang

    Abstract: Facial behavior analysis is a broad topic with various categories such as facial emotion recognition, age, and gender recognition. Many studies focus on individual tasks while the multi-task learning approach is still an open research issue and requires more research. In this paper, we present our solution and experiment result for the Multi-Task Learning challenge of the Affective Behavior Analys… ▽ More

    Submitted 3 October, 2022; v1 submitted 21 July, 2022; originally announced July 2022.