Skip to main content

Showing 1–50 of 340 results for author: Vishnu

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01739  [pdf, other

    cs.RO

    Enabling Tactile Feedback for Robotic Strawberry Handling using AST Skin

    Authors: Vishnu Rajendran, Kiyanoush Nazari, Simon Parsons, Amir Ghalamzan

    Abstract: Acoustic Soft Tactile (AST) skin is a novel sensing technology which derives tactile information from the modulation of acoustic waves travelling through the skin's embedded acoustic channels. A generalisable data-driven calibration model maps the acoustic modulations to the corresponding tactile information in the form of contact forces with their contact locations and contact geometries. AST ski… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: TAROS 2024

  2. arXiv:2406.17654  [pdf, other

    cs.RO cs.AI

    MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection

    Authors: Michelle Adeline, Junn Yong Loo, Vishnu Monn Baskaran

    Abstract: Multi-view 3D object detection is a crucial component of autonomous driving systems. Contemporary query-based methods primarily depend either on dataset-specific initialization of 3D anchors, introducing bias, or utilize dense attention mechanisms, which are computationally inefficient and unscalable. To overcome these issues, we present MDHA, a novel sparse query-based framework, which constructs… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.16625  [pdf, other

    cs.RO

    GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection

    Authors: Harnaik Dhami, Charith Reddy, Vishnu Dutt Sharma, Troi Williams, Pratap Tokekar

    Abstract: We study the problem of visual surface inspection of infrastructure for defects using an Unmanned Aerial Vehicle (UAV). We do not assume that the geometric model of the infrastructure is known beforehand. Our planner, termed GATSBI, plans a path in a receding horizon fashion to inspect all points on the surface of the infrastructure. The input to GATSBI consists of a 3D occupancy map created onlin… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 10 pages, 12 figures, 2 tables. Submitted to IEEE TAES. arXiv admin note: text overlap with arXiv:2012.04803

  4. arXiv:2406.10918  [pdf, other

    cs.LG cs.AI cs.CL

    Embodied Question Answering via Multi-LLM Systems

    Authors: Bhrij Patel, Vishnu Sashank Dorbala, Dinesh Manocha, Amrit Singh Bedi

    Abstract: Embodied Question Answering (EQA) is an important problem, which involves an agent exploring the environment to answer user queries. In the existing literature, EQA has exclusively been studied in single-agent scenarios, where exploration can be time-consuming and costly. In this work, we consider EQA in a multi-agent framework involving multiple large language models (LLM) based agents independen… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: 17 pages, 13 Figures, 4 Tables

  5. arXiv:2406.10231  [pdf

    cs.CV eess.IV

    Sign Language Recognition based on YOLOv5 Algorithm for the Telugu Sign Language

    Authors: Vipul Reddy. P, Vishnu Vardhan Reddy. B, Sukriti

    Abstract: Sign language recognition (SLR) technology has enormous promise to improve communication and accessibility for the difficulty of hearing. This paper presents a novel approach for identifying gestures in TSL using the YOLOv5 object identification framework. The main goal is to create an accurate and successful method for identifying TSL gestures so that the deaf community can use slr. After that, a… ▽ More

    Submitted 24 April, 2024; originally announced June 2024.

    Comments: 11 pages, 9 figures

  6. arXiv:2406.08488  [pdf, other

    cs.CV cs.AI cs.LG

    ICE-G: Image Conditional Editing of 3D Gaussian Splats

    Authors: Vishnu Jaganathan, Hannah Hanyun Huang, Muhammad Zubair Irshad, Varun Jampani, Amit Raj, Zsolt Kira

    Abstract: Recently many techniques have emerged to create high quality 3D assets and scenes. When it comes to editing of these objects, however, existing approaches are either slow, compromise on quality, or do not provide enough customization. We introduce a novel approach to quickly edit a 3D model from a single reference view. Our technique first segments the edit image, and then matches semantically cor… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to CVPR AI4CC Workshop 2024. Project page: https://ice-gaussian.github.io

  7. arXiv:2406.06654  [pdf, other

    cs.LG stat.ME

    Training and Validating a Treatment Recommender with Partial Verification Evidence

    Authors: Vishnu Unnikrishnan, Clara Puga, Miro Schleicher, Uli Niemann, Berthod Langguth, Stefan Schoisswohl, Birgit Mazurek, Rilana Cima, Jose Antonio Lopez-Escamez, Dimitris Kikidis, Eleftheria Vellidou, Ruediger Pryss, Winfried Schlee, Myra Spiliopoulou

    Abstract: Current clinical decision support systems (DSS) are trained and validated on observational data from the target clinic. This is problematic for treatments validated in a randomized clinical trial (RCT), but not yet introduced in any clinic. In this work, we report on a method for training and validating the DSS using the RCT data. The key challenges we address are of missingness -- missing rationa… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  8. arXiv:2406.00985  [pdf, other

    cs.CV

    MultiEdits: Simultaneous Multi-Aspect Editing with Text-to-Image Diffusion Models

    Authors: Mingzhen Huang, Jialing Cai, Shan Jia, Vishnu Suresh Lokhande, Siwei Lyu

    Abstract: Text-driven image synthesis has made significant advancements with the development of diffusion models, transforming how visual content is generated from text prompts. Despite these advances, text-driven image editing, a key area in computer graphics, faces unique challenges. A major challenge is making simultaneous edits across multiple objects or attributes. Applying these methods sequentially f… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  9. arXiv:2405.12168  [pdf, other

    cs.IT

    WiDRa -- Enabling Millimeter-Level Differential Ranging Accuracy in Wi-Fi Using Carrier Phase

    Authors: Vishnu V. Ratnam, Bilal Sadiq, Hao Chen, Wei Sun, Shunyao Wu, Boon L. Ng, Jianzhong, Zhang

    Abstract: Although Wi-Fi is an ideal technology for many ranging applications, the performance of current methods is limited by the system bandwidth, leading to low accuracy of $\sim 1$ m. For many applications, measuring differential range, viz., the change in the range between adjacent measurements, is sufficient. Correspondingly, this work proposes WiDRa - a Wi-Fi based Differential Ranging solution that… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Accepted to IEEE JSAC special issue on Positioning and Sensing Over Wireless Networks, 2024

  10. arXiv:2405.11511  [pdf, other

    cs.CV

    Online Action Representation using Change Detection and Symbolic Programming

    Authors: Vishnu S Nair, Sneha Sree, Jayaraj Joseph, Mohanasankar Sivaprakasam

    Abstract: This paper addresses the critical need for online action representation, which is essential for various applications like rehabilitation, surveillance, etc. The task can be defined as representation of actions as soon as they happen in a streaming video without access to video frames in the future. Most of the existing methods use predefined window sizes for video segments, which is a restrictive… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  11. arXiv:2405.07896  [pdf, other

    cs.AI cs.HC cs.IR cs.LG

    Almanac Copilot: Towards Autonomous Electronic Health Record Navigation

    Authors: Cyril Zakka, Joseph Cho, Gracia Fahed, Rohan Shad, Michael Moor, Robyn Fong, Dhamanpreet Kaur, Vishnu Ravi, Oliver Aalami, Roxana Daneshjou, Akshay Chaudhari, William Hiesinger

    Abstract: Clinicians spend large amounts of time on clinical documentation, and inefficiencies impact quality of care and increase clinician burnout. Despite the promise of electronic medical records (EMR), the transition from paper-based records has been negatively associated with clinician wellness, in part due to poor user experience, increased burden of documentation, and alert fatigue. In this study, w… ▽ More

    Submitted 14 May, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

  12. arXiv:2405.04732  [pdf, other

    cs.RO cs.AI

    S-EQA: Tackling Situational Queries in Embodied Question Answering

    Authors: Vishnu Sashank Dorbala, Prasoon Goyal, Robinson Piramuthu, Michael Johnston, Dinesh Manocha, Reza Ghanadhan

    Abstract: We present and tackle the problem of Embodied Question Answering (EQA) with Situational Queries (S-EQA) in a household environment. Unlike prior EQA work tackling simple queries that directly reference target objects and quantifiable properties pertaining them, EQA with situational queries (such as "Is the bathroom clean and dry?") is more challenging, as the agent needs to figure out not just wha… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 8 Pages

  13. The Dark Side of Dataset Scaling: Evaluating Racial Classification in Multimodal Models

    Authors: Abeba Birhane, Sepehr Dehdashtian, Vinay Uday Prabhu, Vishnu Boddeti

    Abstract: Scale the model, scale the data, scale the GPU farms is the reigning sentiment in the world of generative AI today. While model scaling has been extensively studied, data scaling and its downstream impacts on model performance remain under-explored. This is particularly important in the context of multimodal datasets whose main source is the World Wide Web, condensed and packaged as the Common Cra… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: To appear in the proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT 24), June 3 to 6, 2024, Rio de Janeiro, Brazil. arXiv admin note: text overlap with arXiv:2306.13141

  14. arXiv:2405.03961  [pdf, other

    cs.LG q-bio.BM

    Structure-based drug design by denoising voxel grids

    Authors: Pedro O. Pinheiro, Arian Jamasb, Omar Mahmood, Vishnu Sresht, Saeed Saremi

    Abstract: We present VoxBind, a new score-based generative model for 3D molecules conditioned on protein structures. Our approach represents molecules as 3D atomic density grids and leverages a 3D voxel-denoising network for learning and generation. We extend the neural empirical Bayes formalism (Saremi & Hyvarinen, 2019) to the conditional setting and generate structure-conditioned molecules with a two-ste… ▽ More

    Submitted 2 July, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  15. arXiv:2405.01568  [pdf

    cs.SE

    Convert any android device into a programmable IoT device with the help of IoT Everywhere Framework

    Authors: Vishnu Joshi

    Abstract: The world around us is transforming as the field of the Internet of Things is taking over the world faster than we thought. Everyone in the tech industry is building wonderful things with the help of IoT. Smartwatches, smart coffee machines, smart television, smart homes are some of the examples. Building IoT sensor modules with sensors that connect to the internet can be very intimidating for peo… ▽ More

    Submitted 14 April, 2024; originally announced May 2024.

    Comments: 4 pages, 10 figures

  16. arXiv:2404.16255  [pdf, other

    cs.CR cs.CV

    Enhancing Privacy in Face Analytics Using Fully Homomorphic Encryption

    Authors: Bharat Yalavarthi, Arjun Ramesh Kaushik, Arun Ross, Vishnu Boddeti, Nalini Ratha

    Abstract: Modern face recognition systems utilize deep neural networks to extract salient features from a face. These features denote embeddings in latent space and are often stored as templates in a face recognition system. These embeddings are susceptible to data leakage and, in some cases, can even be used to reconstruct the original face image. To prevent compromising identities, template protection sch… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  17. arXiv:2404.09454  [pdf, other

    cs.CV cs.CY cs.LG

    Utility-Fairness Trade-Offs and How to Find Them

    Authors: Sepehr Dehdashtian, Bashir Sadeghi, Vishnu Naresh Boddeti

    Abstract: When building classification systems with demographic fairness considerations, there are two objectives to satisfy: 1) maximizing utility for the specific task and 2) ensuring fairness w.r.t. a known demographic attribute. These objectives often compete, so optimizing both can lead to a trade-off between utility and fairness. While existing works acknowledge the trade-offs and study their limits,… ▽ More

    Submitted 23 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

  18. arXiv:2404.04752  [pdf, other

    cs.AI cs.MA

    Challenges Faced by Large Language Models in Solving Multi-Agent Flocking

    Authors: Peihan Li, Vishnu Menon, Bhavanaraj Gudiguntla, Daniel Ting, Lifeng Zhou

    Abstract: Flocking is a behavior where multiple agents in a system attempt to stay close to each other while avoiding collision and maintaining a desired formation. This is observed in the natural world and has applications in robotics, including natural disaster search and rescue, wild animal tracking, and perimeter surveillance and patrol. Recently, large language models (LLMs) have displayed an impressiv… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  19. arXiv:2404.03813  [pdf, ps, other

    quant-ph cs.LG

    Agnostic Tomography of Stabilizer Product States

    Authors: Sabee Grewal, Vishnu Iyer, William Kretschmer, Daniel Liang

    Abstract: We define a quantum learning task called agnostic tomography, where given copies of an arbitrary state $ρ$ and a class of quantum states $\mathcal{C}$, the goal is to output a succinct description of a state that approximates $ρ$ at least as well as any state in $\mathcal{C}$ (up to some small error $\varepsilon$). This task generalizes ordinary quantum tomography of states in $\mathcal{C}$ and is… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 20 pages

  20. arXiv:2404.00126  [pdf, ps, other

    quant-ph cs.CC

    Pseudoentanglement Ain't Cheap

    Authors: Sabee Grewal, Vishnu Iyer, William Kretschmer, Daniel Liang

    Abstract: We show that any pseudoentangled state ensemble with a gap of $t$ bits of entropy requires $Ω(t)$ non-Clifford gates to prepare. This bound is tight up to polylogarithmic factors if linear-time quantum-secure pseudorandom functions exist. Our result follows from a polynomial-time algorithm to estimate the entanglement entropy of a quantum state across any cut of qubits. When run on an $n$-qubit st… ▽ More

    Submitted 11 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: 15 pages; v2: slight edits to concurrent work section

  21. arXiv:2403.15593  [pdf, other

    cs.CV cs.LG

    FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs

    Authors: Sepehr Dehdashtian, Lan Wang, Vishnu Naresh Boddeti

    Abstract: Large pre-trained vision-language models such as CLIP provide compact and general-purpose representations of text and images that are demonstrably effective across multiple downstream zero-shot prediction tasks. However, owing to the nature of their training process, these models have the potential to 1) propagate or amplify societal biases in the training data and 2) learn to rely on spurious fea… ▽ More

    Submitted 16 May, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: The Twelfth International Conference on Learning Representations (ICLR) 2024

  22. arXiv:2403.13247  [pdf, other

    cs.LG cs.DC

    FedNMUT -- Federated Noisy Model Update Tracking Convergence Analysis

    Authors: Vishnu Pandi Chellapandi, Antesh Upadhyay, Abolfazl Hashemi, Stanislaw H. Żak

    Abstract: A novel Decentralized Noisy Model Update Tracking Federated Learning algorithm (FedNMUT) is proposed that is tailored to function efficiently in the presence of noisy communication channels that reflect imperfect information exchange. This algorithm uses gradient tracking to minimize the impact of data heterogeneity while minimizing communication overhead. The proposed algorithm incorporates noise… ▽ More

    Submitted 24 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2303.10695

  23. arXiv:2403.12876  [pdf, other

    cs.RO cs.HC

    LAVA: Long-horizon Visual Action based Food Acquisition

    Authors: Amisha Bhaskar, Rui Liu, Vishnu D. Sharma, Guangyao Shi, Pratap Tokekar

    Abstract: Robotic Assisted Feeding (RAF) addresses the fundamental need for individuals with mobility impairments to regain autonomy in feeding themselves. The goal of RAF is to use a robot arm to acquire and transfer food to individuals from the table. Existing RAF methods primarily focus on solid foods, leaving a gap in manipulation strategies for semi-solid and deformable foods. This study introduces Lon… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 8 pages, 8 figures

  24. arXiv:2403.11487  [pdf, other

    cs.RO cs.AI

    Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis

    Authors: Vishnu Sashank Dorbala, Sanjoy Chowdhury, Dinesh Manocha

    Abstract: We present a novel approach to automatically synthesize "wayfinding instructions" for an embodied robot agent. In contrast to prior approaches that are heavily reliant on human-annotated datasets designed exclusively for specific simulation platforms, our algorithm uses in-context learning to condition an LLM to generate instructions using just a few references. Using an LLM-based Visual Question… ▽ More

    Submitted 2 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 14 Pages

  25. Surveyor: Facilitating Discovery Within Video Games for Blind and Low Vision Players

    Authors: Vishnu Nair, Hanxiu 'Hazel' Zhu, Peize Song, Jizhong Wang, Brian A. Smith

    Abstract: Video games are increasingly accessible to blind and low vision (BLV) players, yet many aspects remain inaccessible. One aspect is the joy players feel when they explore environments and make new discoveries, which is integral to many games. Sighted players experience discovery by surveying environments and identifying unexplored areas. Current accessibility tools, however, guide BLV players direc… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Journal ref: Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems (CHI '24), May 2024

  26. arXiv:2403.09905  [pdf, other

    cs.RO cs.CV

    Right Place, Right Time! Towards ObjectNav for Non-Stationary Goals

    Authors: Vishnu Sashank Dorbala, Bhrij Patel, Amrit Singh Bedi, Dinesh Manocha

    Abstract: We present a novel approach to tackle the ObjectNav task for non-stationary and potentially occluded targets in an indoor environment. We refer to this task Portable ObjectNav (or P-ObjectNav), and in this work, present its formulation, feasibility, and a navigation benchmark using a novel memory-enhanced LLM-based policy. In contrast to ObjNav where target object locations are fixed for each epis… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 32

  27. arXiv:2403.09762  [pdf, other

    cs.CL cs.AI cs.HC cs.LG cs.NE

    Emotional Intelligence Through Artificial Intelligence : NLP and Deep Learning in the Analysis of Healthcare Texts

    Authors: Prashant Kumar Nag, Amit Bhagat, R. Vishnu Priya, Deepak kumar Khare

    Abstract: This manuscript presents a methodical examination of the utilization of Artificial Intelligence in the assessment of emotions in texts related to healthcare, with a particular focus on the incorporation of Natural Language Processing and deep learning technologies. We scrutinize numerous research studies that employ AI to augment sentiment analysis, categorize emotions, and forecast patient outcom… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  28. arXiv:2403.08360  [pdf, other

    cs.CV cs.RO

    Improved Image-based Pose Regressor Models for Underwater Environments

    Authors: Luyuan Peng, Hari Vishnu, Mandar Chitre, Yuen Min Too, Bharath Kalyan, Rajat Mishra

    Abstract: We investigate the performance of image-based pose regressor models in underwater environments for relocalization. Leveraging PoseNet and PoseLSTM, we regress a 6-degree-of-freedom pose from single RGB images with high accuracy. Additionally, we explore data augmentation with stereo camera images to improve model accuracy. Experimental results demonstrate that the models achieve high accuracy in b… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Presented at AUV Symposium 2022

  29. arXiv:2403.07198  [pdf, other

    cs.CV

    Action Reimagined: Text-to-Pose Video Editing for Dynamic Human Actions

    Authors: Lan Wang, Vishnu Boddeti, Sernam Lim

    Abstract: We introduce a novel text-to-pose video editing method, ReimaginedAct. While existing video editing tasks are limited to changes in attributes, backgrounds, and styles, our method aims to predict open-ended human action changes in video. Moreover, our method can accept not only direct instructional text prompts but also `what if' questions to predict possible action changes. ReimaginedAct comprise… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  30. arXiv:2403.02598   

    cs.LG cs.CV

    Pooling Image Datasets With Multiple Covariate Shift and Imbalance

    Authors: Sotirios Panagiotis Chytas, Vishnu Suresh Lokhande, Peiran Li, Vikas Singh

    Abstract: Small sample sizes are common in many disciplines, which necessitates pooling roughly similar datasets across multiple institutions to study weak but relevant associations between images and disease outcomes. Such data often manifest shift/imbalance in covariates (i.e., secondary non-imaging data). Controlling for such nuisance variables is common within standard statistical analysis, but the idea… ▽ More

    Submitted 14 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: We need to do some fixes of references to make them more precise. This paper will be corrected and uploaded again by another group member

  31. arXiv:2403.02543  [pdf, ps, other

    quant-ph cs.CC

    PDQMA = DQMA = NEXP: QMA With Hidden Variables and Non-collapsing Measurements

    Authors: Scott Aaronson, Sabee Grewal, Vishnu Iyer, Simon C. Marshall, Ronak Ramachandran

    Abstract: We define and study a variant of QMA (Quantum Merlin Arthur) in which Arthur can make multiple non-collapsing measurements to Merlin's witness state, in addition to ordinary collapsing measurements. By analogy to the class PDQP defined by Aaronson, Bouland, Fitzsimons, and Lee (2014), we call this class PDQMA. Our main result is that PDQMA = NEXP; this result builds on the MIP = NEXP Theorem and c… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 15 pages

  32. Complete and Near-Optimal Robotic Crack Coverage and Filling in Civil Infrastructure

    Authors: Vishnu Veeraraghavan, Kyle Hunte, **gang Yi, Kaiyan Yu

    Abstract: We present a simultaneous sensor-based inspection and footprint coverage (SIFC) planning and control design with applications to autonomous robotic crack map** and filling. The main challenge of the SIFC problem lies in the coupling of complete sensing (for map**) and robotic footprint (for filling) coverage tasks. Initially, we assume known target information (e.g., crack) and employ classic… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Journal ref: in IEEE Transactions on Robotics, vol. 40, pp. 2850-2867, 2024

  33. arXiv:2402.18116  [pdf, other

    cs.GR cs.CV

    Block and Detail: Scaffolding Sketch-to-Image Generation

    Authors: Vishnu Sarukkai, Lu Yuan, Mia Tang, Maneesh Agrawala, Kayvon Fatahalian

    Abstract: We introduce a novel sketch-to-image tool that aligns with the iterative refinement process of artists. Our tool lets users sketch blocking strokes to coarsely represent the placement and form of objects and detail strokes to refine their shape and silhouettes. We develop a two-pass algorithm for generating high-fidelity images from such sketches at any point in the iterative process. In the first… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 12 pages, 13 figures

  34. arXiv:2402.15492  [pdf, other

    cs.LG eess.SP

    Mechanics-Informed Autoencoder Enables Automated Detection and Localization of Unforeseen Structural Damage

    Authors: Xuyang Li, Hamed Bolandi, Mahdi Masmoudi, Talal Salem, Nizar Lajnef, Vishnu Naresh Boddeti

    Abstract: Structural health monitoring (SHM) is vital for ensuring the safety and longevity of structures like buildings and bridges. As the volume and scale of structures and the impact of their failure continue to grow, there is a dire need for SHM techniques that are scalable, inexpensive, operate passively without human intervention, and customized for each mechanical structure without the need for comp… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  35. Publicly auditable privacy-preserving electoral rolls

    Authors: Prashant Agrawal, Mahabir Prasad Jhanwar, Subodh Vishnu Sharma, Subhashis Banerjee

    Abstract: While existing literature on electronic voting has extensively addressed verifiability of voting protocols, the vulnerability of electoral rolls in large public elections remains a critical concern. To ensure integrity of electoral rolls, the current practice is to either make electoral rolls public or share them with the political parties. However, this enables construction of detailed voter prof… ▽ More

    Submitted 2 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Report number: CSF 2024

    Journal ref: 2024 IEEE 37th Computer Security Foundations Symposium (CSF)

  36. arXiv:2402.08648  [pdf, other

    cs.LG cs.AI

    Generating Universal Adversarial Perturbations for Quantum Classifiers

    Authors: Gautham Anil, Vishnu Vinod, Apurva Narayan

    Abstract: Quantum Machine Learning (QML) has emerged as a promising field of research, aiming to leverage the capabilities of quantum computing to enhance existing machine learning methodologies. Recent studies have revealed that, like their classical counterparts, QML models based on Parametrized Quantum Circuits (PQCs) are also vulnerable to adversarial attacks. Moreover, the existence of Universal Advers… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted at AAAI 2024

  37. arXiv:2402.02603  [pdf

    cs.RO

    A Review of Full-Sized Autonomous Racing Vehicle Sensor Architecture

    Authors: Manuel Mar, Vishnu Chellapandi, Liangqi Yuan, Ziran Wang, Eric Dietz

    Abstract: In the landscape of technological innovation, autonomous racing is a dynamic and challenging domain that not only pushes the limits of technology, but also plays a crucial role in advancing and fostering a greater acceptance of autonomous systems. This paper thoroughly explores challenges and advances in autonomous racing vehicle design and performance, focusing on Roborace and the Indy Autonomous… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  38. arXiv:2402.01711  [pdf, other

    cs.CY cs.AI

    LLM on FHIR -- Demystifying Health Records

    Authors: Paul Schmiedmayer, Adrit Rao, Philipp Zagar, Vishnu Ravi, Aydin Zahedivash, Arash Fereydooni, Oliver Aalami

    Abstract: Objective: To enhance health literacy and accessibility of health information for a diverse patient population by develo** a patient-centered artificial intelligence (AI) solution using large language models (LLMs) and Fast Healthcare Interoperability Resources (FHIR) application programming interfaces (APIs). Materials and Methods: The research involved develo** LLM on FHIR, an open-source mo… ▽ More

    Submitted 25 January, 2024; originally announced February 2024.

    Comments: Pre-print of the paper submitted to the Call for Papers for the Special Focus Issue on ChatGPT and Large Language Models (LLMs) in Biomedicine and Health at the Journal of the American Medical Informatics Association: https://academic.oup.com/jamia/pages/call-for-papers-for-special-focus-issue

  39. Determination of Trace Organic Contaminant Concentration via Machine Classification of Surface-Enhanced Raman Spectra

    Authors: Vishnu Jayaprakash, Jae Bem You, Chiranjeevi Kanike, **feng Liu, Christopher McCallum, Xuehua Zhang

    Abstract: Accurate detection and analysis of traces of persistent organic pollutants in water is important in many areas, including environmental monitoring and food quality control, due to their long environmental stability and potential bioaccumulation. While conventional analysis of organic pollutants requires expensive equipment, surface enhanced Raman spectroscopy (SERS) has demonstrated great potentia… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  40. arXiv:2402.00090  [pdf

    q-bio.NC cs.HC

    Classification of attention performance post-longitudinal tDCS via functional connectivity and machine learning methods

    Authors: Akash K Rao, Vishnu K Menon, Arnav Bhavsar, Shubhajit Roy Chowdhury, Ramsingh Negi, Varun Dutt

    Abstract: Attention is the brain's mechanism for selectively processing specific stimuli while filtering out irrelevant information. Characterizing changes in attention following long-term interventions (such as transcranial direct current stimulation (tDCS)) has seldom been emphasized in the literature. To classify attention performance post-tDCS, this study uses functional connectivity and machine learnin… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    Comments: 6 pages, to be presented in the IEEE 9th International Conference for Convergence in Technology (I2CT),Pune, April 2024. arXiv admin note: substantial text overlap with arXiv:2401.17700

  41. arXiv:2401.17745  [pdf

    cs.RO

    Gesture Controlled Robot For Human Detection

    Authors: Athira T. S, Honey Manoj, R S Vishnu Priya, Vishnu K Menon, Srilekshmi M

    Abstract: It is very important to locate survivors from collapsed buildings so that rescue operations can be arranged. Many lives are lost due to lack of competent systems to detect people in these collapsed buildings at the right time. So here we have designed a hand gesture controlled robot which is capable of detecting humans under these collapsed building parts. The proposed work can be used to access s… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 6 pages, presented at the 2nd International Conference on IoT Based Control Networks and Intelligent Systems(ICICNIS 2021)

    Journal ref: proceedings of International Conference on IoT Based Control Networks & Intelligent Systems - ICICNIS 2021, 6 pages,2021

  42. arXiv:2401.17711  [pdf

    cs.HC cs.AI

    Prediction of multitasking performance post-longitudinal tDCS via EEG-based functional connectivity and machine learning methods

    Authors: Akash K Rao, Shashank Uttrani, Vishnu K Menon, Darshil Shah, Arnav Bhavsar, Shubhajit Roy Chowdhury, Varun Dutt

    Abstract: Predicting and understanding the changes in cognitive performance, especially after a longitudinal intervention, is a fundamental goal in neuroscience. Longitudinal brain stimulation-based interventions like transcranial direct current stimulation (tDCS) induce short-term changes in the resting membrane potential and influence cognitive processes. However, very little research has been conducted o… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 16 pages, presented at the 30th International Conference on Neural Information Processing (ICONIP2023), Changsha, China, November 2023

  43. arXiv:2401.17705  [pdf

    cs.LG cs.HC

    Predicting suicidal behavior among Indian adults using childhood trauma, mental health questionnaires and machine learning cascade ensembles

    Authors: Akash K Rao, Gunjan Y Trivedi, Riri G Trivedi, Anshika Bajpai, Gajraj Singh Chauhan, Vishnu K Menon, Kathirvel Soundappan, Hemalatha Ramani, Neha Pandya, Varun Dutt

    Abstract: Among young adults, suicide is India's leading cause of death, accounting for an alarming national suicide rate of around 16%. In recent years, machine learning algorithms have emerged to predict suicidal behavior using various behavioral traits. But to date, the efficacy of machine learning algorithms in predicting suicidal behavior in the Indian context has not been explored in literature. In th… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 11 pages, presnted at the 4th International Conference on Frontiers in Computing and Systems (COMSYS 2023), Himachal Pradesh, October 2023

  44. arXiv:2401.17700  [pdf

    cs.HC cs.AI

    Classification of executive functioning performance post-longitudinal tDCS using functional connectivity and machine learning methods

    Authors: Akash K Rao, Vishnu K Menon, Shashank Uttrani, Ayushman Dixit, Dipanshu Verma, Varun Dutt

    Abstract: Executive functioning is a cognitive process that enables humans to plan, organize, and regulate their behavior in a goal-directed manner. Understanding and classifying the changes in executive functioning after longitudinal interventions (like transcranial direct current stimulation (tDCS)) has not been explored in the literature. This study employs functional connectivity and machine learning al… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 7 pages, presented at the IEEE 20th India Council International Conference (INDICON 2023), Hyderabad, India, December 2023

  45. arXiv:2401.14292  [pdf, other

    cs.RO cs.AI

    Single and bi-layered 2-D acoustic soft tactile skin (AST2)

    Authors: Vishnu Rajendran, Simon Parsons, Amir Ghalamzan E

    Abstract: This paper aims to present an innovative and cost-effective design for Acoustic Soft Tactile (AST) Skin, with the primary goal of significantly enhancing the accuracy of 2-D tactile feature estimation. The existing challenge lies in achieving precise tactile feature estimation, especially concerning contact geometry characteristics, using cost-effective solutions. We hypothesise that by harnessing… ▽ More

    Submitted 29 February, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: IEEE Robosoft conference 2024 (accepted)

  46. arXiv:2401.02677  [pdf, other

    cs.CV cs.AI

    Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

    Authors: Yatharth Gupta, Vishnu V. Jaddipal, Harish Prabhala, Sayak Paul, Patrick Von Platen

    Abstract: Stable Diffusion XL (SDXL) has become the best open source text-to-image model (T2I) for its versatility and top-notch image quality. Efficiently addressing the computational demands of SDXL models is crucial for wider reach and applicability. In this work, we introduce two scaled-down variants, Segmind Stable Diffusion (SSD-1B) and Segmind-Vega, with 1.3B and 0.74B parameter UNets, respectively,… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  47. Joint Phase-Time Arrays: A Paradigm for Frequency-Dependent Analog Beamforming in 6G

    Authors: Vishnu V. Ratnam, Jianhua Mo, Ahmad AlAmmouri, Boon L. Ng, Jianzhong, Zhang, Andreas F. Molisch

    Abstract: Hybrid beamforming is an attractive solution to build cost-effective and energy-efficient transceivers for millimeter-wave and terahertz systems. However, conventional hybrid beamforming techniques rely on analog components that generate a frequency flat response such as phase-shifters and switches, which limits the flexibility of the achievable beam patterns. As a novel alternative, this paper pr… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: The paper is a revised version of the IEEE Access paper, that includes the full operation of Algorithms 1-3 to help curtail incorrect implementations

    Journal ref: IEEE Access, vol. 10, pp. 73364-73377, 2022

  48. arXiv:2312.05299  [pdf, other

    cs.LG hep-th math-ph math.GR

    Learning to be Simple

    Authors: Yang-Hui He, Vishnu Jejjala, Challenger Mishra, Em Sharnoff

    Abstract: In this work we employ machine learning to understand structured mathematical data involving finite groups and derive a theorem about necessary properties of generators of finite simple groups. We create a database of all 2-generated subgroups of the symmetric group on n-objects and conduct a classification of finite simple groups among them using shallow feed-forward neural networks. We show that… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 25 pages, 6 figures and 5 tables

  49. arXiv:2312.01874  [pdf, ps, other

    cs.GT

    Fair Division via Quantile Shares

    Authors: Yakov Babichenko, Michal Feldman, Ron Holzman, Vishnu V. Narayan

    Abstract: We consider the problem of fair division, where a set of indivisible goods should be distributed fairly among a set of agents with combinatorial valuations. To capture fairness, we adopt the notion of shares, where each agent is entitled to a fair share, based on some fairness criterion, and an allocation is considered fair if the value of every agent (weakly) exceeds her fair share. A share-based… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 23 pages, no figures

  50. arXiv:2311.16484  [pdf, other

    cs.CV

    Eye vs. AI: Human Gaze and Model Attention in Video Memorability

    Authors: Prajneya Kumar, Eshika Khandelwal, Makarand Tapaswi, Vishnu Sreekumar

    Abstract: Understanding the factors that determine video memorability has important applications in areas such as educational technology and advertising. Towards this goal, we investigate the semantic and temporal attention mechanisms underlying video memorability. We propose a Transformer-based model with spatio-temporal attention that matches SoTA performance on video memorability prediction on a large na… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.