Skip to main content

Showing 1–50 of 1,391 results for author: Vijay

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02233  [pdf, other

    cs.CL cs.AI cs.LG

    Synthetic Multimodal Question Generation

    Authors: Ian Wu, Sravan Jayanthi, Vijay Viswanathan, Simon Rosenberg, Sina Pakazad, Tongshuang Wu, Graham Neubig

    Abstract: Multimodal Retrieval Augmented Generation (MMRAG) is a powerful approach to question-answering over multimodal documents. A key challenge with evaluating MMRAG is the paucity of high-quality datasets matching the question styles and modalities of interest. In light of this, we propose SMMQG, a synthetic data generation framework. SMMQG leverages interplay between a retriever, large language model… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Submitted to ARR June 2024

  2. arXiv:2407.01781  [pdf, other

    cs.CV cs.GR cs.LG

    fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence

    Authors: Francis Williams, Jiahui Huang, Jonathan Swartz, Gergely Klár, Vijay Thakkar, Matthew Cong, Xuanchi Ren, Ruilong Li, Clement Fuji-Tsang, Sanja Fidler, Eftychios Sifakis, Ken Museth

    Abstract: We present fVDB, a novel GPU-optimized framework for deep learning on large-scale 3D data. fVDB provides a complete set of differentiable primitives to build deep learning architectures for common tasks in 3D learning such as convolution, pooling, attention, ray-tracing, meshing, etc. fVDB simultaneously provides a much larger feature set (primitives and operators) than established frameworks wi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2407.01481  [pdf, other

    cs.DC cs.PF

    LLload: Simplifying Real-Time Job Monitoring for HPC Users

    Authors: Chansup Byun, Julia Mullen, Albert Reuther, William Arcand, William Bergeron, David Bestor, Daniel Burrill, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Peter Michaleas, Guillermo Morales, Andrew Prout, Antonio Rosa, Charles Yee, Jeremy Kepner, Lauren Milechin

    Abstract: One of the more complex tasks for researchers using HPC systems is performance monitoring and tuning of their applications. Develo** a practice of continuous performance improvement, both for speed-up and efficient use of resources is essential to the long term success of both the HPC practitioner and the research project. Profiling tools provide a nice view of the performance of an application… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2407.00557  [pdf, other

    cs.CV

    Explaining Chest X-ray Pathology Models using Textual Concepts

    Authors: Vijay Sadashivaiah, Mannudeep K. Kalra, **kun Yan, James A. Hendler

    Abstract: Deep learning models have revolutionized medical imaging and diagnostics, yet their opaque nature poses challenges for clinical adoption and trust. Amongst approaches to improve model interpretability, concept-based explanations aim to provide concise and human understandable explanations of any arbitrary classifier. However, such methods usually require a large amount of manually collected data w… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  5. arXiv:2406.17990  [pdf, other

    cs.CL cs.AI cs.LG

    Explicit Diversity Conditions for Effective Question Answer Generation with Large Language Models

    Authors: Vikas Yadav, Hyuk Joon Kwon, Vijay Srinivasan, Hongxia **

    Abstract: Question Answer Generation (QAG) is an effective data augmentation technique to improve the accuracy of question answering systems, especially in low-resource domains. While recent pretrained and large language model-based QAG methods have made substantial progress, they face the critical issue of redundant QA pair generation, affecting downstream QA systems. Implicit diversity techniques such as… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Published at COLING 2024

  6. arXiv:2406.17652  [pdf, other

    cs.GR cs.CG cs.CV

    Time-varying Extremum Graphs

    Authors: Somenath Das, Raghavendra Sridharamurthy, Vijay Natarajan

    Abstract: We introduce time-varying extremum graph (TVEG), a topological structure to support visualization and analysis of a time-varying scalar field. The extremum graph is a substructure of the Morse-Smale complex. It captures the adjacency relationship between cells in the Morse decomposition of a scalar field. We define the TVEG as a time-varying extension of the extremum graph and demonstrate how it c… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  7. arXiv:2406.17249  [pdf, other

    cs.RO

    SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation

    Authors: Xu Liu, Jiuzhou Lei, Ankit Prabhu, Yuezhan Tao, Igor Spasojevic, Pratik Chaudhari, Nikolay Atanasov, Vijay Kumar

    Abstract: This paper develops a real-time decentralized metric-semantic Simultaneous Localization and Map** (SLAM) approach that leverages a sparse and lightweight object-based representation to enable a heterogeneous robot team to autonomously explore 3D environments featuring indoor, urban, and forested areas without relying on GPS. We use a hierarchical metric-semantic representation of the environment… ▽ More

    Submitted 2 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: Preliminary release

  8. arXiv:2406.17163  [pdf, other

    cs.CL cs.AI cs.LG

    Paraphrase and Aggregate with Large Language Models for Minimizing Intent Classification Errors

    Authors: Vikas Yadav, Zheng Tang, Vijay Srinivasan

    Abstract: Large language models (LLM) have achieved remarkable success in natural language generation but lesser focus has been given to their applicability in decision making tasks such as classification. We show that LLMs like LLaMa can achieve high performance on large multi-class classification tasks but still make classification errors and worse, generate out-of-vocabulary class labels. To address thes… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted at SIGIR 2024

  9. arXiv:2406.14190  [pdf, other

    cs.LO

    Extended Resolution Clause Learning via Dual Implication Points

    Authors: Sam Buss, Jonathan Chung, Vijay Ganesh, Albert Oliveras

    Abstract: We present a new extended resolution clause learning (ERCL) algorithm, implemented as part of a conflict-driven clause-learning (CDCL) SAT solver, wherein new variables are dynamically introduced as definitions for {\it Dual Implication Points} (DIPs) in the implication graph constructed by the solver at runtime. DIPs are generalizations of unique implication points and can be informally viewed as… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  10. arXiv:2406.12800  [pdf, other

    cs.CR

    Supporting Human Raters with the Detection of Harmful Content using Large Language Models

    Authors: Kurt Thomas, Patrick Gage Kelley, David Tao, Sarah Meiklejohn, Owen Vallis, Shunwen Tan, Blaž Bratanič, Felipe Tiengo Ferreira, Vijay Kumar Eranti, Elie Bursztein

    Abstract: In this paper, we explore the feasibility of leveraging large language models (LLMs) to automate or otherwise assist human raters with identifying harmful content including hate speech, harassment, violent extremism, and election misinformation. Using a dataset of 50,000 comments, we demonstrate that LLMs can achieve 90% accuracy when compared to human verdicts. We explore how to best leverage the… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  11. arXiv:2406.09905  [pdf, other

    cs.CV cs.GR

    Nymeria: A Massive Collection of Multimodal Egocentric Daily Motion in the Wild

    Authors: Lingni Ma, Yuting Ye, Fangzhou Hong, Vladimir Guzov, Yifeng Jiang, Rowan Postyeni, Luis Pesqueira, Alexander Gamino, Vijay Baiyya, Hyo ** Kim, Kevin Bailey, David Soriano Fosas, C. Karen Liu, Ziwei Liu, Jakob Engel, Renzo De Nardi, Richard Newcombe

    Abstract: We introduce Nymeria - a large-scale, diverse, richly annotated human motion dataset collected in the wild with multiple multimodal egocentric devices. The dataset comes with a) full-body 3D motion ground truth; b) egocentric multimodal recordings from Project Aria devices with RGB, grayscale, eye-tracking cameras, IMUs, magnetometer, barometer, and microphones; and c) an additional "observer" dev… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  12. arXiv:2406.09722  [pdf, other

    cs.CV cs.LG

    Cross-view geo-localization: a survey

    Authors: Abhilash Durgam, Sidike Paheding, Vikas Dhiman, Vijay Devabhaktuni

    Abstract: Cross-view geo-localization has garnered notable attention in the realm of computer vision, spurred by the widespread availability of copious geotagged datasets and the advancements in machine learning techniques. This paper provides a thorough survey of cutting-edge methodologies, techniques, and associated challenges that are integral to this domain, with a focus on feature-based and deep learni… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  13. arXiv:2406.09649  [pdf, other

    cs.OS

    SquirrelFS: using the Rust compiler to check file-system crash consistency

    Authors: Hayley LeBlanc, Nathan Taylor, James Bornholt, Vijay Chidambaram

    Abstract: This work introduces a new approach to building crash-safe file systems for persistent memory. We exploit the fact that Rust's typestate pattern allows compile-time enforcement of a specific order of operations. We introduce a novel crash-consistency mechanism, Synchronous Soft Updates, that boils down crash safety to enforcing ordering among updates to file-system metadata. We employ this approac… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  14. arXiv:2406.09631  [pdf, other

    cs.RO

    Optimal Convex Cover as Collision-free Space Approximation for Trajectory Generation

    Authors: Yuwei Wu, Igor Spasojevic, Pratik Chaudhari, Vijay Kumar

    Abstract: We propose an online iterative algorithm to find a suitable convex cover to under-approximate the free space for autonomous navigation to delineate Safe Flight Corridors (SFC). The convex cover consists of a set of polytopes such that the union of the polytopes represents obstacle-free space, allowing us to find trajectories for robots that lie within the convex cover. In order to find the SFC tha… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  15. arXiv:2406.07887  [pdf, other

    cs.LG cs.CL

    An Empirical Study of Mamba-based Language Models

    Authors: Roger Waleffe, Wonmin Byeon, Duncan Riach, Brandon Norick, Vijay Korthikanti, Tri Dao, Albert Gu, Ali Hatamizadeh, Sudhakar Singh, Deepak Narayanan, Garvit Kulshreshtha, Vartika Singh, Jared Casper, Jan Kautz, Mohammad Shoeybi, Bryan Catanzaro

    Abstract: Selective state-space models (SSMs) like Mamba overcome some of the shortcomings of Transformers, such as quadratic computational complexity with sequence length and large inference-time memory requirements from the key-value cache. Moreover, recent studies have shown that SSMs can match or exceed the language modeling capabilities of Transformers, making them an attractive alternative. In a contr… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  16. arXiv:2406.06025  [pdf, other

    cs.SE cs.CL cs.LG

    RepoQA: Evaluating Long Context Code Understanding

    Authors: Jiawei Liu, Jia Le Tian, Vijay Daita, Yuxiang Wei, Yifeng Ding, Yuhan Katherine Wang, Jun Yang, Lingming Zhang

    Abstract: Recent advances have been improving the context windows of Large Language Models (LLMs). To quantify the real long-context capabilities of LLMs, evaluators such as the popular Needle in a Haystack have been developed to test LLMs over a large chunk of raw texts. While effective, current evaluations overlook the insight of how LLMs work with long-context code, i.e., repositories. To this end, we in… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  17. arXiv:2406.04927  [pdf, other

    eess.AS cs.CL

    LLM-based speaker diarization correction: A generalizable approach

    Authors: Georgios Efstathiadis, Vijay Yadav, Anzar Abbas

    Abstract: Speaker diarization is necessary for interpreting conversations transcribed using automated speech recognition (ASR) tools. Despite significant developments in diarization methods, diarization accuracy remains an issue. Here, we investigate the use of large language models (LLMs) for diarization correction as a post-processing step. LLMs were fine-tuned using the Fisher corpus, a large dataset of… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  18. arXiv:2406.03699  [pdf, other

    cs.CL

    M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering

    Authors: Anand Subramanian, Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh-Tung Nguyen, Vijay Prakash Dwivedi, Stefan Winkler

    Abstract: There is vivid research on adapting Large Language Models (LLMs) to perform a variety of tasks in high-stakes domains such as healthcare. Despite their popularity, there is a lack of understanding of the extent and contributing factors that allow LLMs to recall relevant knowledge and combine it with presented information in the clinical and biomedical domain: a fundamental pre-requisite for succes… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024 (Findings)

  19. arXiv:2406.03183  [pdf, other

    cs.CG cs.CV cs.GR

    Geometric Localization of Homology Cycles

    Authors: Amritendu Dhar, Vijay Natarajan, Abhishek Rathod

    Abstract: Computing an optimal cycle in a given homology class, also referred to as the homology localization problem, is known to be an NP-hard problem in general. Furthermore, there is currently no known optimality criterion that localizes classes geometrically and admits a stability property under the setting of persistent homology. We present a geometric optimization of the cycles that is computable in… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: To Appear in CCCG 2024 : Proc. 36th Canadian Conference on Computational Geometry

    ACM Class: I.3.5

  20. arXiv:2406.03162  [pdf, other

    eess.SP cs.IT

    The Curse of Beam-Squint in ISAC: Causes, Implications, and Mitigation Strategies

    Authors: Ahmet M. Elbir, Kumar Vijay Mishra, Abdulkadir Celik, Ahmed M. Eltawil

    Abstract: Integrated sensing and communications (ISAC) has emerged as a means to efficiently utilize spectrum and thereby save cost and power. At the higher end of the spectrum, ISAC systems operate at wideband using large antenna arrays to meet the stringent demands for high-resolution sensing and enhanced communications capacity. However, the wideband implementation entails beam-squint, that is, deviation… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted Paper in IEEE Communications Magazine

  21. arXiv:2406.02877  [pdf, other

    cs.LG cs.DC

    FedStaleWeight: Buffered Asynchronous Federated Learning with Fair Aggregation via Staleness Reweighting

    Authors: Jeffrey Ma, Alan Tu, Yiling Chen, Vijay Janapa Reddi

    Abstract: Federated Learning (FL) endeavors to harness decentralized data while preserving privacy, facing challenges of performance, scalability, and collaboration. Asynchronous Federated Learning (AFL) methods have emerged as promising alternatives to their synchronous counterparts bounded by the slowest agent, yet they add additional challenges in convergence guarantees, fairness with respect to compute… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  22. arXiv:2406.02290  [pdf, other

    cs.LG

    A Study of Optimizations for Fine-tuning Large Language Models

    Authors: Arjun Singh, Nikhil Pandey, Anup Shirgaonkar, Pavan Manoj, Vijay Aski

    Abstract: Fine-tuning large language models is a popular choice among users trying to adapt them for specific applications. However, fine-tuning these models is a demanding task because the user has to examine several factors, such as resource budget, runtime, model size and context length among others. A specific challenge is that fine-tuning is memory intensive, imposing constraints on the required hardwa… ▽ More

    Submitted 6 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 figures. Revised text for clarity, updated references

  23. arXiv:2406.01636  [pdf

    q-bio.QM cs.AI

    COVID-19: post infection implications in different age groups, mechanism, diagnosis, effective prevention, treatment, and recommendations

    Authors: Muhammad Akmal Raheem, Muhammad Ajwad Rahim, Ijaz Gul, Md. Reyad-ul-Ferdous, Liyan Le, Junguo Hui, Shuiwei Xia, Minjiang Chen, Dongmei Yu, Vijay Pandey, Peiwu Qin, Jiansong Ji

    Abstract: SARS-CoV-2, the highly contagious pathogen responsible for the COVID-19 pandemic, has persistent effects that begin four weeks after initial infection and last for an undetermined duration. These chronic effects are more harmful than acute ones. This review explores the long-term impact of the virus on various human organs, including the pulmonary, cardiovascular, neurological, reproductive, gastr… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  24. arXiv:2405.20213  [pdf, other

    cs.AI cs.CL cs.LG

    PostDoc: Generating Poster from a Long Multimodal Document Using Deep Submodular Optimization

    Authors: Vijay Jaisankar, Sambaran Bandyopadhyay, Kalp Vyas, Varre Chaitanya, Shwetha Somasundaram

    Abstract: A poster from a long input document can be considered as a one-page easy-to-read multimodal (text and images) summary presented on a nice template with good design elements. Automatic transformation of a long document into a poster is a very less studied but challenging task. It involves content summarization of the input document followed by template generation and harmonization. In this work, we… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  25. arXiv:2405.19597  [pdf, other

    cs.LG cs.AI cs.CL

    SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors

    Authors: Vijay Lingam, Atula Tejaswi, Aditya Vavre, Aneesh Shetty, Gautham Krishna Gudur, Joydeep Ghosh, Alex Dimakis, Eunsol Choi, Aleksandar Bojchevski, Sujay Sanghavi

    Abstract: Popular parameter-efficient fine-tuning (PEFT) methods, such as LoRA and its variants, freeze pre-trained model weights \(W\) and inject learnable matrices \(ΔW\). These \(ΔW\) matrices are structured for efficient parameterization, often using techniques like low-rank approximations or scaling vectors. However, these methods typically show a performance gap compared to full fine-tuning. Although… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 17 pages, 5 figures, 14 tables

  26. arXiv:2405.16661  [pdf, other

    cs.CL cs.AI cs.LG cs.LO

    RLSF: Reinforcement Learning via Symbolic Feedback

    Authors: Piyush Jha, Prithwish Jana, Arnav Arora, Vijay Ganesh

    Abstract: In recent years, large language models (LLMs) have had a dramatic impact on various sub-fields of AI, most notably on natural language understanding tasks. However, there is widespread agreement that the logical reasoning capabilities of contemporary LLMs are, at best, fragmentary (i.e., may work well on some problem instances but fail dramatically on others). While traditional LLM fine-tuning app… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  27. arXiv:2405.10391  [pdf, other

    cs.RO cs.AI eess.IV

    Vision Transformers for End-to-End Vision-Based Quadrotor Obstacle Avoidance

    Authors: Anish Bhattacharya, Nishanth Rao, Dhruv Parikh, Pratik Kunapuli, Nikolai Matni, Vijay Kumar

    Abstract: We demonstrate the capabilities of an attention-based end-to-end approach for high-speed quadrotor obstacle avoidance in dense, cluttered environments, with comparison to various state-of-the-art architectures. Quadrotor unmanned aerial vehicles (UAVs) have tremendous maneuverability when flown fast; however, as flight speed increases, traditional vision-based navigation via independent map**, p… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 8 pages, 10 figures, 3 tables

  28. arXiv:2405.07169  [pdf, other

    cs.RO

    Challenges and Opportunities for Large-Scale Exploration with Air-Ground Teams using Semantics

    Authors: Fernando Cladera, Ian D. Miller, Zachary Ravichandran, Varun Murali, Jason Hughes, M. Ani Hsieh, C. J. Taylor, Vijay Kumar

    Abstract: One common and desirable application of robots is exploring potentially hazardous and unstructured environments. Air-ground collaboration offers a synergistic approach to addressing such exploration challenges. In this paper, we demonstrate a system for large-scale exploration using a team of aerial and ground robots. Our system uses semantics as lingua franca, and relies on fully opportunistic co… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 6 pages, 5 figres

  29. arXiv:2405.06641  [pdf, other

    cs.IT

    On Existence of Latency Optimal Uncoded Storage Schemes in Geo-Distributed Data Storage Systems

    Authors: Srivathsa Acharya, P. Vijay Kumar, Viveck R. Cadambe

    Abstract: We consider the problem of geographically distributed data storage in a network of servers (or nodes) where the nodes are connected to each other via communication links having certain round-trip times (RTTs). Each node serves a specific set of clients, where a client can request for any of the files available in the distributed system. The parent node provides the requested file if available loca… ▽ More

    Submitted 13 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  30. arXiv:2405.06621  [pdf, other

    cs.IT

    On Streaming Codes for Simultaneously Correcting Burst and Random Erasures

    Authors: Shobhit Bhatnagar, Biswadip Chakraborty, P. Vijay Kumar

    Abstract: Streaming codes are packet-level codes that recover dropped packets within a strict decoding-delay constraint. We study streaming codes over a sliding-window (SW) channel model which admits only those erasure patterns which allow either a single burst erasure of $\le b$ packets along with $\le e$ random packet erasures, or else, $\le a$ random packet erasures, in any sliding-window of $w$ time slo… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  31. arXiv:2405.06606  [pdf, other

    cs.IT

    On Streaming Codes for Burst and Random Errors

    Authors: Shobhit Bhatnagar, P. Vijay Kumar

    Abstract: Streaming codes (SCs) are packet-level codes that recover erased packets within a strict decoding-delay deadline. Streaming codes for various packet erasure channel models such as sliding-window (SW) channel models that admit random or burst erasures in any SW of a fixed length have been studied in the literature, and the optimal rate as well as rate-optimal code constructions of SCs over such cha… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  32. arXiv:2405.05376  [pdf, other

    cs.CL

    Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages

    Authors: Nathaniel R. Robinson, Raj Dabre, Ammon Shurtz, Rasul Dent, Onenamiyi Onesi, Claire Bizon Monroc, Loïc Grobol, Hasan Muhammad, Ashi Garg, Naome A. Etori, Vijay Murari Tiyyala, Olanrewaju Samuel, Matthew Dean Stutzman, Bismarck Bamfo Odoom, Sanjeev Khudanpur, Stephen D. Richardson, Kenton Murray

    Abstract: A majority of language technologies are tailored for a small number of high-resource languages, while relatively many low-resource languages are neglected. One such group, Creole languages, have long been marginalized in academic study, though their speakers could benefit from machine translation (MT). These languages are predominantly used in much of Latin America, Africa and the Caribbean. We pr… ▽ More

    Submitted 13 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: NAACL 2024

  33. arXiv:2405.00892  [pdf, other

    cs.CV cs.AI

    Wake Vision: A Large-scale, Diverse Dataset and Benchmark Suite for TinyML Person Detection

    Authors: Colby Banbury, Emil Njor, Matthew Stewart, Pete Warden, Manjunath Kudlur, Nat Jeffries, Xenofon Fafoutis, Vijay Janapa Reddi

    Abstract: Tiny machine learning (TinyML), which enables machine learning applications on extremely low-power devices, suffers from limited size and quality of relevant datasets. To address this issue, we introduce Wake Vision, a large-scale, diverse dataset tailored for person detection, the canonical task for TinyML visual sensing. Wake Vision comprises over 6 million images, representing a hundredfold inc… ▽ More

    Submitted 6 June, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  34. arXiv:2404.18276  [pdf

    cs.CL cs.AI

    Bias Neutralization Framework: Measuring Fairness in Large Language Models with Bias Intelligence Quotient (BiQ)

    Authors: Malur Narayan, John Pasmore, Elton Sampaio, Vijay Raghavan, Gabriella Waters

    Abstract: The burgeoning influence of Large Language Models (LLMs) in sha** public discourse and decision-making underscores the imperative to address inherent biases within these AI systems. In the wake of AI's expansive integration across sectors, addressing racial bias in LLMs has never been more critical. This paper introduces a novel framework called Comprehensive Bias Neutralization Framework (CBNF)… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 41 pages

    ACM Class: D.1; I.2

  35. arXiv:2404.16244  [pdf, other

    cs.CY

    The Ethics of Advanced AI Assistants

    Authors: Iason Gabriel, Arianna Manzini, Geoff Keeling, Lisa Anne Hendricks, Verena Rieser, Hasan Iqbal, Nenad Tomašev, Ira Ktena, Zachary Kenton, Mikel Rodriguez, Seliem El-Sayed, Sasha Brown, Canfer Akbulut, Andrew Trask, Edward Hughes, A. Stevie Bergman, Renee Shelby, Nahema Marchal, Conor Griffin, Juan Mateos-Garcia, Laura Weidinger, Winnie Street, Benjamin Lange, Alex Ingerman, Alison Lentz , et al. (32 additional authors not shown)

    Abstract: This paper focuses on the opportunities and the ethical and societal risks posed by advanced AI assistants. We define advanced AI assistants as artificial agents with natural language interfaces, whose function is to plan and execute sequences of actions on behalf of a user, across one or more domains, in line with the user's expectations. The paper starts by considering the technology itself, pro… ▽ More

    Submitted 28 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  36. arXiv:2404.16112  [pdf, other

    cs.LG cs.AI cs.CV cs.MM eess.IV

    Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges

    Authors: Badri Narayana Patro, Vijay Srinivas Agneeswaran

    Abstract: Sequence modeling is a crucial area across various domains, including Natural Language Processing (NLP), speech recognition, time series forecasting, music generation, and bioinformatics. Recurrent Neural Networks (RNNs) and Long Short Term Memory Networks (LSTMs) have historically dominated sequence modeling tasks like Machine Translation, Named Entity Recognition (NER), etc. However, the advance… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  37. arXiv:2404.15578  [pdf

    cs.CL

    Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations?

    Authors: Hossein Salami, Brandye Smith-Goettler, Vijay Yadav

    Abstract: General purpose Large Language Models (LLM) such as the Generative Pretrained Transformer (GPT) and Large Language Model Meta AI (LLaMA) have attracted much attention in recent years. There is strong evidence that these models can perform remarkably well in various natural language processing tasks. However, how to leverage them to approach domain-specific use cases and drive value remains an open… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 13 pages, 3 figures

  38. arXiv:2404.14643  [pdf, other

    cs.CR cs.CY cs.GR cs.NI cs.SI

    Teaching Network Traffic Matrices in an Interactive Game Environment

    Authors: Chasen Milner, Hayden Jananthan, Jeremy Kepner, Vijay Gadepally, Michael Jones, Peter Michaleas, Ritesh Patel, Sandeep Pisharody, Gabriel Wachman, Alex Pentland

    Abstract: The Internet has become a critical domain for modern society that requires ongoing efforts for its improvement and protection. Network traffic matrices are a powerful tool for understanding and analyzing networks and are broadly taught in online graph theory educational resources. Network traffic matrix concepts are rarely available in online computer network and cybersecurity educational resource… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 9 pages, 10 figures, 52 references; accepted to IEEE GrAPL

  39. arXiv:2404.14361  [pdf, other

    cs.CL

    Better Synthetic Data by Retrieving and Transforming Existing Datasets

    Authors: Saumya Gandhi, Ritu Gala, Vijay Viswanathan, Tongshuang Wu, Graham Neubig

    Abstract: Despite recent advances in large language models, building dependable and deployable NLP models typically requires abundant, high-quality training data. However, task-specific data is not available for many use cases, and manually curating task-specific data is labor-intensive. Recent work has studied prompt-driven synthetic data generation using large language models, but these generated datasets… ▽ More

    Submitted 26 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: PDF fixed in v3

  40. arXiv:2404.09302  [pdf, other

    cs.LG cs.AI cs.DC

    High Significant Fault Detection in Azure Core Workload Insights

    Authors: Pranay Lohia, Laurent Boue, Sharath Rangappa, Vijay Agneeswaran

    Abstract: Azure Core workload insights have time-series data with different metric units. Faults or Anomalies are observed in these time-series data owing to faults observed with respect to metric name, resources region, dimensions, and its dimension value associated with the data. For Azure Core, an important task is to highlight faults or anomalies to the user on a dashboard that they can perceive easily.… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  41. arXiv:2404.07880  [pdf, other

    cs.RO

    Multi-Robot Target Tracking with Sensing and Communication Danger Zones

    Authors: Jiazhen Liu, Peihan Li, Yuwei Wu, Gaurav S. Sukhatme, Vijay Kumar, Lifeng Zhou

    Abstract: Multi-robot target tracking finds extensive applications in different scenarios, such as environmental surveillance and wildfire management, which require the robustness of the practical deployment of multi-robot systems in uncertain and dangerous environments. Traditional approaches often focus on the performance of tracking accuracy with no modeling and assumption of the environments, neglecting… ▽ More

    Submitted 20 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  42. arXiv:2404.04627  [pdf, other

    cs.CV

    Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement

    Authors: Zaid Khan, Vijay Kumar BG, Samuel Schulter, Yun Fu, Manmohan Chandraker

    Abstract: Visual program synthesis is a promising approach to exploit the reasoning abilities of large language models for compositional computer vision tasks. Previous work has used few-shot prompting with frozen LLMs to synthesize visual programs. Training an LLM to write better visual programs is an attractive prospect, but it is unclear how to accomplish this. No dataset of visual programs for training… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  43. arXiv:2404.03753  [pdf, other

    cs.LO cs.AI cs.LG

    A Reinforcement Learning based Reset Policy for CDCL SAT Solvers

    Authors: Chunxiao Li, Charlie Liu, Jonathan Chung, Zhengyang Lu, Piyush Jha, Vijay Ganesh

    Abstract: Restart policy is an important technique used in modern Conflict-Driven Clause Learning (CDCL) solvers, wherein some parts of the solver state are erased at certain intervals during the run of the solver. In most solvers, variable activities are preserved across restart boundaries, resulting in solvers continuing to search parts of the assignment tree that are not far from the one immediately prio… ▽ More

    Submitted 19 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  44. arXiv:2404.02807  [pdf, other

    physics.med-ph cs.AI

    An Optimization Framework to Personalize Passive Cardiac Mechanics

    Authors: Lei Shi, Ian Chen, Hiroo Takayama, Vijay Vedula

    Abstract: Personalized cardiac mechanics modeling is a powerful tool for understanding the biomechanics of cardiac function in health and disease and assisting in treatment planning. However, current models are limited to using medical images acquired at a single cardiac phase, often limiting their applicability for processing dynamic image acquisitions. This study introduces an inverse finite element analy… ▽ More

    Submitted 5 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  45. arXiv:2404.00769  [pdf, other

    cs.RO

    An Active Perception Game for Robust Autonomous Exploration

    Authors: Siming He, Yuezhan Tao, Igor Spasojevic, Vijay Kumar, Pratik Chaudhari

    Abstract: We formulate active perception for an autonomous agent that explores an unknown environment as a two-player zero-sum game: the agent aims to maximize information gained from the environment while the environment aims to minimize the information gained by the agent. In each episode, the environment reveals a set of actions with their potentially erroneous information gain. In order to select the be… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  46. arXiv:2404.00213  [pdf, other

    cs.CL

    Injecting New Knowledge into Large Language Models via Supervised Fine-Tuning

    Authors: Nick Mecklenburg, Yiyou Lin, Xiaoxiao Li, Daniel Holstein, Leonardo Nunes, Sara Malvar, Bruno Silva, Ranveer Chandra, Vijay Aski, Pavan Kumar Reddy Yannam, Tolga Aktas, Todd Hendry

    Abstract: In recent years, Large Language Models (LLMs) have shown remarkable performance in generating human-like text, proving to be a valuable asset across various applications. However, adapting these models to incorporate new, out-of-domain knowledge remains a challenge, particularly for facts and events that occur after the model's knowledge cutoff date. This paper investigates the effectiveness of Su… ▽ More

    Submitted 2 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: 16 pages; 7 figures. updated authors list

  47. arXiv:2403.19120  [pdf, other

    cs.IT eess.SP

    Co-Designing Statistical MIMO Radar and In-band Full-Duplex Multi-User MIMO Communications -- Part III: Multi-Target Tracking

    Authors: Sk Nayemuzzaman, Kumar Vijay Mishra, Jiawei Liu, Mohammad Saquib

    Abstract: As a next-generation wireless technology, the in-band full-duplex (IBFD) transmission enables simultaneous transmission and reception of signals over the same frequency, thereby doubling spectral efficiency. Further, a continuous up-scaling of wireless network carrier frequencies arising from ever-increasing data traffic is driving research on integrated sensing and communications (ISAC) systems.… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 29 pages, 8 figures, 1 table

  48. arXiv:2403.19119  [pdf, other

    cs.IT eess.SP

    Co-Designing Statistical MIMO Radar and In-band Full-Duplex Multi-User MIMO Communications -- Part II: Joint Precoder, Radar Code, and Receive Filters Design

    Authors: Jiawei Liu, Kumar Vijay Mishra, Mohammad Saquib

    Abstract: We address the challenge of spectral sharing between a statistical multiple-input multiple-output (MIMO) radar and an in-band full-duplex (IBFD) multi-user MIMO (MU-MIMO) communications system operating simultaneously in the same frequency band. Existing research on joint MIMO-radar-MIMO-communications (MRMC) systems has limitations, such as focusing on colocated MIMO radars, half-duplex MIMO comm… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 25 pages, 5 figures. arXiv admin note: text overlap with arXiv:2006.14774

  49. arXiv:2403.18063  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    Heracles: A Hybrid SSM-Transformer Model for High-Resolution Image and Time-Series Analysis

    Authors: Badri N. Patro, Suhas Ranganath, Vinay P. Namboodiri, Vijay S. Agneeswaran

    Abstract: Transformers have revolutionized image modeling tasks with adaptations like DeIT, Swin, SVT, Biformer, STVit, and FDVIT. However, these models often face challenges with inductive bias and high quadratic complexity, making them less efficient for high-resolution images. State space models (SSMs) such as Mamba, V-Mamba, ViM, and SiMBA offer an alternative to handle high resolution images in compute… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  50. arXiv:2403.17067  [pdf, other

    cs.RO

    Trajectory Optimization with Global Yaw Parameterization for Field-of-View Constrained Autonomous Flight

    Authors: Yuwei Wu, Yuezhan Tao, Igor Spasojevic, Vijay Kumar

    Abstract: Trajectory generation for quadrotors with limited field-of-view sensors has numerous applications such as aerial exploration, coverage, inspection, videography, and target tracking. Most previous works simplify the task of optimizing yaw trajectories by either aligning the heading of the robot with its velocity, or potentially restricting the feasible space of candidate trajectories by using a lim… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.