Skip to main content

Showing 1–50 of 570 results for author: Arjun

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.08242  [pdf, other

    cs.ET

    Efficient Reinforcement Learning On Passive RRAM Crossbar Array

    Authors: Arjun Tyagi, Shubham Sahay

    Abstract: The unprecedented growth in the field of machine learning has led to the development of deep neuromorphic networks trained on labelled dataset with capability to mimic or even exceed human capabilities. However, for applications involving continuous decision making in unknown environments, such as rovers for space exploration, robots, unmanned aerial vehicles, etc., explicit supervision and genera… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 5 pages, 5 figures

  2. arXiv:2407.08185  [pdf, other

    cs.CR cs.CL cs.CY cs.NI

    Automatic Generation of Web Censorship Probe Lists

    Authors: Jenny Tang, Leo Alvarez, Arjun Brar, Nguyen Phong Hoang, Nicolas Christin

    Abstract: Domain probe lists--used to determine which URLs to probe for Web censorship--play a critical role in Internet censorship measurement studies. Indeed, the size and accuracy of the domain probe list limits the set of censored pages that can be detected; inaccurate lists can lead to an incomplete view of the censorship landscape or biased results. Previous efforts to generate domain probe lists have… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: To appear in the Proceedings on Privacy Enhancing Technologies 2024

  3. arXiv:2407.06939  [pdf, other

    cs.RO cs.CV

    Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge

    Authors: Sriram Yenamandra, Arun Ramachandran, Mukul Khanna, Karmesh Yadav, Jay Vakil, Andrew Melnik, Michael Büttner, Leon Harz, Lyon Brown, Gora Chand Nandi, Arjun PS, Gaurav Kumar Yadav, Rahul Kala, Robert Haschke, Yang Luo, **xin Zhu, Yansen Han, Bingyi Lu, Xuan Gu, Qinyuan Liu, Ya** Zhao, Qiting Ye, Chenxiao Dou, Yansong Chua, Volodymyr Kuzma , et al. (20 additional authors not shown)

    Abstract: In order to develop robots that can effectively serve as versatile and capable home assistants, it is crucial for them to reliably perceive and interact with a wide variety of objects across diverse environments. To this end, we proposed Open Vocabulary Mobile Manipulation as a key benchmark task for robotics: finding any object in a novel environment and placing it on any receptacle surface withi… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  4. arXiv:2407.06324  [pdf, other

    cs.LG cs.CL cs.NE

    B'MOJO: Hybrid State Space Realizations of Foundation Models with Eidetic and Fading Memory

    Authors: Luca Zancato, Arjun Seshadri, Yonatan Dukler, Aditya Golatkar, Yantao Shen, Benjamin Bowman, Matthew Trager, Alessandro Achille, Stefano Soatto

    Abstract: We describe a family of architectures to support transductive inference by allowing memory to grow to a finite but a-priori unknown bound while making efficient use of finite resources for inference. Current architectures use such resources to represent data either eidetically over a finite span ("context" in Transformers), or fading over an infinite span (in State Space Models, or SSMs). Recent h… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  5. arXiv:2407.04620  [pdf, other

    cs.LG cs.AI cs.CL

    Learning to (Learn at Test Time): RNNs with Expressive Hidden States

    Authors: Yu Sun, Xinhao Li, Karan Dalal, Jiarui Xu, Arjun Vikram, Genghan Zhang, Yann Dubois, Xinlei Chen, Xiaolong Wang, Sanmi Koyejo, Tatsunori Hashimoto, Carlos Guestrin

    Abstract: Self-attention performs well in long context but has quadratic complexity. Existing RNN layers have linear complexity, but their performance in long context is limited by the expressive power of their hidden state. We propose a new class of sequence modeling layers with linear complexity and an expressive hidden state. The key idea is to make the hidden state a machine learning model itself, and t… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  6. arXiv:2407.04108  [pdf, other

    cs.CR cs.CL cs.LG

    Future Events as Backdoor Triggers: Investigating Temporal Vulnerabilities in LLMs

    Authors: Sara Price, Arjun Panickssery, Sam Bowman, Asa Cooper Stickland

    Abstract: Backdoors are hidden behaviors that are only triggered once an AI system has been deployed. Bad actors looking to create successful backdoors must design them to avoid activation during training and evaluation. Since data used in these stages often only contains information about events that have already occurred, a component of a simple backdoor trigger could be a model recognizing data that is i… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  7. arXiv:2407.03864  [pdf, other

    cs.LG cs.AI

    Adversarial Robustness of VAEs across Intersectional Subgroups

    Authors: Chethan Krishnamurthy Ramanaik, Arjun Roy, Eirini Ntoutsi

    Abstract: Despite advancements in Autoencoders (AEs) for tasks like dimensionality reduction, representation learning and data generation, they remain vulnerable to adversarial attacks. Variational Autoencoders (VAEs), with their probabilistic approach to disentangling latent spaces, show stronger resistance to such perturbations compared to deterministic AEs; however, their resilience against adversarial i… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  8. arXiv:2407.03068  [pdf, other

    cs.NI cs.AI

    xApp Distillation: AI-based Conflict Mitigation in B5G O-RAN

    Authors: Hakan Erdol, Xiaoyang Wang, Robert Piechocki, George Oikonomou, Arjun Parekh

    Abstract: The advancements of machine learning-based (ML) decision-making algorithms created various research and industrial opportunities. One of these areas is ML-based near-real-time network management applications (xApps) in Open-Radio Access Network (O-RAN). Normally, xApps are designed solely for the desired objectives, and fine-tuned for deployment. However, telecommunication companies can employ mul… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 5 Pages, 4 figures

  9. arXiv:2407.02514  [pdf, other

    cs.LO cs.AI cs.CL

    LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations

    Authors: Shashank Kirtania, Priyanshu Gupta, Arjun Radhakirshna

    Abstract: In this paper we examine the limitations of Large Language Models (LLMs) for complex reasoning tasks. Although recent works have started to employ formal languages as an intermediate representation for reasoning tasks, they often face challenges in accurately generating and refining these formal specifications to ensure correctness. To address these issues, this paper proposes Logic-LM++, an impro… ▽ More

    Submitted 4 July, 2024; v1 submitted 22 June, 2024; originally announced July 2024.

  10. arXiv:2406.14020  [pdf, other

    cs.CR cs.AI cs.ET cs.NI

    Leveraging eBPF and AI for Ransomware Nose Out

    Authors: Arjun Sekar, Sameer G. Kulkarni, Joy Kuri

    Abstract: In this work, we propose a two-phased approach for real-time detection and deterrence of ransomware. To achieve this, we leverage the capabilities of eBPF (Extended Berkeley Packet Filter) and artificial intelligence to develop both proactive and reactive methods. In the first phase, we utilize signature based detection, where we employ custom eBPF programs to trace the execution of new processes… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 7 pages

  11. arXiv:2406.14012  [pdf, other

    cs.CL cs.AI

    Seeing Through AI's Lens: Enhancing Human Skepticism Towards LLM-Generated Fake News

    Authors: Navid Ayoobi, Sadat Shahriar, Arjun Mukherjee

    Abstract: LLMs offer valuable capabilities, yet they can be utilized by malicious users to disseminate deceptive information and generate fake news. The growing prevalence of LLMs poses difficulties in crafting detection approaches that remain effective across various text domains. Additionally, the absence of precautionary measures for AI-generated news on online social platforms is concerning. Therefore,… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  12. arXiv:2406.11598  [pdf, other

    cs.CL cs.CY

    Understanding "Democratization" in NLP and ML Research

    Authors: Arjun Subramonian, Vagrant Gautam, Dietrich Klakow, Zeerak Talat

    Abstract: Recent improvements in natural language processing (NLP) and machine learning (ML) and increased mainstream adoption have led to researchers frequently discussing the "democratization" of artificial intelligence. In this paper, we seek to clarify how democratization is understood in NLP and ML publications, through large-scale mixed-methods analyses of papers using the keyword "democra*" published… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  13. arXiv:2406.08431  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    Diffusion Soup: Model Merging for Text-to-Image Diffusion Models

    Authors: Benjamin Biggs, Arjun Seshadri, Yang Zou, Achin Jain, Aditya Golatkar, Yusheng Xie, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto

    Abstract: We present Diffusion Soup, a compartmentalization method for Text-to-Image Generation that averages the weights of diffusion models trained on sharded data. By construction, our approach enables training-free continual learning and unlearning with no additional memory or inference costs, since models corresponding to data shards can be added or removed by re-averaging. We show that Diffusion Soup… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  14. arXiv:2406.06613  [pdf, other

    cs.CL cs.AI

    GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents

    Authors: Anthony Costarelli, Mat Allen, Roman Hauksson, Grace Sodunke, Suhas Hariharan, Carlson Cheng, Wenjie Li, Arjun Yadav

    Abstract: Large language models have demonstrated remarkable few-shot performance on many natural language understanding tasks. Despite several demonstrations of using large language models in complex, strategic scenarios, there lacks a comprehensive framework for evaluating agents' performance across various types of reasoning found in games. To address this gap, we introduce GameBench, a cross-domain benc… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  15. arXiv:2406.03415  [pdf, other

    cs.HC

    RemixTape: Enriching Narratives about Metrics with Semantic Alignment and Contextual Recommendation

    Authors: Matthew Brehmer, Margaret Drouhard, Arjun Srinivasan

    Abstract: The temporal dynamics of quantitative metrics or key performance indicators (KPIs) are central to decision making within enterprise organizations. Recently, major business intelligence providers have introduced new infrastructure for defining, sharing, and monitoring metric values. However, these values are often presented in isolation and appropriate context is seldom externalized. In this design… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: SUPPLEMENTAL MATERIAL. The scenario video (Sec. 4.4): https://youtu.be/9h6aHvqg9vA; the tutorial video (Sec. 5.2): https://youtu.be/uZPZ5-oiqZk

  16. arXiv:2406.02784  [pdf, other

    cs.NI

    Feasibility of State Space Models for Network Traffic Generation

    Authors: Andrew Chu, Xi Jiang, Shinan Liu, Arjun Bhagoji, Francesco Bronzino, Paul Schmitt, Nick Feamster

    Abstract: Many problems in computer networking rely on parsing collections of network traces (e.g., traffic prioritization, intrusion detection). Unfortunately, the availability and utility of these collections is limited due to privacy concerns, data staleness, and low representativeness. While methods for generating data to augment collections exist, they often fall short in replicating the quality of rea… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 7 pages, 3 figures, 4 tables

  17. arXiv:2406.02778  [pdf, other

    cs.LG

    MS-IMAP -- A Multi-Scale Graph Embedding Approach for Interpretable Manifold Learning

    Authors: Shay Deutsch, Lionel Yelibi, Alex Tong Lin, Arjun Ravi Kannan

    Abstract: Deriving meaningful representations from complex, high-dimensional data in unsupervised settings is crucial across diverse machine learning applications. This paper introduces a framework for multi-scale graph network embedding based on spectral graph wavelets that employs a contrastive learning approach. A significant feature of the proposed embedding is its capacity to establish a correspondence… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  18. arXiv:2406.02290  [pdf, other

    cs.LG

    A Study of Optimizations for Fine-tuning Large Language Models

    Authors: Arjun Singh, Nikhil Pandey, Anup Shirgaonkar, Pavan Manoj, Vijay Aski

    Abstract: Fine-tuning large language models is a popular choice among users trying to adapt them for specific applications. However, fine-tuning these models is a demanding task because the user has to examine several factors, such as resource budget, runtime, model size and context length among others. A specific challenge is that fine-tuning is memory intensive, imposing constraints on the required hardwa… ▽ More

    Submitted 6 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 figures. Revised text for clarity, updated references

  19. arXiv:2406.00724  [pdf, other

    cs.HC cs.RO

    Exploring Child-Robot Interaction in Individual and Group settings in India

    Authors: Gayathri Manikutty, Sai Ankith Potapragada, Devasena Pasupuleti, Mahesh S. Unnithan, Arjun Venugopal, Pranav Prabha, Arunav H., Vyshnavi Anil Kumar, Rthuraj P. R., Rao R Bhavani

    Abstract: This study evaluates the effectiveness of child-robot interactions with the HaKsh-E social robot in India, examining both individual and group interaction settings. The research centers on game-based interactions designed to teach hand hygiene to children aged 7-11. Utilizing video analysis, rubric assessments, and post-study questionnaires, the study gathered data from 36 participants. Findings i… ▽ More

    Submitted 4 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: 6 pages, 6 figures, Accepted for presentation at ICRAS 2024 (https://www.icras.org/)

  20. arXiv:2405.20179  [pdf, other

    cs.CL cs.AI cs.RO

    Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning CodeLLMs

    Authors: Zichao Hu, Junyi Jessy Li, Arjun Guha, Joydeep Biswas

    Abstract: Large language models (LLMs) have shown great promise at generating robot programs from natural language given domain-specific robot application programming interfaces (APIs). However, the performance gap between proprietary LLMs and smaller open-weight LLMs remains wide. This raises a question: Can we fine-tune smaller open-weight LLMs for generating domain-specific robot programs to close the pe… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  21. arXiv:2405.17159  [pdf, other

    cs.CL cs.CY cs.HC

    Stop! In the Name of Flaws: Disentangling Personal Names and Sociodemographic Attributes in NLP

    Authors: Vagrant Gautam, Arjun Subramonian, Anne Lauscher, Os Keyes

    Abstract: Personal names simultaneously differentiate individuals and categorize them in ways that are important in a given society. While the natural language processing community has thus associated personal names with sociodemographic characteristics in a variety of tasks, researchers have engaged to varying degrees with the established methodological problems in doing so. To guide future work, we presen… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  22. arXiv:2405.13009  [pdf, other

    cs.CL cs.AI

    METAREFLECTION: Learning Instructions for Language Agents using Past Reflections

    Authors: Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Sherry Shi, Gustavo Soares

    Abstract: Despite the popularity of Large Language Models (LLMs), crafting specific prompts for LLMs to perform particular tasks remains challenging. Users often engage in multiple conversational turns with an LLM-based agent to accomplish their intended task. Recent studies have demonstrated that linguistic feedback, in the form of self-reflections generated by the model, can work as reinforcement during t… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  23. arXiv:2405.12979  [pdf, other

    cs.CV

    OmniGlue: Generalizable Feature Matching with Foundation Model Guidance

    Authors: Hanwen Jiang, Arjun Karpur, Bingyi Cao, Qixing Huang, Andre Araujo

    Abstract: The image matching field has been witnessing a continuous emergence of novel learnable feature matching techniques, with ever-improving performance on conventional benchmarks. However, our investigation shows that despite these gains, their potential for real-world applications is restricted by their limited generalization capabilities to novel image domains. In this paper, we introduce OmniGlue,… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: CVPR 2024

  24. arXiv:2405.11659  [pdf, other

    cs.RO cs.CV cs.LG

    Auto-Platoon : Freight by example

    Authors: Tharun V. Puthanveettil, Abhijay Singh, Yashveer Jain, Vinay Bukka, Sameer Arjun S

    Abstract: The work introduces a bio-inspired leader-follower system based on an innovative mechanism proposed as software latching that aims to improve collaboration and coordination between a leader agent and the associated autonomous followers. The system utilizes software latching to establish real-time communication and synchronization between the leader and followers. A layered architecture is proposed… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  25. arXiv:2405.11244  [pdf, other

    cs.SC cs.PL

    Strided Difference Bound Matrices

    Authors: Arjun Pitchanathan, Albert Cohen, Oleksandr Zinenko, Tobias Grosser

    Abstract: A wide range of symbolic analysis and optimization problems can be formalized using polyhedra. Sub-classes of polyhedra, also known as sub-polyhedral domains, are sought for their lower space and time complexity. We introduce the Strided Difference Bound Matrix (SDBM) domain, which represents a sweet spot in the context of optimizing compilers. Its expressiveness and efficient algorithms are parti… ▽ More

    Submitted 4 July, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

    Comments: Preprint and extended from the CAV 2024 conference version. Fixed issue in arxiv version where URLs were not wrapped

  26. arXiv:2405.07518  [pdf, other

    cs.AR cs.AI

    SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

    Authors: Raghu Prabhakar, Ram Sivaramakrishnan, Darshan Gandhi, Yun Du, Mingran Wang, Xiangyu Song, Kejie Zhang, Tianren Gao, Angela Wang, Karen Li, Yongning Sheng, Joshua Brot, Denis Sokolov, Apurv Vivek, Calvin Leung, Arjun Sabnis, Jiayu Bai, Tuowen Zhao, Mark Gottscho, David Jackson, Mark Luttrell, Manish K. Shah, Edison Chen, Kaizhao Liang, Swayambhoo Jain , et al. (5 additional authors not shown)

    Abstract: Monolithic large language models (LLMs) like GPT-4 have paved the way for modern generative AI applications. Training, serving, and maintaining monolithic LLMs at scale, however, remains prohibitively expensive and challenging. The disproportionate increase in compute-to-memory ratio of modern AI accelerators have created a memory wall, necessitating new methods to deploy AI. Composition of Expert… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  27. "Community Guidelines Make this the Best Party on the Internet": An In-Depth Study of Online Platforms' Content Moderation Policies

    Authors: Brennan Schaffner, Arjun Nitin Bhagoji, Siyuan Cheng, Jacqueline Mei, Jay L. Shen, Grace Wang, Marshini Chetty, Nick Feamster, Genevieve Lakier, Chenhao Tan

    Abstract: Moderating user-generated content on online platforms is crucial for balancing user safety and freedom of speech. Particularly in the United States, platforms are not subject to legal constraints prescribing permissible content. Each platform has thus developed bespoke content moderation policies, but there is little work towards a comparative understanding of these policies across platforms and t… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  28. arXiv:2405.02790  [pdf, other

    cs.CR cs.LG

    Confidential and Protected Disease Classifier using Fully Homomorphic Encryption

    Authors: Aditya Malik, Nalini Ratha, Bharat Yalavarthi, Tilak Sharma, Arjun Kaushik, Charanjit Jutla

    Abstract: With the rapid surge in the prevalence of Large Language Models (LLMs), individuals are increasingly turning to conversational AI for initial insights across various domains, including health-related inquiries such as disease diagnosis. Many users seek potential causes on platforms like ChatGPT or Bard before consulting a medical professional for their ailment. These platforms offer valuable benef… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  29. arXiv:2405.00172  [pdf, other

    cs.LG cs.SI stat.ML

    Re-visiting Skip-Gram Negative Sampling: Dimension Regularization for More Efficient Dissimilarity Preservation in Graph Embeddings

    Authors: David Liu, Arjun Seshadri, Tina Eliassi-Rad, Johan Ugander

    Abstract: A wide range of graph embedding objectives decompose into two components: one that attracts the embeddings of nodes that are perceived as similar, and another that repels embeddings of nodes that are perceived as dissimilar. Because real-world graphs are sparse and the number of dissimilar pairs grows quadratically with the number of nodes, Skip-Gram Negative Sampling (SGNS) has emerged as a popul… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  30. arXiv:2404.17225  [pdf, other

    cs.CR cs.AI cs.RO

    Enhancing Privacy and Security of Autonomous UAV Navigation

    Authors: Vatsal Aggarwal, Arjun Ramesh Kaushik, Charanjit Jutla, Nalini Ratha

    Abstract: Autonomous Unmanned Aerial Vehicles (UAVs) have become essential tools in defense, law enforcement, disaster response, and product delivery. These autonomous navigation systems require a wireless communication network, and of late are deep learning based. In critical scenarios such as border protection or disaster response, ensuring the secure navigation of autonomous UAVs is paramount. But, these… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  31. arXiv:2404.16255  [pdf, other

    cs.CR cs.CV

    Enhancing Privacy in Face Analytics Using Fully Homomorphic Encryption

    Authors: Bharat Yalavarthi, Arjun Ramesh Kaushik, Arun Ross, Vishnu Boddeti, Nalini Ratha

    Abstract: Modern face recognition systems utilize deep neural networks to extract salient features from a face. These features denote embeddings in latent space and are often stored as templates in a face recognition system. These embeddings are susceptible to data leakage and, in some cases, can even be used to reconstruct the original face image. To prevent compromising identities, template protection sch… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  32. arXiv:2404.16216  [pdf, other

    cs.CV cs.RO cs.SD eess.AS

    ActiveRIR: Active Audio-Visual Exploration for Acoustic Environment Modeling

    Authors: Arjun Somayazulu, Sagnik Majumder, Changan Chen, Kristen Grauman

    Abstract: An environment acoustic model represents how sound is transformed by the physical characteristics of an indoor environment, for any given source/receiver location. Traditional methods for constructing acoustic models involve expensive and time-consuming collection of large quantities of acoustic data at dense spatial locations in the space, or rely on privileged knowledge of scene geometry to inte… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Project page: https://vision.cs.utexas.edu/projects/active_rir/

  33. arXiv:2404.13076  [pdf, other

    cs.CL cs.AI

    LLM Evaluators Recognize and Favor Their Own Generations

    Authors: Arjun Panickssery, Samuel R. Bowman, Shi Feng

    Abstract: Self-evaluation using large language models (LLMs) has proven valuable not only in benchmarking but also methods like reward modeling, constitutional AI, and self-refinement. But new biases are introduced due to the same LLM acting as both the evaluator and the evaluatee. One such bias is self-preference, where an LLM evaluator scores its own outputs higher than others' while human annotators cons… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  34. arXiv:2404.13008  [pdf, other

    cs.SD eess.AS

    Enhancing Generalization in Audio Deepfake Detection: A Neural Collapse based Sampling and Training Approach

    Authors: Mohammed Yousif, Jonat John Mathew, Huzaifa Pallan, Agamjeet Singh Padda, Syed Daniyal Shah, Sara Adamski, Madhu Reddiboina, Arjun Pankajakshan

    Abstract: Generalization in audio deepfake detection presents a significant challenge, with models trained on specific datasets often struggling to detect deepfakes generated under varying conditions and unknown algorithms. While collectively training a model using diverse datasets can enhance its generalization ability, it comes with high computational costs. To address this, we propose a neural collapse-b… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  35. arXiv:2404.08017  [pdf

    cs.CV cs.AI

    AI-Guided Feature Segmentation Techniques to Model Features from Single Crystal Diamond Growth

    Authors: Rohan Reddy Mekala, Elias Garratt, Matthias Muehle, Arjun Srinivasan, Adam Porter, Mikael Lindvall

    Abstract: Process refinement to consistently produce high-quality material over a large area of the grown crystal, enabling various applications from optics crystals to quantum detectors, has long been a goal for diamond growth. Machine learning offers a promising path toward this goal, but faces challenges such as the complexity of features within datasets, their time-dependency, and the volume of data pro… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 12 pages,4 figures,ACMME 2024. arXiv admin note: substantial text overlap with arXiv:2404.07306

  36. arXiv:2404.07306  [pdf

    cs.CV cs.AI

    AI-Guided Defect Detection Techniques to Model Single Crystal Diamond Growth

    Authors: Rohan Reddy Mekala, Elias Garratt, Matthias Muehle, Arjun Srinivasan, Adam Porter, Mikael Lindvall

    Abstract: From a process development perspective, diamond growth via chemical vapor deposition has made significant strides. However, challenges persist in achieving high quality and large-area material production. These difficulties include controlling conditions to maintain uniform growth rates for the entire growth surface. As growth progresses, various factors or defect states emerge, altering the unifo… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 12 pages,4 figures,ACMME 2024

  37. arXiv:2404.07228  [pdf, other

    cs.AR

    Block-SSD: A New Block-Based Blocking SSD Architecture

    Authors: Ryan Wong, Arjun Tyagi, Sungjun Cho, Pratik Sampat, Yiqiu Sun

    Abstract: Computer science and related fields (e.g., computer engineering, computer hardware engineering, electrical engineering, electrical and computer engineering, computer systems engineering) often draw inspiration from other fields, areas, and the real world in order to describe topics in their area. One cross-domain example is the idea of a block. The idea of blocks comes in many flavors, including s… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: This is an April Fools submission

  38. arXiv:2404.06407  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Rethinking How to Evaluate Language Model Jailbreak

    Authors: Hongyu Cai, Arjun Arunasalam, Leo Y. Lin, Antonio Bianchi, Z. Berkay Celik

    Abstract: Large language models (LLMs) have become increasingly integrated with various applications. To ensure that LLMs do not generate unsafe responses, they are aligned with safeguards that specify what content is restricted. However, such alignment can be bypassed to produce prohibited content using a technique commonly referred to as jailbreak. Different systems have been proposed to perform the jailb… ▽ More

    Submitted 7 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  39. arXiv:2404.03139  [pdf, other

    cs.LG cs.SI

    Theoretical and Empirical Insights into the Origins of Degree Bias in Graph Neural Networks

    Authors: Arjun Subramonian, Jian Kang, Yizhou Sun

    Abstract: Graph Neural Networks (GNNs) often perform better for high-degree nodes than low-degree nodes on node classification tasks. This degree bias can reinforce social marginalization by, e.g., sidelining authors of lowly-cited papers when predicting paper topics in citation networks. While researchers have proposed numerous hypotheses for why GNN degree bias occurs, we find via a survey of 38 degree bi… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  40. arXiv:2404.01903  [pdf, other

    cs.CL cs.LG cs.PL

    Activation Steering for Robust Type Prediction in CodeLLMs

    Authors: Francesca Lucchetti, Arjun Guha

    Abstract: Contemporary LLMs pretrained on code are capable of succeeding at a wide variety of programming tasks. However, their performance is very sensitive to syntactic features, such as the names of variables and types, the structure of code, and presence of type hints. We contribute an inference-time technique to make CodeLLMs more robust to syntactic distractors that are semantically irrelevant. Our me… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 16 pages, 7 figures

  41. arXiv:2404.01030  [pdf, ps, other

    cs.CV cs.AI cs.CY

    Survey of Bias In Text-to-Image Generation: Definition, Evaluation, and Mitigation

    Authors: Yixin Wan, Arjun Subramonian, Anaelia Ovalle, Zongyu Lin, Ashima Suvarna, Christina Chance, Hritik Bansal, Rebecca Pattichis, Kai-Wei Chang

    Abstract: The recent advancement of large and powerful models with Text-to-Image (T2I) generation abilities -- such as OpenAI's DALLE-3 and Google's Gemini -- enables users to generate high-quality images from textual prompts. However, it has become increasingly evident that even simple prompts could cause T2I models to exhibit conspicuous social bias in generated images. Such bias might lead to both alloca… ▽ More

    Submitted 1 May, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  42. arXiv:2404.00318  [pdf, other

    cs.RO cs.CV

    Exploring Unseen Environments with Robots using Large Language and Vision Models through a Procedurally Generated 3D Scene Representation

    Authors: Arjun P S, Andrew Melnik, Gora Chand Nandi

    Abstract: Recent advancements in Generative Artificial Intelligence, particularly in the realm of Large Language Models (LLMs) and Large Vision Language Models (LVLMs), have enabled the prospect of leveraging cognitive planners within robotic systems. This work focuses on solving the object goal navigation problem by mimicking human cognition to attend, perceive and store task specific information and gener… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  43. DataAgent: Evaluating Large Language Models' Ability to Answer Zero-Shot, Natural Language Queries

    Authors: Manit Mishra, Abderrahman Braham, Charles Marsom, Bryan Chung, Gavin Griffin, Dakshesh Sidnerlikar, Chatanya Sarin, Arjun Rajaram

    Abstract: Conventional processes for analyzing datasets and extracting meaningful information are often time-consuming and laborious. Previous work has identified manual, repetitive coding and data collection as major obstacles that hinder data scientists from undertaking more nuanced labor and high-level projects. To combat this, we evaluated OpenAI's GPT-3.5 as a "Language Data Scientist" (LDS) that can e… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: 5 pages, Submitted to International Conference on AI in Cybersecurity

  44. arXiv:2403.11778  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Towards the Development of a Real-Time Deepfake Audio Detection System in Communication Platforms

    Authors: Jonat John Mathew, Rakin Ahsan, Sae Furukawa, Jagdish Gautham Krishna Kumar, Huzaifa Pallan, Agamjeet Singh Padda, Sara Adamski, Madhu Reddiboina, Arjun Pankajakshan

    Abstract: Deepfake audio poses a rising threat in communication platforms, necessitating real-time detection for audio stream integrity. Unlike traditional non-real-time approaches, this study assesses the viability of employing static deepfake audio detection models in real-time communication platforms. An executable software is developed for cross-platform compatibility, enabling real-time execution. Two… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  45. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  46. arXiv:2402.19432  [pdf, other

    cs.RO

    Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation

    Authors: Jonathan Yang, Catherine Glossop, Arjun Bhorkar, Dhruv Shah, Quan Vuong, Chelsea Finn, Dorsa Sadigh, Sergey Levine

    Abstract: Recent years in robotics and imitation learning have shown remarkable progress in training large-scale foundation models by leveraging data across a multitude of embodiments. The success of such policies might lead us to wonder: just how diverse can the robots in the training set be while still facilitating positive transfer? In this work, we study this question in the context of heterogeneous emb… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 16 pages, 9 figures

    MSC Class: 68T40 ACM Class: I.2.9

  47. arXiv:2402.19173  [pdf, other

    cs.SE cs.AI

    StarCoder 2 and The Stack v2: The Next Generation

    Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

    Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  48. arXiv:2402.17767  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Opening Cabinets and Drawers in the Real World using a Commodity Mobile Manipulator

    Authors: Arjun Gupta, Michelle Zhang, Rishik Sathua, Saurabh Gupta

    Abstract: Pulling open cabinets and drawers presents many difficult technical challenges in perception (inferring articulation parameters for objects from onboard sensors), planning (producing motion plans that conform to tight task constraints), and control (making and maintaining contact while applying forces on the environment). In this work, we build an end-to-end system that enables a commodity mobile… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Project webpage: https://arjung128.github.io/opening-cabinets-and-drawers

  49. arXiv:2402.11729  [pdf, other

    cs.LG cs.AI q-bio.QM

    Prospector Heads: Generalized Feature Attribution for Large Models & Data

    Authors: Gautam Machiraju, Alexander Derry, Arjun Desai, Neel Guha, Amir-Hossein Karimi, James Zou, Russ Altman, Christopher Ré, Parag Mallick

    Abstract: Feature attribution, the ability to localize regions of the input data that are relevant for classification, is an important capability for ML models in scientific and biomedical domains. Current methods for feature attribution, which rely on "explaining" the predictions of end-to-end classifiers, suffer from imprecise feature localization and are inadequate for use with small sample sizes and hig… ▽ More

    Submitted 19 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 30 pages, 16 figures, 8 tables. Accepted to ICML 2024

  50. arXiv:2402.10163  [pdf, other

    cs.NE

    Hidden Traveling Waves bind Working Memory Variables in Recurrent Neural Networks

    Authors: Arjun Karuvally, Terrence J. Sejnowski, Hava T. Siegelmann

    Abstract: Traveling waves are a fundamental phenomenon in the brain, playing a crucial role in short-term information storage. In this study, we leverage the concept of traveling wave dynamics within a neural lattice to formulate a theoretical model of neural working memory, study its properties, and its real world implications in AI. The proposed model diverges from traditional approaches, which assume inf… ▽ More

    Submitted 7 April, 2024; v1 submitted 15 February, 2024; originally announced February 2024.