Skip to main content

Showing 1–50 of 128 results for author: Narayanan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01502  [pdf, other

    cs.LG cs.AI

    AI Agents That Matter

    Authors: Sayash Kapoor, Benedikt Stroebl, Zachary S. Siegel, Nitya Nadgir, Arvind Narayanan

    Abstract: AI agents are an exciting new research direction, and agent development is driven by benchmarks. Our analysis of current agent benchmarks and evaluation practices reveals several shortcomings that hinder their usefulness in real-world applications. First, there is a narrow focus on accuracy without attention to other metrics. As a result, SOTA agents are needlessly complex and costly, and the comm… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.16746  [pdf, other

    cs.LG cs.AI cs.CL

    The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

    Authors: Shayne Longpre, Stella Biderman, Alon Albalak, Hailey Schoelkopf, Daniel McDuff, Sayash Kapoor, Kevin Klyman, Kyle Lo, Gabriel Ilharco, Nay San, Maribeth Rauh, Aviya Skowron, Bertie Vidgen, Laura Weidinger, Arvind Narayanan, Victor Sanh, David Adelani, Percy Liang, Rishi Bommasani, Peter Henderson, Sasha Luccioni, Yacine Jernite, Luca Soldaini

    Abstract: Foundation model development attracts a rapidly expanding body of contributors, scientists, and applications. To help shape responsible development practices, we introduce the Foundation Model Development Cheatsheet: a growing collection of 250+ tools and resources spanning text, vision, and speech modalities. We draw on a large body of prior work to survey resources (e.g. software, documentation,… ▽ More

    Submitted 25 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.09296  [pdf, other

    cs.CV cs.AI

    Parameter-Efficient Active Learning for Foundational models

    Authors: Athmanarayanan Lakshmi Narayanan, Ranganath Krishnan, Amrutha Machireddy, Mahesh Subedar

    Abstract: Foundational vision transformer models have shown impressive few shot performance on many vision tasks. This research presents a novel investigation into the application of parameter efficient fine-tuning methods within an active learning (AL) framework, to advance the sampling selection process in extremely budget constrained classification tasks. The focus on image datasets, known for their out-… ▽ More

    Submitted 14 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted for CVPR2024 Transformers for Vision Workshop

  4. arXiv:2406.02004  [pdf, ps, other

    cs.CR cs.CL cs.SD eess.AS

    Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clip**

    Authors: Lun Wang, Om Thakkar, Zhong Meng, Nicole Rafidi, Rohit Prabhavalkar, Arun Narayanan

    Abstract: Gradient clip** plays a vital role in training large-scale automatic speech recognition (ASR) models. It is typically applied to minibatch gradients to prevent gradient explosion, and to the individual sample gradients to mitigate unintended memorization. This work systematically investigates the impact of a specific granularity of gradient clip**, namely per-core clip-** (PCC), across train… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech'24

  5. arXiv:2405.19524  [pdf, other

    cs.CR cs.AI

    AI Risk Management Should Incorporate Both Safety and Security

    Authors: Xiangyu Qi, Yangsibo Huang, Yi Zeng, Edoardo Debenedetti, Jonas Gei**, Luxi He, Kaixuan Huang, Udari Madhushani, Vikash Sehwag, Weijia Shi, Boyi Wei, Tinghao Xie, Danqi Chen, Pin-Yu Chen, Jeffrey Ding, Ruoxi Jia, Jiaqi Ma, Arvind Narayanan, Weijie J Su, Mengdi Wang, Chaowei Xiao, Bo Li, Dawn Song, Peter Henderson, Prateek Mittal

    Abstract: The exposure of security vulnerabilities in safety-aligned language models, e.g., susceptibility to adversarial attacks, has shed light on the intricate interplay between AI safety and AI security. Although the two disciplines now come together under the overarching goal of AI risk management, they have historically evolved separately, giving rise to differing perspectives. Therefore, in this pape… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2403.07918  [pdf, other

    cs.CY cs.AI cs.LG

    On the Societal Impact of Open Foundation Models

    Authors: Sayash Kapoor, Rishi Bommasani, Kevin Klyman, Shayne Longpre, Ashwin Ramaswami, Peter Cihon, Aspen Hopkins, Kevin Bankston, Stella Biderman, Miranda Bogen, Rumman Chowdhury, Alex Engler, Peter Henderson, Yacine Jernite, Seth Lazar, Stefano Maffulli, Alondra Nelson, Joelle Pineau, Aviya Skowron, Dawn Song, Victor Storchan, Daniel Zhang, Daniel E. Ho, Percy Liang, Arvind Narayanan

    Abstract: Foundation models are powerful technologies: how they are released publicly directly shapes their societal impact. In this position paper, we focus on open foundation models, defined here as those with broadly available model weights (e.g. Llama 2, Stable Diffusion XL). We identify five distinctive properties (e.g. greater customizability, poor monitoring) of open foundation models that lead to bo… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

  7. arXiv:2403.04893  [pdf, other

    cs.AI

    A Safe Harbor for AI Evaluation and Red Teaming

    Authors: Shayne Longpre, Sayash Kapoor, Kevin Klyman, Ashwin Ramaswami, Rishi Bommasani, Borhane Blili-Hamelin, Yangsibo Huang, Aviya Skowron, Zheng-Xin Yong, Suhas Kotha, Yi Zeng, Weiyan Shi, Xianjun Yang, Reid Southen, Alexander Robey, Patrick Chao, Diyi Yang, Ruoxi Jia, Daniel Kang, Sandy Pentland, Arvind Narayanan, Percy Liang, Peter Henderson

    Abstract: Independent evaluation and red teaming are critical for identifying the risks posed by generative AI systems. However, the terms of service and enforcement strategies used by prominent AI companies to deter model misuse have disincentives on good faith safety evaluations. This causes some researchers to fear that conducting such research or releasing their findings will result in account suspensio… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  8. arXiv:2403.04781  [pdf

    cs.CR cs.CV cs.LG eess.IV

    Selective Encryption using Segmentation Mask with Chaotic Henon Map for Multidimensional Medical Images

    Authors: S Arut Prakash, Aditya Ganesh Kumar, Prabhu Shankar K. C., Lithicka Anandavel, Aditya Lakshmi Narayanan

    Abstract: A user-centric design and resource optimization should be at the center of any technology or innovation. The user-centric perspective gives the developer the opportunity to develop with task-based optimization. The user in the medical image field is a medical professional who analyzes the medical images and gives their diagnosis results to the patient. This scheme, having the medical professional… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  9. arXiv:2402.17932  [pdf, other

    cs.MA q-fin.GN

    A Heterogeneous Agent Model of Mortgage Servicing: An Income-based Relief Analysis

    Authors: Deepeka Garg, Benjamin Patrick Evans, Leo Ardon, Annapoorani Lakshmi Narayanan, Jared Vann, Udari Madhushani, Makada Henry-Nickie, Sumitra Ganesh

    Abstract: Mortgages account for the largest portion of household debt in the United States, totaling around \$12 trillion nationwide. In times of financial hardship, alleviating mortgage burdens is essential for supporting affected households. The mortgage servicing industry plays a vital role in offering this assistance, yet there has been limited research modelling the complex relationship between househo… ▽ More

    Submitted 29 February, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: AAAI 2024 - AI in Finance for Social Impact

  10. arXiv:2402.17184  [pdf, other

    cs.CL cs.SD eess.AS

    Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

    Authors: Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno

    Abstract: The accuracy of end-to-end (E2E) automatic speech recognition (ASR) models continues to improve as they are scaled to larger sizes, with some now reaching billions of parameters. Widespread deployment and adoption of these models, however, requires computationally efficient strategies for decoding. In the present work, we study one such strategy: applying multiple frame reduction layers in the enc… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted to 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

  11. arXiv:2402.16268  [pdf, other

    cs.LG cs.AI cs.CY

    Foundation Model Transparency Reports

    Authors: Rishi Bommasani, Kevin Klyman, Shayne Longpre, Betty Xiong, Sayash Kapoor, Nestor Maslej, Arvind Narayanan, Percy Liang

    Abstract: Foundation models are critical digital technologies with swee** societal impact that necessitates transparency. To codify how foundation model developers should provide transparency about the development and deployment of their models, we propose Foundation Model Transparency Reports, drawing upon the transparency reporting practices in social media. While external documentation of societal harm… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  12. arXiv:2402.16235  [pdf, other

    cs.HC cs.AI

    Human-AI Co-Creation of Worked Examples for Programming Classes

    Authors: Mohammad Hassany, Peter Brusilovsky, Jiaze Ke, Kamil Akhuseyinoglu, Arun Balajiee Lekshmi Narayanan

    Abstract: Worked examples (solutions to typical programming problems presented as a source code in a certain language and are used to explain the topics from a programming class) are among the most popular types of learning content in programming classes. Most approaches and tools for presenting these examples to students are based on line-by-line explanations of the example code. However, instructors rarel… ▽ More

    Submitted 29 February, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2312.02105

  13. arXiv:2402.08570  [pdf, other

    cs.RO cs.AI cs.LG

    Online Foundation Model Selection in Robotics

    Authors: Po-han Li, Oyku Selin Toprak, Aditya Narayanan, Ufuk Topcu, Sandeep Chinchali

    Abstract: Foundation models have recently expanded into robotics after excelling in computer vision and natural language processing. The models are accessible in two ways: open-source or paid, closed-source options. Users with access to both face a problem when deciding between effective yet costly closed-source models and free but less powerful open-source alternatives. We call it the model selection probl… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  14. arXiv:2402.01656  [pdf, other

    cs.CY cs.AI

    Promises and pitfalls of artificial intelligence for legal applications

    Authors: Sayash Kapoor, Peter Henderson, Arvind Narayanan

    Abstract: Is AI set to redefine the legal profession? We argue that this claim is not supported by the current evidence. We dive into AI's increasingly prevalent roles in three types of legal tasks: information processing; tasks involving creativity, reasoning, or judgment; and predictions about the future. We find that the ease of evaluating legal applications varies greatly across legal tasks, based on th… ▽ More

    Submitted 10 January, 2024; originally announced February 2024.

  15. arXiv:2401.13081  [pdf, other

    cs.CV cs.AI

    Free Form Medical Visual Question Answering in Radiology

    Authors: Abhishek Narayanan, Rushabh Musthyala, Rahul Sankar, Anirudh Prasad Nistala, Pranav Singh, Jacopo Cirrone

    Abstract: Visual Question Answering (VQA) in the medical domain presents a unique, interdisciplinary challenge, combining fields such as Computer Vision, Natural Language Processing, and Knowledge Representation. Despite its importance, research in medical VQA has been scant, only gaining momentum since 2018. Addressing this gap, our research delves into the effective representation of radiology images and… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 6 pages and 4 figures

  16. arXiv:2401.08738  [pdf

    q-bio.GN cs.LG

    Machine Learning-Based Analysis of Ebola Virus' Impact on Gene Expression in Nonhuman Primates

    Authors: Mostafa Rezapour, Muhammad Khalid Khan Niazi, Hao Lu, Aarthi Narayanan, Metin Nafi Gurcan

    Abstract: This study introduces the Supervised Magnitude-Altitude Scoring (SMAS) methodology, a machine learning-based approach, for analyzing gene expression data obtained from nonhuman primates (NHPs) infected with Ebola virus (EBOV). We utilize a comprehensive dataset of NanoString gene expression profiles from Ebola-infected NHPs, deploying the SMAS system for nuanced host-pathogen interaction analysis.… ▽ More

    Submitted 22 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 28 pages, 8 figures, 2 tables

  17. arXiv:2312.17372  [pdf, other

    cs.LG cs.AI physics.acc-ph

    Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e

    Authors: Chenwei Xu, Jerry Yao-Chieh Hu, Aakaash Narayanan, Mattson Thieme, Vladimir Nagaslaev, Mark Austin, Jeremy Arnold, Jose Berlioz, Pierrick Hanlet, Aisha Ibrahim, Dennis Nicklaus, Jovan Mitrevski, Jason Michael St. John, Gauri Pradhan, Andrea Saewert, Kiyomi Seiya, Brian Schupbach, Randy Thurman-Keup, Nhan Tran, Rui Shi, Seda Ogrenci, Alexis Maya-Isabelle Shu**, Kyle Hazelwood, Han Liu

    Abstract: We introduce a novel Proximal Policy Optimization (PPO) algorithm aimed at addressing the challenge of maintaining a uniform proton beam intensity delivery in the Muon to Electron Conversion Experiment (Mu2e) at Fermi National Accelerator Laboratory (Fermilab). Our primary objective is to regulate the spill process to ensure a consistent intensity profile, with the ultimate goal of creating an aut… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 10 pages, accepted at NeurIPS 2023 ML4Phy Workshop

  18. arXiv:2312.02105  [pdf, other

    cs.HC cs.AI

    Authoring Worked Examples for Java Programming with Human-AI Collaboration

    Authors: Mohammad Hassany, Peter Brusilovsky, Jiaze Ke, Kamil Akhuseyinoglu, Arun Balajiee Lekshmi Narayanan

    Abstract: Worked examples (solutions to typical programming problems presented as a source code in a certain language and are used to explain the topics from a programming class) are among the most popular types of learning content in programming classes. Most approaches and tools for presenting these examples to students are based on line-by-line explanations of the example code. However, instructors rarel… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 2 papers both same content

  19. arXiv:2312.01532  [pdf, other

    cs.HC cs.CL

    Using Large Language Models to Accelerate Communication for Users with Severe Motor Impairments

    Authors: Shanqing Cai, Subhashini Venugopalan, Katie Seaver, Xiang Xiao, Katrin Tomanek, Sri Jalasutram, Meredith Ringel Morris, Shaun Kane, Ajit Narayanan, Robert L. MacDonald, Emily Kornman, Daniel Vance, Blair Casey, Steve M. Gleason, Philip Q. Nelson, Michael P. Brenner

    Abstract: Finding ways to accelerate text input for individuals with profound motor impairments has been a long-standing area of research. Closing the speed gap for augmentative and alternative communication (AAC) devices such as eye-tracking keyboards is important for improving the quality of life for such individuals. Recent advances in neural networks of natural language pose new opportunities for re-thi… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  20. arXiv:2311.13489  [pdf, other

    cs.MA

    Large-scale Package Deliveries with Unmanned Aerial Vehicles using Collective Learning

    Authors: Arun Narayanan, Evangelos Pournaras, Pedro H. J. Nardelli

    Abstract: Unmanned aerial vehicles (UAVs) have significant practical advantages for delivering packages, and many logistics companies have begun deploying UAVs for commercial package deliveries. To deliver packages quickly and cost-effectively, the routes taken by UAVs from depots to customers must be optimized. This route optimization problem, a type of capacitated vehicle routing problem, has recently att… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  21. arXiv:2310.20065  [pdf, other

    cs.CV cs.LG

    LinFlo-Net: A two-stage deep learning method to generate simulation ready meshes of the heart

    Authors: Arjun Narayanan, Fanwei Kong, Shawn Shadden

    Abstract: We present a deep learning model to automatically generate computer models of the human heart from patient imaging data with an emphasis on its capability to generate thin-walled cardiac structures. Our method works by deforming a template mesh to fit the cardiac structures to the given image. Compared with prior deep learning methods that adopted this approach, our framework is designed to minimi… ▽ More

    Submitted 3 January, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted manuscript in the Journal of Biomechanical Engineering

  22. arXiv:2310.18772  [pdf, other

    cs.LG cs.NE

    A Data-driven Recommendation Framework for Optimal Walker Designs

    Authors: Advaith Narayanan

    Abstract: The rapidly advancing fields of statistical modeling and machine learning have significantly enhanced data-driven design and optimization. This paper focuses on leveraging these design algorithms to optimize a medical walker, an integral part of gait rehabilitation and physiological therapy of the lower extremities. To achieve the desirable qualities of a walker, we train a predictive machine-lear… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: 13 pages, 12 figures

  23. arXiv:2310.16132  [pdf, other

    cs.SE

    Diversity in Software Engineering Conferences and Journals

    Authors: Aditya Shankar Narayanan, Dheeraj Vagavolu, Nancy A Day, Meiyappan Nagappan

    Abstract: Diversity with respect to ethnicity and gender has been studied in open-source and industrial settings for software development. Publication avenues such as academic conferences and journals contribute to the growing technology industry. However, there have been very few diversity-related studies conducted in the context of academia. In this paper, we study the ethnic, gender, and geographical div… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 13 pages, 10 figures, 4 tables

  24. arXiv:2309.16256  [pdf, other

    quant-ph cs.CC cs.CR

    On finding dense sub-lattices as low energy states of a quantum Hamiltonian

    Authors: Júlia Barberà Rodríguez, Nicolas Gama, Anand Kumar Narayanan, David Joseph

    Abstract: Lattice-based cryptography has emerged as one of the most prominent candidates for post-quantum cryptography, projected to be secure against the imminent threat of large-scale fault-tolerant quantum computers. The Shortest Vector Problem (SVP) is to find the shortest non-zero vector in a given lattice. It is fundamental to lattice-based cryptography and believed to be hard even for quantum compute… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  25. arXiv:2309.08733  [pdf, other

    cs.RO

    Optimal path planning of multi-agent cooperative systems with rigid formation

    Authors: Ananda Rangan Narayanan, Mi Zhou, Erik Verriest

    Abstract: In this article, we consider the path-planning problem of a cooperative homogeneous robotic system with rigid formation. An optimal controller is designed for each agent in such rigid systems based on Pontryagin's minimum principle theory. We found that the optimal control for each agent is equivalent to the optimal control for the Center of Mass (CoM). This equivalence is then proved by using som… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  26. arXiv:2309.06484  [pdf, other

    cs.CG cs.LG

    Learning topological operations on meshes with application to block decomposition of polygons

    Authors: Arjun Narayanan, Yulong Pan, Per-Olof Persson

    Abstract: We present a learning based framework for mesh quality improvement on unstructured triangular and quadrilateral meshes. Our model learns to improve mesh quality according to a prescribed objective function purely via self-play reinforcement learning with no prior heuristics. The actions performed on the mesh are standard local and global element operations. The goal is to minimize the deviation of… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Submitted to Computer-Aided Design Journal. Presented at 17th US National Conference on Computational Mechanics, Albuquerque, NM

    ACM Class: I.2.6; I.3.5; J.6

  27. arXiv:2308.07832  [pdf, ps, other

    cs.LG cs.AI stat.ME

    REFORMS: Reporting Standards for Machine Learning Based Science

    Authors: Sayash Kapoor, Emily Cantrell, Kenny Peng, Thanh Hien Pham, Christopher A. Bail, Odd Erik Gundersen, Jake M. Hofman, Jessica Hullman, Michael A. Lones, Momin M. Malik, Priyanka Nanayakkara, Russell A. Poldrack, Inioluwa Deborah Raji, Michael Roberts, Matthew J. Salganik, Marta Serra-Garcia, Brandon M. Stewart, Gilles Vandewiele, Arvind Narayanan

    Abstract: Machine learning (ML) methods are proliferating in scientific research. However, the adoption of these methods has been accompanied by failures of validity, reproducibility, and generalizability. These failures can hinder scientific progress, lead to false consensus around invalid claims, and undermine the credibility of ML-based science. ML methods are often applied and fail in similar ways acros… ▽ More

    Submitted 19 September, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

  28. arXiv:2307.00404  [pdf, other

    cs.SE

    Automatic Unit Test Generation for Deep Learning Frameworks based on API Knowledge

    Authors: Arunkaleeshwaran Narayanan, Nima Shiri harzevili, Junjie Wang, Lin Shi, Moshi Wei, Song Wang

    Abstract: Many automatic unit test generation tools that can generate unit test cases with high coverage over a program have been proposed. However, most of these tools are ineffective on deep learning (DL) frameworks due to the fact that many of deep learning APIs expect inputs that follow specific API knowledge. To fill this gap, we propose MUTester to generate unit test cases for APIs of deep learning fr… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

  29. arXiv:2306.00551  [pdf, other

    cs.CL cs.HC

    Enhancing Programming eTextbooks with ChatGPT Generated Counterfactual-Thinking-Inspired Questions

    Authors: Arun Balajiee Lekshmi Narayanan, Rully Agus Hendrawan, Venktesh V

    Abstract: Digital textbooks have become an integral part of everyday learning tasks. In this work, we consider the use of digital textbooks for programming classes. Generally, students struggle with utilizing textbooks on programming to the maximum, with a possible reason being that the example programs provided as illustration of concepts in these textbooks don't offer sufficient interactivity for students… ▽ More

    Submitted 6 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Paper Under Review

  30. arXiv:2305.16809  [pdf

    cs.CL cs.AI cs.HC

    GenQ: Automated Question Generation to Support Caregivers While Reading Stories with Children

    Authors: Arun Balajiee Lekshmi Narayanan, Ligia E. Gomez, Martha Michelle Soto Fernandez, Tri Nguyen, Chris Blais, M. Adelaida Restrepo, Art Glenberg

    Abstract: When caregivers ask open--ended questions to motivate dialogue with children, it facilitates the child's reading comprehension skills.Although there is scope for use of technological tools, referred here as "intelligent tutoring systems", to scaffold this process, it is currently unclear whether existing intelligent systems that generate human--language like questions is beneficial. Additionally,… ▽ More

    Submitted 25 September, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  31. arXiv:2302.14491  [pdf, other

    math.NT cs.LO

    Formalization of $p$-adic $L$-functions in Lean 3

    Authors: Ashvni Narayanan

    Abstract: The Euler--Riemann zeta function is a largely studied numbertheoretic object, and the birthplace of several conjectures, such as the Riemann Hypothesis. Different approaches are used to study it, including $p$-adic analysis : deriving information from $p$-adic zeta functions. A generalized version of $p$-adic zeta functions (Riemann zeta function) are $p$-adic $L$-functions (resp. Dirichlet $L$-fu… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

  32. arXiv:2212.09006  [pdf, other

    cs.SD cs.LG eess.AS

    A Review of Speech-centric Trustworthy Machine Learning: Privacy, Safety, and Fairness

    Authors: Tiantian Feng, Rajat Hebbar, Nicholas Mehlman, Xuan Shi, Aditya Kommineni, and Shrikanth Narayanan

    Abstract: Speech-centric machine learning systems have revolutionized many leading domains ranging from transportation and healthcare to education and defense, profoundly changing how people live, work, and interact with each other. However, recent studies have demonstrated that many speech-centric ML systems may need to be considered more trustworthy for broader deployment. Specifically, concerns over priv… ▽ More

    Submitted 16 April, 2023; v1 submitted 17 December, 2022; originally announced December 2022.

    Journal ref: APSIPA Transactions on Signal and Information Processing, vol. 12, no. 3, 2023

  33. arXiv:2211.07524  [pdf, other

    cs.CL cs.AI

    Towards a Mathematics Formalisation Assistant using Large Language Models

    Authors: Ayush Agrawal, Siddhartha Gadgil, Navin Goyal, Ashvni Narayanan, Anand Tadipatri

    Abstract: Mathematics formalisation is the task of writing mathematics (i.e., definitions, theorem statements, proofs) in natural language, as found in books and papers, into a formal language that can then be checked for correctness by a program. It is a thriving activity today, however formalisation remains cumbersome. In this paper, we explore the abilities of a large language model (Codex) to help with… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

  34. arXiv:2210.13079  [pdf, other

    cs.CV cs.AI

    mm-Wave Radar Hand Shape Classification Using Deformable Transformers

    Authors: Athmanarayanan Lakshmi Narayanan, Asma Beevi K. T, Haoyang Wu, **gyi Ma, W. Margaret Huang

    Abstract: A novel, real-time, mm-Wave radar-based static hand shape classification algorithm and implementation are proposed. The method finds several applications in low cost and privacy sensitive touchless control technology using 60 Ghz radar as the sensor input. As opposed to prior Range-Doppler image based 2D classification solutions, our method converts raw radar data to 3D sparse cartesian point clou… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  35. arXiv:2209.06410  [pdf, other

    eess.AS cs.SD

    A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation

    Authors: Tom O'Malley, Arun Narayanan, Quan Wang

    Abstract: Recent work has shown that it is possible to train a single model to perform joint acoustic echo cancellation (AEC), speech enhancement, and voice separation, thereby serving as a unified frontend for robust automatic speech recognition (ASR). The joint model uses contextual information, such as a reference of the playback audio, noise context, and speaker embedding. In this work, we propose a num… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

  36. Security policy audits: why and how

    Authors: Arvind Narayanan, Kevin Lee

    Abstract: Information security isn't just about software and hardware -- it's at least as much about policies and processes. But the research community overwhelmingly focuses on the former over the latter, while ga** policy and process problems persist. In this experience paper, we describe a series of security policy audits that we conducted, exposing policy flaws affecting billions of users that can be… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  37. arXiv:2207.07048  [pdf, other

    cs.LG cs.AI stat.ME

    Leakage and the Reproducibility Crisis in ML-based Science

    Authors: Sayash Kapoor, Arvind Narayanan

    Abstract: The use of machine learning (ML) methods for prediction and forecasting has become widespread across the quantitative sciences. However, there are many known methodological pitfalls, including data leakage, in ML-based science. In this paper, we systematically investigate reproducibility issues in ML-based science. We show that data leakage is indeed a widespread problem and has led to severe repr… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  38. LaMPost: Design and Evaluation of an AI-assisted Email Writing Prototype for Adults with Dyslexia

    Authors: Steven M. Goodman, Erin Buehler, Patrick Clary, Andy Coenen, Aaron Donsbach, Tiffanie N. Horne, Michal Lahav, Robert Macdonald, Rain Breaw Michaels, Ajit Narayanan, Mahima Pushkarna, Joel Riley, Alex Santana, Lei Shi, Rachel Sweeney, Phil Weaver, Ann Yuan, Meredith Ringel Morris

    Abstract: Prior work has explored the writing challenges experienced by people with dyslexia, and the potential for new spelling, grammar, and word retrieval technologies to address these challenges. However, the capabilities for natural language generation demonstrated by the latest class of large language models (LLMs) highlight an opportunity to explore new forms of human-AI writing support tools. In thi… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: To appear at The 24th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '22), October 23-26, 2022, Athens, Greece. 26 pages

  39. arXiv:2206.04720  [pdf, other

    cs.SI cs.CY cs.HC

    How Algorithms Shape the Distribution of Political Advertising: Case Studies of Facebook, Google, and TikTok

    Authors: Orestis Papakyriakopoulos, Christelle Tessono, Arvind Narayanan, Mihir Kshirsagar

    Abstract: Online platforms play an increasingly important role in sha** democracy by influencing the distribution of political information to the electorate. In recent years, political campaigns have spent heavily on the platforms' algorithmic tools to target voters with online advertising. While the public interest in understanding how platforms perform the task of sha** the political discourse has nev… ▽ More

    Submitted 13 July, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Forthcoming in: Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society (AIES'22), August 1-3, 2022, Oxford, United Kingdom. ACM, New York, NY, USA, 15 pages

  40. arXiv:2205.08555  [pdf, ps, other

    eess.AS cs.SD

    Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments

    Authors: Joe Caroselli, Arun Narayanan, Yiteng Huang

    Abstract: One of the most challenging scenarios for smart speakers is multi-talker, when target speech from the desired speaker is mixed with interfering speech from one or more speakers. A smart assistant needs to determine which voice to recognize and which to ignore and it needs to do so in a streaming, low-latency manner. This work presents two multi-microphone speech enhancement algorithms targeted at… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: Submitted to IWAENC 2022

  41. arXiv:2205.06107  [pdf, other

    econ.TH cs.AI cs.LG

    Social learning via actions in bandit environments

    Authors: Aroon Narayanan

    Abstract: I study a game of strategic exploration with private payoffs and public actions in a Bayesian bandit setting. In particular, I look at cascade equilibria, in which agents switch over time from the risky action to the riskless action only when they become sufficiently pessimistic. I show that these equilibria exist under some conditions and establish their salient properties. Individual exploration… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

  42. arXiv:2205.04619  [pdf, other

    cs.LG cs.AI econ.TH

    Risk Preferences of Learning Algorithms

    Authors: Andreas Haupt, Aroon Narayanan

    Abstract: Agents' learning from feedback shapes economic outcomes, and many economic decision-makers today employ learning algorithms to make consequential choices. This note shows that a widely used learning algorithm, $\varepsilon$-Greedy, exhibits emergent risk aversion: it prefers actions with lower variance. When presented with actions of the same expectation, under a wide range of conditions,… ▽ More

    Submitted 12 December, 2023; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 11 pages, 6 figures

  43. arXiv:2205.03767  [pdf, other

    cs.CL

    Context-Aware Abbreviation Expansion Using Large Language Models

    Authors: Shanqing Cai, Subhashini Venugopalan, Katrin Tomanek, Ajit Narayanan, Meredith Ringel Morris, Michael P. Brenner

    Abstract: Motivated by the need for accelerating text entry in augmentative and alternative communication (AAC) for people with severe motor impairments, we propose a paradigm in which phrases are abbreviated aggressively as primarily word-initial letters. Our approach is to expand the abbreviations into full-phrase options by leveraging conversation context with the power of pretrained large language model… ▽ More

    Submitted 10 May, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: 15 pages, 7 figures, 8 tables. Accepted as a long paper at NAACL 2022

  44. arXiv:2205.03481  [pdf, other

    eess.AS cs.SD eess.SP

    A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy

    Authors: Sankaran Panchapagesan, Arun Narayanan, Turaj Zakizadeh Shabestary, Shuai Shao, Nathan Howard, Alex Park, James Walker, Alexander Gruenstein

    Abstract: Acoustic Echo Cancellation (AEC) is essential for accurate recognition of queries spoken to a smart speaker that is playing out audio. Previous work has shown that a neural AEC model operating on log-mel spectral features (denoted "logmel" hereafter) can greatly improve Automatic Speech Recognition (ASR) accuracy when optimized with an auxiliary loss utilizing a pre-trained ASR model encoder. In t… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: Submitted to Interspeech 2022

  45. arXiv:2204.12092  [pdf, other

    eess.AS cs.SD

    Mask scalar prediction for improving robust automatic speech recognition

    Authors: Arun Narayanan, James Walker, Sankaran Panchapagesan, Nathan Howard, Yuma Koizumi

    Abstract: Using neural network based acoustic frontends for improving robustness of streaming automatic speech recognition (ASR) systems is challenging because of the causality constraints and the resulting distortion that the frontend processing introduces in speech. Time-frequency masking based approaches have been shown to work well, but they need additional hyper-parameters to scale the mask to limit sp… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: Submitted to Interspeech 2022

  46. arXiv:2204.11933  [pdf, other

    eess.AS cs.SD

    Cleanformer: A multichannel array configuration-invariant neural enhancement frontend for ASR in smart speakers

    Authors: Joseph Caroselli, Arun Narayanan, Nathan Howard, Tom O'Malley

    Abstract: This work introduces the Cleanformer, a streaming multichannel neural based enhancement frontend for automatic speech recognition (ASR). This model has a conformer-based architecture which takes as inputs a single channel each of raw and enhanced signals, and uses self-attention to derive a time-frequency mask. The enhanced input is generated by a multichannel adaptive noise cancellation algorithm… ▽ More

    Submitted 3 May, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted to ICASSP 2023

  47. arXiv:2204.08345  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Extracting Targeted Training Data from ASR Models, and How to Mitigate It

    Authors: Ehsan Amid, Om Thakkar, Arun Narayanan, Rajiv Mathews, Françoise Beaufays

    Abstract: Recent work has designed methods to demonstrate that model updates in ASR training can leak potentially sensitive attributes of the utterances used in computing the updates. In this work, we design the first method to demonstrate information leakage about training data from trained ASR models. We design Noise Masking, a fill-in-the-blank style method for extracting targeted parts of training data… ▽ More

    Submitted 27 June, 2022; v1 submitted 18 April, 2022; originally announced April 2022.

    Comments: Accepted to appear at Interspeech'22

  48. arXiv:2204.05545  [pdf, other

    cs.AI

    A Reinforcement Learning Approach for Electric Vehicle Routing Problem with Vehicle-to-Grid Supply

    Authors: Ajay Narayanan, Prasant Misra, Ankush Ojha, Vivek Bandhu, Supratim Ghosh, Arunchandar Vasan

    Abstract: The use of electric vehicles (EV) in the last mile is appealing from both sustainability and operational cost perspectives. In addition to the inherent cost efficiency of EVs, selling energy back to the grid during peak grid demand, is a potential source of additional revenue to a fleet operator. To achieve this, EVs have to be at specific locations (discharge points) during specific points in tim… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 6 pages; 1 figure; Proc. of the Adaptive and Learning Agents Workshop (ALA 2022), Cruz, Hayes, da Silva, Santos (eds.), May 9-10, 2022, Online, https:// ala2022.github.io/.2022

  49. arXiv:2204.03793  [pdf, other

    eess.AS cs.LG cs.SD

    Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

    Authors: Shao** Ding, Rajeev Rikhye, Qiao Liang, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw

    Abstract: Personalization of on-device speech recognition (ASR) has seen explosive growth in recent years, largely due to the increasing popularity of personal assistant features on mobile devices and smart home speakers. In this work, we present Personal VAD 2.0, a personalized voice activity detector that detects the voice activity of a target speaker, as part of a streaming on-device ASR system. Although… ▽ More

    Submitted 24 June, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: Accepted by INTERSPEECH 2022

  50. The worst of both worlds: A comparative analysis of errors in learning from data in psychology and machine learning

    Authors: Jessica Hullman, Sayash Kapoor, Priyanka Nanayakkara, Andrew Gelman, Arvind Narayanan

    Abstract: Recent arguments that machine learning (ML) is facing a reproducibility and replication crisis suggest that some published claims in ML research cannot be taken at face value. These concerns inspire analogies to the replication crisis affecting the social and medical sciences. They also inspire calls for the integration of statistical approaches to causal inference and predictive modeling. A deepe… ▽ More

    Submitted 2 June, 2022; v1 submitted 12 March, 2022; originally announced March 2022.