Skip to main content

Showing 1–50 of 199 results for author: Shah, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17232  [pdf, other

    cs.CL

    Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks

    Authors: Yun-Shiuan Chuang, Zach Studdiford, Krirk Nirunwiroj, Agam Goyal, Vincent V. Frigo, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Creating human-like large language model (LLM) agents is crucial for faithful social simulation. Having LLMs role-play based on demographic information sometimes improves human likeness but often does not. This study assessed whether LLM alignment with human behavior can be improved by integrating information from empirically-derived human belief networks. Using data from a human survey, we estima… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2405.18334  [pdf, other

    cs.DB cs.CV cs.LG

    SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches

    Authors: Renzhi Wu, Pramod Chunduri, Dristi J Shah, Ashmitha Julius Aravind, Ali Payani, Xu Chu, Joy Arulraj, Kexin Rong

    Abstract: In this paper, we will present SketchQL, a video database management system (VDBMS) for retrieving video moments with a sketch-based query interface. This novel interface allows users to specify object trajectory events with simple mouse drag-and-drop operations. Users can use trajectories of single objects as building blocks to compose complex events. Using a pre-trained model that encodes trajec… ▽ More

    Submitted 30 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Journal ref: Published on International Conference on Very Large Databases 2024

  3. arXiv:2405.09530  [pdf, other

    cs.CY cs.CV cs.LG

    A community palm model

    Authors: Nicholas Clinton, Andreas Vollrath, Remi D'annunzio, Desheng Liu, Henry B. Glick, Adrià Descals, Alicia Sullivan, Oliver Guinan, Jacob Abramowitz, Fred Stolle, Chris Goodman, Tanya Birch, David Quinn, Olga Danylo, Tijs Lips, Daniel Coelho, Enikoe Bihari, Bryce Cronkite-Ratcliff, Ate Poortinga, Atena Haghighattalab, Evan Notman, Michael DeWitt, Aaron Yonas, Gennadii Donchyts, Devaja Shah , et al. (5 additional authors not shown)

    Abstract: Palm oil production has been identified as one of the major drivers of deforestation for tropical countries. To meet supply chain objectives, commodity producers and other stakeholders need timely information of land cover dynamics in their supply shed. However, such data are difficult to obtain from suppliers who may lack digital geographic representations of their supply sheds and production loc… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: v0

  4. arXiv:2405.09155  [pdf, other

    cs.ET

    TunnelSense: Low-power, Non-Contact Sensing using Tunnel Diodes

    Authors: Lim Chang Quan Thaddeus, C. Rajashekar Reddy, Yuvraj Singh Bhadauria, Dhairya Shah, Manoj Gulati, Ambuj Varshney

    Abstract: Sensing the motion of physical objects in an environment enables numerous applications, from tracking occupancy in buildings and monitoring vital signs to diagnosing faults in machines. Typically, these application scenarios involve attaching a sensor, such as an accelerometer, to the object of interest, like a wearable device that tracks our steps. However, many of these scenarios require trackin… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: This work is accepted at IEEE RFID 2024

  5. arXiv:2404.13008  [pdf, other

    cs.SD eess.AS

    Enhancing Generalization in Audio Deepfake Detection: A Neural Collapse based Sampling and Training Approach

    Authors: Mohammed Yousif, Jonat John Mathew, Huzaifa Pallan, Agamjeet Singh Padda, Syed Daniyal Shah, Sara Adamski, Madhu Reddiboina, Arjun Pankajakshan

    Abstract: Generalization in audio deepfake detection presents a significant challenge, with models trained on specific datasets often struggling to detect deepfakes generated under varying conditions and unknown algorithms. While collectively training a model using diverse datasets can enhance its generalization ability, it comes with high computational costs. To address this, we propose a neural collapse-b… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  6. arXiv:2403.10912  [pdf

    cs.CV cs.LG

    Automatic location detection based on deep learning

    Authors: Anjali Karangiya, Anirudh Sharma, Divax Shah, Kartavya Badgujar, Dr. Chintan Thacker, Dainik Dave

    Abstract: The proliferation of digital images and the advancements in deep learning have paved the way for innovative solutions in various domains, especially in the field of image classification. Our project presents an in-depth study and implementation of an image classification system specifically tailored to identify and classify images of Indian cities. Drawing from an extensive dataset, our model clas… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  7. arXiv:2403.09611  [pdf, other

    cs.CV cs.CL cs.LG

    MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

    Authors: Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman , et al. (7 additional authors not shown)

    Abstract: In this work, we discuss building performant Multimodal Large Language Models (MLLMs). In particular, we study the importance of various architecture components and data choices. Through careful and comprehensive ablations of the image encoder, the vision language connector, and various pre-training data choices, we identified several crucial design lessons. For example, we demonstrate that for la… ▽ More

    Submitted 18 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  8. arXiv:2403.00991  [pdf, other

    cs.RO cs.CV cs.LG

    SELFI: Autonomous Self-Improvement with Reinforcement Learning for Social Navigation

    Authors: Noriaki Hirose, Dhruv Shah, Kyle Stachowicz, Ajay Sridhar, Sergey Levine

    Abstract: Autonomous self-improving robots that interact and improve with experience are key to the real-world deployment of robotic systems. In this paper, we propose an online learning method, SELFI, that leverages online robot experience to rapidly fine-tune pre-trained control policies efficiently. SELFI applies online model-free reinforcement learning on top of offline model-based learning to bring out… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 11pages, 13 figures, 2 tables

  9. arXiv:2402.19432  [pdf, other

    cs.RO

    Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation

    Authors: Jonathan Yang, Catherine Glossop, Arjun Bhorkar, Dhruv Shah, Quan Vuong, Chelsea Finn, Dorsa Sadigh, Sergey Levine

    Abstract: Recent years in robotics and imitation learning have shown remarkable progress in training large-scale foundation models by leveraging data across a multitude of embodiments. The success of such policies might lead us to wonder: just how diverse can the robots in the training set be while still facilitating positive transfer? In this work, we study this question in the context of heterogeneous emb… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 16 pages, 9 figures

    MSC Class: 68T40 ACM Class: I.2.9

  10. arXiv:2402.14959  [pdf, other

    stat.AP cs.CY stat.ML

    A Causal Framework to Evaluate Racial Bias in Law Enforcement Systems

    Authors: Jessy Xinyi Han, Andrew Miller, S. Craig Watkins, Christopher Winship, Fotini Christia, Devavrat Shah

    Abstract: We are interested in develo** a data-driven method to evaluate race-induced biases in law enforcement systems. While the recent works have addressed this question in the context of police-civilian interactions using police stop data, they have two key limitations. First, bias can only be properly quantified if true criminality is accounted for in addition to race, but it is absent in prior works… ▽ More

    Submitted 20 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  11. arXiv:2402.05983  [pdf, other

    eess.IV cs.LG physics.app-ph physics.ins-det

    Capability enhancement of the X-ray micro-tomography system via ML-assisted approaches

    Authors: Dhruvi Shah, Shruti Mehta, Ashish Agrawal, Shishir Purohit, Bhaskar Chaudhury

    Abstract: Ring artifacts in X-ray micro-CT images are one of the primary causes of concern in their accurate visual interpretation and quantitative analysis. The geometry of X-ray micro-CT scanners is similar to the medical CT machines, except the sample is rotated with a stationary source and detector. The ring artifacts are caused by a defect or non-linear responses in detector pixels during the MicroCT d… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  12. arXiv:2402.03390  [pdf, other

    eess.IV cs.AI cs.CV cs.NI

    PixelGen: Rethinking Embedded Camera Systems

    Authors: Kunjun Li, Manoj Gulati, Steven Waskito, Dhairya Shah, Shantanu Chakrabarty, Ambuj Varshney

    Abstract: Embedded camera systems are ubiquitous, representing the most widely deployed example of a wireless embedded system. They capture a representation of the world - the surroundings illuminated by visible or infrared light. Despite their widespread usage, the architecture of embedded camera systems has remained unchanged, which leads to limitations. They visualize only a tiny portion of the world. Ad… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  13. arXiv:2402.00793  [pdf, other

    cs.LG cs.AI cs.HC

    Human Expertise in Algorithmic Prediction

    Authors: Rohan Alur, Manish Raghavan, Devavrat Shah

    Abstract: We introduce a novel framework for incorporating human expertise into algorithmic predictions. Our approach focuses on the use of human judgment to distinguish inputs which `look the same' to any feasible predictive algorithm. We argue that this framing clarifies the problem of human/AI collaboration in prediction tasks, as experts often have access to information -- particularly subjective inform… ▽ More

    Submitted 22 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 35 pages, 13 figures

  14. arXiv:2401.17711  [pdf

    cs.HC cs.AI

    Prediction of multitasking performance post-longitudinal tDCS via EEG-based functional connectivity and machine learning methods

    Authors: Akash K Rao, Shashank Uttrani, Vishnu K Menon, Darshil Shah, Arnav Bhavsar, Shubhajit Roy Chowdhury, Varun Dutt

    Abstract: Predicting and understanding the changes in cognitive performance, especially after a longitudinal intervention, is a fundamental goal in neuroscience. Longitudinal brain stimulation-based interventions like transcranial direct current stimulation (tDCS) induce short-term changes in the resting membrane potential and influence cognitive processes. However, very little research has been conducted o… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 16 pages, presented at the 30th International Conference on Neural Information Processing (ICONIP2023), Changsha, China, November 2023

  15. arXiv:2312.17270  [pdf, other

    cs.CR cs.LG

    Anticipated Network Surveillance -- An extrapolated study to predict cyber-attacks using Machine Learning and Data Analytics

    Authors: Aviral Srivastava, Dhyan Thakkar, Dr. Sharda Valiveti, Dr. Pooja Shah, Dr. Gaurang Raval

    Abstract: Machine learning and data mining techniques are utiized for enhancement of the security of any network. Researchers used machine learning for pattern detection, anomaly detection, dynamic policy setting, etc. The methods allow the program to learn from data and make decisions without human intervention, consuming a huge training period and computation power. This paper discusses a novel technique… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  16. arXiv:2312.00894  [pdf, other

    cs.SE

    Leveraging Large Language Models to Improve REST API Testing

    Authors: Myeongsoo Kim, Tyler Stennett, Dhruv Shah, Saurabh Sinha, Alessandro Orso

    Abstract: The widespread adoption of REST APIs, coupled with their growing complexity and size, has led to the need for automated REST API testing tools. Current tools focus on the structured data in REST API specifications but often neglect valuable insights available in unstructured natural-language descriptions in the specifications, which leads to suboptimal test coverage. Recently, to address this gap,… ▽ More

    Submitted 29 January, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: To be published in the 46th IEEE/ACM International Conference on Software Engineering - New Ideas and Emerging Results Track (ICSE-NIER 2024)

  17. arXiv:2311.09665  [pdf, other

    cs.CL

    The Wisdom of Partisan Crowds: Comparing Collective Intelligence in Humans and LLM-based Agents

    Authors: Yun-Shiuan Chuang, Siddharth Suresh, Nikunj Harlalka, Agam Goyal, Robert Hawkins, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Human groups are able to converge on more accurate beliefs through deliberation, even in the presence of polarization and partisan bias -- a phenomenon known as the "wisdom of partisan crowds." Generated agents powered by Large Language Models (LLMs) are increasingly used to simulate human collective behavior, yet few benchmarks exist for evaluating their dynamics against the behavior of human gro… ▽ More

    Submitted 16 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  18. arXiv:2311.09618  [pdf, other

    physics.soc-ph cs.CL

    Simulating Opinion Dynamics with Networks of LLM-based Agents

    Authors: Yun-Shiuan Chuang, Agam Goyal, Nikunj Harlalka, Siddharth Suresh, Robert Hawkins, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Accurately simulating human opinion dynamics is crucial for understanding a variety of societal phenomena, including polarization and the spread of misinformation. However, the agent-based models (ABMs) commonly used for such simulations often over-simplify human behavior. We propose a new approach to simulating opinion dynamics based on populations of Large Language Models (LLMs). Our findings re… ▽ More

    Submitted 31 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  19. arXiv:2311.06430  [pdf, other

    cs.RO

    GOAT: GO to Any Thing

    Authors: Matthew Chang, Theophile Gervet, Mukul Khanna, Sriram Yenamandra, Dhruv Shah, So Yeon Min, Kavit Shah, Chris Paxton, Saurabh Gupta, Dhruv Batra, Roozbeh Mottaghi, Jitendra Malik, Devendra Singh Chaplot

    Abstract: In deployment scenarios such as homes and warehouses, mobile robots are expected to autonomously navigate for extended periods, seamlessly executing tasks articulated in terms that are intuitively understandable by human operators. We present GO To Any Thing (GOAT), a universal navigation system capable of tackling these requirements with three key features: a) Multimodal: it can tackle goals spec… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  20. arXiv:2311.02287  [pdf, other

    cs.LG cs.AI

    Predicting Ground Reaction Force from Inertial Sensors

    Authors: Bowen Song, Marco Paolieri, Harper E. Stewart, Leana Golubchik, Jill L. McNitt-Gray, Vishal Misra, Devavrat Shah

    Abstract: The study of ground reaction forces (GRF) is used to characterize the mechanical loading experienced by individuals in movements such as running, which is clinically applicable to identify athletes at risk for stress-related injuries. Our aim in this paper is to determine if data collected with inertial measurement units (IMUs), that can be worn by athletes during outdoor runs, can be used to pred… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  21. arXiv:2310.12183  [pdf, other

    math.OC cs.AI

    An Optimistic-Robust Approach for Dynamic Positioning of Omnichannel Inventories

    Authors: Pavithra Harsha, Shivaram Subramanian, Ali Koc, Mahesh Ramakrishna, Brian Quanz, Dhruv Shah, Chandra Narayanaswami

    Abstract: We introduce a new class of data-driven and distribution-free optimistic-robust bimodal inventory optimization (BIO) strategy to effectively allocate inventory across a retail chain to meet time-varying, uncertain omnichannel demand. While prior Robust optimization (RO) methods emphasize the downside, i.e., worst-case adversarial demand, BIO also considers the upside to remain resilient like RO wh… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  22. arXiv:2310.10103  [pdf, other

    cs.RO cs.AI cs.CL cs.LG

    Navigation with Large Language Models: Semantic Guesswork as a Heuristic for Planning

    Authors: Dhruv Shah, Michael Equi, Blazej Osinski, Fei Xia, Brian Ichter, Sergey Levine

    Abstract: Navigation in unfamiliar environments presents a major challenge for robots: while map** and planning techniques can be used to build up a representation of the world, quickly discovering a path to a desired goal in unfamiliar settings with such methods often requires lengthy map** and exploration. Humans can rapidly navigate new environments, particularly indoor environments that are laid out… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Videos, code, and an interactive Colab notebook that runs in your browser https://sites.google.com/view/lfg-nav/

  23. arXiv:2310.09277  [pdf, other

    cs.LG

    A Hybrid Approach for Depression Classification: Random Forest-ANN Ensemble on Motor Activity Signals

    Authors: Anket Patil, Dhairya Shah, Abhishek Shah, Mokshit Gala

    Abstract: Regarding the rising number of people suffering from mental health illnesses in today's society, the importance of mental health cannot be overstated. Wearable sensors, which are increasingly widely available, provide a potential way to track and comprehend mental health issues. These gadgets not only monitor everyday activities but also continuously record vital signs like heart rate, perhaps pro… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 8 pages

    MSC Class: 68T05

  24. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  25. arXiv:2310.07896  [pdf, other

    cs.RO cs.CV cs.LG

    NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration

    Authors: Ajay Sridhar, Dhruv Shah, Catherine Glossop, Sergey Levine

    Abstract: Robotic learning for navigation in unfamiliar environments needs to provide policies for both task-oriented navigation (i.e., reaching a goal that the robot has located), and task-agnostic exploration (i.e., searching for a goal in a novel setting). Typically, these roles are handled by separate models, for example by using subgoal proposals, planning, or separate navigation strategies. In this pa… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Project page https://general-navigation-models.github.io/nomad/

  26. arXiv:2310.04468  [pdf, other

    cs.CL cs.AI

    Validating transformers for redaction of text from electronic health records in real-world healthcare

    Authors: Zeljko Kraljevic, Anthony Shek, Joshua Au Yeung, Ewart Jonathan Sheldon, Mohammad Al-Agil, Haris Shuaib, Xi Bai, Kawsar Noor, Anoop D. Shah, Richard Dobson, James Teo

    Abstract: Protecting patient privacy in healthcare records is a top priority, and redaction is a commonly used method for obscuring directly identifiable information in text. Rule-based methods have been widely used, but their precision is often low causing over-redaction of text and frequently not being adaptable enough for non-standardised or unconventional structures of personal health information. Deep… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  27. arXiv:2310.00574  [pdf, other

    cs.AR cs.LG cs.PF

    YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs

    Authors: Cyrus Zhou, Zack Hassman, Ruize Xu, Dhirpal Shah, Vaugnn Richard, Yan**g Li

    Abstract: We address the challenges associated with deploying neural networks on CPUs, with a particular focus on minimizing inference time while maintaining accuracy. Our novel approach is to use the dataflow (i.e., computation order) of a neural network to explore data reuse opportunities using heuristic-guided analysis and a code generation framework, which enables exploration of various Single Instructi… ▽ More

    Submitted 23 November, 2023; v1 submitted 1 October, 2023; originally announced October 2023.

    ACM Class: B.8.2

  28. arXiv:2309.06413  [pdf, other

    cs.LG stat.ML

    On Computationally Efficient Learning of Exponential Family Distributions

    Authors: Abhin Shah, Devavrat Shah, Gregory W. Wornell

    Abstract: We consider the classical problem of learning, with arbitrary accuracy, the natural parameters of a $k$-parameter truncated \textit{minimal} exponential family from i.i.d. samples in a computationally and statistically efficient manner. We focus on the setting where the support as well as the natural parameters are appropriately bounded. While the traditional maximum likelihood estimator for this… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: An earlier version of this work arXiv:2110.15397 was presented at the Neural Information Processing Systems Conference in December 2021 titled "A Computationally Efficient Method for Learning Exponential Family Distributions"

  29. arXiv:2307.05538  [pdf, other

    cs.CL

    Advancements in Scientific Controllable Text Generation Methods

    Authors: Arnav Goel, Medha Hira, Avinash Anand, Siddhesh Bangar, Dr. Rajiv Ratn Shah

    Abstract: The previous work on controllable text generation is organized using a new schema we provide in this study. Seven components make up the schema, and each one is crucial to the creation process. To accomplish controlled generation for scientific literature, we describe the various modulation strategies utilised to modulate each of the seven components. We also offer a theoretical study and qualitat… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

  30. arXiv:2306.14846  [pdf, other

    cs.RO cs.CV cs.LG

    ViNT: A Foundation Model for Visual Navigation

    Authors: Dhruv Shah, Ajay Sridhar, Nitish Dashora, Kyle Stachowicz, Kevin Black, Noriaki Hirose, Sergey Levine

    Abstract: General-purpose pre-trained models ("foundation models") have enabled practitioners to produce generalizable solutions for individual machine learning problems with datasets that are significantly smaller than those required for learning from scratch. Such models are typically trained on large and diverse datasets with weak supervision, consuming much more training data than is available for any i… ▽ More

    Submitted 24 October, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted for oral presentation at CoRL 2023

  31. arXiv:2306.07305  [pdf, other

    cs.LG cs.AI q-fin.CP

    Making forecasting self-learning and adaptive -- Pilot forecasting rack

    Authors: Shaun D'Souza, Dheeraj Shah, Amareshwar Allati, Parikshit Soni

    Abstract: Retail sales and price projections are typically based on time series forecasting. For some product categories, the accuracy of demand forecasts achieved is low, negatively impacting inventory, transport, and replenishment planning. This paper presents our findings based on a proactive pilot exercise to explore ways to help retailers to improve forecast accuracy for such product categories. We e… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  32. Machine Vision Using Cellphone Camera: A Comparison of deep networks for classifying three challenging denominations of Indian Coins

    Authors: Keyur D. Joshi, Dhruv Shah, Varshil Shah, Nilay Gandhi, Sanket J. Shah, Sanket B. Shah

    Abstract: Indian currency coins come in a variety of denominations. Off all the varieties Rs.1, RS.2, and Rs.5 have similar diameters. Majority of the coin styles in market circulation for denominations of Rs.1 and Rs.2 coins are nearly the same except for numerals on its reverse side. If a coin is resting on its obverse side, the correct denomination is not distinguishable by humans. Therefore, it was hypo… ▽ More

    Submitted 12 May, 2023; originally announced June 2023.

    Comments: 6 Pages, 4 Figures, 6 Tables, Conference paper

  33. arXiv:2306.04775  [pdf, other

    cs.LG stat.ML

    Exploiting Observation Bias to Improve Matrix Completion

    Authors: Yassir Jedra, Sean Mann, Charlotte Park, Devavrat Shah

    Abstract: We consider a variant of matrix completion where entries are revealed in a biased manner, adopting a model akin to that introduced by Ma and Chen. Instead of treating this observation bias as a disadvantage, as is typically the case, the goal is to exploit the shared information between the bias and the outcome of interest to improve predictions. Towards this, we consider a natural model where the… ▽ More

    Submitted 4 February, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

  34. arXiv:2306.01874  [pdf, other

    cs.RO cs.CV cs.LG

    SACSoN: Scalable Autonomous Control for Social Navigation

    Authors: Noriaki Hirose, Dhruv Shah, Ajay Sridhar, Sergey Levine

    Abstract: Machine learning provides a powerful tool for building socially compliant robotic systems that go beyond simple predictive models of human behavior. By observing and understanding human interactions from past experiences, learning can enable effective social navigation behaviors directly from data. In this paper, our goal is to develop methods for training policies for socially unobtrusive navigat… ▽ More

    Submitted 25 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 11 pages, 15 figures, 4 tables

  35. arXiv:2306.01646  [pdf, other

    stat.ML cs.CY cs.LG

    Auditing for Human Expertise

    Authors: Rohan Alur, Loren Laine, Darrick K. Li, Manish Raghavan, Devavrat Shah, Dennis Shung

    Abstract: High-stakes prediction tasks (e.g., patient diagnosis) are often handled by trained human experts. A common source of concern about automation in these settings is that experts may exercise intuition that is difficult to model and/or have access to information (e.g., conversations with a patient) that is simply unavailable to a would-be algorithm. This raises a natural question whether human exper… ▽ More

    Submitted 27 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 30 pages, 10 figures. To appear in the proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  36. arXiv:2305.16491  [pdf, other

    cs.LG eess.SY stat.ML

    SAMoSSA: Multivariate Singular Spectrum Analysis with Stochastic Autoregressive Noise

    Authors: Abdullah Alomar, Munther Dahleh, Sean Mann, Devavrat Shah

    Abstract: The well-established practice of time series analysis involves estimating deterministic, non-stationary trend and seasonality components followed by learning the residual stochastic, stationary components. Recently, it has been shown that one can learn the deterministic non-stationary components accurately using multivariate Singular Spectrum Analysis (mSSA) in the absence of a correlated stationa… ▽ More

    Submitted 26 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  37. arXiv:2305.07168  [pdf, other

    cs.IR

    Local Life: Stay Informed Around You, A Scalable Geoparsing and Geotagging Approach to Serve Local News Worldwide

    Authors: Deven Santosh Shah, Gosuddin Kamaruddin Siddiqi, Shiying He, Radhika Bansal

    Abstract: Local news has become increasingly important in the news industry due to its various benefits. It offers local audiences information that helps them participate in their communities and interests. It also serves as a reliable source of factual reporting that can prevent misinformation. Moreover, it can influence national audiences as some local stories may have wider implications for politics, env… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: 7 pages, 3 figures, 3 tables

  38. arXiv:2304.12404  [pdf, other

    cs.CL

    Semantic Tokenizer for Enhanced Natural Language Processing

    Authors: Sandeep Mehta, Darpan Shah, Ravindra Kulkarni, Cornelia Caragea

    Abstract: Traditionally, NLP performance improvement has been focused on improving models and increasing the number of model parameters. NLP vocabulary construction has remained focused on maximizing the number of words represented through subword regularization. We present a novel tokenizer that uses semantics to drive vocabulary construction. The tokenizer includes a trainer that uses stemming to enhance… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  39. arXiv:2304.10525  [pdf, other

    cs.CY cs.LG cs.SI stat.AP

    A User-Driven Framework for Regulating and Auditing Social Media

    Authors: Sarah H. Cen, Aleksander Madry, Devavrat Shah

    Abstract: People form judgments and make decisions based on the information that they observe. A growing portion of that information is not only provided, but carefully curated by social media platforms. Although lawmakers largely agree that platforms should not operate without any oversight, there is little consensus on how to regulate social media. There is consensus, however, that creating a strict, glob… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 21 pages, 4 figures

  40. arXiv:2304.09831  [pdf, other

    cs.RO cs.AI cs.LG

    FastRLAP: A System for Learning High-Speed Driving via Deep RL and Autonomous Practicing

    Authors: Kyle Stachowicz, Dhruv Shah, Arjun Bhorkar, Ilya Kostrikov, Sergey Levine

    Abstract: We present a system that enables an autonomous small-scale RC car to drive aggressively from visual observations using reinforcement learning (RL). Our system, FastRLAP (faster lap), trains autonomously in the real world, without human interventions, and without requiring any simulation or expert demonstrations. Our system integrates a number of important components to make this possible: we initi… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  41. arXiv:2303.13243  [pdf, other

    eess.AS cs.SD

    Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition

    Authors: Kai Liu, Hailiang Xiong, Gangqiang Yang, Zhengfeng Du, Yewen Cao, Danyal Shah

    Abstract: As one of the major branches of automatic speech recognition, attention-based models greatly improves the feature representation ability of the model. In particular, the multi-head mechanism is employed in the attention, ho** to learn speech features of more aspects in different attention subspaces. For speech recognition of complex languages, on the one hand, a small head size will lead to an o… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  42. arXiv:2303.02273  [pdf, other

    cs.LG cs.CV

    Learning Label Encodings for Deep Regression

    Authors: Deval Shah, Tor M. Aamodt

    Abstract: Deep regression networks are widely used to tackle the problem of predicting a continuous value for a given input. Task-specialized approaches for training regression networks have shown significant improvement over generic approaches, such as direct regression. More recently, a generic approach based on regression by binary classification using binary-encoded labels has shown significant improvem… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: Published at ICLR 2023 (Notable top-25%)

    Journal ref: International Conference on Learning Representations 2023 (https://openreview.net/pdf?id=k60XE_b0Ix6)

  43. arXiv:2303.00983  [pdf, other

    cs.CV cs.GR eess.IV

    Using simulation to quantify the performance of automotive perception systems

    Authors: Zhenyi Liu, Devesh Shah, Alireza Rahimpour, Devesh Upadhyay, Joyce Farrell, Brian A Wandell

    Abstract: The design and evaluation of complex systems can benefit from a software simulation - sometimes called a digital twin. The simulation can be used to characterize system performance or to test its performance under conditions that are difficult to measure (e.g., nighttime for automotive perception systems). We describe the image system simulation software tools that we use to evaluate the performan… ▽ More

    Submitted 10 March, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  44. arXiv:2303.00855  [pdf

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents

    Authors: Wenlong Huang, Fei Xia, Dhruv Shah, Danny Driess, Andy Zeng, Yao Lu, Pete Florence, Igor Mordatch, Sergey Levine, Karol Hausman, Brian Ichter

    Abstract: Recent progress in large language models (LLMs) has demonstrated the ability to learn and leverage Internet-scale knowledge through pre-training with autoregressive models. Unfortunately, applying such models to settings with embodied agents, such as robots, is challenging due to their lack of experience with the physical world, inability to parse non-language observations, and ignorance of reward… ▽ More

    Submitted 11 December, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

  45. arXiv:2302.11768  [pdf, other

    eess.AS cs.SD

    A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

    Authors: Zhepei Wang, Ritwik Giri, Devansh Shah, Jean-Marc Valin, Michael M. Goodwin, Paris Smaragdis

    Abstract: In this study, we present an approach to train a single speech enhancement network that can perform both personalized and non-personalized speech enhancement. This is achieved by incorporating a frame-wise conditioning input that specifies the type of enhancement output. To improve the quality of the enhanced output and mitigate oversuppression, we experiment with re-weighting frames by the presen… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: Accepted by ICASSP 2023

  46. arXiv:2302.02228  [pdf, other

    stat.ML cs.LG

    Counterfactual Identifiability of Bijective Causal Models

    Authors: Arash Nasr-Esfahany, Mohammad Alizadeh, Devavrat Shah

    Abstract: We study counterfactual identifiability in causal models with bijective generation mechanisms (BGM), a class that generalizes several widely-used causal models in the literature. We establish their counterfactual identifiability for three common causal structures with unobserved confounding, and propose a practical learning method that casts learning a BGM as structured generative modeling. Learne… ▽ More

    Submitted 6 June, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

  47. arXiv:2302.02096  [pdf, other

    cs.LG stat.ML

    Matrix Estimation for Individual Fairness

    Authors: Cindy Y. Zhang, Sarah H. Cen, Devavrat Shah

    Abstract: In recent years, multiple notions of algorithmic fairness have arisen. One such notion is individual fairness (IF), which requires that individuals who are similar receive similar treatment. In parallel, matrix estimation (ME) has emerged as a natural paradigm for handling noisy data with missing values. In this work, we connect the two concepts. We show that pre-processing data using ME can impro… ▽ More

    Submitted 3 August, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: 23 pages, 3 figures, ICML 2023

  48. arXiv:2301.08146  [pdf, other

    cs.IR cs.CL cs.LG

    What's happening in your neighborhood? A Weakly Supervised Approach to Detect Local News

    Authors: Deven Santosh Shah, Shiying He, Gosuddin Kamaruddin Siddiqi, Radhika Bansal

    Abstract: Local news articles are a subset of news that impact users in a geographical area, such as a city, county, or state. Detecting local news (Step 1) and subsequently deciding its geographical location as well as radius of impact (Step 2) are two important steps towards accurate local news recommendation. Naive rule-based methods, such as detecting city names from the news title, tend to give erroneo… ▽ More

    Submitted 5 June, 2024; v1 submitted 14 January, 2023; originally announced January 2023.

    Comments: 8 pages, 2 figures, 5 tables

  49. arXiv:2212.08244  [pdf, other

    cs.RO cs.CV cs.LG

    Offline Reinforcement Learning for Visual Navigation

    Authors: Dhruv Shah, Arjun Bhorkar, Hrish Leen, Ilya Kostrikov, Nick Rhinehart, Sergey Levine

    Abstract: Reinforcement learning can enable robots to navigate to distant goals while optimizing user-specified reward functions, including preferences for following lanes, staying on paved paths, or avoiding freshly mowed grass. However, online learning from trial-and-error for real-world robots is logistically challenging, and methods that instead can utilize existing datasets of robotic navigation data c… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: Project page https://sites.google.com/view/revind/home

  50. arXiv:2212.06759  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Robotic Navigation from Experience: Principles, Methods, and Recent Results

    Authors: Sergey Levine, Dhruv Shah

    Abstract: Navigation is one of the most heavily studied problems in robotics, and is conventionally approached as a geometric map** and planning problem. However, real-world navigation presents a complex set of physical challenges that defies simple geometric abstractions. Machine learning offers a promising way to go beyond geometry and conventional planning, allowing for navigational systems that make d… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: Final print version is here: https://royalsocietypublishing.org/doi/10.1098/rstb.2021.0447

    Journal ref: Philosophical Transactions of the Royal Society B, Volume 378 Issue 1869, 2022