Skip to main content

Showing 1–50 of 306 results for author: Shah, D

.
  1. arXiv:2406.19287  [pdf, other

    astro-ph.HE

    Isotropy of cosmic rays beyond $10^{20}$ eV favors their heavy mass composition

    Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

    Abstract: We report an estimation of the injected mass composition of ultra-high energy cosmic rays (UHECRs) at energies higher than 10 EeV. The composition is inferred from an energy-dependent sky distribution of UHECR events observed by the Telescope Array surface detector by comparing it to the Large Scale Structure of the local Universe. In the case of negligible extra-galactic magnetic fields the resul… ▽ More

    Submitted 3 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: 8 pages, 3 figures, accepted for publication in PRL

  2. arXiv:2406.19286  [pdf, other

    astro-ph.HE

    Mass composition of ultra-high energy cosmic rays from distribution of their arrival directions with the Telescope Array

    Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

    Abstract: We use a new method to estimate the injected mass composition of ultrahigh cosmic rays (UHECRs) at energies higher than 10 EeV. The method is based on comparison of the energy-dependent distribution of cosmic ray arrival directions as measured by the Telescope Array experiment (TA) with that calculated in a given putative model of UHECR under the assumption that sources trace the large-scale struc… ▽ More

    Submitted 3 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: 18 pages, 11 figures, accepted for publication in PRD

  3. arXiv:2406.17232  [pdf, other

    cs.CL

    Beyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks

    Authors: Yun-Shiuan Chuang, Zach Studdiford, Krirk Nirunwiroj, Agam Goyal, Vincent V. Frigo, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Creating human-like large language model (LLM) agents is crucial for faithful social simulation. Having LLMs role-play based on demographic information sometimes improves human likeness but often does not. This study assessed whether LLM alignment with human behavior can be improved by integrating information from empirically-derived human belief networks. Using data from a human survey, we estima… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  4. arXiv:2405.20088  [pdf, other

    stat.AP stat.ME

    Personalized Predictions from Population Level Experiments: A Study on Alzheimer's Disease

    Authors: Dennis Shen, Anish Agarwal, Vishal Misra, Bjoern Schelter, Devavrat Shah, Helen Shiells, Claude Wischik

    Abstract: The purpose of this article is to infer patient level outcomes from population level randomized control trials (RCTs). In this pursuit, we utilize the recently proposed synthetic nearest neighbors (SNN) estimator. At its core, SNN leverages information across patients to impute missing data associated with each patient of interest. We focus on two types of missing data: (i) unrecorded outcomes fro… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  5. arXiv:2405.18343  [pdf, other

    q-bio.TO

    On in-silico estimation of left ventricular end-diastolic pressure from cardiac strains

    Authors: Emilio A. Mendiola, Raza Rana Mehdi, Dipan J. Shah, Reza Avazmohammadi

    Abstract: Left ventricular diastolic dysfunction (LVDD) is a group of diseases that adversely affect the passive phase of the cardiac cycle and can lead to heart failure. While left ventricular end-diastolic pressure (LVEDP) is a valuable prognostic measure in LVDD patients, traditional invasive methods of measuring LVEDP present risks and limitations, highlighting the need for alternative approaches. This… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  6. arXiv:2405.18334  [pdf, other

    cs.DB cs.CV cs.LG

    SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches

    Authors: Renzhi Wu, Pramod Chunduri, Dristi J Shah, Ashmitha Julius Aravind, Ali Payani, Xu Chu, Joy Arulraj, Kexin Rong

    Abstract: In this paper, we will present SketchQL, a video database management system (VDBMS) for retrieving video moments with a sketch-based query interface. This novel interface allows users to specify object trajectory events with simple mouse drag-and-drop operations. Users can use trajectories of single objects as building blocks to compose complex events. Using a pre-trained model that encodes trajec… ▽ More

    Submitted 30 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Journal ref: Published on International Conference on Very Large Databases 2024

  7. arXiv:2405.09530  [pdf, other

    cs.CY cs.CV cs.LG

    A community palm model

    Authors: Nicholas Clinton, Andreas Vollrath, Remi D'annunzio, Desheng Liu, Henry B. Glick, AdriĆ  Descals, Alicia Sullivan, Oliver Guinan, Jacob Abramowitz, Fred Stolle, Chris Goodman, Tanya Birch, David Quinn, Olga Danylo, Tijs Lips, Daniel Coelho, Enikoe Bihari, Bryce Cronkite-Ratcliff, Ate Poortinga, Atena Haghighattalab, Evan Notman, Michael DeWitt, Aaron Yonas, Gennadii Donchyts, Devaja Shah , et al. (5 additional authors not shown)

    Abstract: Palm oil production has been identified as one of the major drivers of deforestation for tropical countries. To meet supply chain objectives, commodity producers and other stakeholders need timely information of land cover dynamics in their supply shed. However, such data are difficult to obtain from suppliers who may lack digital geographic representations of their supply sheds and production loc… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: v0

  8. arXiv:2405.09155  [pdf, other

    cs.ET

    TunnelSense: Low-power, Non-Contact Sensing using Tunnel Diodes

    Authors: Lim Chang Quan Thaddeus, C. Rajashekar Reddy, Yuvraj Singh Bhadauria, Dhairya Shah, Manoj Gulati, Ambuj Varshney

    Abstract: Sensing the motion of physical objects in an environment enables numerous applications, from tracking occupancy in buildings and monitoring vital signs to diagnosing faults in machines. Typically, these application scenarios involve attaching a sensor, such as an accelerometer, to the object of interest, like a wearable device that tracks our steps. However, many of these scenarios require trackin… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: This work is accepted at IEEE RFID 2024

  9. arXiv:2404.19478  [pdf, ps, other

    cond-mat.stat-mech math-ph

    Non Gaussian statistics in static and dynamic Galton boards

    Authors: Dhruv Shah, R. K. Shishir, Manjaree, Shreya Pithva, T. Y. Booritth Balaji, Rahul Agarwal Singh

    Abstract: Perturbing the arrangements of pegs on a static Galton board can result in non-trivial stationary distributions, which in the continuum limit correspond to departure from regular gaussian behavior. Two such distributions are obtained. Further, the distributions generated for a dynamic galton board under external forcing in a general direction are obtained by solution of the corresponding stochasti… ▽ More

    Submitted 2 July, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: 10 pages

  10. arXiv:2404.17570  [pdf, other

    quant-ph physics.app-ph physics.optics

    A manufacturable platform for photonic quantum computing

    Authors: Koen Alexander, Andrea Bahgat, Avishai Benyamini, Dylan Black, Damien Bonneau, Stanley Burgos, Ben Burridge, Geoff Campbell, Gabriel Catalano, Alex Ceballos, Chia-Ming Chang, CJ Chung, Fariba Danesh, Tom Dauer, Michael Davis, Eric Dudley, ** Er-Xuan, Josep Fargas, Alessandro Farsi, Colleen Fenrich, Jonathan Frazer, Masaya Fukami, Yogeeswaran Ganesan, Gary Gibson, Mercedes Gimeno-Segovia , et al. (70 additional authors not shown)

    Abstract: Whilst holding great promise for low noise, ease of operation and networking, useful photonic quantum computing has been precluded by the need for beyond-state-of-the-art components, manufactured by the millions. Here we introduce a manufacturable platform for quantum computing with photons. We benchmark a set of monolithically-integrated silicon photonics-based modules to generate, manipulate, ne… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 8 pages, 5 figures

  11. arXiv:2404.13008  [pdf, other

    cs.SD eess.AS

    Enhancing Generalization in Audio Deepfake Detection: A Neural Collapse based Sampling and Training Approach

    Authors: Mohammed Yousif, Jonat John Mathew, Huzaifa Pallan, Agamjeet Singh Padda, Syed Daniyal Shah, Sara Adamski, Madhu Reddiboina, Arjun Pankajakshan

    Abstract: Generalization in audio deepfake detection presents a significant challenge, with models trained on specific datasets often struggling to detect deepfakes generated under varying conditions and unknown algorithms. While collectively training a model using diverse datasets can enhance its generalization ability, it comes with high computational costs. To address this, we propose a neural collapse-b… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  12. arXiv:2403.10912  [pdf

    cs.CV cs.LG

    Automatic location detection based on deep learning

    Authors: Anjali Karangiya, Anirudh Sharma, Divax Shah, Kartavya Badgujar, Dr. Chintan Thacker, Dainik Dave

    Abstract: The proliferation of digital images and the advancements in deep learning have paved the way for innovative solutions in various domains, especially in the field of image classification. Our project presents an in-depth study and implementation of an image classification system specifically tailored to identify and classify images of Indian cities. Drawing from an extensive dataset, our model clas… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  13. arXiv:2403.09611  [pdf, other

    cs.CV cs.CL cs.LG

    MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

    Authors: Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu HĆØ, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman , et al. (7 additional authors not shown)

    Abstract: In this work, we discuss building performant Multimodal Large Language Models (MLLMs). In particular, we study the importance of various architecture components and data choices. Through careful and comprehensive ablations of the image encoder, the vision language connector, and various pre-training data choices, we identified several crucial design lessons. For example, we demonstrate that for la… ▽ More

    Submitted 18 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  14. arXiv:2403.00991  [pdf, other

    cs.RO cs.CV cs.LG

    SELFI: Autonomous Self-Improvement with Reinforcement Learning for Social Navigation

    Authors: Noriaki Hirose, Dhruv Shah, Kyle Stachowicz, Ajay Sridhar, Sergey Levine

    Abstract: Autonomous self-improving robots that interact and improve with experience are key to the real-world deployment of robotic systems. In this paper, we propose an online learning method, SELFI, that leverages online robot experience to rapidly fine-tune pre-trained control policies efficiently. SELFI applies online model-free reinforcement learning on top of offline model-based learning to bring out… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 11pages, 13 figures, 2 tables

  15. arXiv:2402.19432  [pdf, other

    cs.RO

    Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation

    Authors: Jonathan Yang, Catherine Glossop, Arjun Bhorkar, Dhruv Shah, Quan Vuong, Chelsea Finn, Dorsa Sadigh, Sergey Levine

    Abstract: Recent years in robotics and imitation learning have shown remarkable progress in training large-scale foundation models by leveraging data across a multitude of embodiments. The success of such policies might lead us to wonder: just how diverse can the robots in the training set be while still facilitating positive transfer? In this work, we study this question in the context of heterogeneous emb… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 16 pages, 9 figures

    MSC Class: 68T40 ACM Class: I.2.9

  16. arXiv:2402.14959  [pdf, other

    stat.AP cs.CY stat.ML

    A Causal Framework to Evaluate Racial Bias in Law Enforcement Systems

    Authors: Jessy Xinyi Han, Andrew Miller, S. Craig Watkins, Christopher Winship, Fotini Christia, Devavrat Shah

    Abstract: We are interested in develo** a data-driven method to evaluate race-induced biases in law enforcement systems. While the recent works have addressed this question in the context of police-civilian interactions using police stop data, they have two key limitations. First, bias can only be properly quantified if true criminality is accounted for in addition to race, but it is absent in prior works… ▽ More

    Submitted 20 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  17. arXiv:2402.05983  [pdf, other

    eess.IV cs.LG physics.app-ph physics.ins-det

    Capability enhancement of the X-ray micro-tomography system via ML-assisted approaches

    Authors: Dhruvi Shah, Shruti Mehta, Ashish Agrawal, Shishir Purohit, Bhaskar Chaudhury

    Abstract: Ring artifacts in X-ray micro-CT images are one of the primary causes of concern in their accurate visual interpretation and quantitative analysis. The geometry of X-ray micro-CT scanners is similar to the medical CT machines, except the sample is rotated with a stationary source and detector. The ring artifacts are caused by a defect or non-linear responses in detector pixels during the MicroCT d… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  18. arXiv:2402.03390  [pdf, other

    eess.IV cs.AI cs.CV cs.NI

    PixelGen: Rethinking Embedded Camera Systems

    Authors: Kunjun Li, Manoj Gulati, Steven Waskito, Dhairya Shah, Shantanu Chakrabarty, Ambuj Varshney

    Abstract: Embedded camera systems are ubiquitous, representing the most widely deployed example of a wireless embedded system. They capture a representation of the world - the surroundings illuminated by visible or infrared light. Despite their widespread usage, the architecture of embedded camera systems has remained unchanged, which leads to limitations. They visualize only a tiny portion of the world. Ad… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  19. arXiv:2402.00793  [pdf, other

    cs.LG cs.AI cs.HC

    Human Expertise in Algorithmic Prediction

    Authors: Rohan Alur, Manish Raghavan, Devavrat Shah

    Abstract: We introduce a novel framework for incorporating human expertise into algorithmic predictions. Our approach focuses on the use of human judgment to distinguish inputs which `look the same' to any feasible predictive algorithm. We argue that this framing clarifies the problem of human/AI collaboration in prediction tasks, as experts often have access to information -- particularly subjective inform… ▽ More

    Submitted 22 May, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 35 pages, 13 figures

  20. arXiv:2401.17711  [pdf

    cs.HC cs.AI

    Prediction of multitasking performance post-longitudinal tDCS via EEG-based functional connectivity and machine learning methods

    Authors: Akash K Rao, Shashank Uttrani, Vishnu K Menon, Darshil Shah, Arnav Bhavsar, Shubhajit Roy Chowdhury, Varun Dutt

    Abstract: Predicting and understanding the changes in cognitive performance, especially after a longitudinal intervention, is a fundamental goal in neuroscience. Longitudinal brain stimulation-based interventions like transcranial direct current stimulation (tDCS) induce short-term changes in the resting membrane potential and influence cognitive processes. However, very little research has been conducted o… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 16 pages, presented at the 30th International Conference on Neural Information Processing (ICONIP2023), Changsha, China, November 2023

  21. arXiv:2312.17270  [pdf, other

    cs.CR cs.LG

    Anticipated Network Surveillance -- An extrapolated study to predict cyber-attacks using Machine Learning and Data Analytics

    Authors: Aviral Srivastava, Dhyan Thakkar, Dr. Sharda Valiveti, Dr. Pooja Shah, Dr. Gaurang Raval

    Abstract: Machine learning and data mining techniques are utiized for enhancement of the security of any network. Researchers used machine learning for pattern detection, anomaly detection, dynamic policy setting, etc. The methods allow the program to learn from data and make decisions without human intervention, consuming a huge training period and computation power. This paper discusses a novel technique… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  22. arXiv:2312.00894  [pdf, other

    cs.SE

    Leveraging Large Language Models to Improve REST API Testing

    Authors: Myeongsoo Kim, Tyler Stennett, Dhruv Shah, Saurabh Sinha, Alessandro Orso

    Abstract: The widespread adoption of REST APIs, coupled with their growing complexity and size, has led to the need for automated REST API testing tools. Current tools focus on the structured data in REST API specifications but often neglect valuable insights available in unstructured natural-language descriptions in the specifications, which leads to suboptimal test coverage. Recently, to address this gap,… ▽ More

    Submitted 29 January, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: To be published in the 46th IEEE/ACM International Conference on Software Engineering - New Ideas and Emerging Results Track (ICSE-NIER 2024)

  23. arXiv:2311.09665  [pdf, other

    cs.CL

    The Wisdom of Partisan Crowds: Comparing Collective Intelligence in Humans and LLM-based Agents

    Authors: Yun-Shiuan Chuang, Siddharth Suresh, Nikunj Harlalka, Agam Goyal, Robert Hawkins, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Human groups are able to converge on more accurate beliefs through deliberation, even in the presence of polarization and partisan bias -- a phenomenon known as the "wisdom of partisan crowds." Generated agents powered by Large Language Models (LLMs) are increasingly used to simulate human collective behavior, yet few benchmarks exist for evaluating their dynamics against the behavior of human gro… ▽ More

    Submitted 16 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  24. arXiv:2311.09618  [pdf, other

    physics.soc-ph cs.CL

    Simulating Opinion Dynamics with Networks of LLM-based Agents

    Authors: Yun-Shiuan Chuang, Agam Goyal, Nikunj Harlalka, Siddharth Suresh, Robert Hawkins, Sijia Yang, Dhavan Shah, Junjie Hu, Timothy T. Rogers

    Abstract: Accurately simulating human opinion dynamics is crucial for understanding a variety of societal phenomena, including polarization and the spread of misinformation. However, the agent-based models (ABMs) commonly used for such simulations often over-simplify human behavior. We propose a new approach to simulating opinion dynamics based on populations of Large Language Models (LLMs). Our findings re… ▽ More

    Submitted 31 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  25. arXiv:2311.06430  [pdf, other

    cs.RO

    GOAT: GO to Any Thing

    Authors: Matthew Chang, Theophile Gervet, Mukul Khanna, Sriram Yenamandra, Dhruv Shah, So Yeon Min, Kavit Shah, Chris Paxton, Saurabh Gupta, Dhruv Batra, Roozbeh Mottaghi, Jitendra Malik, Devendra Singh Chaplot

    Abstract: In deployment scenarios such as homes and warehouses, mobile robots are expected to autonomously navigate for extended periods, seamlessly executing tasks articulated in terms that are intuitively understandable by human operators. We present GO To Any Thing (GOAT), a universal navigation system capable of tackling these requirements with three key features: a) Multimodal: it can tackle goals spec… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  26. arXiv:2311.02287  [pdf, other

    cs.LG cs.AI

    Predicting Ground Reaction Force from Inertial Sensors

    Authors: Bowen Song, Marco Paolieri, Harper E. Stewart, Leana Golubchik, Jill L. McNitt-Gray, Vishal Misra, Devavrat Shah

    Abstract: The study of ground reaction forces (GRF) is used to characterize the mechanical loading experienced by individuals in movements such as running, which is clinically applicable to identify athletes at risk for stress-related injuries. Our aim in this paper is to determine if data collected with inertial measurement units (IMUs), that can be worn by athletes during outdoor runs, can be used to pred… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  27. arXiv:2310.12183  [pdf, other

    math.OC cs.AI

    An Optimistic-Robust Approach for Dynamic Positioning of Omnichannel Inventories

    Authors: Pavithra Harsha, Shivaram Subramanian, Ali Koc, Mahesh Ramakrishna, Brian Quanz, Dhruv Shah, Chandra Narayanaswami

    Abstract: We introduce a new class of data-driven and distribution-free optimistic-robust bimodal inventory optimization (BIO) strategy to effectively allocate inventory across a retail chain to meet time-varying, uncertain omnichannel demand. While prior Robust optimization (RO) methods emphasize the downside, i.e., worst-case adversarial demand, BIO also considers the upside to remain resilient like RO wh… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  28. arXiv:2310.10103  [pdf, other

    cs.RO cs.AI cs.CL cs.LG

    Navigation with Large Language Models: Semantic Guesswork as a Heuristic for Planning

    Authors: Dhruv Shah, Michael Equi, Blazej Osinski, Fei Xia, Brian Ichter, Sergey Levine

    Abstract: Navigation in unfamiliar environments presents a major challenge for robots: while map** and planning techniques can be used to build up a representation of the world, quickly discovering a path to a desired goal in unfamiliar settings with such methods often requires lengthy map** and exploration. Humans can rapidly navigate new environments, particularly indoor environments that are laid out… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Videos, code, and an interactive Colab notebook that runs in your browser https://sites.google.com/view/lfg-nav/

  29. arXiv:2310.09277  [pdf, other

    cs.LG

    A Hybrid Approach for Depression Classification: Random Forest-ANN Ensemble on Motor Activity Signals

    Authors: Anket Patil, Dhairya Shah, Abhishek Shah, Mokshit Gala

    Abstract: Regarding the rising number of people suffering from mental health illnesses in today's society, the importance of mental health cannot be overstated. Wearable sensors, which are increasingly widely available, provide a potential way to track and comprehend mental health issues. These gadgets not only monitor everyday activities but also continuously record vital signs like heart rate, perhaps pro… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 8 pages

    MSC Class: 68T05

  30. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  31. arXiv:2310.07896  [pdf, other

    cs.RO cs.CV cs.LG

    NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration

    Authors: Ajay Sridhar, Dhruv Shah, Catherine Glossop, Sergey Levine

    Abstract: Robotic learning for navigation in unfamiliar environments needs to provide policies for both task-oriented navigation (i.e., reaching a goal that the robot has located), and task-agnostic exploration (i.e., searching for a goal in a novel setting). Typically, these roles are handled by separate models, for example by using subgoal proposals, planning, or separate navigation strategies. In this pa… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Project page https://general-navigation-models.github.io/nomad/

  32. arXiv:2310.04468  [pdf, other

    cs.CL cs.AI

    Validating transformers for redaction of text from electronic health records in real-world healthcare

    Authors: Zeljko Kraljevic, Anthony Shek, Joshua Au Yeung, Ewart Jonathan Sheldon, Mohammad Al-Agil, Haris Shuaib, Xi Bai, Kawsar Noor, Anoop D. Shah, Richard Dobson, James Teo

    Abstract: Protecting patient privacy in healthcare records is a top priority, and redaction is a commonly used method for obscuring directly identifiable information in text. Rule-based methods have been widely used, but their precision is often low causing over-redaction of text and frequently not being adaptable enough for non-standardised or unconventional structures of personal health information. Deep… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  33. arXiv:2310.00574  [pdf, other

    cs.AR cs.LG cs.PF

    YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs

    Authors: Cyrus Zhou, Zack Hassman, Ruize Xu, Dhirpal Shah, Vaugnn Richard, Yan**g Li

    Abstract: We address the challenges associated with deploying neural networks on CPUs, with a particular focus on minimizing inference time while maintaining accuracy. Our novel approach is to use the dataflow (i.e., computation order) of a neural network to explore data reuse opportunities using heuristic-guided analysis and a code generation framework, which enables exploration of various Single Instructi… ▽ More

    Submitted 23 November, 2023; v1 submitted 1 October, 2023; originally announced October 2023.

    ACM Class: B.8.2

  34. arXiv:2309.06413  [pdf, other

    cs.LG stat.ML

    On Computationally Efficient Learning of Exponential Family Distributions

    Authors: Abhin Shah, Devavrat Shah, Gregory W. Wornell

    Abstract: We consider the classical problem of learning, with arbitrary accuracy, the natural parameters of a $k$-parameter truncated \textit{minimal} exponential family from i.i.d. samples in a computationally and statistically efficient manner. We focus on the setting where the support as well as the natural parameters are appropriately bounded. While the traditional maximum likelihood estimator for this… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: An earlier version of this work arXiv:2110.15397 was presented at the Neural Information Processing Systems Conference in December 2021 titled "A Computationally Efficient Method for Learning Exponential Family Distributions"

  35. arXiv:2308.09207  [pdf, other

    math.ST stat.ME

    Average partial effect estimation using double machine learning

    Authors: Harvey Klyne, Rajen D. Shah

    Abstract: Single-parameter summaries of variable effects are desirable for ease of interpretation, but linear models, which would deliver these, may fit poorly to the data. A modern approach is to estimate the average partial effect -- the average slope of the regression function with respect to the predictor of interest -- using a doubly robust semiparametric procedure. Most existing work has focused on sp… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 61 pages, 4 figures

  36. arXiv:2307.11401  [pdf, other

    stat.ME math.ST

    Sandwich Boosting for Accurate Estimation in Partially Linear Models for Grouped Data

    Authors: Elliot H. Young, Rajen D. Shah

    Abstract: We study partially linear models in settings where observations are arranged in independent groups but may exhibit within-group dependence. Existing approaches estimate linear model parameters through weighted least squares, with optimal weights (given by the inverse covariance of the response, conditional on the covariates) typically estimated by maximising a (restricted) likelihood from random e… ▽ More

    Submitted 15 April, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

  37. arXiv:2307.05538  [pdf, other

    cs.CL

    Advancements in Scientific Controllable Text Generation Methods

    Authors: Arnav Goel, Medha Hira, Avinash Anand, Siddhesh Bangar, Dr. Rajiv Ratn Shah

    Abstract: The previous work on controllable text generation is organized using a new schema we provide in this study. Seven components make up the schema, and each one is crucial to the creation process. To accomplish controlled generation for scientific literature, we describe the various modulation strategies utilised to modulate each of the seven components. We also offer a theoretical study and qualitat… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

  38. arXiv:2306.14846  [pdf, other

    cs.RO cs.CV cs.LG

    ViNT: A Foundation Model for Visual Navigation

    Authors: Dhruv Shah, Ajay Sridhar, Nitish Dashora, Kyle Stachowicz, Kevin Black, Noriaki Hirose, Sergey Levine

    Abstract: General-purpose pre-trained models ("foundation models") have enabled practitioners to produce generalizable solutions for individual machine learning problems with datasets that are significantly smaller than those required for learning from scratch. Such models are typically trained on large and diverse datasets with weak supervision, consuming much more training data than is available for any i… ▽ More

    Submitted 24 October, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted for oral presentation at CoRL 2023

  39. arXiv:2306.07305  [pdf, other

    cs.LG cs.AI q-fin.CP

    Making forecasting self-learning and adaptive -- Pilot forecasting rack

    Authors: Shaun D'Souza, Dheeraj Shah, Amareshwar Allati, Parikshit Soni

    Abstract: Retail sales and price projections are typically based on time series forecasting. For some product categories, the accuracy of demand forecasts achieved is low, negatively impacting inventory, transport, and replenishment planning. This paper presents our findings based on a proactive pilot exercise to explore ways to help retailers to improve forecast accuracy for such product categories. We e… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  40. Machine Vision Using Cellphone Camera: A Comparison of deep networks for classifying three challenging denominations of Indian Coins

    Authors: Keyur D. Joshi, Dhruv Shah, Varshil Shah, Nilay Gandhi, Sanket J. Shah, Sanket B. Shah

    Abstract: Indian currency coins come in a variety of denominations. Off all the varieties Rs.1, RS.2, and Rs.5 have similar diameters. Majority of the coin styles in market circulation for denominations of Rs.1 and Rs.2 coins are nearly the same except for numerals on its reverse side. If a coin is resting on its obverse side, the correct denomination is not distinguishable by humans. Therefore, it was hypo… ▽ More

    Submitted 12 May, 2023; originally announced June 2023.

    Comments: 6 Pages, 4 Figures, 6 Tables, Conference paper

  41. arXiv:2306.04775  [pdf, other

    cs.LG stat.ML

    Exploiting Observation Bias to Improve Matrix Completion

    Authors: Yassir Jedra, Sean Mann, Charlotte Park, Devavrat Shah

    Abstract: We consider a variant of matrix completion where entries are revealed in a biased manner, adopting a model akin to that introduced by Ma and Chen. Instead of treating this observation bias as a disadvantage, as is typically the case, the goal is to exploit the shared information between the bias and the outcome of interest to improve predictions. Towards this, we consider a natural model where the… ▽ More

    Submitted 4 February, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

  42. arXiv:2306.01874  [pdf, other

    cs.RO cs.CV cs.LG

    SACSoN: Scalable Autonomous Control for Social Navigation

    Authors: Noriaki Hirose, Dhruv Shah, Ajay Sridhar, Sergey Levine

    Abstract: Machine learning provides a powerful tool for building socially compliant robotic systems that go beyond simple predictive models of human behavior. By observing and understanding human interactions from past experiences, learning can enable effective social navigation behaviors directly from data. In this paper, our goal is to develop methods for training policies for socially unobtrusive navigat… ▽ More

    Submitted 25 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 11 pages, 15 figures, 4 tables

  43. arXiv:2306.01646  [pdf, other

    stat.ML cs.CY cs.LG

    Auditing for Human Expertise

    Authors: Rohan Alur, Loren Laine, Darrick K. Li, Manish Raghavan, Devavrat Shah, Dennis Shung

    Abstract: High-stakes prediction tasks (e.g., patient diagnosis) are often handled by trained human experts. A common source of concern about automation in these settings is that experts may exercise intuition that is difficult to model and/or have access to information (e.g., conversations with a patient) that is simply unavailable to a would-be algorithm. This raises a natural question whether human exper… ▽ More

    Submitted 27 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 30 pages, 10 figures. To appear in the proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  44. arXiv:2305.16491  [pdf, other

    cs.LG eess.SY stat.ML

    SAMoSSA: Multivariate Singular Spectrum Analysis with Stochastic Autoregressive Noise

    Authors: Abdullah Alomar, Munther Dahleh, Sean Mann, Devavrat Shah

    Abstract: The well-established practice of time series analysis involves estimating deterministic, non-stationary trend and seasonality components followed by learning the residual stochastic, stationary components. Recently, it has been shown that one can learn the deterministic non-stationary components accurately using multivariate Singular Spectrum Analysis (mSSA) in the absence of a correlated stationa… ▽ More

    Submitted 26 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  45. arXiv:2305.07168  [pdf, other

    cs.IR

    Local Life: Stay Informed Around You, A Scalable Geoparsing and Geotagging Approach to Serve Local News Worldwide

    Authors: Deven Santosh Shah, Gosuddin Kamaruddin Siddiqi, Shiying He, Radhika Bansal

    Abstract: Local news has become increasingly important in the news industry due to its various benefits. It offers local audiences information that helps them participate in their communities and interests. It also serves as a reliable source of factual reporting that can prevent misinformation. Moreover, it can influence national audiences as some local stories may have wider implications for politics, env… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: 7 pages, 3 figures, 3 tables

  46. arXiv:2304.12404  [pdf, other

    cs.CL

    Semantic Tokenizer for Enhanced Natural Language Processing

    Authors: Sandeep Mehta, Darpan Shah, Ravindra Kulkarni, Cornelia Caragea

    Abstract: Traditionally, NLP performance improvement has been focused on improving models and increasing the number of model parameters. NLP vocabulary construction has remained focused on maximizing the number of words represented through subword regularization. We present a novel tokenizer that uses semantics to drive vocabulary construction. The tokenizer includes a trainer that uses stemming to enhance… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  47. arXiv:2304.10525  [pdf, other

    cs.CY cs.LG cs.SI stat.AP

    A User-Driven Framework for Regulating and Auditing Social Media

    Authors: Sarah H. Cen, Aleksander Madry, Devavrat Shah

    Abstract: People form judgments and make decisions based on the information that they observe. A growing portion of that information is not only provided, but carefully curated by social media platforms. Although lawmakers largely agree that platforms should not operate without any oversight, there is little consensus on how to regulate social media. There is consensus, however, that creating a strict, glob… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 21 pages, 4 figures

  48. arXiv:2304.09831  [pdf, other

    cs.RO cs.AI cs.LG

    FastRLAP: A System for Learning High-Speed Driving via Deep RL and Autonomous Practicing

    Authors: Kyle Stachowicz, Dhruv Shah, Arjun Bhorkar, Ilya Kostrikov, Sergey Levine

    Abstract: We present a system that enables an autonomous small-scale RC car to drive aggressively from visual observations using reinforcement learning (RL). Our system, FastRLAP (faster lap), trains autonomously in the real world, without human interventions, and without requiring any simulation or expert demonstrations. Our system integrates a number of important components to make this possible: we initi… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  49. arXiv:2303.13243  [pdf, other

    eess.AS cs.SD

    Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition

    Authors: Kai Liu, Hailiang Xiong, Gangqiang Yang, Zhengfeng Du, Yewen Cao, Danyal Shah

    Abstract: As one of the major branches of automatic speech recognition, attention-based models greatly improves the feature representation ability of the model. In particular, the multi-head mechanism is employed in the attention, ho** to learn speech features of more aspects in different attention subspaces. For speech recognition of complex languages, on the one hand, a small head size will lead to an o… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  50. arXiv:2303.04558  [pdf, ps, other

    math.PR

    Remarks on Differential Inclusion limits of Stochastic Approximation

    Authors: Vivek Borkar, Dhruv Shah

    Abstract: For stochastic approximation algorithms with discontinuous dynamics, it is shown that under suitable distributional assumptions, the interpolated iterates track a Fillipov solution of the limiting differential inclusion. In addition, we give an alternative control theoretic approach to recent results of [7] on certain limiting empirical measures associated with the iteration.

    Submitted 1 September, 2023; v1 submitted 8 March, 2023; originally announced March 2023.