Skip to main content

Showing 1–50 of 163 results for author: Srivastava, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20772  [pdf

    cs.LG cs.CY

    Reinforcement Learning for Sociohydrology

    Authors: Tirthankar Roy, Shivendra Srivastava, Beichen Zhang

    Abstract: In this study, we discuss how reinforcement learning (RL) provides an effective and efficient framework for solving sociohydrology problems. The efficacy of RL for these types of problems is evident because of its ability to update policies in an iterative manner - something that is also foundational to sociohydrology, where we are interested in representing the co-evolution of human-water interac… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  2. arXiv:2405.15907  [pdf, other

    cs.AI

    Belief-State Query Policies for Planning With Preferences Under Partial Observability

    Authors: Daniel Bramblett, Siddharth Srivastava

    Abstract: Planning in real-world settings often entails addressing partial observability while aligning with users' preferences. We present a novel framework for expressing users' preferences about agent behavior in a partially observable setting using parameterized belief-state query (BSQ) preferences in the setting of goal-oriented partially observable Markov decision processes (gPOMDPs). We present the f… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2405.13004  [pdf, other

    cs.CL cs.AI

    MathDivide: Improved mathematical reasoning by large language models

    Authors: Saksham Sahai Srivastava, Ashutosh Gandhi

    Abstract: Large language models have been proven to be capable of handling complex linguistic and cognitive tasks. Therefore their usage has been extended to tasks requiring logical reasoning ability such as Mathematics. In this paper, we propose a prompting technique called MathDivide that breaks down the mathematical problem into simpler subproblems. Each of the subproblems is formulated as an algebraic e… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 10 pages, 3 figures

  4. arXiv:2405.09546  [pdf, other

    cs.CV

    BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation

    Authors: Yunhao Ge, Yihe Tang, Jiashu Xu, Cem Gokmen, Chengshu Li, Wensi Ai, Benjamin Jose Martinez, Arman Aydin, Mona Anvari, Ayush K Chakravarthy, Hong-Xing Yu, Josiah Wong, Sanjana Srivastava, Sharon Lee, Shengxin Zha, Laurent Itti, Yunzhu Li, Roberto Martín-Martín, Miao Liu, Pengchuan Zhang, Ruohan Zhang, Li Fei-Fei, Jiajun Wu

    Abstract: The systematic evaluation and understanding of computer vision models under varying conditions require large amounts of data with comprehensive and customized labels, which real-world vision datasets rarely satisfy. While current synthetic data generators offer a promising alternative, particularly for embodied AI tasks, they often fall short for computer vision tasks due to low asset and renderin… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: CVPR 2024 (Highlight). Project website: https://behavior-vision-suite.github.io/

  5. arXiv:2404.19095  [pdf

    cs.HC cs.IR cs.LG cs.SI

    Catalyzing Social Interactions in Mixed Reality using ML Recommendation Systems

    Authors: Sparsh Srivastava, Rohan Arora

    Abstract: We create an innovative mixed reality-first social recommendation model, utilizing features uniquely collected through mixed reality (MR) systems to promote social interaction, such as gaze recognition, proximity, noise level, congestion level, and conversational intensity. We further extend these models to include right-time features to deliver timely notifications. We measure performance metrics… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  6. arXiv:2404.15325  [pdf

    cs.HC

    Quantifying Social Presence in Mixed Reality: A Contemporary Review of Techniques and Innovations

    Authors: Sparsh Srivastava

    Abstract: This literature review investigates the transformative potential of mixed reality (MR) technology, where we explore the intersection of contemporary technological advancements, modern deep learning recommendation systems, and social psychology frameworks. This interdisciplinary study informs the understanding of MR's role in improving social presence, catalyzing novel social interactions, and enha… ▽ More

    Submitted 26 April, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

  7. arXiv:2404.12415  [pdf

    eess.IV cs.CV cs.LG

    Soil Fertility Prediction Using Combined USB-microscope Based Soil Image, Auxiliary Variables, and Portable X-Ray Fluorescence Spectrometry

    Authors: Shubhadip Dasgupta, Satwik Pate, Divya Rathore, L. G. Divyanth, Ayan Das, Anshuman Nayak, Subhadip Dey, Asim Biswas, David C. Weindorf, Bin Li, Sergio Henrique Godinho Silva, Bruno Teixeira Ribeiro, Sanjay Srivastava, Somsubhra Chakraborty

    Abstract: This study explored the application of portable X-ray fluorescence (PXRF) spectrometry and soil image analysis to rapidly assess soil fertility, focusing on critical parameters such as available B, organic carbon (OC), available Mn, available S, and the sulfur availability index (SAI). Analyzing 1,133 soil samples from various agro-climatic zones in Eastern India, the research combined color and t… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 37 pages, 10 figures; manuscript under peer-review for publication in the jounral 'Computers and Electronics in Agriculture'

  8. arXiv:2404.00846  [pdf, other

    cs.CV cs.LG

    Transfer Learning with Point Transformers

    Authors: Kartik Gupta, Rahul Vippala, Sahima Srivastava

    Abstract: Point Transformers are near state-of-the-art models for classification, segmentation, and detection tasks on Point Cloud data. They utilize a self attention based mechanism to model large range spatial dependencies between multiple point sets. In this project we explore two things: classification performance of these attention based networks on ModelNet10 dataset and then, we use the trained model… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  9. arXiv:2404.00808  [pdf, other

    cs.RO

    Using Explainable AI and Hierarchical Planning for Outreach with Robots

    Authors: Daksh Dobhal, Jayesh Nagpal, Rushang Karia, Pulkit Verma, Rashmeet Kaur Nayyar, Naman Shah, Siddharth Srivastava

    Abstract: Understanding how robots plan and execute tasks is crucial in today's world, where they are becoming more prevalent in our daily lives. However, teaching non-experts the complexities of robot planning can be challenging. This work presents an open-source platform that simplifies the process using a visual interface that completely abstracts the complex internals of hierarchical planning that robot… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  10. arXiv:2403.18327  [pdf, other

    cs.CL cs.AI

    Can LLMs Converse Formally? Automatically Assessing LLMs in Translating and Interpreting Formal Specifications

    Authors: Rushang Karia, Daksh Dobhal, Daniel Bramblett, Pulkit Verma, Siddharth Srivastava

    Abstract: Stakeholders often describe system requirements using natural language which are then converted to formal syntax by a domain-expert leading to increased design costs. This paper assesses the capabilities of Large Language Models (LLMs) in converting between natural language descriptions and formal specifications. Existing work has evaluated the capabilities of LLMs in generating formal syntax such… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  11. arXiv:2403.09227  [pdf, other

    cs.RO cs.AI

    BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation

    Authors: Chengshu Li, Ruohan Zhang, Josiah Wong, Cem Gokmen, Sanjana Srivastava, Roberto Martín-Martín, Chen Wang, Gabrael Levine, Wensi Ai, Benjamin Martinez, Hang Yin, Michael Lingelbach, Minjune Hwang, Ayano Hiranaka, Sujay Garlanka, Arman Aydin, Sharon Lee, Jiankai Sun, Mona Anvari, Manasi Sharma, Dhruva Bansal, Samuel Hunter, Kyu-Young Kim, Alan Lou, Caleb R Matthews , et al. (10 additional authors not shown)

    Abstract: We present BEHAVIOR-1K, a comprehensive simulation benchmark for human-centered robotics. BEHAVIOR-1K includes two components, guided and motivated by the results of an extensive survey on "what do you want robots to do for you?". The first is the definition of 1,000 everyday activities, grounded in 50 scenes (houses, gardens, restaurants, offices, etc.) with more than 9,000 objects annotated with… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: A preliminary version was published at 6th Conference on Robot Learning (CoRL 2022)

  12. arXiv:2403.05022  [pdf, other

    cs.SE

    Effective Fault Localization using Probabilistic and Grou** Approach

    Authors: Saksham Sahai Srivastava, Arpita Dutta, Rajib Mall

    Abstract: Context: Fault localization (FL) is the key activity while debugging a program. Any improvement to this activity leads to significant improvement in total software development cost. There is an internal linkage between the program spectrum and test execution result. Conditional probability in statistics captures the probability of occurring one event in relationship to one or more other events. Ob… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  13. arXiv:2403.03360  [pdf, other

    cs.CR

    Bridge the Future: High-Performance Networks in Confidential VMs without Trusted I/O devices

    Authors: Mengyuan Li, Shashvat Srivastava, Mengjia Yan

    Abstract: Trusted I/O (TIO) is an appealing solution to improve I/O performance for confidential VMs (CVMs), with the potential to eliminate broad sources of I/O overhead. However, this paper emphasizes that not all types of I/O can derive substantial benefits from TIO, particularly network I/O. Given the obligatory use of encryption protocols for network traffic in CVM's threat model, TIO's approach of I/O… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  14. arXiv:2402.19450  [pdf, other

    cs.AI cs.CL

    Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap

    Authors: Saurabh Srivastava, Annarose M B, Anto P V, Shashank Menon, Ajay Sukumar, Adwaith Samod T, Alan Philipose, Stevin Prince, Sooraj Thomas

    Abstract: We propose a framework for robust evaluation of reasoning capabilities of language models, using functional variants of benchmarks. Models that solve a reasoning test should exhibit no difference in performance over the static version of a problem compared to a snapshot of the functional variant. We have rewritten the relevant fragment of the MATH benchmark into its functional variant MATH(), with… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 37 pages, 10 figures

  15. arXiv:2402.11871  [pdf, other

    cs.RO cs.AI

    From Reals to Logic and Back: Inventing Symbolic Vocabularies, Actions, and Models for Planning from Raw Data

    Authors: Naman Shah, Jayesh Nagpal, Pulkit Verma, Siddharth Srivastava

    Abstract: Hand-crafted, logic-based state and action representations have been widely used to overcome the intractable computational complexity of long-horizon robot planning problems, including task and motion planning problems. However, creating such representations requires experts with strong intuitions and detailed knowledge about the robot and the tasks it may need to accomplish in a given setting. Re… ▽ More

    Submitted 4 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  16. Asynchronous Distributed Coordinated Hybrid Precoding in Multi-cell mmWave Wireless Networks

    Authors: Meesam Jafri, Suraj Srivastava, Sunil Kumar, Aditya K. Jagannatham, Lajos Hanzo

    Abstract: Asynchronous distributed hybrid beamformers (ADBF) are conceived for minimizing the total transmit power subject to signal-to-interference-plus-noise ratio (SINR) constraints at the users. Our design requires only limited information exchange between the base stations (BSs) of the mmWave multi-cell coordinated (MCC) networks considered. To begin with, a semidefinite relaxation (SDR)-based fully-di… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Journal ref: IEEE Open Journal of Vehicular Technology, vol. 5, pp. 200-218, 2024

  17. arXiv:2402.08145  [pdf, other

    cs.AI

    Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings

    Authors: Rushang Karia, Pulkit Verma, Alberto Speranzon, Siddharth Srivastava

    Abstract: This paper introduces a new approach for continual planning and model learning in relational, non-stationary stochastic environments. Such capabilities are essential for the deployment of sequential decision-making systems in the uncertain and constantly evolving real world. Working in such practical settings with unknown (and non-stationary) transition systems and changing tasks, the proposed fra… ▽ More

    Submitted 6 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: To appear at ICAPS-24

  18. arXiv:2402.04489  [pdf, other

    cs.LG cs.CR cs.CY stat.ME

    De-amplifying Bias from Differential Privacy in Language Model Fine-tuning

    Authors: Sanjari Srivastava, Piotr Mardziel, Zhikhun Zhang, Archana Ahlawat, Anupam Datta, John C Mitchell

    Abstract: Fairness and privacy are two important values machine learning (ML) practitioners often seek to operationalize in models. Fairness aims to reduce model bias for social/demographic sub-groups. Privacy via differential privacy (DP) mechanisms, on the other hand, limits the impact of any individual's training data on the resulting model. The trade-offs between privacy and fairness goals of trustworth… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  19. arXiv:2311.07682  [pdf, other

    cs.CL cs.AI cs.LG

    Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion

    Authors: Kerem Zaman, Leshem Choshen, Shashank Srivastava

    Abstract: Model fusion research aims to aggregate the knowledge of multiple models to enhance performance by combining their weights. In this work, we study the inverse, investigating whether and how can model fusion interfere and reduce unwanted knowledge. We delve into the effects of model fusion on the evolution of learned shortcuts, social biases, and memorization capabilities in fine-tuned language mod… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 16 pages, 9 figures, 6 tables

  20. arXiv:2311.07538  [pdf, other

    cs.CL cs.LG

    Leveraging Multiple Teachers for Test-Time Adaptation of Language-Guided Classifiers

    Authors: Kangda Wei, Sayan Ghosh, Rakesh R. Menon, Shashank Srivastava

    Abstract: Recent approaches have explored language-guided classifiers capable of classifying examples from novel tasks when provided with task-specific natural language explanations, instructions or prompts (Sanh et al., 2022; R. Menon et al., 2022). While these classifiers can generalize in zero-shot settings, their task performance often varies substantially between different language explanations in unpr… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  21. arXiv:2311.06221  [pdf, other

    cs.CL

    A Comparison of Lexicon-Based and ML-Based Sentiment Analysis: Are There Outlier Words?

    Authors: Siddhant Jaydeep Mahajani, Shashank Srivastava, Alan F. Smeaton

    Abstract: Lexicon-based approaches to sentiment analysis of text are based on each word or lexical entry having a pre-defined weight indicating its sentiment polarity. These are usually manually assigned but the accuracy of these when compared against machine leaning based approaches to computing sentiment, are not known. It may be that there are lexical entries whose sentiment values cause a lexicon-based… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: 4 pages, to appear in Proceedings of the 31st Irish Conference on Artificial Intelligence and Cognitive Science. December 7th-8th, 2023

  22. arXiv:2311.05709  [pdf, ps, other

    cs.CV cs.LG

    OmniVec: Learning robust representations with cross modal sharing

    Authors: Siddharth Srivastava, Gaurav Sharma

    Abstract: Majority of research in learning based methods has been towards designing and training networks for specific tasks. However, many of the learning based tasks, across modalities, share commonalities and could be potentially tackled in a joint framework. We present an approach in such direction, to learn multiple tasks, in multiple modalities, with a unified architecture. The proposed network is com… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted to WACV 2024

  23. arXiv:2311.04659  [pdf, other

    cs.AI

    Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models

    Authors: Yiyuan Li, Rakesh R. Menon, Sayan Ghosh, Shashank Srivastava

    Abstract: Generalized quantifiers (e.g., few, most) are used to indicate the proportions predicates are satisfied (for example, some apples are red). One way to interpret quantifier semantics is to explicitly bind these satisfactions with percentage scopes (e.g., 30%-40% of apples are red). This approach can be helpful for tasks like logic formalization and surface-form quantitative reasoning (Gordon and Sc… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

  24. arXiv:2311.02263  [pdf, other

    cs.DS cs.IT

    List Decoding of Tanner and Expander Amplified Codes from Distance Certificates

    Authors: Fernando Granha Jeronimo, Shashank Srivastava, Madhur Tulsiani

    Abstract: We develop new list decoding algorithms for Tanner codes and distance-amplified codes based on bipartite spectral expanders. We show that proofs exhibiting lower bounds on the minimum distance of these codes can be used as certificates discoverable by relaxations in the Sum-of-Squares (SoS) semidefinite programming hierarchy. Combining these certificates with certain entropic proxies to ensure tha… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: FOCS 2023

  25. arXiv:2310.02251  [pdf, other

    cs.CV cs.RO

    Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving

    Authors: Tushar Choudhary, Vikrant Dewangan, Shivam Chandhok, Shubham Priyadarshan, Anushka Jain, Arun K. Singh, Siddharth Srivastava, Krishna Murthy Jatavallabhula, K. Madhava Krishna

    Abstract: Talk2BEV is a large vision-language model (LVLM) interface for bird's-eye view (BEV) maps in autonomous driving contexts. While existing perception systems for autonomous driving scenarios have largely focused on a pre-defined (closed) set of object categories and driving scenarios, Talk2BEV blends recent advances in general-purpose language and vision models with BEV-structured map representation… ▽ More

    Submitted 14 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Project page at https://llmbev.github.io/talk2bev/

  26. arXiv:2310.02107  [pdf, other

    cs.CL

    Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance

    Authors: Saurabh Srivastava, Chengyue Huang, Weiguo Fan, Ziyu Yao

    Abstract: Large language models (LLMs) have revolutionized zero-shot task performance, mitigating the need for task-specific annotations while enhancing task generalizability. Despite its advancements, current methods using trigger phrases such as "Let's think step by step" remain limited. This study introduces PRomPTed, an approach that optimizes the zero-shot prompts for individual task instances followin… ▽ More

    Submitted 11 June, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted at ACL 2024 - Findings

  27. arXiv:2308.13035  [pdf

    q-bio.QM cs.LG

    The intersection of video capsule endoscopy and artificial intelligence: addressing unique challenges using machine learning

    Authors: Shan Guleria, Benjamin Schwartz, Yash Sharma, Philip Fernandes, James Jablonski, Sodiq Adewole, Sanjana Srivastava, Fisher Rhoads, Michael Porter, Michelle Yeghyayan, Dylan Hyatt, Andrew Copland, Lubaina Ehsan, Donald Brown, Sana Syed

    Abstract: Introduction: Technical burdens and time-intensive review processes limit the practical utility of video capsule endoscopy (VCE). Artificial intelligence (AI) is poised to address these limitations, but the intersection of AI and VCE reveals challenges that must first be overcome. We identified five challenges to address. Challenge #1: VCE data are stochastic and contains significant artifact. Cha… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  28. arXiv:2308.04624  [pdf, other

    cs.CL cs.AI

    Benchmarking LLM powered Chatbots: Methods and Metrics

    Authors: Debarag Banerjee, Pooja Singh, Arjun Avadhanam, Saksham Srivastava

    Abstract: Autonomous conversational agents, i.e. chatbots, are becoming an increasingly common mechanism for enterprises to provide support to customers and partners. In order to rate chatbots, especially ones powered by Generative AI tools like Large Language Models (LLMs) we need to be able to accurately assess their performance. This is where chatbot benchmarking becomes important. In this paper, we prop… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 8 pages, 14 figures

  29. arXiv:2306.10407  [pdf, other

    cs.LG cs.AI physics.bio-ph q-bio.CB

    FP-IRL: Fokker-Planck-based Inverse Reinforcement Learning -- A Physics-Constrained Approach to Markov Decision Processes

    Authors: Chengyang Huang, Siddhartha Srivastava, Xun Huan, Krishna Garikipati

    Abstract: Inverse Reinforcement Learning (IRL) is a compelling technique for revealing the rationale underlying the behavior of autonomous agents. IRL seeks to estimate the unknown reward function of a Markov decision process (MDP) from observed agent trajectories. However, IRL needs a transition function, and most algorithms assume it is known or can be estimated in advance from data. It therefore becomes… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

  30. arXiv:2306.04806  [pdf, other

    cs.AI

    Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version)

    Authors: Pulkit Verma, Rushang Karia, Siddharth Srivastava

    Abstract: It is essential for users to understand what their AI systems can and can't do in order to use them safely. However, the problem of enabling users to assess AI systems with sequential decision-making (SDM) capabilities is relatively understudied. This paper presents a new approach for modeling the capabilities of black-box AI systems that can plan and act, along with the possible effects and requi… ▽ More

    Submitted 28 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  31. arXiv:2305.13646  [pdf

    cs.LG physics.ao-ph

    An Autoencoder-based Snow Drought Index

    Authors: Sinan Rasiya Koya, Kanak Kanti Kar, Shivendra Srivastava, Tsegaye Tadesse, Mark Svoboda, Tirthankar Roy

    Abstract: In several regions across the globe, snow has a significant impact on hydrology. The amounts of water that infiltrate the ground and flow as runoff are driven by the melting of snow. Therefore, it is crucial to study the magnitude and effect of snowmelt. Snow droughts, resulting from reduced snow storage, can drastically impact the water supplies in basins where snow predominates, such as in the w… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  32. arXiv:2305.13469  [pdf, other

    cs.CL cs.AI

    MAILEX: Email Event and Argument Extraction

    Authors: Saurabh Srivastava, Gaurav Singh, Shou Matsumoto, Ali Raz, Paulo Costa, Joshua Poore, Ziyu Yao

    Abstract: In this work, we present the first dataset, MailEx, for performing event extraction from conversational email threads. To this end, we first proposed a new taxonomy covering 10 event types and 76 arguments in the email domain. Our final dataset includes 1.5K email threads and ~4K emails, which are annotated with totally ~8K event instances. To understand the task challenges, we conducted a series… ▽ More

    Submitted 20 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2023

  33. arXiv:2305.12995  [pdf, other

    cs.CL cs.AI cs.LG

    MaNtLE: Model-agnostic Natural Language Explainer

    Authors: Rakesh R. Menon, Kerem Zaman, Shashank Srivastava

    Abstract: Understanding the internal reasoning behind the predictions of machine learning systems is increasingly vital, given their rising adoption and acceptance. While previous approaches, such as LIME, generate algorithmic explanations by attributing importance to input features for individual examples, recent research indicates that practitioners prefer examining language explanations that explain sub-… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 17 pages, 13 figures, 6 tables

  34. arXiv:2305.12710  [pdf, other

    cs.CL

    Beyond Labels: Empowering Human Annotators with Natural Language Explanations through a Novel Active-Learning Architecture

    Authors: Bingsheng Yao, Ishan **dal, Lucian Popa, Yannis Katsis, Sayan Ghosh, Lihong He, Yuxuan Lu, Shashank Srivastava, Yunyao Li, James Hendler, Dakuo Wang

    Abstract: Real-world domain experts (e.g., doctors) rarely annotate only a decision label in their day-to-day workflow without providing explanations. Yet, existing low-resource learning techniques, such as Active Learning (AL), that aim to support human annotators mostly focus on the label while neglecting the natural language explanation of a data point. This work proposes a novel AL architecture to suppo… ▽ More

    Submitted 23 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP 2023 Findings

  35. arXiv:2305.10887  [pdf, ps, other

    cs.IT eess.SP

    Robust Hybrid Transceiver Designs for Linear Decentralized Estimation in mmWave MIMO IoT Networks in the Face of Imperfect CSI

    Authors: Priyanka Maity, Kunwar Pritiraj Rajput, Suraj Srivastava, Naveen K. D. Venkategowda, Aditya K. Jagannatham, Lajos Hanzo

    Abstract: Hybrid transceivers are designed for linear decentralized estimation (LDE) in a mmWave multiple-input multiple-output (MIMO) IoT network (IoTNe). For a noiseless fusion center (FC), it is demonstrated that the MSE performance is determined by the number of RF chains used at each IoT node (IoTNo). Next, the minimum-MSE RF transmit precoders (TPCs) and receive combiner (RC) matrices are designed for… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 16 pages, 8 figures

  36. arXiv:2305.08195  [pdf, other

    cs.CL

    Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing

    Authors: Hao Yan, Saurabh Srivastava, Yintao Tai, Sida I. Wang, Wen-tau Yih, Ziyu Yao

    Abstract: Interactive semantic parsing based on natural language (NL) feedback, where users provide feedback to correct the parser mistakes, has emerged as a more practical scenario than the traditional one-shot semantic parsing. However, prior work has heavily relied on human-annotated feedback data to train the interactive semantic parser, which is prohibitively expensive and not scalable. In this work, w… ▽ More

    Submitted 4 June, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023. 18 pages, 6 figures

  37. arXiv:2304.13958  [pdf, other

    cs.CL cs.CY cs.SI

    Learning and Reasoning Multifaceted and Longitudinal Data for Poverty Estimates and Livelihood Capabilities of Lagged Regions in Rural India

    Authors: Atharva Kulkarni, Raya Das, Ravi S. Srivastava, Tanmoy Chakraborty

    Abstract: Poverty is a multifaceted phenomenon linked to the lack of capabilities of households to earn a sustainable livelihood, increasingly being assessed using multidimensional indicators. Its spatial pattern depends on social, economic, political, and regional variables. Artificial intelligence has shown immense scope in analyzing the complexities and nuances of poverty. The proposed project aims to ex… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: Accepted to IJCAI 2023 Main Conference (AI for Social Good Track)

  38. arXiv:2304.11251  [pdf, other

    stat.ML cs.LG

    Machine Learning and the Future of Bayesian Computation

    Authors: Steven Winter, Trevor Campbell, Lizhen Lin, Sanvesh Srivastava, David B. Dunson

    Abstract: Bayesian models are a powerful tool for studying complex data, allowing the analyst to encode rich hierarchical dependencies and leverage prior information. Most importantly, they facilitate a complete characterization of uncertainty through the posterior distribution. Practical posterior computation is commonly performed via MCMC, which can be computationally infeasible for high dimensional model… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  39. arXiv:2302.06227  [pdf, other

    eess.AS cs.SD

    Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages

    Authors: Sudhanshu Srivastava, Ishika Gupta, Anusha Prakash, Jom Kuriakose, Hema A. Murthy

    Abstract: Hidden-Markov-model (HMM) based text-to-speech (HTS) offers flexibility in speaking styles along with fast training and synthesis while being computationally less intense. HTS performs well even in low-resource scenarios. The primary drawback is that the voice quality is poor compared to that of E2E systems. A hybrid approach combining HMM-based feature generation and neural-network-based HiFi-GAN… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: 5 pages, 5 figures

  40. arXiv:2212.10276  [pdf, other

    cs.AI

    Identifying and Manipulating the Personality Traits of Language Models

    Authors: Graham Caron, Shashank Srivastava

    Abstract: Psychology research has long explored aspects of human personality such as extroversion, agreeableness and emotional stability. Categorizations like the `Big Five' personality traits are commonly used to assess and diagnose personality types. In this work, we explore the question of whether the perceived personality in language models is exhibited consistently in their language generation. For exa… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  41. arXiv:2212.09175  [pdf, other

    cs.LG

    Predicting Citi Bike Demand Evolution Using Dynamic Graphs

    Authors: Alexander Saff, Mayur Bhandary, Siddharth Srivastava

    Abstract: Bike sharing systems often suffer from poor capacity management as a result of variable demand. These bike sharing systems would benefit from models to predict demand in order to moderate the number of bikes stored at each station. In this paper, we attempt to apply a graph neural network model to predict bike demand in the New York City, Citi Bike dataset.

    Submitted 18 December, 2022; originally announced December 2022.

  42. arXiv:2212.09104  [pdf, other

    cs.CL

    LaSQuE: Improved Zero-Shot Classification from Explanations Through Quantifier Modeling and Curriculum Learning

    Authors: Sayan Ghosh, Rakesh R Menon, Shashank Srivastava

    Abstract: A hallmark of human intelligence is the ability to learn new concepts purely from language. Several recent approaches have explored training machine learning models via natural language supervision. However, these approaches fall short in leveraging linguistic quantifiers (such as 'always' or 'rarely') and mimicking humans in compositionally learning complex tasks. Here, we present LaSQuE, a metho… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: Work in progress

  43. arXiv:2212.02823  [pdf, other

    cs.AI

    Hierarchical Decomposition and Analysis for Generalized Planning

    Authors: Siddharth Srivastava

    Abstract: This paper presents new methods for analyzing and evaluating generalized plans that can solve broad classes of related planning problems. Although synthesis and learning of generalized plans has been a longstanding goal in AI, it remains challenging due to fundamental gaps in methods for analyzing the scope and utility of a given generalized plan. This paper addresses these gaps by develo** a ne… ▽ More

    Submitted 26 June, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: Accepted for publication at JAIR

  44. arXiv:2211.11602  [pdf, other

    cs.LG cs.HC cs.MA

    Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

    Authors: Josh Abramson, Arun Ahuja, Federico Carnevale, Petko Georgiev, Alex Goldin, Alden Hung, Jessica Landon, Jirka Lhotka, Timothy Lillicrap, Alistair Muldal, George Powell, Adam Santoro, Guy Scully, Sanjana Srivastava, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

    Abstract: An important goal in artificial intelligence is to create agents that can both interact naturally with humans and learn from their feedback. Here we demonstrate how to use reinforcement learning from human feedback (RLHF) to improve upon simulated, embodied agents trained to a base level of competency with imitation learning. First, we collected data of humans interacting with agents in a simulate… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  45. arXiv:2211.01338  [pdf, other

    eess.AS cs.CL cs.MM cs.SD eess.IV

    Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages

    Authors: Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K V Vikram, Mano Ranjith Kumar M, Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya , et al. (2 additional authors not shown)

    Abstract: Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video. This task becomes challenging when the source and target languages… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  46. arXiv:2210.12530  [pdf, other

    cs.LG cs.AI cs.CL

    LMPriors: Pre-Trained Language Models as Task-Specific Priors

    Authors: Kristy Choi, Chris Cundy, Sanjari Srivastava, Stefano Ermon

    Abstract: Particularly in low-data regimes, an outstanding challenge in machine learning is develo** principled techniques for augmenting our models with suitable priors. This is to encourage them to learn in ways that are compatible with our understanding of the world. But in contrast to generic priors such as shrinkage or sparsity, we draw inspiration from the recent successes of large-scale language mo… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: First two authors contributed equally

  47. arXiv:2210.12302  [pdf, other

    cs.CL

    What do Large Language Models Learn beyond Language?

    Authors: Avinash Madasu, Shashank Srivastava

    Abstract: Large language models (LMs) have rapidly become a mainstay in Natural Language Processing. These models are known to acquire rich linguistic knowledge from training on large amounts of text. In this paper, we investigate if pre-training on text also confers these models with helpful `inductive biases' for non-linguistic reasoning. On a set of 19 diverse non-linguistic tasks involving quantitative… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted at the Findings of EMNLP 2022

  48. arXiv:2210.01955  [pdf, other

    cs.LG cs.AI

    Learning Dynamic Abstract Representations for Sample-Efficient Reinforcement Learning

    Authors: Mehdi Dadvar, Rashmeet Kaur Nayyar, Siddharth Srivastava

    Abstract: In many real-world problems, the learning agent needs to learn a problem's abstractions and solution simultaneously. However, most such abstractions need to be designed and refined by hand for different problems and domains of application. This paper presents a novel top-down approach for constructing state abstractions while carrying out reinforcement learning. Starting with state variables and a… ▽ More

    Submitted 8 December, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

  49. arXiv:2210.00068  [pdf, other

    cs.LG cs.AI cs.RO

    Multi-Task Option Learning and Discovery for Stochastic Path Planning

    Authors: Naman Shah, Siddharth Srivastava

    Abstract: This paper addresses the problem of reliably and efficiently solving broad classes of long-horizon stochastic path planning problems. Starting with a vanilla RL formulation with a stochastic dynamics simulator and an occupancy matrix of the environment, our approach computes useful options with policies as well as high-level paths that compose the discovered options. Our main contributions are (1)… ▽ More

    Submitted 8 December, 2022; v1 submitted 30 September, 2022; originally announced October 2022.

  50. arXiv:2209.06970  [pdf, other

    cs.CV cs.GR cs.LG

    Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models

    Authors: Chen Henry Wu, Saman Motamed, Shaunak Srivastava, Fernando De la Torre

    Abstract: Generative models (e.g., GANs, diffusion models) learn the underlying data distribution in an unsupervised manner. However, many applications of interest require sampling from a particular region of the output space or sampling evenly over a range of characteristics. For efficient sampling in these scenarios, we propose Generative Visual Prompt (PromptGen), a framework for distributional control o… ▽ More

    Submitted 17 October, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022