Skip to main content

Showing 1–50 of 283 results for author: Gupta, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19580  [pdf, other

    cs.AR cs.LG

    FRED: Flexible REduction-Distribution Interconnect and Communication Implementation for Wafer-Scale Distributed Training of DNN Models

    Authors: Saeed Rashidi, William Won, Sudarshan Srinivasan, Puneet Gupta, Tushar Krishna

    Abstract: Distributed Deep Neural Network (DNN) training is a technique to reduce the training overhead by distributing the training tasks into multiple accelerators, according to a parallelization strategy. However, high-performance compute and interconnects are needed for maximum speed-up and linear scaling of the system. Wafer-scale systems are a promising technology that allows for tightly integrating h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.19545  [pdf, other

    cs.CL cs.AI

    Leveraging Machine-Generated Rationales to Facilitate Social Meaning Detection in Conversations

    Authors: Ritam Dutt, Zhen Wu, Kelly Shi, Divyanshu Sheth, Prakhar Gupta, Carolyn Penstein Rose

    Abstract: We present a generalizable classification approach that leverages Large Language Models (LLMs) to facilitate the detection of implicitly encoded social meaning in conversations. We design a multi-faceted prompt to extract a textual explanation of the reasoning that connects visible cues to underlying social meanings. These extracted explanations or rationales serve as augmentations to the conversa… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: To appear at The Proceedings of the Association for Computational Linguistics, 2024

  3. arXiv:2406.14236  [pdf, other

    quant-ph cs.DC

    NAC-QFL: Noise Aware Clustered Quantum Federated Learning

    Authors: Himanshu Sahu, Hari Prabhat Gupta

    Abstract: Recent advancements in quantum computing, alongside successful deployments of quantum communication, hold promises for revolutionizing mobile networks. While Quantum Machine Learning (QML) presents opportunities, it contends with challenges like noise in quantum devices and scalability. Furthermore, the high cost of quantum communication constrains the practical application of QML in real-world sc… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2405.17469  [pdf, other

    cs.LG cs.AI cs.CY cs.PF

    A Dataset for Research on Water Sustainability

    Authors: Pranjol Sen Gupta, Md Rajib Hossen, Pengfei Li, Shaolei Ren, Mohammad A. Islam

    Abstract: Freshwater scarcity is a global problem that requires collective efforts across all industry sectors. Nevertheless, a lack of access to operational water footprint data bars many applications from exploring optimization opportunities hidden within the temporal and spatial variations. To break this barrier into research in water sustainability, we build a dataset for operation direct water usage in… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted by ACM e-Energy 2024

  5. arXiv:2405.13009  [pdf, other

    cs.CL cs.AI

    METAREFLECTION: Learning Instructions for Language Agents using Past Reflections

    Authors: Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Sherry Shi, Gustavo Soares

    Abstract: Despite the popularity of Large Language Models (LLMs), crafting specific prompts for LLMs to perform particular tasks remains challenging. Users often engage in multiple conversational turns with an LLM-based agent to accomplish their intended task. Recent studies have demonstrated that linguistic feedback, in the form of self-reflections generated by the model, can work as reinforcement during t… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  6. arXiv:2405.11651  [pdf, other

    cs.LG

    Movie Revenue Prediction using Machine Learning Models

    Authors: Vikranth Udandarao, Pratyush Gupta

    Abstract: In the contemporary film industry, accurately predicting a movie's earnings is paramount for maximizing profitability. This project aims to develop a machine learning model for predicting movie earnings based on input features like the movie name, the MPAA rating of the movie, the genre of the movie, the year of release of the movie, the IMDb Rating, the votes by the watchers, the director, the wr… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: for associated code base, see https://github.com/Vikranth3140/Movie-Revenue-Prediction

  7. arXiv:2405.01616  [pdf, other

    q-bio.BM cs.AI cs.LG

    Generative Active Learning for the Search of Small-molecule Protein Binders

    Authors: Maksym Korablyov, Cheng-Hao Liu, Moksh Jain, Almer M. van der Sloot, Eric Jolicoeur, Edward Ruediger, Andrei Cristian Nica, Emmanuel Bengio, Kostiantyn Lapchevskyi, Daniel St-Cyr, Doris Alexandra Schuetz, Victor Ion Butoi, Jarrid Rector-Brooks, Simon Blackburn, Leo Feng, Hadi Nekoei, SaiKrishna Gottipati, Priyesh Vijayan, Prateek Gupta, Ladislav Rampášek, Sasikanth Avancha, Pierre-Luc Bacon, William L. Hamilton, Brooks Paige, Sanchit Misra , et al. (9 additional authors not shown)

    Abstract: Despite substantial progress in machine learning for scientific discovery in recent years, truly de novo design of small molecules which exhibit a property of interest remains a significant challenge. We introduce LambdaZero, a generative active learning approach to search for synthesizable molecules. Powered by deep reinforcement learning, LambdaZero learns to search over the vast space of molecu… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  8. arXiv:2404.16556  [pdf, other

    cs.CV

    Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models

    Authors: Parul Gupta, Munawar Hayat, Abhinav Dhall, Thanh-Toan Do

    Abstract: Few-shot image synthesis entails generating diverse and realistic images of novel categories using only a few example images. While multiple recent efforts in this direction have achieved impressive results, the existing approaches are dependent only upon the few novel samples available at test time in order to generate new images, which restricts the diversity of the generated images. To overcome… ▽ More

    Submitted 28 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  9. arXiv:2404.05859  [pdf, other

    cs.CG math.AT

    Box Filtration

    Authors: Enrique Alvarado, Prashant Gupta, Bala Krishnamoorthy

    Abstract: We define a new framework that unifies the filtration and mapper approaches from TDA, and present efficient algorithms to compute it. Termed the box filtration of a PCD, we grow boxes (hyperrectangles) that are not necessarily centered at each point (in place of balls centered at points). We grow the boxes non-uniformly and asymmetrically in different dimensions based on the distribution of points… ▽ More

    Submitted 11 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: 17 figures

    MSC Class: 55N31; 62R40;

  10. arXiv:2404.01746  [pdf, other

    cs.RO cs.AI cs.LG

    Towards Scalable & Efficient Interaction-Aware Planning in Autonomous Vehicles using Knowledge Distillation

    Authors: Piyush Gupta, David Isele, Sangjae Bae

    Abstract: Real-world driving involves intricate interactions among vehicles navigating through dense traffic scenarios. Recent research focuses on enhancing the interaction awareness of autonomous vehicles to leverage these interactions in decision-making. These interaction-aware planners rely on neural-network-based prediction models to capture inter-vehicle interactions, aiming to integrate these predicti… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  11. arXiv:2403.08848  [pdf, other

    eess.IV cs.CV

    FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders

    Authors: Soumen Basu, Mayuna Gupta, Chetan Madan, Pankaj Gupta, Chetan Arora

    Abstract: In recent years, automated Gallbladder Cancer (GBC) detection has gained the attention of researchers. Current state-of-the-art (SOTA) methodologies relying on ultrasound sonography (US) images exhibit limited generalization, emphasizing the need for transformative approaches. We observe that individual US frames may lack sufficient information to capture disease manifestation. This study advocate… ▽ More

    Submitted 29 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: To Appear at CVPR 2024

  12. arXiv:2402.12566  [pdf, other

    cs.CL cs.LG

    GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence

    Authors: Kundan Krishna, Sanjana Ramprasad, Prakhar Gupta, Byron C. Wallace, Zachary C. Lipton, Jeffrey P. Bigham

    Abstract: LLMs can generate factually incorrect statements even when provided access to reference documents. Such errors can be dangerous in high-stakes applications (e.g., document-grounded QA for healthcare or finance). We present GenAudit -- a tool intended to assist fact-checking LLM responses for document-grounded tasks. GenAudit suggests edits to the LLM response by revising or removing claims that ar… ▽ More

    Submitted 16 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Code and models available at https://genaudit.org

  13. arXiv:2401.15906  [pdf, other

    cs.CR cs.IT stat.AP

    Mean Estimation with User-Level Privacy for Spatio-Temporal IoT Datasets

    Authors: V. Arvind Rameshwar, Anshoo Tandon, Prajjwal Gupta, Aditya Vikram Singh, Novoneel Chakraborty, Abhay Sharma

    Abstract: This paper considers the problem of the private release of sample means of speed values from traffic datasets. Our key contribution is the development of user-level differentially private algorithms that incorporate carefully chosen parameter values to ensure low estimation errors on real-world datasets, while ensuring privacy. We test our algorithms on ITMS (Intelligent Traffic Management System)… ▽ More

    Submitted 25 April, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 14 pages, 5 figures, submitted to the ACM for possible publication

  14. arXiv:2401.14283  [pdf, other

    stat.ML cs.LG

    Information Leakage Detection through Approximate Bayes-optimal Prediction

    Authors: Pritha Gupta, Marcel Wever, Eyke Hüllermeier

    Abstract: In today's data-driven world, the proliferation of publicly available information intensifies the challenge of information leakage (IL), raising security concerns. IL involves unintentionally exposing secret (sensitive) information to unauthorized parties via systems' observable information. Conventional statistical approaches, which estimate mutual information (MI) between observable and secret i… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Under submission in JMLR

    MSC Class: 94A15; 62H30; 94A60 ACM Class: I.5.1; G.3; E.3

  15. arXiv:2401.01387  [pdf, other

    cs.CV

    DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition

    Authors: Parul Gupta, Tuan Nguyen, Abhinav Dhall, Munawar Hayat, Trung Le, Thanh-Toan Do

    Abstract: The task of Visual Relationship Recognition (VRR) aims to identify relationships between two interacting objects in an image and is particularly challenging due to the widely-spread and highly imbalanced distribution of <subject, relation, object> triplets. To overcome the resultant performance bias in existing VRR approaches, we introduce DiffAugment -- a method which first augments the tail clas… ▽ More

    Submitted 1 March, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  16. arXiv:2312.09626  [pdf

    cs.HC cs.CY

    Exploring Gender Disparities in Bumble's Match Recommendations

    Authors: Ritvik Aryan Kalra, Pratham Gupta, Ben Varghese, Nimmi Rangaswamy

    Abstract: We study bias and discrimination in the context of Bumble, an online dating platform in India. Drawing on research in AI fairness and inclusion studies we analyze algorithmic bias and their propensity to reproduce bias. We conducted an experiment to identify and address the presence of bias in the matching algorithms Bumble pushes to its users in the form of profiles for potential dates in the rea… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  17. arXiv:2312.07979  [pdf

    cs.CL cs.LG

    SLJP: Semantic Extraction based Legal Judgment Prediction

    Authors: Prameela Madambakam, Shathanaa Rajmohan, Himangshu Sharma, Tummepalli Anka Chandrahas Purushotham Gupta

    Abstract: Legal Judgment Prediction (LJP) is a judicial assistance system that recommends the legal components such as applicable statues, prison term and penalty term by analyzing the given input case document. Indian legal system is in the need of technical assistance such as artificial intelligence to solve the crores of pending cases in various courts for years and its being increased day to day. Most o… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  18. arXiv:2312.04613  [pdf, other

    physics.ed-ph cs.LG

    Testing LLM performance on the Physics GRE: some observations

    Authors: Pranav Gupta

    Abstract: With the recent developments in large language models (LLMs) and their widespread availability through open source models and/or low-cost APIs, several exciting products and applications are emerging, many of which are in the field of STEM educational technology for K-12 and university students. There is a need to evaluate these powerful language models on several benchmarks, in order to understan… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 4 pages

  19. arXiv:2312.02467  [pdf, other

    cs.RO

    Object Importance Estimation using Counterfactual Reasoning for Intelligent Driving

    Authors: Pranay Gupta, Abhijat Biswas, Henny Admoni, David Held

    Abstract: The ability to identify important objects in a complex and dynamic driving environment is essential for autonomous driving agents to make safe and efficient driving decisions. It also helps assistive driving systems decide when to alert drivers. We tackle object importance estimation in a data-driven fashion and introduce HOIST - Human-annotated Object Importance in Simulated Traffic. HOIST contai… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  20. arXiv:2312.00292  [pdf, other

    cs.CL

    SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection

    Authors: Anku Rani, Dwip Dalal, Shreya Gautam, Pankaj Gupta, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

    Abstract: Deception is the intentional practice of twisting information. It is a nuanced societal practice deeply intertwined with human societal evolution, characterized by a multitude of facets. This research explores the problem of deception through the lens of psychology, employing a framework that categorizes deception into three forms: lies of omission, lies of commission, and lies of influence. The p… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  21. arXiv:2311.18259  [pdf, other

    cs.CV cs.AI

    Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

    Authors: Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, **g Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, **g Huang, Md Mohaiminul Islam, Suyog Jain , et al. (76 additional authors not shown)

    Abstract: We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike repair). 740 participants from 13 cities worldwide performed these activities in 123 different natural scene contexts, yielding long-form captures from… ▽ More

    Submitted 29 April, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: updated baseline results and dataset statistics to match the released v2 data; added table to appendix comparing stats of Ego-Exo4D alongside other datasets

  22. arXiv:2311.06381  [pdf, other

    cs.HC

    Optimal Fidelity Selection for Improved Performance in Human-in-the-Loop Queues for Underwater Search

    Authors: Piyush Gupta, Vaibhav Srivastava

    Abstract: In the context of human-supervised autonomy, we study the problem of optimal fidelity selection for a human operator performing an underwater visual search task. Human performance depends on various cognitive factors such as workload and fatigue. We perform human experiments in which participants perform two tasks simultaneously: a primary task, which is subject to evaluation, and a secondary task… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  23. arXiv:2311.03486  [pdf, other

    cs.HC

    Fostering Human Learning in Sequential Decision-Making: Understanding the Role of Evaluative Feedback

    Authors: Piyush Gupta, Subir Biswas, Vaibhav Srivastava

    Abstract: Cognitive rehabilitation, STEM (science, technology, engineering, and math) skill acquisition, and coaching games such as chess often require tutoring decision-making strategies. The advancement of AI-driven tutoring systems for facilitating human learning requires an understanding of the impact of evaluative feedback on human decision-making and skill development. To this end, we conduct human ex… ▽ More

    Submitted 4 May, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  24. arXiv:2310.17228  [pdf, other

    cs.AI cs.CL cs.SE

    TST$^\mathrm{R}$: Target Similarity Tuning Meets the Real World

    Authors: Anirudh Khatry, Sumit Gulwani, Priyanshu Gupta, Vu Le, Ananya Singha, Mukul Singh, Gust Verbruggen

    Abstract: Target similarity tuning (TST) is a method of selecting relevant examples in natural language (NL) to code generation through large language models (LLMs) to improve performance. Its goal is to adapt a sentence embedding model to have the similarity between two NL inputs match the similarity between their associated code outputs. In this paper, we propose different methods to apply and improve TST… ▽ More

    Submitted 28 October, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted for EMNLP-Findings, 2023

  25. arXiv:2310.10745  [pdf, other

    cs.LG math.DS physics.flu-dyn stat.ML

    Mori-Zwanzig latent space Koopman closure for nonlinear autoencoder

    Authors: Priyam Gupta, Peter J. Schmid, Denis Sipp, Taraneh Sayadi, Georgios Rigas

    Abstract: The Koopman operator presents an attractive approach to achieve global linearization of nonlinear systems, making it a valuable method for simplifying the understanding of complex dynamics. While data-driven methodologies have exhibited promise in approximating finite Koopman operators, they grapple with various challenges, such as the judicious selection of observables, dimensionality reduction,… ▽ More

    Submitted 16 April, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 22 pages, 11 figures

  26. arXiv:2310.07940  [pdf, other

    cs.LG

    Cost-Driven Hardware-Software Co-Optimization of Machine Learning Pipelines

    Authors: Ravit Sharma, Wojciech Romaszkan, Feiqian Zhu, Puneet Gupta, Ankur Mehta

    Abstract: Researchers have long touted a vision of the future enabled by a proliferation of internet-of-things devices, including smart sensors, homes, and cities. Increasingly, embedding intelligence in such devices involves the use of deep neural networks. However, their storage and processing requirements make them prohibitive for cheap, off-the-shelf platforms. Overcoming those requirements is necessary… ▽ More

    Submitted 19 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  27. arXiv:2310.05380  [pdf, other

    cs.IR cs.LG

    Augmented Embeddings for Custom Retrievals

    Authors: Anirudh Khatry, Yasharth Bajpai, Priyanshu Gupta, Sumit Gulwani, Ashish Tiwari

    Abstract: Information retrieval involves selecting artifacts from a corpus that are most relevant to a given search query. The flavor of retrieval typically used in classical applications can be termed as homogeneous and relaxed, where queries and corpus elements are both natural language (NL) utterances (homogeneous) and the goal is to pick most relevant elements from the corpus in the Top-K, where K is la… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: 14 pages

    ACM Class: I.2.6

  28. arXiv:2309.05261  [pdf, other

    cs.CV

    Gall Bladder Cancer Detection from US Images with Only Image Level Labels

    Authors: Soumen Basu, Ashish Papanai, Mayank Gupta, Pankaj Gupta, Chetan Arora

    Abstract: Automated detection of Gallbladder Cancer (GBC) from Ultrasound (US) images is an important problem, which has drawn increased interest from researchers. However, most of these works use difficult-to-acquire information such as bounding box annotations or additional US videos. In this paper, we focus on GBC detection using only image-level labels. Such annotation is usually available based on the… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: Accepted at MICCAI 2023

  29. arXiv:2308.13561  [pdf, other

    cs.HC cs.CV

    Project Aria: A New Tool for Egocentric Multi-Modal AI Research

    Authors: Jakob Engel, Kiran Somasundaram, Michael Goesele, Albert Sun, Alexander Gamino, Andrew Turner, Arjang Talattof, Arnie Yuan, Bilal Souti, Brighid Meredith, Cheng Peng, Chris Sweeney, Cole Wilson, Dan Barnes, Daniel DeTone, David Caruso, Derek Valleroy, Dinesh Ginjupalli, Duncan Frost, Edward Miller, Elias Mueggler, Evgeniy Oleinik, Fan Zhang, Guruprasad Somasundaram, Gustavo Solaira , et al. (49 additional authors not shown)

    Abstract: Egocentric, multi-modal data as available on future augmented reality (AR) devices provides unique challenges and opportunities for machine perception. These future devices will need to be all-day wearable in a socially acceptable form-factor to support always available, context-aware and personalized AI applications. Our team at Meta Reality Labs Research built the Aria device, an egocentric, mul… ▽ More

    Submitted 1 October, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

  30. arXiv:2308.13093  [pdf, other

    cs.CV

    EgoBlur: Responsible Innovation in Aria

    Authors: Nikhil Raina, Guruprasad Somasundaram, Kang Zheng, Sagar Miglani, Steve Saarinen, Jeff Meissner, Mark Schwesinger, Luis Pesqueira, Ishita Prasad, Edward Miller, Prince Gupta, Mingfei Yan, Richard Newcombe, Carl Ren, Omkar M Parkhi

    Abstract: Project Aria pushes the frontiers of Egocentric AI with large-scale real-world data collection using purposely designed glasses with privacy first approach. To protect the privacy of bystanders being recorded by the glasses, our research protocols are designed to ensure recorded video is processed by an AI anonymization model that removes bystander faces and vehicle license plates. Detected face a… ▽ More

    Submitted 6 September, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

  31. arXiv:2308.11526  [pdf, other

    cs.CL cs.AI cs.SE

    Learning Representations on Logs for AIOps

    Authors: Pranjal Gupta, Harshit Kumar, Debanjana Kar, Karan Bhukar, Pooja Aggarwal, Prateeti Mohapatra

    Abstract: AI for IT Operations (AIOps) is a powerful platform that Site Reliability Engineers (SREs) use to automate and streamline operational workflows with minimal human intervention. Automated log analysis is a critical task in AIOps as it provides key insights for SREs to identify and address ongoing faults. Tasks such as log format detection, log classification, and log parsing are key components of a… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 11 pages, 2023 IEEE 16th International Conference on Cloud Computing (CLOUD)

  32. arXiv:2308.08577  [pdf, other

    cs.SD cs.CL cs.HC eess.AS

    AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis

    Authors: Hrishikesh Viswanath, Aneesh Bhattacharya, Pascal Jutras-Dubé, Prerit Gupta, Mridu Prashanth, Yashvardhan Khaitan, Aniket Bera

    Abstract: Affect is an emotional characteristic encompassing valence, arousal, and intensity, and is a crucial attribute for enabling authentic conversations. While existing text-to-speech (TTS) and speech-to-speech systems rely on strength embedding vectors and global style tokens to capture emotions, these models represent emotions as a component of style or represent them in discrete categories. We propo… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  33. arXiv:2308.06069  [pdf, other

    cs.SE cs.LG cs.LO eess.SY

    Safeguarding Learning-based Control for Smart Energy Systems with Sampling Specifications

    Authors: Chih-Hong Cheng, Venkatesh Prasad Venkataramanan, Pragya Kirti Gupta, Yun-Fei Hsu, Simon Burton

    Abstract: We study challenges using reinforcement learning in controlling energy systems, where apart from performance requirements, one has additional safety requirements such as avoiding blackouts. We detail how these safety requirements in real-time temporal logic can be strengthened via discretization into linear temporal logic (LTL), such that the satisfaction of the LTL formulae implies the satisfacti… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  34. arXiv:2307.09882  [pdf, other

    cs.LG cs.AI

    Adversarial Likelihood Estimation With One-Way Flows

    Authors: Omri Ben-Dov, Pravir Singh Gupta, Victoria Abrevaya, Michael J. Black, Partha Ghosh

    Abstract: Generative Adversarial Networks (GANs) can produce high-quality samples, but do not provide an estimate of the probability density around the samples. However, it has been noted that maximizing the log-likelihood within an energy-based setting can lead to an adversarial framework where the discriminator provides unnormalized density (often called energy). We further develop this perspective, incor… ▽ More

    Submitted 2 October, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

  35. arXiv:2307.08652  [pdf, other

    cs.GR

    Search Me Knot, Render Me Knot: Embedding Search and Differentiable Rendering of Knots in 3D

    Authors: Aalok Gangopadhyay, Paras Gupta, Tarun Sharma, Prajwal Singh, Shanmuganathan Raman

    Abstract: We introduce the problem of knot-based inverse perceptual art. Given multiple target images and their corresponding viewing configurations, the objective is to find a 3D knot-based tubular structure whose appearance resembles the target images when viewed from the specified viewing configurations. To solve this problem, we first design a differentiable rendering algorithm for rendering tubular kno… ▽ More

    Submitted 19 August, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  36. arXiv:2307.06951   

    cs.AI cs.LG

    AI For Global Climate Cooperation 2023 Competition Proceedings

    Authors: Yoshua Bengio, Prateek Gupta, Lu Li, Soham Phade, Sunil Srinivasa, Andrew Williams, Tianyu Zhang, Yang Zhang, Stephan Zheng

    Abstract: The international community must collaborate to mitigate climate change and sustain economic growth. However, collaboration is hard to achieve, partly because no global authority can ensure compliance with international climate agreements. Combining AI with climate-economic simulations offers a promising solution to design international frameworks, including negotiation protocols and climate agree… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  37. A Dataset of Inertial Measurement Units for Handwritten English Alphabets

    Authors: Hari Prabhat Gupta, Rahul Mishra

    Abstract: This paper presents an end-to-end methodology for collecting datasets to recognize handwritten English alphabets by utilizing Inertial Measurement Units (IMUs) and leveraging the diversity present in the Indian writing style. The IMUs are utilized to capture the dynamic movement patterns associated with handwriting, enabling more accurate recognition of alphabets. The Indian context introduces var… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 10 pages, 12 figures

  38. arXiv:2306.16503  [pdf, other

    cs.LG cs.AI

    SARC: Soft Actor Retrospective Critic

    Authors: Sukriti Verma, Ayush Chopra, Jayakumar Subramanian, Mausoom Sarkar, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

    Abstract: The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two. Various strategies have been introduced in literature to learn better gradient estimates to help achieve better convergence.… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted at RLDM 2022

  39. arXiv:2306.04207  [pdf, ps, other

    cs.DC

    Resource Aware Clustering for Tackling the Heterogeneity of Participants in Federated Learning

    Authors: Rahul Mishra, Hari Prabhat Gupta, Garvit Banga

    Abstract: Federated Learning is a training framework that enables multiple participants to collaboratively train a shared model while preserving data privacy and minimizing communication overhead. The heterogeneity of devices and networking resources of the participants delay the training and aggregation in federated learning. This paper proposes a federated learning approach to manoeuvre the heterogeneity… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 13 pages, 4 figures

  40. arXiv:2305.16606  [pdf, other

    cs.IR

    Mitigating Exploitation Bias in Learning to Rank with an Uncertainty-aware Empirical Bayes Approach

    Authors: Tao Yang, Cuize Han, Chen Luo, Parth Gupta, Jeff M. Phillips, Qingyao Ai

    Abstract: Ranking is at the core of many artificial intelligence (AI) applications, including search engines, recommender systems, etc. Modern ranking systems are often constructed with learning-to-rank (LTR) models built from user behavior signals. While previous studies have demonstrated the effectiveness of using user behavior signals (e.g., clicks) as both features and labels of LTR algorithms, we argue… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  41. arXiv:2305.14296  [pdf, other

    cs.CL cs.LG

    USB: A Unified Summarization Benchmark Across Tasks and Domains

    Authors: Kundan Krishna, Prakhar Gupta, Sanjana Ramprasad, Byron C. Wallace, Jeffrey P. Bigham, Zachary C. Lipton

    Abstract: While the NLP community has produced numerous summarization benchmarks, none provide the rich annotations required to simultaneously address many important problems related to control and reliability. We introduce a Wikipedia-derived benchmark, complemented by a rich set of crowd-sourced annotations, that supports $8$ interrelated tasks: (i) extractive summarization; (ii) abstractive summarization… ▽ More

    Submitted 4 December, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP Findings 2023 Camera Ready

  42. arXiv:2305.14129  [pdf, other

    cs.SE cs.LG

    GrACE: Generation using Associated Code Edits

    Authors: Priyanshu Gupta, Avishree Khare, Yasharth Bajpai, Saikat Chakraborty, Sumit Gulwani, Aditya Kanade, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari

    Abstract: Developers expend a significant amount of time in editing code for a variety of reasons such as bug fixing or adding new features. Designing effective methods to predict code edits has been an active yet challenging area of research due to the diversity of code edits and the difficulty of capturing the developer intent. In this work, we address these challenges by endowing pre-trained large langua… ▽ More

    Submitted 20 September, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  43. arXiv:2305.05255  [pdf, other

    cs.HC

    Emolysis: A Multimodal Open-Source Group Emotion Analysis and Visualization Toolkit

    Authors: Shreya Ghosh, Zhixi Cai, Parul Gupta, Garima Sharma, Abhinav Dhall, Munawar Hayat, Tom Gedeon

    Abstract: Automatic group emotion recognition plays an important role in understanding complex human-human interaction. This paper introduces, Emolysis, a standalone open-source toolkit for real-time multimodal group emotion recognition and visualization. Given any input video, Emolysis processes nearly real-time synchronized multimodal input and maps it to group level emotion, valence and arousal. Addition… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: Both Shreya Ghosh and Zhixi Cai contributed equally to this research

  44. arXiv:2304.04125  [pdf, ps, other

    cs.LG cs.AR

    Training Neural Networks for Execution on Approximate Hardware

    Authors: Tianmu Li, Shurui Li, Puneet Gupta

    Abstract: Approximate computing methods have shown great potential for deep learning. Due to the reduced hardware costs, these methods are especially suitable for inference tasks on battery-operated devices that are constrained by their power budget. However, approximate computing hasn't reached its full potential due to the lack of work on training methods. In this work, we discuss training methods for app… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

  45. arXiv:2303.17651  [pdf, other

    cs.CL cs.AI cs.LG

    Self-Refine: Iterative Refinement with Self-Feedback

    Authors: Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Shashank Gupta, Bodhisattwa Prasad Majumder, Katherine Hermann, Sean Welleck, Amir Yazdanbakhsh, Peter Clark

    Abstract: Like humans, large language models (LLMs) do not always generate the best output on their first try. Motivated by how humans refine their written text, we introduce Self-Refine, an approach for improving initial outputs from LLMs through iterative feedback and refinement. The main idea is to generate an initial output using an LLMs; then, the same LLMs provides feedback for its output and uses it… ▽ More

    Submitted 25 May, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Code, data, and demo at https://selfrefine.info/

  46. arXiv:2303.16088  [pdf, other

    physics.comp-ph cond-mat.stat-mech cs.LG

    GNN-Assisted Phase Space Integration with Application to Atomistics

    Authors: Shashank Saxena, Jan-Hendrik Bastek, Miguel Spinola, Prateek Gupta, Dennis M. Kochmann

    Abstract: Overcoming the time scale limitations of atomistics can be achieved by switching from the state-space representation of Molecular Dynamics (MD) to a statistical-mechanics-based representation in phase space, where approximations such as maximum-entropy or Gaussian phase packets (GPP) evolve the atomistic ensemble in a time-coarsened fashion. In practice, this requires the computation of expensive… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  47. arXiv:2303.00344  [pdf, other

    cs.CL

    Inline Citation Classification using Peripheral Context and Time-evolving Augmentation

    Authors: Priyanshi Gupta, Yash Kumar Atri, Apurva Nagvenkar, Sourish Dasgupta, Tanmoy Chakraborty

    Abstract: Citation plays a pivotal role in determining the associations among research articles. It portrays essential information in indicative, supportive, or contrastive studies. The task of inline citation classification aids in extrapolating these relationships; However, existing studies are still immature and demand further scrutiny. Current datasets and methods used for inline citation classification… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: accepted to PAKDD 2023

  48. arXiv:2302.14635  [pdf, other

    cs.CL

    H-AES: Towards Automated Essay Scoring for Hindi

    Authors: Shubhankar Singh, Anirudh Pupneja, Shivaansh Mital, Cheril Shah, Manish Bawkar, Lakshman Prasad Gupta, Ajit Kumar, Yaman Kumar, Rushali Gupta, Rajiv Ratn Shah

    Abstract: The use of Natural Language Processing (NLP) for Automated Essay Scoring (AES) has been well explored in the English language, with benchmark models exhibiting performance comparable to human scorers. However, AES in Hindi and other low-resource languages remains unexplored. In this study, we reproduce and compare state-of-the-art methods for AES in the Hindi domain. We employ classical feature-ba… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: 9 pages, 3 Tables, To be published as a part of Proceedings of the 37th AAAI Conference on Artificial Intelligence

  49. arXiv:2302.14601  [pdf, other

    cs.SE eess.SY

    SAFR-AV: Safety Analysis of Autonomous Vehicles using Real World Data -- An end-to-end solution for real world data driven scenario-based testing for pre-certification of AV stacks

    Authors: Sagar Pathrudkar, Saadhana Venkataraman, Deepika Kanade, Aswin Ajayan, Palash Gupta, Shehzaman Khatib, Vijaya Sarathi Indla, Saikat Mukherjee

    Abstract: One of the major impediments in deployment of Autonomous Driving Systems (ADS) is their safety and reliability. The primary reason for the complexity of testing ADS is that it operates in an open world characterized by its non-deterministic, high-dimensional and non-stationary nature where the actions of other actors in the environment are uncontrollable from the ADS's perspective. This leads to a… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  50. arXiv:2302.00871  [pdf, other

    cs.CL

    Using In-Context Learning to Improve Dialogue Safety

    Authors: Nicholas Meade, Spandana Gella, Devamanyu Hazarika, Prakhar Gupta, Di **, Siva Reddy, Yang Liu, Dilek Hakkani-Tür

    Abstract: While large neural-based conversational models have become increasingly proficient dialogue agents, recent work has highlighted safety issues with these systems. For example, these systems can be goaded into generating toxic content, which often perpetuates social biases or stereotypes. We investigate a retrieval-based method for reducing bias and toxicity in responses from chatbots. It uses in-co… ▽ More

    Submitted 22 October, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Findings of EMNLP 2023