Skip to main content

Showing 1–50 of 365 results for author: Agarwal, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18893  [pdf, other

    cs.CV

    AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models

    Authors: Aishwarya Agarwal, Srikrishna Karanam, Balaji Vasan Srinivasan

    Abstract: We consider the problem of customizing text-to-image diffusion models with user-supplied reference images. Given new prompts, the existing methods can capture the key concept from the reference images but fail to align the generated image with the prompt. In this work, we seek to address this key issue by proposing new methods that can easily be used in conjunction with existing customization meth… ▽ More

    Submitted 27 June, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: 10 pages, 9 figures

  2. arXiv:2406.13473  [pdf, other

    cs.CV

    Snowy Scenes,Clear Detections: A Robust Model for Traffic Light Detection in Adverse Weather Conditions

    Authors: Shivank Garg, Abhishek Baghel, Amit Agarwal, Durga Toshniwal

    Abstract: With the rise of autonomous vehicles and advanced driver-assistance systems (ADAS), ensuring reliable object detection in all weather conditions is crucial for safety and efficiency. Adverse weather like snow, rain, and fog presents major challenges for current detection systems, often resulting in failures and potential safety risks. This paper introduces a novel framework and pipeline designed t… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2406.10197  [pdf, other

    cs.CV cs.AI cs.LG

    Crafting Parts for Expressive Object Composition

    Authors: Harsh Rangwani, Aishwarya Agarwal, Kuldeep Kulkarni, R. Venkatesh Babu, Srikrishna Karanam

    Abstract: Text-to-image generation from large generative models like Stable Diffusion, DALLE-2, etc., have become a common base for various tasks due to their superior quality and extensive knowledge bases. As image composition and generation are creative processes the artists need control over various parts of the images being generated. We find that just adding details about parts in the base text prompt… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Project Page Will Be Here: https://rangwani-harsh.github.io/PartCraft

  4. arXiv:2406.06027  [pdf, other

    cs.CL

    HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMs

    Authors: Pranoy Panda, Ankush Agarwal, Chaitanya Devaguptapu, Manohar Kaul, Prathosh A P

    Abstract: Given unstructured text, Large Language Models (LLMs) are adept at answering simple (single-hop) questions. However, as the complexity of the questions increase, the performance of LLMs degrade. We believe this is due to the overhead associated with understanding the complex question followed by filtering and aggregating unstructured information in the raw text. Recent methods try to reduce this b… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024 in the main track

  5. arXiv:2406.02252  [pdf, other

    cs.DC

    Exploring the Efficiency of Renewable Energy-based Modular Data Centers at Scale

    Authors: **ghan Sun, Zibo Gong, Anup Agarwal, Shadi Noghabi, Ranveer Chandra, Marc Snir, Jian Huang

    Abstract: Modular data centers (MDCs) that can be placed right at the energy farms and powered mostly by renewable energy, are proven to be a flexible and effective approach to lowering the carbon footprint of data centers. However, the main challenge of using renewable energy is the high variability of power produced, which implies large volatility in powering computing resources at MDCs, and degraded appl… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2406.01611  [pdf, other

    cs.IR cs.LG stat.ML

    System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes

    Authors: Arpit Agarwal, Nicolas Usunier, Alessandro Lazaric, Maximilian Nickel

    Abstract: Recommender systems are an important part of the modern human experience whose influence ranges from the food we eat to the news we read. Yet, there is still debate as to what extent recommendation platforms are aligned with the user goals. A core issue fueling this debate is the challenge of inferring a user utility based on engagement signals such as likes, shares, watch time etc., which are the… ▽ More

    Submitted 29 May, 2024; originally announced June 2024.

    Comments: Accepted at FAccT'24

  7. arXiv:2406.00040  [pdf, other

    cs.CL

    Unveiling Themes in Judicial Proceedings: A Cross-Country Study Using Topic Modeling on Legal Documents from India and the UK

    Authors: Krish Didwania, Dr. Durga Toshniwal, Amit Agarwal

    Abstract: Legal documents are indispensable in every country for legal practices and serve as the primary source of information regarding previous cases and employed statutes. In today's world, with an increasing number of judicial cases, it is crucial to systematically categorize past cases into subgroups, which can then be utilized for upcoming cases and practices. Our primary focus in this endeavor was t… ▽ More

    Submitted 30 June, 2024; v1 submitted 27 May, 2024; originally announced June 2024.

  8. arXiv:2405.19316  [pdf, other

    cs.LG cs.CL

    Robust Preference Optimization through Reward Model Distillation

    Authors: Adam Fisch, Jacob Eisenstein, Vicky Zayats, Alekh Agarwal, Ahmad Beirami, Chirag Nagpal, Pete Shaw, Jonathan Berant

    Abstract: Language model (LM) post-training (or alignment) involves maximizing a reward function that is derived from preference annotations. Direct Preference Optimization (DPO) is a popular offline alignment method that trains a policy directly on preference data without the need to train a reward model or apply reinforcement learning. However, typical preference datasets have only a single, or at most a… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  9. arXiv:2405.18621  [pdf, other

    cs.LG stat.ME stat.ML

    Multi-Armed Bandits with Network Interference

    Authors: Abhineet Agarwal, Anish Agarwal, Lorenzo Masoero, Justin Whitehouse

    Abstract: Online experimentation with interference is a common challenge in modern applications such as e-commerce and adaptive clinical trials in medicine. For example, in online marketplaces, the revenue of a good depends on discounts applied to competing goods. Statistical inference with interference is widely studied in the offline setting, but far less is known about how to adaptively assign treatments… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  10. arXiv:2405.16204  [pdf, other

    cs.CV cs.AI cs.GR

    VOODOO XP: Expressive One-Shot Head Reenactment for VR Telepresence

    Authors: Phong Tran, Egor Zakharov, Long-Nhat Ho, Liwen Hu, Adilbek Karmanov, Aviral Agarwal, McLean Goldwhite, Ariana Bermudez Venegas, Anh Tuan Tran, Hao Li

    Abstract: We introduce VOODOO XP: a 3D-aware one-shot head reenactment method that can generate highly expressive facial expressions from any input driver video and a single 2D portrait. Our solution is real-time, view-consistent, and can be instantly used without calibration or fine-tuning. We demonstrate our solution on a monocular video setting and an end-to-end VR telepresence system for two-way communi… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  11. arXiv:2405.10345  [pdf, other

    q-bio.QM cs.AI cs.LG

    Machine Learning Driven Biomarker Selection for Medical Diagnosis

    Authors: Divyagna Bavikadi, Ayushi Agarwal, Shashank Ganta, Yunro Chung, Lusheng Song, Ji Qiu, Paulo Shakarian

    Abstract: Recent advances in experimental methods have enabled researchers to collect data on thousands of analytes simultaneously. This has led to correlational studies that associated molecular measurements with diseases such as Alzheimer's, Liver, and Gastric Cancer. However, the use of thousands of biomarkers selected from the analytes is not practical for real-world medical diagnosis and is likely unde… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  12. arXiv:2405.07991  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    SPIN: Simultaneous Perception, Interaction and Navigation

    Authors: Shagun Uppal, Ananye Agarwal, Haoyu Xiong, Kenneth Shaw, Deepak Pathak

    Abstract: While there has been remarkable progress recently in the fields of manipulation and locomotion, mobile manipulation remains a long-standing challenge. Compared to locomotion or static manipulation, a mobile system must make a diverse range of long-horizon tasks feasible in unstructured and dynamic environments. While the applications are broad and interesting, there are a plethora of challenges in… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: In CVPR 2024. Website at https://spin-robot.github.io/

  13. Towards Building Autonomous Data Services on Azure

    Authors: Yiwen Zhu, Yuanyuan Tian, Joyce Cahoon, Subru Krishnan, Ankita Agarwal, Rana Alotaibi, Jesús Camacho-Rodríguez, Bibin Chundatt, Andrew Chung, Niharika Dutta, Andrew Fogarty, Anja Gruenheid, Brandon Haynes, Matteo Interlandi, Minu Iyer, Nick Jurgens, Sumeet Khushalani, Brian Kroth, Manoj Kumar, Jyoti Leeka, Sergiy Matusevych, Minni Mittal, Andreas Mueller, Kartheek Muthyala, Harsha Nagulapalli , et al. (13 additional authors not shown)

    Abstract: Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: SIGMOD Companion of the 2023 International Conference on Management of Data. 2023

  14. arXiv:2405.01573  [pdf, other

    cs.SE cs.AI

    Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository

    Authors: A**kya Deshpande, Anmol Agarwal, Shashank Shet, Arun Iyer, Aditya Kanade, Ramakrishna Bairi, Suresh Parthasarathy

    Abstract: LLMs have demonstrated significant potential in code generation tasks, achieving promising results at the function or statement level across various benchmarks. However, the complexities associated with creating code artifacts like classes, particularly within the context of real-world software repositories, remain underexplored. Prior research treats class-level generation as an isolated task, ne… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 April, 2024; originally announced May 2024.

    Comments: Preprint with additional experiments

  15. arXiv:2404.16831  [pdf, other

    cs.CV

    The Third Monocular Depth Estimation Challenge

    Authors: Jaime Spencer, Fabio Tosi, Matteo Poggi, Ripudaman Singh Arora, Chris Russell, Simon Hadfield, Richard Bowden, GuangYuan Zhou, ZhengXin Li, Qiang Rao, Yi** Bao, Xiao Liu, Dohyeong Kim, **seong Kim, Myunghyun Kim, Mykola Lavreniuk, Rui Li, Qing Mao, Jiang Wu, Yu Zhu, **qiu Sun, Yanning Zhang, Suraj Patni, Aradhye Agarwal, Chetan Arora , et al. (16 additional authors not shown)

    Abstract: This paper discusses the results of the third edition of the Monocular Depth Estimation Challenge (MDEC). The challenge focuses on zero-shot generalization to the challenging SYNS-Patches dataset, featuring complex scenes in natural and indoor settings. As with the previous edition, methods can use any form of supervision, i.e. supervised or self-supervised. The challenge received a total of 19 su… ▽ More

    Submitted 27 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: To appear in CVPRW2024

  16. Understanding the Role of Temperature in Diverse Question Generation by GPT-4

    Authors: Arav Agarwal, Karthik Mittal, Aidan Doyle, Pragnya Sridhar, Zipiao Wan, Jacob Arthur Doughty, Jaromir Savelka, Majd Sakr

    Abstract: We conduct a preliminary study of the effect of GPT's temperature parameter on the diversity of GPT4-generated questions. We find that using higher temperature values leads to significantly higher diversity, with different temperatures exposing different types of similarity between generated sets of questions. We also demonstrate that diverse question generation is especially difficult for questio… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  17. arXiv:2404.01049  [pdf, other

    astro-ph.IM cs.LG

    A Novel Sector-Based Algorithm for an Optimized Star-Galaxy Classification

    Authors: Anumanchi Agastya Sai Ram Likhit, Divyansh Tripathi, Akshay Agarwal

    Abstract: This paper introduces a novel sector-based methodology for star-galaxy classification, leveraging the latest Sloan Digital Sky Survey data (SDSS-DR18). By strategically segmenting the sky into sectors aligned with SDSS observational patterns and employing a dedicated convolutional neural network (CNN), we achieve state-of-the-art performance for star galaxy classification. Our preliminary results… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Journal ref: The Second Tiny Papers Track at ICLR 2024

  18. arXiv:2403.19462  [pdf, other

    cs.LG cs.PL

    Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization

    Authors: Teodor V. Marinov, Alekh Agarwal, Mircea Trofin

    Abstract: This work studies a Reinforcement Learning (RL) problem in which we are given a set of trajectories collected with K baseline policies. Each of these policies can be quite suboptimal in isolation, and have strong performance in complementary parts of the state space. The goal is to learn a policy which performs as well as the best combination of baselines on the entire state space. We propose a si… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  19. arXiv:2403.18807  [pdf, other

    cs.CV cs.AI cs.LG

    ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation

    Authors: Suraj Patni, Aradhye Agarwal, Chetan Arora

    Abstract: In the absence of parallax cues, a learning-based single image depth estimation (SIDE) model relies heavily on shading and contextual cues in the image. While this simplicity is attractive, it is necessary to train such models on large and varied datasets, which are difficult to capture. It has been shown that using embeddings from pre-trained foundational models, such as CLIP, improves zero shot… ▽ More

    Submitted 17 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  20. arXiv:2403.12950  [pdf, other

    cs.LG stat.ML

    Optimal and Adaptive Non-Stationary Dueling Bandits Under a Generalized Borda Criterion

    Authors: Joe Suk, Arpit Agarwal

    Abstract: In dueling bandits, the learner receives preference feedback between arms, and the regret of an arm is defined in terms of its suboptimality to a winner arm. The more challenging and practically motivated non-stationary variant of dueling bandits, where preferences change over time, has been the focus of several recent works (Saha and Gupta, 2022; Buening and Saha, 2023; Suk and Agarwal, 2023). Th… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  21. arXiv:2403.08299  [pdf, other

    cs.SE cs.AI

    AutoDev: Automated AI-Driven Development

    Authors: Michele Tufano, Anisha Agarwal, **u Jang, Roshanak Zilouchian Moghaddam, Neel Sundaresan

    Abstract: The landscape of software development has witnessed a paradigm shift with the advent of AI-powered assistants, exemplified by GitHub Copilot. However, existing solutions are not leveraging all the potential capabilities available in an IDE such as building, testing, executing code, git operations, etc. Therefore, they are constrained by their limited capabilities, primarily focusing on suggesting… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  22. arXiv:2403.05931  [pdf

    cs.CL cs.LG

    Thread Detection and Response Generation using Transformers with Prompt Optimisation

    Authors: Kevin Joshua T, Arnav Agarwal, Shriya Sanjay, Yash Sarda, John Sahaya Rani Alex, Saurav Gupta, Sushant Kumar, Vishwanath Kamath

    Abstract: Conversational systems are crucial for human-computer interaction, managing complex dialogues by identifying threads and prioritising responses. This is especially vital in multi-party conversations, where precise identification of threads and strategic response prioritisation ensure efficient dialogue management. To address these challenges an end-to-end model that identifies threads and prioriti… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures, submitted to 2024 IEEE International Conference on Signal Processing and Communications (SPCOM)

    ACM Class: I.2.7; I.2.6

  23. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  24. arXiv:2403.04638  [pdf, other

    cs.RO

    Scalable, Simulation-Guided Compliant Tactile Finger Design

    Authors: Yuxiang Ma, Arpit Agarwal, Sandra Q. Liu, Wenzhen Yuan, Edward H. Adelson

    Abstract: Compliant grippers enable robots to work with humans in unstructured environments. In general, these grippers can improve with tactile sensing to estimate the state of objects around them to precisely manipulate objects. However, co-designing compliant structures with high-resolution tactile sensing is a challenging task. We propose a simulation framework for the end-to-end forward design of GelSi… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Yuxiang Ma, Arpit Agarwal, and Sandra Q. Liu contributed equally to this work. Project video: https://youtu.be/CnTUTA5cfMw . 7 pages, 11 figures, 2024 IEEE International Conference on Soft Robotics (RoboSoft)

  25. arXiv:2402.17447  [pdf, other

    cs.CL cs.AI cs.IR

    Deep Learning Based Named Entity Recognition Models for Recipes

    Authors: Mansi Goel, Ayush Agarwal, Shubham Agrawal, Janak Kapuriya, Akhil Vamshi Konam, Rishabh Gupta, Shrey Rastogi, Niharika, Ganesh Bagler

    Abstract: Food touches our lives through various endeavors, including flavor, nourishment, health, and sustainability. Recipes are cultural capsules transmitted across generations via unstructured text. Automated protocols for recognizing named entities, the building blocks of recipe text, are of immense value for various applications ranging from information extraction to novel recipe generation. Named ent… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 13 pages, 6 main figures and 2 in appendices, and 3 main tables; Accepted for publication in LREC-COLING 2024

  26. arXiv:2402.17235  [pdf, other

    cs.LG

    Stochastic Gradient Succeeds for Bandits

    Authors: **cheng Mei, Zixin Zhong, Bo Dai, Alekh Agarwal, Csaba Szepesvari, Dale Schuurmans

    Abstract: We show that the \emph{stochastic gradient} bandit algorithm converges to a \emph{globally optimal} policy at an $O(1/t)$ rate, even with a \emph{constant} step size. Remarkably, global convergence of the stochastic gradient bandit algorithm has not been previously established, even though it is an old algorithm known to be applicable to bandits. The new result is achieved by establishing two nove… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 39 pages; Correction for a previous version published at ICML 2023 conference

  27. Parallel Approximate Maximum Flows in Near-Linear Work and Polylogarithmic Depth

    Authors: Arpit Agarwal, Sanjeev Khanna, Huan Li, Prathamesh Patil, Chen Wang, Nathan White, Peilin Zhong

    Abstract: We present a parallel algorithm for the $(1-ε)$-approximate maximum flow problem in capacitated, undirected graphs with $n$ vertices and $m$ edges, achieving $O(ε^{-3}\text{polylog} n)$ depth and $O(m ε^{-3} \text{polylog} n)$ work in the PRAM model. Although near-linear time sequential algorithms for this problem have been known for almost a decade, no parallel algorithms that simultaneously achi… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  28. arXiv:2402.14576  [pdf, other

    cs.NI cs.LG eess.SY

    Edge Caching Based on Deep Reinforcement Learning and Transfer Learning

    Authors: Farnaz Niknia, ** Wang, Zixu Wang, Aakash Agarwal, Adib S. Rezaei

    Abstract: This paper addresses the escalating challenge of redundant data transmission in networks. The surge in traffic has strained backhaul links and backbone networks, prompting the exploration of caching solutions at the edge router. Existing work primarily relies on Markov Decision Processes (MDP) for caching issues, assuming fixed-time interval decisions; however, real-world scenarios involve random… ▽ More

    Submitted 29 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  29. arXiv:2402.14261  [pdf, other

    cs.SE cs.AI

    Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming

    Authors: Anisha Agarwal, Aaron Chan, Shubham Chandel, **u Jang, Shaun Miller, Roshanak Zilouchian Moghaddam, Yevhen Mohylevskyy, Neel Sundaresan, Michele Tufano

    Abstract: The integration of Large Language Models (LLMs) into Development Environments (IDEs) has become a focal point in modern software development. LLMs such as OpenAI GPT-3.5/4 and Code Llama offer the potential to significantly augment developer productivity by serving as intelligent, chat-driven programming assistants. However, utilizing LLMs out of the box is unlikely to be optimal for any given sce… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  30. arXiv:2402.14013  [pdf, ps, other

    cs.LG cs.DS

    Misalignment, Learning, and Ranking: Harnessing Users Limited Attention

    Authors: Arpit Agarwal, Rad Niazadeh, Prathamesh Patil

    Abstract: In digital health and EdTech, recommendation systems face a significant challenge: users often choose impulsively, in ways that conflict with the platform's long-term payoffs. This misalignment makes it difficult to effectively learn to rank items, as it may hinder exploration of items with greater long-term payoffs. Our paper tackles this issue by utilizing users' limited attention spans. We prop… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  31. arXiv:2402.13448  [pdf, other

    cs.CL cs.AI cs.LG

    ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance

    Authors: Liwen Sun, Abhineet Agarwal, Aaron Kornblith, Bin Yu, Chenyan Xiong

    Abstract: In the emergency department (ED), patients undergo triage and multiple laboratory tests before diagnosis. This time-consuming process causes ED crowding which impacts patient mortality, medical errors, staff burnout, etc. This work proposes (time) cost-effective diagnostic assistance that leverages artificial intelligence systems to help ED clinicians make efficient and accurate diagnoses. In coll… ▽ More

    Submitted 27 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  32. arXiv:2402.12629  [pdf, other

    cs.MM cs.CY cs.SI

    Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale

    Authors: Anmol Agarwal, Pratyush Priyadarshi, Shiven Sinha, Shrey Gupta, Hitkul Jangra, Kiran Garimella, Ponnurangam Kumaraguru

    Abstract: In this paper, we tackle the complex task of analyzing televised debates, with a focus on a prime time news debate show from India. Previous methods, which often relied solely on text, fall short in capturing the multimedia essence of these debates. To address this gap, we introduce a comprehensive automated toolkit that employs advanced computer vision and speech-to-text techniques for large-scal… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  33. arXiv:2402.11652  [pdf, other

    econ.EM cs.LG stat.ME stat.ML

    Doubly Robust Inference in Causal Latent Factor Models

    Authors: Alberto Abadie, Anish Agarwal, Raaz Dwivedi, Abhin Shah

    Abstract: This article introduces a new estimator of average treatment effects under unobserved confounding in modern data-rich environments featuring large numbers of units and outcomes. The proposed estimator is doubly robust, combining outcome imputation, inverse probability weighting, and a novel cross-fitting procedure for matrix completion. We derive finite-sample and asymptotic guarantees, and show t… ▽ More

    Submitted 15 April, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  34. arXiv:2402.07198  [pdf, other

    cs.LG

    More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning

    Authors: Kaiwen Wang, Owen Oertell, Alekh Agarwal, Nathan Kallus, Wen Sun

    Abstract: In this paper, we prove that Distributional Reinforcement Learning (DistRL), which learns the return distribution, can obtain second-order bounds in both online and offline RL in general settings with function approximation. Second-order bounds are instance-dependent bounds that scale with the variance of return, which we prove are tighter than the previously known small-loss bounds of distributio… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  35. arXiv:2401.04056  [pdf, other

    cs.LG

    A Minimaximalist Approach to Reinforcement Learning from Human Feedback

    Authors: Gokul Swamy, Christoph Dann, Rahul Kidambi, Zhiwei Steven Wu, Alekh Agarwal

    Abstract: We present Self-Play Preference Optimization (SPO), an algorithm for reinforcement learning from human feedback. Our approach is minimalist in that it does not require training a reward model nor unstable adversarial training and is therefore rather simple to implement. Our approach is maximalist in that it provably handles non-Markovian, intransitive, and stochastic preferences while being robust… ▽ More

    Submitted 13 June, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  36. arXiv:2401.01879  [pdf, other

    cs.LG cs.CL cs.IT

    Theoretical guarantees on the best-of-n alignment policy

    Authors: Ahmad Beirami, Alekh Agarwal, Jonathan Berant, Alexander D'Amour, Jacob Eisenstein, Chirag Nagpal, Ananda Theertha Suresh

    Abstract: A simple and effective method for the alignment of generative models is the best-of-$n$ policy, where $n$ samples are drawn from a base policy, and ranked based on a reward function, and the highest ranking one is selected. A commonly used analytical expression in the literature claims that the KL divergence between the best-of-$n$ policy and the base policy is equal to $\log (n) - (n-1)/n.$ We di… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  37. arXiv:2401.01633  [pdf, ps, other

    cs.CC quant-ph

    Quantum Polynomial Hierarchies: Karp-Lipton, error reduction, and lower bounds

    Authors: Avantika Agarwal, Sevag Gharibian, Venkata Koppula, Dorian Rudolph

    Abstract: The Polynomial-Time Hierarchy ($\mathsf{PH}$) is a staple of classical complexity theory, with applications spanning randomized computation to circuit lower bounds to ''quantum advantage'' analyses for near-term quantum computers. Quantumly, however, despite the fact that at least \emph{four} definitions of quantum $\mathsf{PH}$ exist, it has been challenging to prove analogues for these of even b… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  38. arXiv:2312.16307  [pdf, other

    econ.EM cs.GT cs.LG stat.ME

    Incentive-Aware Synthetic Control: Accurate Counterfactual Estimation via Incentivized Exploration

    Authors: Daniel Ngo, Keegan Harris, Anish Agarwal, Vasilis Syrgkanis, Zhiwei Steven Wu

    Abstract: We consider the setting of synthetic control methods (SCMs), a canonical approach used to estimate the treatment effect on the treated in a panel data setting. We shed light on a frequently overlooked but ubiquitous assumption made in SCMs of "overlap": a treated unit can be written as some combination -- typically, convex or linear combination -- of the units that remain under control. We show th… ▽ More

    Submitted 13 February, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

  39. arXiv:2312.16079  [pdf, other

    cs.NI eess.SY

    Coexistence assessment and interference mitigation for 5G and Fixed Satellite Stations in C-band in India

    Authors: Avinash Agarwal

    Abstract: In this paper, we present the findings of a study conducted to assess the coexistence of Fifth Generation (5G) wireless networks and Fixed Satellite Station (FSS) receivers in the C-Band (3300-4200 MHz) in India. Through simulations, we evaluate the coexistence feasibility and calculate the minimum separation distances required to mitigate interference, consider-ing factors such as 5G Base Station… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 10 pages, 9 figures

  40. arXiv:2312.15427  [pdf, other

    cs.LG cs.DS

    Semi-Bandit Learning for Monotone Stochastic Optimization

    Authors: Arpit Agarwal, Rohan Ghuge, Viswanath Nagarajan

    Abstract: Stochastic optimization is a widely used approach for optimization under uncertainty, where uncertain input parameters are modeled by random variables. Exact or approximation algorithms have been obtained for several fundamental problems in this area. However, a significant limitation of this approach is that it requires full knowledge of the underlying probability distributions. Can we still get… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  41. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  42. arXiv:2312.09244  [pdf, other

    cs.LG

    Hel** or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

    Authors: Jacob Eisenstein, Chirag Nagpal, Alekh Agarwal, Ahmad Beirami, Alex D'Amour, DJ Dvijotham, Adam Fisch, Katherine Heller, Stephen Pfohl, Deepak Ramachandran, Peter Shaw, Jonathan Berant

    Abstract: Reward models play a key role in aligning language model applications towards human preferences. However, this setup creates an incentive for the language model to exploit errors in the reward model to achieve high estimated reward, a phenomenon often termed \emph{reward hacking}. A natural mitigation is to train an ensemble of reward models, aggregating over model outputs to obtain a more robust… ▽ More

    Submitted 20 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

  43. arXiv:2312.06719  [pdf, other

    cs.CV

    SkyScenes: A Synthetic Dataset for Aerial Scene Understanding

    Authors: Sahil Khose, Anisha Pal, Aayushi Agarwal, Deepanshi, Judy Hoffman, Prithvijit Chattopadhyay

    Abstract: Real-world aerial scene understanding is limited by a lack of datasets that contain densely annotated images curated under a diverse set of conditions. Due to inherent challenges in obtaining such images in controlled real-world settings, we present SkyScenes, a synthetic dataset of densely annotated aerial images captured from Unmanned Aerial Vehicle (UAV) perspectives. We carefully curate SkySce… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  44. arXiv:2312.03173  [pdf, other

    cs.CY cs.AI cs.CL

    A Comparative Study of AI-Generated (GPT-4) and Human-crafted MCQs in Programming Education

    Authors: Jacob Doughty, Zipiao Wan, Anishka Bompelli, Jubahed Qayum, Taozhi Wang, Juran Zhang, Yujia Zheng, Aidan Doyle, Pragnya Sridhar, Arav Agarwal, Christopher Bogart, Eric Keylor, Can Kultur, Jaromir Savelka, Majd Sakr

    Abstract: There is a constant need for educators to develop and maintain effective up-to-date assessments. While there is a growing body of research in computing education on utilizing large language models (LLMs) in generation and engagement with coding exercises, the use of LLMs for generating programming MCQs has not been extensively explored. We analyzed the capability of GPT-4 to produce multiple-choic… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  45. arXiv:2312.02975  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Dexterous Functional Gras**

    Authors: Ananye Agarwal, Shagun Uppal, Kenneth Shaw, Deepak Pathak

    Abstract: While there have been significant strides in dexterous manipulation, most of it is limited to benchmark tasks like in-hand reorientation which are of limited utility in the real world. The main benefit of dexterous hands over two-fingered ones is their ability to pickup tools and other objects (including thin ones) and grasp them firmly to apply force. However, this task requires both a complex un… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: In CoRL 2023. Website at https://dexfunc.github.io/

  46. arXiv:2311.13006  [pdf, ps, other

    cs.DS

    Learning-Augmented Dynamic Submodular Maximization

    Authors: Arpit Agarwal, Eric Balkanski

    Abstract: In dynamic submodular maximization, the goal is to maintain a high-value solution over a sequence of element insertions and deletions with a fast update time. Motivated by large-scale applications and the fact that dynamic data often exhibits patterns, we ask the following question: can predictions be used to accelerate the update time of dynamic submodular maximization algorithms? We consider t… ▽ More

    Submitted 8 March, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  47. arXiv:2311.11919  [pdf, other

    cs.CV

    An Image is Worth Multiple Words: Multi-attribute Inversion for Constrained Text-to-Image Synthesis

    Authors: Aishwarya Agarwal, Srikrishna Karanam, Tripti Shukla, Balaji Vasan Srinivasan

    Abstract: We consider the problem of constraining diffusion model outputs with a user-supplied reference image. Our key objective is to extract multiple attributes (e.g., color, object, layout, style) from this single reference image, and then generate new samples with them. One line of existing work proposes to invert the reference images into a single textual conditioning vector, enabling generation of ne… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  48. arXiv:2311.09612  [pdf, other

    cs.CV cs.CL

    Efficient End-to-End Visual Document Understanding with Rationale Distillation

    Authors: Wang Zhu, Alekh Agarwal, Mandar Joshi, Robin Jia, Jesse Thomason, Kristina Toutanova

    Abstract: Understanding visually situated language requires interpreting complex layouts of textual and visual elements. Pre-processing tools, such as optical character recognition (OCR), can map document image inputs to textual tokens, then large language models (LLMs) can reason over text. However, such methods have high computational and engineering complexity. Can small pretrained image-to-text models a… ▽ More

    Submitted 1 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted by NAACL 2024

  49. arXiv:2311.09518  [pdf, other

    cs.CY

    From GPT-3 to GPT-4: On the Evolving Efficacy of LLMs to Answer Multiple-choice Questions for Programming Classes in Higher Education

    Authors: Jaromir Savelka, Arav Agarwal, Christopher Bogart, Majd Sakr

    Abstract: We explore the evolving efficacy of three generative pre-trained transformer (GPT) models in generating answers for multiple-choice questions (MCQ) from introductory and intermediate Python programming courses in higher education. We focus on the differences in capabilities of the models prior to the release of ChatGPT (Nov '22), at the time of the release, and today (i.e., Aug '23). Recent studie… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2303.08033, arXiv:2306.10073

  50. arXiv:2311.09497  [pdf, other

    cs.DL cs.GT

    Peer Reviews of Peer Reviews: A Randomized Controlled Trial and Other Experiments

    Authors: Alexander Goldberg, Ivan Stelmakh, Kyunghyun Cho, Alice Oh, Alekh Agarwal, Danielle Belgrave, Nihar B. Shah

    Abstract: Is it possible to reliably evaluate the quality of peer reviews? We study this question driven by two primary motivations -- incentivizing high-quality reviewing using assessed quality of reviews and measuring changes to review quality in experiments. We conduct a large scale study at the NeurIPS 2022 conference, a top-tier conference in machine learning, in which we invited (meta)-reviewers and a… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.