Skip to main content

Showing 1–50 of 71 results for author: Siddarth

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.08933  [pdf

    cs.LG

    Machine Learning in High Volume Media Manufacturing

    Authors: Siddarth Reddy Karuka, Abhinav Sunderrajan, Zheng Zheng, Yong Woon Tiean, Ganesh Nagappan, Allan Luk

    Abstract: Errors or failures in a high-volume manufacturing environment can have significant impact that can result in both the loss of time and money. Identifying such failures early has been a top priority for manufacturing industries and various rule-based algorithms have been developed over the years. However, catching these failures is time consuming and such algorithms cannot adapt well to changes in… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2406.07814  [pdf, other

    cs.AI cs.CL cs.HC

    Collective Constitutional AI: Aligning a Language Model with Public Input

    Authors: Saffron Huang, Divya Siddarth, Liane Lovitt, Thomas I. Liao, Esin Durmus, Alex Tamkin, Deep Ganguli

    Abstract: There is growing consensus that language model (LM) developers should not be the sole deciders of LM behavior, creating a need for methods that enable the broader public to collectively shape the behavior of LM systems that affect them. To address this need, we present Collective Constitutional AI (CCAI): a multi-stage process for sourcing and integrating public input into LMs-from identifying a t… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    ACM Class: I.2.7; K.4.2

    Journal ref: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency. 1395-1417

  3. arXiv:2406.05331  [pdf, other

    cs.RO

    Autonomous Robotic Assembly: From Part Singulation to Precise Assembly

    Authors: Kei Ota, Devesh K. Jha, Siddarth Jain, Bill Yerazunis, Radu Corcodel, Yash Shukla, Antonia Bronars, Diego Romeres

    Abstract: Imagine a robot that can assemble a functional product from the individual parts presented in any configuration to the robot. Designing such a robotic system is a complex problem which presents several open challenges. To bypass these challenges, the current generation of assembly systems is built with a lot of system integration effort to provide the structure and precision necessary for assembly… ▽ More

    Submitted 11 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Under submission

  4. arXiv:2405.20971  [pdf, other

    cs.LG cs.CV

    Amortizing intractable inference in diffusion models for vision, language, and control

    Authors: Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: Diffusion models have emerged as effective distribution estimators in vision, language, and reinforcement learning, but their use as priors in downstream tasks poses an intractable posterior inference problem. This paper studies amortized sampling of the posterior over data, $\mathbf{x}\sim p^{\rm post}(\mathbf{x})\propto p(\mathbf{x})r(\mathbf{x})$, in a model that consists of a diffusion generat… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Code: https://github.com/GFNOrg/diffusion-finetuning

  5. arXiv:2403.05513  [pdf, other

    cs.RO

    A Detection and Filtering Framework for Collaborative Localization

    Authors: Thirumalaesh Ashokkumar, Katherine A Skinner, Siddarth Agarwal, Ankit Vora, Ashutosh Bhown

    Abstract: Increasingly, autonomous vehicles (AVs) are becoming a reality, such as the Advanced Driver Assistance Systems (ADAS) in vehicles that assist drivers in driving and parking functions with vehicles today. The localization problem for AVs relies primarily on multiple sensors, including cameras, LiDARs, and radars. Manufacturing, installing, calibrating, and maintaining these sensors can be very expe… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  6. arXiv:2402.17904  [pdf

    cs.RO

    4CNet: A Confidence-Aware, Contrastive, Conditional, Consistency Model for Robot Map Prediction in Multi-Robot Environments

    Authors: Aaron Hao Tan, Siddarth Narasimhan, Goldie Nejat

    Abstract: Mobile robots in unknown cluttered environments with irregularly shaped obstacles often face sensing, energy, and communication challenges which directly affect their ability to explore these environments. In this paper, we introduce a novel deep learning method, Confidence-Aware Contrastive Conditional Consistency Model (4CNet), for mobile robot map prediction during resource-limited exploration… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 14 pages, 10 figures

  7. arXiv:2312.08468  [pdf, other

    cs.AI

    On Diagnostics for Understanding Agent Training Behaviour in Cooperative MARL

    Authors: Wiem Khlifi, Siddarth Singh, Omayma Mahjoub, Ruan de Kock, Abidine Vall, Rihab Gorsane, Arnu Pretorius

    Abstract: Cooperative multi-agent reinforcement learning (MARL) has made substantial strides in addressing the distributed decision-making challenges. However, as multi-agent systems grow in complexity, gaining a comprehensive understanding of their behaviour becomes increasingly challenging. Conventionally, tracking team rewards over time has served as a pragmatic measure to gauge the effectiveness of agen… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 4 pages, AAAI XAI4DRL workshop 2023

    MSC Class: I.2.11; I.2.0; A.0

  8. arXiv:2312.08466  [pdf, other

    cs.AI

    Efficiently Quantifying Individual Agent Importance in Cooperative MARL

    Authors: Omayma Mahjoub, Ruan de Kock, Siddarth Singh, Wiem Khlifi, Abidine Vall, Kale-ab Tessera, Arnu Pretorius

    Abstract: Measuring the contribution of individual agents is challenging in cooperative multi-agent reinforcement learning (MARL). In cooperative MARL, team performance is typically inferred from a single shared global reward. Arguably, among the best current approaches to effectively measure individual agent contributions is to use Shapley values. However, calculating these values is expensive as the compu… ▽ More

    Submitted 26 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 8 pages, AAAI XAI4DRL workshop 2023; references updated, figure 8 style updated, typos

    MSC Class: I.2.11; I.2.0; A.0

  9. arXiv:2312.08463  [pdf, other

    cs.AI

    How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning

    Authors: Siddarth Singh, Omayma Mahjoub, Ruan de Kock, Wiem Khlifi, Abidine Vall, Kale-ab Tessera, Arnu Pretorius

    Abstract: Establishing sound experimental standards and rigour is important in any growing field of research. Deep Multi-Agent Reinforcement Learning (MARL) is one such nascent field. Although exciting progress has been made, MARL has recently come under scrutiny for replicability issues and a lack of standardised evaluation methodology, specifically in the cooperative setting. Although protocols have been… ▽ More

    Submitted 26 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 6 pages, AAAI XAI4DRL workshop 2023; typos corrected, images updated, page count updated

    MSC Class: I.2.11; I.2.0; A.0

  10. arXiv:2312.06876  [pdf, other

    cs.RO cs.AI

    Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks

    Authors: Lingfeng Sun, Devesh K. Jha, Chiori Hori, Siddarth Jain, Radu Corcodel, Xinghao Zhu, Masayoshi Tomizuka, Diego Romeres

    Abstract: Designing robotic agents to perform open vocabulary tasks has been the long-standing goal in robotics and AI. Recently, Large Language Models (LLMs) have achieved impressive results in creating robotic agents for performing open vocabulary tasks. However, planning for these tasks in the presence of uncertainties is challenging as it requires \enquote{chain-of-thought} reasoning, aggregating inform… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 22 pages, 4 figures

  11. arXiv:2310.11207  [pdf, other

    cs.CL cs.LG

    Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations

    Authors: Shiyuan Huang, Siddarth Mamidanna, Shreedhar Jangam, Yilun Zhou, Leilani H. Gilpin

    Abstract: Large language models (LLMs) such as ChatGPT have demonstrated superior performance on a variety of natural language processing (NLP) tasks including sentiment analysis, mathematical reasoning and summarization. Furthermore, since these models are instruction-tuned on human conversations to produce "helpful" responses, they can and often will produce explanations along with the response, which we… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  12. arXiv:2310.06751  [pdf, other

    cs.RO

    EARL: Eye-on-Hand Reinforcement Learner for Dynamic Gras** with Active Pose Estimation

    Authors: Baichuan Huang, **g** Yu, Siddarth Jain

    Abstract: In this paper, we explore the dynamic gras** of moving objects through active pose tracking and reinforcement learning for hand-eye coordination systems. Most existing vision-based robotic gras** methods implicitly assume target objects are stationary or moving predictably. Performing gras** of unpredictably moving objects presents a unique set of challenges. For example, a pre-computed robu… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Presented on IROS 2023 Corresponding author Siddarth Jain

  13. arXiv:2309.06599  [pdf, other

    cs.LG

    Reasoning with Latent Diffusion in Offline Reinforcement Learning

    Authors: Siddarth Venkatraman, Shivesh Khaitan, Ravi Tej Akella, John Dolan, Jeff Schneider, Glen Berseth

    Abstract: Offline reinforcement learning (RL) holds promise as a means to learn high-reward policies from a static dataset, without the need for further environment interactions. However, a key challenge in offline RL lies in effectively stitching portions of suboptimal trajectories from the static dataset while avoiding extrapolation errors arising due to a lack of support in the dataset. Existing approach… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  14. arXiv:2307.03718  [pdf, other

    cs.CY cs.AI

    Frontier AI Regulation: Managing Emerging Risks to Public Safety

    Authors: Markus Anderljung, Joslyn Barnhart, Anton Korinek, Jade Leung, Cullen O'Keefe, Jess Whittlestone, Shahar Avin, Miles Brundage, Justin Bullock, Duncan Cass-Beggs, Ben Chang, Tantum Collins, Tim Fist, Gillian Hadfield, Alan Hayes, Lewis Ho, Sara Hooker, Eric Horvitz, Noam Kolt, Jonas Schuett, Yonadav Shavit, Divya Siddarth, Robert Trager, Kevin Wolf

    Abstract: Advanced AI models hold the promise of tremendous benefits for humanity, but society needs to proactively manage the accompanying risks. In this paper, we focus on what we term "frontier AI" models: highly capable foundation models that could possess dangerous capabilities sufficient to pose severe risks to public safety. Frontier AI models pose a distinct regulatory challenge: dangerous capabilit… ▽ More

    Submitted 7 November, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Update July 11th: - Added missing footnote back in. - Adjusted author order (mistakenly non-alphabetical among the first 6 authors) and adjusted affiliations (Jess Whittlestone's affiliation was mistagged and Gillian Hadfield had SRI added to her affiliations) Updated September 4th: Various typos

  15. arXiv:2306.15644  [pdf, other

    cs.CL

    Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos

    Authors: Chiori Hori, Puyuan Peng, David Harwath, Xinyu Liu, Kei Ota, Siddarth Jain, Radu Corcodel, Devesh Jha, Diego Romeres, Jonathan Le Roux

    Abstract: To realize human-robot collaboration, robots need to execute actions for new tasks according to human instructions given finite prior knowledge. Human experts can share their knowledge of how to perform a task with a robot through multi-modal instructions in their demonstrations, showing a sequence of short-horizon steps to achieve a long-horizon goal. This paper introduces a method for robot acti… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted to Interspeech2023

  16. arXiv:2306.09884  [pdf, other

    cs.LG cs.AI

    Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

    Authors: Clément Bonnet, Daniel Luo, Donal Byrne, Shikha Surana, Sasha Abramowitz, Paul Duckworth, Vincent Coyette, Laurence I. Midgley, Elshadai Tegegn, Tristan Kalloniatis, Omayma Mahjoub, Matthew Macfarlane, Andries P. Smit, Nathan Grinsztajn, Raphael Boige, Cemlyn N. Waters, Mohamed A. Mimouni, Ulrich A. Mbou Sob, Ruan de Kock, Siddarth Singh, Daniel Furelos-Blanco, Victor Le, Arnu Pretorius, Alexandre Laterre

    Abstract: Open-source reinforcement learning (RL) environments have played a crucial role in driving progress in the development of AI algorithms. In modern RL research, there is a need for simulated environments that are performant, scalable, and modular to enable their utilization in a wider range of potential real-world applications. Therefore, we present Jumanji, a suite of diverse RL environments speci… ▽ More

    Submitted 15 March, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: 9 pages + 21 pages of appendices and references. Published at ICLR 2024

  17. arXiv:2306.08001  [pdf, ps, other

    cs.LG cs.AI

    A Markovian Formalism for Active Querying

    Authors: Sid Ijju

    Abstract: Active learning algorithms have been an integral part of recent advances in artificial intelligence. However, the research in the field is widely varying and lacks an overall organizing leans. We outline a Markovian formalism for the field of active learning and survey the literature to demonstrate the organizing capability of our proposed formalism. Our formalism takes a partially observable Mark… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: Active Learning, Markov, Inverse Reinforcement Learning, Query

  18. arXiv:2306.07180  [pdf, other

    cs.LG

    Diffusion Models for Black-Box Optimization

    Authors: Siddarth Krishnamoorthy, Satvik Mehul Mashkaria, Aditya Grover

    Abstract: The goal of offline black-box optimization (BBO) is to optimize an expensive black-box function using a fixed dataset of function evaluations. Prior works consider forward approaches that learn surrogates to the black-box function and inverse approaches that directly map function values to corresponding points in the input domain of the black-box function. These approaches are limited by the quali… ▽ More

    Submitted 21 August, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: International Conference on Machine Learning 2023

  19. arXiv:2306.04305  [pdf, other

    cs.GT econ.TH

    Self-Resolving Prediction Markets for Unverifiable Outcomes

    Authors: Siddarth Srinivasan, Ezra Karger, Yiling Chen

    Abstract: Prediction markets elicit and aggregate beliefs by paying agents based on how close their predictions are to a verifiable future outcome. However, outcomes of many important questions are difficult to verify or unverifiable, in that the ground truth may be hard or impossible to access. Examples include questions about causal effects where it is infeasible or unethical to run randomized trials; cro… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  20. arXiv:2306.01654  [pdf, other

    cs.LG cs.CV stat.ML

    GANs Settle Scores!

    Authors: Siddarth Asokan, Nishanth Shetty, Aadithya Srikanth, Chandra Sekhar Seelamantula

    Abstract: Generative adversarial networks (GANs) comprise a generator, trained to learn the underlying distribution of the desired data, and a discriminator, trained to distinguish real samples from those output by the generator. A majority of GAN literature focuses on understanding the optimality of the discriminator through integral probability metric (IPM) or divergence based analysis. In this paper, we… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  21. arXiv:2306.00785  [pdf, other

    stat.ML cs.LG

    Data Interpolants -- That's What Discriminators in Higher-order Gradient-regularized GANs Are

    Authors: Siddarth Asokan, Chandra Sekhar Seelamantula

    Abstract: We consider the problem of optimizing the discriminator in generative adversarial networks (GANs) subject to higher-order gradient regularization. We show analytically, via the least-squares (LSGAN) and Wasserstein (WGAN) GAN variants, that the discriminator optimization problem is one of interpolation in $n$-dimensions. The optimal discriminator, derived using variational Calculus, turns out to b… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  22. arXiv:2305.15324  [pdf, other

    cs.AI

    Model evaluation for extreme risks

    Authors: Toby Shevlane, Sebastian Farquhar, Ben Garfinkel, Mary Phuong, Jess Whittlestone, Jade Leung, Daniel Kokotajlo, Nahema Marchal, Markus Anderljung, Noam Kolt, Lewis Ho, Divya Siddarth, Shahar Avin, Will Hawkins, Been Kim, Iason Gabriel, Vijay Bolina, Jack Clark, Yoshua Bengio, Paul Christiano, Allan Dafoe

    Abstract: Current approaches to building general-purpose AI systems tend to produce systems with both beneficial and harmful capabilities. Further progress in AI development could lead to capabilities that pose extreme risks, such as offensive cyber capabilities or strong manipulation skills. We explain why model evaluation is critical for addressing extreme risks. Developers must be able to identify danger… ▽ More

    Submitted 22 September, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Fixed typos; added citation

    ACM Class: K.4.1

  23. arXiv:2305.14547  [pdf

    cs.AR cs.ET cs.LG

    Bulk-Switching Memristor-based Compute-In-Memory Module for Deep Neural Network Training

    Authors: Yuting Wu, Qiwen Wang, Ziyu Wang, Xinxin Wang, Buvna Ayyagari, Siddarth Krishnan, Michael Chudzik, Wei D. Lu

    Abstract: The need for deep neural network (DNN) models with higher performance and better functionality leads to the proliferation of very large models. Model training, however, requires intensive computation time and energy. Memristor-based compute-in-memory (CIM) modules can perform vector-matrix multiplication (VMM) in situ and in parallel, and have shown great promises in DNN inference applications. Ho… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Journal ref: Adv. Mater.35 (2023) 2305465

  24. arXiv:2305.07613  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Spider GAN: Leveraging Friendly Neighbors to Accelerate GAN Training

    Authors: Siddarth Asokan, Chandra Sekhar Seelamantula

    Abstract: Training Generative adversarial networks (GANs) stably is a challenging task. The generator in GANs transform noise vectors, typically Gaussian distributed, into realistic data such as images. In this paper, we propose a novel approach for training GANs with images as inputs, but without enforcing any pairwise constraints. The intuition is that images are more structured than noise, which the gene… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: CVPR 2023

  25. arXiv:2305.02554  [pdf, other

    cs.RO

    Learning Generalizable Pivoting Skills

    Authors: Xiang Zhang, Siddarth Jain, Baichuan Huang, Masayoshi Tomizuka, Diego Romeres

    Abstract: The skill of pivoting an object with a robotic system is challenging for the external forces that act on the system, mainly given by contact interaction. The complexity increases when the same skills are required to generalize across different objects. This paper proposes a framework for learning robust and generalizable pivoting skills, which consists of three steps. First, we learn a pivoting po… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 2023 International Conference on Robotics and Automation (ICRA)

  26. arXiv:2304.00009  [pdf, other

    cs.AI

    The challenge of redundancy on multi-agent value factorisation

    Authors: Siddarth Singh, Benjamin Rosman

    Abstract: In the field of cooperative multi-agent reinforcement learning (MARL), the standard paradigm is the use of centralised training and decentralised execution where a central critic conditions the policies of the cooperative agents based on a central state. It has been shown, that in cases with large numbers of redundant agents these methods become less effective. In a more general case, there is lik… ▽ More

    Submitted 28 March, 2023; originally announced April 2023.

    Comments: Published at the 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023). 2 Pages, 1 Figure

    ACM Class: I.2.11; I.2.0; A.0

  27. arXiv:2303.12642  [pdf

    cs.AI cs.CY cs.LG

    Democratising AI: Multiple Meanings, Goals, and Methods

    Authors: Elizabeth Seger, Aviv Ovadya, Ben Garfinkel, Divya Siddarth, Allan Dafoe

    Abstract: Numerous parties are calling for the democratisation of AI, but the phrase is used to refer to a variety of goals, the pursuit of which sometimes conflict. This paper identifies four kinds of AI democratisation that are commonly discussed: (1) the democratisation of AI use, (2) the democratisation of AI development, (3) the democratisation of AI profits, and (4) the democratisation of AI governanc… ▽ More

    Submitted 7 August, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: V2 Changed second author affiliation; added citation to section 5.2; edit to author contribution statement; V3 camera ready version for conference proceedings. Minor content changes in response to reviewer comments

  28. arXiv:2303.11074  [pdf, ps, other

    cs.CY

    Generative AI and the Digital Commons

    Authors: Saffron Huang, Divya Siddarth

    Abstract: Many generative foundation models (or GFMs) are trained on publicly available data and use public infrastructure, but 1) may degrade the "digital commons" that they depend on, and 2) do not have processes in place to return value captured to data producers and stakeholders. Existing conceptions of data rights and protection (focusing largely on individually-owned data and associated privacy concer… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  29. arXiv:2212.04554  [pdf, other

    cs.RO

    Task-Directed Exploration in Continuous POMDPs for Robotic Manipulation of Articulated Objects

    Authors: Aidan Curtis, Leslie Kaelbling, Siddarth Jain

    Abstract: Representing and reasoning about uncertainty is crucial for autonomous agents acting in partially observable environments with noisy sensors. Partially observable Markov decision processes (POMDPs) serve as a general framework for representing problems in which uncertainty is an important factor. Online sample-based POMDP methods have emerged as efficient approaches to solving large POMDPs and hav… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  30. arXiv:2212.01434  [pdf, other

    cs.RO cs.AI

    Generalizable Human-Robot Collaborative Assembly Using Imitation Learning and Force Control

    Authors: Devesh K. Jha, Siddarth Jain, Diego Romeres, William Yerazunis, Daniel Nikovski

    Abstract: Robots have been steadily increasing their presence in our daily lives, where they can work along with humans to provide assistance in various tasks on industry floors, in offices, and in homes. Automated assembly is one of the key applications of robots, and the next generation assembly systems could become much more efficient by creating collaborative human-robot systems. However, although colla… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  31. arXiv:2210.16871  [pdf, other

    eess.AS cs.SD

    Improved acoustic-to-articulatory inversion using representations from pretrained self-supervised learning models

    Authors: Sathvik Udupa, Siddarth C, Prasanta Kumar Ghosh

    Abstract: In this work, we investigate the effectiveness of pretrained Self-Supervised Learning (SSL) features for learning the map** for acoustic to articulatory inversion (AAI). Signal processing-based acoustic features such as MFCCs have been predominantly used for the AAI task with deep neural networks. With SSL features working well for various other speech tasks such as speech recognition, emotion c… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: submitted to ICASSP 2023

  32. arXiv:2209.10485  [pdf, other

    cs.LG cs.AI cs.GL cs.MA

    Towards a Standardised Performance Evaluation Protocol for Cooperative MARL

    Authors: Rihab Gorsane, Omayma Mahjoub, Ruan de Kock, Roland Dubb, Siddarth Singh, Arnu Pretorius

    Abstract: Multi-agent reinforcement learning (MARL) has emerged as a useful approach to solving decentralised decision-making problems at scale. Research in the field has been growing steadily with many breakthrough algorithms proposed in recent years. In this work, we take a closer look at this rapid development with a focus on evaluation methodologies employed across a large body of research in cooperativ… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: Published at the 36th Conference on Neural Information Processing Systems (NeurIPS 2022). Website: see https://sites.google.com/view/marl-standard-protocol . 43 Pages, 21 Figures, 8 Tables

    ACM Class: I.2.11; I.2.0; A.1

  33. arXiv:2209.01320  [pdf, other

    cs.CV

    Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement

    Authors: Siddarth Ravichandran, Ondřej Texler, Dimitar Dinev, Hyun Jae Kang

    Abstract: Over the last few decades, many aspects of human life have been enhanced with virtual domains, from the advent of digital assistants such as Amazon's Alexa and Apple's Siri to the latest metaverse efforts of the rebranded Meta. These trends underscore the importance of generating photorealistic visual depictions of humans. This has led to the rapid growth of so-called deepfake and talking-head gen… ▽ More

    Submitted 23 March, 2023; v1 submitted 2 September, 2022; originally announced September 2022.

  34. MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant Systems for Machine Learning

    Authors: Baolin Li, Tirthak Patel, Siddarth Samsi, Vijay Gadepally, Devesh Tiwari

    Abstract: GPU technology has been improving at an expedited pace in terms of size and performance, empowering HPC and AI/ML researchers to advance the scientific discovery process. However, this also leads to inefficient resource usage, as most GPU workloads, including complicated AI/ML models, are not able to utilize the GPU resources to their fullest extent -- encouraging support for GPU multi-tenancy. We… ▽ More

    Submitted 6 October, 2022; v1 submitted 23 July, 2022; originally announced July 2022.

  35. arXiv:2206.10786  [pdf, other

    cs.LG cs.AI

    Generative Pretraining for Black-Box Optimization

    Authors: Siddarth Krishnamoorthy, Satvik Mehul Mashkaria, Aditya Grover

    Abstract: Many problems in science and engineering involve optimizing an expensive black-box function over a high-dimensional space. For such black-box optimization (BBO) problems, we typically assume a small budget for online function evaluations, but also often have access to a fixed, offline dataset for pretraining. Prior approaches seek to utilize the offline data to approximate the function or its inve… ▽ More

    Submitted 21 August, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: International Conference for Machine Learning 2023 NeurIPS Workshop for Foundational Models for Decision Making (Oral) 2022

  36. arXiv:2204.10447  [pdf, other

    cs.RO

    Design of Adaptive Compliance Controllers for Safe Robotic Assembly

    Authors: Devesh K. Jha, Diego Romeres, Siddarth Jain, William Yerazunis, Daniel Nikovski

    Abstract: Insertion operations are a critical element of most robotic assembly operation, and peg-in-hole (PiH) insertion is one of the most widely studied tasks in the industrial and academic manipulation communities. PiH insertion is in fact an entire class of problems, where the complexity of the problem can depend on the type of misalignment and contact formation during an insertion attempt. In this pap… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: 8 pages, 10 figures

  37. arXiv:2203.15155  [pdf, other

    cs.RO cs.AI cs.CV

    Learning to Synthesize Volumetric Meshes from Vision-based Tactile Imprints

    Authors: Xinghao Zhu, Siddarth Jain, Masayoshi Tomizuka, Jeroen van Baar

    Abstract: Vision-based tactile sensors typically utilize a deformable elastomer and a camera mounted above to provide high-resolution image observations of contacts. Obtaining accurate volumetric meshes for the deformed elastomer can provide direct contact information and benefit robotic gras** and manipulation. This paper focuses on learning to synthesize the volumetric mesh of the elastomer based on the… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: To appear in the Proceedings of the IEEE International Conference on Robotics and Automation (ICRA 2022), Philadelphia (PA), USA

  38. arXiv:2203.04563  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    MLNav: Learning to Safely Navigate on Martian Terrains

    Authors: Shreyansh Daftry, Neil Abcouwer, Tyler Del Sesto, Siddarth Venkatraman, Jialin Song, Lucas Igel, Amos Byon, Ugo Rosolia, Yisong Yue, Masahiro Ono

    Abstract: We present MLNav, a learning-enhanced path planning framework for safety-critical and resource-limited systems operating in complex environments, such as rovers navigating on Mars. MLNav makes judicious use of machine learning to enhance the efficiency of path planning while fully respecting safety constraints. In particular, the dominant computational cost in such safety-critical settings is runn… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: IEEE Robotics and Automation Letters (RA-L) and ICRA 2022

  39. Unconditionally secure digital signatures implemented in an 8-user quantum network

    Authors: Yoann Pelet, Ittoop Vergheese Puthoor, Natarajan Venkatachalam, Sören Wengerowsky, Martin Lončarić, Sebastian Philipp Neumann, Bo Liu, Željko Samec, Mario Stipčević, Rupert Ursin, Erika Andersson, John G. Rarity, Djeylan Aktas, Siddarth Koduru Joshi

    Abstract: The ability to know and verifiably demonstrate the origins of messages can often be as important as encrypting the message itself. Here we present an experimental demonstration of an unconditionally secure digital signature (USS) protocol implemented for the first time, to the best of our knowledge, on a fully connected quantum network without trusted nodes. Our USS protocol is secure against forg… ▽ More

    Submitted 10 February, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: Preprint, 9 pages, 7 figures, 1 table

  40. arXiv:2202.03645  [pdf, other

    cs.LG

    NxtPost: User to Post Recommendations in Facebook Groups

    Authors: Kaushik Rangadurai, Yiqun Liu, Siddarth Malreddy, Xiaoyi Liu, Piyush Maheshwari, Vishwanath Sangale, Fedor Borisyuk

    Abstract: In this paper, we present NxtPost, a deployed user-to-post content-based sequential recommender system for Facebook Groups. Inspired by recent advances in NLP, we have adapted a Transformer-based model to the domain of sequential recommendation. We explore causal masked multi-head attention that optimizes both short and long-term user interests. From a user's past activities validated by defined s… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: 9 pages

  41. arXiv:2110.04441  [pdf, other

    cs.AI cs.CL cs.RO

    Natural Language for Human-Robot Collaboration: Problems Beyond Language Grounding

    Authors: Seth Pate, Wei Xu, Ziyi Yang, Maxwell Love, Siddarth Ganguri, Lawson L. S. Wong

    Abstract: To enable robots to instruct humans in collaborations, we identify several aspects of language processing that are not commonly studied in this context. These include location, planning, and generation. We suggest evaluations for each task, offer baselines for simple methods, and close by discussing challenges and opportunities in studying language for collaboration.

    Submitted 8 October, 2021; originally announced October 2021.

    Comments: 5 pages, 2 figures, Presented at AI-HRI symposium as part of AAAI-FSS 2021 (arXiv:2109.10836)

    Report number: AIHRI/2021/38

  42. arXiv:2109.00923  [pdf, other

    econ.GN cs.GT cs.LG

    Auctions and Peer Prediction for Academic Peer Review

    Authors: Siddarth Srinivasan, Jamie Morgenstern

    Abstract: Peer reviewed publications are considered the gold standard in certifying and disseminating ideas that a research community considers valuable. However, we identify two major drawbacks of the current system: (1) the overwhelming demand for reviewers due to a large volume of submissions, and (2) the lack of incentives for reviewers to participate and expend the necessary effort to provide high-qual… ▽ More

    Submitted 10 May, 2023; v1 submitted 27 August, 2021; originally announced September 2021.

  43. arXiv:2108.13865  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

    Authors: Anoop Cherian, Goncalo Dias Pais, Siddarth Jain, Tim K. Marks, Alan Sullivan

    Abstract: In this paper, we present InSeGAN, an unsupervised 3D generative adversarial network (GAN) for segmenting (nearly) identical instances of rigid objects in depth images. Using an analysis-by-synthesis approach, we design a novel GAN architecture to synthesize a multiple-instance depth image with independent control over each instance. InSeGAN takes in a set of code vectors (e.g., random noise vecto… ▽ More

    Submitted 28 January, 2022; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: Accepted at ICCV 2021. Code & data @ https://www.merl.com/research/license/InSeGAN

  44. arXiv:2105.13515  [pdf, ps, other

    cs.CY

    Vaccine Credential Technology Principles

    Authors: Divya Siddarth, Vi Hart, Bethan Cantrell, Kristina Yasuda, Josh Mandel, Karen Easterbrook

    Abstract: The historically rapid development of effective COVID-19 vaccines has policymakers facing evergreen public health questions regarding vaccination records and verification. Governments and institutions around the world are already taking action on digital vaccine certificates, including guidance and recommendations from the European Commission, the WHO, and the Biden Administration. These could be… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  45. arXiv:2103.08684  [pdf, other

    cs.RO

    Robotics During a Pandemic: The 2020 NSF CPS Virtual Challenge -- SoilScope, Mars Edition

    Authors: Darwin Mick, K. Srikar Siddarth, Swastik Nandan, Harish Anand, Stephen A. Rees, Jnaneshwar Das

    Abstract: Remote sample recovery is a rapidly evolving application of Small Unmanned Aircraft Systems (sUAS) for planetary sciences and space exploration. Development of cyber-physical systems (CPS) for autonomous deployment and recovery of sensor probes for sample caching is already in progress with NASA's MARS 2020 mission. To challenge student teams to develop autonomy for sample recovery settings, the 2… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: 7 pages, Submitted to IROS

  46. arXiv:2012.15565  [pdf, other

    cs.LG cs.CV

    Searching a Raw Video Database using Natural Language Queries

    Authors: Sriram Krishna, Siddarth Vinay, Srinivas K S

    Abstract: The number of videos being produced and consequently stored in databases for video streaming platforms has been increasing exponentially over time. This vast database should be easily index-able to find the requisite clip or video to match the given search specification, preferably in the form of a textual query. This work aims to provide an end-to-end pipeline to search a video database with a vo… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 6 pages, 12 figures, to appear in the proceedings of the First International Conference on Advances in Electrical, Computing, Communications and Sustainable Technologies (ICAECT 2021)

  47. arXiv:2011.09480  [pdf, other

    quant-ph cs.CR cs.ET

    Experimental implementation of secure anonymous protocols on an eight-user quantum network

    Authors: Zixin Huang, Siddarth Koduru Joshi, Djeylan Aktas, Cosmo Lupo, Armanda O. Quintavalle, Natarajan Venkatachalam, Sören Wengerowsky, Martin Lončarić, Sebastian Philipp Neumann, Bo Liu, Željko Samec, Laurent Kling, Mario Stipčević, Rupert Ursin, John G. Rarity

    Abstract: Anonymity in networked communication is vital for many privacy-preserving tasks. Secure key distribution alone is insufficient for high-security communications, often knowing who transmits a message to whom and when must also be kept hidden from an adversary. Here we experimentally demonstrate 5 information-theoretically secure anonymity protocols on an 8 user city-wide quantum network using polar… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: 11 pages, 4 figures, 1 table, experimental work. ZH and SKJ contributed equally to this work and are joint first authors

  48. arXiv:2011.06022  [pdf, other

    cs.RO cs.LG

    Machine Learning Based Path Planning for Improved Rover Navigation (Pre-Print Version)

    Authors: Neil Abcouwer, Shreyansh Daftry, Siddarth Venkatraman, Tyler del Sesto, Olivier Toupet, Ravi Lanka, Jialin Song, Yisong Yue, Masahiro Ono

    Abstract: Enhanced AutoNav (ENav), the baseline surface navigation software for NASA's Perseverance rover, sorts a list of candidate paths for the rover to traverse, then uses the Approximate Clearance Evaluation (ACE) algorithm to evaluate whether the most highly ranked paths are safe. ACE is crucial for maintaining the safety of the rover, but is computationally expensive. If the most promising candidates… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: 9 pages, 5 figures, Pre-Print, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

    ACM Class: I.2.6; I.2.9; I.2.8

  49. arXiv:2010.15639  [pdf, other

    stat.ML cs.LG

    Teaching a GAN What Not to Learn

    Authors: Siddarth Asokan, Chandra Sekhar Seelamantula

    Abstract: Generative adversarial networks (GANs) were originally envisioned as unsupervised generative models that learn to follow a target distribution. Variants such as conditional GANs, auxiliary-classifier GANs (ACGANs) project GANs on to supervised and semi-supervised learning frameworks by providing labelled data and using multi-class discriminators. In this paper, we approach the supervised GAN probl… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Comments: Neural Information Processing Systems 2020

  50. arXiv:2010.10653  [pdf, other

    cs.LG quant-ph

    Quantum Tensor Networks, Stochastic Processes, and Weighted Automata

    Authors: Siddarth Srinivasan, Sandesh Adhikary, Jacob Miller, Guillaume Rabusseau, Byron Boots

    Abstract: Modeling joint probability distributions over sequences has been studied from many perspectives. The physics community developed matrix product states, a tensor-train decomposition for probabilistic modeling, motivated by the need to tractably model many-body systems. But similar models have also been studied in the stochastic processes and weighted automata literature, with little work on how the… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.