Skip to main content

Showing 101–150 of 717 results for author: Jain, A

.
  1. arXiv:2309.01973  [pdf, other

    cs.LG cs.AI cs.IT stat.ML

    Linear Regression using Heterogeneous Data Batches

    Authors: Ayush Jain, Rajat Sen, Weihao Kong, Abhimanyu Das, Alon Orlitsky

    Abstract: In many learning applications, data are collected from multiple sources, each providing a \emph{batch} of samples that by itself is insufficient to learn its input-output relationship. A common approach assumes that the sources fall in one of several unknown subgroups, each with an unknown input distribution and input-output relationship. We consider one of this setup's most fundamental and import… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  2. arXiv:2309.00511  [pdf, ps, other

    hep-th cond-mat.stat-mech hep-ph nucl-th physics.flu-dyn

    Schwinger-Keldysh effective field theory for stable and causal relativistic hydrodynamics

    Authors: Akash Jain, Pavel Kovtun

    Abstract: We construct stable and causal effective field theories (EFTs) for describing statistical fluctuations in relativistic diffusion and relativistic hydrodynamics. These EFTs are fully non-linear, including couplings to background sources, and enable us to compute n-point time-ordered correlation functions including the effects of statistical fluctuations. The EFTs we construct are inspired by the Ma… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 47+1 pages

  3. arXiv:2308.14920  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Matbench Discovery -- A framework to evaluate machine learning crystal stability predictions

    Authors: Janosh Riebesell, Rhys E. A. Goodall, Philipp Benner, Yuan Chiang, Bowen Deng, Alpha A. Lee, Anubhav Jain, Kristin A. Persson

    Abstract: Matbench Discovery simulates the deployment of machine learning (ML) energy models in a high-throughput search for stable inorganic crystals. We address the disconnect between (i) thermodynamic stability and formation energy and (ii) in-domain vs out-of-distribution performance. Alongside this paper, we publish a Python package to aid with future model submissions and a growing online leaderboard… ▽ More

    Submitted 4 February, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: 31 pages, 18 figures, 4 tables

  4. arXiv:2308.10658  [pdf, other

    cs.CV

    Learning Clothing and Pose Invariant 3D Shape Representation for Long-Term Person Re-Identification

    Authors: Feng Liu, Minchul Kim, ZiAng Gu, Anil Jain, Xiaoming Liu

    Abstract: Long-Term Person Re-Identification (LT-ReID) has become increasingly crucial in computer vision and biometrics. In this work, we aim to extend LT-ReID beyond pedestrian recognition to include a wider range of real-world human activities while still accounting for cloth-changing scenarios over large time gaps. This setting poses additional challenges due to the geometric misalignment and appearance… ▽ More

    Submitted 21 September, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: 10 pages, 7 figures, accepted by ICCV 2023

  5. arXiv:2308.08825  [pdf, ps, other

    cs.LG eess.SP

    Controlling Federated Learning for Covertness

    Authors: Adit Jain, Vikram Krishnamurthy

    Abstract: A learner aims to minimize a function $f$ by repeatedly querying a distributed oracle that provides noisy gradient evaluations. At the same time, the learner seeks to hide $\arg\min f$ from a malicious eavesdropper that observes the learner's queries. This paper considers the problem of \textit{covert} or \textit{learner-private} optimization, where the learner has to dynamically choose between le… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  6. arXiv:2308.08638  [pdf, other

    cs.CV cs.CY cs.LG

    Fair GANs through model rebalancing for extremely imbalanced class distributions

    Authors: Anubhav Jain, Nasir Memon, Julian Togelius

    Abstract: Deep generative models require large amounts of training data. This often poses a problem as the collection of datasets can be expensive and difficult, in particular datasets that are representative of the appropriate underlying distribution (e.g. demographic). This introduces biases in datasets which are further propagated in the models. We present an approach to construct an unbiased generative… ▽ More

    Submitted 21 December, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

  7. arXiv:2308.08515  [pdf

    physics.app-ph

    Investigation of Magnesium Silicate as an Effective Gate Dielectric for AlGaN/GaN Metal Oxide High Electron Mobility Transistors (MOSHEMT)

    Authors: Seshasainadh Pudi, Navneet Bhardwaj, Ritam Sarkar, V S Santhosh N Varma Bellamkonda, Umang Singh, Anshul Jain, Swagata Bhunia, Soumyadip Chatterjee, Apurba Laha

    Abstract: In this study, a 6 nm layer of Magnesium Silicate (Mg-Silicate) was deposited on AlGaN/GaN heterostructure by sputtering of multiple stacks of MgO and SiO$_{2}$, followed by rapid thermal annealing in a nitrogen (N$_{2}$) environment. The X-ray photoelectron spectroscopy (XPS) analysis confirmed the stoichiometric Mg-Silicate (MgSiO$_{3}$) after being annealed at a temperature of 850 $^\circ$C for… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  8. arXiv:2308.07021  [pdf, ps, other

    math.CV

    Weighted Szegő Kernels on Planar Domains

    Authors: Aakanksha Jain, Kaushal Verma

    Abstract: We study properties of weighted Szegő and Garabedian kernels on planar domains. Motivated by the unweighted case as explained in Bell's work, the starting point is a weighted Kerzman-Stein formula that yields boundary smoothness of the weighted Szegő kernel. This provides information on the dependence of the weighted Szegő kernel as a function of the weight. When the weights are close to the const… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 26 pages

    MSC Class: 2020 MSC class. Primary: 30C40; Secondary: 31A99

  9. arXiv:2308.03180  [pdf, ps, other

    nucl-th nucl-ex

    Cluster radioactivity from trans-tin to superheavy region using an improved empirical formula

    Authors: G. Saxena, A. Jain

    Abstract: A simple relation $(aZ_{c} + b)(Z_{d}/Q)^{1/2} + (cZ_{c} + d)$ of estimation of the half-life of cluster emission is further improved for cluster and $α$-decays, separately, by incorporating isospin of parent nucleus as well as angular momentum taken away by the emitted particle. This improved version is not only found robust in producing experimental half-lives belonging to the trans-tin and tran… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: 12 pages, 2 Figures, 5 Tables, Accepted in European Physics Journal A

  10. arXiv:2308.01741  [pdf, other

    cs.CL

    Supply chain emission estimation using large language models

    Authors: Ayush Jain, Manikandan Padmanaban, Jagabondhu Hazra, Shantanu Godbole, Kommy Weldemariam

    Abstract: Large enterprises face a crucial imperative to achieve the Sustainable Development Goals (SDGs), especially goal 13, which focuses on combating climate change and its impacts. To mitigate the effects of climate change, reducing enterprise Scope 3 (supply chain emissions) is vital, as it accounts for more than 90\% of total emission inventories. However, tracking Scope 3 emissions proves challengin… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  11. arXiv:2308.00106  [pdf, other

    cs.DC

    Entropy Maximization in Sparse Matrix by Vector Multiplication ($\max_E SpMV$)

    Authors: Paolo D'Alberto, Abhishek Jain, Ismail Bustany, Henri Fraisse, Mansimran Benipal

    Abstract: The peak performance of any SpMV depends primarily on the available memory bandwidth and its effective use. GPUs, ASICs, and new FPGAs have higher and higher bandwidth; however, for large scale and highly sparse matrices, SpMV is still a hard problem because of its random access pattern and workload imbalance. Here, we show how to turn randomness to our advantage. We propose a matrix permutation p… ▽ More

    Submitted 24 July, 2023; originally announced August 2023.

    Comments: 26 pages

  12. arXiv:2307.14744  [pdf, other

    cs.DC cs.DB

    Wait-Free Updates and Range Search using Uruv

    Authors: Gaurav Bhardwaj, Abhay Jain, Bapi Chatterjee, Sathya Peri

    Abstract: CRUD operations, along with range queries make a highly useful abstract data type (ADT), employed by many dynamic analytics tasks. Despite its wide applications, to our knowledge, no fully wait-free data structure is known to support this ADT. In this paper, we introduce Uruv, a proactive linearizable and practical wait-free concurrent data structure that implements the ADT mentioned above. Stru… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  13. arXiv:2307.12549  [pdf, other

    cs.CY

    Estimating Time to Clear Pendency of Cases in High Courts in India using Linear Regression

    Authors: Kshitiz Verma, Anshu Musaddi, Ansh Mittal, Anshul Jain

    Abstract: Indian Judiciary is suffering from burden of millions of cases that are lying pending in its courts at all the levels. The High Court National Judicial Data Grid (HC-NJDG) indexes all the cases pending in the high courts and publishes the data publicly. In this paper, we analyze the data that we have collected from the HC-NJDG portal on 229 randomly chosen days between August 31, 2017 to March 22,… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 12 pages, 9 figures, JURISIN 2022. arXiv admin note: text overlap with arXiv:2307.10615

  14. arXiv:2307.10231  [pdf

    cs.AI cs.LG

    Automated Knowledge Modeling for Cancer Clinical Practice Guidelines

    Authors: Pralaypati Ta, Bhumika Gupta, Arihant Jain, Sneha Sree C, Arunima Sarkar, Keerthi Ram, Mohanasankar Sivaprakasam

    Abstract: Clinical Practice Guidelines (CPGs) for cancer diseases evolve rapidly due to new evidence generated by active research. Currently, CPGs are primarily published in a document format that is ill-suited for managing this develo** knowledge. A knowledge model of the guidelines document suitable for programmatic interaction is required. This work proposes an automated method for extraction of knowle… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

  15. arXiv:2307.06930  [pdf, other

    cs.CV cs.CL

    mBLIP: Efficient Bootstrap** of Multilingual Vision-LLMs

    Authors: Gregor Geigle, Abhay Jain, Radu Timofte, Goran Glavaš

    Abstract: Modular vision-language models (Vision-LLMs) align pretrained image encoders with (frozen) large language models (LLMs) and post-hoc condition LLMs to `understand' the image input. With the abundance of readily available high-quality English image-text data as well as strong monolingual English LLMs, the research focus has been on English-only Vision-LLMs. Multilingual vision-language models are s… ▽ More

    Submitted 20 June, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: ALVR Workshop 2024

  16. arXiv:2306.17206  [pdf, other

    cs.CV

    FarSight: A Physics-Driven Whole-Body Biometric System at Large Distance and Altitude

    Authors: Feng Liu, Ryan Ashbaugh, Nicholas Chimitt, Najmul Hassan, Ali Hassani, Ajay Jaiswal, Minchul Kim, Zhiyuan Mao, Christopher Perry, Zhiyuan Ren, Yiyang Su, Pegah Varghaei, Kai Wang, Xingguang Zhang, Stanley Chan, Arun Ross, Humphrey Shi, Zhangyang Wang, Anil Jain, Xiaoming Liu

    Abstract: Whole-body biometric recognition is an important area of research due to its vast applications in law enforcement, border security, and surveillance. This paper presents the end-to-end design, development and evaluation of FarSight, an innovative software system designed for whole-body (fusion of face, gait and body shape) biometric recognition. FarSight accepts videos from elevated platforms and… ▽ More

    Submitted 6 September, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 11 pages, 7 figures, accepted in WACV 2024

  17. arXiv:2306.15917  [pdf, other

    cs.CL cs.IR

    Confidence-Calibrated Ensemble Dense Phrase Retrieval

    Authors: William Yang, Noah Bergam, Arnav Jain, Nima Sheikhoslami

    Abstract: In this paper, we consider the extent to which the transformer-based Dense Passage Retrieval (DPR) algorithm, developed by (Karpukhin et. al. 2020), can be optimized without further pre-training. Our method involves two particular insights: we apply the DPR context encoder at various phrase lengths (e.g. one-sentence versus five-sentence segments), and we take a confidence-calibrated ensemble pred… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  18. arXiv:2306.14808  [pdf, other

    cs.LG

    Maximum State Entropy Exploration using Predecessor and Successor Representations

    Authors: Arnav Kumar Jain, Lucas Lehnert, Irina Rish, Glen Berseth

    Abstract: Animals have a developed ability to explore that aids them in important tasks such as locating food, exploring for shelter, and finding misplaced items. These exploration skills necessarily track where they have been so that they can plan for finding items with relative efficiency. Contemporary exploration algorithms often learn a less efficient exploration strategy because they either condition o… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  19. Decay properties of undetected superheavy nuclei with Z>110

    Authors: A. Jain, P. K. Sharma, S. K. Jain, Dashty T. Akrawy, G. Saxena

    Abstract: A comprehensive study of favoured and unfavoured $α$-decay, cluster decay, weak-decay along with spontaneous fission in undetected superheavy nuclei within the range for proton number 111$\leq$Z$\leq$118 and neutron number 161$\leq$N$\leq$192 is performed. Half-lives for various mentioned decays are estimated with good accuracy on the basis of NUBASE2020 and are found in excellent match with the k… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 24 pages, 6 figures, 5 tables, Accepted in Physica Scripta

  20. arXiv:2306.10717  [pdf, other

    cs.HC

    A neuro-symbolic approach for multimodal reference expression comprehension

    Authors: Aman Jain, Anirudh Reddy Kondapally, Kentaro Yamada, Hitomi Yanaka

    Abstract: Human-Machine Interaction (HMI) systems have gained huge interest in recent years, with reference expression comprehension being one of the main challenges. Traditionally human-machine interaction has been mostly limited to speech and visual modalities. However, to allow for more freedom in interaction, recent works have proposed the integration of additional modalities, such as gestures in HMI sy… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Appeared in the 37th Annual Conference of the Japanese Society for Artificial Intelligence, 2023

  21. ReactGenie: A Development Framework for Complex Multimodal Interactions Using Large Language Models

    Authors: Jackie Junrui Yang, Yingtian Shi, Yuhan Zhang, Karina Li, Daniel Wan Rosli, Anisha Jain, Shuning Zhang, Tianshi Li, James A. Landay, Monica S. Lam

    Abstract: By combining voice and touch interactions, multimodal interfaces can surpass the efficiency of either modality alone. Traditional multimodal frameworks require laborious developer work to support rich multimodal commands where the user's multimodal command involves possibly exponential combinations of actions/function invocations. This paper presents ReactGenie, a programming framework that better… ▽ More

    Submitted 2 May, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

  22. arXiv:2306.09247  [pdf, other

    cs.CR cs.AI cs.LG

    ATLAS: Automatically Detecting Discrepancies Between Privacy Policies and Privacy Labels

    Authors: Akshath Jain, David Rodriguez, Jose M. del Alamo, Norman Sadeh

    Abstract: Privacy policies are long, complex documents that end-users seldom read. Privacy labels aim to ameliorate these issues by providing succinct summaries of salient data practices. In December 2020, Apple began requiring that app developers submit privacy labels describing their apps' data practices. Yet, research suggests that app developers often struggle to do so. In this paper, we automatically i… ▽ More

    Submitted 24 May, 2023; originally announced June 2023.

    Comments: 14 pages, 13 figures

  23. arXiv:2306.04597  [pdf, other

    cs.CL cs.LG

    Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions

    Authors: Himanshu Thakur, Atishay Jain, Praneetha Vaddamanu, Paul Pu Liang, Louis-Philippe Morency

    Abstract: Societal biases present in pre-trained large language models are a critical issue as these models have been shown to propagate biases in countless downstream applications, rendering them unfair towards specific groups of people. Since large-scale retraining of these models from scratch is both time and compute-expensive, a variety of approaches have been previously proposed that de-bias a pre-trai… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2023 Main Conference

  24. arXiv:2306.02617  [pdf, other

    cs.LG

    Permutation Decision Trees

    Authors: Harikrishnan N B, Arham Jain, Nithin Nagaraj

    Abstract: Decision Tree is a well understood Machine Learning model that is based on minimizing impurities in the internal nodes. The most common impurity measures are Shannon entropy and Gini impurity. These impurity measures are insensitive to the order of training data and hence the final tree obtained is invariant to any permutation of the data. This is a limitation in terms of modeling when there are t… ▽ More

    Submitted 31 May, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 15 pages, 8 figures

  25. arXiv:2306.00942  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Train Offline, Test Online: A Real Robot Learning Benchmark

    Authors: Gaoyue Zhou, Victoria Dean, Mohan Kumar Srirama, Aravind Rajeswaran, Jyothish Pari, Kyle Hatch, Aryan Jain, Tianhe Yu, Pieter Abbeel, Lerrel Pinto, Chelsea Finn, Abhinav Gupta

    Abstract: Three challenges limit the progress of robot learning research: robots are expensive (few labs can participate), everyone uses different robots (findings do not generalize across labs), and we lack internet-scale robotics data. We take on these challenges via a new benchmark: Train Offline, Test Online (TOTO). TOTO provides remote users with access to shared robotic hardware for evaluating methods… ▽ More

    Submitted 30 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICRA 2023

  26. arXiv:2306.00272  [pdf, other

    cs.CV

    Accelerated Fingerprint Enhancement: A GPU-Optimized Mixed Architecture Approach

    Authors: André Brasil Vieira Wyzykowski, Anil K. Jain

    Abstract: This document presents a preliminary approach to latent fingerprint enhancement, fundamentally designed around a mixed Unet architecture. It combines the capabilities of the Resnet-101 network and Unet encoder, aiming to form a potentially powerful composite. This combination, enhanced with attention mechanisms and forward skip connections, is intended to optimize the enhancement of ridge and minu… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  27. arXiv:2306.00231  [pdf, other

    cs.CV

    A Universal Latent Fingerprint Enhancer Using Transformers

    Authors: Andre Brasil Vieira Wyzykowski, Anil K. Jain

    Abstract: Forensic science heavily relies on analyzing latent fingerprints, which are crucial for criminal investigations. However, various challenges, such as background noise, overlap** prints, and contamination, make the identification process difficult. Moreover, limited access to real crime scene and laboratory-generated databases hinders the development of efficient recognition algorithms. This stud… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  28. arXiv:2305.18341  [pdf, other

    cs.PL cs.AI cs.LG

    Coarse-Tuning Models of Code with Reinforcement Learning Feedback

    Authors: Abhinav Jain, Chima Adiole, Swarat Chaudhuri, Thomas Reps, Chris Jermaine

    Abstract: Large Language Models (LLMs) pre-trained on code have recently emerged as the dominant approach to program synthesis. However, these models are trained using next-token prediction, which ignores the syntax and semantics of code. We propose RLCF, that further trains a pre-trained LLM via reinforcement learning, using feedback from a grounding function that scores the quality of the code. The ground… ▽ More

    Submitted 23 December, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 23 pages

  29. arXiv:2305.14343  [pdf, other

    cs.LG cs.AI cs.CV

    Video Prediction Models as Rewards for Reinforcement Learning

    Authors: Alejandro Escontrela, Ademi Adeniji, Wilson Yan, Ajay Jain, Xue Bin Peng, Ken Goldberg, Youngwoon Lee, Danijar Hafner, Pieter Abbeel

    Abstract: Specifying reward signals that allow agents to learn complex behaviors is a long-standing challenge in reinforcement learning. A promising approach is to extract preferences for behaviors from unlabeled videos, which are widely available on the internet. We present Video Prediction Rewards (VIPER), an algorithm that leverages pretrained video prediction models as action-free reward signals for rei… ▽ More

    Submitted 30 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 22 pages, 18 figures, 4 tables. under review

  30. arXiv:2305.07710  [pdf, other

    cs.CV

    Zero-shot racially balanced dataset generation using an existing biased StyleGAN2

    Authors: Anubhav Jain, Nasir Memon, Julian Togelius

    Abstract: Facial recognition systems have made significant strides thanks to data-heavy deep learning models, but these models rely on large privacy-sensitive datasets. Further, many of these datasets lack diversity in terms of ethnicity and demographics, which can lead to biased models that can have serious societal and security implications. To address these issues, we propose a methodology that leverages… ▽ More

    Submitted 18 September, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

  31. arXiv:2305.07602  [pdf, other

    cs.CV

    ViT Unified: Joint Fingerprint Recognition and Presentation Attack Detection

    Authors: Steven A. Grosz, Kanishka P. Wijewardena, Anil K. Jain

    Abstract: A secure fingerprint recognition system must contain both a presentation attack (i.e., spoof) detection and recognition module in order to protect users against unwanted access by malicious users. Traditionally, these tasks would be carried out by two independent systems; however, recent studies have demonstrated the potential to have one unified system architecture in order to reduce the computat… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  32. arXiv:2305.07552  [pdf, other

    cs.CV cs.AI cs.CY

    Dish detection in food platters: A framework for automated diet logging and nutrition management

    Authors: Mansi Goel, Shashank Dargar, Shounak Ghatak, Nidhi Verma, Pratik Chauhan, Anushka Gupta, Nikhila Vishnumolakala, Hareesh Amuru, Ekta Gambhir, Ronak Chhajed, Meenal Jain, Astha Jain, Samiksha Garg, Nitesh Narwade, Nikhilesh Verhwani, Abhuday Tiwari, Kirti Vashishtha, Ganesh Bagler

    Abstract: Diet is central to the epidemic of lifestyle disorders. Accurate and effortless diet logging is one of the significant bottlenecks for effective diet management and calorie restriction. Dish detection from food platters is a challenging problem due to a visually complex food layout. We present an end-to-end computational framework for diet management, from data compilation, annotation, and state-o… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 11 pages, 5 figures, 5 tables. Submitted to the 8th International Conference on Computer Vision & Image Processing (CVIP-2023)

    ACM Class: I.4.9; I.5.4; J.3

  33. arXiv:2305.05161  [pdf, other

    cs.CV

    Child Palm-ID: Contactless Palmprint Recognition for Children

    Authors: Akash Godbole, Steven A. Grosz, Anil K. Jain

    Abstract: Effective distribution of nutritional and healthcare aid for children, particularly infants and toddlers, in some of the least developed and most impoverished countries of the world, is a major problem due to the lack of reliable identification documents. Biometric authentication technology has been investigated to address child recognition in the absence of reliable ID documents. We present a mob… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  34. arXiv:2304.14999  [pdf, other

    cs.CL cs.AI

    Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs

    Authors: George Pu, Anirudh Jain, Jihan Yin, Russell Kaplan

    Abstract: As foundation models continue to exponentially scale in size, efficient methods of adaptation become increasingly critical. Parameter-efficient fine-tuning (PEFT), a recent class of techniques that require only modifying a small percentage of the model parameters, is currently the most popular method for adapting large language models (LLMs). Several PEFT techniques have recently been proposed wit… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Short paper, ICLR '23 Workshop on Understanding Foundation Models

  35. arXiv:2304.14391  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement

    Authors: Nikolaos Gkanatsios, Ayush Jain, Zhou Xian, Yunchu Zhang, Christopher Atkeson, Katerina Fragkiadaki

    Abstract: Language is compositional; an instruction can express multiple relation constraints to hold among objects in a scene that a robot is tasked to rearrange. Our focus in this work is an instructable scene-rearranging framework that generalizes to longer instructions and to spatial concept compositions never seen at training time. We propose to represent language-instructed spatial concepts with energ… ▽ More

    Submitted 23 January, 2024; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: First two authors contributed equally | RSS 2023

  36. arXiv:2304.13846  [pdf, other

    physics.app-ph cs.CL cs.IR

    Extracting Structured Seed-Mediated Gold Nanorod Growth Procedures from Literature with GPT-3

    Authors: Nicholas Walker, John Dagdelen, Kevin Cruse, Sanghoon Lee, Samuel Gleason, Alexander Dunn, Gerbrand Ceder, A. Paul Alivisatos, Kristin A. Persson, Anubhav Jain

    Abstract: Although gold nanorods have been the subject of much research, the pathways for controlling their shape and thereby their optical properties remain largely heuristically understood. Although it is apparent that the simultaneous presence of and interaction between various reagents during synthesis control these properties, computational and experimental approaches for exploring the synthesis space… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  37. arXiv:2304.13800  [pdf, other

    cs.CV cs.LG

    Latent Fingerprint Recognition: Fusion of Local and Global Embeddings

    Authors: Steven A. Grosz, Anil K. Jain

    Abstract: One of the most challenging problems in fingerprint recognition continues to be establishing the identity of a suspect associated with partial and smudgy fingerprints left at a crime scene (i.e., latent prints or fingermarks). Despite the success of fixed-length embeddings for rolled and slap fingerprint recognition, the features learned for latent fingerprint matching have mostly been limited to… ▽ More

    Submitted 7 September, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

  38. arXiv:2304.10799  [pdf, other

    math.OC

    A scalable solution for the extended multi-channel facility location problem

    Authors: Etika Agarwal, Karthik S. Gurumoorthy, Ankit Ajit Jain, Shantala Manchenahally

    Abstract: We study the extended version of the non-uniform, capacitated facility location problem with multiple fulfilment channels between the facilities and clients, each with their own channel capacities and service cost. Though the problem has been extensively studied in the literature, all the prior works assume a single channel of fulfilment, and the existing methods based on linear programming, prima… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  39. arXiv:2304.09852  [pdf, ps, other

    hep-th cond-mat.str-el

    Dipole superfluid hydrodynamics

    Authors: Akash Jain, Kristan Jensen, Ruochuan Liu, Eric Mefford

    Abstract: We construct a theory of hydrodynamic transport for systems with conserved dipole moment, U(1) charge, energy, and momentum. These models have been considered in the context of fractons, since their elementary and isolated charges are immobile by symmetry, and have two known translation-invariant gapless phases: a "p-wave dipole superfluid" phase where the dipole symmetry is spontaneously broken a… ▽ More

    Submitted 29 January, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 53 pages plus appendices; we have included a Mathematica notebook to the arXiv submission which computes dispersion relations and response functions; v2: fixed typos

  40. arXiv:2304.08769  [pdf, ps, other

    cs.LG cs.MA

    Cooperative Multi-Agent Reinforcement Learning for Inventory Management

    Authors: Madhav Khirwar, Karthik S. Gurumoorthy, Ankit Ajit Jain, Shantala Manchenahally

    Abstract: With Reinforcement Learning (RL) for inventory management (IM) being a nascent field of research, approaches tend to be limited to simple, linear environments with implementations that are minor modifications of off-the-shelf RL algorithms. Scaling these simplistic environments to a real-world supply chain comes with a few challenges such as: minimizing the computational requirements of the enviro… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 14 pages, 5 figures

  41. arXiv:2304.07060  [pdf, other

    cs.CV

    DCFace: Synthetic Face Generation with Dual Condition Diffusion Model

    Authors: Minchul Kim, Feng Liu, Anil Jain, Xiaoming Liu

    Abstract: Generating synthetic datasets for training face recognition models is challenging because dataset generation entails more than creating high fidelity images. It involves generating multiple images of same subjects under different factors (\textit{e.g.}, variations in pose, illumination, expression, aging and occlusion) which follows the real image conditional distribution. Previous works have stud… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: To appear in CVPR 2023

  42. arXiv:2304.06861  [pdf, other

    cs.CL cs.CY cs.LG

    Evaluation of Social Biases in Recent Large Pre-Trained Models

    Authors: Swapnil Sharma, Nikita Anand, Kranthi Kiran G. V., Alind Jain

    Abstract: Large pre-trained language models are widely used in the community. These models are usually trained on unmoderated and unfiltered data from open sources like the Internet. Due to this, biases that we see in platforms online which are a reflection of those in society are in turn captured and learned by these models. These models are deployed in applications that affect millions of people and their… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 7 pages, 4 Tables

  43. EEG Cortical Source Feature based Hand Kinematics Decoding using Residual CNN-LSTM Neural Network

    Authors: Anant Jain, Lalan Kumar

    Abstract: Motor kinematics decoding (MKD) using brain signal is essential to develop Brain-computer interface (BCI) system for rehabilitation or prosthesis devices. Surface electroencephalogram (EEG) signal has been widely utilized for MKD. However, kinematic decoding from cortical sources is sparsely explored. In this work, the feasibility of hand kinematics decoding using EEG cortical source signals has b… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Journal ref: 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Sydney, Australia, 2023

  44. arXiv:2303.18240  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?

    Authors: Arjun Majumdar, Karmesh Yadav, Sergio Arnaud, Yecheng Jason Ma, Claire Chen, Sneha Silwal, Aryan Jain, Vincent-Pierre Berges, Pieter Abbeel, Jitendra Malik, Dhruv Batra, Yixin Lin, Oleksandr Maksymets, Aravind Rajeswaran, Franziska Meier

    Abstract: We present the largest and most comprehensive empirical study of pre-trained visual representations (PVRs) or visual 'foundation models' for Embodied AI. First, we curate CortexBench, consisting of 17 different tasks spanning locomotion, navigation, dexterous, and mobile manipulation. Next, we systematically evaluate existing PVRs and find that none are universally dominant. To study the effect of… ▽ More

    Submitted 1 February, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: Project website: https://eai-vc.github.io

  45. arXiv:2303.11960  [pdf, other

    cs.HC

    Preparing Unprepared Students For Future Learning

    Authors: Mark Abdelshiheed, Mehak Maniktala, Song Ju, Ayush Jain, Tiffany Barnes, Min Chi

    Abstract: Based on strategy-awareness (knowing which problem-solving strategy to use) and time-awareness (knowing when to use it), students are categorized into Rote (neither type of awareness), Dabbler (strategy-aware only) or Selective (both types of awareness). It was shown that Selective is often significantly more prepared for future learning than Rote and Dabbler (Abdelshiheed et al., 2020). In this w… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

  46. arXiv:2303.08595  [pdf, other

    cs.LG

    Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions

    Authors: Kaiqi Zhao, Animesh Jain, Ming Zhao

    Abstract: Pruning is a promising approach to compress deep learning models in order to deploy them on resource-constrained edge devices. However, many existing pruning solutions are based on unstructured pruning, which yields models that cannot efficiently run on commodity hardware; and they often require users to manually explore and tune the pruning process, which is time-consuming and often leads to sub-… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2201.10520

  47. Improved quantum error correction with randomized compiling

    Authors: Aditya Jain, Pavithran Iyer, Stephen D. Bartlett, Joseph Emerson

    Abstract: Current hardware for quantum computing suffers from high levels of noise, and so to achieve practical fault-tolerant quantum computing will require powerful and efficient methods to correct for errors in quantum circuits. Here, we explore the role and effectiveness of using noise tailoring techniques to improve the performance of error correcting codes. Noise tailoring methods such as randomized c… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 7 pages + 8 page appendix, 8 figures

    Journal ref: Phys. Rev. Research 5, 033049 (2023)

  48. arXiv:2303.06274  [pdf

    cs.CV cs.LG

    CoNIC Challenge: Pushing the Frontiers of Nuclear Detection, Segmentation, Classification and Counting

    Authors: Simon Graham, Quoc Dang Vu, Mostafa Jahanifar, Martin Weigert, Uwe Schmidt, Wenhua Zhang, Jun Zhang, Sen Yang, **xi Xiang, Xiyue Wang, Josef Lorenz Rumberger, Elias Baumann, Peter Hirsch, Lihao Liu, Chenyang Hong, Angelica I. Aviles-Rivero, Ayushi Jain, Heeyoung Ahn, Yiyu Hong, Hussam Azzuni, Min Xu, Mohammad Yaqub, Marie-Claire Blache, Benoît Piégu, Bertrand Vernay , et al. (64 additional authors not shown)

    Abstract: Nuclear detection, segmentation and morphometric profiling are essential in hel** us further understand the relationship between histology and patient outcome. To drive innovation in this area, we setup a community-wide challenge using the largest available dataset of its kind to assess nuclear segmentation and cellular composition. Our challenge, named CoNIC, stimulated the development of repro… ▽ More

    Submitted 14 March, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  49. arXiv:2303.05662  [pdf

    cond-mat.soft

    Single-Step Synthesis of Shape-Controlled Polymeric Particles using Initiated Chemical Vapor Deposition in Liquid Crystals

    Authors: Apoorva Jain, Soumyamouli Pal, Nicholas L. Abbott, Rong Yang

    Abstract: The ability to synthesize shape-controlled polymer particles will benefit a wide range of applications including targeted drug delivery and metamaterials with reconfigurable structures, but existing synthesis approaches are commonly multistep and limited to a narrow size/shape range. Using a novel single-step synthesis technique, a variety of shapes including nanospheres, hemispherical micro-domes… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: 35 pages, 5 figures

  50. arXiv:2303.01598  [pdf, other

    cs.CV cs.LG

    A Meta-Learning Approach to Predicting Performance and Data Requirements

    Authors: Achin Jain, Gurumurthy Swaminathan, Paolo Favaro, Hao Yang, Avinash Ravichandran, Hrayr Harutyunyan, Alessandro Achille, Onkar Dabeer, Bernt Schiele, Ashwin Swaminathan, Stefano Soatto

    Abstract: We propose an approach to estimate the number of samples required for a model to reach a target performance. We find that the power law, the de facto principle to estimate model performance, leads to large error when using a small dataset (e.g., 5 samples per class) for extrapolation. This is because the log-performance error against the log-dataset size follows a nonlinear progression in the few-… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: CVPR 2023