Search | arXiv e-print repository

doi 10.22323/1.444.0858

The cosipy library: COSI's high-level analysis software

Authors: Israel Martinez-Castellanos, Savitri Gallego, Chien-You Huang, Chris Karwin, Carolyn Kierans, Jan Peter Lommler, Saurabh Mittal, Michela Negro, Eliza Neights, Sean N. Pike, Yong Sheng, Thomas Siegert, Hiroki Yoneda, Andreas Zoglauer, John A. Tomsick, Steven E. Boggs, Dieter Hartmann, Marco Ajello, Eric Burns, Chris Fryer, Alexander Lowell, Julien Malzac, Jarred Roberts, Pascal Saint-Hilaire, Albert Shih , et al. (50 additional authors not shown)

Abstract: The Compton Spectrometer and Imager (COSI) is a selected Small Explorer (SMEX) mission launching in 2027. It consists of a large field-of-view Compton telescope that will probe with increased sensitivity the under-explored MeV gamma-ray sky (0.2-5 MeV). We will present the current status of cosipy, a Python library that will perform spectral and polarization fits, image deconvolution, and all high… ▽ More The Compton Spectrometer and Imager (COSI) is a selected Small Explorer (SMEX) mission launching in 2027. It consists of a large field-of-view Compton telescope that will probe with increased sensitivity the under-explored MeV gamma-ray sky (0.2-5 MeV). We will present the current status of cosipy, a Python library that will perform spectral and polarization fits, image deconvolution, and all high-level analysis tasks required by COSI's broad science goals: uncovering the origin of the Galactic positrons, map** the sites of Galactic nucleosynthesis, improving our models of the jet and emission mechanism of gamma-ray bursts (GRBs) and active galactic nuclei (AGNs), and detecting and localizing gravitational wave and neutrino sources. The cosipy library builds on the experience gained during the COSI balloon campaigns and will bring the analysis of data in the Compton regime to a modern open-source likelihood-based code, capable of performing coherent joint fits with other instruments using the Multi-Mission Maximum Likelihood framework (3ML). In this contribution, we will also discuss our plans to receive feedback from the community by having yearly software releases accompanied by publicly-available data challenges. △ Less

Submitted 22 August, 2023; originally announced August 2023.

Journal ref: Martinez, Israel. The cosipy library: COSI's high-level analysis software. PoS ICRC2023 (2023) 444-858

arXiv:2308.10943 [pdf, other]

VERTICO VII: Environmental quenching caused by suppression of molecular gas content and star formation efficiency in Virgo Cluster galaxies

Authors: Toby Brown, Ian D. Roberts, Mallory Thorp, Sara L. Ellison, Nikki Zabel, Christine D. Wilson, Yannick M. Bahé, Dhruv Bisaria, Alberto D. Bolatto, Alessandro Boselli, Aeree Chung, Luca Cortese, Barbara Catinella, Timothy A. Davis, María J. Jiménez-Donaire, Claudia D. P. Lagos, Bumhyun Lee, Laura C. Parker, Rory Smith, Kristine Spekkens, Adam R. H. Stevens, Vicente Villanueva, Adam B. Watts

Abstract: We study how environment regulates the star formation cycle of 33 Virgo Cluster satellite galaxies on 720 parsec scales. We present the first resolved star-forming main sequence for cluster galaxies, dividing the sample based on their global HI properties and comparing to a control sample of field galaxies. HI-poor cluster galaxies have reduced star formation rate (SFR) surface densities with resp… ▽ More We study how environment regulates the star formation cycle of 33 Virgo Cluster satellite galaxies on 720 parsec scales. We present the first resolved star-forming main sequence for cluster galaxies, dividing the sample based on their global HI properties and comparing to a control sample of field galaxies. HI-poor cluster galaxies have reduced star formation rate (SFR) surface densities with respect to both HI-normal cluster and field galaxies (0.5 dex), suggesting that mechanisms regulating the global HI content are responsible for quenching local star formation. We demonstrate that the observed quenching in HI-poor galaxies is caused by environmental processes such as ram pressure strip** (RPS) simultaneously reducing molecular gas surface density and star formation efficiency (SFE), compared to regions in HI-normal systems (by 0.38 and 0.22 dex, respectively). We observe systematically elevated SFRs that are driven by increased molecular gas surface densities at fixed stellar mass surface density in the outskirts of early-stage RPS galaxies, while SFE remains unchanged with respect to the field sample. We quantify how RPS and starvation affect the star formation cycle of inner and outer galaxy discs as they are processed by the cluster. We show both are effective quenching mechanisms with the key difference being that RPS acts upon the galaxy outskirts while starvation regulates the star formation cycle throughout disc, including within the truncation radius. For both processes, the quenching is caused by a simultaneous reduction in molecular gas surface densities and SFE at fixed stellar mass surface density. △ Less

Submitted 21 August, 2023; originally announced August 2023.

Comments: 17 pages, 1 table, 5 figures, accepted for publication in ApJ

arXiv:2308.09873 [pdf, other]

Skill Transformer: A Monolithic Policy for Mobile Manipulation

Authors: Xiaoyu Huang, Dhruv Batra, Akshara Rai, Andrew Szot

Abstract: We present Skill Transformer, an approach for solving long-horizon robotic tasks by combining conditional sequence modeling and skill modularity. Conditioned on egocentric and proprioceptive observations of a robot, Skill Transformer is trained end-to-end to predict both a high-level skill (e.g., navigation, picking, placing), and a whole-body low-level action (e.g., base and arm motion), using a… ▽ More We present Skill Transformer, an approach for solving long-horizon robotic tasks by combining conditional sequence modeling and skill modularity. Conditioned on egocentric and proprioceptive observations of a robot, Skill Transformer is trained end-to-end to predict both a high-level skill (e.g., navigation, picking, placing), and a whole-body low-level action (e.g., base and arm motion), using a transformer architecture and demonstration trajectories that solve the full task. It retains the composability and modularity of the overall task through a skill predictor module while reasoning about low-level actions and avoiding hand-off errors, common in modular approaches. We test Skill Transformer on an embodied rearrangement benchmark and find it performs robust task planning and low-level control in new scenarios, achieving a 2.5x higher success rate than baselines in hard rearrangement problems. △ Less

Submitted 18 August, 2023; originally announced August 2023.

arXiv:2308.08031 [pdf, other]

Company Similarity using Large Language Models

Authors: Dimitrios Vamvourellis, Máté Toth, Snigdha Bhagat, Dhruv Desai, Dhagash Mehta, Stefano Pasquali

Abstract: Identifying companies with similar profiles is a core task in finance with a wide range of applications in portfolio construction, asset pricing and risk attribution. When a rigorous definition of similarity is lacking, financial analysts usually resort to 'traditional' industry classifications such as Global Industry Classification System (GICS) which assign a unique category to each company at d… ▽ More Identifying companies with similar profiles is a core task in finance with a wide range of applications in portfolio construction, asset pricing and risk attribution. When a rigorous definition of similarity is lacking, financial analysts usually resort to 'traditional' industry classifications such as Global Industry Classification System (GICS) which assign a unique category to each company at different levels of granularity. Due to their discrete nature, though, GICS classifications do not allow for ranking companies in terms of similarity. In this paper, we explore the ability of pre-trained and finetuned large language models (LLMs) to learn company embeddings based on the business descriptions reported in SEC filings. We show that we can reproduce GICS classifications using the embeddings as features. We also benchmark these embeddings on various machine learning and financial metrics and conclude that the companies that are similar according to the embeddings are also similar in terms of financial performance metrics including return correlation. △ Less

Submitted 15 August, 2023; originally announced August 2023.

Comments: 8 pages, 2 figures, 2 tables

arXiv:2308.07354 [pdf, other]

Addressing the $r_{d}$ Tension using Late-Time Observational Measurements in a Novel Deceleration Parametrization

Authors: Himanshu Chaudhary, Dhruv Arora, Ujjal Debnath, G. Mustafa, S. K. Maurya

Abstract: This paper introduces a novel cosmological model aimed at probing the accelerated expansion of the late Universe through a unique parametrization of the deceleration parameter. We aim to constrain key cosmic parameters by integrating recent measurements of the Hubble parameter obtained from various observational methods, including cosmic chronometers, Type Ia Supernovae, Gamma-Ray Bursts (GRB), Qu… ▽ More This paper introduces a novel cosmological model aimed at probing the accelerated expansion of the late Universe through a unique parametrization of the deceleration parameter. We aim to constrain key cosmic parameters by integrating recent measurements of the Hubble parameter obtained from various observational methods, including cosmic chronometers, Type Ia Supernovae, Gamma-Ray Bursts (GRB), Quasars, and baryon acoustic oscillations (BAO) from recent galaxy surveys. With a redshift range spanning (0.106 < z < 2.33) and incorporating the latest Hubble constant measurement from Riess in 2022, our analysis yields optimal fit values for the Hubble parameter $H_{0}$ and sound horizon $r_{d}$. Notably, we uncover an inconsistency in $H_{0}$ values derived from late-time observational measurements, reflecting the well-known $H_{0}$ tension. In terms of $r_{d}$, while there is close agreement between Joint analysis and Joint analysis with R22, discrepancies arise upon gradual inclusion of BAO and BAO with R22 datasets. Our model demonstrates excellent fit to observed data and aligns well with the standard $Λ$CDM paradigm at higher redshifts. However, its most intriguing aspect lies in predicting a super-accelerated expansion in the distant future, in contrast to the de Sitter phase predicted by $Λ$CDM. Additionally, unique behaviors in the jerk parameter hint at novel dynamics beyond traditional cosmological models. Statefinder and $O_{m}$ Diagnostics tests were conducted, and comparison using the Akaike information criterion indicates neither model can be ruled out based on the latest observational measurements. These findings propose our cosmological model as a compelling alternative to $Λ$CDM, offering fresh insights into dark energy's nature and the cosmos' future. △ Less

Submitted 12 February, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

Comments: 10 figures, 10 figures

arXiv:2308.06882 [pdf, other]

Quantifying Outlierness of Funds from their Categories using Supervised Similarity

Authors: Dhruv Desai, Ashmita Dhiman, Tushar Sharma, Deepika Sharma, Dhagash Mehta, Stefano Pasquali

Abstract: Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. H… ▽ More Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. Here, we aim to quantify the effect of miscategorization of funds utilizing a machine learning based approach. We formulate the problem of miscategorization of funds as a distance-based outlier detection problem, where the outliers are the data-points that are far from the rest of the data-points in the given feature space. We implement and employ a Random Forest (RF) based method of distance metric learning, and compute the so-called class-wise outlier measures for each data-point to identify outliers in the data. We test our implementation on various publicly available data sets, and then apply it to mutual fund data. We show that there is a strong relationship between the outlier measures of the funds and their future returns and discuss the implications of our findings. △ Less

Submitted 13 August, 2023; originally announced August 2023.

Comments: 8 pages, 5 tables, 8 figures

arXiv:2308.05390 [pdf, other]

Product Review Image Ranking for Fashion E-commerce

Authors: Sangeet Jaiswal, Dhruv Patel, Sreekanth Vempati, Konduru Saiswaroop

Abstract: In a fashion e-commerce platform where customers can't physically examine the products on their own, being able to see other customers' text and image reviews of the product is critical while making purchase decisions. Given the high reliance on these reviews, over the years we have observed customers proactively sharing their reviews. With an increase in the coverage of User Generated Content (UG… ▽ More In a fashion e-commerce platform where customers can't physically examine the products on their own, being able to see other customers' text and image reviews of the product is critical while making purchase decisions. Given the high reliance on these reviews, over the years we have observed customers proactively sharing their reviews. With an increase in the coverage of User Generated Content (UGC), there has been a corresponding increase in the number of customer images. It is thus imperative to display the most relevant images on top as it may influence users' online shop** choices and behavior. In this paper, we propose a simple yet effective training procedure for ranking customer images. We created a dataset consisting of Myntra (A Major Indian Fashion e-commerce company) studio posts and highly engaged (upvotes/downvotes) UGC images as our starting point and used selected distortion techniques on the images of the above dataset to bring their quality at par with those of bad UGC images. We train our network to rank bad-quality images lower than high-quality ones. Our proposed method outperforms the baseline models on two metrics, namely correlation coefficient, and accuracy, by substantial margins. △ Less

Submitted 10 August, 2023; originally announced August 2023.

Comments: Accepted in Proceedings of ACM SIGIR Workshop on eCommerce (SIGIR eCom'22)

arXiv:2308.03882 [pdf, other]

Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations

Authors: Nirbhay Modhe, Qiaozi Gao, Ashwin Kalyan, Dhruv Batra, Govind Thattai, Gaurav Sukhatme

Abstract: Offline reinforcement learning (RL) methods strike a balance between exploration and exploitation by conservative value estimation -- penalizing values of unseen states and actions. Model-free methods penalize values at all unseen actions, while model-based methods are able to further exploit unseen states via model rollouts. However, such methods are handicapped in their ability to find unseen st… ▽ More Offline reinforcement learning (RL) methods strike a balance between exploration and exploitation by conservative value estimation -- penalizing values of unseen states and actions. Model-free methods penalize values at all unseen actions, while model-based methods are able to further exploit unseen states via model rollouts. However, such methods are handicapped in their ability to find unseen states far away from the available offline data due to two factors -- (a) very short rollout horizons in models due to cascading model errors, and (b) model rollouts originating solely from states observed in offline data. We relax the second assumption and present a novel unseen state augmentation strategy to allow exploitation of unseen states where the learned model and value estimates generalize. Our strategy finds unseen states by value-informed perturbations of seen states followed by filtering out states with epistemic uncertainty estimates too high (high error) or too low (too similar to seen data). We observe improved performance in several offline RL tasks and find that our augmentation strategy consistently leads to overall lower average dataset Q-value estimates i.e. more conservative Q-value estimates than a baseline. △ Less

Submitted 24 September, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

arXiv:2308.03664 [pdf, other]

Two-stage Early Prediction Framework of Remaining Useful Life for Lithium-ion Batteries

Authors: Dhruv Mittal, Hymalai Bello, Bo Zhou, Mayank Shekhar Jha, Sungho Suh, Paul Lukowicz

Abstract: Early prediction of remaining useful life (RUL) is crucial for effective battery management across various industries, ranging from household appliances to large-scale applications. Accurate RUL prediction improves the reliability and maintainability of battery technology. However, existing methods have limitations, including assumptions of data from the same sensors or distribution, foreknowledge… ▽ More Early prediction of remaining useful life (RUL) is crucial for effective battery management across various industries, ranging from household appliances to large-scale applications. Accurate RUL prediction improves the reliability and maintainability of battery technology. However, existing methods have limitations, including assumptions of data from the same sensors or distribution, foreknowledge of the end of life (EOL), and neglect to determine the first prediction cycle (FPC) to identify the start of the unhealthy stage. This paper proposes a novel method for RUL prediction of Lithium-ion batteries. The proposed framework comprises two stages: determining the FPC using a neural network-based model to divide the degradation data into distinct health states and predicting the degradation pattern after the FPC to estimate the remaining useful life as a percentage. Experimental results demonstrate that the proposed method outperforms conventional approaches in terms of RUL prediction. Furthermore, the proposed method shows promise for real-world scenarios, providing improved accuracy and applicability for battery management. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: Accepted at the 49th Annual Conference of the IEEE Industrial Electronics Society (IECON 2023)

arXiv:2308.03504 [pdf, other]

doi 10.1051/0004-6361/202347101

Three-temperature radiation hydrodynamics with PLUTO: Tests and applications to protoplanetary disks

Authors: Dhruv Muley, Julio David Melon Fuksman, Hubert Klahr

Abstract: In circumstellar disks around T Tauri stars, visible and near-infrared stellar irradiation is intercepted by dust at the disk's optical surface and reprocessed into thermal infrared; this subsequently undergoes radiative diffusion through the optically thick bulk of the disk. The gas component -- overwhelmingly dominant by mass, but contributing little to the opacity -- is heated primarily by gas-… ▽ More In circumstellar disks around T Tauri stars, visible and near-infrared stellar irradiation is intercepted by dust at the disk's optical surface and reprocessed into thermal infrared; this subsequently undergoes radiative diffusion through the optically thick bulk of the disk. The gas component -- overwhelmingly dominant by mass, but contributing little to the opacity -- is heated primarily by gas-grain collisions. In hydrodynamical simulations, however, typical models for this heating process (local isothermality, $β$-cooling, two-temperature radiation hydrodynamics) incorporate simplifying assumptions that limit their ranges of validity. To build on these methods, we develop a ``three-temperature" numerical scheme, which self-consistently models energy exchange between gas, dust, and radiation, as a part of the PLUTO radiation-hydrodynamics code. With a range of test problems in 0D, 1D, 2D, and 3D, we demonstrate the efficacy of our method, and make the case for its applicability to a wide range of problems in disk physics, including hydrodynamic instabilities and disk-planet interaction. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: Accepted to Astronomy and Astrophysics; 16 pages, 11 figures incl. Appendix. Comments and questions welcome

Journal ref: A&A 678, A162 (2023)

arXiv:2308.00119 [pdf, ps, other]

Overtaking Moving Obstacles with Digit: Path Following for Bipedal Robots via Model Predictive Contouring Control

Authors: Kunal S. Narkhede, Dhruv A. Thanki, Abhijeet M. Kulkarni, Ioannis Poulakakis

Abstract: Humanoid robots are expected to navigate in changing environments and perform a variety of tasks. Frequently, these tasks require the robot to make decisions online regarding the speed and precision of following a reference path. For example, a robot may want to decide to temporarily deviate from its path to overtake a slowly moving obstacle that shares the same path and is ahead. In this case, pa… ▽ More Humanoid robots are expected to navigate in changing environments and perform a variety of tasks. Frequently, these tasks require the robot to make decisions online regarding the speed and precision of following a reference path. For example, a robot may want to decide to temporarily deviate from its path to overtake a slowly moving obstacle that shares the same path and is ahead. In this case, path following performance is compromised in favor of fast path traversal. Available global trajectory tracking approaches typically assume a given -- specified in advance -- time parametrization of the path and seek to minimize the norm of the Cartesian error. As a result, when the robot should be where on the path is fixed and temporary deviations from the path are strongly discouraged. Given a global path, this paper presents a Model Predictive Contouring Control (MPCC) approach to selecting footsteps that maximize path traversal while simultaneously allowing the robot to decide between faithful versus fast path following. The method is evaluated in high-fidelity simulations of the bipedal robot Digit in terms of tracking performance of curved paths under disturbances and is also applied to the case where Digit overtakes a moving obstacle. △ Less

Submitted 31 July, 2023; originally announced August 2023.

Comments: Accepted for publication in 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

arXiv:2307.14707 [pdf, other]

An Automata Theoretic Characterization of Weighted First-Order Logic

Authors: Dhruv Nevatia, Benjamin Monmege

Abstract: Since the 1970s with the work of McNaughton, Papert and Schützenberger, a regular language is known to be definable in the first-order logic if and only if its syntactic monoid is aperiodic. This algebraic characterisation of a fundamental logical fragment has been extended in the quantitative case by Droste and Gastin, dealing with polynomially ambiguous weighted automata and a restricted fragmen… ▽ More Since the 1970s with the work of McNaughton, Papert and Schützenberger, a regular language is known to be definable in the first-order logic if and only if its syntactic monoid is aperiodic. This algebraic characterisation of a fundamental logical fragment has been extended in the quantitative case by Droste and Gastin, dealing with polynomially ambiguous weighted automata and a restricted fragment of weighted first-order logic. In the quantitative setting, the full weighted first-order logic (without the restriction that Droste and Gastin use, about the quantifier alternation) is more powerful than weighted automata, and extensions of the automata with two-way navigation, and pebbles or nested capabilities have been introduced to deal with it. In this work, we characterise the fragment of these extended weighted automata that recognise exactly the full weighted first-order logic, under the condition that automata are polynomially ambiguous. △ Less

Submitted 27 July, 2023; originally announced July 2023.

arXiv:2307.14623 [pdf, other]

BubbleML: A Multi-Physics Dataset and Benchmarks for Machine Learning

Authors: Sheikh Md Shakeel Hassan, Arthur Feeney, Akash Dhruv, Jihoon Kim, Youngjoon Suh, Jaiyoung Ryu, Yoon** Won, Aparna Chandramowlishwaran

Abstract: In the field of phase change phenomena, the lack of accessible and diverse datasets suitable for machine learning (ML) training poses a significant challenge. Existing experimental datasets are often restricted, with limited availability and sparse ground truth data, impeding our understanding of this complex multiphysics phenomena. To bridge this gap, we present the BubbleML Dataset \footnote{\la… ▽ More In the field of phase change phenomena, the lack of accessible and diverse datasets suitable for machine learning (ML) training poses a significant challenge. Existing experimental datasets are often restricted, with limited availability and sparse ground truth data, impeding our understanding of this complex multiphysics phenomena. To bridge this gap, we present the BubbleML Dataset \footnote{\label{git_dataset}\url{https://github.com/HPCForge/BubbleML}} which leverages physics-driven simulations to provide accurate ground truth information for various boiling scenarios, encompassing nucleate pool boiling, flow boiling, and sub-cooled boiling. This extensive dataset covers a wide range of parameters, including varying gravity conditions, flow rates, sub-cooling levels, and wall superheat, comprising 79 simulations. BubbleML is validated against experimental observations and trends, establishing it as an invaluable resource for ML research. Furthermore, we showcase its potential to facilitate exploration of diverse downstream tasks by introducing two benchmarks: (a) optical flow analysis to capture bubble dynamics, and (b) operator networks for learning temperature dynamics. The BubbleML dataset and its benchmarks serve as a catalyst for advancements in ML-driven research on multiphysics phase change phenomena, enabling the development and comparison of state-of-the-art techniques and models. △ Less

Submitted 24 August, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

Comments: Submitted to Neurips Datasets and Benchmarks Track 2023

arXiv:2307.10569 [pdf, ps, other]

Deceptive Alignment Monitoring

Authors: Andres Carranza, Dhruv Pai, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo

Abstract: As the capabilities of large machine learning models continue to grow, and as the autonomy afforded to such models continues to expand, the spectre of a new adversary looms: the models themselves. The threat that a model might behave in a seemingly reasonable manner, while secretly and subtly modifying its behavior for ulterior reasons is often referred to as deceptive alignment in the AI Safety &… ▽ More As the capabilities of large machine learning models continue to grow, and as the autonomy afforded to such models continues to expand, the spectre of a new adversary looms: the models themselves. The threat that a model might behave in a seemingly reasonable manner, while secretly and subtly modifying its behavior for ulterior reasons is often referred to as deceptive alignment in the AI Safety & Alignment communities. Consequently, we call this new direction Deceptive Alignment Monitoring. In this work, we identify emerging directions in diverse machine learning subfields that we believe will become increasingly important and intertwined in the near future for deceptive alignment monitoring, and we argue that advances in these fields present both long-term challenges and new research opportunities. We conclude by advocating for greater involvement by the adversarial machine learning community in these emerging directions. △ Less

Submitted 25 July, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

Comments: Accepted as BlueSky Oral to 2023 ICML AdvML Workshop

arXiv:2307.10563 [pdf, other]

FACADE: A Framework for Adversarial Circuit Anomaly Detection and Evaluation

Authors: Dhruv Pai, Andres Carranza, Rylan Schaeffer, Arnuv Tandon, Sanmi Koyejo

Abstract: We present FACADE, a novel probabilistic and geometric framework designed for unsupervised mechanistic anomaly detection in deep neural networks. Its primary goal is advancing the understanding and mitigation of adversarial attacks. FACADE aims to generate probabilistic distributions over circuits, which provide critical insights to their contribution to changes in the manifold properties of pseud… ▽ More We present FACADE, a novel probabilistic and geometric framework designed for unsupervised mechanistic anomaly detection in deep neural networks. Its primary goal is advancing the understanding and mitigation of adversarial attacks. FACADE aims to generate probabilistic distributions over circuits, which provide critical insights to their contribution to changes in the manifold properties of pseudo-classes, or high-dimensional modes in activation space, yielding a powerful tool for uncovering and combating adversarial attacks. Our approach seeks to improve model robustness, enhance scalable model oversight, and demonstrates promising applications in real-world deployment settings. △ Less

Submitted 20 July, 2023; originally announced July 2023.

Comments: Accepted as BlueSky Poster at 2023 ICML AdvML Workshop

arXiv:2307.09974 [pdf, ps, other]

Dynamic factor and VARMA models: equivalent representations, dimension reduction and nonlinear matrix equations

Authors: Shankar Bhamidi, Dhruv Patel, Vladas Pipiras

Abstract: A dynamic factor model with factor series following a VAR$(p)$ model is shown to have a VARMA$(p,p)$ model representation. Reduced-rank structures are identified for the VAR and VMA components of the resulting VARMA model. It is also shown how the VMA component parameters can be computed numerically from the original model parameters via the innovations algorithm, and connections of this approach… ▽ More A dynamic factor model with factor series following a VAR$(p)$ model is shown to have a VARMA$(p,p)$ model representation. Reduced-rank structures are identified for the VAR and VMA components of the resulting VARMA model. It is also shown how the VMA component parameters can be computed numerically from the original model parameters via the innovations algorithm, and connections of this approach to non-linear matrix equations are made. Some VAR models related to the resulting VARMA model are also discussed. △ Less

Submitted 19 July, 2023; originally announced July 2023.

MSC Class: Primary: 62M10. Secondary: 15A24; 65F45

arXiv:2307.09970 [pdf, other]

Correlation networks, dynamic factor models and community detection

Authors: Shankar Bhamidi, Dhruv Patel, Vladas Pipiras, Guorong Wu

Abstract: A dynamic factor model with a mixture distribution of the loadings is introduced and studied for multivariate, possibly high-dimensional time series. The correlation matrix of the model exhibits a block structure, reminiscent of correlation patterns for many real multivariate time series. A standard $k$-means algorithm on the loadings estimated through principal components is used to cluster compo… ▽ More A dynamic factor model with a mixture distribution of the loadings is introduced and studied for multivariate, possibly high-dimensional time series. The correlation matrix of the model exhibits a block structure, reminiscent of correlation patterns for many real multivariate time series. A standard $k$-means algorithm on the loadings estimated through principal components is used to cluster component time series into communities with accompanying bounds on the misclustering rate. This is one standard method of community detection applied to correlation matrices viewed as weighted networks. This work puts a mixture model, a dynamic factor model and network community detection in one interconnected framework. Performance of the proposed methodology is illustrated on simulated and real data. △ Less

Submitted 19 July, 2023; originally announced July 2023.

MSC Class: Primary: 62M10; 62H30; 05C22. Secondary: 62H20

arXiv:2307.09423 [pdf, other]

Scaling Laws for Imitation Learning in Single-Agent Games

Authors: Jens Tuyls, Dhruv Madeka, Kari Torkkola, Dean Foster, Karthik Narasimhan, Sham Kakade

Abstract: Imitation Learning (IL) is one of the most widely used methods in machine learning. Yet, many works find it is often unable to fully recover the underlying expert behavior, even in constrained environments like single-agent games. However, none of these works deeply investigate the role of scaling up the model and data size. Inspired by recent work in Natural Language Processing (NLP) where "scali… ▽ More Imitation Learning (IL) is one of the most widely used methods in machine learning. Yet, many works find it is often unable to fully recover the underlying expert behavior, even in constrained environments like single-agent games. However, none of these works deeply investigate the role of scaling up the model and data size. Inspired by recent work in Natural Language Processing (NLP) where "scaling up" has resulted in increasingly more capable LLMs, we investigate whether carefully scaling up model and data size can bring similar improvements in the imitation learning setting for single-agent games. We first demonstrate our findings on a variety of Atari games, and thereafter focus on the extremely challenging game of NetHack. In all games, we find that IL loss and mean return scale smoothly with the compute budget (FLOPs) and are strongly correlated, resulting in power laws for training compute-optimal IL agents. Finally, we forecast and train several NetHack agents with IL and find they outperform prior state-of-the-art by 1.5x in all settings. Our work both demonstrates the scaling behavior of imitation learning in a variety of single-agent games, as well as the viability of scaling up current approaches for increasingly capable agents in NetHack, a game that remains elusively hard for current AI systems. △ Less

Submitted 10 March, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

arXiv:2307.08694 [pdf, ps, other]

Ramsey numbers and the Zarankiewicz problem

Authors: David Conlon, Sam Mattheus, Dhruv Mubayi, Jacques Verstraëte

Abstract: Building on recent work of Mattheus and Verstraëte, we establish a general connection between Ramsey numbers of the form $r(F,t)$ for $F$ a fixed graph and a variant of the Zarankiewicz problem asking for the maximum number of 1s in an $m$ by $n$ $0/1$-matrix that does not have any matrix from a fixed finite family $\mathcal{L}(F)$ derived from $F$ as a submatrix. As an application, we give new lo… ▽ More Building on recent work of Mattheus and Verstraëte, we establish a general connection between Ramsey numbers of the form $r(F,t)$ for $F$ a fixed graph and a variant of the Zarankiewicz problem asking for the maximum number of 1s in an $m$ by $n$ $0/1$-matrix that does not have any matrix from a fixed finite family $\mathcal{L}(F)$ derived from $F$ as a submatrix. As an application, we give new lower bounds for the Ramsey numbers $r(C_5,t)$ and $r(C_7,t)$, namely, $r(C_5,t) = \tildeΩ(t^{\frac{10}{7}})$ and $r(C_7,t) = \tildeΩ(t^{\frac{5}{4}})$. We also show how the truth of a plausible conjecture about Zarankiewicz numbers would allow an approximate determination of $r(C_{2\ell+1}, t)$ for any fixed integer $\ell \geq 2$. △ Less

Submitted 24 April, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

Comments: 9 pages

arXiv:2307.05646 [pdf, other]

Better Handling Coreference Resolution in Aspect Level Sentiment Classification by Fine-Tuning Language Models

Authors: Dhruv Mullick, Bilal Ghanem, Alona Fyshe

Abstract: Customer feedback is invaluable to companies as they refine their products. Monitoring customer feedback can be automated with Aspect Level Sentiment Classification (ALSC) which allows us to analyse specific aspects of the products in reviews. Large Language Models (LLMs) are the heart of many state-of-the-art ALSC solutions, but they perform poorly in some scenarios requiring Coreference Resoluti… ▽ More Customer feedback is invaluable to companies as they refine their products. Monitoring customer feedback can be automated with Aspect Level Sentiment Classification (ALSC) which allows us to analyse specific aspects of the products in reviews. Large Language Models (LLMs) are the heart of many state-of-the-art ALSC solutions, but they perform poorly in some scenarios requiring Coreference Resolution (CR). In this work, we propose a framework to improve an LLM's performance on CR-containing reviews by fine tuning on highly inferential tasks. We show that the performance improvement is likely attributed to the improved model CR ability. We also release a new dataset that focuses on CR in ALSC. △ Less

Submitted 11 July, 2023; originally announced July 2023.

Comments: Work done up till December 2022

arXiv:2307.00586 [pdf, other]

ClipSitu: Effectively Leveraging CLIP for Conditional Predictions in Situation Recognition

Authors: Debaditya Roy, Dhruv Verma, Basura Fernando

Abstract: Situation Recognition is the task of generating a structured summary of what is happening in an image using an activity verb and the semantic roles played by actors and objects. In this task, the same activity verb can describe a diverse set of situations as well as the same actor or object category can play a diverse set of semantic roles depending on the situation depicted in the image. Hence a… ▽ More Situation Recognition is the task of generating a structured summary of what is happening in an image using an activity verb and the semantic roles played by actors and objects. In this task, the same activity verb can describe a diverse set of situations as well as the same actor or object category can play a diverse set of semantic roles depending on the situation depicted in the image. Hence a situation recognition model needs to understand the context of the image and the visual-linguistic meaning of semantic roles. Therefore, we leverage the CLIP foundational model that has learned the context of images via language descriptions. We show that deeper-and-wider multi-layer perceptron (MLP) blocks obtain noteworthy results for the situation recognition task by using CLIP image and text embedding features and it even outperforms the state-of-the-art CoFormer, a Transformer-based model, thanks to the external implicit visual-linguistic knowledge encapsulated by CLIP and the expressive power of modern MLP block designs. Motivated by this, we design a cross-attention-based Transformer using CLIP visual tokens that model the relation between textual roles and visual entities. Our cross-attention-based Transformer known as ClipSitu XTF outperforms existing state-of-the-art by a large margin of 14.1\% on semantic role labelling (value) for top-1 accuracy using imSitu dataset. {Similarly, our ClipSitu XTF obtains state-of-the-art situation localization performance.} We will make the code publicly available. △ Less

Submitted 11 September, 2023; v1 submitted 2 July, 2023; originally announced July 2023.

Comments: State-of-the-art results on Grounded Situation Recognition

arXiv:2306.16722 [pdf, other]

Evaluating Paraphrastic Robustness in Textual Entailment Models

Authors: Dhruv Verma, Yash Kumar Lal, Shreyashee Sinha, Benjamin Van Durme, Adam Poliak

Abstract: We present PaRTE, a collection of 1,126 pairs of Recognizing Textual Entailment (RTE) examples to evaluate whether models are robust to paraphrasing. We posit that if RTE models understand language, their predictions should be consistent across inputs that share the same meaning. We use the evaluation set to determine if RTE models' predictions change when examples are paraphrased. In our experime… ▽ More We present PaRTE, a collection of 1,126 pairs of Recognizing Textual Entailment (RTE) examples to evaluate whether models are robust to paraphrasing. We posit that if RTE models understand language, their predictions should be consistent across inputs that share the same meaning. We use the evaluation set to determine if RTE models' predictions change when examples are paraphrased. In our experiments, contemporary models change their predictions on 8-16\% of paraphrased examples, indicating that there is still room for improvement. △ Less

Submitted 29 June, 2023; originally announced June 2023.

arXiv:2306.16112 [pdf, other]

doi 10.1038/s44310-024-00006-9

Controlling lasing around Exceptional Points in Coupled Nanolasers

Authors: Anna Fischer, T. V. Raziman, Wai Kit Ng, Jente Clarysse, Jakub Dranczewski, Dhruv Saxena, Stefano Vezzoli, Heinz Schmid, Kirsten Moselund, Riccardo Sapienza

Abstract: Coupled nanolasers are of growing interest for on-chip optical computation and data transmission, which requires an understanding of how lasers interact to form complex systems. The non-Hermitian interaction between two coupled resonators, when excited selectively, can lead to parity-time symmetry, the formation of exceptional points, and subsequently spectral control and increased sensitivity. Th… ▽ More Coupled nanolasers are of growing interest for on-chip optical computation and data transmission, which requires an understanding of how lasers interact to form complex systems. The non-Hermitian interaction between two coupled resonators, when excited selectively, can lead to parity-time symmetry, the formation of exceptional points, and subsequently spectral control and increased sensitivity. These investigations have been limited to pump energies close to the lasing threshold, and large or narrow-line lasers. Here, by programmable optical excitation we study two coupled nanolasers significantly above threshold, where mode instability plays an important role. We map the mode evolution around two exceptional points, and observe lasing gaps due to reversed pump dependence which compare well with nonlinear theory. Finally, the coupling can be exploited to control the lasing threshold and wavelength, and for frequency switching around the lasing gap. Controlled and integrated nanolasers constitutes a promising platform for future highly sensitive and programmable on-chip laser sources. △ Less

Submitted 28 June, 2023; originally announced June 2023.

Comments: 8 pages, 4 figures

arXiv:2306.14846 [pdf, other]

ViNT: A Foundation Model for Visual Navigation

Authors: Dhruv Shah, Ajay Sridhar, Nitish Dashora, Kyle Stachowicz, Kevin Black, Noriaki Hirose, Sergey Levine

Abstract: General-purpose pre-trained models ("foundation models") have enabled practitioners to produce generalizable solutions for individual machine learning problems with datasets that are significantly smaller than those required for learning from scratch. Such models are typically trained on large and diverse datasets with weak supervision, consuming much more training data than is available for any i… ▽ More General-purpose pre-trained models ("foundation models") have enabled practitioners to produce generalizable solutions for individual machine learning problems with datasets that are significantly smaller than those required for learning from scratch. Such models are typically trained on large and diverse datasets with weak supervision, consuming much more training data than is available for any individual downstream application. In this paper, we describe the Visual Navigation Transformer (ViNT), a foundation model that aims to bring the success of general-purpose pre-trained models to vision-based robotic navigation. ViNT is trained with a general goal-reaching objective that can be used with any navigation dataset, and employs a flexible Transformer-based architecture to learn navigational affordances and enable efficient adaptation to a variety of downstream navigational tasks. ViNT is trained on a number of existing navigation datasets, comprising hundreds of hours of robotic navigation from a variety of different robotic platforms, and exhibits positive transfer, outperforming specialist models trained on singular datasets. ViNT can be augmented with diffusion-based subgoal proposals to explore novel environments, and can solve kilometer-scale navigation problems when equipped with long-range heuristics. ViNT can also be adapted to novel task specifications with a technique inspired by prompt-tuning, where the goal encoder is replaced by an encoding of another task modality (e.g., GPS waypoints or routing commands) embedded into the same space of goal tokens. This flexibility and ability to accommodate a variety of downstream problem domains establishes ViNT as an effective foundation model for mobile robotics. For videos, code, and model checkpoints, see our project page at https://visualnav-transformer.github.io. △ Less

Submitted 24 October, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

Comments: Accepted for oral presentation at CoRL 2023

arXiv:2306.14812 [pdf, other]

MOVES: Movable and Moving LiDAR Scene Segmentation in Label-Free settings using Static Reconstruction

Authors: Prashant Kumar, Dhruv Makwana, Onkar Susladkar, Anurag Mittal, Prem Kumar Kalra

Abstract: Accurate static structure reconstruction and segmentation of non-stationary objects is of vital importance for autonomous navigation applications. These applications assume a LiDAR scan to consist of only static structures. In the real world however, LiDAR scans consist of non-stationary dynamic structures - moving and movable objects. Current solutions use segmentation information to isolate and… ▽ More Accurate static structure reconstruction and segmentation of non-stationary objects is of vital importance for autonomous navigation applications. These applications assume a LiDAR scan to consist of only static structures. In the real world however, LiDAR scans consist of non-stationary dynamic structures - moving and movable objects. Current solutions use segmentation information to isolate and remove moving structures from LiDAR scan. This strategy fails in several important use-cases where segmentation information is not available. In such scenarios, moving objects and objects with high uncertainty in their motion i.e. movable objects, may escape detection. This violates the above assumption. We present MOVES, a novel GAN based adversarial model that segments out moving as well as movable objects in the absence of segmentation information. We achieve this by accurately transforming a dynamic LiDAR scan to its corresponding static scan. This is obtained by replacing dynamic objects and corresponding occlusions with static structures which were occluded by dynamic objects. We leverage corresponding static-dynamic LiDAR pairs. △ Less

Submitted 15 October, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

Comments: 35 pages, 8 figures, 6 tables

arXiv:2306.13216 [pdf, other]

Diverse Community Data for Benchmarking Data Privacy Algorithms

Authors: Aniruddha Sen, Christine Task, Dhruv Kapur, Gary Howarth, Karan Bhagat

Abstract: The Collaborative Research Cycle (CRC) is a National Institute of Standards and Technology (NIST) benchmarking program intended to strengthen understanding of tabular data deidentification technologies. Deidentification algorithms are vulnerable to the same bias and privacy issues that impact other data analytics and machine learning applications, and can even amplify those issues by contaminating… ▽ More The Collaborative Research Cycle (CRC) is a National Institute of Standards and Technology (NIST) benchmarking program intended to strengthen understanding of tabular data deidentification technologies. Deidentification algorithms are vulnerable to the same bias and privacy issues that impact other data analytics and machine learning applications, and can even amplify those issues by contaminating downstream applications. This paper summarizes four CRC contributions: theoretical work on the relationship between diverse populations and challenges for equitable deidentification; public benchmark data focused on diverse populations and challenging features; a comprehensive open source suite of evaluation metrology for deidentified datasets; and an archive of more than 450 deidentified data samples from a broad range of techniques. The initial set of evaluation results demonstrate the value of these tools for investigations in this field. △ Less

Submitted 31 October, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

Journal ref: https://proceedings.neurips.cc/paper_files/paper/2023/file/a15032f8199511ced4d7a8e2bbb487a5-Paper-Datasets_and_Benchmarks.pdf

arXiv:2306.11565 [pdf, other]

HomeRobot: Open-Vocabulary Mobile Manipulation

Authors: Sriram Yenamandra, Arun Ramachandran, Karmesh Yadav, Austin Wang, Mukul Khanna, Theophile Gervet, Tsung-Yen Yang, Vidhi Jain, Alexander William Clegg, John Turner, Zsolt Kira, Manolis Savva, Angel Chang, Devendra Singh Chaplot, Dhruv Batra, Roozbeh Mottaghi, Yonatan Bisk, Chris Paxton

Abstract: HomeRobot (noun): An affordable compliant robot that navigates homes and manipulates a wide range of objects in order to complete everyday tasks. Open-Vocabulary Mobile Manipulation (OVMM) is the problem of picking any object in any unseen environment, and placing it in a commanded location. This is a foundational challenge for robots to be useful assistants in human environments, because it invol… ▽ More HomeRobot (noun): An affordable compliant robot that navigates homes and manipulates a wide range of objects in order to complete everyday tasks. Open-Vocabulary Mobile Manipulation (OVMM) is the problem of picking any object in any unseen environment, and placing it in a commanded location. This is a foundational challenge for robots to be useful assistants in human environments, because it involves tackling sub-problems from across robotics: perception, language understanding, navigation, and manipulation are all essential to OVMM. In addition, integration of the solutions to these sub-problems poses its own substantial challenges. To drive research in this area, we introduce the HomeRobot OVMM benchmark, where an agent navigates household environments to grasp novel objects and place them on target receptacles. HomeRobot has two components: a simulation component, which uses a large and diverse curated object set in new, high-quality multi-room home environments; and a real-world component, providing a software stack for the low-cost Hello Robot Stretch to encourage replication of real-world experiments across labs. We implement both reinforcement learning and heuristic (model-based) baselines and show evidence of sim-to-real transfer. Our baselines achieve a 20% success rate in the real world; our experiments identify ways future research work improve performance. See videos on our website: https://ovmm.github.io/. △ Less

Submitted 10 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

Comments: 37 pages, 22 figures, 8 tables

arXiv:2306.11290 [pdf, other]

Habitat Synthetic Scenes Dataset (HSSD-200): An Analysis of 3D Scene Scale and Realism Tradeoffs for ObjectGoal Navigation

Authors: Mukul Khanna, Yongsen Mao, Hanxiao Jiang, Sanjay Haresh, Brennan Shacklett, Dhruv Batra, Alexander Clegg, Eric Undersander, Angel X. Chang, Manolis Savva

Abstract: We contribute the Habitat Synthetic Scene Dataset, a dataset of 211 high-quality 3D scenes, and use it to test navigation agent generalization to realistic 3D environments. Our dataset represents real interiors and contains a diverse set of 18,656 models of real-world objects. We investigate the impact of synthetic 3D scene dataset scale and realism on the task of training embodied agents to find… ▽ More We contribute the Habitat Synthetic Scene Dataset, a dataset of 211 high-quality 3D scenes, and use it to test navigation agent generalization to realistic 3D environments. Our dataset represents real interiors and contains a diverse set of 18,656 models of real-world objects. We investigate the impact of synthetic 3D scene dataset scale and realism on the task of training embodied agents to find and navigate to objects (ObjectGoal navigation). By comparing to synthetic 3D scene datasets from prior work, we find that scale helps in generalization, but the benefits quickly saturate, making visual fidelity and correlation to real-world scenes more important. Our experiments show that agents trained on our smaller-scale dataset can match or outperform agents trained on much larger datasets. Surprisingly, we observe that agents trained on just 122 scenes from our dataset outperform agents trained on 10,000 scenes from the ProcTHOR-10K dataset in terms of zero-shot generalization in real-world scanned environments. △ Less

Submitted 7 December, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

arXiv:2306.10174 [pdf, other]

doi 10.1016/j.jcp.2024.113122

A Vortex Dam** Outflow Forcing for Multiphase Flows with Sharp Interfacial Jumps

Authors: Akash Dhruv

Abstract: Outflow boundaries play an important role in multiphase fluid dynamics simulations that involve transition between liquid and vapor phases. These flows are dominated by low Weber numbers and a sharp jump in pressure, velocity, and temperature. Inadequate treatment of these jumps at the outlet generates undesirable fluid disturbances that propagate upstream and lead to instabilities within the comp… ▽ More Outflow boundaries play an important role in multiphase fluid dynamics simulations that involve transition between liquid and vapor phases. These flows are dominated by low Weber numbers and a sharp jump in pressure, velocity, and temperature. Inadequate treatment of these jumps at the outlet generates undesirable fluid disturbances that propagate upstream and lead to instabilities within the computational domain. To mitigate these disturbances, we introduce a forcing term that can be applied to incompressible Navier-Stokes equations to enforce stability in the numerical solution. The forcing term acts as a dam** mechanism to control vortices that are generated by droplet/bubbles in multiphase flows, and is designed to be a general formulation that can be coupled with a fixed pressure outflow boundary condition to simulate a variety of multiphase flow problems. We demonstrate its applicability to simulate pool and flow boiling problems, where bubble-induced vortices during evaporation and condensation present a challenge at the outflow. Validation and verification cases are chosen to quantify accuracy and stability of the proposed method in comparison to established benchmarks and reference solutions, along with detailed performance analysis for three-dimensional simulations on leadership supercomputing platforms. Computational experiments are performed using Flash-X, which is a composable open-source software instrument designed for multiscale fluid dynamics simulations on heterogeneous architectures. △ Less

Submitted 18 May, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

Comments: Preprint Submitted to Elsevier

arXiv:2306.07552 [pdf, other]

Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second

Authors: Vincent-Pierre Berges, Andrew Szot, Devendra Singh Chaplot, Aaron Gokaslan, Roozbeh Mottaghi, Dhruv Batra, Eric Undersander

Abstract: We present Galactic, a large-scale simulation and reinforcement-learning (RL) framework for robotic mobile manipulation in indoor environments. Specifically, a Fetch robot (equipped with a mobile base, 7DoF arm, RGBD camera, egomotion, and onboard sensing) is spawned in a home environment and asked to rearrange objects - by navigating to an object, picking it up, navigating to a target location, a… ▽ More We present Galactic, a large-scale simulation and reinforcement-learning (RL) framework for robotic mobile manipulation in indoor environments. Specifically, a Fetch robot (equipped with a mobile base, 7DoF arm, RGBD camera, egomotion, and onboard sensing) is spawned in a home environment and asked to rearrange objects - by navigating to an object, picking it up, navigating to a target location, and then placing the object at the target location. Galactic is fast. In terms of simulation speed (rendering + physics), Galactic achieves over 421,000 steps-per-second (SPS) on an 8-GPU node, which is 54x faster than Habitat 2.0 (7699 SPS). More importantly, Galactic was designed to optimize the entire rendering + physics + RL interplay since any bottleneck in the interplay slows down training. In terms of simulation+RL speed (rendering + physics + inference + learning), Galactic achieves over 108,000 SPS, which 88x faster than Habitat 2.0 (1243 SPS). These massive speed-ups not only drastically cut the wall-clock training time of existing experiments, but also unlock an unprecedented scale of new experiments. First, Galactic can train a mobile pick skill to >80% accuracy in under 16 minutes, a 100x speedup compared to the over 24 hours it takes to train the same skill in Habitat 2.0. Second, we use Galactic to perform the largest-scale experiment to date for rearrangement using 5B steps of experience in 46 hours, which is equivalent to 20 years of robot experience. This scaling results in a single neural network composed of task-agnostic components achieving 85% success in GeometricGoal rearrangement, compared to 0% success reported in Habitat 2.0 for the same approach. The code is available at github.com/facebookresearch/galactic. △ Less

Submitted 13 June, 2023; originally announced June 2023.

arXiv:2306.06084 [pdf]

doi 10.1109/M2VIP55626.2022.10041089

Machine Vision Using Cellphone Camera: A Comparison of deep networks for classifying three challenging denominations of Indian Coins

Authors: Keyur D. Joshi, Dhruv Shah, Varshil Shah, Nilay Gandhi, Sanket J. Shah, Sanket B. Shah

Abstract: Indian currency coins come in a variety of denominations. Off all the varieties Rs.1, RS.2, and Rs.5 have similar diameters. Majority of the coin styles in market circulation for denominations of Rs.1 and Rs.2 coins are nearly the same except for numerals on its reverse side. If a coin is resting on its obverse side, the correct denomination is not distinguishable by humans. Therefore, it was hypo… ▽ More Indian currency coins come in a variety of denominations. Off all the varieties Rs.1, RS.2, and Rs.5 have similar diameters. Majority of the coin styles in market circulation for denominations of Rs.1 and Rs.2 coins are nearly the same except for numerals on its reverse side. If a coin is resting on its obverse side, the correct denomination is not distinguishable by humans. Therefore, it was hypothesized that a digital image of a coin resting on its either size could be classified into its correct denomination by training a deep neural network model. The digital images were generated by using cheap cell phone cameras. To find the most suitable deep neural network architecture, four were selected based on the preliminary analysis carried out for comparison. The results confirm that two of the four deep neural network models can classify the correct denomination from either side of a coin with an accuracy of 97%. △ Less

Submitted 12 May, 2023; originally announced June 2023.

Comments: 6 Pages, 4 Figures, 6 Tables, Conference paper

arXiv:2306.05649 [pdf, other]

Specifying and Solving Robust Empirical Risk Minimization Problems Using CVXPY

Authors: Eric Luxenberg, Dhruv Malik, Yuanzhi Li, Aarti Singh, Stephen Boyd

Abstract: We consider robust empirical risk minimization (ERM), where model parameters are chosen to minimize the worst-case empirical loss when each data point varies over a given convex uncertainty set. In some simple cases, such problems can be expressed in an analytical form. In general the problem can be made tractable via dualization, which turns a min-max problem into a min-min problem. Dualization r… ▽ More We consider robust empirical risk minimization (ERM), where model parameters are chosen to minimize the worst-case empirical loss when each data point varies over a given convex uncertainty set. In some simple cases, such problems can be expressed in an analytical form. In general the problem can be made tractable via dualization, which turns a min-max problem into a min-min problem. Dualization requires expertise and is tedious and error-prone. We demonstrate how CVXPY can be used to automate this dualization procedure in a user-friendly manner. Our framework allows practitioners to specify and solve robust ERM problems with a general class of convex losses, capturing many standard regression and classification problems. Users can easily specify any complex uncertainty set that is representable via disciplined convex programming (DCP) constraints. △ Less

Submitted 13 June, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

arXiv:2306.04626 [pdf, other]

doi 10.3847/1538-4357/acef1f

Constraining nuclear parameters using Gravitational waves from f-mode Oscillations in Neutron Stars

Authors: Bikram Keshari Pradhan, Dhruv Pathak, Debarati Chatterjee

Abstract: Gravitational waves (GW) emanating from unstable quasi-normal modes in Neutron Stars (NS) could be accessible with the improved sensitivity of the current GW detectors or with the next-generation GW detectors and, therefore, can be employed to study the NS interior. Assuming f-mode excitation in isolated pulsars with typical energy of pulsar glitches and considering potential f-mode GW candidates… ▽ More Gravitational waves (GW) emanating from unstable quasi-normal modes in Neutron Stars (NS) could be accessible with the improved sensitivity of the current GW detectors or with the next-generation GW detectors and, therefore, can be employed to study the NS interior. Assuming f-mode excitation in isolated pulsars with typical energy of pulsar glitches and considering potential f-mode GW candidates for A+ (upgraded LIGO detectors operating at 5th observation run design sensitivity) and Einstein Telescope (ET), we demonstrate the inverse problem of NS asteroseismology within a Bayesian formalism to constrain the nuclear parameters and NS Equation of State (EOS). We describe the NS interior within relativistic mean field formalism. Taking the example of glitching pulsars, we find that for a single event in A+ and ET, among the nuclear parameters, the nucleon effective mass ($m^*$) within 90\% credible interval (CI) can be restricted within $10\%$ and $5\%$, respectively. At the same time, the incompressibility ($K$) and the slope of the symmetry energy ($L$) are only loosely constrained. Considering multiple (10) events in A+ and ET, all the nuclear parameters are well constrained, especially $m^*$, which can be constrained to 3\% and 2\% in A+ and ET, respectively. Uncertainty in the observables of a $1.4M_{\odot}$ NS such as radius ($R_{1.4M_{\odot}}$), f-mode frequency ($f_{1.4M_{\odot}}$), dam** time ($τ_{1.4M_{\odot}}$) and a few EOS properties including squared speed of sound ($c_s^2$) are also estimated. △ Less

Submitted 6 September, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

Comments: Accepted for publication in the Astrophysical Journal (APJ). 17 pages and 9 figures

Report number: LIGO-P2300155

Journal ref: The Astrophysical Journal (APJ), volume = {956}, number = {1}, pages = {38}, year= {2023}

arXiv:2306.03973 [pdf, other]

Three-Dimensional General-Relativistic Simulations of Neutrino-Driven Winds from Magnetized Proto-Neutron Stars

Authors: Dhruv K. Desai, Daniel M. Siegel, Brian D. Metzger

Abstract: Formed in the aftermath of a core-collapse supernova or neutron star merger, a hot proto-neutron star (PNS) launches an outflow driven by neutrino heating lasting for up to tens of seconds. Though such winds are considered potential sites for the nucleosynthesis of heavy elements via the rapid neutron capture process ($r$-process), previous work has shown that unmagnetized PNS winds fail to achiev… ▽ More Formed in the aftermath of a core-collapse supernova or neutron star merger, a hot proto-neutron star (PNS) launches an outflow driven by neutrino heating lasting for up to tens of seconds. Though such winds are considered potential sites for the nucleosynthesis of heavy elements via the rapid neutron capture process ($r$-process), previous work has shown that unmagnetized PNS winds fail to achieve the necessary combination of high entropy and/or short dynamical timescale in the seed nucleus formation region. We present three-dimensional general-relativistic magnetohydrodynamical (GRMHD) simulations of PNS winds which include the effects of a dynamically strong ($B \gtrsim 10^{15}$ G) dipole magnetic field. After initializing the magnetic field, the wind quickly develops a helmet-streamer configuration, characterized by outflows along open polar magnetic field lines and a ``closed'' zone of trapped plasma at lower latitudes. Neutrino heating within the closed zone causes the thermal pressure of the trapped material to rise in time compared to the polar outflow regions, ultimately leading to the expulsion of this matter from the closed zone on a timescale of $\sim$60 ms, consistent with the predictions of \citet{Thompson03}. The high entropies of these transient ejecta are still growing at the end of our simulations and are sufficient to enable a successful 2nd-peak $r$-process in at least a modest $\gtrsim 1\%$ of the equatorial wind ejecta. △ Less

Submitted 6 June, 2023; originally announced June 2023.

Comments: 22 pages, 16 figures

arXiv:2306.03733 [pdf, other]

A Novel Approach To User Agent String Parsing For Vulnerability Analysis Using Mutli-Headed Attention

Authors: Dhruv Nandakumar, Sathvik Murli, Ankur Khosla, Kevin Choi, Abdul Rahman, Drew Walsh, Scott Riede, Eric Dull, Edward Bowen

Abstract: The increasing reliance on the internet has led to the proliferation of a diverse set of web-browsers and operating systems (OSs) capable of browsing the web. User agent strings (UASs) are a component of web browsing that are transmitted with every Hypertext Transfer Protocol (HTTP) request. They contain information about the client device and software, which is used by web servers for various pur… ▽ More The increasing reliance on the internet has led to the proliferation of a diverse set of web-browsers and operating systems (OSs) capable of browsing the web. User agent strings (UASs) are a component of web browsing that are transmitted with every Hypertext Transfer Protocol (HTTP) request. They contain information about the client device and software, which is used by web servers for various purposes such as content negotiation and security. However, due to the proliferation of various browsers and devices, parsing UASs is a non-trivial task due to a lack of standardization of UAS formats. Current rules-based approaches are often brittle and can fail when encountering such non-standard formats. In this work, a novel methodology for parsing UASs using Multi-Headed Attention Based transformers is proposed. The proposed methodology exhibits strong performance in parsing a variety of UASs with differing formats. Furthermore, a framework to utilize parsed UASs to estimate the vulnerability scores for large sections of publicly visible IT networks or regions is also discussed. The methodology present here can also be easily extended or deployed for real-time parsing of logs in enterprise settings. △ Less

Submitted 6 June, 2023; originally announced June 2023.

Comments: Accepted to the International Conference on Machine Learning and Cybernetics (ICMLC) 2023

arXiv:2306.01993 [pdf, ps, other]

Provable benefits of score matching

Authors: Chirag Pabbaraju, Dhruv Rohatgi, Anish Sevekari, Holden Lee, Ankur Moitra, Andrej Risteski

Abstract: Score matching is an alternative to maximum likelihood (ML) for estimating a probability distribution parametrized up to a constant of proportionality. By fitting the ''score'' of the distribution, it sidesteps the need to compute this constant of proportionality (which is often intractable). While score matching and variants thereof are popular in practice, precise theoretical understanding of th… ▽ More Score matching is an alternative to maximum likelihood (ML) for estimating a probability distribution parametrized up to a constant of proportionality. By fitting the ''score'' of the distribution, it sidesteps the need to compute this constant of proportionality (which is often intractable). While score matching and variants thereof are popular in practice, precise theoretical understanding of the benefits and tradeoffs with maximum likelihood -- both computational and statistical -- are not well understood. In this work, we give the first example of a natural exponential family of distributions such that the score matching loss is computationally efficient to optimize, and has a comparable statistical efficiency to ML, while the ML loss is intractable to optimize using a gradient-based method. The family consists of exponentials of polynomials of fixed degree, and our result can be viewed as a continuous analogue of recent developments in the discrete setting. Precisely, we show: (1) Designing a zeroth-order or first-order oracle for optimizing the maximum likelihood loss is NP-hard. (2) Maximum likelihood has a statistical efficiency polynomial in the ambient dimension and the radius of the parameters of the family. (3) Minimizing the score matching loss is both computationally and statistically efficient, with complexity polynomial in the ambient dimension. △ Less

Submitted 2 June, 2023; originally announced June 2023.

Comments: 25 Pages

arXiv:2306.01874 [pdf, other]

SACSoN: Scalable Autonomous Control for Social Navigation

Authors: Noriaki Hirose, Dhruv Shah, Ajay Sridhar, Sergey Levine

Abstract: Machine learning provides a powerful tool for building socially compliant robotic systems that go beyond simple predictive models of human behavior. By observing and understanding human interactions from past experiences, learning can enable effective social navigation behaviors directly from data. In this paper, our goal is to develop methods for training policies for socially unobtrusive navigat… ▽ More Machine learning provides a powerful tool for building socially compliant robotic systems that go beyond simple predictive models of human behavior. By observing and understanding human interactions from past experiences, learning can enable effective social navigation behaviors directly from data. In this paper, our goal is to develop methods for training policies for socially unobtrusive navigation, such that robots can navigate among humans in ways that don't disturb human behavior. We introduce a definition for such behavior based on the counterfactual perturbation of the human: if the robot had not intruded into the space, would the human have acted in the same way? By minimizing this counterfactual perturbation, we can induce robots to behave in ways that do not alter the natural behavior of humans in the shared space. Instantiating this principle requires training policies to minimize their effect on human behavior, and this in turn requires data that allows us to model the behavior of humans in the presence of robots. Therefore, our approach is based on two key contributions. First, we collect a large dataset where an indoor mobile robot interacts with human bystanders. Second, we utilize this dataset to train policies that minimize counterfactual perturbation. We provide supplementary videos and make publicly available the largest-of-its-kind visual navigation dataset on our project page. △ Less

Submitted 25 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

Comments: 11 pages, 15 figures, 4 tables

arXiv:2306.00087 [pdf, other]

Adaptive Coordination in Social Embodied Rearrangement

Authors: Andrew Szot, Unnat Jain, Dhruv Batra, Zsolt Kira, Ruta Desai, Akshara Rai

Abstract: We present the task of "Social Rearrangement", consisting of cooperative everyday tasks like setting up the dinner table, tidying a house or unpacking groceries in a simulated multi-agent environment. In Social Rearrangement, two robots coordinate to complete a long-horizon task, using onboard sensing and egocentric observations, and no privileged information about the environment. We study zero-s… ▽ More We present the task of "Social Rearrangement", consisting of cooperative everyday tasks like setting up the dinner table, tidying a house or unpacking groceries in a simulated multi-agent environment. In Social Rearrangement, two robots coordinate to complete a long-horizon task, using onboard sensing and egocentric observations, and no privileged information about the environment. We study zero-shot coordination (ZSC) in this task, where an agent collaborates with a new partner, emulating a scenario where a robot collaborates with a new human partner. Prior ZSC approaches struggle to generalize in our complex and visually rich setting, and on further analysis, we find that they fail to generate diverse coordination behaviors at training time. To counter this, we propose Behavior Diversity Play (BDP), a novel ZSC approach that encourages diversity through a discriminability objective. Our results demonstrate that BDP learns adaptive agents that can tackle visual coordination, and zero-shot generalize to new partners in unseen environments, achieving 35% higher success and 32% higher efficiency compared to baselines. △ Less

Submitted 31 May, 2023; originally announced June 2023.

arXiv:2305.16892 [pdf, other]

Feature Adaptation for Sparse Linear Regression

Authors: Jonathan Kelner, Frederic Koehler, Raghu Meka, Dhruv Rohatgi

Abstract: Sparse linear regression is a central problem in high-dimensional statistics. We study the correlated random design setting, where the covariates are drawn from a multivariate Gaussian $N(0,Σ)$, and we seek an estimator with small excess risk. If the true signal is $t$-sparse, information-theoretically, it is possible to achieve strong recovery guarantees with only $O(t\log n)$ samples. However,… ▽ More Sparse linear regression is a central problem in high-dimensional statistics. We study the correlated random design setting, where the covariates are drawn from a multivariate Gaussian $N(0,Σ)$, and we seek an estimator with small excess risk. If the true signal is $t$-sparse, information-theoretically, it is possible to achieve strong recovery guarantees with only $O(t\log n)$ samples. However, computationally efficient algorithms have sample complexity linear in (some variant of) the condition number of $Σ$. Classical algorithms such as the Lasso can require significantly more samples than necessary even if there is only a single sparse approximate dependency among the covariates. We provide a polynomial-time algorithm that, given $Σ$, automatically adapts the Lasso to tolerate a small number of approximate dependencies. In particular, we achieve near-optimal sample complexity for constant sparsity and if $Σ$ has few ``outlier'' eigenvalues. Our algorithm fits into a broader framework of feature adaptation for sparse linear regression with ill-conditioned covariates. With this framework, we additionally provide the first polynomial-factor improvement over brute-force search for constant sparsity $t$ and arbitrary covariance $Σ$. △ Less

Submitted 26 May, 2023; originally announced May 2023.

arXiv:2305.15488 [pdf, other]

Foundational Models for Malware Embeddings Using Spatio-Temporal Parallel Convolutional Networks

Authors: Dhruv Nandakumar, Devin Quinn, Elijah Soba, Eunyoung Kim, Christopher Redino, Chris Chan, Kevin Choi, Abdul Rahman, Edward Bowen

Abstract: In today's interconnected digital landscape, the proliferation of malware poses a significant threat to the security and stability of computer networks and systems worldwide. As the complexity of malicious tactics, techniques, and procedures (TTPs) continuously grows to evade detection, so does the need for advanced methods capable of capturing and characterizing malware behavior. The current stat… ▽ More In today's interconnected digital landscape, the proliferation of malware poses a significant threat to the security and stability of computer networks and systems worldwide. As the complexity of malicious tactics, techniques, and procedures (TTPs) continuously grows to evade detection, so does the need for advanced methods capable of capturing and characterizing malware behavior. The current state of the art in malware classification and detection uses task specific objectives; however, this method fails to generalize to other downstream tasks involving the same malware class. In this paper, the authors introduce a novel method that combines convolutional neural networks, standard graph embedding techniques, and a metric learning objective to extract meaningful information from network flow data and create strong embeddings characterizing malware behavior. These embeddings enable the development of highly accurate, efficient, and generalizable machine learning models for tasks such as malware strain classification, zero day threat detection, and closest attack type attribution as demonstrated in this paper. A shift from task specific objectives to strong embeddings will not only allow rapid iteration of cyber-threat detection models, but also allow different modalities to be introduced in the development of these models. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: 10 pages, 6 tables, 2 figures. Preprint, under review

arXiv:2305.14823 [pdf, other]

doi 10.1051/0004-6361/202346599

VERTICO VI: Cold-gas asymmetries in Virgo cluster galaxies

Authors: Ian D. Roberts, Toby Brown, Nikki Zabel, Christine D. Wilson, Aeree Chung, Laura C. Parker, Dhruv Bisaria, Alessandro Boselli, Barbara Catinella, Ryan Chown, Luca Cortese, Timothy A. Davis, Sara Ellison, Maria Jesus Jimenez-Donaire, Bumhyun Lee, Rory Smith, Kristine Spekkens, Adam R. H. Stevens, Mallory Thorp, Vincente Villanueva, Adam B. Watts, Charlotte Welker, Hyein Yoon

Abstract: We analyze cold-gas distributions in Virgo cluster galaxies using resolved CO(2-1) (tracing molecular hydrogen, H2) and HI observations from the Virgo Environment Traced In CO (VERTICO) and the VLA Imaging of Virgo in Atomic Gas (VIVA) surveys. From a theoretical perspective, it is expected that environmental processes in clusters will have a stronger influence on diffuse atomic gas compared to th… ▽ More We analyze cold-gas distributions in Virgo cluster galaxies using resolved CO(2-1) (tracing molecular hydrogen, H2) and HI observations from the Virgo Environment Traced In CO (VERTICO) and the VLA Imaging of Virgo in Atomic Gas (VIVA) surveys. From a theoretical perspective, it is expected that environmental processes in clusters will have a stronger influence on diffuse atomic gas compared to the relatively dense molecular gas component, and that these environmental perturbations can compress the cold interstellar medium in cluster galaxies leading to elevated star formation. In this work we observationally test these predictions for star-forming satellite galaxies within the Virgo cluster. We divide our Virgo galaxy sample into HI-normal, HI-tailed, and HI-truncated classes and show, unsurprisingly, that the HI-tailed galaxies have the largest quantitative HI asymmetries. We also compare to a control sample of non-cluster galaxies and find that Virgo galaxies, on average, have HI asymmetries that are 40 +/- 10 per cent larger than the control. There is less separation between control, HI-normal, HI-tailed, and HI-truncated galaxies in terms of H2 asymmetries, and on average, Virgo galaxies have H2 asymmetries that are only marginally (20 +/- 10 per cent) larger than the control sample. We find a weak correlation between HI and H2 asymmetries over our entire sample, but a stronger correlation for those specific galaxies being strongly impacted by environmental perturbations. Finally, we divide the discs of the HI-tailed Virgo galaxies into a leading half and trailing half according to the observed tail direction. We find evidence for excess molecular gas mass on the leading halves of the disc. This excess molecular gas on the leading half is accompanied by an excess in star formation rate such that the depletion time is, on average, unchanged. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: 15 pages, 8 figures, 1 table, accepted for publication in A&A

Journal ref: A&A 675, A78 (2023)

arXiv:2305.14815 [pdf, other]

Machine Reading Comprehension using Case-based Reasoning

Authors: Dung Thai, Dhruv Agarwal, Mudit Chaudhary, Wenlong Zhao, Rajarshi Das, Manzil Zaheer, Jay-Yoon Lee, Hannaneh Hajishirzi, Andrew McCallum

Abstract: We present an accurate and interpretable method for answer extraction in machine reading comprehension that is reminiscent of case-based reasoning (CBR) from classical AI. Our method (CBR-MRC) builds upon the hypothesis that contextualized answers to similar questions share semantic similarities with each other. Given a test question, CBR-MRC first retrieves a set of similar cases from a nonparame… ▽ More We present an accurate and interpretable method for answer extraction in machine reading comprehension that is reminiscent of case-based reasoning (CBR) from classical AI. Our method (CBR-MRC) builds upon the hypothesis that contextualized answers to similar questions share semantic similarities with each other. Given a test question, CBR-MRC first retrieves a set of similar cases from a nonparametric memory and then predicts an answer by selecting the span in the test context that is most similar to the contextualized representations of answers in the retrieved cases. The semi-parametric nature of our approach allows it to attribute a prediction to the specific set of evidence cases, making it a desirable choice for building reliable and debuggable QA systems. We show that CBR-MRC provides high accuracy comparable with large reader models and outperforms baselines by 11.5 and 8.4 EM on NaturalQuestions and NewsQA, respectively. Further, we demonstrate the ability of CBR-MRC in identifying not just the correct answer tokens but also the span with the most relevant supporting evidence. Lastly, we observe that contexts for certain question types show higher lexical diversity than others and find that CBR-MRC is robust to these variations while performance using fully-parametric methods drops. △ Less

Submitted 5 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

Comments: 9 pages, 2 figures

arXiv:2305.09857 [pdf, other]

CoEdIT: Text Editing by Task-Specific Instruction Tuning

Authors: Vipul Raheja, Dhruv Kumar, Ryan Koo, Dongyeop Kang

Abstract: We introduce CoEdIT, a state-of-the-art text editing system for writing assistance. CoEdIT takes instructions from the user specifying the attributes of the desired text, such as "Make the sentence simpler" or "Write it in a more neutral style," and outputs the edited text. We present a large language model fine-tuned on a diverse collection of task-specific instructions for text editing (a total… ▽ More We introduce CoEdIT, a state-of-the-art text editing system for writing assistance. CoEdIT takes instructions from the user specifying the attributes of the desired text, such as "Make the sentence simpler" or "Write it in a more neutral style," and outputs the edited text. We present a large language model fine-tuned on a diverse collection of task-specific instructions for text editing (a total of 82K instructions). Our model (1) achieves state-of-the-art performance on various text editing benchmarks, (2) is competitive with publicly available largest-sized LLMs trained on instructions while being nearly 60x smaller, (3) is capable of generalizing to unseen edit instructions, and (4) exhibits abilities to generalize to composite instructions containing different combinations of edit actions. Through extensive qualitative and quantitative analysis, we show that writers prefer the edits suggested by CoEdIT relative to other state-of-the-art text editing models. Our code, data, and models are publicly available at https://github.com/vipulraheja/coedit. △ Less

Submitted 23 October, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

Comments: Accepted to EMNLP 2023 (Findings). 18 pages, 13 tables, 2 figures

ACM Class: I.2.7

arXiv:2305.08082 [pdf, other]

Tidal forces in the Simpson-Visser black-bounce and wormhole spacetimes

Authors: Dhruv Arora, Parth Bambhaniya, Dipanjan Dey, Pankaj S. Joshi

Abstract: The concept of regular black holes has gained attention in recent years, especially in the context of quantum gravity theories. In these theories, the existence of singularities is paradoxical as they represent a breakdown of the laws of physics. Motivated by the recent developments in this area, we study the tidal force effects in one such family of regular geometries described by the Simpson-Vis… ▽ More The concept of regular black holes has gained attention in recent years, especially in the context of quantum gravity theories. In these theories, the existence of singularities is paradoxical as they represent a breakdown of the laws of physics. Motivated by the recent developments in this area, we study the tidal force effects in one such family of regular geometries described by the Simpson-Visser metric. We find the radial and angular force profiles for a radially in-falling particle in this spacetime and calculate the variation of the geodesic separation vector with the radial coordinate using two different initial conditions. These results are then compared with that of Schwarzschild black hole spacetime. We show that for a regular black hole, both radial and angular tidal forces show a peak outside the horizon and then fall to ultimately switch their behavior from stretching to compression and vice-versa. Also, they are finite at $r=0$ unlike the Schwarzschild spacetime. It is also seen that the angular deviation profile shows an oscillating behavior for a particular initial condition. Our analysis can be used to distinguish between regular black hole, one-way and two-way wormholes and a singular black hole spacetimes. △ Less

Submitted 14 May, 2023; originally announced May 2023.

Comments: 12 pages, 17 figures

arXiv:2305.07367 [pdf, ps, other]

S-REINFORCE: A Neuro-Symbolic Policy Gradient Approach for Interpretable Reinforcement Learning

Authors: Rajdeep Dutta, Qincheng Wang, Ankur Singh, Dhruv Kumarjiguda, Li Xiaoli, Senthilnath Jayavelu

Abstract: This paper presents a novel RL algorithm, S-REINFORCE, which is designed to generate interpretable policies for dynamic decision-making tasks. The proposed algorithm leverages two types of function approximators, namely Neural Network (NN) and Symbolic Regressor (SR), to produce numerical and symbolic policies, respectively. The NN component learns to generate a numerical probability distribution… ▽ More This paper presents a novel RL algorithm, S-REINFORCE, which is designed to generate interpretable policies for dynamic decision-making tasks. The proposed algorithm leverages two types of function approximators, namely Neural Network (NN) and Symbolic Regressor (SR), to produce numerical and symbolic policies, respectively. The NN component learns to generate a numerical probability distribution over the possible actions using a policy gradient, while the SR component captures the functional form that relates the associated states with the action probabilities. The SR-generated policy expressions are then utilized through importance sampling to improve the rewards received during the learning process. We have tested the proposed S-REINFORCE algorithm on various dynamic decision-making problems with low and high dimensional action spaces, and the results demonstrate its effectiveness and impact in achieving interpretable solutions. By leveraging the strengths of both NN and SR, S-REINFORCE produces policies that are not only well-performing but also easy to interpret, making it an ideal choice for real-world applications where transparency and causality are crucial. △ Less

Submitted 12 May, 2023; originally announced May 2023.

Comments: 10 pages, 7 figures

arXiv:2305.07120 [pdf, other]

Geometric Modeling and Physics Simulation Framework for Building a Digital Twin of Extrusion-based Additive Manufacturing

Authors: Dhruv Gamdha, Kumar Saurabh, Baskar Ganapathysubramanian, Adarsh Krishnamurthy

Abstract: Accurate simulation of the printing process is essential for improving print quality, reducing waste, and optimizing the printing parameters of extrusion-based additive manufacturing. Traditional additive manufacturing simulations are very compute-intensive and are not scalable to simulate even moderately-sized geometries. In this paper, we propose a general framework for creating a digital twin o… ▽ More Accurate simulation of the printing process is essential for improving print quality, reducing waste, and optimizing the printing parameters of extrusion-based additive manufacturing. Traditional additive manufacturing simulations are very compute-intensive and are not scalable to simulate even moderately-sized geometries. In this paper, we propose a general framework for creating a digital twin of the dynamic printing process by performing physics simulations with the intermediate print geometries. Our framework takes a general extrusion-based additive manufacturing G-code, generates an analysis-suitable voxelized geometry representation from the print schedule, and performs physics-based (transient thermal and phase change) simulations of the printing process. Our approach leverages parallel adaptive octree meshes for both voxelated geometry representation as well as for fast simulations to address real-time predictions. We demonstrate the effectiveness of our method by simulating the printing of complex geometries at high voxel resolutions with both sparse and dense infills. Our results show that this approach scales to high voxel resolutions and can predict the transient heat distribution as the print progresses. This work lays the computational and algorithmic foundations for building real-time digital twins and performing rapid virtual print sequence exploration to improve print quality and further reduce material waste. △ Less

Submitted 9 May, 2023; originally announced May 2023.

Comments: 13 pages

arXiv:2305.05118 [pdf, other]

Flame: Simplifying Topology Extension in Federated Learning

Authors: Harshit Daga, Jaemin Shin, Dhruv Garg, Ada Gavrilovska, Myung** Lee, Ramana Rao Kompella

Abstract: Distributed machine learning approaches, including a broad class of federated learning (FL) techniques, present a number of benefits when deploying machine learning applications over widely distributed infrastructures. The benefits are highly dependent on the details of the underlying machine learning topology, which specifies the functionality executed by the participating nodes, their dependenci… ▽ More Distributed machine learning approaches, including a broad class of federated learning (FL) techniques, present a number of benefits when deploying machine learning applications over widely distributed infrastructures. The benefits are highly dependent on the details of the underlying machine learning topology, which specifies the functionality executed by the participating nodes, their dependencies and interconnections. Current systems lack the flexibility and extensibility necessary to customize the topology of a machine learning deployment. We present Flame, a new system that provides flexibility of the topology configuration of distributed FL applications around the specifics of a particular deployment context, and is easily extensible to support new FL architectures. Flame achieves this via a new high-level abstraction Topology Abstraction Graphs (TAGs). TAGs decouple the ML application logic from the underlying deployment details, making it possible to specialize the application deployment with reduced development effort. Flame is released as an open source project, and its flexibility and extensibility support a variety of topologies and mechanisms, and can facilitate the development of new FL methodologies. △ Less

Submitted 17 January, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

arXiv:2305.02955 [pdf, other]

Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality

Authors: Dhruv Malik, Conor Igoe, Yuanzhi Li, Aarti Singh

Abstract: In recommender system or crowdsourcing applications of online learning, a human's preferences or abilities are often a function of the algorithm's recent actions. Motivated by this, a significant line of work has formalized settings where an action's loss is a function of the number of times that action was recently played in the prior $m$ timesteps, where $m$ corresponds to a bound on human memor… ▽ More In recommender system or crowdsourcing applications of online learning, a human's preferences or abilities are often a function of the algorithm's recent actions. Motivated by this, a significant line of work has formalized settings where an action's loss is a function of the number of times that action was recently played in the prior $m$ timesteps, where $m$ corresponds to a bound on human memory capacity. To more faithfully capture decay of human memory with time, we introduce the Weighted Tallying Bandit (WTB), which generalizes this setting by requiring that an action's loss is a function of a \emph{weighted} summation of the number of times that arm was played in the last $m$ timesteps. This WTB setting is intractable without further assumption. So we study it under Repeated Exposure Optimality (REO), a condition motivated by the literature on human physiology, which requires the existence of an action that when repetitively played will eventually yield smaller loss than any other sequence of actions. We study the minimization of the complete policy regret (CPR), which is the strongest notion of regret, in WTB under REO. Since $m$ is typically unknown, we assume we only have access to an upper bound $M$ on $m$. We show that for problems with $K$ actions and horizon $T$, a simple modification of the successive elimination algorithm has $O \left( \sqrt{KT} + (m+M)K \right)$ CPR. Interestingly, upto an additive (in lieu of mutliplicative) factor in $(m+M)K$, this recovers the classical guarantee for the simpler stochastic multi-armed bandit with traditional regret. We additionally show that in our setting, any algorithm will suffer additive CPR of $Ω\left( mK + M \right)$, demonstrating our result is nearly optimal. Our algorithm is computationally efficient, and we experimentally demonstrate its practicality and superiority over natural baselines. △ Less

Submitted 4 May, 2023; originally announced May 2023.

Comments: ICML 2023

arXiv:2305.01531 [pdf, ps, other]

Large cliques or co-cliques in hypergraphs with forbidden order-size pairs

Authors: Maria Axenovich, Domagoj Bradač, Lior Gishboliner, Dhruv Mubayi, Lea Weber

Abstract: The well-known Erdős-Hajnal conjecture states that for any graph $F$, there exists $ε>0$ such that every $n$-vertex graph $G$ that contains no induced copy of $F$ has a homogeneous set of size at least $n^ε$. We consider a variant of the Erdős-Hajnal problem for hypergraphs where we forbid a family of hypergraphs described by their orders and sizes. For graphs, we observe that if we forbid induced… ▽ More The well-known Erdős-Hajnal conjecture states that for any graph $F$, there exists $ε>0$ such that every $n$-vertex graph $G$ that contains no induced copy of $F$ has a homogeneous set of size at least $n^ε$. We consider a variant of the Erdős-Hajnal problem for hypergraphs where we forbid a family of hypergraphs described by their orders and sizes. For graphs, we observe that if we forbid induced subgraphs on $m$ vertices and $f$ edges for any positive $m$ and $0\leq f \leq \binom{m}{2}$, then we obtain large homogeneous sets. For triple systems, in the first nontrivial case $m=4$, for every $S \subseteq \{0,1,2,3,4\}$, we give bounds on the minimum size of a homogeneous set in a triple system where the number of edges spanned by every four vertices is not in $S$. In most cases the bounds are essentially tight. We also determine, for all $S$, whether the growth rate is polynomial or polylogarithmic. Some open problems remain. △ Less

Submitted 2 May, 2023; originally announced May 2023.

Comments: A preliminary version of this manuscript appeared as arXiv:2303.09578

arXiv:2305.01098 [pdf, other]

IndoorSim-to-OutdoorReal: Learning to Navigate Outdoors without any Outdoor Experience

Authors: Joanne Truong, April Zitkovich, Sonia Chernova, Dhruv Batra, Tingnan Zhang, Jie Tan, Wenhao Yu

Abstract: We present IndoorSim-to-OutdoorReal (I2O), an end-to-end learned visual navigation approach, trained solely in simulated short-range indoor environments, and demonstrates zero-shot sim-to-real transfer to the outdoors for long-range navigation on the Spot robot. Our method uses zero real-world experience (indoor or outdoor), and requires the simulator to model no predominantly-outdoor phenomenon (… ▽ More We present IndoorSim-to-OutdoorReal (I2O), an end-to-end learned visual navigation approach, trained solely in simulated short-range indoor environments, and demonstrates zero-shot sim-to-real transfer to the outdoors for long-range navigation on the Spot robot. Our method uses zero real-world experience (indoor or outdoor), and requires the simulator to model no predominantly-outdoor phenomenon (sloped grounds, sidewalks, etc). The key to I2O transfer is in providing the robot with additional context of the environment (i.e., a satellite map, a rough sketch of a map by a human, etc.) to guide the robot's navigation in the real-world. The provided context-maps do not need to be accurate or complete -- real-world obstacles (e.g., trees, bushes, pedestrians, etc.) are not drawn on the map, and openings are not aligned with where they are in the real-world. Crucially, these inaccurate context-maps provide a hint to the robot about a route to take to the goal. We find that our method that leverages Context-Maps is able to successfully navigate hundreds of meters in novel environments, avoiding novel obstacles on its path, to a distant goal without a single collision or human intervention. In comparison, policies without the additional context fail completely. Lastly, we test the robustness of the Context-Map policy by adding varying degrees of noise to the map in simulation. We find that the Context-Map policy is surprisingly robust to noise in the provided context-map. In the presence of significantly inaccurate maps (corrupted with 50% noise, or entirely blank maps), the policy gracefully regresses to the behavior of a policy with no context. Videos are available at https://www.joannetruong.com/projects/i2o.html △ Less

Submitted 9 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

Showing 151–200 of 815 results for author: Dhruv