Search | arXiv e-print repository

SCAPE: Learning Stiffness Control from Augmented Position Control Experiences

Authors: Mincheol Kim, Scott Niekum, Ashish D. Deshpande

Abstract: We introduce a sample-efficient method for learning state-dependent stiffness control policies for dexterous manipulation. The ability to control stiffness facilitates safe and reliable manipulation by providing compliance and robustness to uncertainties. Most current reinforcement learning approaches to achieve robotic manipulation have exclusively focused on position control, often due to the di… ▽ More We introduce a sample-efficient method for learning state-dependent stiffness control policies for dexterous manipulation. The ability to control stiffness facilitates safe and reliable manipulation by providing compliance and robustness to uncertainties. Most current reinforcement learning approaches to achieve robotic manipulation have exclusively focused on position control, often due to the difficulty of learning high-dimensional stiffness control policies. This difficulty can be partially mitigated via policy guidance such as imitation learning. However, expert stiffness control demonstrations are often expensive or infeasible to record. Therefore, we present an approach to learn Stiffness Control from Augmented Position control Experiences (SCAPE) that bypasses this difficulty by transforming position control demonstrations into approximate, suboptimal stiffness control demonstrations. Then, the suboptimality of the augmented demonstrations is addressed by using complementary techniques that help the agent safely learn from both the demonstrations and reinforcement learning. By using simulation tools and experiments on a robotic testbed, we show that the proposed approach efficiently learns safe manipulation policies and outperforms learned position control policies and several other baseline learning algorithms. △ Less

Submitted 14 September, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

Comments: Accepted at CoRL 2021

arXiv:2102.02559 [pdf, other]

doi 10.1103/PhysRevB.103.125430

Perturbation of charge density waves in 1T-TiSe$_2$

Authors: Imrankhan Mulani, Umashankar Rajput, Luminita Harnagea, Aparna Deshpande

Abstract: In this study, using low-temperature scanning tunneling microscopy (STM), we focus on understanding the native defects in pristine \textit{1T}-TiSe$_2$ at the atomic scale. We probe how they perturb the charge density waves (CDWs) and lead to local domain formation. These defects influence the correlation length of CDWs. We establish a connection between suppression of CDWs, Ti intercalation, and… ▽ More In this study, using low-temperature scanning tunneling microscopy (STM), we focus on understanding the native defects in pristine \textit{1T}-TiSe$_2$ at the atomic scale. We probe how they perturb the charge density waves (CDWs) and lead to local domain formation. These defects influence the correlation length of CDWs. We establish a connection between suppression of CDWs, Ti intercalation, and show how this supports the exciton condensation model of CDW formation in \textit{1T}-TiSe$_2$. △ Less

Submitted 18 February, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

Journal ref: Phys. Rev. B 103, 125430 (2021)

arXiv:2102.00084 [pdf, other]

A linearized framework and a new benchmark for model selection for fine-tuning

Authors: Aditya Deshpande, Alessandro Achille, Avinash Ravichandran, Hao Li, Luca Zancato, Charless Fowlkes, Rahul Bhotika, Stefano Soatto, Pietro Perona

Abstract: Fine-tuning from a collection of models pre-trained on different domains (a "model zoo") is emerging as a technique to improve test accuracy in the low-data regime. However, model selection, i.e. how to pre-select the right model to fine-tune from a model zoo without performing any training, remains an open topic. We use a linearized framework to approximate fine-tuning, and introduce two new base… ▽ More Fine-tuning from a collection of models pre-trained on different domains (a "model zoo") is emerging as a technique to improve test accuracy in the low-data regime. However, model selection, i.e. how to pre-select the right model to fine-tune from a model zoo without performing any training, remains an open topic. We use a linearized framework to approximate fine-tuning, and introduce two new baselines for model selection -- Label-Gradient and Label-Feature Correlation. Since all model selection algorithms in the literature have been tested on different use-cases and never compared directly, we introduce a new comprehensive benchmark for model selection comprising of: i) A model zoo of single and multi-domain models, and ii) Many target tasks. Our benchmark highlights accuracy gain with model zoo compared to fine-tuning Imagenet models. We show our model selection baseline can select optimal models to fine-tune in few selections and has the highest ranking correlation to fine-tuning accuracy compared to existing algorithms. △ Less

Submitted 29 January, 2021; originally announced February 2021.

Comments: 14 pages

arXiv:2101.12134 [pdf]

doi 10.1109/TNS.2021.3132946

A Comparative Study of Straight-Strip and Zigzag-Interleaved Anode Patterns for MPGD Readouts

Authors: C. Perez-Lara, S. Aune, B. Azmoun, K. Dehmelt, A. Deshpande, W. Fan, P. Garg, T. K. Hemmick, M. Kebbiri, A. Kiselev, I. Mandjavidze, M. L. Purschke, M. Revolle, M. Vandenbroucke, C. Woody

Abstract: Due to their simplicity and versatility of design, straight strip or rectangular pad anode structures are frequently employed with micro-pattern gas detectors to reconstruct high precision space points for various tracking applications. The particle impact point is typically determined by interpolating the charge collected by several neighboring pads. However, to effectively extract the inherent p… ▽ More Due to their simplicity and versatility of design, straight strip or rectangular pad anode structures are frequently employed with micro-pattern gas detectors to reconstruct high precision space points for various tracking applications. The particle impact point is typically determined by interpolating the charge collected by several neighboring pads. However, to effectively extract the inherent positional information, the lateral spacing of the straight pads must be significantly smaller than the extent of the charge cloud. In contrast, highly interleaved anode patterns, such as zigzags, can adequately sample the charge with a pitch comparable to the size of the charge cloud or even larger. This has the considerable advantage of providing the same performance while requiring far fewer instrumented channels. Additionally, the geometric parameters defining such zigzag structures may be tuned to provide a uniform detector response without the need for so-called pad response functions, while simultaneously maintaining excellent position resolution. We have measured the position resolution of a variety of zigzag shaped anode patterns optimized for various MPGDs, including GEM, Micromegas, and micro-RWELL and compared this performance to the same detectors equipped with straight pads of varying pitch. We report on the performance results of each readout structure, evaluated under identical conditions in a test beam. △ Less

Submitted 28 January, 2021; originally announced January 2021.

arXiv:2012.11448 [pdf, other]

The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective

Authors: Naman Goel, Alfonso Amayuelas, Amit Deshpande, Amit Sharma

Abstract: Training datasets for machine learning often have some form of missingness. For example, to learn a model for deciding whom to give a loan, the available training data includes individuals who were given a loan in the past, but not those who were not. This missingness, if ignored, nullifies any fairness guarantee of the training procedure when the model is deployed. Using causal graphs, we charact… ▽ More Training datasets for machine learning often have some form of missingness. For example, to learn a model for deciding whom to give a loan, the available training data includes individuals who were given a loan in the past, but not those who were not. This missingness, if ignored, nullifies any fairness guarantee of the training procedure when the model is deployed. Using causal graphs, we characterize the missingness mechanisms in different real-world scenarios. We show conditions under which various distributions, used in popular fairness algorithms, can or can not be recovered from the training data. Our theoretical results imply that many of these algorithms can not guarantee fairness in practice. Modeling missingness also helps to identify correct design principles for fair algorithms. For example, in multi-stage settings where decisions are made in multiple screening rounds, we use our framework to derive the minimal distributions required to design a fair algorithm. Our proposed algorithm decentralizes the decision-making process and still achieves similar performance to the optimal algorithm that requires centralization and non-recoverable distributions. △ Less

Submitted 21 December, 2020; originally announced December 2020.

Comments: To appear in the Proceedings of AAAI 2021

arXiv:2012.06033 [pdf, ps, other]

Autocatalytic systems and recombination: a reaction network perspective

Authors: Gheorghe Craciun, Abhishek Deshpande, Badal Joshi, Polly Y. Yu

Abstract: Autocatalytic systems are very often incorporated in the "origin of life" models, a connection that has been analyzed in the context of the classical hypercycles introduced by Manfred Eigen. We investigate the dynamics of certain networks called bimolecular autocatalytic systems. In particular, we consider the dynamics corresponding to the relative populations in these networks, and show that they… ▽ More Autocatalytic systems are very often incorporated in the "origin of life" models, a connection that has been analyzed in the context of the classical hypercycles introduced by Manfred Eigen. We investigate the dynamics of certain networks called bimolecular autocatalytic systems. In particular, we consider the dynamics corresponding to the relative populations in these networks, and show that they can be analyzed by studying well-chosen autonomous polynomial dynamical systems. Moreover, we find that one can use results from reaction network theory to prove persistence and permanence of several types of bimolecular autocatalytic systems called autocatalytic recombination networks. △ Less

Submitted 10 December, 2020; originally announced December 2020.

Comments: 24 pages, 6 figures

MSC Class: 37N25; 80A30; 92C45; 92E20; 14M25

arXiv:2012.04672 [pdf, other]

Euclid: Forecasts for $k$-cut $3 \times 2$ Point Statistics

Authors: P. L. Taylor, T. Kitching, V. F. Cardone, A. Ferté, E. M. Huff, F. Bernardeau, J. Rhodes, A. C. Deshpande, I. Tutusaus, A. Pourtsidou, S. Camera, C. Carbone, S. Casas, M. Martinelli, V. Pettorino, Z. Sakr, D. Sapone, V. Yankelevich, N. Auricchio, A. Balestra, C. Bodendorf, D. Bonino, A. Boucaud, E. Branchini, M. Brescia , et al. (70 additional authors not shown)

Abstract: Modelling uncertainties at small scales, i.e. high $k$ in the power spectrum $P(k)$, due to baryonic feedback, nonlinear structure growth and the fact that galaxies are biased tracers poses a significant obstacle to fully leverage the constraining power of the {\it Euclid} wide-field survey. $k$-cut cosmic shear has recently been proposed as a method to optimally remove sensitivity to these scales… ▽ More Modelling uncertainties at small scales, i.e. high $k$ in the power spectrum $P(k)$, due to baryonic feedback, nonlinear structure growth and the fact that galaxies are biased tracers poses a significant obstacle to fully leverage the constraining power of the {\it Euclid} wide-field survey. $k$-cut cosmic shear has recently been proposed as a method to optimally remove sensitivity to these scales while preserving usable information. In this paper we generalise the $k$-cut cosmic shear formalism to $3 \times 2$ point statistics and estimate the loss of information for different $k$-cuts in a $3 \times 2$ point analysis of the {\it Euclid} data. Extending the Fisher matrix analysis of~\citet{blanchard2019euclid}, we assess the degradation in constraining power for different $k$-cuts. We work in the idealised case and assume the galaxy bias is linear, the covariance is Gaussian, while neglecting uncertainties due to photo-z errors and baryonic feedback. We find that taking a $k$-cut at $2.6 \ h \ {\rm Mpc} ^{-1}$ yields a dark energy Figure of Merit (FOM) of 1018. This is comparable to taking a weak lensing cut at $\ell = 5000$ and a galaxy clustering and galaxy-galaxy lensing cut at $\ell = 3000$ in a traditional $3 \times 2$ point analysis. We also find that the fraction of the observed galaxies used in the photometric clustering part of the analysis is one of the main drivers of the FOM. Removing $50 \% \ (90 \%)$ of the clustering galaxies decreases the FOM by $19 \% \ (62 \%)$. Given that the FOM depends so heavily on the fraction of galaxies used in the clustering analysis, extensive efforts should be made to handle the real-world systematics present when extending the analysis beyond the luminous red galaxy (LRG) sample. △ Less

Submitted 20 July, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

Comments: 10 pages, 5 figures. Accepted by the Open Journal of Astrophysics

arXiv:2011.14187 [pdf, other]

doi 10.1103/PhysRevD.103.032007

Transverse momentum dependent forward neutron single spin asymmetries in transversely polarized $p$$+$$p$ collisions at $\sqrt{s}=200$ GeV

Authors: U. A. Acharya, C. Aidala, Y. Akiba, M. Alfred, V. Andrieux, N. Apadula, H. Asano, B. Azmoun, V. Babintsev, N. S. Bandara, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, R. Belmont, A. Berdnikov, Y. Berdnikov, L. Bichon, B. Blankenship, D. S. Blau, J. S. Bok, V. Borisov, M. L. Brooks, J. Bryslawskyj, V. Bumazhnov , et al. (289 additional authors not shown)

Abstract: In 2015, the PHENIX collaboration has measured very forward ($η>6.8$) single-spin asymmetries of inclusive neutrons in transversely polarized proton-proton and proton-nucleus collisions at a center of mass energy of 200 GeV. A previous publication from this data set concentrated on the nuclear dependence of such asymmetries. In this measurement the explicit transverse-momentum dependence of inclus… ▽ More In 2015, the PHENIX collaboration has measured very forward ($η>6.8$) single-spin asymmetries of inclusive neutrons in transversely polarized proton-proton and proton-nucleus collisions at a center of mass energy of 200 GeV. A previous publication from this data set concentrated on the nuclear dependence of such asymmetries. In this measurement the explicit transverse-momentum dependence of inclusive neutron single spin asymmetries for proton-proton collisions is extracted using a bootstrap**-unfolding technique on the transverse momenta. This explicit transverse-momentum dependence will help improve the understanding of the mechanisms that create these asymmetries. △ Less

Submitted 6 February, 2021; v1 submitted 28 November, 2020; originally announced November 2020.

Comments: 314 authors from 66 institutions, 8 pages, 3 figures, 1 table, 2015 data. v2 is version accepted for publication in Physical Review D. Plain text data tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

Journal ref: Phys. Rev. D 103, 032007 (2021)

arXiv:2011.14170 [pdf, other]

doi 10.1103/PhysRevD.103.052009

Transverse single-spin asymmetries of midrapidity $π^0$ and $η$ mesons in polarized $p$$+$$p$ collisions at $\sqrt{s}=200$ GeV

Authors: U. A. Acharya, C. Aidala, Y. Akiba, M. Alfred, V. Andrieux, N. Apadula, H. Asano, B. Azmoun, V. Babintsev, N. S. Bandara, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, R. Belmont, A. Berdnikov, Y. Berdnikov, L. Bichon, B. Blankenship, D. S. Blau, J. S. Bok, V. Borisov, M. L. Brooks, J. Bryslawskyj, V. Bumazhnov , et al. (289 additional authors not shown)

Abstract: We present a measurement of the transverse single-spin asymmetry for $π^0$ and $η$ mesons in $p^\uparrow$$+$$p$ collisions in the pseudorapidity range $|η|<0.35$ and at a center-of-mass energy of 200 GeV with the PHENIX detector at the Relativistic Heavy Ion Collider. In comparison with previous measurements in this kinematic region, these results have a factor of 3 smaller uncertainties. As hadro… ▽ More We present a measurement of the transverse single-spin asymmetry for $π^0$ and $η$ mesons in $p^\uparrow$$+$$p$ collisions in the pseudorapidity range $|η|<0.35$ and at a center-of-mass energy of 200 GeV with the PHENIX detector at the Relativistic Heavy Ion Collider. In comparison with previous measurements in this kinematic region, these results have a factor of 3 smaller uncertainties. As hadrons, $π^0$ and $η$ mesons are sensitive to both initial- and final-state nonperturbative effects for a mix of parton flavors. Comparisons of the differences in their transverse single-spin asymmetries have the potential to disentangle the possible effects of strangeness, isospin, or mass. These results can constrain the twist-3 trigluon collinear correlation function as well as the gluon Sivers function. △ Less

Submitted 26 February, 2021; v1 submitted 28 November, 2020; originally announced November 2020.

Comments: 314 authors from 66 institutions, 10 pages, 5 figures, 2 tables, 2015 data. v2 is version accepted for publication in Phys. Rev. D. Plain text data tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

Journal ref: Phys. Rev. D 103, 052009 (2021)

arXiv:2011.09109 [pdf, ps, other]

On Simultaneous Long-Short Stock Trading Controllers with Cross-Coupling

Authors: Atul Deshpande, John A Gubner, B. Ross Barmish

Abstract: The Simultaneous Long-Short(SLS) controller for trading a single stock is known to guarantee positive expected value of the resulting gain-loss function with respect to a large class of stock price dynamics. In the literature, this is known as the Robust Positive Expectation(RPE)property. An obvious way to extend this theory to the trading of two stocks is to trade each one of them using its own i… ▽ More The Simultaneous Long-Short(SLS) controller for trading a single stock is known to guarantee positive expected value of the resulting gain-loss function with respect to a large class of stock price dynamics. In the literature, this is known as the Robust Positive Expectation(RPE)property. An obvious way to extend this theory to the trading of two stocks is to trade each one of them using its own independent SLS controller. Motivated by the fact that such a scheme does not exploit any correlation between the two stocks, we study the case when the relative sign between the drifts of the two stocks is known. The main contributions of this paper are three-fold: First, we put forward a novel architecture in which we cross-couple two SLS controllers for the two-stock case. Second, we derive a closed-form expression for the expected value of the gain-loss function. Third, we use this closed-form expression to prove that the RPE property is guaranteed with respect to a large class of stock-price dynamics. When more information over and above the relative sign is assumed, additional benefits of the new architecture are seen. For example, when bounds or precise values for the means and covariances of the stock returns are included in the model, numerical simulations suggest that our new controller can achieve lower trading risk than a pair of decoupled SLS controllers for the same level of expected trading gain. △ Less

Submitted 18 November, 2020; originally announced November 2020.

Comments: Presented at IFAC World Congress, 2020. Will appear in IFAC-PapersOnline

arXiv:2011.02323 [pdf, other]

Indic-Transformers: An Analysis of Transformer Language Models for Indian Languages

Authors: Kushal Jain, Adwait Deshpande, Kumar Shridhar, Felix Laumann, Ayushman Dash

Abstract: Language models based on the Transformer architecture have achieved state-of-the-art performance on a wide range of NLP tasks such as text classification, question-answering, and token classification. However, this performance is usually tested and reported on high-resource languages, like English, French, Spanish, and German. Indian languages, on the other hand, are underrepresented in such bench… ▽ More Language models based on the Transformer architecture have achieved state-of-the-art performance on a wide range of NLP tasks such as text classification, question-answering, and token classification. However, this performance is usually tested and reported on high-resource languages, like English, French, Spanish, and German. Indian languages, on the other hand, are underrepresented in such benchmarks. Despite some Indian languages being included in training multilingual Transformer models, they have not been the primary focus of such work. In order to evaluate the performance on Indian languages specifically, we analyze these language models through extensive experiments on multiple downstream tasks in Hindi, Bengali, and Telugu language. Here, we compare the efficacy of fine-tuning model parameters of pre-trained models against that of training a language model from scratch. Moreover, we empirically argue against the strict dependency between the dataset size and model performance, but rather encourage task-specific model and method selection. We achieve state-of-the-art performance on Hindi and Bengali languages for text classification task. Finally, we present effective strategies for handling the modeling of Indian languages and we release our model checkpoints for the community : https://huggingface.co/neuralspace-reverie. △ Less

Submitted 4 November, 2020; originally announced November 2020.

Comments: Accepted at ML-RSA @ NeurIPS 2020

arXiv:2011.00781 [pdf, other]

Searching k-Optimal Goals for an Orienteering Problem on a Specialized Graph with Budget Constraints

Authors: Abhinav Sharma, Advait Deshpande, Yanming Wang, Xinyi Xu, Prashan Madumal, Anbin Hou

Abstract: We propose a novel non-randomized anytime orienteering algorithm for finding k-optimal goals that maximize reward on a specialized graph with budget constraints. This specialized graph represents a real-world scenario which is analogous to an orienteering problem of finding k-most optimal goal states. We propose a novel non-randomized anytime orienteering algorithm for finding k-optimal goals that maximize reward on a specialized graph with budget constraints. This specialized graph represents a real-world scenario which is analogous to an orienteering problem of finding k-most optimal goal states. △ Less

Submitted 2 November, 2020; originally announced November 2020.

arXiv:2010.13290 [pdf, other]

On reaction network implementations of neural networks

Authors: David F. Anderson, Badal Joshi, Abhishek Deshpande

Abstract: This paper is concerned with the utilization of deterministically modeled chemical reaction networks for the implementation of (feed-forward) neural networks. We develop a general mathematical framework and prove that the ordinary differential equations (ODEs) associated with certain reaction network implementations of neural networks have desirable properties including (i) existence of unique pos… ▽ More This paper is concerned with the utilization of deterministically modeled chemical reaction networks for the implementation of (feed-forward) neural networks. We develop a general mathematical framework and prove that the ordinary differential equations (ODEs) associated with certain reaction network implementations of neural networks have desirable properties including (i) existence of unique positive fixed points that are smooth in the parameters of the model (necessary for gradient descent), and (ii) fast convergence to the fixed point regardless of initial condition (necessary for efficient implementation). We do so by first making a connection between neural networks and fixed points for systems of ODEs, and then by constructing reaction networks with the correct associated set of ODEs. We demonstrate the theory by constructing a reaction network that implements a neural network with a smoothed ReLU activation function, though we also demonstrate how to generalize the construction to allow for other activation functions (each with the desirable properties listed previously). As there are multiple types of "networks" utilized in this paper, we also give a careful introduction to both reaction networks and neural networks, in order to disambiguate the overlap** vocabulary in the two settings and to clearly highlight the role of each network's properties. △ Less

Submitted 8 March, 2021; v1 submitted 25 October, 2020; originally announced October 2020.

Comments: Small edits

arXiv:2010.07749 [pdf, other]

doi 10.21105/astro.2010.07749

Propagating residual biases in masked cosmic shear power spectra

Authors: T. D. Kitching, A. C. Deshpande, P. L. Taylor

Abstract: In this paper we derive a full expression for the propagation of weak lensing shape measurement biases into cosmic shear power spectra including the effect of missing data. We show using simulations that terms higher than first order in bias parameters can be ignored and the impact of biases can be captured by terms dependent only on the mean of the multiplicative bias field. We identify that the… ▽ More In this paper we derive a full expression for the propagation of weak lensing shape measurement biases into cosmic shear power spectra including the effect of missing data. We show using simulations that terms higher than first order in bias parameters can be ignored and the impact of biases can be captured by terms dependent only on the mean of the multiplicative bias field. We identify that the B-mode power contains information on the multiplicative bias. We find that without priors on the residual multiplicative bias $δm$ and stochastic ellipticity variance $σ_e$ that constraints on the amplitude of the cosmic shear power spectrum are completely degenerate, and that when applying priors the constrained amplitude $A$ is slightly biased low via a classic marginalisation paradox. Using all-sky Gaussian random field simulations we find that the combination of $(1+2δm)A$ is unbiased for a joint EE and BB power spectrum likelihood if the error and mean (precision and accuracy) of the stochastic ellipticity variance is known to better than $σ(σ_e)\leq 0.05$ and $Δσ_e\leq 0.01$, or the multiplicative bias is known to better than $σ(m)\leq 0.07$ and $Δm\leq 0.01$. △ Less

Submitted 14 December, 2020; v1 submitted 15 October, 2020; originally announced October 2020.

Comments: 12 pages, accepted to the Open Journal of Astrophysics, comments welcome

arXiv:2010.06986 [pdf, other]

On the Problem of Underranking in Group-Fair Ranking

Authors: Sruthi Gorantla, Amit Deshpande, Anand Louis

Abstract: Search and recommendation systems, such as search engines, recruiting tools, online marketplaces, news, and social media, output ranked lists of content, products, and sometimes, people. Credit ratings, standardized tests, risk assessments output only a score, but are also used implicitly for ranking. Bias in such ranking systems, especially among the top ranks, can worsen social and economic ineq… ▽ More Search and recommendation systems, such as search engines, recruiting tools, online marketplaces, news, and social media, output ranked lists of content, products, and sometimes, people. Credit ratings, standardized tests, risk assessments output only a score, but are also used implicitly for ranking. Bias in such ranking systems, especially among the top ranks, can worsen social and economic inequalities, polarize opinions, and reinforce stereotypes. On the other hand, a bias correction for minority groups can cause more harm if perceived as favoring group-fair outcomes over meritocracy. In this paper, we formulate the problem of underranking in group-fair rankings, which was not addressed in previous work. Most group-fair ranking algorithms post-process a given ranking and output a group-fair ranking. We define underranking based on how close the group-fair rank of each item is to its original rank, and prove a lower bound on the trade-off achievable for simultaneous underranking and group fairness in ranking. We give a fair ranking algorithm that takes any given ranking and outputs another ranking with simultaneous underranking and group fairness guarantees comparable to the lower bound we prove. Our algorithm works with group fairness constraints for any number of groups. Our experimental results confirm the theoretical trade-off between underranking and group fairness, and also show that our algorithm achieves the best of both when compared to the state-of-the-art baselines. △ Less

Submitted 18 February, 2021; v1 submitted 24 September, 2020; originally announced October 2020.

Comments: 27 pages

arXiv:2010.03831 [pdf, other]

QUIC-EST: A Transmission Scheme to Maximize VoI of Multi-Stream Correlated Data Flows

Authors: Federico Chiariotti, Anay Ajit Deshpande, Marco Giordani, Kostantinos Antonakoglou, Andrea Zanella, Toktam Mahmoodi

Abstract: New advanced applications, such as autonomous driving and haptic communication, require to transmit multi-sensory data and require low latency and high reliability. These applications include. Existing implementations for such services have mostly relied on ad hoc scheduling and send rate adaptation mechanisms, implemented directly by the application and running over UDP. In this work, we propose… ▽ More New advanced applications, such as autonomous driving and haptic communication, require to transmit multi-sensory data and require low latency and high reliability. These applications include. Existing implementations for such services have mostly relied on ad hoc scheduling and send rate adaptation mechanisms, implemented directly by the application and running over UDP. In this work, we propose a transmission scheme that relies on the features of the recently developed QUIC transport protocol, providing reliability where needed, and standardized congestion control, without compromising latency. Furthermore, we propose a scheduler for sensor data transmissions on the transport layer that can exploit the correlations over time and across sensors. This mechanism allows applications to maximize the Value of Information (VoI) of the transmitted data, as we demonstrate through simulations in two realistic application scenarios. △ Less

Submitted 8 October, 2020; originally announced October 2020.

Comments: 7 pages, 6 figures. Submitted for publication to the IEEE

arXiv:2010.02930 [pdf, other]

doi 10.1103/PhysRevX.11.031016

Optimal State Transfer and Entanglement Generation in Power-law Interacting Systems

Authors: Minh C. Tran, Abhinav Deshpande, Andrew Y. Guo, Andrew Lucas, Alexey V. Gorshkov

Abstract: We present an optimal protocol for encoding an unknown qubit state into a multiqubit Greenberger-Horne-Zeilinger-like state and, consequently, transferring quantum information in large systems exhibiting power-law ($1/r^α$) interactions. For all power-law exponents $α$ between $d$ and $2d+1$, where $d$ is the dimension of the system, the protocol yields a polynomial speedup for $α>2d$ and a superp… ▽ More We present an optimal protocol for encoding an unknown qubit state into a multiqubit Greenberger-Horne-Zeilinger-like state and, consequently, transferring quantum information in large systems exhibiting power-law ($1/r^α$) interactions. For all power-law exponents $α$ between $d$ and $2d+1$, where $d$ is the dimension of the system, the protocol yields a polynomial speedup for $α>2d$ and a superpolynomial speedup for $α\leq 2d$, compared to the state of the art. For all $α>d$, the protocol saturates the Lieb-Robinson bounds (up to subpolynomial corrections), thereby establishing the optimality of the protocol and the tightness of the bounds in this regime. The protocol has a wide range of applications, including in quantum sensing, quantum computing, and preparation of topologically ordered states. In addition, the protocol provides a lower bound on the gate count in digital simulations of power-law interacting systems. △ Less

Submitted 1 February, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

Comments: Updated Table I, Additional discussion on a lower bound for the gate count in digital quantum simulation

Journal ref: Phys. Rev. X 11, 031016 (2021)

arXiv:2010.02399 [pdf, other]

Guiding Attention for Self-Supervised Learning with Transformers

Authors: Ameet Deshpande, Karthik Narasimhan

Abstract: In this paper, we propose a simple and effective technique to allow for efficient self-supervised learning with bi-directional Transformers. Our approach is motivated by recent studies demonstrating that self-attention patterns in trained models contain a majority of non-linguistic regularities. We propose a computationally efficient auxiliary loss function to guide attention heads to conform to s… ▽ More In this paper, we propose a simple and effective technique to allow for efficient self-supervised learning with bi-directional Transformers. Our approach is motivated by recent studies demonstrating that self-attention patterns in trained models contain a majority of non-linguistic regularities. We propose a computationally efficient auxiliary loss function to guide attention heads to conform to such patterns. Our method is agnostic to the actual pre-training objective and results in faster convergence of models as well as better performance on downstream tasks compared to the baselines, achieving state of the art results in low-resource settings. Surprisingly, we also find that linguistic properties of attention heads are not necessarily correlated with language modeling performance. △ Less

Submitted 5 October, 2020; originally announced October 2020.

Comments: Accepted to Findings of EMNLP, 2020

arXiv:2010.02316 [pdf, other]

Sentiment Analysis for Reinforcement Learning

Authors: Ameet Deshpande, Eve Fleisig

Abstract: While reinforcement learning (RL) has been successful in natural language processing (NLP) domains such as dialogue generation and text-based games, it typically faces the problem of sparse rewards that leads to slow or no convergence. Traditional methods that use text descriptions to extract only a state representation ignore the feedback inherently present in them. In text-based games, for examp… ▽ More While reinforcement learning (RL) has been successful in natural language processing (NLP) domains such as dialogue generation and text-based games, it typically faces the problem of sparse rewards that leads to slow or no convergence. Traditional methods that use text descriptions to extract only a state representation ignore the feedback inherently present in them. In text-based games, for example, descriptions like "Good Job! You ate the food}" indicate progress, and descriptions like "You entered a new room" indicate exploration. Positive and negative cues like these can be converted to rewards through sentiment analysis. This technique converts the sparse reward problem into a dense one, which is easier to solve. Furthermore, this can enable reinforcement learning without rewards, in which the agent learns entirely from these intrinsic sentiment rewards. This framework is similar to intrinsic motivation, where the environment does not necessarily provide the rewards, but the agent analyzes and realizes them by itself. We find that providing dense rewards in text-based games using sentiment analysis improves performance under some conditions. △ Less

Submitted 5 October, 2020; originally announced October 2020.

Comments: Work in progress

arXiv:2010.00722 [pdf, other]

Evaluating a Generative Adversarial Framework for Information Retrieval

Authors: Ameet Deshpande, Mitesh M. Khapra

Abstract: Recent advances in Generative Adversarial Networks (GANs) have resulted in its widespread applications to multiple domains. A recent model, IRGAN, applies this framework to Information Retrieval (IR) and has gained significant attention over the last few years. In this focused work, we critically analyze multiple components of IRGAN, while providing experimental and theoretical evidence of some of… ▽ More Recent advances in Generative Adversarial Networks (GANs) have resulted in its widespread applications to multiple domains. A recent model, IRGAN, applies this framework to Information Retrieval (IR) and has gained significant attention over the last few years. In this focused work, we critically analyze multiple components of IRGAN, while providing experimental and theoretical evidence of some of its shortcomings. Specifically, we identify issues with the constant baseline term in the policy gradients optimization and show that the generator harms IRGAN's performance. Motivated by our findings, we propose two models influenced by self-contrastive estimation and co-training which outperform IRGAN on two out of the three tasks considered. △ Less

Submitted 1 October, 2020; originally announced October 2020.

arXiv:2009.09154 [pdf, other]

CLEVR Parser: A Graph Parser Library for Geometric Learning on Language Grounded Image Scenes

Authors: Raeid Saqur, Ameet Deshpande

Abstract: The CLEVR dataset has been used extensively in language grounded visual reasoning in Machine Learning (ML) and Natural Language Processing (NLP) domains. We present a graph parser library for CLEVR, that provides functionalities for object-centric attributes and relationships extraction, and construction of structural graph representations for dual modalities. Structural order-invariant representa… ▽ More The CLEVR dataset has been used extensively in language grounded visual reasoning in Machine Learning (ML) and Natural Language Processing (NLP) domains. We present a graph parser library for CLEVR, that provides functionalities for object-centric attributes and relationships extraction, and construction of structural graph representations for dual modalities. Structural order-invariant representations enable geometric learning and can aid in downstream tasks like language grounding to vision, robotics, compositionality, interpretability, and computational grammar construction. We provide three extensible main components - parser, embedder, and visualizer that can be tailored to suit specific learning setups. We also provide out-of-the-box functionality for seamless integration with popular deep graph neural network (GNN) libraries. Additionally, we discuss downstream usage and applications of the library, and how it accelerates research for the NLP research community. △ Less

Submitted 1 October, 2020; v1 submitted 18 September, 2020; originally announced September 2020.

Comments: Accepted at NLP-OSS, EMNLP 2020 (2nd Workshop for Natural Language Processing Open Source Software)

arXiv:2009.01792 [pdf, other]

doi 10.1103/PhysRevD.102.083535

Accessing the high-$\ell$ frontier under the Reduced Shear Approximation with $k$-cut Cosmic Shear

Authors: Anurag C. Deshpande, Peter L. Taylor, Thomas D. Kitching

Abstract: The precision of Stage IV cosmic shear surveys will enable us to probe smaller physical scales than ever before, however, model uncertainties from baryonic physics and non-linear structure formation will become a significant concern. The $k$-cut method -- applying a redshift-dependent $\ell$-cut after making the Bernardeau-Nishimichi-Taruya transform -- can reduce sensitivity to baryonic physics;… ▽ More The precision of Stage IV cosmic shear surveys will enable us to probe smaller physical scales than ever before, however, model uncertainties from baryonic physics and non-linear structure formation will become a significant concern. The $k$-cut method -- applying a redshift-dependent $\ell$-cut after making the Bernardeau-Nishimichi-Taruya transform -- can reduce sensitivity to baryonic physics; allowing Stage IV surveys to include information from increasingly higher $\ell$-modes. Here we address the question of whether it can also mitigate the impact of making the reduced shear approximation; which is also important in the high-$κ$, small-scale regime. The standard procedure for relaxing this approximation requires the repeated evaluation of the convergence bispectrum, and consequently can be prohibitively computationally expensive when included in Monte Carlo analyses. We find that the $k$-cut cosmic shear procedure suppresses the $w_0w_a$CDM cosmological parameter biases expected from the reduced shear approximation for Stage IV experiments, when $\ell$-modes up to $5000$ are probed. The maximum cut required for biases from the reduced shear approximation to be below the threshold of significance is at $k = 5.37 \, h{\rm Mpc}^{-1}$. With this cut, the predicted $1σ$ constraints increase, relative to the case where the correction is directly computed, by less than $10\%$ for all parameters. This represents a significant improvement in constraints compared to the more conservative case where only $\ell$-modes up to 1500 are probed, and no $k$-cut is used. We also repeat this analysis for a hypothetical, comparable kinematic weak lensing survey. The key parts of code used for this analysis are made publicly available. △ Less

Submitted 26 October, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

Comments: 10 pages, 3 figures. Accepted to Phys. Rev. D. Matches published version. Code available at https://github.com/desh1701/k-cut_reduced_shear

Journal ref: Phys. Rev. D 102, 083535 (2020)

arXiv:2007.11582 [pdf, other]

doi 10.1103/PRXQuantum.3.040327

Importance of the spectral gap in estimating ground-state energies

Authors: Abhinav Deshpande, Alexey V. Gorshkov, Bill Fefferman

Abstract: The field of quantum Hamiltonian complexity lies at the intersection of quantum many-body physics and computational complexity theory, with deep implications to both fields. The main object of study is the LocalHamiltonian problem, which is concerned with estimating the ground-state energy of a local Hamiltonian and is complete for the class QMA, a quantum generalization of the class NP. A major c… ▽ More The field of quantum Hamiltonian complexity lies at the intersection of quantum many-body physics and computational complexity theory, with deep implications to both fields. The main object of study is the LocalHamiltonian problem, which is concerned with estimating the ground-state energy of a local Hamiltonian and is complete for the class QMA, a quantum generalization of the class NP. A major challenge in the field is to understand the complexity of the LocalHamiltonian problem in more physically natural parameter regimes. One crucial parameter in understanding the ground space of any Hamiltonian in many-body physics is the spectral gap, which is the difference between the smallest two eigenvalues. Despite its importance in quantum many-body physics, the role played by the spectral gap in the complexity of the LocalHamiltonian is less well-understood. In this work, we make progress on this question by considering the precise regime, in which one estimates the ground-state energy to within inverse exponential precision. Computing ground-state energies precisely is a task that is important for quantum chemistry and quantum many-body physics. In the setting of inverse-exponential precision, there is a surprising result that the complexity of LocalHamiltonian is magnified from QMA to PSPACE, the class of problems solvable in polynomial space. We clarify the reason behind this boost in complexity. Specifically, we show that the full complexity of the high precision case only comes about when the spectral gap is exponentially small. As a consequence of the proof techniques developed to show our results, we uncover important implications for the representability and circuit complexity of ground states of local Hamiltonians, the theory of uniqueness of quantum witnesses, and techniques for the amplification of quantum witnesses in the presence of postselection. △ Less

Submitted 9 December, 2022; v1 submitted 22 July, 2020; originally announced July 2020.

Comments: 32 pages, 4 figures. Comments welcome. v2: close to published version

Journal ref: PRX Quantum 3, 040327 (2022)

arXiv:2007.07793 [pdf, other]

Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors

Authors: Aditya M. Deshpande, Rumit Kumar, Ali A. Minai, Manish Kumar

Abstract: In this paper, we present a novel developmental reinforcement learning-based controller for a quadcopter with thrust vectoring capabilities. This multirotor UAV design has tilt-enabled rotors. It utilizes the rotor force magnitude and direction to achieve the desired state during flight. The control policy of this robot is learned using the policy transfer from the learned controller of the quadco… ▽ More In this paper, we present a novel developmental reinforcement learning-based controller for a quadcopter with thrust vectoring capabilities. This multirotor UAV design has tilt-enabled rotors. It utilizes the rotor force magnitude and direction to achieve the desired state during flight. The control policy of this robot is learned using the policy transfer from the learned controller of the quadcopter (comparatively simple UAV design without thrust vectoring). This approach allows learning a control policy for systems with multiple inputs and multiple outputs. The performance of the learned policy is evaluated by physics-based simulations for the tasks of hovering and way-point navigation. The flight simulations utilize a flight controller based on reinforcement learning without any additional PID components. The results show faster learning with the presented approach as opposed to learning the control policy from scratch for this new UAV design created by modifications in a conventional quadcopter, i.e., the addition of more degrees of freedom (4-actuators in conventional quadcopter to 8-actuators in tilt-rotor quadcopter). We demonstrate the robustness of our learned policy by showing the recovery of the tilt-rotor platform in the simulation from various non-static initial conditions in order to reach a desired state. The developmental policy for the tilt-rotor UAV also showed superior fault tolerance when compared with the policy learned from the scratch. The results show the ability of the presented approach to bootstrap the learned behavior from a simpler system (lower-dimensional action-space) to a more complex robot (comparatively higher-dimensional action-space) and reach better performance faster. △ Less

Submitted 15 July, 2020; originally announced July 2020.

Comments: 10 pages, 8 figures, Accepted in Dynamic Systems and Control Conference (https://event.asme.org/DSCC)

arXiv:2007.04798 [pdf]

doi 10.1103/PhysRevApplied.13.044075

Growth, Properties, and Applications of Pulsed Laser Deposited Nanolaminate Ti3AlC2 Thin Films

Authors: Abhijit Biswas, Arundhati Sengupta, Umashankar Rajput, Sachin Kumar Singh, Vivek Antad, Sk Mujaffar Hossain, Swati Parmar, Dibyata Rout, Aparna Deshpande, Sunil Nair, Satishchandra Ogale

Abstract: Recently, nanolaminated ternary carbides have attracted immense interest due to the concomitant presence of both ceramic and metallic properties. Here, we grow nanolaminate Ti3AlC2 thin films by pulsed laser deposition on c-axis-oriented sapphire substrates and, surprisingly, the films are found to be highly oriented along the (103) axis normal to the film plane, rather than the (000l) orientation… ▽ More Recently, nanolaminated ternary carbides have attracted immense interest due to the concomitant presence of both ceramic and metallic properties. Here, we grow nanolaminate Ti3AlC2 thin films by pulsed laser deposition on c-axis-oriented sapphire substrates and, surprisingly, the films are found to be highly oriented along the (103) axis normal to the film plane, rather than the (000l) orientation. Multiple characterization techniques are employed to explore the structural and chemical quality of these films, the electrical and optical properties, and the device functionalities. The 80-nm thick Ti3AlC2 film is highly conducting at room temperature (resistivity of 50 micro ohm-cm), and a very-low-temperature coefficient of resistivity. The ultrathin (2 nm) Ti3AlC2 film has fairly good optical transparency and high conductivity at room temperature (sheet resistance of 735 ohm). Scanning tunneling microscopy reveals the metallic characteristics (with finite density of states at the Fermi level) at room temperature. The metal-semiconductor junction of the p-type Ti3AlC2 film and n-Si show the expected rectification (diode) characteristics, in contrast to the ohmic contact behavior in the case of Ti3AlC2 on p-Si. A triboelectric-nanogenerator-based touch-sensing device, comprising of the Ti3AlC2 film, shows a very impressive peak-to-peak open-circuit output voltage of 80 V. These observations reveal that pulsed laser deposited Ti3AlC2 thin films have excellent potential for applications in multiple domains, such as bottom electrodes, resistors for high-precision measurements, Schottky diodes, ohmic contacts, fairly transparent ultrathin conductors, and next-generation biomechanical touch sensors for energy harvesting. △ Less

Submitted 9 July, 2020; originally announced July 2020.

Comments: 41 pages, 8 main figures, Published in Physical Review Applied

Journal ref: Phys. Rev. Applied 13, 044075 (2020)

arXiv:2007.00662 [pdf, other]

doi 10.1103/PhysRevResearch.4.L042016

Implementing a Fast Unbounded Quantum Fanout Gate Using Power-Law Interactions

Authors: Andrew Y. Guo, Abhinav Deshpande, Su-Kuan Chu, Zachary Eldredge, Przemyslaw Bienias, Dhruv Devulapalli, Yuan Su, Andrew M. Childs, Alexey V. Gorshkov

Abstract: The standard circuit model for quantum computation presumes the ability to directly perform gates between arbitrary pairs of qubits, which is unlikely to be practical for large-scale experiments. Power-law interactions with strength decaying as $1/r^α$ in the distance $r$ provide an experimentally realizable resource for information processing, whilst still retaining long-range connectivity. We le… ▽ More The standard circuit model for quantum computation presumes the ability to directly perform gates between arbitrary pairs of qubits, which is unlikely to be practical for large-scale experiments. Power-law interactions with strength decaying as $1/r^α$ in the distance $r$ provide an experimentally realizable resource for information processing, whilst still retaining long-range connectivity. We leverage the power of these interactions to implement a fast quantum fanout gate with an arbitrary number of targets. Our implementation allows the quantum Fourier transform (QFT) and Shor's algorithm to be performed on a $D$-dimensional lattice in time logarithmic in the number of qubits for interactions with $α\le D$. As a corollary, we show that power-law systems with $α\le D$ are difficult to simulate classically even for short times, under a standard assumption that factoring is classically intractable. Complementarily, we develop a new technique to give a general lower bound, linear in the size of the system, on the time required to implement the QFT and the fanout gate in systems that are constrained by a linear light cone. This allows us to prove an asymptotically tighter lower bound for long-range systems than is possible with previously available techniques. △ Less

Submitted 1 July, 2020; originally announced July 2020.

Comments: 6 pages, 1 figure

Journal ref: Physical Review Research 4, L042016 (2022)

arXiv:2006.16573 [pdf, ps, other]

Subspace approximation with outliers

Authors: Amit Deshpande, Rameshwar Pratap

Abstract: The subspace approximation problem with outliers, for given $n$ points in $d$ dimensions $x_{1},\ldots, x_{n} \in R^{d}$, an integer $1 \leq k \leq d$, and an outlier parameter $0 \leq α\leq 1$, is to find a $k$-dimensional linear subspace of $R^{d}$ that minimizes the sum of squared distances to its nearest $(1-α)n$ points. More generally, the $\ell_{p}$ subspace approximation problem with outlie… ▽ More The subspace approximation problem with outliers, for given $n$ points in $d$ dimensions $x_{1},\ldots, x_{n} \in R^{d}$, an integer $1 \leq k \leq d$, and an outlier parameter $0 \leq α\leq 1$, is to find a $k$-dimensional linear subspace of $R^{d}$ that minimizes the sum of squared distances to its nearest $(1-α)n$ points. More generally, the $\ell_{p}$ subspace approximation problem with outliers minimizes the sum of $p$-th powers of distances instead of the sum of squared distances. Even the case of robust PCA is non-trivial, and previous work requires additional assumptions on the input. Any multiplicative approximation algorithm for the subspace approximation problem with outliers must solve the robust subspace recovery problem, a special case in which the $(1-α)n$ inliers in the optimal solution are promised to lie exactly on a $k$-dimensional linear subspace. However, robust subspace recovery is Small Set Expansion (SSE)-hard. We show how to extend dimension reduction techniques and bi-criteria approximations based on sampling to the problem of subspace approximation with outliers. To get around the SSE-hardness of robust subspace recovery, we assume that the squared distance error of the optimal $k$-dimensional subspace summed over the optimal $(1-α)n$ inliers is at least $δ$ times its squared-error summed over all $n$ points, for some $0 < δ\leq 1 - α$. With this assumption, we give an efficient algorithm to find a subset of $poly(k/ε) \log(1/δ) \log\log(1/δ)$ points whose span contains a $k$-dimensional subspace that gives a multiplicative $(1+ε)$-approximation to the optimal solution. The running time of our algorithm is linear in $n$ and $d$. Interestingly, our results hold even when the fraction of outliers $α$ is large, as long as the obvious condition $0 < δ\leq 1 - α$ is satisfied. △ Less

Submitted 30 June, 2020; originally announced June 2020.

arXiv:2006.15686 [pdf, other]

Quaternion Feedback Based Autonomous Control of a Quadcopter UAV with Thrust Vectoring Rotors

Authors: Rumit Kumar, Mahathi Bhargavapuri, Aditya M. Deshpande, Siddharth Sridhar, Kelly Cohen, Manish Kumar

Abstract: In this paper, we present an autonomous flight controller for a quadcopter with thrust vectoring capabilities. This UAV falls in the category of multirotors with tilt-motion enabled rotors. Since the vehicle considered is over-actuated in nature, the dynamics and control allocation have to be analysed carefully. Moreover, the possibility of hovering at large attitude maneuvers of this novel vehicl… ▽ More In this paper, we present an autonomous flight controller for a quadcopter with thrust vectoring capabilities. This UAV falls in the category of multirotors with tilt-motion enabled rotors. Since the vehicle considered is over-actuated in nature, the dynamics and control allocation have to be analysed carefully. Moreover, the possibility of hovering at large attitude maneuvers of this novel vehicle requires singularity-free attitude control. Hence, quaternion state feedback is utilized to compute the control commands for the UAV motors while avoiding the gimbal lock condition experienced by Euler angle based controllers. The quaternion implementation also reduces the overall complexity of state estimation due to absence of trigonometric parameters. The quadcopter dynamic model and state space is utilized to design the attitude controller and control allocation for the UAV. The control allocation, in particular, is derived by linearizing the system about hover condition. This mathematical method renders the control allocation more accurate than existing approaches. Lyapunov stability analysis of the attitude controller is shown to prove global stability. The quaternion feedback attitude controller is commanded by an outer position controller loop which generates rotor-tilt and desired quaternions commands for the system. The performance of the UAV is evaluated by numerical simulations for tracking attitude step commands and for following a way-point navigation mission. △ Less

Submitted 28 June, 2020; originally announced June 2020.

Comments: Accepted for publication in American Controls Conference 2020, 6-Pages, 10 figures

arXiv:2006.11604 [pdf, other]

How do SGD hyperparameters in natural training affect adversarial robustness?

Authors: Sandesh Kamath, Amit Deshpande, K V Subrahmanyam

Abstract: Learning rate, batch size and momentum are three important hyperparameters in the SGD algorithm. It is known from the work of Jastrzebski et al. arXiv:1711.04623 that large batch size training of neural networks yields models which do not generalize well. Yao et al. arXiv:1802.08241 observe that large batch training yields models that have poor adversarial robustness. In the same paper, the author… ▽ More Learning rate, batch size and momentum are three important hyperparameters in the SGD algorithm. It is known from the work of Jastrzebski et al. arXiv:1711.04623 that large batch size training of neural networks yields models which do not generalize well. Yao et al. arXiv:1802.08241 observe that large batch training yields models that have poor adversarial robustness. In the same paper, the authors train models with different batch sizes and compute the eigenvalues of the Hessian of loss function. They observe that as the batch size increases, the dominant eigenvalues of the Hessian become larger. They also show that both adversarial training and small-batch training leads to a drop in the dominant eigenvalues of the Hessian or lowering its spectrum. They combine adversarial training and second order information to come up with a new large-batch training algorithm and obtain robust models with good generalization. In this paper, we empirically observe the effect of the SGD hyperparameters on the accuracy and adversarial robustness of networks trained with unperturbed samples. Jastrzebski et al. considered training models with a fixed learning rate to batch size ratio. They observed that higher the ratio, better is the generalization. We observe that networks trained with constant learning rate to batch size ratio, as proposed in Jastrzebski et al., yield models which generalize well and also have almost constant adversarial robustness, independent of the batch size. We observe that momentum is more effective with varying batch sizes and a fixed learning rate than with constant learning rate to batch size ratio based SGD training. △ Less

Submitted 20 June, 2020; originally announced June 2020.

Comments: Preliminary version presented in ICML 2019 Workshop on "Understanding and Improving Generalization in Deep Learning" as "On Adversarial Robustness of Small vs Large Batch Training"

arXiv:2006.08735 [pdf, ps, other]

Minimal invariant regions and minimal globally attracting regions for toric differential inclusions

Authors: Yida Ding, Abhishek Deshpande, Gheorghe Craciun

Abstract: Toric differential inclusions occur as key dynamical systems in the context of the Global Attractor Conjecture. We introduce the notions of minimal invariant regions and minimal globally attracting regions for toric differential inclusions. We describe a procedure for constructing explicitly the minimal invariant and minimal globally attracting regions for two-dimensional toric differential inclus… ▽ More Toric differential inclusions occur as key dynamical systems in the context of the Global Attractor Conjecture. We introduce the notions of minimal invariant regions and minimal globally attracting regions for toric differential inclusions. We describe a procedure for constructing explicitly the minimal invariant and minimal globally attracting regions for two-dimensional toric differential inclusions. In particular, we obtain invariant regions and globally attracting regions for two-dimensional weakly reversible or endotactic dynamical systems (even if they have time-dependent parameters). △ Less

Submitted 15 June, 2020; originally announced June 2020.

Comments: 29 pages, 15 figures

MSC Class: 37N25; 80A30; 92C45; 92E20; 14M25

arXiv:2006.08224 [pdf, other]

Needles in the 'Sheet'stack: Augmented Analytics to get Insights from Spreadsheets

Authors: Medha Atre, Anand Deshpande, Reshma Godse, Pooja Deokar, Sandip Moharir, Dhruva Ray, Akshay Chitlangia, Trupti Phadnis, Yugansh Goyal

Abstract: Business intelligence (BI) tools for database analytics have come a long way and nowadays also provide ready insights or visual query explorations, e.g. QuickInsights by Microsoft Power BI, SpotIQ by ThoughtSpot, Zenvisage, etc. In this demo, we focus on providing insights by examining periodic spreadsheets of different reports (aka views), without prior knowledge of the schema of the database or… ▽ More Business intelligence (BI) tools for database analytics have come a long way and nowadays also provide ready insights or visual query explorations, e.g. QuickInsights by Microsoft Power BI, SpotIQ by ThoughtSpot, Zenvisage, etc. In this demo, we focus on providing insights by examining periodic spreadsheets of different reports (aka views), without prior knowledge of the schema of the database or reports, or data information. Such a solution is targeted at users without the familiarity with the database schema or resources to conduct analytics in the contemporary way. △ Less

Submitted 15 June, 2020; originally announced June 2020.

ACM Class: H.2.8

arXiv:2006.04449 [pdf, other]

On Universalized Adversarial and Invariant Perturbations

Authors: Sandesh Kamath, Amit Deshpande, K V Subrahmanyam

Abstract: Convolutional neural networks or standard CNNs (StdCNNs) are translation-equivariant models that achieve translation invariance when trained on data augmented with sufficient translations. Recent work on equivariant models for a given group of transformations (e.g., rotations) has lead to group-equivariant convolutional neural networks (GCNNs). GCNNs trained on data augmented with sufficient rotat… ▽ More Convolutional neural networks or standard CNNs (StdCNNs) are translation-equivariant models that achieve translation invariance when trained on data augmented with sufficient translations. Recent work on equivariant models for a given group of transformations (e.g., rotations) has lead to group-equivariant convolutional neural networks (GCNNs). GCNNs trained on data augmented with sufficient rotations achieve rotation invariance. Recent work by authors arXiv:2002.11318 studies a trade-off between invariance and robustness to adversarial attacks. In another related work arXiv:2005.08632, given any model and any input-dependent attack that satisfies a certain spectral property, the authors propose a universalization technique called SVD-Universal to produce a universal adversarial perturbation by looking at very few test examples. In this paper, we study the effectiveness of SVD-Universal on GCNNs as they gain rotation invariance through higher degree of training augmentation. We empirically observe that as GCNNs gain rotation invariance through training augmented with larger rotations, the fooling rate of SVD-Universal gets better. To understand this phenomenon, we introduce universal invariant directions and study their relation to the universal adversarial direction produced by SVD-Universal. △ Less

Submitted 8 June, 2020; originally announced June 2020.

Comments: Some part of this work was presented in ICML 2018 Workshop on "Towards learning with limited labels: Equivariance, Invariance,and Beyond" as "Understanding Adversarial Robustness of Symmetric Networks"

arXiv:2005.14686 [pdf, other]

doi 10.1103/PhysRevC.102.064905

Production of $π^0$ and $η$ mesons in U$+$U collisions at $\sqrt{s_{_{NN}}}=192$ GeV

Authors: U. Acharya, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, J. Alexander, K. Aoki, N. Apadula, H. Asano, E. T. Atomssa, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, X. Bai, B. Bannier, K. N. Barish, S. Bathe, V. Baublis, C. Baumann, S. Baumgart, A. Bazilevsky, M. Beaumier, R. Belmont, A. Berdnikov , et al. (378 additional authors not shown)

Abstract: The PHENIX experiment at the Relativistic Heavy Ion Collider measured $π^0$ and $η$ mesons at midrapidity in U$+$U collisions at $\sqrt{s_{_{NN}}}=192$ GeV in a wide transverse momentum range. Measurements were performed in the $π^0(η)\rightarrowγγ$ decay modes. A strong suppression of $π^0$ and $η$ meson production at high transverse momentum was observed in central U$+$U collisions relative to b… ▽ More The PHENIX experiment at the Relativistic Heavy Ion Collider measured $π^0$ and $η$ mesons at midrapidity in U$+$U collisions at $\sqrt{s_{_{NN}}}=192$ GeV in a wide transverse momentum range. Measurements were performed in the $π^0(η)\rightarrowγγ$ decay modes. A strong suppression of $π^0$ and $η$ meson production at high transverse momentum was observed in central U$+$U collisions relative to binary scaled $p$$+$$p$ results. Yields of $π^0$ and $η$ mesons measured in U$+$U collisions show similar suppression pattern to the ones measured in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV for similar numbers of participant nucleons. The $η$/$π^0$ ratios do not show dependence on centrality or transverse momentum, and are consistent with previously measured values in hadron-hadron, hadron-nucleus, nucleus-nucleus, and $e^+e^-$ collisions. △ Less

Submitted 13 November, 2020; v1 submitted 29 May, 2020; originally announced May 2020.

Comments: 403 authors from 72 institutions, 13 pages, 6 figures, 7 tables, 2012 data. v2 is version accepted by Physical Review C. Plain text data tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

Journal ref: Phys. Rev. C 102, 064905 (2020)

arXiv:2005.14276 [pdf, other]

doi 10.1103/PhysRevD.102.092002

Production of $b\bar{b}$ at forward rapidity in $p$+$p$ collisions at $\sqrt{s}=510$ GeV

Authors: U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, M. Alfred, N. Apadula, Y. Aramaki, H. Asano, E. T. Atomssa, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, N. S. Bandara, B. Bannier, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, S. Beckman, R. Belmont, A. Berdnikov, Y. Berdnikov , et al. (325 additional authors not shown)

Abstract: The cross section of bottom quark-antiquark ($b\bar{b}$) production in $p$+$p$ collisions at $\sqrt{s}=510$ GeV is measured with the PHENIX detector at the Relativistic Heavy Ion Collider. The results are based on the yield of high mass, like-sign muon pairs measured within the PHENIX muon arm acceptance ($1.2<|y|<2.2$). The $b\bar{b}$ signal is extracted from like-sign dimuons by utilizing the un… ▽ More The cross section of bottom quark-antiquark ($b\bar{b}$) production in $p$+$p$ collisions at $\sqrt{s}=510$ GeV is measured with the PHENIX detector at the Relativistic Heavy Ion Collider. The results are based on the yield of high mass, like-sign muon pairs measured within the PHENIX muon arm acceptance ($1.2<|y|<2.2$). The $b\bar{b}$ signal is extracted from like-sign dimuons by utilizing the unique properties of neutral $B$ meson oscillation. We report a differential cross section of $dσ_{b\bar{b}\rightarrow μ^\pmμ^\pm}/dy = 0.16 \pm 0.01~(\mbox{stat}) \pm 0.02~(\mbox{syst}) \pm 0.02~(\mbox{global})$ nb for like-sign muons in the rapidity and $p_T$ ranges $1.2<|y|<2.2$ and $p_T>1$ GeV/$c$, and dimuon mass of 5--10 GeV/$c^2$. The extrapolated total cross section at this energy for $b\bar{b}$ production is $13.1 \pm 0.6~(\mbox{stat}) \pm 1.5~(\mbox{syst}) \pm 2.7~(\mbox{global})~μ$b. The total cross section is compared to a perturbative quantum chromodynamics calculation and is consistent within uncertainties. The azimuthal opening angle between muon pairs from $b\bar{b}$ decays and their $p_T$ distributions are compared to distributions generated using {\sc ps pythia 6}, which includes next-to-leading order processes. The azimuthal correlations and pair $p_T$ distribution are not very well described by {\sc pythia} calculations, but are still consistent within uncertainties. Flavor creation and flavor excitation subprocesses are favored over gluon splitting. △ Less

Submitted 27 October, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

Comments: 360 authors from 69 institutions, 13 pages, 11 figures, 2 tables, 2013 data. v2 is version accepted for publication in Physical Review D. Plain text data tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

Journal ref: Phys. Rev. D 102, 092002 (2020)

arXiv:2005.14273 [pdf, other]

doi 10.1103/PhysRevD.102.072008

Polarization and cross section of midrapidity J/$ψ$ production in proton-proton collisions at $\sqrt{s}=510$ GeV

Authors: U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, M. Alfred, N. Apadula, Y. Aramaki, H. Asano, E. T. Atomssa, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, N. S. Bandara, B. Bannier, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, S. Beckman, R. Belmont, A. Berdnikov, Y. Berdnikov , et al. (325 additional authors not shown)

Abstract: The PHENIX experiment has measured the spin alignment for inclusive $J/ψ\rightarrow e^{+}e^{-}$ decays in $p$+$p$ collisions at $\sqrt{s}=510$ GeV at midrapidity. The angular distributions have been measured in three different polarization frames, and the three decay angular coefficients have been extracted in a full two-dimensional analysis. Previously, PHENIX saw large longitudinal net polarizat… ▽ More The PHENIX experiment has measured the spin alignment for inclusive $J/ψ\rightarrow e^{+}e^{-}$ decays in $p$+$p$ collisions at $\sqrt{s}=510$ GeV at midrapidity. The angular distributions have been measured in three different polarization frames, and the three decay angular coefficients have been extracted in a full two-dimensional analysis. Previously, PHENIX saw large longitudinal net polarization at forward rapidity at the same collision energy. This analysis at midrapidity, complementary to the previous PHENIX results, sees no sizable polarization in the measured transverse momentum range of $0.0<p_T<10.0$ GeV/$c$. The results are consistent with a previous one-dimensional analysis at midrapidity at $\sqrt{s}=200$ GeV. The transverse-momentum-dependent cross section for midrapidity $J/ψ$ production has additionally been measured, and after comparison to world data we find a simple logarithmic dependence of the cross section on $\sqrt{s}$. △ Less

Submitted 27 October, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

Comments: 360 authors from 69 institutions, 13 pages, 15 figures, 1 table, 2013 data. v1 is version accepted for publication in Physical Review D. Plain text data tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

Journal ref: Phys. Rev. D 102, 072008 (2020)

arXiv:2005.14270 [pdf, other]

doi 10.1103/PhysRevC.102.054910

Measurement of jet-medium interactions via direct photon-hadron correlations in Au$+$Au and $d$$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV

Authors: U. Acharya, A. Adare, S. Afanasiev, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, H. Al-Bataineh, J. Alexander, H. Al-Ta'ani, A. Angerami, K. Aoki, N. Apadula, Y. Aramaki, H. Asano, E. C. Aschenauer, E. T. Atomssa, R. Averbeck, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, G. Baksay, L. Baksay, B. Bannier , et al. (553 additional authors not shown)

Abstract: We present direct photon-hadron correlations in 200 GeV/A Au$+$Au, $d$$+$Au and $p$$+$$p$ collisions, for direct photon $p_T$ from 5--12 GeV/$c$, collected by the PHENIX Collaboration in the years from 2006 to 2011. We observe no significant modification of jet fragmentation in $d$$+$Au collisions, indicating that cold nuclear matter effects are small or absent. Hadrons carrying a large fraction o… ▽ More We present direct photon-hadron correlations in 200 GeV/A Au$+$Au, $d$$+$Au and $p$$+$$p$ collisions, for direct photon $p_T$ from 5--12 GeV/$c$, collected by the PHENIX Collaboration in the years from 2006 to 2011. We observe no significant modification of jet fragmentation in $d$$+$Au collisions, indicating that cold nuclear matter effects are small or absent. Hadrons carrying a large fraction of the quark's momentum are suppressed in Au$+$Au compared to $p$$+$$p$ and $d$$+$Au. As the momentum fraction decreases, the yield of hadrons in Au$+$Au increases to an excess over the yield in $p$$+$$p$ collisions. The excess is at large angles and at low hadron $p_T$ and is most pronounced for hadrons associated with lower momentum direct photons. Comparison to theoretical calculations suggests that the hadron excess arises from medium response to energy deposited by jets. △ Less

Submitted 19 November, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

Comments: 578 authors from 80 institutions, 11 pages, 7 figures, data from 2007, 2008, 2010, and 2011. v2 is version accepted for publication in Physical Review C. Plain text data tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

Journal ref: Phys. Rev. C 102, 054910 (2020)

arXiv:2005.10840 [pdf, other]

doi 10.1103/PRXQuantum.2.030350

Complexity of Fermionic Dissipative Interactions and Applications to Quantum Computing

Authors: Oles Shtanko, Abhinav Deshpande, Paul S. Julienne, Alexey V. Gorshkov

Abstract: Interactions between particles are usually a resource for quantum computing, making quantum many-body systems intractable by any known classical algorithm. In contrast, noise is typically considered as being inimical to quantum many-body correlations, ultimately leading the system to a classically tractable state. This work shows that noise represented by two-body processes, such as pair loss, pla… ▽ More Interactions between particles are usually a resource for quantum computing, making quantum many-body systems intractable by any known classical algorithm. In contrast, noise is typically considered as being inimical to quantum many-body correlations, ultimately leading the system to a classically tractable state. This work shows that noise represented by two-body processes, such as pair loss, plays the same role as many-body interactions and makes otherwise classically simulable systems universal for quantum computing. We analyze such processes in detail and establish a complexity transition between simulable and nonsimulable systems as a function of a tuning parameter. We determine important classes of simulable and nonsimulable two-body dissipation. Finally, we show how using resonant dissipation in cold atoms can enhance the performance of two-qubit gates. △ Less

Submitted 17 September, 2021; v1 submitted 21 May, 2020; originally announced May 2020.

Comments: 20 pages + 5 figures

Journal ref: PRX Quantum 2, 030350 (2021)

arXiv:2005.08632 [pdf, other]

Universalization of any adversarial attack using very few test examples

Authors: Sandesh Kamath, Amit Deshpande, K V Subrahmanyam, Vineeth N Balasubramanian

Abstract: Deep learning models are known to be vulnerable not only to input-dependent adversarial attacks but also to input-agnostic or universal adversarial attacks. Dezfooli et al. \cite{Dezfooli17,Dezfooli17anal} construct universal adversarial attack on a given model by looking at a large number of training data points and the geometry of the decision boundary near them. Subsequent work \cite{Khrulkov18… ▽ More Deep learning models are known to be vulnerable not only to input-dependent adversarial attacks but also to input-agnostic or universal adversarial attacks. Dezfooli et al. \cite{Dezfooli17,Dezfooli17anal} construct universal adversarial attack on a given model by looking at a large number of training data points and the geometry of the decision boundary near them. Subsequent work \cite{Khrulkov18} constructs universal attack by looking only at test examples and intermediate layers of the given model. In this paper, we propose a simple universalization technique to take any input-dependent adversarial attack and construct a universal attack by only looking at very few adversarial test examples. We do not require details of the given model and have negligible computational overhead for universalization. We theoretically justify our universalization technique by a spectral property common to many input-dependent adversarial perturbations, e.g., gradients, Fast Gradient Sign Method (FGSM) and DeepFool. Using matrix concentration inequalities and spectral perturbation bounds, we show that the top singular vector of input-dependent adversarial directions on a small test sample gives an effective and simple universal adversarial attack. For VGG16 and VGG19 models trained on ImageNet, our simple universalization of Gradient, FGSM, and DeepFool perturbations using a test sample of 64 images gives fooling rates comparable to state-of-the-art universal attacks \cite{Dezfooli17,Khrulkov18} for reasonable norms of perturbation. Code available at https://github.com/ksandeshk/svd-uap . △ Less

Submitted 28 October, 2022; v1 submitted 18 May, 2020; originally announced May 2020.

Comments: Appeared in ACM CODS-COMAD 2022 (Research Track)

arXiv:2005.06037 [pdf, other]

doi 10.1016/j.promfg.2020.05.141

Computer Vision Toolkit for Non-invasive Monitoring of Factory Floor Artifacts

Authors: Aditya M. Deshpande, Anil Kumar Telikicherla, Vinay Jakkali, David A. Wickelhaus, Manish Kumar, Sam Anand

Abstract: Digitization has led to smart, connected technologies be an integral part of businesses, governments and communities. For manufacturing digitization, there has been active research and development with a focus on Cloud Manufacturing (CM) and the Industrial Internet of Things (IIoT). This work presents a computer vision toolkit (CV Toolkit) for non-invasive digitization of the factory floor in line… ▽ More Digitization has led to smart, connected technologies be an integral part of businesses, governments and communities. For manufacturing digitization, there has been active research and development with a focus on Cloud Manufacturing (CM) and the Industrial Internet of Things (IIoT). This work presents a computer vision toolkit (CV Toolkit) for non-invasive digitization of the factory floor in line with Industry 4.0 requirements for factory data collection. Currently, technical challenges persist towards digitization of legacy systems due to the limitation for changes in their design and sensors. This novel toolkit is developed to facilitate easy integration of legacy production machinery and factory floor artifacts with the digital and smart manufacturing environment with no requirement of any physical changes in the machines. The system developed is modular, and allows real-time monitoring of production machinery. Modularity aspect allows the incorporation of new software applications in the current framework of CV Toolkit. To allow connectivity of this toolkit with manufacturing floors in a simple, deployable and cost-effective manner, the toolkit is integrated with a known manufacturing data standard, MTConnect, to "translate" the digital inputs into data streams that can be read by commercial status tracking and reporting software solutions. The proposed toolkit is demonstrated using a mock-panel environment developed in house at the University of Cincinnati to highlight its usability. △ Less

Submitted 12 May, 2020; originally announced May 2020.

Comments: Accepted for publication in 48th SME North American Manufacturing Research Conference (NAMRC48)

Journal ref: Procedia Manufacturing 48 (2020) 1020-1028

arXiv:2005.05815 [pdf, other]

doi 10.1016/j.promfg.2020.05.146

One-Shot Recognition of Manufacturing Defects in Steel Surfaces

Authors: Aditya M. Deshpande, Ali A. Minai, Manish Kumar

Abstract: Quality control is an essential process in manufacturing to make the product defect-free as well as to meet customer needs. The automation of this process is important to maintain high quality along with the high manufacturing throughput. With recent developments in deep learning and computer vision technologies, it has become possible to detect various features from the images with near-human acc… ▽ More Quality control is an essential process in manufacturing to make the product defect-free as well as to meet customer needs. The automation of this process is important to maintain high quality along with the high manufacturing throughput. With recent developments in deep learning and computer vision technologies, it has become possible to detect various features from the images with near-human accuracy. However, many of these approaches are data intensive. Training and deployment of such a system on manufacturing floors may become expensive and time-consuming. The need for large amounts of training data is one of the limitations of the applicability of these approaches in real-world manufacturing systems. In this work, we propose the application of a Siamese convolutional neural network to do one-shot recognition for such a task. Our results demonstrate how one-shot learning can be used in quality control of steel by identification of defects on the steel surface. This method can significantly reduce the requirements of training data and can also be run in real-time. △ Less

Submitted 12 May, 2020; originally announced May 2020.

Comments: Accepted for publication in NAMRC 48

Journal ref: Procedia Manufacturing 48 (2020) 1064-1071

arXiv:2004.12920 [pdf, other]

Flight Control of Sliding Arm Quadcopter with Dynamic Structural Parameters

Authors: Rumit Kumar, Aditya M. Deshpande, James Z. Wells, Manish Kumar

Abstract: The conceptual design and flight controller of a novel kind of quadcopter are presented. This design is capable of morphing the shape of the UAV during flight to achieve position and attitude control. We consider a dynamic center of gravity (CoG) which causes continuous variation in a moment of inertia (MoI) parameters of the UAV in this design. These dynamic structural parameters play a vital rol… ▽ More The conceptual design and flight controller of a novel kind of quadcopter are presented. This design is capable of morphing the shape of the UAV during flight to achieve position and attitude control. We consider a dynamic center of gravity (CoG) which causes continuous variation in a moment of inertia (MoI) parameters of the UAV in this design. These dynamic structural parameters play a vital role in the stability and control of the system. The length of quadcopter arms is a variable parameter, and it is actuated using attitude feedback-based control law. The MoI parameters are computed in real-time and incorporated in the equations of motion of the system. The UAV utilizes the angular motion of propellers and variable quadcopter arm lengths for position and navigation control. The movement space of the CoG is a design parameter and it is bounded by actuator limitations and stability requirements of the system. A detailed information on equations of motion, flight controller design and possible applications of this system are provided. Further, the proposed shape-changing UAV system is evaluated by comparative numerical simulations for way point navigation mission and complex trajectory tracking. △ Less

Submitted 27 April, 2020; originally announced April 2020.

Comments: 6 Pages

arXiv:2004.02681 [pdf, ps, other]

doi 10.1103/PhysRevD.102.032001

Measurement of charged pion double spin asymmetries at midrapidity in longitudinally polarized $p$$+$$p$ collisions at $\sqrt{s}=510$ GeV

Authors: U. A. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, M. Alfred, N. Apadula, Y. Aramaki, H. Asano, E. T. Atomssa, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, N. S. Bandara, B. Bannier, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, S. Beckman, R. Belmont, A. Berdnikov, Y. Berdnikov , et al. (335 additional authors not shown)

Abstract: The PHENIX experiment at the Relativistic Heavy Ion Collider has measured the longitudinal double spin asymmetries, $A_{LL}$, for charged pions at midrapidity ($|η|<0.35$) in longitudinally polarized $p$$+$$p$ collisions at $\sqrt{s}=510$ GeV. These measurements are sensitive to the gluon spin contribution to the total spin of the proton in the parton momentum fraction $x$ range between 0.04 and 0… ▽ More The PHENIX experiment at the Relativistic Heavy Ion Collider has measured the longitudinal double spin asymmetries, $A_{LL}$, for charged pions at midrapidity ($|η|<0.35$) in longitudinally polarized $p$$+$$p$ collisions at $\sqrt{s}=510$ GeV. These measurements are sensitive to the gluon spin contribution to the total spin of the proton in the parton momentum fraction $x$ range between 0.04 and 0.09. One can infer the sign of the gluon polarization from the ordering of pion asymmetries with charge alone. The asymmetries are found to be consistent with global quantum-chromodynamics fits of deep-inelastic scattering and data at $\sqrt{s}=200$ GeV, which show a nonzero positive contribution of gluon spin to the proton spin. △ Less

Submitted 31 July, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

Comments: 360 authors, 8 pages, 6 figures, 1 table, 2013 data. v2 is version accepted for publication in Physical Review D. Plain text data tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

Journal ref: Phys. Rev. D 102, 032001 (2020)

arXiv:2004.01666 [pdf, other]

doi 10.1103/PhysRevD.101.103531

Post-Limber Weak Lensing Bispectrum, Reduced Shear Correction, and Magnification Bias Correction

Authors: Anurag C. Deshpande, Thomas D. Kitching

Abstract: The significant increase in precision that will be achieved by Stage IV cosmic shear surveys means that several currently used theoretical approximations may cease to be valid. An additional layer of complexity arises from the fact that many of these approximations are interdependent; the procedure to correct for one involves making another. Two such approximations that must be relaxed for upcomin… ▽ More The significant increase in precision that will be achieved by Stage IV cosmic shear surveys means that several currently used theoretical approximations may cease to be valid. An additional layer of complexity arises from the fact that many of these approximations are interdependent; the procedure to correct for one involves making another. Two such approximations that must be relaxed for upcoming experiments are the reduced shear approximation and the effect of neglecting magnification bias. Accomplishing this involves the calculation of the convergence bispectrum; typically subject to the Limber approximation. In this work, we compute the post-Limber convergence bispectrum, and the post-Limber reduced shear and magnification bias corrections to the angular power spectrum for a Euclid-like survey. We find that the Limber approximation significantly overestimates the bispectrum when any side of the bispectrum triangle, $\ell_i<60$. However, the resulting changes in the reduced shear and magnification bias corrections are well below the sample variance for $\ell\leq5000$. We also compute a worst-case scenario for the additional biases on $w_0w_a$CDM cosmological parameters that result from the difference between the post-Limber and Limber approximated forms of the corrections. These further demonstrate that the reduced shear and magnification bias corrections can safely be treated under the Limber approximation for upcoming surveys. △ Less

Submitted 27 May, 2020; v1 submitted 3 April, 2020; originally announced April 2020.

Comments: 12 pages, 4 figures. Accepted by Phys. Rev. D. Matches published version

Journal ref: Phys. Rev. D 101, 103531 (2020)

arXiv:2002.11318 [pdf, other]

Can we have it all? On the Trade-off between Spatial and Adversarial Robustness of Neural Networks

Authors: Sandesh Kamath, Amit Deshpande, K V Subrahmanyam, Vineeth N Balasubramanian

Abstract: (Non-)robustness of neural networks to small, adversarial pixel-wise perturbations, and as more recently shown, to even random spatial transformations (e.g., translations, rotations) entreats both theoretical and empirical understanding. Spatial robustness to random translations and rotations is commonly attained via equivariant models (e.g., StdCNNs, GCNNs) and training augmentation, whereas adve… ▽ More (Non-)robustness of neural networks to small, adversarial pixel-wise perturbations, and as more recently shown, to even random spatial transformations (e.g., translations, rotations) entreats both theoretical and empirical understanding. Spatial robustness to random translations and rotations is commonly attained via equivariant models (e.g., StdCNNs, GCNNs) and training augmentation, whereas adversarial robustness is typically achieved by adversarial training. In this paper, we prove a quantitative trade-off between spatial and adversarial robustness in a simple statistical setting. We complement this empirically by showing that: (a) as the spatial robustness of equivariant models improves by training augmentation with progressively larger transformations, their adversarial robustness worsens progressively, and (b) as the state-of-the-art robust models are adversarially trained with progressively larger pixel-wise perturbations, their spatial robustness drops progressively. Towards achieving pareto-optimality in this trade-off, we propose a method based on curriculum learning that trains gradually on more difficult perturbations (both spatial and adversarial) to improve spatial and adversarial robustness simultaneously. △ Less

Submitted 10 November, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

Comments: Accepted NeurIPS 2021. Preliminary version consisting early experimental results was presented in ICML 2018 Workshop on "Towards learning with limited labels: Equivariance, Invariance,and Beyond" as "Understanding Adversarial Robustness of Symmetric Networks"

arXiv:2002.11208 [pdf]

Automatic Segmentation, Feature Extraction and Comparison of Healthy and Stroke Cerebral Vasculature

Authors: Aditi Deshpande, Nima Jamilpour, Bin Jiang, Chelsea Kidwell, Max Wintermark, Kaveh Laksari

Abstract: Accurate segmentation of cerebral vasculature and a quantitative assessment of cerebrovascular morphology is critical to various diagnostic and therapeutic purposes and is pertinent to studying brain health and disease. However, this is still a challenging task due to the complexity of the vascular imaging data. We propose an automated method for cerebral vascular segmentation without the need of… ▽ More Accurate segmentation of cerebral vasculature and a quantitative assessment of cerebrovascular morphology is critical to various diagnostic and therapeutic purposes and is pertinent to studying brain health and disease. However, this is still a challenging task due to the complexity of the vascular imaging data. We propose an automated method for cerebral vascular segmentation without the need of any manual intervention as well as a method to skeletonize the binary volume to extract vascular geometric features which can characterize vessel structure. We combine a probabilistic vessel-enhancing filtering with an active-contour technique to segment magnetic resonance and computed tomography angiograms (MRA and CTA) and subsequently extract the vessel centerlines and diameters to calculate the geometrical properties of the vasculature. Our method was validated using a 3D phantom of the Circle-of-Willis region with 84% mean Dice Similarity and 85% mean Pearson Correlation with minimal modified Hausdorff distance error. We applied this method to a dataset of healthy subjects and stroke patients and present a quantitative comparison between them. We found significant differences in the geometric features including total length (2.88 +/- 0.38 m for healthy and 2.20 +/- 0.67 m for stroke), volume (40.18 +/- 25.55 ml for healthy and 34.43 +/- 21.83 ml for stroke), tortuosity (3.24 +/- 0.88 rad/cm for healthy and 5.80 +/- 0.92 rad/cm for stroke) and fractality (box dimension 1.36 +/- 0.28 for healthy vs. 1.69 +/- 0.20 for stroke). This technique can be applied on any imaging modality and can be used in the future to automatically obtain the 3D segmented vasculature for diagnosis and treatment planning of Stroke and other cerebrovascular diseases (CVD) in the clinic and also to study the morphological changes caused by various CVD. △ Less

Submitted 25 February, 2020; originally announced February 2020.

Comments: 14 pages, 4 figures, 4 tables

arXiv:2001.11509 [pdf, other]

doi 10.1103/PhysRevX.10.031009

Hierarchy of linear light cones with long-range interactions

Authors: Minh C. Tran, Chi-Fang Chen, Adam Ehrenberg, Andrew Y. Guo, Abhinav Deshpande, Yifan Hong, Zhe-Xuan Gong, Alexey V. Gorshkov, Andrew Lucas

Abstract: In quantum many-body systems with local interactions, quantum information and entanglement cannot spread outside of a linear light cone, which expands at an emergent velocity analogous to the speed of light. Local operations at sufficiently separated spacetime points approximately commute -- given a many-body state,… ▽ More In quantum many-body systems with local interactions, quantum information and entanglement cannot spread outside of a linear light cone, which expands at an emergent velocity analogous to the speed of light. Local operations at sufficiently separated spacetime points approximately commute -- given a many-body state, $\mathcal{O}_x(t) \mathcal{O}_y |ψ\rangle \approx \mathcal{O}_y\mathcal{O}_x(t) |ψ\rangle$ with arbitrarily small errors -- so long as $|x-y|\gtrsim vt$, where $v$ is finite. Yet most non-relativistic physical systems realized in nature have long-range interactions: two degrees of freedom separated by a distance $r$ interact with potential energy $V(r) \propto 1/r^α$. In systems with long-range interactions, we rigorously establish a hierarchy of linear light cones: at the same $α$, some quantum information processing tasks are constrained by a linear light cone while others are not. In one spatial dimension, this linear light cone exists for every many-body state when $α>3$ (Lieb-Robinson light cone); for a typical state chosen uniformly at random from the Hilbert space when $α>\frac{5}{2}$ (Frobenius light cone); for every state of a non-interacting system when $α>2$ (free light cone). These bounds apply to time-dependent systems and are optimal up to subalgebraic improvements. Our theorems regarding the Lieb-Robinson and free light cones -- and their tightness -- also generalize to arbitrary dimensions. We discuss the implications of our bounds on the growth of connected correlators and of topological order, the clustering of correlations in gapped systems, and the digital simulation of systems with long-range interactions. In addition, we show that universal quantum state transfer, as well as many-body quantum chaos, are bounded by the Frobenius light cone, and therefore are poorly constrained by all Lieb-Robinson bounds. △ Less

Submitted 18 July, 2022; v1 submitted 30 January, 2020; originally announced January 2020.

Comments: 36 pages; 6 figures; v2: revised and expanded introduction, a few extra results; v3: minor revisions. v4: corrections in Section 7

Journal ref: Phys. Rev. X 10, 031009 (2020)

arXiv:2001.08791 [pdf, other]

Machine learning based co-creative design framework

Authors: Brian Quanz, Wei Sun, Ajay Deshpande, Dhruv Shah, Jae-eun Park

Abstract: We propose a flexible, co-creative framework bringing together multiple machine learning techniques to assist human users to efficiently produce effective creative designs. We demonstrate its potential with a perfume bottle design case study, including human evaluation and quantitative and qualitative analyses. We propose a flexible, co-creative framework bringing together multiple machine learning techniques to assist human users to efficiently produce effective creative designs. We demonstrate its potential with a perfume bottle design case study, including human evaluation and quantitative and qualitative analyses. △ Less

Submitted 23 January, 2020; originally announced January 2020.

Comments: Thirty-third Conference on Neural Information Processing Systems (NeurIPS) 2019 Workshop on Machine Learning for Creativity and Design, December 14th, 2019, Vancouver, Canada (https://neurips2019creativity.github.io/)

ACM Class: I.2.6; I.2.1; D.2.2; I.5.4; I.3.6; H.1.2; H.5.0; J.5

arXiv:1912.13424 [pdf, other]

doi 10.1103/PhysRevD.101.052006

$J/ψ$ and $ψ(2S)$ production at forward rapidity in $p$+$p$ collisions at $\sqrt{s}=510$ GeV

Authors: U. A. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, M. Alfred, N. Apadula, Y. Aramaki, H. Asano, E. T. Atomssa, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, N. S. Bandara, B. Bannier, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, S. Beckman, R. Belmont, A. Berdnikov, Y. Berdnikov , et al. (335 additional authors not shown)

Abstract: The PHENIX experiment at the Relativistic Heavy Ion Collider has measured the differential cross section, mean transverse momentum, mean transverse momentum squared of inclusive $J/ψ$ and cross-section ratio of $ψ(2S)$ to $J/ψ$ at forward rapidity in \pp collisions at \sqrts = 510 GeV via the dimuon decay channel. Comparison is made to inclusive $J/ψ$ cross sections measured at \sqrts = 200 GeV an… ▽ More The PHENIX experiment at the Relativistic Heavy Ion Collider has measured the differential cross section, mean transverse momentum, mean transverse momentum squared of inclusive $J/ψ$ and cross-section ratio of $ψ(2S)$ to $J/ψ$ at forward rapidity in \pp collisions at \sqrts = 510 GeV via the dimuon decay channel. Comparison is made to inclusive $J/ψ$ cross sections measured at \sqrts = 200 GeV and 2.76--13 TeV. The result is also compared to leading-order nonrelativistic QCD calculations coupled to a color-glass-condensate description of the low-$x$ gluons in the proton at low transverse momentum ($p_T$) and to next-to-leading order nonrelativistic QCD calculations for the rest of the $p_T$ range. These calculations overestimate the data at low $p_T$. While consistent with the data within uncertainties above $\approx3$ GeV/$c$, the calculations are systematically below the data. The total cross section times the branching ratio is BR $dσ^{J/ψ}_{pp}/dy (1.2<|y|<2.2, 0<p_T<10~\mbox{GeV/$c$}) =$ 54.3 $\pm$ 0.5 (stat) $\pm$ 5.5 (syst) nb. △ Less

Submitted 19 February, 2020; v1 submitted 31 December, 2019; originally announced December 2019.

Comments: 361 authors from 71 institutions, 13 pages, 4 tables, 11 figures, 2013 data. v2 is version accepted for publication in Physical Review D. Plain text data tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

Journal ref: Phys. Rev. D 101, 052006 (2020)

arXiv:1912.07326 [pdf, other]

doi 10.1051/0004-6361/201937323

Euclid: The reduced shear approximation and magnification bias for Stage IV cosmic shear experiments

Authors: A. C. Deshpande, T. D. Kitching, V. F. Cardone, P. L. Taylor, S. Casas, S. Camera, C. Carbone, M. Kilbinger, V. Pettorino, Z. Sakr, D. Sapone, I. Tutusaus, N. Auricchio, C. Bodendorf, D. Bonino, M. Brescia, V. Capobianco, J. Carretero, M. Castellano, S. Cavuoti, R. Cledassou, G. Congedo, L. Conversi, L. Corcione, M. Cropper , et al. (47 additional authors not shown)

Abstract: Stage IV weak lensing experiments will offer more than an order of magnitude leap in precision. We must therefore ensure that our analyses remain accurate in this new era. Accordingly, previously ignored systematic effects must be addressed. In this work, we evaluate the impact of the reduced shear approximation and magnification bias, on the information obtained from the angular power spectrum. T… ▽ More Stage IV weak lensing experiments will offer more than an order of magnitude leap in precision. We must therefore ensure that our analyses remain accurate in this new era. Accordingly, previously ignored systematic effects must be addressed. In this work, we evaluate the impact of the reduced shear approximation and magnification bias, on the information obtained from the angular power spectrum. To first-order, the statistics of reduced shear, a combination of shear and convergence, are taken to be equal to those of shear. However, this approximation can induce a bias in the cosmological parameters that can no longer be neglected. A separate bias arises from the statistics of shear being altered by the preferential selection of galaxies and the dilution of their surface densities, in high-magnification regions. The corrections for these systematic effects take similar forms, allowing them to be treated together. We calculated the impact of neglecting these effects on the cosmological parameters that would be determined from Euclid, using cosmic shear tomography. To do so, we employed the Fisher matrix formalism, and included the impact of the super-sample covariance. We also demonstrate how the reduced shear correction can be calculated using a lognormal field forward modelling approach. These effects cause significant biases in Omega_m, sigma_8, n_s, Omega_DE, w_0, and w_a of -0.53 sigma, 0.43 sigma, -0.34 sigma, 1.36 sigma, -0.68 sigma, and 1.21 sigma, respectively. We then show that these lensing biases interact with another systematic: the intrinsic alignment of galaxies. Accordingly, we develop the formalism for an intrinsic alignment-enhanced lensing bias correction. Applying this to Euclid, we find that the additional terms introduced by this correction are sub-dominant. △ Less

Submitted 1 April, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

Comments: 16 pages, 6 figures, submitted to Astronomy & Astrophysics on 16/12/2019, accepted on 04/03/2020. SSC Fisher procedure corrected

Journal ref: A&A 636, A95 (2020)

arXiv:1910.14487 [pdf, other]

doi 10.1103/PhysRevC.102.014902

Measurement of $J/ψ$ at forward and backward rapidity in $p$+$p$, $p$$+A$l, $p$$+A$u, and $^3$He+Au collisions at $\sqrt{s_{_{NN}}}=200~{\rm GeV}$

Authors: U. A. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, M. Alfred, V. Andrieux, N. Apadula, H. Asano, B. Azmoun, V. Babintsev, M. Bai, N. S. Bandara, B. Bannier, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, S. Beckman, R. Belmont, A. Berdnikov, Y. Berdnikov, D. S. Blau, M. Boer, J. S. Bok , et al. (337 additional authors not shown)

Abstract: Charmonium is a valuable probe in heavy-ion collisions to study the properties of the quark gluon plasma, and is also an interesting probe in small collision systems to study cold nuclear matter effects, which are also present in large collision systems. With the recent observations of collective behavior of produced particles in small system collisions, measurements of the modification of charmon… ▽ More Charmonium is a valuable probe in heavy-ion collisions to study the properties of the quark gluon plasma, and is also an interesting probe in small collision systems to study cold nuclear matter effects, which are also present in large collision systems. With the recent observations of collective behavior of produced particles in small system collisions, measurements of the modification of charmonium in small systems have become increasingly relevant. We present the results of $J/ψ$ measurements at forward and backward rapidity in various small collision systems, $p$$+$$p$, $p$$+$Al, $p$$+$Au and $^3$He$+$Au, at $\sqrt{s_{_{NN}}}$=200 GeV. The results are presented in the form of the observable $R_{AB}$, the nuclear modification factor, a measure of the ratio of the $J/ψ$ invariant yield compared to the scaled yield in $p$$+$$p$ collisions. We examine the rapidity, transverse momentum, and collision centrality dependence of nuclear effects on $J/ψ$ production with different projectile sizes $p$ and $^3$He, and different target sizes Al and Au. The modification is found to be strongly dependent on the target size, but to be very similar for $p$$+$Au and $^{3}$He$+$Au. However, for 0%--20% central collisions at backward rapidity, the modification for $^{3}$He$+$Au is found to be smaller than that for $p$$+$Au, with a mean fit to the ratio of $0.89\pm0.03$(stat)${\pm}0.08$(syst), possibly indicating final state effects due to the larger projectile size. △ Less

Submitted 12 July, 2020; v1 submitted 31 October, 2019; originally announced October 2019.

Comments: 362 authors, 68 institutions, 23 pages, 28 figures, 3 tables, 2014 and 2015 data. v3 is version accepted for publication in Phys. Rev. C. Plain text data tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

Journal ref: Phys. Rev. C 102, 014902 (2020)

Showing 151–200 of 553 results for author: Deshpande, A