-
SCAPE: Learning Stiffness Control from Augmented Position Control Experiences
Authors:
Mincheol Kim,
Scott Niekum,
Ashish D. Deshpande
Abstract:
We introduce a sample-efficient method for learning state-dependent stiffness control policies for dexterous manipulation. The ability to control stiffness facilitates safe and reliable manipulation by providing compliance and robustness to uncertainties. Most current reinforcement learning approaches to achieve robotic manipulation have exclusively focused on position control, often due to the di…
▽ More
We introduce a sample-efficient method for learning state-dependent stiffness control policies for dexterous manipulation. The ability to control stiffness facilitates safe and reliable manipulation by providing compliance and robustness to uncertainties. Most current reinforcement learning approaches to achieve robotic manipulation have exclusively focused on position control, often due to the difficulty of learning high-dimensional stiffness control policies. This difficulty can be partially mitigated via policy guidance such as imitation learning. However, expert stiffness control demonstrations are often expensive or infeasible to record. Therefore, we present an approach to learn Stiffness Control from Augmented Position control Experiences (SCAPE) that bypasses this difficulty by transforming position control demonstrations into approximate, suboptimal stiffness control demonstrations. Then, the suboptimality of the augmented demonstrations is addressed by using complementary techniques that help the agent safely learn from both the demonstrations and reinforcement learning. By using simulation tools and experiments on a robotic testbed, we show that the proposed approach efficiently learns safe manipulation policies and outperforms learned position control policies and several other baseline learning algorithms.
△ Less
Submitted 14 September, 2021; v1 submitted 16 February, 2021;
originally announced February 2021.
-
Perturbation of charge density waves in 1T-TiSe$_2$
Authors:
Imrankhan Mulani,
Umashankar Rajput,
Luminita Harnagea,
Aparna Deshpande
Abstract:
In this study, using low-temperature scanning tunneling microscopy (STM), we focus on understanding the native defects in pristine \textit{1T}-TiSe$_2$ at the atomic scale. We probe how they perturb the charge density waves (CDWs) and lead to local domain formation. These defects influence the correlation length of CDWs. We establish a connection between suppression of CDWs, Ti intercalation, and…
▽ More
In this study, using low-temperature scanning tunneling microscopy (STM), we focus on understanding the native defects in pristine \textit{1T}-TiSe$_2$ at the atomic scale. We probe how they perturb the charge density waves (CDWs) and lead to local domain formation. These defects influence the correlation length of CDWs. We establish a connection between suppression of CDWs, Ti intercalation, and show how this supports the exciton condensation model of CDW formation in \textit{1T}-TiSe$_2$.
△ Less
Submitted 18 February, 2021; v1 submitted 4 February, 2021;
originally announced February 2021.
-
A linearized framework and a new benchmark for model selection for fine-tuning
Authors:
Aditya Deshpande,
Alessandro Achille,
Avinash Ravichandran,
Hao Li,
Luca Zancato,
Charless Fowlkes,
Rahul Bhotika,
Stefano Soatto,
Pietro Perona
Abstract:
Fine-tuning from a collection of models pre-trained on different domains (a "model zoo") is emerging as a technique to improve test accuracy in the low-data regime. However, model selection, i.e. how to pre-select the right model to fine-tune from a model zoo without performing any training, remains an open topic. We use a linearized framework to approximate fine-tuning, and introduce two new base…
▽ More
Fine-tuning from a collection of models pre-trained on different domains (a "model zoo") is emerging as a technique to improve test accuracy in the low-data regime. However, model selection, i.e. how to pre-select the right model to fine-tune from a model zoo without performing any training, remains an open topic. We use a linearized framework to approximate fine-tuning, and introduce two new baselines for model selection -- Label-Gradient and Label-Feature Correlation. Since all model selection algorithms in the literature have been tested on different use-cases and never compared directly, we introduce a new comprehensive benchmark for model selection comprising of: i) A model zoo of single and multi-domain models, and ii) Many target tasks. Our benchmark highlights accuracy gain with model zoo compared to fine-tuning Imagenet models. We show our model selection baseline can select optimal models to fine-tune in few selections and has the highest ranking correlation to fine-tuning accuracy compared to existing algorithms.
△ Less
Submitted 29 January, 2021;
originally announced February 2021.
-
A Comparative Study of Straight-Strip and Zigzag-Interleaved Anode Patterns for MPGD Readouts
Authors:
C. Perez-Lara,
S. Aune,
B. Azmoun,
K. Dehmelt,
A. Deshpande,
W. Fan,
P. Garg,
T. K. Hemmick,
M. Kebbiri,
A. Kiselev,
I. Mandjavidze,
M. L. Purschke,
M. Revolle,
M. Vandenbroucke,
C. Woody
Abstract:
Due to their simplicity and versatility of design, straight strip or rectangular pad anode structures are frequently employed with micro-pattern gas detectors to reconstruct high precision space points for various tracking applications. The particle impact point is typically determined by interpolating the charge collected by several neighboring pads. However, to effectively extract the inherent p…
▽ More
Due to their simplicity and versatility of design, straight strip or rectangular pad anode structures are frequently employed with micro-pattern gas detectors to reconstruct high precision space points for various tracking applications. The particle impact point is typically determined by interpolating the charge collected by several neighboring pads. However, to effectively extract the inherent positional information, the lateral spacing of the straight pads must be significantly smaller than the extent of the charge cloud. In contrast, highly interleaved anode patterns, such as zigzags, can adequately sample the charge with a pitch comparable to the size of the charge cloud or even larger. This has the considerable advantage of providing the same performance while requiring far fewer instrumented channels. Additionally, the geometric parameters defining such zigzag structures may be tuned to provide a uniform detector response without the need for so-called pad response functions, while simultaneously maintaining excellent position resolution. We have measured the position resolution of a variety of zigzag shaped anode patterns optimized for various MPGDs, including GEM, Micromegas, and micro-RWELL and compared this performance to the same detectors equipped with straight pads of varying pitch. We report on the performance results of each readout structure, evaluated under identical conditions in a test beam.
△ Less
Submitted 28 January, 2021;
originally announced January 2021.
-
The Importance of Modeling Data Missingness in Algorithmic Fairness: A Causal Perspective
Authors:
Naman Goel,
Alfonso Amayuelas,
Amit Deshpande,
Amit Sharma
Abstract:
Training datasets for machine learning often have some form of missingness. For example, to learn a model for deciding whom to give a loan, the available training data includes individuals who were given a loan in the past, but not those who were not. This missingness, if ignored, nullifies any fairness guarantee of the training procedure when the model is deployed. Using causal graphs, we charact…
▽ More
Training datasets for machine learning often have some form of missingness. For example, to learn a model for deciding whom to give a loan, the available training data includes individuals who were given a loan in the past, but not those who were not. This missingness, if ignored, nullifies any fairness guarantee of the training procedure when the model is deployed. Using causal graphs, we characterize the missingness mechanisms in different real-world scenarios. We show conditions under which various distributions, used in popular fairness algorithms, can or can not be recovered from the training data. Our theoretical results imply that many of these algorithms can not guarantee fairness in practice. Modeling missingness also helps to identify correct design principles for fair algorithms. For example, in multi-stage settings where decisions are made in multiple screening rounds, we use our framework to derive the minimal distributions required to design a fair algorithm. Our proposed algorithm decentralizes the decision-making process and still achieves similar performance to the optimal algorithm that requires centralization and non-recoverable distributions.
△ Less
Submitted 21 December, 2020;
originally announced December 2020.
-
Autocatalytic systems and recombination: a reaction network perspective
Authors:
Gheorghe Craciun,
Abhishek Deshpande,
Badal Joshi,
Polly Y. Yu
Abstract:
Autocatalytic systems are very often incorporated in the "origin of life" models, a connection that has been analyzed in the context of the classical hypercycles introduced by Manfred Eigen. We investigate the dynamics of certain networks called bimolecular autocatalytic systems. In particular, we consider the dynamics corresponding to the relative populations in these networks, and show that they…
▽ More
Autocatalytic systems are very often incorporated in the "origin of life" models, a connection that has been analyzed in the context of the classical hypercycles introduced by Manfred Eigen. We investigate the dynamics of certain networks called bimolecular autocatalytic systems. In particular, we consider the dynamics corresponding to the relative populations in these networks, and show that they can be analyzed by studying well-chosen autonomous polynomial dynamical systems. Moreover, we find that one can use results from reaction network theory to prove persistence and permanence of several types of bimolecular autocatalytic systems called autocatalytic recombination networks.
△ Less
Submitted 10 December, 2020;
originally announced December 2020.
-
Euclid: Forecasts for $k$-cut $3 \times 2$ Point Statistics
Authors:
P. L. Taylor,
T. Kitching,
V. F. Cardone,
A. Ferté,
E. M. Huff,
F. Bernardeau,
J. Rhodes,
A. C. Deshpande,
I. Tutusaus,
A. Pourtsidou,
S. Camera,
C. Carbone,
S. Casas,
M. Martinelli,
V. Pettorino,
Z. Sakr,
D. Sapone,
V. Yankelevich,
N. Auricchio,
A. Balestra,
C. Bodendorf,
D. Bonino,
A. Boucaud,
E. Branchini,
M. Brescia
, et al. (70 additional authors not shown)
Abstract:
Modelling uncertainties at small scales, i.e. high $k$ in the power spectrum $P(k)$, due to baryonic feedback, nonlinear structure growth and the fact that galaxies are biased tracers poses a significant obstacle to fully leverage the constraining power of the {\it Euclid} wide-field survey. $k$-cut cosmic shear has recently been proposed as a method to optimally remove sensitivity to these scales…
▽ More
Modelling uncertainties at small scales, i.e. high $k$ in the power spectrum $P(k)$, due to baryonic feedback, nonlinear structure growth and the fact that galaxies are biased tracers poses a significant obstacle to fully leverage the constraining power of the {\it Euclid} wide-field survey. $k$-cut cosmic shear has recently been proposed as a method to optimally remove sensitivity to these scales while preserving usable information. In this paper we generalise the $k$-cut cosmic shear formalism to $3 \times 2$ point statistics and estimate the loss of information for different $k$-cuts in a $3 \times 2$ point analysis of the {\it Euclid} data. Extending the Fisher matrix analysis of~\citet{blanchard2019euclid}, we assess the degradation in constraining power for different $k$-cuts. We work in the idealised case and assume the galaxy bias is linear, the covariance is Gaussian, while neglecting uncertainties due to photo-z errors and baryonic feedback. We find that taking a $k$-cut at $2.6 \ h \ {\rm Mpc} ^{-1}$ yields a dark energy Figure of Merit (FOM) of 1018. This is comparable to taking a weak lensing cut at $\ell = 5000$ and a galaxy clustering and galaxy-galaxy lensing cut at $\ell = 3000$ in a traditional $3 \times 2$ point analysis. We also find that the fraction of the observed galaxies used in the photometric clustering part of the analysis is one of the main drivers of the FOM. Removing $50 \% \ (90 \%)$ of the clustering galaxies decreases the FOM by $19 \% \ (62 \%)$. Given that the FOM depends so heavily on the fraction of galaxies used in the clustering analysis, extensive efforts should be made to handle the real-world systematics present when extending the analysis beyond the luminous red galaxy (LRG) sample.
△ Less
Submitted 20 July, 2021; v1 submitted 8 December, 2020;
originally announced December 2020.
-
Transverse momentum dependent forward neutron single spin asymmetries in transversely polarized $p$$+$$p$ collisions at $\sqrt{s}=200$ GeV
Authors:
U. A. Acharya,
C. Aidala,
Y. Akiba,
M. Alfred,
V. Andrieux,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
N. S. Bandara,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau,
J. S. Bok,
V. Borisov,
M. L. Brooks,
J. Bryslawskyj,
V. Bumazhnov
, et al. (289 additional authors not shown)
Abstract:
In 2015, the PHENIX collaboration has measured very forward ($η>6.8$) single-spin asymmetries of inclusive neutrons in transversely polarized proton-proton and proton-nucleus collisions at a center of mass energy of 200 GeV. A previous publication from this data set concentrated on the nuclear dependence of such asymmetries. In this measurement the explicit transverse-momentum dependence of inclus…
▽ More
In 2015, the PHENIX collaboration has measured very forward ($η>6.8$) single-spin asymmetries of inclusive neutrons in transversely polarized proton-proton and proton-nucleus collisions at a center of mass energy of 200 GeV. A previous publication from this data set concentrated on the nuclear dependence of such asymmetries. In this measurement the explicit transverse-momentum dependence of inclusive neutron single spin asymmetries for proton-proton collisions is extracted using a bootstrap**-unfolding technique on the transverse momenta. This explicit transverse-momentum dependence will help improve the understanding of the mechanisms that create these asymmetries.
△ Less
Submitted 6 February, 2021; v1 submitted 28 November, 2020;
originally announced November 2020.
-
Transverse single-spin asymmetries of midrapidity $π^0$ and $η$ mesons in polarized $p$$+$$p$ collisions at $\sqrt{s}=200$ GeV
Authors:
U. A. Acharya,
C. Aidala,
Y. Akiba,
M. Alfred,
V. Andrieux,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
N. S. Bandara,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau,
J. S. Bok,
V. Borisov,
M. L. Brooks,
J. Bryslawskyj,
V. Bumazhnov
, et al. (289 additional authors not shown)
Abstract:
We present a measurement of the transverse single-spin asymmetry for $π^0$ and $η$ mesons in $p^\uparrow$$+$$p$ collisions in the pseudorapidity range $|η|<0.35$ and at a center-of-mass energy of 200 GeV with the PHENIX detector at the Relativistic Heavy Ion Collider. In comparison with previous measurements in this kinematic region, these results have a factor of 3 smaller uncertainties. As hadro…
▽ More
We present a measurement of the transverse single-spin asymmetry for $π^0$ and $η$ mesons in $p^\uparrow$$+$$p$ collisions in the pseudorapidity range $|η|<0.35$ and at a center-of-mass energy of 200 GeV with the PHENIX detector at the Relativistic Heavy Ion Collider. In comparison with previous measurements in this kinematic region, these results have a factor of 3 smaller uncertainties. As hadrons, $π^0$ and $η$ mesons are sensitive to both initial- and final-state nonperturbative effects for a mix of parton flavors. Comparisons of the differences in their transverse single-spin asymmetries have the potential to disentangle the possible effects of strangeness, isospin, or mass. These results can constrain the twist-3 trigluon collinear correlation function as well as the gluon Sivers function.
△ Less
Submitted 26 February, 2021; v1 submitted 28 November, 2020;
originally announced November 2020.
-
On Simultaneous Long-Short Stock Trading Controllers with Cross-Coupling
Authors:
Atul Deshpande,
John A Gubner,
B. Ross Barmish
Abstract:
The Simultaneous Long-Short(SLS) controller for trading a single stock is known to guarantee positive expected value of the resulting gain-loss function with respect to a large class of stock price dynamics. In the literature, this is known as the Robust Positive Expectation(RPE)property. An obvious way to extend this theory to the trading of two stocks is to trade each one of them using its own i…
▽ More
The Simultaneous Long-Short(SLS) controller for trading a single stock is known to guarantee positive expected value of the resulting gain-loss function with respect to a large class of stock price dynamics. In the literature, this is known as the Robust Positive Expectation(RPE)property. An obvious way to extend this theory to the trading of two stocks is to trade each one of them using its own independent SLS controller. Motivated by the fact that such a scheme does not exploit any correlation between the two stocks, we study the case when the relative sign between the drifts of the two stocks is known. The main contributions of this paper are three-fold: First, we put forward a novel architecture in which we cross-couple two SLS controllers for the two-stock case. Second, we derive a closed-form expression for the expected value of the gain-loss function. Third, we use this closed-form expression to prove that the RPE property is guaranteed with respect to a large class of stock-price dynamics. When more information over and above the relative sign is assumed, additional benefits of the new architecture are seen. For example, when bounds or precise values for the means and covariances of the stock returns are included in the model, numerical simulations suggest that our new controller can achieve lower trading risk than a pair of decoupled SLS controllers for the same level of expected trading gain.
△ Less
Submitted 18 November, 2020;
originally announced November 2020.
-
Indic-Transformers: An Analysis of Transformer Language Models for Indian Languages
Authors:
Kushal Jain,
Adwait Deshpande,
Kumar Shridhar,
Felix Laumann,
Ayushman Dash
Abstract:
Language models based on the Transformer architecture have achieved state-of-the-art performance on a wide range of NLP tasks such as text classification, question-answering, and token classification. However, this performance is usually tested and reported on high-resource languages, like English, French, Spanish, and German. Indian languages, on the other hand, are underrepresented in such bench…
▽ More
Language models based on the Transformer architecture have achieved state-of-the-art performance on a wide range of NLP tasks such as text classification, question-answering, and token classification. However, this performance is usually tested and reported on high-resource languages, like English, French, Spanish, and German. Indian languages, on the other hand, are underrepresented in such benchmarks. Despite some Indian languages being included in training multilingual Transformer models, they have not been the primary focus of such work. In order to evaluate the performance on Indian languages specifically, we analyze these language models through extensive experiments on multiple downstream tasks in Hindi, Bengali, and Telugu language. Here, we compare the efficacy of fine-tuning model parameters of pre-trained models against that of training a language model from scratch. Moreover, we empirically argue against the strict dependency between the dataset size and model performance, but rather encourage task-specific model and method selection. We achieve state-of-the-art performance on Hindi and Bengali languages for text classification task. Finally, we present effective strategies for handling the modeling of Indian languages and we release our model checkpoints for the community : https://huggingface.co/neuralspace-reverie.
△ Less
Submitted 4 November, 2020;
originally announced November 2020.
-
Searching k-Optimal Goals for an Orienteering Problem on a Specialized Graph with Budget Constraints
Authors:
Abhinav Sharma,
Advait Deshpande,
Yanming Wang,
Xinyi Xu,
Prashan Madumal,
Anbin Hou
Abstract:
We propose a novel non-randomized anytime orienteering algorithm for finding k-optimal goals that maximize reward on a specialized graph with budget constraints. This specialized graph represents a real-world scenario which is analogous to an orienteering problem of finding k-most optimal goal states.
We propose a novel non-randomized anytime orienteering algorithm for finding k-optimal goals that maximize reward on a specialized graph with budget constraints. This specialized graph represents a real-world scenario which is analogous to an orienteering problem of finding k-most optimal goal states.
△ Less
Submitted 2 November, 2020;
originally announced November 2020.
-
On reaction network implementations of neural networks
Authors:
David F. Anderson,
Badal Joshi,
Abhishek Deshpande
Abstract:
This paper is concerned with the utilization of deterministically modeled chemical reaction networks for the implementation of (feed-forward) neural networks. We develop a general mathematical framework and prove that the ordinary differential equations (ODEs) associated with certain reaction network implementations of neural networks have desirable properties including (i) existence of unique pos…
▽ More
This paper is concerned with the utilization of deterministically modeled chemical reaction networks for the implementation of (feed-forward) neural networks. We develop a general mathematical framework and prove that the ordinary differential equations (ODEs) associated with certain reaction network implementations of neural networks have desirable properties including (i) existence of unique positive fixed points that are smooth in the parameters of the model (necessary for gradient descent), and (ii) fast convergence to the fixed point regardless of initial condition (necessary for efficient implementation). We do so by first making a connection between neural networks and fixed points for systems of ODEs, and then by constructing reaction networks with the correct associated set of ODEs. We demonstrate the theory by constructing a reaction network that implements a neural network with a smoothed ReLU activation function, though we also demonstrate how to generalize the construction to allow for other activation functions (each with the desirable properties listed previously). As there are multiple types of "networks" utilized in this paper, we also give a careful introduction to both reaction networks and neural networks, in order to disambiguate the overlap** vocabulary in the two settings and to clearly highlight the role of each network's properties.
△ Less
Submitted 8 March, 2021; v1 submitted 25 October, 2020;
originally announced October 2020.
-
Propagating residual biases in masked cosmic shear power spectra
Authors:
T. D. Kitching,
A. C. Deshpande,
P. L. Taylor
Abstract:
In this paper we derive a full expression for the propagation of weak lensing shape measurement biases into cosmic shear power spectra including the effect of missing data. We show using simulations that terms higher than first order in bias parameters can be ignored and the impact of biases can be captured by terms dependent only on the mean of the multiplicative bias field. We identify that the…
▽ More
In this paper we derive a full expression for the propagation of weak lensing shape measurement biases into cosmic shear power spectra including the effect of missing data. We show using simulations that terms higher than first order in bias parameters can be ignored and the impact of biases can be captured by terms dependent only on the mean of the multiplicative bias field. We identify that the B-mode power contains information on the multiplicative bias. We find that without priors on the residual multiplicative bias $δm$ and stochastic ellipticity variance $σ_e$ that constraints on the amplitude of the cosmic shear power spectrum are completely degenerate, and that when applying priors the constrained amplitude $A$ is slightly biased low via a classic marginalisation paradox. Using all-sky Gaussian random field simulations we find that the combination of $(1+2δm)A$ is unbiased for a joint EE and BB power spectrum likelihood if the error and mean (precision and accuracy) of the stochastic ellipticity variance is known to better than $σ(σ_e)\leq 0.05$ and $Δσ_e\leq 0.01$, or the multiplicative bias is known to better than $σ(m)\leq 0.07$ and $Δm\leq 0.01$.
△ Less
Submitted 14 December, 2020; v1 submitted 15 October, 2020;
originally announced October 2020.
-
On the Problem of Underranking in Group-Fair Ranking
Authors:
Sruthi Gorantla,
Amit Deshpande,
Anand Louis
Abstract:
Search and recommendation systems, such as search engines, recruiting tools, online marketplaces, news, and social media, output ranked lists of content, products, and sometimes, people. Credit ratings, standardized tests, risk assessments output only a score, but are also used implicitly for ranking. Bias in such ranking systems, especially among the top ranks, can worsen social and economic ineq…
▽ More
Search and recommendation systems, such as search engines, recruiting tools, online marketplaces, news, and social media, output ranked lists of content, products, and sometimes, people. Credit ratings, standardized tests, risk assessments output only a score, but are also used implicitly for ranking. Bias in such ranking systems, especially among the top ranks, can worsen social and economic inequalities, polarize opinions, and reinforce stereotypes. On the other hand, a bias correction for minority groups can cause more harm if perceived as favoring group-fair outcomes over meritocracy. In this paper, we formulate the problem of underranking in group-fair rankings, which was not addressed in previous work. Most group-fair ranking algorithms post-process a given ranking and output a group-fair ranking. We define underranking based on how close the group-fair rank of each item is to its original rank, and prove a lower bound on the trade-off achievable for simultaneous underranking and group fairness in ranking. We give a fair ranking algorithm that takes any given ranking and outputs another ranking with simultaneous underranking and group fairness guarantees comparable to the lower bound we prove. Our algorithm works with group fairness constraints for any number of groups. Our experimental results confirm the theoretical trade-off between underranking and group fairness, and also show that our algorithm achieves the best of both when compared to the state-of-the-art baselines.
△ Less
Submitted 18 February, 2021; v1 submitted 24 September, 2020;
originally announced October 2020.
-
QUIC-EST: A Transmission Scheme to Maximize VoI of Multi-Stream Correlated Data Flows
Authors:
Federico Chiariotti,
Anay Ajit Deshpande,
Marco Giordani,
Kostantinos Antonakoglou,
Andrea Zanella,
Toktam Mahmoodi
Abstract:
New advanced applications, such as autonomous driving and haptic communication, require to transmit multi-sensory data and require low latency and high reliability. These applications include. Existing implementations for such services have mostly relied on ad hoc scheduling and send rate adaptation mechanisms, implemented directly by the application and running over UDP. In this work, we propose…
▽ More
New advanced applications, such as autonomous driving and haptic communication, require to transmit multi-sensory data and require low latency and high reliability. These applications include. Existing implementations for such services have mostly relied on ad hoc scheduling and send rate adaptation mechanisms, implemented directly by the application and running over UDP. In this work, we propose a transmission scheme that relies on the features of the recently developed QUIC transport protocol, providing reliability where needed, and standardized congestion control, without compromising latency. Furthermore, we propose a scheduler for sensor data transmissions on the transport layer that can exploit the correlations over time and across sensors. This mechanism allows applications to maximize the Value of Information (VoI) of the transmitted data, as we demonstrate through simulations in two realistic application scenarios.
△ Less
Submitted 8 October, 2020;
originally announced October 2020.
-
Optimal State Transfer and Entanglement Generation in Power-law Interacting Systems
Authors:
Minh C. Tran,
Abhinav Deshpande,
Andrew Y. Guo,
Andrew Lucas,
Alexey V. Gorshkov
Abstract:
We present an optimal protocol for encoding an unknown qubit state into a multiqubit Greenberger-Horne-Zeilinger-like state and, consequently, transferring quantum information in large systems exhibiting power-law ($1/r^α$) interactions. For all power-law exponents $α$ between $d$ and $2d+1$, where $d$ is the dimension of the system, the protocol yields a polynomial speedup for $α>2d$ and a superp…
▽ More
We present an optimal protocol for encoding an unknown qubit state into a multiqubit Greenberger-Horne-Zeilinger-like state and, consequently, transferring quantum information in large systems exhibiting power-law ($1/r^α$) interactions. For all power-law exponents $α$ between $d$ and $2d+1$, where $d$ is the dimension of the system, the protocol yields a polynomial speedup for $α>2d$ and a superpolynomial speedup for $α\leq 2d$, compared to the state of the art. For all $α>d$, the protocol saturates the Lieb-Robinson bounds (up to subpolynomial corrections), thereby establishing the optimality of the protocol and the tightness of the bounds in this regime. The protocol has a wide range of applications, including in quantum sensing, quantum computing, and preparation of topologically ordered states. In addition, the protocol provides a lower bound on the gate count in digital simulations of power-law interacting systems.
△ Less
Submitted 1 February, 2021; v1 submitted 6 October, 2020;
originally announced October 2020.
-
Guiding Attention for Self-Supervised Learning with Transformers
Authors:
Ameet Deshpande,
Karthik Narasimhan
Abstract:
In this paper, we propose a simple and effective technique to allow for efficient self-supervised learning with bi-directional Transformers. Our approach is motivated by recent studies demonstrating that self-attention patterns in trained models contain a majority of non-linguistic regularities. We propose a computationally efficient auxiliary loss function to guide attention heads to conform to s…
▽ More
In this paper, we propose a simple and effective technique to allow for efficient self-supervised learning with bi-directional Transformers. Our approach is motivated by recent studies demonstrating that self-attention patterns in trained models contain a majority of non-linguistic regularities. We propose a computationally efficient auxiliary loss function to guide attention heads to conform to such patterns. Our method is agnostic to the actual pre-training objective and results in faster convergence of models as well as better performance on downstream tasks compared to the baselines, achieving state of the art results in low-resource settings. Surprisingly, we also find that linguistic properties of attention heads are not necessarily correlated with language modeling performance.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
Sentiment Analysis for Reinforcement Learning
Authors:
Ameet Deshpande,
Eve Fleisig
Abstract:
While reinforcement learning (RL) has been successful in natural language processing (NLP) domains such as dialogue generation and text-based games, it typically faces the problem of sparse rewards that leads to slow or no convergence. Traditional methods that use text descriptions to extract only a state representation ignore the feedback inherently present in them. In text-based games, for examp…
▽ More
While reinforcement learning (RL) has been successful in natural language processing (NLP) domains such as dialogue generation and text-based games, it typically faces the problem of sparse rewards that leads to slow or no convergence. Traditional methods that use text descriptions to extract only a state representation ignore the feedback inherently present in them. In text-based games, for example, descriptions like "Good Job! You ate the food}" indicate progress, and descriptions like "You entered a new room" indicate exploration. Positive and negative cues like these can be converted to rewards through sentiment analysis. This technique converts the sparse reward problem into a dense one, which is easier to solve. Furthermore, this can enable reinforcement learning without rewards, in which the agent learns entirely from these intrinsic sentiment rewards. This framework is similar to intrinsic motivation, where the environment does not necessarily provide the rewards, but the agent analyzes and realizes them by itself. We find that providing dense rewards in text-based games using sentiment analysis improves performance under some conditions.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
Evaluating a Generative Adversarial Framework for Information Retrieval
Authors:
Ameet Deshpande,
Mitesh M. Khapra
Abstract:
Recent advances in Generative Adversarial Networks (GANs) have resulted in its widespread applications to multiple domains. A recent model, IRGAN, applies this framework to Information Retrieval (IR) and has gained significant attention over the last few years. In this focused work, we critically analyze multiple components of IRGAN, while providing experimental and theoretical evidence of some of…
▽ More
Recent advances in Generative Adversarial Networks (GANs) have resulted in its widespread applications to multiple domains. A recent model, IRGAN, applies this framework to Information Retrieval (IR) and has gained significant attention over the last few years. In this focused work, we critically analyze multiple components of IRGAN, while providing experimental and theoretical evidence of some of its shortcomings. Specifically, we identify issues with the constant baseline term in the policy gradients optimization and show that the generator harms IRGAN's performance. Motivated by our findings, we propose two models influenced by self-contrastive estimation and co-training which outperform IRGAN on two out of the three tasks considered.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
CLEVR Parser: A Graph Parser Library for Geometric Learning on Language Grounded Image Scenes
Authors:
Raeid Saqur,
Ameet Deshpande
Abstract:
The CLEVR dataset has been used extensively in language grounded visual reasoning in Machine Learning (ML) and Natural Language Processing (NLP) domains. We present a graph parser library for CLEVR, that provides functionalities for object-centric attributes and relationships extraction, and construction of structural graph representations for dual modalities. Structural order-invariant representa…
▽ More
The CLEVR dataset has been used extensively in language grounded visual reasoning in Machine Learning (ML) and Natural Language Processing (NLP) domains. We present a graph parser library for CLEVR, that provides functionalities for object-centric attributes and relationships extraction, and construction of structural graph representations for dual modalities. Structural order-invariant representations enable geometric learning and can aid in downstream tasks like language grounding to vision, robotics, compositionality, interpretability, and computational grammar construction. We provide three extensible main components - parser, embedder, and visualizer that can be tailored to suit specific learning setups. We also provide out-of-the-box functionality for seamless integration with popular deep graph neural network (GNN) libraries. Additionally, we discuss downstream usage and applications of the library, and how it accelerates research for the NLP research community.
△ Less
Submitted 1 October, 2020; v1 submitted 18 September, 2020;
originally announced September 2020.
-
Accessing the high-$\ell$ frontier under the Reduced Shear Approximation with $k$-cut Cosmic Shear
Authors:
Anurag C. Deshpande,
Peter L. Taylor,
Thomas D. Kitching
Abstract:
The precision of Stage IV cosmic shear surveys will enable us to probe smaller physical scales than ever before, however, model uncertainties from baryonic physics and non-linear structure formation will become a significant concern. The $k$-cut method -- applying a redshift-dependent $\ell$-cut after making the Bernardeau-Nishimichi-Taruya transform -- can reduce sensitivity to baryonic physics;…
▽ More
The precision of Stage IV cosmic shear surveys will enable us to probe smaller physical scales than ever before, however, model uncertainties from baryonic physics and non-linear structure formation will become a significant concern. The $k$-cut method -- applying a redshift-dependent $\ell$-cut after making the Bernardeau-Nishimichi-Taruya transform -- can reduce sensitivity to baryonic physics; allowing Stage IV surveys to include information from increasingly higher $\ell$-modes. Here we address the question of whether it can also mitigate the impact of making the reduced shear approximation; which is also important in the high-$κ$, small-scale regime. The standard procedure for relaxing this approximation requires the repeated evaluation of the convergence bispectrum, and consequently can be prohibitively computationally expensive when included in Monte Carlo analyses. We find that the $k$-cut cosmic shear procedure suppresses the $w_0w_a$CDM cosmological parameter biases expected from the reduced shear approximation for Stage IV experiments, when $\ell$-modes up to $5000$ are probed. The maximum cut required for biases from the reduced shear approximation to be below the threshold of significance is at $k = 5.37 \, h{\rm Mpc}^{-1}$. With this cut, the predicted $1σ$ constraints increase, relative to the case where the correction is directly computed, by less than $10\%$ for all parameters. This represents a significant improvement in constraints compared to the more conservative case where only $\ell$-modes up to 1500 are probed, and no $k$-cut is used. We also repeat this analysis for a hypothetical, comparable kinematic weak lensing survey. The key parts of code used for this analysis are made publicly available.
△ Less
Submitted 26 October, 2020; v1 submitted 3 September, 2020;
originally announced September 2020.
-
Importance of the spectral gap in estimating ground-state energies
Authors:
Abhinav Deshpande,
Alexey V. Gorshkov,
Bill Fefferman
Abstract:
The field of quantum Hamiltonian complexity lies at the intersection of quantum many-body physics and computational complexity theory, with deep implications to both fields. The main object of study is the LocalHamiltonian problem, which is concerned with estimating the ground-state energy of a local Hamiltonian and is complete for the class QMA, a quantum generalization of the class NP. A major c…
▽ More
The field of quantum Hamiltonian complexity lies at the intersection of quantum many-body physics and computational complexity theory, with deep implications to both fields. The main object of study is the LocalHamiltonian problem, which is concerned with estimating the ground-state energy of a local Hamiltonian and is complete for the class QMA, a quantum generalization of the class NP. A major challenge in the field is to understand the complexity of the LocalHamiltonian problem in more physically natural parameter regimes. One crucial parameter in understanding the ground space of any Hamiltonian in many-body physics is the spectral gap, which is the difference between the smallest two eigenvalues. Despite its importance in quantum many-body physics, the role played by the spectral gap in the complexity of the LocalHamiltonian is less well-understood. In this work, we make progress on this question by considering the precise regime, in which one estimates the ground-state energy to within inverse exponential precision. Computing ground-state energies precisely is a task that is important for quantum chemistry and quantum many-body physics.
In the setting of inverse-exponential precision, there is a surprising result that the complexity of LocalHamiltonian is magnified from QMA to PSPACE, the class of problems solvable in polynomial space. We clarify the reason behind this boost in complexity. Specifically, we show that the full complexity of the high precision case only comes about when the spectral gap is exponentially small. As a consequence of the proof techniques developed to show our results, we uncover important implications for the representability and circuit complexity of ground states of local Hamiltonians, the theory of uniqueness of quantum witnesses, and techniques for the amplification of quantum witnesses in the presence of postselection.
△ Less
Submitted 9 December, 2022; v1 submitted 22 July, 2020;
originally announced July 2020.
-
Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors
Authors:
Aditya M. Deshpande,
Rumit Kumar,
Ali A. Minai,
Manish Kumar
Abstract:
In this paper, we present a novel developmental reinforcement learning-based controller for a quadcopter with thrust vectoring capabilities. This multirotor UAV design has tilt-enabled rotors. It utilizes the rotor force magnitude and direction to achieve the desired state during flight. The control policy of this robot is learned using the policy transfer from the learned controller of the quadco…
▽ More
In this paper, we present a novel developmental reinforcement learning-based controller for a quadcopter with thrust vectoring capabilities. This multirotor UAV design has tilt-enabled rotors. It utilizes the rotor force magnitude and direction to achieve the desired state during flight. The control policy of this robot is learned using the policy transfer from the learned controller of the quadcopter (comparatively simple UAV design without thrust vectoring). This approach allows learning a control policy for systems with multiple inputs and multiple outputs. The performance of the learned policy is evaluated by physics-based simulations for the tasks of hovering and way-point navigation. The flight simulations utilize a flight controller based on reinforcement learning without any additional PID components. The results show faster learning with the presented approach as opposed to learning the control policy from scratch for this new UAV design created by modifications in a conventional quadcopter, i.e., the addition of more degrees of freedom (4-actuators in conventional quadcopter to 8-actuators in tilt-rotor quadcopter). We demonstrate the robustness of our learned policy by showing the recovery of the tilt-rotor platform in the simulation from various non-static initial conditions in order to reach a desired state. The developmental policy for the tilt-rotor UAV also showed superior fault tolerance when compared with the policy learned from the scratch. The results show the ability of the presented approach to bootstrap the learned behavior from a simpler system (lower-dimensional action-space) to a more complex robot (comparatively higher-dimensional action-space) and reach better performance faster.
△ Less
Submitted 15 July, 2020;
originally announced July 2020.
-
Growth, Properties, and Applications of Pulsed Laser Deposited Nanolaminate Ti3AlC2 Thin Films
Authors:
Abhijit Biswas,
Arundhati Sengupta,
Umashankar Rajput,
Sachin Kumar Singh,
Vivek Antad,
Sk Mujaffar Hossain,
Swati Parmar,
Dibyata Rout,
Aparna Deshpande,
Sunil Nair,
Satishchandra Ogale
Abstract:
Recently, nanolaminated ternary carbides have attracted immense interest due to the concomitant presence of both ceramic and metallic properties. Here, we grow nanolaminate Ti3AlC2 thin films by pulsed laser deposition on c-axis-oriented sapphire substrates and, surprisingly, the films are found to be highly oriented along the (103) axis normal to the film plane, rather than the (000l) orientation…
▽ More
Recently, nanolaminated ternary carbides have attracted immense interest due to the concomitant presence of both ceramic and metallic properties. Here, we grow nanolaminate Ti3AlC2 thin films by pulsed laser deposition on c-axis-oriented sapphire substrates and, surprisingly, the films are found to be highly oriented along the (103) axis normal to the film plane, rather than the (000l) orientation. Multiple characterization techniques are employed to explore the structural and chemical quality of these films, the electrical and optical properties, and the device functionalities. The 80-nm thick Ti3AlC2 film is highly conducting at room temperature (resistivity of 50 micro ohm-cm), and a very-low-temperature coefficient of resistivity. The ultrathin (2 nm) Ti3AlC2 film has fairly good optical transparency and high conductivity at room temperature (sheet resistance of 735 ohm). Scanning tunneling microscopy reveals the metallic characteristics (with finite density of states at the Fermi level) at room temperature. The metal-semiconductor junction of the p-type Ti3AlC2 film and n-Si show the expected rectification (diode) characteristics, in contrast to the ohmic contact behavior in the case of Ti3AlC2 on p-Si. A triboelectric-nanogenerator-based touch-sensing device, comprising of the Ti3AlC2 film, shows a very impressive peak-to-peak open-circuit output voltage of 80 V. These observations reveal that pulsed laser deposited Ti3AlC2 thin films have excellent potential for applications in multiple domains, such as bottom electrodes, resistors for high-precision measurements, Schottky diodes, ohmic contacts, fairly transparent ultrathin conductors, and next-generation biomechanical touch sensors for energy harvesting.
△ Less
Submitted 9 July, 2020;
originally announced July 2020.
-
Implementing a Fast Unbounded Quantum Fanout Gate Using Power-Law Interactions
Authors:
Andrew Y. Guo,
Abhinav Deshpande,
Su-Kuan Chu,
Zachary Eldredge,
Przemyslaw Bienias,
Dhruv Devulapalli,
Yuan Su,
Andrew M. Childs,
Alexey V. Gorshkov
Abstract:
The standard circuit model for quantum computation presumes the ability to directly perform gates between arbitrary pairs of qubits, which is unlikely to be practical for large-scale experiments. Power-law interactions with strength decaying as $1/r^α$ in the distance $r$ provide an experimentally realizable resource for information processing, whilst still retaining long-range connectivity. We le…
▽ More
The standard circuit model for quantum computation presumes the ability to directly perform gates between arbitrary pairs of qubits, which is unlikely to be practical for large-scale experiments. Power-law interactions with strength decaying as $1/r^α$ in the distance $r$ provide an experimentally realizable resource for information processing, whilst still retaining long-range connectivity. We leverage the power of these interactions to implement a fast quantum fanout gate with an arbitrary number of targets. Our implementation allows the quantum Fourier transform (QFT) and Shor's algorithm to be performed on a $D$-dimensional lattice in time logarithmic in the number of qubits for interactions with $α\le D$. As a corollary, we show that power-law systems with $α\le D$ are difficult to simulate classically even for short times, under a standard assumption that factoring is classically intractable. Complementarily, we develop a new technique to give a general lower bound, linear in the size of the system, on the time required to implement the QFT and the fanout gate in systems that are constrained by a linear light cone. This allows us to prove an asymptotically tighter lower bound for long-range systems than is possible with previously available techniques.
△ Less
Submitted 1 July, 2020;
originally announced July 2020.
-
Subspace approximation with outliers
Authors:
Amit Deshpande,
Rameshwar Pratap
Abstract:
The subspace approximation problem with outliers, for given $n$ points in $d$ dimensions $x_{1},\ldots, x_{n} \in R^{d}$, an integer $1 \leq k \leq d$, and an outlier parameter $0 \leq α\leq 1$, is to find a $k$-dimensional linear subspace of $R^{d}$ that minimizes the sum of squared distances to its nearest $(1-α)n$ points. More generally, the $\ell_{p}$ subspace approximation problem with outlie…
▽ More
The subspace approximation problem with outliers, for given $n$ points in $d$ dimensions $x_{1},\ldots, x_{n} \in R^{d}$, an integer $1 \leq k \leq d$, and an outlier parameter $0 \leq α\leq 1$, is to find a $k$-dimensional linear subspace of $R^{d}$ that minimizes the sum of squared distances to its nearest $(1-α)n$ points. More generally, the $\ell_{p}$ subspace approximation problem with outliers minimizes the sum of $p$-th powers of distances instead of the sum of squared distances. Even the case of robust PCA is non-trivial, and previous work requires additional assumptions on the input. Any multiplicative approximation algorithm for the subspace approximation problem with outliers must solve the robust subspace recovery problem, a special case in which the $(1-α)n$ inliers in the optimal solution are promised to lie exactly on a $k$-dimensional linear subspace. However, robust subspace recovery is Small Set Expansion (SSE)-hard.
We show how to extend dimension reduction techniques and bi-criteria approximations based on sampling to the problem of subspace approximation with outliers. To get around the SSE-hardness of robust subspace recovery, we assume that the squared distance error of the optimal $k$-dimensional subspace summed over the optimal $(1-α)n$ inliers is at least $δ$ times its squared-error summed over all $n$ points, for some $0 < δ\leq 1 - α$. With this assumption, we give an efficient algorithm to find a subset of $poly(k/ε) \log(1/δ) \log\log(1/δ)$ points whose span contains a $k$-dimensional subspace that gives a multiplicative $(1+ε)$-approximation to the optimal solution. The running time of our algorithm is linear in $n$ and $d$. Interestingly, our results hold even when the fraction of outliers $α$ is large, as long as the obvious condition $0 < δ\leq 1 - α$ is satisfied.
△ Less
Submitted 30 June, 2020;
originally announced June 2020.
-
Quaternion Feedback Based Autonomous Control of a Quadcopter UAV with Thrust Vectoring Rotors
Authors:
Rumit Kumar,
Mahathi Bhargavapuri,
Aditya M. Deshpande,
Siddharth Sridhar,
Kelly Cohen,
Manish Kumar
Abstract:
In this paper, we present an autonomous flight controller for a quadcopter with thrust vectoring capabilities. This UAV falls in the category of multirotors with tilt-motion enabled rotors. Since the vehicle considered is over-actuated in nature, the dynamics and control allocation have to be analysed carefully. Moreover, the possibility of hovering at large attitude maneuvers of this novel vehicl…
▽ More
In this paper, we present an autonomous flight controller for a quadcopter with thrust vectoring capabilities. This UAV falls in the category of multirotors with tilt-motion enabled rotors. Since the vehicle considered is over-actuated in nature, the dynamics and control allocation have to be analysed carefully. Moreover, the possibility of hovering at large attitude maneuvers of this novel vehicle requires singularity-free attitude control. Hence, quaternion state feedback is utilized to compute the control commands for the UAV motors while avoiding the gimbal lock condition experienced by Euler angle based controllers. The quaternion implementation also reduces the overall complexity of state estimation due to absence of trigonometric parameters. The quadcopter dynamic model and state space is utilized to design the attitude controller and control allocation for the UAV. The control allocation, in particular, is derived by linearizing the system about hover condition. This mathematical method renders the control allocation more accurate than existing approaches. Lyapunov stability analysis of the attitude controller is shown to prove global stability. The quaternion feedback attitude controller is commanded by an outer position controller loop which generates rotor-tilt and desired quaternions commands for the system. The performance of the UAV is evaluated by numerical simulations for tracking attitude step commands and for following a way-point navigation mission.
△ Less
Submitted 28 June, 2020;
originally announced June 2020.
-
How do SGD hyperparameters in natural training affect adversarial robustness?
Authors:
Sandesh Kamath,
Amit Deshpande,
K V Subrahmanyam
Abstract:
Learning rate, batch size and momentum are three important hyperparameters in the SGD algorithm. It is known from the work of Jastrzebski et al. arXiv:1711.04623 that large batch size training of neural networks yields models which do not generalize well. Yao et al. arXiv:1802.08241 observe that large batch training yields models that have poor adversarial robustness. In the same paper, the author…
▽ More
Learning rate, batch size and momentum are three important hyperparameters in the SGD algorithm. It is known from the work of Jastrzebski et al. arXiv:1711.04623 that large batch size training of neural networks yields models which do not generalize well. Yao et al. arXiv:1802.08241 observe that large batch training yields models that have poor adversarial robustness. In the same paper, the authors train models with different batch sizes and compute the eigenvalues of the Hessian of loss function. They observe that as the batch size increases, the dominant eigenvalues of the Hessian become larger. They also show that both adversarial training and small-batch training leads to a drop in the dominant eigenvalues of the Hessian or lowering its spectrum. They combine adversarial training and second order information to come up with a new large-batch training algorithm and obtain robust models with good generalization. In this paper, we empirically observe the effect of the SGD hyperparameters on the accuracy and adversarial robustness of networks trained with unperturbed samples. Jastrzebski et al. considered training models with a fixed learning rate to batch size ratio. They observed that higher the ratio, better is the generalization. We observe that networks trained with constant learning rate to batch size ratio, as proposed in Jastrzebski et al., yield models which generalize well and also have almost constant adversarial robustness, independent of the batch size. We observe that momentum is more effective with varying batch sizes and a fixed learning rate than with constant learning rate to batch size ratio based SGD training.
△ Less
Submitted 20 June, 2020;
originally announced June 2020.
-
Minimal invariant regions and minimal globally attracting regions for toric differential inclusions
Authors:
Yida Ding,
Abhishek Deshpande,
Gheorghe Craciun
Abstract:
Toric differential inclusions occur as key dynamical systems in the context of the Global Attractor Conjecture. We introduce the notions of minimal invariant regions and minimal globally attracting regions for toric differential inclusions. We describe a procedure for constructing explicitly the minimal invariant and minimal globally attracting regions for two-dimensional toric differential inclus…
▽ More
Toric differential inclusions occur as key dynamical systems in the context of the Global Attractor Conjecture. We introduce the notions of minimal invariant regions and minimal globally attracting regions for toric differential inclusions. We describe a procedure for constructing explicitly the minimal invariant and minimal globally attracting regions for two-dimensional toric differential inclusions. In particular, we obtain invariant regions and globally attracting regions for two-dimensional weakly reversible or endotactic dynamical systems (even if they have time-dependent parameters).
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
Needles in the 'Sheet'stack: Augmented Analytics to get Insights from Spreadsheets
Authors:
Medha Atre,
Anand Deshpande,
Reshma Godse,
Pooja Deokar,
Sandip Moharir,
Dhruva Ray,
Akshay Chitlangia,
Trupti Phadnis,
Yugansh Goyal
Abstract:
Business intelligence (BI) tools for database analytics have come a long way and nowadays also provide ready insights or visual query explorations, e.g. QuickInsights by Microsoft Power BI, SpotIQ by ThoughtSpot, Zenvisage, etc. In this demo, we focus on providing insights by examining periodic spreadsheets of different reports (aka views), without prior knowledge of the schema of the database or…
▽ More
Business intelligence (BI) tools for database analytics have come a long way and nowadays also provide ready insights or visual query explorations, e.g. QuickInsights by Microsoft Power BI, SpotIQ by ThoughtSpot, Zenvisage, etc. In this demo, we focus on providing insights by examining periodic spreadsheets of different reports (aka views), without prior knowledge of the schema of the database or reports, or data information. Such a solution is targeted at users without the familiarity with the database schema or resources to conduct analytics in the contemporary way.
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
On Universalized Adversarial and Invariant Perturbations
Authors:
Sandesh Kamath,
Amit Deshpande,
K V Subrahmanyam
Abstract:
Convolutional neural networks or standard CNNs (StdCNNs) are translation-equivariant models that achieve translation invariance when trained on data augmented with sufficient translations. Recent work on equivariant models for a given group of transformations (e.g., rotations) has lead to group-equivariant convolutional neural networks (GCNNs). GCNNs trained on data augmented with sufficient rotat…
▽ More
Convolutional neural networks or standard CNNs (StdCNNs) are translation-equivariant models that achieve translation invariance when trained on data augmented with sufficient translations. Recent work on equivariant models for a given group of transformations (e.g., rotations) has lead to group-equivariant convolutional neural networks (GCNNs). GCNNs trained on data augmented with sufficient rotations achieve rotation invariance. Recent work by authors arXiv:2002.11318 studies a trade-off between invariance and robustness to adversarial attacks. In another related work arXiv:2005.08632, given any model and any input-dependent attack that satisfies a certain spectral property, the authors propose a universalization technique called SVD-Universal to produce a universal adversarial perturbation by looking at very few test examples. In this paper, we study the effectiveness of SVD-Universal on GCNNs as they gain rotation invariance through higher degree of training augmentation. We empirically observe that as GCNNs gain rotation invariance through training augmented with larger rotations, the fooling rate of SVD-Universal gets better. To understand this phenomenon, we introduce universal invariant directions and study their relation to the universal adversarial direction produced by SVD-Universal.
△ Less
Submitted 8 June, 2020;
originally announced June 2020.
-
Production of $π^0$ and $η$ mesons in U$+$U collisions at $\sqrt{s_{_{NN}}}=192$ GeV
Authors:
U. Acharya,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
J. Alexander,
K. Aoki,
N. Apadula,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
X. Bai,
B. Bannier,
K. N. Barish,
S. Bathe,
V. Baublis,
C. Baumann,
S. Baumgart,
A. Bazilevsky,
M. Beaumier,
R. Belmont,
A. Berdnikov
, et al. (378 additional authors not shown)
Abstract:
The PHENIX experiment at the Relativistic Heavy Ion Collider measured $π^0$ and $η$ mesons at midrapidity in U$+$U collisions at $\sqrt{s_{_{NN}}}=192$ GeV in a wide transverse momentum range. Measurements were performed in the $π^0(η)\rightarrowγγ$ decay modes. A strong suppression of $π^0$ and $η$ meson production at high transverse momentum was observed in central U$+$U collisions relative to b…
▽ More
The PHENIX experiment at the Relativistic Heavy Ion Collider measured $π^0$ and $η$ mesons at midrapidity in U$+$U collisions at $\sqrt{s_{_{NN}}}=192$ GeV in a wide transverse momentum range. Measurements were performed in the $π^0(η)\rightarrowγγ$ decay modes. A strong suppression of $π^0$ and $η$ meson production at high transverse momentum was observed in central U$+$U collisions relative to binary scaled $p$$+$$p$ results. Yields of $π^0$ and $η$ mesons measured in U$+$U collisions show similar suppression pattern to the ones measured in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV for similar numbers of participant nucleons. The $η$/$π^0$ ratios do not show dependence on centrality or transverse momentum, and are consistent with previously measured values in hadron-hadron, hadron-nucleus, nucleus-nucleus, and $e^+e^-$ collisions.
△ Less
Submitted 13 November, 2020; v1 submitted 29 May, 2020;
originally announced May 2020.
-
Production of $b\bar{b}$ at forward rapidity in $p$+$p$ collisions at $\sqrt{s}=510$ GeV
Authors:
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
M. Alfred,
N. Apadula,
Y. Aramaki,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont,
A. Berdnikov,
Y. Berdnikov
, et al. (325 additional authors not shown)
Abstract:
The cross section of bottom quark-antiquark ($b\bar{b}$) production in $p$+$p$ collisions at $\sqrt{s}=510$ GeV is measured with the PHENIX detector at the Relativistic Heavy Ion Collider. The results are based on the yield of high mass, like-sign muon pairs measured within the PHENIX muon arm acceptance ($1.2<|y|<2.2$). The $b\bar{b}$ signal is extracted from like-sign dimuons by utilizing the un…
▽ More
The cross section of bottom quark-antiquark ($b\bar{b}$) production in $p$+$p$ collisions at $\sqrt{s}=510$ GeV is measured with the PHENIX detector at the Relativistic Heavy Ion Collider. The results are based on the yield of high mass, like-sign muon pairs measured within the PHENIX muon arm acceptance ($1.2<|y|<2.2$). The $b\bar{b}$ signal is extracted from like-sign dimuons by utilizing the unique properties of neutral $B$ meson oscillation. We report a differential cross section of $dσ_{b\bar{b}\rightarrow μ^\pmμ^\pm}/dy = 0.16 \pm 0.01~(\mbox{stat}) \pm 0.02~(\mbox{syst}) \pm 0.02~(\mbox{global})$ nb for like-sign muons in the rapidity and $p_T$ ranges $1.2<|y|<2.2$ and $p_T>1$ GeV/$c$, and dimuon mass of 5--10 GeV/$c^2$. The extrapolated total cross section at this energy for $b\bar{b}$ production is $13.1 \pm 0.6~(\mbox{stat}) \pm 1.5~(\mbox{syst}) \pm 2.7~(\mbox{global})~μ$b. The total cross section is compared to a perturbative quantum chromodynamics calculation and is consistent within uncertainties. The azimuthal opening angle between muon pairs from $b\bar{b}$ decays and their $p_T$ distributions are compared to distributions generated using {\sc ps pythia 6}, which includes next-to-leading order processes. The azimuthal correlations and pair $p_T$ distribution are not very well described by {\sc pythia} calculations, but are still consistent within uncertainties. Flavor creation and flavor excitation subprocesses are favored over gluon splitting.
△ Less
Submitted 27 October, 2020; v1 submitted 28 May, 2020;
originally announced May 2020.
-
Polarization and cross section of midrapidity J/$ψ$ production in proton-proton collisions at $\sqrt{s}=510$ GeV
Authors:
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
M. Alfred,
N. Apadula,
Y. Aramaki,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont,
A. Berdnikov,
Y. Berdnikov
, et al. (325 additional authors not shown)
Abstract:
The PHENIX experiment has measured the spin alignment for inclusive $J/ψ\rightarrow e^{+}e^{-}$ decays in $p$+$p$ collisions at $\sqrt{s}=510$ GeV at midrapidity. The angular distributions have been measured in three different polarization frames, and the three decay angular coefficients have been extracted in a full two-dimensional analysis. Previously, PHENIX saw large longitudinal net polarizat…
▽ More
The PHENIX experiment has measured the spin alignment for inclusive $J/ψ\rightarrow e^{+}e^{-}$ decays in $p$+$p$ collisions at $\sqrt{s}=510$ GeV at midrapidity. The angular distributions have been measured in three different polarization frames, and the three decay angular coefficients have been extracted in a full two-dimensional analysis. Previously, PHENIX saw large longitudinal net polarization at forward rapidity at the same collision energy. This analysis at midrapidity, complementary to the previous PHENIX results, sees no sizable polarization in the measured transverse momentum range of $0.0<p_T<10.0$ GeV/$c$. The results are consistent with a previous one-dimensional analysis at midrapidity at $\sqrt{s}=200$ GeV. The transverse-momentum-dependent cross section for midrapidity $J/ψ$ production has additionally been measured, and after comparison to world data we find a simple logarithmic dependence of the cross section on $\sqrt{s}$.
△ Less
Submitted 27 October, 2020; v1 submitted 28 May, 2020;
originally announced May 2020.
-
Measurement of jet-medium interactions via direct photon-hadron correlations in Au$+$Au and $d$$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV
Authors:
U. Acharya,
A. Adare,
S. Afanasiev,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
H. Al-Bataineh,
J. Alexander,
H. Al-Ta'ani,
A. Angerami,
K. Aoki,
N. Apadula,
Y. Aramaki,
H. Asano,
E. C. Aschenauer,
E. T. Atomssa,
R. Averbeck,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
G. Baksay,
L. Baksay,
B. Bannier
, et al. (553 additional authors not shown)
Abstract:
We present direct photon-hadron correlations in 200 GeV/A Au$+$Au, $d$$+$Au and $p$$+$$p$ collisions, for direct photon $p_T$ from 5--12 GeV/$c$, collected by the PHENIX Collaboration in the years from 2006 to 2011. We observe no significant modification of jet fragmentation in $d$$+$Au collisions, indicating that cold nuclear matter effects are small or absent. Hadrons carrying a large fraction o…
▽ More
We present direct photon-hadron correlations in 200 GeV/A Au$+$Au, $d$$+$Au and $p$$+$$p$ collisions, for direct photon $p_T$ from 5--12 GeV/$c$, collected by the PHENIX Collaboration in the years from 2006 to 2011. We observe no significant modification of jet fragmentation in $d$$+$Au collisions, indicating that cold nuclear matter effects are small or absent. Hadrons carrying a large fraction of the quark's momentum are suppressed in Au$+$Au compared to $p$$+$$p$ and $d$$+$Au. As the momentum fraction decreases, the yield of hadrons in Au$+$Au increases to an excess over the yield in $p$$+$$p$ collisions. The excess is at large angles and at low hadron $p_T$ and is most pronounced for hadrons associated with lower momentum direct photons. Comparison to theoretical calculations suggests that the hadron excess arises from medium response to energy deposited by jets.
△ Less
Submitted 19 November, 2020; v1 submitted 28 May, 2020;
originally announced May 2020.
-
Complexity of Fermionic Dissipative Interactions and Applications to Quantum Computing
Authors:
Oles Shtanko,
Abhinav Deshpande,
Paul S. Julienne,
Alexey V. Gorshkov
Abstract:
Interactions between particles are usually a resource for quantum computing, making quantum many-body systems intractable by any known classical algorithm. In contrast, noise is typically considered as being inimical to quantum many-body correlations, ultimately leading the system to a classically tractable state. This work shows that noise represented by two-body processes, such as pair loss, pla…
▽ More
Interactions between particles are usually a resource for quantum computing, making quantum many-body systems intractable by any known classical algorithm. In contrast, noise is typically considered as being inimical to quantum many-body correlations, ultimately leading the system to a classically tractable state. This work shows that noise represented by two-body processes, such as pair loss, plays the same role as many-body interactions and makes otherwise classically simulable systems universal for quantum computing. We analyze such processes in detail and establish a complexity transition between simulable and nonsimulable systems as a function of a tuning parameter. We determine important classes of simulable and nonsimulable two-body dissipation. Finally, we show how using resonant dissipation in cold atoms can enhance the performance of two-qubit gates.
△ Less
Submitted 17 September, 2021; v1 submitted 21 May, 2020;
originally announced May 2020.
-
Universalization of any adversarial attack using very few test examples
Authors:
Sandesh Kamath,
Amit Deshpande,
K V Subrahmanyam,
Vineeth N Balasubramanian
Abstract:
Deep learning models are known to be vulnerable not only to input-dependent adversarial attacks but also to input-agnostic or universal adversarial attacks. Dezfooli et al. \cite{Dezfooli17,Dezfooli17anal} construct universal adversarial attack on a given model by looking at a large number of training data points and the geometry of the decision boundary near them. Subsequent work \cite{Khrulkov18…
▽ More
Deep learning models are known to be vulnerable not only to input-dependent adversarial attacks but also to input-agnostic or universal adversarial attacks. Dezfooli et al. \cite{Dezfooli17,Dezfooli17anal} construct universal adversarial attack on a given model by looking at a large number of training data points and the geometry of the decision boundary near them. Subsequent work \cite{Khrulkov18} constructs universal attack by looking only at test examples and intermediate layers of the given model. In this paper, we propose a simple universalization technique to take any input-dependent adversarial attack and construct a universal attack by only looking at very few adversarial test examples. We do not require details of the given model and have negligible computational overhead for universalization. We theoretically justify our universalization technique by a spectral property common to many input-dependent adversarial perturbations, e.g., gradients, Fast Gradient Sign Method (FGSM) and DeepFool. Using matrix concentration inequalities and spectral perturbation bounds, we show that the top singular vector of input-dependent adversarial directions on a small test sample gives an effective and simple universal adversarial attack. For VGG16 and VGG19 models trained on ImageNet, our simple universalization of Gradient, FGSM, and DeepFool perturbations using a test sample of 64 images gives fooling rates comparable to state-of-the-art universal attacks \cite{Dezfooli17,Khrulkov18} for reasonable norms of perturbation. Code available at https://github.com/ksandeshk/svd-uap .
△ Less
Submitted 28 October, 2022; v1 submitted 18 May, 2020;
originally announced May 2020.
-
Computer Vision Toolkit for Non-invasive Monitoring of Factory Floor Artifacts
Authors:
Aditya M. Deshpande,
Anil Kumar Telikicherla,
Vinay Jakkali,
David A. Wickelhaus,
Manish Kumar,
Sam Anand
Abstract:
Digitization has led to smart, connected technologies be an integral part of businesses, governments and communities. For manufacturing digitization, there has been active research and development with a focus on Cloud Manufacturing (CM) and the Industrial Internet of Things (IIoT). This work presents a computer vision toolkit (CV Toolkit) for non-invasive digitization of the factory floor in line…
▽ More
Digitization has led to smart, connected technologies be an integral part of businesses, governments and communities. For manufacturing digitization, there has been active research and development with a focus on Cloud Manufacturing (CM) and the Industrial Internet of Things (IIoT). This work presents a computer vision toolkit (CV Toolkit) for non-invasive digitization of the factory floor in line with Industry 4.0 requirements for factory data collection. Currently, technical challenges persist towards digitization of legacy systems due to the limitation for changes in their design and sensors. This novel toolkit is developed to facilitate easy integration of legacy production machinery and factory floor artifacts with the digital and smart manufacturing environment with no requirement of any physical changes in the machines. The system developed is modular, and allows real-time monitoring of production machinery. Modularity aspect allows the incorporation of new software applications in the current framework of CV Toolkit. To allow connectivity of this toolkit with manufacturing floors in a simple, deployable and cost-effective manner, the toolkit is integrated with a known manufacturing data standard, MTConnect, to "translate" the digital inputs into data streams that can be read by commercial status tracking and reporting software solutions. The proposed toolkit is demonstrated using a mock-panel environment developed in house at the University of Cincinnati to highlight its usability.
△ Less
Submitted 12 May, 2020;
originally announced May 2020.
-
One-Shot Recognition of Manufacturing Defects in Steel Surfaces
Authors:
Aditya M. Deshpande,
Ali A. Minai,
Manish Kumar
Abstract:
Quality control is an essential process in manufacturing to make the product defect-free as well as to meet customer needs. The automation of this process is important to maintain high quality along with the high manufacturing throughput. With recent developments in deep learning and computer vision technologies, it has become possible to detect various features from the images with near-human acc…
▽ More
Quality control is an essential process in manufacturing to make the product defect-free as well as to meet customer needs. The automation of this process is important to maintain high quality along with the high manufacturing throughput. With recent developments in deep learning and computer vision technologies, it has become possible to detect various features from the images with near-human accuracy. However, many of these approaches are data intensive. Training and deployment of such a system on manufacturing floors may become expensive and time-consuming. The need for large amounts of training data is one of the limitations of the applicability of these approaches in real-world manufacturing systems. In this work, we propose the application of a Siamese convolutional neural network to do one-shot recognition for such a task. Our results demonstrate how one-shot learning can be used in quality control of steel by identification of defects on the steel surface. This method can significantly reduce the requirements of training data and can also be run in real-time.
△ Less
Submitted 12 May, 2020;
originally announced May 2020.
-
Flight Control of Sliding Arm Quadcopter with Dynamic Structural Parameters
Authors:
Rumit Kumar,
Aditya M. Deshpande,
James Z. Wells,
Manish Kumar
Abstract:
The conceptual design and flight controller of a novel kind of quadcopter are presented. This design is capable of morphing the shape of the UAV during flight to achieve position and attitude control. We consider a dynamic center of gravity (CoG) which causes continuous variation in a moment of inertia (MoI) parameters of the UAV in this design. These dynamic structural parameters play a vital rol…
▽ More
The conceptual design and flight controller of a novel kind of quadcopter are presented. This design is capable of morphing the shape of the UAV during flight to achieve position and attitude control. We consider a dynamic center of gravity (CoG) which causes continuous variation in a moment of inertia (MoI) parameters of the UAV in this design. These dynamic structural parameters play a vital role in the stability and control of the system. The length of quadcopter arms is a variable parameter, and it is actuated using attitude feedback-based control law. The MoI parameters are computed in real-time and incorporated in the equations of motion of the system. The UAV utilizes the angular motion of propellers and variable quadcopter arm lengths for position and navigation control. The movement space of the CoG is a design parameter and it is bounded by actuator limitations and stability requirements of the system. A detailed information on equations of motion, flight controller design and possible applications of this system are provided. Further, the proposed shape-changing UAV system is evaluated by comparative numerical simulations for way point navigation mission and complex trajectory tracking.
△ Less
Submitted 27 April, 2020;
originally announced April 2020.
-
Measurement of charged pion double spin asymmetries at midrapidity in longitudinally polarized $p$$+$$p$ collisions at $\sqrt{s}=510$ GeV
Authors:
U. A. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
M. Alfred,
N. Apadula,
Y. Aramaki,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont,
A. Berdnikov,
Y. Berdnikov
, et al. (335 additional authors not shown)
Abstract:
The PHENIX experiment at the Relativistic Heavy Ion Collider has measured the longitudinal double spin asymmetries, $A_{LL}$, for charged pions at midrapidity ($|η|<0.35$) in longitudinally polarized $p$$+$$p$ collisions at $\sqrt{s}=510$ GeV. These measurements are sensitive to the gluon spin contribution to the total spin of the proton in the parton momentum fraction $x$ range between 0.04 and 0…
▽ More
The PHENIX experiment at the Relativistic Heavy Ion Collider has measured the longitudinal double spin asymmetries, $A_{LL}$, for charged pions at midrapidity ($|η|<0.35$) in longitudinally polarized $p$$+$$p$ collisions at $\sqrt{s}=510$ GeV. These measurements are sensitive to the gluon spin contribution to the total spin of the proton in the parton momentum fraction $x$ range between 0.04 and 0.09. One can infer the sign of the gluon polarization from the ordering of pion asymmetries with charge alone. The asymmetries are found to be consistent with global quantum-chromodynamics fits of deep-inelastic scattering and data at $\sqrt{s}=200$ GeV, which show a nonzero positive contribution of gluon spin to the proton spin.
△ Less
Submitted 31 July, 2020; v1 submitted 6 April, 2020;
originally announced April 2020.
-
Post-Limber Weak Lensing Bispectrum, Reduced Shear Correction, and Magnification Bias Correction
Authors:
Anurag C. Deshpande,
Thomas D. Kitching
Abstract:
The significant increase in precision that will be achieved by Stage IV cosmic shear surveys means that several currently used theoretical approximations may cease to be valid. An additional layer of complexity arises from the fact that many of these approximations are interdependent; the procedure to correct for one involves making another. Two such approximations that must be relaxed for upcomin…
▽ More
The significant increase in precision that will be achieved by Stage IV cosmic shear surveys means that several currently used theoretical approximations may cease to be valid. An additional layer of complexity arises from the fact that many of these approximations are interdependent; the procedure to correct for one involves making another. Two such approximations that must be relaxed for upcoming experiments are the reduced shear approximation and the effect of neglecting magnification bias. Accomplishing this involves the calculation of the convergence bispectrum; typically subject to the Limber approximation. In this work, we compute the post-Limber convergence bispectrum, and the post-Limber reduced shear and magnification bias corrections to the angular power spectrum for a Euclid-like survey. We find that the Limber approximation significantly overestimates the bispectrum when any side of the bispectrum triangle, $\ell_i<60$. However, the resulting changes in the reduced shear and magnification bias corrections are well below the sample variance for $\ell\leq5000$. We also compute a worst-case scenario for the additional biases on $w_0w_a$CDM cosmological parameters that result from the difference between the post-Limber and Limber approximated forms of the corrections. These further demonstrate that the reduced shear and magnification bias corrections can safely be treated under the Limber approximation for upcoming surveys.
△ Less
Submitted 27 May, 2020; v1 submitted 3 April, 2020;
originally announced April 2020.
-
Can we have it all? On the Trade-off between Spatial and Adversarial Robustness of Neural Networks
Authors:
Sandesh Kamath,
Amit Deshpande,
K V Subrahmanyam,
Vineeth N Balasubramanian
Abstract:
(Non-)robustness of neural networks to small, adversarial pixel-wise perturbations, and as more recently shown, to even random spatial transformations (e.g., translations, rotations) entreats both theoretical and empirical understanding. Spatial robustness to random translations and rotations is commonly attained via equivariant models (e.g., StdCNNs, GCNNs) and training augmentation, whereas adve…
▽ More
(Non-)robustness of neural networks to small, adversarial pixel-wise perturbations, and as more recently shown, to even random spatial transformations (e.g., translations, rotations) entreats both theoretical and empirical understanding. Spatial robustness to random translations and rotations is commonly attained via equivariant models (e.g., StdCNNs, GCNNs) and training augmentation, whereas adversarial robustness is typically achieved by adversarial training. In this paper, we prove a quantitative trade-off between spatial and adversarial robustness in a simple statistical setting. We complement this empirically by showing that: (a) as the spatial robustness of equivariant models improves by training augmentation with progressively larger transformations, their adversarial robustness worsens progressively, and (b) as the state-of-the-art robust models are adversarially trained with progressively larger pixel-wise perturbations, their spatial robustness drops progressively. Towards achieving pareto-optimality in this trade-off, we propose a method based on curriculum learning that trains gradually on more difficult perturbations (both spatial and adversarial) to improve spatial and adversarial robustness simultaneously.
△ Less
Submitted 10 November, 2021; v1 submitted 26 February, 2020;
originally announced February 2020.
-
Automatic Segmentation, Feature Extraction and Comparison of Healthy and Stroke Cerebral Vasculature
Authors:
Aditi Deshpande,
Nima Jamilpour,
Bin Jiang,
Chelsea Kidwell,
Max Wintermark,
Kaveh Laksari
Abstract:
Accurate segmentation of cerebral vasculature and a quantitative assessment of cerebrovascular morphology is critical to various diagnostic and therapeutic purposes and is pertinent to studying brain health and disease. However, this is still a challenging task due to the complexity of the vascular imaging data. We propose an automated method for cerebral vascular segmentation without the need of…
▽ More
Accurate segmentation of cerebral vasculature and a quantitative assessment of cerebrovascular morphology is critical to various diagnostic and therapeutic purposes and is pertinent to studying brain health and disease. However, this is still a challenging task due to the complexity of the vascular imaging data. We propose an automated method for cerebral vascular segmentation without the need of any manual intervention as well as a method to skeletonize the binary volume to extract vascular geometric features which can characterize vessel structure. We combine a probabilistic vessel-enhancing filtering with an active-contour technique to segment magnetic resonance and computed tomography angiograms (MRA and CTA) and subsequently extract the vessel centerlines and diameters to calculate the geometrical properties of the vasculature. Our method was validated using a 3D phantom of the Circle-of-Willis region with 84% mean Dice Similarity and 85% mean Pearson Correlation with minimal modified Hausdorff distance error. We applied this method to a dataset of healthy subjects and stroke patients and present a quantitative comparison between them. We found significant differences in the geometric features including total length (2.88 +/- 0.38 m for healthy and 2.20 +/- 0.67 m for stroke), volume (40.18 +/- 25.55 ml for healthy and 34.43 +/- 21.83 ml for stroke), tortuosity (3.24 +/- 0.88 rad/cm for healthy and 5.80 +/- 0.92 rad/cm for stroke) and fractality (box dimension 1.36 +/- 0.28 for healthy vs. 1.69 +/- 0.20 for stroke). This technique can be applied on any imaging modality and can be used in the future to automatically obtain the 3D segmented vasculature for diagnosis and treatment planning of Stroke and other cerebrovascular diseases (CVD) in the clinic and also to study the morphological changes caused by various CVD.
△ Less
Submitted 25 February, 2020;
originally announced February 2020.
-
Hierarchy of linear light cones with long-range interactions
Authors:
Minh C. Tran,
Chi-Fang Chen,
Adam Ehrenberg,
Andrew Y. Guo,
Abhinav Deshpande,
Yifan Hong,
Zhe-Xuan Gong,
Alexey V. Gorshkov,
Andrew Lucas
Abstract:
In quantum many-body systems with local interactions, quantum information and entanglement cannot spread outside of a linear light cone, which expands at an emergent velocity analogous to the speed of light. Local operations at sufficiently separated spacetime points approximately commute -- given a many-body state,…
▽ More
In quantum many-body systems with local interactions, quantum information and entanglement cannot spread outside of a linear light cone, which expands at an emergent velocity analogous to the speed of light. Local operations at sufficiently separated spacetime points approximately commute -- given a many-body state, $\mathcal{O}_x(t) \mathcal{O}_y |ψ\rangle \approx \mathcal{O}_y\mathcal{O}_x(t) |ψ\rangle$ with arbitrarily small errors -- so long as $|x-y|\gtrsim vt$, where $v$ is finite. Yet most non-relativistic physical systems realized in nature have long-range interactions: two degrees of freedom separated by a distance $r$ interact with potential energy $V(r) \propto 1/r^α$. In systems with long-range interactions, we rigorously establish a hierarchy of linear light cones: at the same $α$, some quantum information processing tasks are constrained by a linear light cone while others are not. In one spatial dimension, this linear light cone exists for every many-body state when $α>3$ (Lieb-Robinson light cone); for a typical state chosen uniformly at random from the Hilbert space when $α>\frac{5}{2}$ (Frobenius light cone); for every state of a non-interacting system when $α>2$ (free light cone). These bounds apply to time-dependent systems and are optimal up to subalgebraic improvements. Our theorems regarding the Lieb-Robinson and free light cones -- and their tightness -- also generalize to arbitrary dimensions. We discuss the implications of our bounds on the growth of connected correlators and of topological order, the clustering of correlations in gapped systems, and the digital simulation of systems with long-range interactions. In addition, we show that universal quantum state transfer, as well as many-body quantum chaos, are bounded by the Frobenius light cone, and therefore are poorly constrained by all Lieb-Robinson bounds.
△ Less
Submitted 18 July, 2022; v1 submitted 30 January, 2020;
originally announced January 2020.
-
Machine learning based co-creative design framework
Authors:
Brian Quanz,
Wei Sun,
Ajay Deshpande,
Dhruv Shah,
Jae-eun Park
Abstract:
We propose a flexible, co-creative framework bringing together multiple machine learning techniques to assist human users to efficiently produce effective creative designs. We demonstrate its potential with a perfume bottle design case study, including human evaluation and quantitative and qualitative analyses.
We propose a flexible, co-creative framework bringing together multiple machine learning techniques to assist human users to efficiently produce effective creative designs. We demonstrate its potential with a perfume bottle design case study, including human evaluation and quantitative and qualitative analyses.
△ Less
Submitted 23 January, 2020;
originally announced January 2020.
-
$J/ψ$ and $ψ(2S)$ production at forward rapidity in $p$+$p$ collisions at $\sqrt{s}=510$ GeV
Authors:
U. A. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
M. Alfred,
N. Apadula,
Y. Aramaki,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont,
A. Berdnikov,
Y. Berdnikov
, et al. (335 additional authors not shown)
Abstract:
The PHENIX experiment at the Relativistic Heavy Ion Collider has measured the differential cross section, mean transverse momentum, mean transverse momentum squared of inclusive $J/ψ$ and cross-section ratio of $ψ(2S)$ to $J/ψ$ at forward rapidity in \pp collisions at \sqrts = 510 GeV via the dimuon decay channel. Comparison is made to inclusive $J/ψ$ cross sections measured at \sqrts = 200 GeV an…
▽ More
The PHENIX experiment at the Relativistic Heavy Ion Collider has measured the differential cross section, mean transverse momentum, mean transverse momentum squared of inclusive $J/ψ$ and cross-section ratio of $ψ(2S)$ to $J/ψ$ at forward rapidity in \pp collisions at \sqrts = 510 GeV via the dimuon decay channel. Comparison is made to inclusive $J/ψ$ cross sections measured at \sqrts = 200 GeV and 2.76--13 TeV. The result is also compared to leading-order nonrelativistic QCD calculations coupled to a color-glass-condensate description of the low-$x$ gluons in the proton at low transverse momentum ($p_T$) and to next-to-leading order nonrelativistic QCD calculations for the rest of the $p_T$ range. These calculations overestimate the data at low $p_T$. While consistent with the data within uncertainties above $\approx3$ GeV/$c$, the calculations are systematically below the data. The total cross section times the branching ratio is BR $dσ^{J/ψ}_{pp}/dy (1.2<|y|<2.2, 0<p_T<10~\mbox{GeV/$c$}) =$ 54.3 $\pm$ 0.5 (stat) $\pm$ 5.5 (syst) nb.
△ Less
Submitted 19 February, 2020; v1 submitted 31 December, 2019;
originally announced December 2019.
-
Euclid: The reduced shear approximation and magnification bias for Stage IV cosmic shear experiments
Authors:
A. C. Deshpande,
T. D. Kitching,
V. F. Cardone,
P. L. Taylor,
S. Casas,
S. Camera,
C. Carbone,
M. Kilbinger,
V. Pettorino,
Z. Sakr,
D. Sapone,
I. Tutusaus,
N. Auricchio,
C. Bodendorf,
D. Bonino,
M. Brescia,
V. Capobianco,
J. Carretero,
M. Castellano,
S. Cavuoti,
R. Cledassou,
G. Congedo,
L. Conversi,
L. Corcione,
M. Cropper
, et al. (47 additional authors not shown)
Abstract:
Stage IV weak lensing experiments will offer more than an order of magnitude leap in precision. We must therefore ensure that our analyses remain accurate in this new era. Accordingly, previously ignored systematic effects must be addressed. In this work, we evaluate the impact of the reduced shear approximation and magnification bias, on the information obtained from the angular power spectrum. T…
▽ More
Stage IV weak lensing experiments will offer more than an order of magnitude leap in precision. We must therefore ensure that our analyses remain accurate in this new era. Accordingly, previously ignored systematic effects must be addressed. In this work, we evaluate the impact of the reduced shear approximation and magnification bias, on the information obtained from the angular power spectrum. To first-order, the statistics of reduced shear, a combination of shear and convergence, are taken to be equal to those of shear. However, this approximation can induce a bias in the cosmological parameters that can no longer be neglected. A separate bias arises from the statistics of shear being altered by the preferential selection of galaxies and the dilution of their surface densities, in high-magnification regions. The corrections for these systematic effects take similar forms, allowing them to be treated together. We calculated the impact of neglecting these effects on the cosmological parameters that would be determined from Euclid, using cosmic shear tomography. To do so, we employed the Fisher matrix formalism, and included the impact of the super-sample covariance. We also demonstrate how the reduced shear correction can be calculated using a lognormal field forward modelling approach. These effects cause significant biases in Omega_m, sigma_8, n_s, Omega_DE, w_0, and w_a of -0.53 sigma, 0.43 sigma, -0.34 sigma, 1.36 sigma, -0.68 sigma, and 1.21 sigma, respectively. We then show that these lensing biases interact with another systematic: the intrinsic alignment of galaxies. Accordingly, we develop the formalism for an intrinsic alignment-enhanced lensing bias correction. Applying this to Euclid, we find that the additional terms introduced by this correction are sub-dominant.
△ Less
Submitted 1 April, 2020; v1 submitted 16 December, 2019;
originally announced December 2019.
-
Measurement of $J/ψ$ at forward and backward rapidity in $p$+$p$, $p$$+A$l, $p$$+A$u, and $^3$He+Au collisions at $\sqrt{s_{_{NN}}}=200~{\rm GeV}$
Authors:
U. A. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
M. Alfred,
V. Andrieux,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
D. S. Blau,
M. Boer,
J. S. Bok
, et al. (337 additional authors not shown)
Abstract:
Charmonium is a valuable probe in heavy-ion collisions to study the properties of the quark gluon plasma, and is also an interesting probe in small collision systems to study cold nuclear matter effects, which are also present in large collision systems. With the recent observations of collective behavior of produced particles in small system collisions, measurements of the modification of charmon…
▽ More
Charmonium is a valuable probe in heavy-ion collisions to study the properties of the quark gluon plasma, and is also an interesting probe in small collision systems to study cold nuclear matter effects, which are also present in large collision systems. With the recent observations of collective behavior of produced particles in small system collisions, measurements of the modification of charmonium in small systems have become increasingly relevant. We present the results of $J/ψ$ measurements at forward and backward rapidity in various small collision systems, $p$$+$$p$, $p$$+$Al, $p$$+$Au and $^3$He$+$Au, at $\sqrt{s_{_{NN}}}$=200 GeV. The results are presented in the form of the observable $R_{AB}$, the nuclear modification factor, a measure of the ratio of the $J/ψ$ invariant yield compared to the scaled yield in $p$$+$$p$ collisions. We examine the rapidity, transverse momentum, and collision centrality dependence of nuclear effects on $J/ψ$ production with different projectile sizes $p$ and $^3$He, and different target sizes Al and Au. The modification is found to be strongly dependent on the target size, but to be very similar for $p$$+$Au and $^{3}$He$+$Au. However, for 0%--20% central collisions at backward rapidity, the modification for $^{3}$He$+$Au is found to be smaller than that for $p$$+$Au, with a mean fit to the ratio of $0.89\pm0.03$(stat)${\pm}0.08$(syst), possibly indicating final state effects due to the larger projectile size.
△ Less
Submitted 12 July, 2020; v1 submitted 31 October, 2019;
originally announced October 2019.