-
Disentangling the dominant drivers of gravity wave variability in the Martian thermosphere
Authors:
N. V. Rao,
V. Leelavathi,
Ch. Yaswanth,
S. V. B. Rao
Abstract:
In this study, we extracted the amplitudes of the gravity waves (GWs)from the neutral densities measured in situ by the neutral gas and ion mass spectrometer aboard the Mars atmosphere and volatile evolution mission. The spatial and temporal variabilities of the GWs show that solar activity (the F10.7 cm solar flux corrected for a heliocentric distance of 1.66 AU), solar insolation, and the lower…
▽ More
In this study, we extracted the amplitudes of the gravity waves (GWs)from the neutral densities measured in situ by the neutral gas and ion mass spectrometer aboard the Mars atmosphere and volatile evolution mission. The spatial and temporal variabilities of the GWs show that solar activity (the F10.7 cm solar flux corrected for a heliocentric distance of 1.66 AU), solar insolation, and the lower atmospheric dust are the dominant drivers of the GW variability in the thermosphere. We developed a methodology in which a linear regression analysis has been used to disentangle the complex variabilities of the GWs. The three dominant drivers could account for most of the variability in the GW amplitudes. Variability caused by the sources of GWs and the effects of winds and the global circulation in the mesosphere and lower thermosphere are the other factors that could not be addressed. The results of the present study show that for every 100 sfu increase in the solar activity, the GW amplitudes in the thermosphere decrease by ~9%. Solar insolation drives the diurnal, seasonal and latitudinal variations of ~9%, ~4% and ~6%, respectively. Using the historical data of the dust opacity and solar activity, we estimated the GW amplitudes of the Martian thermosphere from MY 24 to MY 35. The GW amplitudes were significantly reduced during the maximum of solar cycle 23 and were highest in the solar minimum. The global dust storms of MY 25, 28, and 34 lead to significant enhancements in the GW amplitudes.
△ Less
Submitted 15 August, 2022;
originally announced August 2022.
-
Fractional Brownian Motion: Local Modulus of Continuity with Refined Almost Sure Upper Bound and First Exit Time from One-sided Barrier
Authors:
Qidi Peng,
Nan Rao
Abstract:
Based on an optimal rate wavelet series representation, we derive a local modulus of continuity result with a refined almost sure upper bound for fractional Brownian motion. \sloppy The obtained upper bound of the small fractional Brownian increments is of order $\mathcal O_{a.s.}\big(|h|^H\sqrt{\log\log |h|^{-1}}\big)$ as $|h|\to0$, and an upper bound of its $p$th moment is provided, for any…
▽ More
Based on an optimal rate wavelet series representation, we derive a local modulus of continuity result with a refined almost sure upper bound for fractional Brownian motion. \sloppy The obtained upper bound of the small fractional Brownian increments is of order $\mathcal O_{a.s.}\big(|h|^H\sqrt{\log\log |h|^{-1}}\big)$ as $|h|\to0$, and an upper bound of its $p$th moment is provided, for any $p>0$. This result fills the gap of the law of iterated logarithm for fractional Brownian motion, where the moments' information of the random multiplier in the upper bound is missing. With this enhanced upper bound and some new results on the distribution of the maximum of fractional Brownian motion, we obtain a new and refined asymptotic estimate of the upper-tail probability for a fractional Brownian motion to first exit from a positive-valued barrier over time $T$, as $T\to+\infty$.
△ Less
Submitted 18 October, 2023; v1 submitted 20 July, 2022;
originally announced July 2022.
-
An Input-Output Feedback Linearization based Exponentially Stable Controller for Multi-UAV Payload Transport
Authors:
Nishanth Rao,
Suresh Sundaram
Abstract:
In this paper, an exponentially stable trajectory tracking controller is proposed for multi-UAV payload transport. The multi-UAV payload system has a 2-DOF magnetic spherical joint between the UAVs and the vertical rigid links of the payload frame, so the UAVs can roll or pitch freely. These vertical links are rigidly attached to the payload and cannot move. An input-output feedback linearized mod…
▽ More
In this paper, an exponentially stable trajectory tracking controller is proposed for multi-UAV payload transport. The multi-UAV payload system has a 2-DOF magnetic spherical joint between the UAVs and the vertical rigid links of the payload frame, so the UAVs can roll or pitch freely. These vertical links are rigidly attached to the payload and cannot move. An input-output feedback linearized model is derived for the complete payload-UAV system along with thrust vectoring control for trajectory tracking of the payload. The theoretical analysis on tracking control laws shows that control law is exponentially stable, thus guaranteeing safe transportation along the desired trajectory. To validate the performance of the proposed control law, the results for a numerical simulation as well as a high-fidelity Gazebo real-time simulation are presented. Next, the robustness of the proposed controller is analyzed against two practical situations: External disturbance on the payload and payload mass uncertainty. The results clearly indicate that the proposed controller is robust and computationally efficient while achieving exponentially stable trajectory tracking.
△ Less
Submitted 10 July, 2022;
originally announced July 2022.
-
Hyper-Universal Policy Approximation: Learning to Generate Actions from a Single Image using Hypernets
Authors:
Dimitrios C. Gklezakos,
Rishi Jha,
Rajesh P. N. Rao
Abstract:
Inspired by Gibson's notion of object affordances in human vision, we ask the question: how can an agent learn to predict an entire action policy for a novel object or environment given only a single glimpse? To tackle this problem, we introduce the concept of Universal Policy Functions (UPFs) which are state-to-action map**s that generalize not only to new goals but most importantly to novel, u…
▽ More
Inspired by Gibson's notion of object affordances in human vision, we ask the question: how can an agent learn to predict an entire action policy for a novel object or environment given only a single glimpse? To tackle this problem, we introduce the concept of Universal Policy Functions (UPFs) which are state-to-action map**s that generalize not only to new goals but most importantly to novel, unseen environments. Specifically, we consider the problem of efficiently learning such policies for agents with limited computational and communication capacity, constraints that are frequently encountered in edge devices. We propose the Hyper-Universal Policy Approximator (HUPA), a hypernetwork-based model to generate small task- and environment-conditional policy networks from a single image, with good generalization properties. Our results show that HUPAs significantly outperform an embedding-based alternative for generated policies that are size-constrained. Although this work is restricted to a simple map-based navigation task, future work includes applying the principles behind HUPAs to learning more general affordances for objects and environments.
△ Less
Submitted 7 July, 2022;
originally announced July 2022.
-
Text Enriched Sparse Hyperbolic Graph Convolutional Networks
Authors:
Nurendra Choudhary,
Nikhil Rao,
Karthik Subbian,
Chandan K. Reddy
Abstract:
Heterogeneous networks, which connect informative nodes containing text with different edge types, are routinely used to store and process information in various real-world applications. Graph Neural Networks (GNNs) and their hyperbolic variants provide a promising approach to encode such networks in a low-dimensional latent space through neighborhood aggregation and hierarchical feature extractio…
▽ More
Heterogeneous networks, which connect informative nodes containing text with different edge types, are routinely used to store and process information in various real-world applications. Graph Neural Networks (GNNs) and their hyperbolic variants provide a promising approach to encode such networks in a low-dimensional latent space through neighborhood aggregation and hierarchical feature extraction, respectively. However, these approaches typically ignore metapath structures and the available semantic information. Furthermore, these approaches are sensitive to the noise present in the training data. To tackle these limitations, in this paper, we propose Text Enriched Sparse Hyperbolic Graph Convolution Network (TESH-GCN) to capture the graph's metapath structures using semantic signals and further improve prediction in large heterogeneous graphs. In TESH-GCN, we extract semantic node information, which successively acts as a connection signal to extract relevant nodes' local neighborhood and graph-level metapath features from the sparse adjacency tensor in a reformulated hyperbolic graph convolution layer. These extracted features in conjunction with semantic features from the language model (for robustness) are used for the final downstream task. Experiments on various heterogeneous graph datasets show that our model outperforms the current state-of-the-art approaches by a large margin on the task of link prediction. We also report a reduction in both the training time and model parameters compared to the existing hyperbolic approaches through a reformulated hyperbolic graph convolution. Furthermore, we illustrate the robustness of our model by experimenting with different levels of simulated noise in both the graph structure and text, and also, present a mechanism to explain TESH-GCN's prediction by analyzing the extracted metapaths.
△ Less
Submitted 7 July, 2022; v1 submitted 5 July, 2022;
originally announced July 2022.
-
Tangle of Spin Double Helices in the Honeycomb Kitaev-$Γ$ Model
Authors:
Jheng-Wei Li,
Nihal Rao,
Jan von Delft,
Lode Pollet,
Ke Liu
Abstract:
We investigate the ground-state nature of the honeycomb Kitaev-$Γ$ model in the material-relevant parameter regime through a combination of classical and quantum simulations. The classical model is imprinted with a tangle of highly structured spin double helices. This helix tangle exhibits $18$ inequivalent helical axes and features a spontaneous periodicity anisotropy and a ${\rm sgn}(Γ)$-determi…
▽ More
We investigate the ground-state nature of the honeycomb Kitaev-$Γ$ model in the material-relevant parameter regime through a combination of classical and quantum simulations. The classical model is imprinted with a tangle of highly structured spin double helices. This helix tangle exhibits $18$ inequivalent helical axes and features a spontaneous periodicity anisotropy and a ${\rm sgn}(Γ)$-determined chirality pattern. Infinite PEPS simulations with clusters up to $36$ sites identify hallmarks of this many-body order in the quantum spin-$1/2$ model. Our findings provide a fresh perspective of the Kitaev-$Γ$ model and enrich the physics of Kitaev magnetism.
△ Less
Submitted 24 May, 2023; v1 submitted 17 June, 2022;
originally announced June 2022.
-
Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole Hierarchies
Authors:
Ares Fisher,
Rajesh P. N. Rao
Abstract:
Human vision involves parsing and representing objects and scenes using structured representations based on part-whole hierarchies. Computer vision and machine learning researchers have recently sought to emulate this capability using capsule networks, reference frames and active predictive coding, but a generative model formulation has been lacking. We introduce Recursive Neural Programs (RNPs),…
▽ More
Human vision involves parsing and representing objects and scenes using structured representations based on part-whole hierarchies. Computer vision and machine learning researchers have recently sought to emulate this capability using capsule networks, reference frames and active predictive coding, but a generative model formulation has been lacking. We introduce Recursive Neural Programs (RNPs), which, to our knowledge, is the first neural generative model to address the part-whole hierarchy learning problem. RNPs model images as hierarchical trees of probabilistic sensory-motor programs that recursively reuse learned sensory-motor primitives to model an image within different reference frames, forming recursive image grammars. We express RNPs as structured variational autoencoders (sVAEs) for inference and sampling, and demonstrate parts-based parsing, sampling and one-shot transfer learning for MNIST, Omniglot and Fashion-MNIST datasets, demonstrating the model's expressive power. Our results show that RNPs provide an intuitive and explainable way of composing objects and scenes, allowing rich compositionality and intuitive interpretations of objects in terms of part-whole hierarchies.
△ Less
Submitted 25 June, 2022; v1 submitted 16 June, 2022;
originally announced June 2022.
-
Shop** Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search
Authors:
Chandan K. Reddy,
Lluís Màrquez,
Fran Valero,
Nikhil Rao,
Hugo Zaragoza,
Sambaran Bandyopadhyay,
Arnab Biswas,
Anlu Xing,
Karthik Subbian
Abstract:
Improving the quality of search results can significantly enhance users experience and engagement with search engines. In spite of several recent advancements in the fields of machine learning and data mining, correctly classifying items for a particular user search query has been a long-standing challenge, which still has a large room for improvement. This paper introduces the "Shop** Queries D…
▽ More
Improving the quality of search results can significantly enhance users experience and engagement with search engines. In spite of several recent advancements in the fields of machine learning and data mining, correctly classifying items for a particular user search query has been a long-standing challenge, which still has a large room for improvement. This paper introduces the "Shop** Queries Dataset", a large dataset of difficult Amazon search queries and results, publicly released with the aim of fostering research in improving the quality of search results. The dataset contains around 130 thousand unique queries and 2.6 million manually labeled (query,product) relevance judgements. The dataset is multilingual with queries in English, Japanese, and Spanish. The Shop** Queries Dataset is being used in one of the KDDCup'22 challenges. In this paper, we describe the dataset and present three evaluation tasks along with baseline results: (i) ranking the results list, (ii) classifying product results into relevance categories, and (iii) identifying substitute products for a given query. We anticipate that this data will become the gold standard for future research in the topic of product search.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
Analysis of Learner Independent Variables for Estimating Assessment Items Difficulty Level
Authors:
Shilpi Banerjee,
N. J. Rao
Abstract:
The quality of assessment determines the quality of learning, and is characterized by validity, reliability and difficulty. Mastery of learning is generally represented by the difficulty levels of assessment items. A very large number of variables are identified in the literature to measure the difficulty level. These variables, which are not completely independent of one another, are categorized…
▽ More
The quality of assessment determines the quality of learning, and is characterized by validity, reliability and difficulty. Mastery of learning is generally represented by the difficulty levels of assessment items. A very large number of variables are identified in the literature to measure the difficulty level. These variables, which are not completely independent of one another, are categorized into learner dependent, learner independent, generic, non-generic and score based. This research proposes a model for predicting the difficulty level of assessment items in engineering courses using learner independent and generic variables. An ordinal regression model is developed for predicting the difficulty level, and uses six variables including three stimuli variables (item presentation, usage of technical notations and number of resources), two content related variables (number of concepts and procedures) and one task variable (number of conditions). Experimental results from three engineering courses provide around 80% accuracy in classification of items using the proposed model.
△ Less
Submitted 9 June, 2022;
originally announced June 2022.
-
Learning Backward Compatible Embeddings
Authors:
Weihua Hu,
Rajas Bansal,
Kaidi Cao,
Nikhil Rao,
Karthik Subbian,
Jure Leskovec
Abstract:
Embeddings, low-dimensional vector representation of objects, are fundamental in building modern machine learning systems. In industrial settings, there is usually an embedding team that trains an embedding model to solve intended tasks (e.g., product recommendation). The produced embeddings are then widely consumed by consumer teams to solve their unintended tasks (e.g., fraud detection). However…
▽ More
Embeddings, low-dimensional vector representation of objects, are fundamental in building modern machine learning systems. In industrial settings, there is usually an embedding team that trains an embedding model to solve intended tasks (e.g., product recommendation). The produced embeddings are then widely consumed by consumer teams to solve their unintended tasks (e.g., fraud detection). However, as the embedding model gets updated and retrained to improve performance on the intended task, the newly-generated embeddings are no longer compatible with the existing consumer models. This means that historical versions of the embeddings can never be retired or all consumer teams have to retrain their models to make them compatible with the latest version of the embeddings, both of which are extremely costly in practice. Here we study the problem of embedding version updates and their backward compatibility. We formalize the problem where the goal is for the embedding team to keep updating the embedding version, while the consumer teams do not have to retrain their models. We develop a solution based on learning backward compatible embeddings, which allows the embedding model version to be updated frequently, while also allowing the latest version of the embedding to be quickly transformed into any backward compatible historical version of it, so that consumer teams do not have to retrain their models. Under our framework, we explore six methods and systematically evaluate them on a real-world recommender system application. We show that the best method, which we call BC-Aligner, maintains backward compatibility with existing unintended tasks even after multiple model version updates. Simultaneously, BC-Aligner achieves the intended task performance similar to the embedding model that is solely optimized for the intended task.
△ Less
Submitted 7 June, 2022;
originally announced June 2022.
-
An efficient Deep Spatio-Temporal Context Aware decision Network (DST-CAN) for Predictive Manoeuvre Planning
Authors:
Jayabrata Chowdhury,
Suresh Sundaram,
Nishant Rao,
Narasimhan Sundararajan
Abstract:
To ensure the safety and efficiency of its maneuvers, an Autonomous Vehicle (AV) should anticipate the future intentions of surrounding vehicles using its sensor information. If an AV can predict its surrounding vehicles' future trajectories, it can make safe and efficient manoeuvre decisions. In this paper, we present such a Deep Spatio-Temporal Context-Aware decision Network (DST-CAN) model for…
▽ More
To ensure the safety and efficiency of its maneuvers, an Autonomous Vehicle (AV) should anticipate the future intentions of surrounding vehicles using its sensor information. If an AV can predict its surrounding vehicles' future trajectories, it can make safe and efficient manoeuvre decisions. In this paper, we present such a Deep Spatio-Temporal Context-Aware decision Network (DST-CAN) model for predictive manoeuvre planning of AVs. A memory neuron network is used to predict future trajectories of its surrounding vehicles. The driving environment's spatio-temporal information (past, present, and predicted future trajectories) are embedded into a context-aware grid. The proposed DST-CAN model employs these context-aware grids as inputs to a convolutional neural network to understand the spatial relationships between the vehicles and determine a safe and efficient manoeuvre decision. The DST-CAN model also uses information of human driving behavior on a highway. Performance evaluation of DST-CAN has been carried out using two publicly available NGSIM US-101 and I-80 datasets. Also, rule-based ground truth decisions have been compared with those generated by DST-CAN. The results clearly show that DST-CAN can make much better decisions with 3-sec of predicted trajectories of neighboring vehicles compared to currently existing methods that do not use this prediction.
△ Less
Submitted 20 May, 2022;
originally announced May 2022.
-
Globular Clusters UVIT Legacy Survey (GlobULeS) I. FUV-optical Color-Magnitude Diagrams for Eight Globular Clusters
Authors:
Snehalata Sahu,
Annapurni Subramaniam,
Gaurav Singh,
Ramakant Yadav,
Aldo R. Valcarce,
Samyaday Choudhury,
Sharmila Rani,
Deepthi S. Prabhu,
Chul Chung,
Patrick Côté,
Nathan Leigh,
Aaron M. Geller,
Sourav Chatterjee,
N. Kameswara Rao,
Avrajit Bandyopadhyay,
Michael Shara,
Emanuele Dalessandro,
Gajendra Pandey,
Joesph E. Postma,
John Hutchings,
Mirko Simunovic,
Peter B. Stetson,
Sivarani Thirupathi,
Thomas Puzia,
Young-Jong Sohn
Abstract:
We present the first results of eight Globular Clusters (GCs) from the AstroSat/UVIT Legacy Survey program GlobULeS based on the observations carried out in two FUV filters (F148W and F169M). The FUV-optical and FUV-FUV color-magnitude diagrams (CMDs) of GCs with the proper motion membership were constructed by combining the UVIT data with HST UV Globular Cluster Survey (HUGS) data for inner regio…
▽ More
We present the first results of eight Globular Clusters (GCs) from the AstroSat/UVIT Legacy Survey program GlobULeS based on the observations carried out in two FUV filters (F148W and F169M). The FUV-optical and FUV-FUV color-magnitude diagrams (CMDs) of GCs with the proper motion membership were constructed by combining the UVIT data with HST UV Globular Cluster Survey (HUGS) data for inner regions and Gaia Early Data Release (EDR3) for regions outside the HST's field. We detect sources as faint as F148W $\sim$ 23.5~mag which are classified based on their locations in CMDs by overlaying stellar evolutionary models. The CMDs of 8 GCs are combined with the previous UVIT studies of 3 GCs to create stacked FUV-optical CMDs to highlight the features/peculiarities found in the different evolutionary sequences. The FUV (F148W) detected stellar populations of 11 GCs comprises 2,816 Horizontal Branch (HB) stars (190 Extreme HB candidates), 46 post-HB (pHB), 221 Blue Straggler Stars (BSS), and 107 White Dwarf (WD) candidates. We note that the blue HB color extension obtained from F148W$-$G color and the number of FUV detected EHB candidates are strongly correlated with the maximum internal Helium (He) variation within each GC, suggesting that the FUV-optical plane is the most sensitive to He abundance variations in the HB. We discuss the potential science cases that will be addressed using these catalogues including HB morphologies, BSSs, pHB, and, WD stars.
△ Less
Submitted 27 April, 2022;
originally announced April 2022.
-
Optimal resource allocation for flexible-grid entanglement distribution networks
Authors:
J. Alnas,
M. Alshowkan,
N. S. V. Rao,
N. A. Peters,
J. M. Lukens
Abstract:
We use a genetic algorithm (GA) as a design aid for determining the optimal provisioning of entangled photon spectrum in flex-grid quantum networks with arbitrary numbers of channels and users. After introducing a general model for entanglement distribution based on frequency-polarization hyperentangled biphotons, we derive upper bounds on fidelity and entangled bit rate for networks comprising on…
▽ More
We use a genetic algorithm (GA) as a design aid for determining the optimal provisioning of entangled photon spectrum in flex-grid quantum networks with arbitrary numbers of channels and users. After introducing a general model for entanglement distribution based on frequency-polarization hyperentangled biphotons, we derive upper bounds on fidelity and entangled bit rate for networks comprising one-to-one user connections. Simple conditions based on user detector quality and link efficiencies are found that determine whether entanglement is possible. We successfully apply a GA to find optimal resource allocations in four different representative network scenarios and validate features of our model experimentally in a quantum local area network in deployed fiber. Our results show promise for the rapid design of large-scale entanglement distribution networks.
△ Less
Submitted 13 April, 2022;
originally announced April 2022.
-
Comments on Comments: Where Code Review and Documentation Meet
Authors:
Nikitha Rao,
Jason Tsay,
Martin Hirzel,
Vincent J. Hellendoorn
Abstract:
A central function of code review is to increase understanding; hel** reviewers understand a code change aids in knowledge transfer and finding bugs. Comments in code largely serve a similar purpose, hel** future readers understand the program. It is thus natural to study what happens when these two forms of understanding collide. We ask: what documentation-related comments do reviewers make a…
▽ More
A central function of code review is to increase understanding; hel** reviewers understand a code change aids in knowledge transfer and finding bugs. Comments in code largely serve a similar purpose, hel** future readers understand the program. It is thus natural to study what happens when these two forms of understanding collide. We ask: what documentation-related comments do reviewers make and how do they affect understanding of the contribution? We analyze ca.700K review comments on 2,000 (Java and Python) GitHub projects, and propose several filters to identify which comments are likely to be either in response to a change in documentation and/or call for such a change. We identify 65K such cases. We next develop a taxonomy of the reviewer intents behind such "comments on comments". We find that achieving a shared understanding of the code is key: reviewer comments most often focused on clarification, followed by pointing out issues to fix, such as typos and outdated comments. Curiously, clarifying comments were frequently suggested (often verbatim) by the reviewer, indicating a desire to persist their understanding acquired during code review. We conclude with a discussion of implications of our comments-on-comments dataset for research on improving code review, including the potential benefits for automating code review.
△ Less
Submitted 31 March, 2022;
originally announced April 2022.
-
Quantum Networks for High Energy Physics
Authors:
Andrei Derevianko,
Eden Figueroa,
Julián MartÍnez-Rincón,
Inder Monga,
Andrei Nomerotski,
Cristián H. Peña,
Nicholas A. Peters,
Raphael Pooser,
Nageswara Rao,
Anze Slosar,
Panagiotis Spentzouris,
Maria Spiropulu,
Paul Stankus,
Wenji Wu,
Si Xie
Abstract:
Quantum networks of quantum objects promise to be exponentially more powerful than the objects considered independently. To live up to this promise will require the development of error mitigation and correction strategies to preserve quantum information as it is initialized, stored, transported, utilized, and measured. The quantum information could be encoded in discrete variables such as qubits,…
▽ More
Quantum networks of quantum objects promise to be exponentially more powerful than the objects considered independently. To live up to this promise will require the development of error mitigation and correction strategies to preserve quantum information as it is initialized, stored, transported, utilized, and measured. The quantum information could be encoded in discrete variables such as qubits, in continuous variables, or anything in-between. Quantum computational networks promise to enable simulation of physical phenomena of interest to the HEP community. Quantum sensor networks promise new measurement capability to test for new physics and improve upon existing measurements of fundamental constants. Such networks could exist at multiple scales from the nano-scale to a global-scale quantum network.
△ Less
Submitted 31 March, 2022;
originally announced March 2022.
-
Quantum counterfactuality with identical particles
Authors:
Vinod N. Rao,
Anindita Banerjee,
R. Srikanth
Abstract:
Quantum self-interference enables the counterfactual transmission of information, whereby the transmitted bits involve no particles traveling through the channel. In this work, we show how counterfactuality can be realized even when the self interference is replaced by interference between identical particles. Interestingly, the facet of indistinguishability called forth here is associated with fi…
▽ More
Quantum self-interference enables the counterfactual transmission of information, whereby the transmitted bits involve no particles traveling through the channel. In this work, we show how counterfactuality can be realized even when the self interference is replaced by interference between identical particles. Interestingly, the facet of indistinguishability called forth here is associated with first-order coherence, and is different from the usual notion of indistinguishability associated with the (anti-)commutation relations of mode operators. From an experimental perspective, the simplest implementation of the proposed idea can be realized by slight modifications to existing protocols for differential-phase-shift quantum key distribution or interaction-free measurement.
△ Less
Submitted 17 October, 2023; v1 submitted 21 March, 2022;
originally announced March 2022.
-
Task-Agnostic Graph Explanations
Authors:
Yaochen Xie,
Sumeet Katariya,
Xianfeng Tang,
Edward Huang,
Nikhil Rao,
Karthik Subbian,
Shuiwang Ji
Abstract:
Graph Neural Networks (GNNs) have emerged as powerful tools to encode graph-structured data. Due to their broad applications, there is an increasing need to develop tools to explain how GNNs make decisions given graph-structured data. Existing learning-based GNN explanation approaches are task-specific in training and hence suffer from crucial drawbacks. Specifically, they are incapable of produci…
▽ More
Graph Neural Networks (GNNs) have emerged as powerful tools to encode graph-structured data. Due to their broad applications, there is an increasing need to develop tools to explain how GNNs make decisions given graph-structured data. Existing learning-based GNN explanation approaches are task-specific in training and hence suffer from crucial drawbacks. Specifically, they are incapable of producing explanations for a multitask prediction model with a single explainer. They are also unable to provide explanations in cases where the GNN is trained in a self-supervised manner, and the resulting representations are used in future downstream tasks. To address these limitations, we propose a Task-Agnostic GNN Explainer (TAGE) that is independent of downstream models and trained under self-supervision with no knowledge of downstream tasks. TAGE enables the explanation of GNN embedding models with unseen downstream tasks and allows efficient explanation of multitask models. Our extensive experiments show that TAGE can significantly speed up the explanation efficiency by using the same model to explain predictions for multiple downstream tasks while achieving explanation quality as good as or even better than current state-of-the-art GNN explanation approaches. Our code is pubicly available as part of the DIG library at https://github.com/divelab/DIG/tree/main/dig/xgraph/TAGE/.
△ Less
Submitted 23 September, 2022; v1 submitted 16 February, 2022;
originally announced February 2022.
-
Integrated Decision Control Approach for Cooperative Safety-Critical Payload Transport in a Cluttered Environment
Authors:
Nishanth Rao,
Suresh Sundaram
Abstract:
In this paper, the problem of coordinated transportation of heavy payload by a team of UAVs in a cluttered environment is addressed. The payload is modeled as a rigid body and is assumed to track a pre-computed global flight trajectory from a start point to a goal point. Due to the presence of local dynamic obstacles in the environment, the UAVs must ensure that there is no collision between the p…
▽ More
In this paper, the problem of coordinated transportation of heavy payload by a team of UAVs in a cluttered environment is addressed. The payload is modeled as a rigid body and is assumed to track a pre-computed global flight trajectory from a start point to a goal point. Due to the presence of local dynamic obstacles in the environment, the UAVs must ensure that there is no collision between the payload and these obstacles while ensuring that the payload oscillations are kept minimum. An Integrated Decision Controller (IDC) is proposed, that integrates the optimal tracking control law given by a centralized Model Predictive Controller with safety-critical constraints provided by the Exponential Control Barrier Functions. The entire payload-UAV system is enclosed by a safe convex hull boundary, and the IDC ensures that no obstacle enters this boundary. To evaluate the performance of the IDC, the results for a numerical simulation as well as a high-fidelity Gazebo simulation are presented. An ablation study is conducted to analyze the robustness of the proposed IDC against practical dubieties like noisy state values, relative obstacle safety margin, and payload mass uncertainty. The results clearly show that the IDC achieves both trajectory tracking and obstacle avoidance successfully while restricting the payload oscillations within a safe limit.
△ Less
Submitted 31 January, 2022;
originally announced January 2022.
-
Active Predictive Coding Networks: A Neural Solution to the Problem of Learning Reference Frames and Part-Whole Hierarchies
Authors:
Dimitrios C. Gklezakos,
Rajesh P. N. Rao
Abstract:
We introduce Active Predictive Coding Networks (APCNs), a new class of neural networks that solve a major problem posed by Hinton and others in the fields of artificial intelligence and brain modeling: how can neural networks learn intrinsic reference frames for objects and parse visual scenes into part-whole hierarchies by dynamically allocating nodes in a parse tree? APCNs address this problem b…
▽ More
We introduce Active Predictive Coding Networks (APCNs), a new class of neural networks that solve a major problem posed by Hinton and others in the fields of artificial intelligence and brain modeling: how can neural networks learn intrinsic reference frames for objects and parse visual scenes into part-whole hierarchies by dynamically allocating nodes in a parse tree? APCNs address this problem by using a novel combination of ideas: (1) hypernetworks are used for dynamically generating recurrent neural networks that predict parts and their locations within intrinsic reference frames conditioned on higher object-level embedding vectors, and (2) reinforcement learning is used in conjunction with backpropagation for end-to-end learning of model parameters. The APCN architecture lends itself naturally to multi-level hierarchical learning and is closely related to predictive coding models of cortical function. Using the MNIST, Fashion-MNIST and Omniglot datasets, we demonstrate that APCNs can (a) learn to parse images into part-whole hierarchies, (b) learn compositional representations, and (c) transfer their knowledge to unseen classes of objects. With their ability to dynamically generate parse trees with part locations for objects, APCNs offer a new framework for explainable AI that leverages advances in deep learning while retaining interpretability and compositionality.
△ Less
Submitted 14 January, 2022;
originally announced January 2022.
-
Predictive Coding Theories of Cortical Function
Authors:
Linxing Preston Jiang,
Rajesh P. N. Rao
Abstract:
Predictive coding is a unifying framework for understanding perception, action and neocortical organization. In predictive coding, different areas of the neocortex implement a hierarchical generative model of the world that is learned from sensory inputs. Cortical circuits are hypothesized to perform Bayesian inference based on this generative model. Specifically, the Rao-Ballard hierarchical pred…
▽ More
Predictive coding is a unifying framework for understanding perception, action and neocortical organization. In predictive coding, different areas of the neocortex implement a hierarchical generative model of the world that is learned from sensory inputs. Cortical circuits are hypothesized to perform Bayesian inference based on this generative model. Specifically, the Rao-Ballard hierarchical predictive coding model assumes that the top-down feedback connections from higher to lower order cortical areas convey predictions of lower-level activities. The bottom-up, feedforward connections in turn convey the errors between top-down predictions and actual activities. These errors are used to correct current estimates of the state of the world and generate new predictions. Through the objective of minimizing prediction errors, predictive coding provides a functional explanation for a wide range of neural responses and many aspects of brain organization.
△ Less
Submitted 18 May, 2023; v1 submitted 18 December, 2021;
originally announced December 2021.
-
On The Effect Of Coding Artifacts On Acoustic Scene Classification
Authors:
Nagashree K. S. Rao,
Nils Peters
Abstract:
Previous DCASE challenges contributed to an increase in the performance of acoustic scene classification systems. State-of-the-art classifiers demand significant processing capabilities and memory which is challenging for resource-constrained mobile or IoT edge devices. Thus, it is more likely to deploy these models on more powerful hardware and classify audio recordings previously uploaded (or st…
▽ More
Previous DCASE challenges contributed to an increase in the performance of acoustic scene classification systems. State-of-the-art classifiers demand significant processing capabilities and memory which is challenging for resource-constrained mobile or IoT edge devices. Thus, it is more likely to deploy these models on more powerful hardware and classify audio recordings previously uploaded (or streamed) from low-power edge devices. In such scenario, the edge device may apply perceptual audio coding to reduce the transmission data rate. This paper explores the effect of perceptual audio coding on the classification performance using a DCASE 2020 challenge contribution [1]. We found that classification accuracy can degrade by up to 57% compared to classifying original (uncompressed) audio. We further demonstrate how lossy audio compression techniques during model training can improve classification accuracy of compressed audio signals even for audio codecs and codec bitrates not included in the training process.
△ Less
Submitted 9 December, 2021;
originally announced December 2021.
-
Direct observation of Jahn-Teller critical dynamics at a charge-order Verwey transition
Authors:
Vinícius Pascotto Gastaldo,
Mala N. Rao,
Alexey Bosak,
Matteo d'Astuto,
Andrea Prodi,
Marine Verseils,
Yannick Klein,
Christophe Bellin,
Luigi Paolasini,
Adilson J. A. de Oliveira,
Edmondo Gilioli,
Samrath Lal Chaplot,
Andrea Gauzzi
Abstract:
By means of diffuse and inelastic x-ray scattering (DS,IXS), we probe directly the charge-ordering (CO) dynamics in the Verwey system (NaMn$_3$)Mn$_4$O$_{12}$, where a peculiar quadruple perovskite structure with no oxygen disorder stabilizes a nearly full Mn$^{3+}$/Mn$^{4+}$ static charge order at $T_{\rm CO}$=175 K concomitant to a commensurate structural modulation with propagation vector…
▽ More
By means of diffuse and inelastic x-ray scattering (DS,IXS), we probe directly the charge-ordering (CO) dynamics in the Verwey system (NaMn$_3$)Mn$_4$O$_{12}$, where a peculiar quadruple perovskite structure with no oxygen disorder stabilizes a nearly full Mn$^{3+}$/Mn$^{4+}$ static charge order at $T_{\rm CO}$=175 K concomitant to a commensurate structural modulation with propagation vector ${\bf q}_{\rm CO}=(\frac{1}{2},\frac{1}{2},0)$. At $T_{\rm CO}$, the IXS spectra unveil a softening of a 35.3 meV phonon at ${\bf q}_{\rm CO}$. Lattice dynamical calculations enable us to attribute this soft phonon to a A$_g$ mode whose polarization matches the Jahn-Teller-like distortion pattern of the structural modulation. This result demonstrates that the Jahn-Teller instability is the driving force of the CO Verwey transition in (NaMn$_3$)Mn$_4$O$_{12}$, thus elucidating a long-standing controversy regarding the mechanism of this transition observed in other mixed-valence systems like magnetite.
△ Less
Submitted 8 December, 2021;
originally announced December 2021.
-
THz Band Channel Measurements and Statistical Modeling for Urban Microcellular Environments
Authors:
Naveed A. Abbasi,
Jorge Gomez-Ponce,
Revanth Kondaveti,
Ashish Kumar,
Eshan Bhagat,
Rakesh N S Rao,
Shadi Abu-Surra,
Gary Xu,
Charlie Zhang,
Andreas F. Molisch
Abstract:
The THz band (0.1-10 THz) has attracted considerable attention for next-generation wireless communications, due to the large amount of available bandwidth that may be key to meet the rapidly increasing data rate requirements. Before deploying a system in this band, a detailed wireless channel analysis is required as the basis for proper design and testing of system implementations. One of the most…
▽ More
The THz band (0.1-10 THz) has attracted considerable attention for next-generation wireless communications, due to the large amount of available bandwidth that may be key to meet the rapidly increasing data rate requirements. Before deploying a system in this band, a detailed wireless channel analysis is required as the basis for proper design and testing of system implementations. One of the most important deployment scenarios of this band is the outdoor microcellular environment, where the Transmitter (Tx) and the Receiver (Rx) have a significant height difference (typically $ \ge 10$ m). In this paper, we present double-directional (i.e., directionally resolved at both link ends) channel measurements in such a microcellular scenario encompassing street canyons and an open square. Measurements are done for a 1 GHz bandwidth between 145-146 GHz and an antenna beamwidth of 13 degree; distances between Tx and Rx are up to 85 m and the Tx is at a height of 11.5 m from the ground. The measurements are analyzed to estimate path loss, shadowing, delay spread, angular spread, and multipath component (MPC) power distribution. These results allow the development of more realistic and detailed THz channel models and system performance assessment.
△ Less
Submitted 3 December, 2021;
originally announced December 2021.
-
Photonic and electronic state interactions in BaTiO3 based Optical Microcavity
Authors:
Jitendra Nath Acharyya,
R. B. Gangineni,
D. Narayana Rao,
G. Vijaya Prakash
Abstract:
The photonic modes mediated absorption dynamics at femtosecond time scales along with the control of spontaneous emission tunability are investigated all-dielectric optical microcavity having BaTiO3 (BTO) as defect layer. The cavity-enhanced transient absorption reveals the dominant excited state absorption (ESA) of both photonic and electronic modes due to strong third-order optical nonlinearity…
▽ More
The photonic modes mediated absorption dynamics at femtosecond time scales along with the control of spontaneous emission tunability are investigated all-dielectric optical microcavity having BaTiO3 (BTO) as defect layer. The cavity-enhanced transient absorption reveals the dominant excited state absorption (ESA) of both photonic and electronic modes due to strong third-order optical nonlinearity influence. Photoluminescence of BTO is found to be guided and tuned by the photonic cavity mode. Such active photonic structures can be envisaged as a potential candidate in nonlinear optics and photonic device applications.
△ Less
Submitted 2 December, 2021;
originally announced December 2021.
-
Advanced Architectures for High-Performance Quantum Networking
Authors:
Muneer Alshowkan,
Philip G. Evans,
Brian P. Williams,
Nageswara S. V. Rao,
Claire E. Marvinney,
Yun-Yi Pai,
Benjamin J. Lawrie,
Nicholas A. Peters,
Joseph M. Lukens
Abstract:
As practical quantum networks prepare to serve an ever-expanding number of nodes, there has grown a need for advanced auxiliary classical systems that support the quantum protocols and maintain compatibility with the existing fiber-optic infrastructure. We propose and demonstrate a quantum local area network design that addresses current deployment limitations in timing and security in a scalable…
▽ More
As practical quantum networks prepare to serve an ever-expanding number of nodes, there has grown a need for advanced auxiliary classical systems that support the quantum protocols and maintain compatibility with the existing fiber-optic infrastructure. We propose and demonstrate a quantum local area network design that addresses current deployment limitations in timing and security in a scalable fashion using commercial off-the-shelf components. We employ White Rabbit switches to synchronize three remote nodes with ultra-low timing jitter, significantly increasing the fidelities of the distributed entangled states over previous work with Global Positioning System clocks. Second, using a parallel quantum key distribution channel, we secure the classical communications needed for instrument control and data management. In this way, the conventional network which manages our entanglement network is secured using keys generated via an underlying quantum key distribution layer, preserving the integrity of the supporting systems and the relevant data in a future-proof fashion.
△ Less
Submitted 30 November, 2021;
originally announced November 2021.
-
Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods
Authors:
Wenqing Zheng,
Edward W Huang,
Nikhil Rao,
Sumeet Katariya,
Zhangyang Wang,
Karthik Subbian
Abstract:
Graph Neural Networks (GNNs) have achieved state-of-the-art performance in node classification, regression, and recommendation tasks. GNNs work well when rich and high-quality connections are available. However, their effectiveness is often jeopardized in many real-world graphs in which node degrees have power-law distributions. The extreme case of this situation, where a node may have no neighbor…
▽ More
Graph Neural Networks (GNNs) have achieved state-of-the-art performance in node classification, regression, and recommendation tasks. GNNs work well when rich and high-quality connections are available. However, their effectiveness is often jeopardized in many real-world graphs in which node degrees have power-law distributions. The extreme case of this situation, where a node may have no neighbors, is called Strict Cold Start (SCS). SCS forces the prediction to rely completely on the node's own features. We propose Cold Brew, a teacher-student distillation approach to address the SCS and noisy-neighbor challenges for GNNs. We also introduce feature contribution ratio (FCR), a metric to quantify the behavior of inductive GNNs to solve SCS. We experimentally show that FCR disentangles the contributions of different graph data components and helps select the best architecture for SCS generalization. We further demonstrate the superior performance of Cold Brew on several public benchmark and proprietary e-commerce datasets, where many nodes have either very few or noisy connections. Our source code is available at https://github.com/amazon-research/gnn-tail-generalization.
△ Less
Submitted 13 March, 2022; v1 submitted 8 November, 2021;
originally announced November 2021.
-
Lessons Learned on the Interface between Quantum and Conventional Networking
Authors:
Muneer Alshowkan,
Nageswara S. V. Rao,
Joseph C. Chapman,
Brian P. Williams,
Philip G. Evans,
Raphael C. Pooser,
Joseph M. Lukens,
Nicholas A. Peters
Abstract:
The future Quantum Internet is expected to be based on a hybrid architecture with core quantum transport capabilities complemented by conventional networking.Practical and foundational considerations indicate the need for conventional control and data planes that (i) utilize extensive existing telecommunications fiber infrastructure, and (ii) provide parallel conventional data channels needed for…
▽ More
The future Quantum Internet is expected to be based on a hybrid architecture with core quantum transport capabilities complemented by conventional networking.Practical and foundational considerations indicate the need for conventional control and data planes that (i) utilize extensive existing telecommunications fiber infrastructure, and (ii) provide parallel conventional data channels needed for quantum networking protocols. We propose a quantum-conventional network (QCN) harness to implement a new architecture to meet these requirements. The QCN control plane carries the control and management traffic, whereas its data plane handles the conventional and quantum data communications. We established a local area QCN connecting three quantum laboratories over dedicated fiber and conventional network connections. We describe considerations and tradeoffs for layering QCN functionalities, informed by our recent quantum entanglement distribution experiments conducted over this network.
△ Less
Submitted 3 November, 2021;
originally announced November 2021.
-
Cluster-and-Conquer: A Framework For Time-Series Forecasting
Authors:
Reese Pathak,
Rajat Sen,
Nikhil Rao,
N. Benjamin Erichson,
Michael I. Jordan,
Inderjit S. Dhillon
Abstract:
We propose a three-stage framework for forecasting high-dimensional time-series data. Our method first estimates parameters for each univariate time series. Next, we use these parameters to cluster the time series. These clusters can be viewed as multivariate time series, for which we then compute parameters. The forecasted values of a single time series can depend on the history of other time ser…
▽ More
We propose a three-stage framework for forecasting high-dimensional time-series data. Our method first estimates parameters for each univariate time series. Next, we use these parameters to cluster the time series. These clusters can be viewed as multivariate time series, for which we then compute parameters. The forecasted values of a single time series can depend on the history of other time series in the same cluster, accounting for intra-cluster similarity while minimizing potential noise in predictions by ignoring inter-cluster effects. Our framework -- which we refer to as "cluster-and-conquer" -- is highly general, allowing for any time-series forecasting and clustering method to be used in each step. It is computationally efficient and embarrassingly parallel. We motivate our framework with a theoretical analysis in an idealized mixed linear regression setting, where we provide guarantees on the quality of the estimates. We accompany these guarantees with experimental results that demonstrate the advantages of our framework: when instantiated with simple linear autoregressive models, we are able to achieve state-of-the-art results on several benchmark datasets, sometimes outperforming deep-learning-based approaches.
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs
Authors:
Nurendra Choudhary,
Nikhil Rao,
Sumeet Katariya,
Karthik Subbian,
Chandan K. Reddy
Abstract:
Logical reasoning over Knowledge Graphs (KGs) is a fundamental technique that can provide efficient querying mechanism over large and incomplete databases. Current approaches employ spatial geometries such as boxes to learn query representations that encompass the answer entities and model the logical operations of projection and intersection. However, their geometry is restrictive and leads to no…
▽ More
Logical reasoning over Knowledge Graphs (KGs) is a fundamental technique that can provide efficient querying mechanism over large and incomplete databases. Current approaches employ spatial geometries such as boxes to learn query representations that encompass the answer entities and model the logical operations of projection and intersection. However, their geometry is restrictive and leads to non-smooth strict boundaries, which further results in ambiguous answer entities. Furthermore, previous works propose transformation tricks to handle unions which results in non-closure and, thus, cannot be chained in a stream. In this paper, we propose a Probabilistic Entity Representation Model (PERM) to encode entities as a Multivariate Gaussian density with mean and covariance parameters to capture its semantic position and smooth decision boundary, respectively. Additionally, we also define the closed logical operations of projection, intersection, and union that can be aggregated using an end-to-end objective function. On the logical query reasoning problem, we demonstrate that the proposed PERM significantly outperforms the state-of-the-art methods on various public benchmark KG datasets on standard evaluation metrics. We also evaluate PERM's competence on a COVID-19 drug-repurposing case study and show that our proposed work is able to recommend drugs with substantially better F1 than current methods. Finally, we demonstrate the working of our PERM's query answering process through a low-dimensional visualization of the Gaussian representations.
△ Less
Submitted 30 October, 2021; v1 submitted 26 October, 2021;
originally announced October 2021.
-
TorchEsegeta: Framework for Interpretability and Explainability of Image-based Deep Learning Models
Authors:
Soumick Chatterjee,
Arnab Das,
Chirag Mandal,
Budhaditya Mukhopadhyay,
Manish Vipinraj,
Aniruddh Shukla,
Rajatha Nagaraja Rao,
Chompunuch Sarasaen,
Oliver Speck,
Andreas Nürnberger
Abstract:
Clinicians are often very sceptical about applying automatic image processing approaches, especially deep learning based methods, in practice. One main reason for this is the black-box nature of these approaches and the inherent problem of missing insights of the automatically derived decisions. In order to increase trust in these methods, this paper presents approaches that help to interpret and…
▽ More
Clinicians are often very sceptical about applying automatic image processing approaches, especially deep learning based methods, in practice. One main reason for this is the black-box nature of these approaches and the inherent problem of missing insights of the automatically derived decisions. In order to increase trust in these methods, this paper presents approaches that help to interpret and explain the results of deep learning algorithms by depicting the anatomical areas which influence the decision of the algorithm most. Moreover, this research presents a unified framework, TorchEsegeta, for applying various interpretability and explainability techniques for deep learning models and generate visual interpretations and explanations for clinicians to corroborate their clinical findings. In addition, this will aid in gaining confidence in such methods. The framework builds on existing interpretability and explainability techniques that are currently focusing on classification models, extending them to segmentation tasks. In addition, these methods have been adapted to 3D models for volumetric analysis. The proposed framework provides methods to quantitatively compare visual explanations using infidelity and sensitivity metrics. This framework can be used by data scientists to perform post-hoc interpretations and explanations of their models, develop more explainable tools and present the findings to clinicians to increase their faith in such models. The proposed framework was evaluated based on a use case scenario of vessel segmentation models trained on Time-of-fight (TOF) Magnetic Resonance Angiogram (MRA) images of the human brain. Quantitative and qualitative results of a comparative study of different models and interpretability methods are presented. Furthermore, this paper provides an extensive overview of several existing interpretability and explainability methods.
△ Less
Submitted 7 February, 2022; v1 submitted 15 October, 2021;
originally announced October 2021.
-
AstroSat study of the globular cluster NGC 2298: probable evolutionary scenarios of hot HB stars
Authors:
Sharmila Rani,
Gajendra Pandey,
Annapurni Subramaniam,
Chul Chung,
Snehalata Sahu,
N. Kameswara Rao
Abstract:
We present the far-UV (FUV) photometry of images acquired with UVIT on AstroSat to probe the horizontal branch (HB) population of the Galactic globular cluster NGC 2298. UV-optical color-magnitude diagrams (CMDs) are constructed for member stars in combination with HST UV Globular Cluster Survey (HUGS) data for the central region and Gaia and ground-based photometric data for the outer region. Blu…
▽ More
We present the far-UV (FUV) photometry of images acquired with UVIT on AstroSat to probe the horizontal branch (HB) population of the Galactic globular cluster NGC 2298. UV-optical color-magnitude diagrams (CMDs) are constructed for member stars in combination with HST UV Globular Cluster Survey (HUGS) data for the central region and Gaia and ground-based photometric data for the outer region. Blue HB (BHB) sequence with a spread and four hot HB stars are detected in all FUV-optical CMDs and are compared with theoretical updated BaSTI isochrones and synthetic HB models with a range in helium abundance, suggesting that the hot HB stars are helium enhanced when compared to the BHB. The estimated effective temperature, radius, and luminosity of HB stars, using best SED fits, were compared with various HB models. BHB stars span a temperature range from 7,500-12,250 K. The three hot HB stars have 35,000-40,000 K, whereas one star has around 100,000K. We suggest the following evolutionary scenarios: two stars are likely to be the progeny of extreme HB (EHB) stars formed through an early hot-flasher scenario; one is likely to be an EHB star with probable helium enrichment, the hottest HB star is about to enter the WD cooling phase, could have evolved from BHB phase. Nevertheless, these are interesting spectroscopic targets to understand the late stages of evolution.
△ Less
Submitted 9 October, 2021;
originally announced October 2021.
-
Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes
Authors:
Satpreet Harcharan Singh,
Floris van Breugel,
Rajesh P. N. Rao,
Bingni Wen Brunton
Abstract:
Tracking a turbulent plume to locate its source is a complex control problem because it requires multi-sensory integration and must be robust to intermittent odors, changing wind direction, and variable plume statistics. This task is routinely performed by flying insects, often over long distances, in pursuit of food or mates. Several aspects of this remarkable behavior have been studied in detail…
▽ More
Tracking a turbulent plume to locate its source is a complex control problem because it requires multi-sensory integration and must be robust to intermittent odors, changing wind direction, and variable plume statistics. This task is routinely performed by flying insects, often over long distances, in pursuit of food or mates. Several aspects of this remarkable behavior have been studied in detail in many experimental studies. Here, we take a complementary in silico approach, using artificial agents trained with reinforcement learning to develop an integrated understanding of the behaviors and neural computations that support plume tracking. Specifically, we use deep reinforcement learning (DRL) to train recurrent neural network (RNN) agents to locate the source of simulated turbulent plumes. Interestingly, the agents' emergent behaviors resemble those of flying insects, and the RNNs learn to represent task-relevant variables, such as head direction and time since last odor encounter. Our analyses suggest an intriguing experimentally testable hypothesis for tracking plumes in changing wind direction -- that agents follow local plume shape rather than the current wind direction. While reflexive short-memory behaviors are sufficient for tracking plumes in constant wind, longer timescales of memory are essential for tracking plumes that switch direction. At the level of neural dynamics, the RNNs' population activity is low-dimensional and organized into distinct dynamical structures, with some correspondence to behavioral modules. Our in silico approach provides key intuitions for turbulent plume tracking strategies and motivates future targeted experimental and theoretical developments.
△ Less
Submitted 17 December, 2021; v1 submitted 25 September, 2021;
originally announced September 2021.
-
Scalable Feature Selection for (Multitask) Gradient Boosted Trees
Authors:
Cuize Han,
Nikhil Rao,
Daria Sorokina,
Karthik Subbian
Abstract:
Gradient Boosted Decision Trees (GBDTs) are widely used for building ranking and relevance models in search and recommendation. Considerations such as latency and interpretability dictate the use of as few features as possible to train these models. Feature selection in GBDT models typically involves heuristically ranking the features by importance and selecting the top few, or by performing a ful…
▽ More
Gradient Boosted Decision Trees (GBDTs) are widely used for building ranking and relevance models in search and recommendation. Considerations such as latency and interpretability dictate the use of as few features as possible to train these models. Feature selection in GBDT models typically involves heuristically ranking the features by importance and selecting the top few, or by performing a full backward feature elimination routine. On-the-fly feature selection methods proposed previously scale suboptimally with the number of features, which can be daunting in high dimensional settings. We develop a scalable forward feature selection variant for GBDT, via a novel group testing procedure that works well in high dimensions, and enjoys favorable theoretical performance and computational guarantees. We show via extensive experiments on both public and proprietary datasets that the proposed method offers significant speedups in training time, while being as competitive as existing GBDT methods in terms of model performance metrics. We also extend the method to the multitask setting, allowing the practitioner to select common features across tasks, as well as selecting task-specific features.
△ Less
Submitted 4 September, 2021;
originally announced September 2021.
-
Maximizing and Satisficing in Multi-armed Bandits with Graph Information
Authors:
Parth K. Thaker,
Mohit Malu,
Nikhil Rao,
Gautam Dasarathy
Abstract:
Pure exploration in multi-armed bandits has emerged as an important framework for modeling decision-making and search under uncertainty. In modern applications, however, one is often faced with a tremendously large number of options. Even obtaining one observation per option may be too costly rendering traditional pure exploration algorithms ineffective. Fortunately, one often has access to simila…
▽ More
Pure exploration in multi-armed bandits has emerged as an important framework for modeling decision-making and search under uncertainty. In modern applications, however, one is often faced with a tremendously large number of options. Even obtaining one observation per option may be too costly rendering traditional pure exploration algorithms ineffective. Fortunately, one often has access to similar relationships amongst the options that can be leveraged. In this paper, we consider the pure exploration problem in stochastic multi-armed bandits where the similarities between the arms are captured by a graph and the rewards may be represented as a smooth signal on this graph. In particular, we consider the problem of finding the arm with the maximum reward (i.e., the maximizing problem) or one with a sufficiently high reward (i.e., the satisficing problem) under this model. We propose novel algorithms \textbf{\algoname{}} (GRaph-based UcB) and $ζ$-\textbf{\algoname{}} for these problems and provide a theoretical characterization of their performance which specifically elicits the benefit of the graph side information. We also prove a lower bound on the data requirement, showing a large class of problems where these algorithms are near-optimal. We complement our theory with experimental results that show the benefit of capitalizing on such side information.
△ Less
Submitted 20 November, 2022; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Measurement of the cosmic ray helium energy spectrum from 70 GeV to 80 TeV with the DAMPE space mission
Authors:
F. Alemanno,
Q. An,
P. Azzarello,
F. C. T. Barbato,
P. Bernardini,
X. J. Bi,
M. S. Cai,
E. Catanzani,
J. Chang,
D. Y. Chen,
J. L. Chen,
Z. F. Chen,
M. Y. Cui,
T. S. Cui,
Y. X. Cui,
H. T. Dai,
A. D'Amone,
A. De Benedittis,
I. De Mitri,
F. de Palma,
M. Deliyergiyev,
M. Di Santo,
T. K. Dong,
Z. X. Dong,
G. Donvito
, et al. (120 additional authors not shown)
Abstract:
The measurement of the energy spectrum of cosmic ray helium nuclei from 70 GeV to 80 TeV using 4.5 years of data recorded by the DArk Matter Particle Explorer (DAMPE) is reported in this work. A hardening of the spectrum is observed at an energy of about 1.3 TeV, similar to previous observations. In addition, a spectral softening at about 34 TeV is revealed for the first time with large statistics…
▽ More
The measurement of the energy spectrum of cosmic ray helium nuclei from 70 GeV to 80 TeV using 4.5 years of data recorded by the DArk Matter Particle Explorer (DAMPE) is reported in this work. A hardening of the spectrum is observed at an energy of about 1.3 TeV, similar to previous observations. In addition, a spectral softening at about 34 TeV is revealed for the first time with large statistics and well controlled systematic uncertainties, with an overall significance of $4.3σ$. The DAMPE spectral measurements of both cosmic protons and helium nuclei suggest a particle charge dependent softening energy, although with current uncertainties a dependence on the number of nucleons cannot be ruled out.
△ Less
Submitted 21 May, 2021; v1 submitted 19 May, 2021;
originally announced May 2021.
-
A First Look: Towards Explainable TextVQA Models via Visual and Textual Explanations
Authors:
Varun Nagaraj Rao,
Xingjian Zhen,
Karen Hovsepian,
Mingwei Shen
Abstract:
Explainable deep learning models are advantageous in many situations. Prior work mostly provide unimodal explanations through post-hoc approaches not part of the original system design. Explanation mechanisms also ignore useful textual information present in images. In this paper, we propose MTXNet, an end-to-end trainable multimodal architecture to generate multimodal explanations, which focuses…
▽ More
Explainable deep learning models are advantageous in many situations. Prior work mostly provide unimodal explanations through post-hoc approaches not part of the original system design. Explanation mechanisms also ignore useful textual information present in images. In this paper, we propose MTXNet, an end-to-end trainable multimodal architecture to generate multimodal explanations, which focuses on the text in the image. We curate a novel dataset TextVQA-X, containing ground truth visual and multi-reference textual explanations that can be leveraged during both training and evaluation. We then quantitatively show that training with multimodal explanations complements model performance and surpasses unimodal baselines by up to 7% in CIDEr scores and 2% in IoU. More importantly, we demonstrate that the multimodal explanations are consistent with human interpretations, help justify the models' decision, and provide useful insights to help diagnose an incorrect prediction. Finally, we describe a real-world e-commerce application for using the generated multimodal explanations.
△ Less
Submitted 28 April, 2021;
originally announced May 2021.
-
Variable selection for longitudinal survey data
Authors:
Laura Dumitrescu,
Wei Qian,
J. N. K. Rao
Abstract:
In this article we propose a new variable selection method for analyzing data collected from longitudinal sample surveys. The procedure is based on the survey-weighted quadratic inference function, which was recently introduced as an alternative to the survey-weighted generalized estimating function. Under the joint model-design framework, we introduce the penalized survey-weighted quadratic infer…
▽ More
In this article we propose a new variable selection method for analyzing data collected from longitudinal sample surveys. The procedure is based on the survey-weighted quadratic inference function, which was recently introduced as an alternative to the survey-weighted generalized estimating function. Under the joint model-design framework, we introduce the penalized survey-weighted quadratic inference estimator and obtain sufficient conditions for the existence, weak consistency, sparsity and asymptotic normality. To illustrate the finite sample performance of the model selection procedure, we include a limited simulation study.
△ Less
Submitted 2 May, 2021;
originally announced May 2021.
-
Vec2GC -- A Graph Based Clustering Method for Text Representations
Authors:
Rajesh N Rao,
Manojit Chakraborty
Abstract:
NLP pipelines with limited or no labeled data, rely on unsupervised methods for document processing. Unsupervised approaches typically depend on clustering of terms or documents. In this paper, we introduce a novel clustering algorithm, Vec2GC (Vector to Graph Communities), an end-to-end pipeline to cluster terms or documents for any given text corpus. Our method uses community detection on a weig…
▽ More
NLP pipelines with limited or no labeled data, rely on unsupervised methods for document processing. Unsupervised approaches typically depend on clustering of terms or documents. In this paper, we introduce a novel clustering algorithm, Vec2GC (Vector to Graph Communities), an end-to-end pipeline to cluster terms or documents for any given text corpus. Our method uses community detection on a weighted graph of the terms or documents, created using text representation learning. Vec2GC clustering algorithm is a density based approach, that supports hierarchical clustering as well.
△ Less
Submitted 12 April, 2023; v1 submitted 15 April, 2021;
originally announced April 2021.
-
SEMIE: SEMantically Infused Embeddings with Enhanced Interpretability for Domain-specific Small Corpus
Authors:
Rishabh Gupta,
Rajesh N Rao
Abstract:
Word embeddings are a basic building block of modern NLP pipelines. Efforts have been made to learn rich, efficient, and interpretable embeddings for large generic datasets available in the public domain. However, these embeddings have limited applicability for small corpora from specific domains such as automotive, manufacturing, maintenance and support, etc. In this work, we present a comprehens…
▽ More
Word embeddings are a basic building block of modern NLP pipelines. Efforts have been made to learn rich, efficient, and interpretable embeddings for large generic datasets available in the public domain. However, these embeddings have limited applicability for small corpora from specific domains such as automotive, manufacturing, maintenance and support, etc. In this work, we present a comprehensive notion of interpretability for word embeddings and propose a novel method to generate highly interpretable and efficient embeddings for a domain-specific small corpus. We report the evaluation results of our resulting word embeddings and demonstrate their novel features for enhanced interpretability.
△ Less
Submitted 21 March, 2021;
originally announced March 2021.
-
A Reconfigurable Quantum Local Area Network Over Deployed Fiber
Authors:
Muneer Alshowkan,
Brian P. Williams,
Philip G. Evans,
Nageswara S. V. Rao,
Emma M. Simmerman,
Hsuan-Hao Lu,
Navin B. Lingaraju,
Andrew M. Weiner,
Claire E. Marvinney,
Yun-Yi Pai,
Benjamin J. Lawrie,
Nicholas A. Peters,
Joseph M. Lukens
Abstract:
Practical quantum networking architectures are crucial for scaling the connection of quantum resources. Yet quantum network testbeds have thus far underutilized the full capabilities of modern lightwave communications, such as flexible-grid bandwidth allocation. In this work, we implement flex-grid entanglement distribution in a deployed network for the first time, connecting nodes in three distin…
▽ More
Practical quantum networking architectures are crucial for scaling the connection of quantum resources. Yet quantum network testbeds have thus far underutilized the full capabilities of modern lightwave communications, such as flexible-grid bandwidth allocation. In this work, we implement flex-grid entanglement distribution in a deployed network for the first time, connecting nodes in three distinct campus buildings time-synchronized via the Global Positioning System (GPS). We quantify the quality of the distributed polarization entanglement via log-negativity, which offers a generic metric of link performance in entangled bits per second. After demonstrating successful entanglement distribution for two allocations of our eight dynamically reconfigurable channels, we demonstrate remote state preparation -- the first realization on deployed fiber -- showcasing one possible quantum protocol enabled by the distributed entanglement network. Our results realize an advanced paradigm for managing entanglement resources in quantum networks of ever-increasing complexity and service demands.
△ Less
Submitted 26 February, 2021;
originally announced February 2021.
-
ShuffleUNet: Super resolution of diffusion-weighted MRIs using deep learning
Authors:
Soumick Chatterjee,
Alessandro Sciarra,
Max Dünnwald,
Raghava Vinaykanth Mushunuri,
Ranadheer Podishetti,
Rajatha Nagaraja Rao,
Geetha Doddapaneni Gopinath,
Steffen Oeltze-Jafra,
Oliver Speck,
Andreas Nürnberger
Abstract:
Diffusion-weighted magnetic resonance imaging (DW-MRI) can be used to characterise the microstructure of the nervous tissue, e.g. to delineate brain white matter connections in a non-invasive manner via fibre tracking. Magnetic Resonance Imaging (MRI) in high spatial resolution would play an important role in visualising such fibre tracts in a superior manner. However, obtaining an image of such r…
▽ More
Diffusion-weighted magnetic resonance imaging (DW-MRI) can be used to characterise the microstructure of the nervous tissue, e.g. to delineate brain white matter connections in a non-invasive manner via fibre tracking. Magnetic Resonance Imaging (MRI) in high spatial resolution would play an important role in visualising such fibre tracts in a superior manner. However, obtaining an image of such resolution comes at the expense of longer scan time. Longer scan time can be associated with the increase of motion artefacts, due to the patient's psychological and physical conditions. Single Image Super-Resolution (SISR), a technique aimed to obtain high-resolution (HR) details from one single low-resolution (LR) input image, achieved with Deep Learning, is the focus of this study. Compared to interpolation techniques or sparse-coding algorithms, deep learning extracts prior knowledge from big datasets and produces superior MRI images from the low-resolution counterparts. In this research, a deep learning based super-resolution technique is proposed and has been applied for DW-MRI. Images from the IXI dataset have been used as the ground-truth and were artificially downsampled to simulate the low-resolution images. The proposed method has shown statistically significant improvement over the baselines and achieved an SSIM of $0.913\pm0.045$.
△ Less
Submitted 25 February, 2021;
originally announced February 2021.
-
Spatio-Temporal Look-Ahead Trajectory Prediction using Memory Neural Network
Authors:
Nishanth Rao,
Suresh Sundaram
Abstract:
Prognostication of vehicle trajectories in unknown environments is intrinsically a challenging and difficult problem to solve. The behavior of such vehicles is highly influenced by surrounding traffic, road conditions, and rogue participants present in the environment. Moreover, the presence of pedestrians, traffic lights, stop signs, etc., makes it much harder to infer the behavior of various tra…
▽ More
Prognostication of vehicle trajectories in unknown environments is intrinsically a challenging and difficult problem to solve. The behavior of such vehicles is highly influenced by surrounding traffic, road conditions, and rogue participants present in the environment. Moreover, the presence of pedestrians, traffic lights, stop signs, etc., makes it much harder to infer the behavior of various traffic agents. This paper attempts to solve the problem of Spatio-temporal look-ahead trajectory prediction using a novel recurrent neural network called the Memory Neuron Network. The Memory Neuron Network (MNN) attempts to capture the input-output relationship between the past positions and the future positions of the traffic agents. The proposed model is computationally less intensive and has a simple architecture as compared to other deep learning models that utilize LSTMs and GRUs. It is then evaluated on the publicly available NGSIM dataset and its performance is compared with several state-of-art algorithms. Additionally, the performance is also evaluated on a custom synthetic dataset generated from the CARLA simulator. It is seen that the proposed model outperforms the existing state-of-art algorithms. Finally, the model is integrated with the CARLA simulator to test its robustness in real-time traffic scenarios.
△ Less
Submitted 24 February, 2021;
originally announced February 2021.
-
Ultrafast nonlinear pulse propagation dynamics in metal-dielectric periodic photonic architectures
Authors:
Jitendra Nath Acharyya,
Akhilesh Kumar Mishra,
D. Narayana Rao,
Ajit Kumar,
G. Vijaya Prakash
Abstract:
One-dimensional (1D) metal-dielectric (MD) periodic structures take advantage of large refractive index contrast between metal and dielectrics to invoke extremely high nonlinear ultrafast responses of metal. These structures are also special due to their extremely high laser damage threshold. The Bragg like 1D MD structure (Ag/SiO2)4 enables strong optical field confinement with much enhanced nonl…
▽ More
One-dimensional (1D) metal-dielectric (MD) periodic structures take advantage of large refractive index contrast between metal and dielectrics to invoke extremely high nonlinear ultrafast responses of metal. These structures are also special due to their extremely high laser damage threshold. The Bragg like 1D MD structure (Ag/SiO2)4 enables strong optical field confinement with much enhanced nonlinear features as compared to simple metal or single (Ag/SiO2)1 structure. In the present work, the ultrafast nonlinear optical responses of the above structures are investigated via femtosecond broadband optical pump-probe technique. The enhanced nonlinear optical absorption is of reverse saturation of absorption (RSA) nature, resulted due to free-carrier absorption (FCA) and excited-state absorption (ESA) processes. The spectral nonlinearities are closely related to the pump-induced modification of the metal's dielectric functions, which are qualitatively visualized by transfer matrix and two-temperature models. The ultrafast temporal evolution of nonlinear absorption clearly demonstrated enhanced optical nonlinearity, disentangled by the electron-electron and electron-phonon dynamic interactions at picosecond time scales. A phenomenological pulse propagation model is employed that incorporates the experimentally obtained nonlinear absorption coefficients and different nonlinear effects exhibited by the system. Nonlinearity plays a crucial role in controlling the ultrafast pulse propagation and could open a new window for many nonlinear device applications. The findings of these new optical materials could possibly pave the way for promising applications in ultrafast photonics.
△ Less
Submitted 15 September, 2021; v1 submitted 17 February, 2021;
originally announced February 2021.
-
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Authors:
Sebastian Gehrmann,
Tosin Adewumi,
Karmanya Aggarwal,
Pawan Sasanka Ammanamanchi,
Aremu Anuoluwapo,
Antoine Bosselut,
Khyathi Raghavi Chandu,
Miruna Clinciu,
Dipanjan Das,
Kaustubh D. Dhole,
Wanyu Du,
Esin Durmus,
Ondřej Dušek,
Chris Emezue,
Varun Gangal,
Cristina Garbacea,
Tatsunori Hashimoto,
Yufang Hou,
Yacine Jernite,
Harsh Jhamtani,
Yangfeng Ji,
Shailza Jolly,
Mihir Kale,
Dhruv Kumar,
Faisal Ladhak
, et al. (31 additional authors not shown)
Abstract:
We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it…
▽ More
We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent anglo-centric corpora with well-established, but flawed, metrics. This disconnect makes it challenging to identify the limitations of current models and opportunities for progress. Addressing this limitation, GEM provides an environment in which models can easily be applied to a wide set of tasks and in which evaluation strategies can be tested. Regular updates to the benchmark will help NLG research become more multilingual and evolve the challenge alongside models. This paper serves as the description of the data for which we are organizing a shared task at our ACL 2021 Workshop and to which we invite the entire NLG community to participate.
△ Less
Submitted 1 April, 2021; v1 submitted 2 February, 2021;
originally announced February 2021.
-
Machine-Learned Phase Diagrams of Generalized Kitaev Honeycomb Magnets
Authors:
Nihal Rao,
Ke Liu,
Marc Machaczek,
Lode Pollet
Abstract:
We use a recently developed interpretable and unsupervised machine-learning method, the tensorial kernel support vector machine (TK-SVM), to investigate the low-temperature classical phase diagram of a generalized Heisenberg-Kitaev-$Γ$ ($J$-$K$-$Γ$) model on a honeycomb lattice. Aside from reproducing phases reported by previous quantum and classical studies, our machine finds a hitherto missed ne…
▽ More
We use a recently developed interpretable and unsupervised machine-learning method, the tensorial kernel support vector machine (TK-SVM), to investigate the low-temperature classical phase diagram of a generalized Heisenberg-Kitaev-$Γ$ ($J$-$K$-$Γ$) model on a honeycomb lattice. Aside from reproducing phases reported by previous quantum and classical studies, our machine finds a hitherto missed nested zigzag-stripy order and establishes the robustness of a recently identified modulated $S_3 \times Z_3$ phase, which emerges through the competition between the Kitaev and $Γ$ spin liquids, against Heisenberg interactions. The results imply that, in the restricted parameter space spanned by the three primary exchange interactions -- $J$, $K$, and $Γ$, the representative Kitaev material $α$-${\rm RuCl}_3$ lies close to the boundaries of several phases, including a simple ferromagnet, the unconventional $S_3 \times Z_3$ and nested zigzag-stripy magnets. A zigzag order is stabilized by a finite $Γ^{\prime}$ and/or $J_3$ term, whereas the four magnetic orders may compete in particular if $Γ^{\prime}$ is anti-ferromagnetic.
△ Less
Submitted 12 September, 2021; v1 submitted 1 February, 2021;
originally announced February 2021.
-
Nigraha: Machine-learning based pipeline to identify and evaluate planet candidates from TESS
Authors:
Sriram Rao,
Ashish Mahabal,
Niyanth Rao,
Cauligi Raghavendra
Abstract:
The Transiting Exoplanet Survey Satellite (TESS) has now been operational for a little over two years, covering the Northern and the Southern hemispheres once. The TESS team processes the downlinked data using the Science Processing Operations Center pipeline and Quick Look pipeline to generate alerts for follow-up. Combined with other efforts from the community, over two thousand planet candidate…
▽ More
The Transiting Exoplanet Survey Satellite (TESS) has now been operational for a little over two years, covering the Northern and the Southern hemispheres once. The TESS team processes the downlinked data using the Science Processing Operations Center pipeline and Quick Look pipeline to generate alerts for follow-up. Combined with other efforts from the community, over two thousand planet candidates have been found of which tens have been confirmed as planets. We present our pipeline, Nigraha, that is complementary to these approaches. Nigraha uses a combination of transit finding, supervised machine learning, and detailed vetting to identify with high confidence a few planet candidates that were missed by prior searches. In particular, we identify high signal to noise ratio (SNR) shallow transits that may represent more Earth-like planets. In the spirit of open data exploration we provide details of our pipeline, release our supervised machine learning model and code as open source, and make public the 38 candidates we have found in seven sectors. The model can easily be run on other sectors as is. As part of future work we outline ways to increase the yield by strengthening some of the steps where we have been conservative and discarded objects for lack of a datum or two.
△ Less
Submitted 22 February, 2021; v1 submitted 22 January, 2021;
originally announced January 2021.
-
SoftNER: Mining Knowledge Graphs From Cloud Incidents
Authors:
Manish Shetty,
Chetan Bansal,
Sumit Kumar,
Nikitha Rao,
Nachiappan Nagappan
Abstract:
The move from boxed products to services and the widespread adoption of cloud computing has had a huge impact on the software development life cycle and DevOps processes. Particularly, incident management has become critical for develo** and operating large-scale services. Prior work on incident management has heavily focused on the challenges with incident triaging and de-duplication. In this w…
▽ More
The move from boxed products to services and the widespread adoption of cloud computing has had a huge impact on the software development life cycle and DevOps processes. Particularly, incident management has become critical for develo** and operating large-scale services. Prior work on incident management has heavily focused on the challenges with incident triaging and de-duplication. In this work, we address the fundamental problem of structured knowledge extraction from service incidents. We have built SoftNER, a framework for mining Knowledge Graphs from incident reports. First, we build a novel multi-task learning based BiLSTM-CRF model which leverages not just the semantic context but also the data-types for extracting factual information in the form of named entities. Next, we present an approach to mine relations between the named entities for automatically constructing knowledge graphs. We have deployed SoftNER at Microsoft, a major cloud service provider and have evaluated it on more than 2 months of cloud incidents. We show that the unsupervised machine learning pipeline has a high precision of 0.96. Our multi-task learning based deep learning model also outperforms the state-of-the-art NER models. Lastly, using the knowledge extracted by SoftNER, we are able to build accurate models for applications such as incident triaging and recommending entities based on their relevance to incident titles.
△ Less
Submitted 23 June, 2021; v1 submitted 14 January, 2021;
originally announced January 2021.
-
VFSIE -- Development and Testing Framework for Federated Science Instruments
Authors:
Anees Al-Najjar,
Nageswara S. V. Rao,
Neena Imam,
Thomas Naughton,
Seth Hitefield,
Lawrence Sorrillo,
James Kohl,
Wael Elwasif,
Jean-Christophe Bilheux,
Hassina Bilheux,
Swen Boehm,
Jason Kincl
Abstract:
Recent developments in softwarization of networked infrastructures combined with containerization of computing workflows promise unprecedented compute anywhere and everywhere capabilities for federations of edge and remote computing systems and science instruments. The development and testing of software stacks that implement these capabilities over physical production federations, however, is not…
▽ More
Recent developments in softwarization of networked infrastructures combined with containerization of computing workflows promise unprecedented compute anywhere and everywhere capabilities for federations of edge and remote computing systems and science instruments. The development and testing of software stacks that implement these capabilities over physical production federations, however, is not very practical nor cost-effective. In response, we develop a digital twin of the physical infrastructure, called the Virtual Federated Science Instrument Environment (VFSIE). This framework emulates the federation using containers and hosts connected over an emulated network, and supports the development and testing of federation stacks and workflows. We illustrate its use in a case study involving Jupiter Notebook computations and instrument control.
△ Less
Submitted 2 February, 2021; v1 submitted 6 January, 2021;
originally announced January 2021.
-
In orbit performance of UVIT over the 5 years
Authors:
S. K. Ghosh,
P. Joseph,
A. Kumar,
J. Postma,
C. S. Stalin,
A. Subramaniam,
S. N. Tandon,
I. V. Barve,
A. Devaraj,
K. George,
V. Girish,
J. B. Hutchings,
P. U. Kamath,
S. Kathiravan,
J. P. Lancelot,
D. Leahy,
P. K. Mahesh,
R. Mohan,
S. Nagabhushana,
A. K. Pati,
N. Kameswara Rao,
K. Sankarasubramanian,
P. Sreekumar,
S. Sriram
Abstract:
Over the last 5 years, UVIT has completed observations of more than 500 proposals with ~ 800 unique pointings. In addition, regular planned monitoring observations have been made and from their analysis various key parameters related to in orbit performance of UVIT have been quantified. The sensitivities of the UV channels have remained steady indicating no effect of potential molecular contaminat…
▽ More
Over the last 5 years, UVIT has completed observations of more than 500 proposals with ~ 800 unique pointings. In addition, regular planned monitoring observations have been made and from their analysis various key parameters related to in orbit performance of UVIT have been quantified. The sensitivities of the UV channels have remained steady indicating no effect of potential molecular contamination confirming the adequacy of all the protocols implemented for avoiding contamination. The quality of the PSF through the years confirms adequacy of thermal control measures. The early calibrations obtained during the Performance Verification (PV) phase have been further revised for more subtle effects. These include flat fields and detector distortions with greater precision. The operations of UVIT have also evolved through in orbit experience, e.g. tweaking of operational sequencing, protocol for recovery from bright object detection (BOD) shutdowns, parameters for BOD thresholds, etc. Finally, some effects of charged particle hits on electronics led to opimised strategy for regular resetting. The Near-UV channel was lost in one of such operations. All the above in-orbit experiences are presented here.
△ Less
Submitted 25 December, 2020;
originally announced December 2020.
-
Self-Supervised Hyperboloid Representations from Logical Queries over Knowledge Graphs
Authors:
Nurendra Choudhary,
Nikhil Rao,
Sumeet Katariya,
Karthik Subbian,
Chandan K. Reddy
Abstract:
Knowledge Graphs (KGs) are ubiquitous structures for information storagein several real-world applications such as web search, e-commerce, social networks, and biology. Querying KGs remains a foundational and challenging problem due to their size and complexity. Promising approaches to tackle this problem include embedding the KG units (e.g., entities and relations) in a Euclidean space such that…
▽ More
Knowledge Graphs (KGs) are ubiquitous structures for information storagein several real-world applications such as web search, e-commerce, social networks, and biology. Querying KGs remains a foundational and challenging problem due to their size and complexity. Promising approaches to tackle this problem include embedding the KG units (e.g., entities and relations) in a Euclidean space such that the query embedding contains the information relevant to its results. These approaches, however, fail to capture the hierarchical nature and semantic information of the entities present in the graph. Additionally, most of these approaches only utilize multi-hop queries (that can be modeled by simple translation operations) to learn embeddings and ignore more complex operations such as intersection and union of simpler queries. To tackle such complex operations, in this paper, we formulate KG representation learning as a self-supervised logical query reasoning problem that utilizes translation, intersection and union queries over KGs. We propose Hyperboloid Embeddings (HypE), a novel self-supervised dynamic reasoning framework, that utilizes positive first-order existential queries on a KG to learn representations of its entities and relations as hyperboloids in a Poincaré ball. HypE models the positive first-order queries as geometrical translation, intersection, and union. For the problem of KG reasoning in real-world datasets, the proposed HypE model significantly outperforms the state-of-the art results. We also apply HypE to an anomaly detection task on a popular e-commerce website product taxonomy as well as hierarchically organized web articles and demonstrate significant performance improvements compared to existing baseline methods. Finally, we also visualize the learned HypE embeddings in a Poincaré ball to clearly interpret and comprehend the representation space.
△ Less
Submitted 12 May, 2021; v1 submitted 23 December, 2020;
originally announced December 2020.