-
Transition region response to Quiet Sun Ellerman Bombs
Authors:
Aditi Bhatnagar,
Luc Rouppe van der Voort,
Jayant Joshi
Abstract:
Quiet Sun Ellerman Bombs (QSEBs) are key indicators of small-scale photospheric magnetic reconnection events. Recent high-resolution observations have shown that they are ubiquitous and that large numbers of QSEBs can be found in the quiet Sun. We aim to understand the impact of QSEBs on the upper solar atmosphere by analysing their spatial and temporal relationship with the UV brightenings observ…
▽ More
Quiet Sun Ellerman Bombs (QSEBs) are key indicators of small-scale photospheric magnetic reconnection events. Recent high-resolution observations have shown that they are ubiquitous and that large numbers of QSEBs can be found in the quiet Sun. We aim to understand the impact of QSEBs on the upper solar atmosphere by analysing their spatial and temporal relationship with the UV brightenings observed in transition region diagnostics. We analyse high-resolution H-beta observations from the Swedish 1-m Solar Telescope and utilise k-means clustering to detect 1423 QSEBs in a 51 min time series. We use coordinated and co-aligned observations from the Interface Region Imaging Spectrograph (IRIS) to search for corresponding signatures in the 1400 A slit-jaw image (SJI) channel and in the Si IV 1394 A and Mg II 2798.8 A triplet spectral lines. We identify UV brightenings from SJI 1400 using a threshold of 5$σ$ above the median background. We focused on 453 long-lived QSEBs ($>1$ min) and found 67 cases of UV brightenings from SJI 1400 occurring near the QSEBs, both temporally and spatially. Temporal analysis of these events indicates that QSEBs start before UV brightenings in 57 % of cases, while UV brightenings lead in 36 % of instances. The majority of the UV brightenings occur within 1000 km from the QSEBs in the direction of the solar limb. We also identify 21 QSEBs covered by the IRIS slit, with 4 of them showing emissions in both or one of the Si IV 1394 A and Mg II 2798.8 A triplet lines, at distances within 500 km from the QSEBs in the limb direction. We conclude that a small fraction (15 %) of the long-lived QSEBs contribute to localized heating observable in transition region diagnostics, indicating a minimal role in the global heating of the upper solar atmosphere.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Towards Joint Sequence-Structure Generation of Nucleic Acid and Protein Complexes with SE(3)-Discrete Diffusion
Authors:
Alex Morehead,
Jeffrey Ruffolo,
Aadyot Bhatnagar,
Ali Madani
Abstract:
Generative models of macromolecules carry abundant and impactful implications for industrial and biomedical efforts in protein engineering. However, existing methods are currently limited to modeling protein structures or sequences, independently or jointly, without regard to the interactions that commonly occur between proteins and other macromolecules. In this work, we introduce MMDiff, a genera…
▽ More
Generative models of macromolecules carry abundant and impactful implications for industrial and biomedical efforts in protein engineering. However, existing methods are currently limited to modeling protein structures or sequences, independently or jointly, without regard to the interactions that commonly occur between proteins and other macromolecules. In this work, we introduce MMDiff, a generative model that jointly designs sequences and structures of nucleic acid and protein complexes, independently or in complex, using joint SE(3)-discrete diffusion noise. Such a model has important implications for emerging areas of macromolecular design including structure-based transcription factor design and design of noncoding RNA sequences. We demonstrate the utility of MMDiff through a rigorous new design benchmark for macromolecular complex generation that we introduce in this work. Our results demonstrate that MMDiff is able to successfully generate micro-RNA and single-stranded DNA molecules while being modestly capable of joint modeling DNA and RNA molecules in interaction with multi-chain protein complexes. Source code: https://github.com/Profluent-Internships/MMDiff.
△ Less
Submitted 21 December, 2023;
originally announced January 2024.
-
Do Socialization Restrictions Prevent Restaurants from Becoming Covid Hotspots?
Authors:
Aviral Bhatnagar,
Himanshu Kharkwal,
Jaideep Srivastava
Abstract:
Simulation models for infection spread can help understand what factors play a major role in infection spread. Health agencies like the Center for Disease Control (CDC) can accordingly mandate effective guidelines to curb the spread. We built an infection spread model to simulate disease propagation through airborne transmission to study the impact of restaurant operational policies on the Covid-1…
▽ More
Simulation models for infection spread can help understand what factors play a major role in infection spread. Health agencies like the Center for Disease Control (CDC) can accordingly mandate effective guidelines to curb the spread. We built an infection spread model to simulate disease propagation through airborne transmission to study the impact of restaurant operational policies on the Covid-19 infections. We use the Wells-Riley model to measure the expected value of new infections in a given time-frame in a particular location. For the purpose of this study, we have restricted our analysis to bars and restaurants in the Minneapolis-St. Paul region. Our model helps identify disease hotspots within the Twin Cities and proves that stay-at-home orders were effective during the recent lockdown, and the people typically followed the social distancing guidelines. To arrive at this conclusion, we performed significance testing by considering specific hypothetical scenarios. At the end of the study, we discuss the reasoning behind the hotspots, and make suggestions that could help avoid them.
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
Improved Online Conformal Prediction via Strongly Adaptive Online Learning
Authors:
Aadyot Bhatnagar,
Huan Wang,
Caiming Xiong,
Yu Bai
Abstract:
We study the problem of uncertainty quantification via prediction sets, in an online setting where the data distribution may vary arbitrarily over time. Recent work develops online conformal prediction techniques that leverage regret minimization algorithms from the online learning literature to learn prediction sets with approximately valid coverage and small regret. However, standard regret mini…
▽ More
We study the problem of uncertainty quantification via prediction sets, in an online setting where the data distribution may vary arbitrarily over time. Recent work develops online conformal prediction techniques that leverage regret minimization algorithms from the online learning literature to learn prediction sets with approximately valid coverage and small regret. However, standard regret minimization could be insufficient for handling changing environments, where performance guarantees may be desired not only over the full time horizon but also in all (sub-)intervals of time. We develop new online conformal prediction methods that minimize the strongly adaptive regret, which measures the worst-case regret over all intervals of a fixed length. We prove that our methods achieve near-optimal strongly adaptive regret for all interval lengths simultaneously, and approximately valid coverage. Experiments show that our methods consistently obtain better coverage and smaller prediction sets than existing methods on real-world tasks, such as time series forecasting and image classification under distribution shift.
△ Less
Submitted 15 February, 2023;
originally announced February 2023.
-
Metaheuristic for Hub-Spoke Facility Location Problem: Application to Indian E-commerce Industry
Authors:
Aakash Sachdeva,
Bhupinder Singh,
Rahul Prasad,
Nakshatra Goel,
Ronit Mondal,
Jatin Munjal,
Abhishek Bhatnagar,
Manjeet Dahiya
Abstract:
Indian e-commerce industry has evolved over the last decade and is expected to grow over the next few years. The focus has now shifted to turnaround time (TAT) due to the emergence of many third-party logistics providers and higher customer expectations. The key consideration for delivery providers is to balance their overall operating costs while meeting the promised TAT to their customers. E-com…
▽ More
Indian e-commerce industry has evolved over the last decade and is expected to grow over the next few years. The focus has now shifted to turnaround time (TAT) due to the emergence of many third-party logistics providers and higher customer expectations. The key consideration for delivery providers is to balance their overall operating costs while meeting the promised TAT to their customers. E-commerce delivery partners operate through a network of facilities whose strategic locations help to run the operations efficiently. In this work, we identify the locations of hubs throughout the country and their corresponding map** with the distribution centers. The objective is to minimize the total network costs with TAT adherence. We use Genetic Algorithm and leverage business constraints to reduce the solution search space and hence the solution time. The results indicate an improvement of 9.73% in TAT compliance compared with the current scenario.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Child PalmID: Contactless Palmprint Recognition
Authors:
Anil K. Jain,
Akash Godbole,
Anjoo Bhatnagar,
Prem Sewak Sudhish
Abstract:
Develo** and least developed countries face the dire challenge of ensuring that each child in their country receives required doses of vaccination, adequate nutrition and proper medication. International agencies such as UNICEF, WHO and WFP, among other organizations, strive to find innovative solutions to determine which child has received the benefits and which have not. Biometric recognition…
▽ More
Develo** and least developed countries face the dire challenge of ensuring that each child in their country receives required doses of vaccination, adequate nutrition and proper medication. International agencies such as UNICEF, WHO and WFP, among other organizations, strive to find innovative solutions to determine which child has received the benefits and which have not. Biometric recognition systems have been sought out to help solve this problem. To that end, this report establishes a baseline accuracy of a commercial contactless palmprint recognition system that may be deployed for recognizing children in the age group of one to five years old. On a database of contactless palmprint images of one thousand unique palms from 500 children, we establish SOTA authentication accuracy of 90.85% @ FAR of 0.01%, rank-1 identification accuracy of 99.0% (closed set), and FPIR=0.01 @ FNIR=0.3 for open-set identification using PalmMobile SDK from Armatura.
△ Less
Submitted 14 December, 2022;
originally announced December 2022.
-
"We Need a Woman in Music": Exploring Wikipedia's Values on Article Priority
Authors:
Mo Houtti,
Isaac Johnson,
Joel Cepeda,
Soumya Khandelwal,
Aviral Bhatnagar,
Loren Terveen
Abstract:
Wikipedia -- like most peer production communities -- suffers from a basic problem: the amount of work that needs to be done (articles to be created and improved) exceeds the available resources (editor effort). Recommender systems have been deployed to address this problem, but they have tended to recommend work tasks that match individuals' personal interests, ignoring more global community valu…
▽ More
Wikipedia -- like most peer production communities -- suffers from a basic problem: the amount of work that needs to be done (articles to be created and improved) exceeds the available resources (editor effort). Recommender systems have been deployed to address this problem, but they have tended to recommend work tasks that match individuals' personal interests, ignoring more global community values. In English Wikipedia, discussion about Vital articles constitutes a proxy for community values about the types of articles that are most important, and should therefore be prioritized for improvement. We first analyzed these discussions, finding that an article's priority is considered a function of 1) its inherent importance and 2) its effects on Wikipedia's global composition. One important example of the second consideration is balance, including along the dimensions of gender and geography. We then conducted a quantitative analysis evaluating how four different article prioritization methods -- two from prior research -- would affect Wikipedia's overall balance on these two dimensions; we found significant differences among the methods. We discuss the implications of our results, including particularly how they can guide the design of recommender systems that take into account community values, not just individuals' interests.
△ Less
Submitted 17 August, 2022;
originally announced August 2022.
-
Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations
Authors:
Qingyu Chen,
Alexis Allot,
Robert Leaman,
Rezarta Islamaj Doğan,
**gcheng Du,
Li Fang,
Kai Wang,
Shuo Xu,
Yuefu Zhang,
Parsa Bagherzadeh,
Sabine Bergler,
Aakash Bhatnagar,
Nidhir Bhavsar,
Yung-Chun Chang,
Sheng-Jie Lin,
Wentai Tang,
Hongtong Zhang,
Ilija Tavchioski,
Senja Pollak,
Shubo Tian,
**feng Zhang,
Yulia Otmakhova,
Antonio Jimeno Yepes,
Hang Dong,
Honghan Wu
, et al. (14 additional authors not shown)
Abstract:
The COVID-19 pandemic has been severely impacting global society since December 2019. Massive research has been undertaken to understand the characteristics of the virus and design vaccines and drugs. The related findings have been reported in biomedical literature at a rate of about 10,000 articles on COVID-19 per month. Such rapid growth significantly challenges manual curation and interpretatio…
▽ More
The COVID-19 pandemic has been severely impacting global society since December 2019. Massive research has been undertaken to understand the characteristics of the virus and design vaccines and drugs. The related findings have been reported in biomedical literature at a rate of about 10,000 articles on COVID-19 per month. Such rapid growth significantly challenges manual curation and interpretation. For instance, LitCovid is a literature database of COVID-19-related articles in PubMed, which has accumulated more than 200,000 articles with millions of accesses each month by users worldwide. One primary curation task is to assign up to eight topics (e.g., Diagnosis and Treatment) to the articles in LitCovid. Despite the continuing advances in biomedical text mining methods, few have been dedicated to topic annotations in COVID-19 literature. To close the gap, we organized the BioCreative LitCovid track to call for a community effort to tackle automated topic annotation for COVID-19 literature. The BioCreative LitCovid dataset, consisting of over 30,000 articles with manually reviewed topics, was created for training and testing. It is one of the largest multilabel classification datasets in biomedical scientific literature. 19 teams worldwide participated and made 80 submissions in total. Most teams used hybrid systems based on transformers. The highest performing submissions achieved 0.8875, 0.9181, and 0.9394 for macro F1-score, micro F1-score, and instance-based F1-score, respectively. The level of participation and results demonstrate a successful track and help close the gap between dataset curation and method development. The dataset is publicly available via https://ftp.ncbi.nlm.nih.gov/pub/lu/LitCovid/biocreative/ for benchmarking and further development.
△ Less
Submitted 3 June, 2022; v1 submitted 20 April, 2022;
originally announced April 2022.
-
HLDC: Hindi Legal Documents Corpus
Authors:
Arnav Kapoor,
Mudit Dhawan,
Anmol Goel,
T. H. Arjun,
Akshala Bhatnagar,
Vibhu Agrawal,
Amul Agrawal,
Arnab Bhattacharya,
Ponnurangam Kumaraguru,
Ashutosh Modi
Abstract:
Many populous countries including India are burdened with a considerable backlog of legal cases. Development of automated systems that could process legal documents and augment legal practitioners can mitigate this. However, there is a dearth of high-quality corpora that is needed to develop such data-driven systems. The problem gets even more pronounced in the case of low resource languages such…
▽ More
Many populous countries including India are burdened with a considerable backlog of legal cases. Development of automated systems that could process legal documents and augment legal practitioners can mitigate this. However, there is a dearth of high-quality corpora that is needed to develop such data-driven systems. The problem gets even more pronounced in the case of low resource languages such as Hindi. In this resource paper, we introduce the Hindi Legal Documents Corpus (HLDC), a corpus of more than 900K legal documents in Hindi. Documents are cleaned and structured to enable the development of downstream applications. Further, as a use-case for the corpus, we introduce the task of bail prediction. We experiment with a battery of models and propose a Multi-Task Learning (MTL) based model for the same. MTL models use summarization as an auxiliary task along with bail prediction as the main task. Experiments with different models are indicative of the need for further research in this area. We release the corpus and model implementation code with this paper: https://github.com/Exploration-Lab/HLDC
△ Less
Submitted 24 May, 2024; v1 submitted 2 April, 2022;
originally announced April 2022.
-
Quantum effects in an expanded Black-Scholes model
Authors:
Anantya Bhatnagar,
Dimitri D. Vvedensky
Abstract:
The limitations of the classical Black-Scholes model are examined by comparing calculated and actual historical prices of European call options on stocks from several sectors of the S&P 500. Persistent differences between the two prices point to an expanded model proposed by Segal and Segal (1998) in which information not simultaneously observable or actionable with public information can be repre…
▽ More
The limitations of the classical Black-Scholes model are examined by comparing calculated and actual historical prices of European call options on stocks from several sectors of the S&P 500. Persistent differences between the two prices point to an expanded model proposed by Segal and Segal (1998) in which information not simultaneously observable or actionable with public information can be represented by an additional pseudo-Wiener process. A real linear combination of the original and added processes leads to a commutation relation analogous to that between a boson field and its canonical momentum in quantum field theory. The resulting pricing formula for a European call option replaces the classical volatility with the norm of a complex quantity, whose imaginary part is shown to compensate for the disparity between prices obtained from the classical Black-Scholes model and actual prices of the test call options. This provides market evidence for the influence of a non-classical process on the price of a security based on non-commuting operators.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
HashSet -- A Dataset For Hashtag Segmentation
Authors:
Prashant Kodali,
Akshala Bhatnagar,
Naman Ahuja,
Manish Shrivastava,
Ponnurangam Kumaraguru
Abstract:
Hashtag segmentation is the task of breaking a hashtag into its constituent tokens. Hashtags often encode the essence of user-generated posts, along with information like topic and sentiment, which are useful in downstream tasks. Hashtags prioritize brevity and are written in unique ways -- transliterating and mixing languages, spelling variations, creative named entities. Benchmark datasets used…
▽ More
Hashtag segmentation is the task of breaking a hashtag into its constituent tokens. Hashtags often encode the essence of user-generated posts, along with information like topic and sentiment, which are useful in downstream tasks. Hashtags prioritize brevity and are written in unique ways -- transliterating and mixing languages, spelling variations, creative named entities. Benchmark datasets used for the hashtag segmentation task -- STAN, BOUN -- are small in size and extracted from a single set of tweets. However, datasets should reflect the variations in writing styles of hashtags and also account for domain and language specificity, failing which the results will misrepresent model performance. We argue that model performance should be assessed on a wider variety of hashtags, and datasets should be carefully curated. To this end, we propose HashSet, a dataset comprising of: a) 1.9k manually annotated dataset; b) 3.3M loosely supervised dataset. HashSet dataset is sampled from a different set of tweets when compared to existing datasets and provides an alternate distribution of hashtags to build and validate hashtag segmentation models. We show that the performance of SOTA models for Hashtag Segmentation drops substantially on proposed dataset, indicating that the proposed dataset provides an alternate set of hashtags to train and assess models.
△ Less
Submitted 17 January, 2022;
originally announced January 2022.
-
Momentum Contrastive Autoencoder: Using Contrastive Learning for Latent Space Distribution Matching in WAE
Authors:
Devansh Arpit,
Aadyot Bhatnagar,
Huan Wang,
Caiming Xiong
Abstract:
Wasserstein autoencoder (WAE) shows that matching two distributions is equivalent to minimizing a simple autoencoder (AE) loss under the constraint that the latent space of this AE matches a pre-specified prior distribution. This latent space distribution matching is a core component of WAE, and a challenging task. In this paper, we propose to use the contrastive learning framework that has been s…
▽ More
Wasserstein autoencoder (WAE) shows that matching two distributions is equivalent to minimizing a simple autoencoder (AE) loss under the constraint that the latent space of this AE matches a pre-specified prior distribution. This latent space distribution matching is a core component of WAE, and a challenging task. In this paper, we propose to use the contrastive learning framework that has been shown to be effective for self-supervised representation learning, as a means to resolve this problem. We do so by exploiting the fact that contrastive learning objectives optimize the latent space distribution to be uniform over the unit hyper-sphere, which can be easily sampled from. We show that using the contrastive learning framework to optimize the WAE loss achieves faster convergence and more stable optimization compared with existing popular algorithms for WAE. This is also reflected in the FID scores on CelebA and CIFAR-10 datasets, and the realistic generated image quality on the CelebA-HQ dataset.
△ Less
Submitted 15 February, 2023; v1 submitted 19 October, 2021;
originally announced October 2021.
-
Inertial Particles in Superfluid Turbulence: Coflow and Counterflow
Authors:
Sanjay Shukla,
Akhilesh Kumar Verma,
Vishwanath Shukla,
Akshay Bhatnagar,
Rahul Pandit
Abstract:
We use pseudospectral direct numerical simulations (DNSs) to solve the three-dimensional (3D) Hall-Vinen-Bekharevich-Khalatnikov (HVBK) model of superfluid Helium. We then explore the statistical properties of inertial particles, in both coflow and counterflow superfluid turbulence (ST) in the 3D HVBK system; particle motion is governed by a generalization of the Maxey-Riley-Gatignol equations. We…
▽ More
We use pseudospectral direct numerical simulations (DNSs) to solve the three-dimensional (3D) Hall-Vinen-Bekharevich-Khalatnikov (HVBK) model of superfluid Helium. We then explore the statistical properties of inertial particles, in both coflow and counterflow superfluid turbulence (ST) in the 3D HVBK system; particle motion is governed by a generalization of the Maxey-Riley-Gatignol equations. We first characterize the anisotropy of counterflow ST by showing that there exist large vortical columns. The light particles show confined motion as they are attracted towards these columns and they form large clusters; by contrast, heavy particles are expelled from these vortical regions. We characterise the statistics of such inertial particles in 3D HVBK ST: (1) The mean angle $Θ(τ)$, between particle positions, separated by the time lag $τ$, exhibits two different scaling regions in (a) dissipation and (b) inertial ranges, for different values of the parameters in our model; in particular, the value of $Θ(τ)$, at large $τ$, depends on the magnitude of ${\bf U}_{ns}$. (2) The irreversibility of 3D HVBK turbulence is quantified by computing the statistics of energy increments for inertial particles. (3) The probability distribution function (PDF) of energy increments is of direct relevance to recent experimental studies of irreversibility in superfluid turbulence; we find, in agreement with these experiments, that, for counterflow ST, the skewness of this PDF is less pronounced than its counterparts for coflow ST or for classical-fluid turbulence.
△ Less
Submitted 7 January, 2023; v1 submitted 19 October, 2021;
originally announced October 2021.
-
Rate of formation of caustics in heavy particles advected by turbulence
Authors:
Akshay Bhatnagar,
Vikash Pandey,
Prasad Perlekar,
Dhrubaditya Mitra
Abstract:
The rate of collision and the relative velocities of the colliding particles in turbulent flows is a crucial part of several natural phenomena, e.g., rain formation in warm clouds and planetesimal formation in a protoplanetary disks. The particles are often modeled as passive, but heavy and inertial. Within this model, large relative velocities emerge due to formation of singularities (caustics) o…
▽ More
The rate of collision and the relative velocities of the colliding particles in turbulent flows is a crucial part of several natural phenomena, e.g., rain formation in warm clouds and planetesimal formation in a protoplanetary disks. The particles are often modeled as passive, but heavy and inertial. Within this model, large relative velocities emerge due to formation of singularities (caustics) of in the gradient matrix of the velocities of the particles. Using extensive direct numerical simulations of heavy particles in both two (direct and inverse cascade) and three dimensional turbulent flows we calculate the rate of formation of caustics, $J$ as a function of the Stokes number (${\rm St}$).The best approximation to our data is $J \sim \exp(-C/{\rm St})$, in the limit ${\rm St} \to 0 $ where $C$ is a non-universal constant.
△ Less
Submitted 6 October, 2021;
originally announced October 2021.
-
Coagulation drives turbulence in binary fluid mixtures
Authors:
Akshay Bhatnagar,
Prasad Perlekar,
Dhrubaditya Mitra
Abstract:
We use direct numerical simulations and scaling arguments to study coarsening in binary fluid mixtures with a conserved order parameter in the droplet-spinodal regime -- the volume fraction of the droplets is neither too small nor symmetric -- for small diffusivity and viscosity. Coagulation of droplets drives a turbulent flow that eventually decays. We uncover a novel coarsening mechanism, driven…
▽ More
We use direct numerical simulations and scaling arguments to study coarsening in binary fluid mixtures with a conserved order parameter in the droplet-spinodal regime -- the volume fraction of the droplets is neither too small nor symmetric -- for small diffusivity and viscosity. Coagulation of droplets drives a turbulent flow that eventually decays. We uncover a novel coarsening mechanism, driven by turbulence where the characteristic length scale of the flow is different from the characteristic length scale of droplets, giving rise to a domain growth law of $t^{1/2}$, where $t$ is time. At intermediate times, both the flow and the droplets form self-similar structures: the structure factor $S(q) \sim q^{-2}$ and the kinetic energy spectra $E(q) \sim q^{-5/3}$ for an intermediate range of $q$, the wavenumber.
△ Less
Submitted 20 September, 2021;
originally announced September 2021.
-
Merlion: A Machine Learning Library for Time Series
Authors:
Aadyot Bhatnagar,
Paul Kassianik,
Chenghao Liu,
Tian Lan,
Wenzhuo Yang,
Rowan Cassius,
Doyen Sahoo,
Devansh Arpit,
Sri Subramanian,
Gerald Woo,
Amrita Saha,
Arun Kumar Jagota,
Gokulakrishnan Gopalakrishnan,
Manpreet Singh,
K C Krithika,
Sukumar Maddineni,
Daeki Cho,
Bo Zong,
Yingbo Zhou,
Caiming Xiong,
Silvio Savarese,
Steven Hoi,
Huan Wang
Abstract:
We introduce Merlion, an open-source machine learning library for time series. It features a unified interface for many commonly used models and datasets for anomaly detection and forecasting on both univariate and multivariate time series, along with standard pre/post-processing layers. It has several modules to improve ease-of-use, including visualization, anomaly score calibration to improve in…
▽ More
We introduce Merlion, an open-source machine learning library for time series. It features a unified interface for many commonly used models and datasets for anomaly detection and forecasting on both univariate and multivariate time series, along with standard pre/post-processing layers. It has several modules to improve ease-of-use, including visualization, anomaly score calibration to improve interpetability, AutoML for hyperparameter tuning and model selection, and model ensembling. Merlion also provides a unique evaluation framework that simulates the live deployment and re-training of a model in production. This library aims to provide engineers and researchers a one-stop solution to rapidly develop models for their specific time series needs and benchmark them across multiple time series datasets. In this technical report, we highlight Merlion's architecture and major functionalities, and we report benchmark numbers across different baseline models and ensembles.
△ Less
Submitted 19 September, 2021;
originally announced September 2021.
-
Local Calibration: Metrics and Recalibration
Authors:
Rachel Luo,
Aadyot Bhatnagar,
Yu Bai,
Shengjia Zhao,
Huan Wang,
Caiming Xiong,
Silvio Savarese,
Stefano Ermon,
Edward Schmerling,
Marco Pavone
Abstract:
Probabilistic classifiers output confidence scores along with their predictions, and these confidence scores should be calibrated, i.e., they should reflect the reliability of the prediction. Confidence scores that minimize standard metrics such as the expected calibration error (ECE) accurately measure the reliability on average across the entire population. However, it is in general impossible t…
▽ More
Probabilistic classifiers output confidence scores along with their predictions, and these confidence scores should be calibrated, i.e., they should reflect the reliability of the prediction. Confidence scores that minimize standard metrics such as the expected calibration error (ECE) accurately measure the reliability on average across the entire population. However, it is in general impossible to measure the reliability of an individual prediction. In this work, we propose the local calibration error (LCE) to span the gap between average and individual reliability. For each individual prediction, the LCE measures the average reliability of a set of similar predictions, where similarity is quantified by a kernel function on a pretrained feature space and by a binning scheme over predicted model confidences. We show theoretically that the LCE can be estimated sample-efficiently from data, and empirically find that it reveals miscalibration modes that are more fine-grained than the ECE can detect. Our key result is a novel local recalibration method LoRe, to improve confidence scores for individual predictions and decrease the LCE. Experimentally, we show that our recalibration method produces more accurate confidence scores, which improves downstream fairness and decision making on classification tasks with both image and tabular data.
△ Less
Submitted 18 August, 2022; v1 submitted 22 February, 2021;
originally announced February 2021.
-
Paths to caustic formation in turbulent aerosols
Authors:
Jan Meibohm,
Vikash Pandey,
Akshay Bhatnagar,
Kristian Gustavsson,
Dhrubaditya Mitra,
Prasad Perlekar,
B. Mehlig
Abstract:
The dynamics of small, yet heavy, identical particles in turbulence exhibits singularities, called caustics, that lead to large fluctuations in the spatial particle-number density, and in collision velocities. For large particle, inertia the fluid velocity at the particle position is essentially a white-noise signal and caustic formation is analogous to Kramers escape. Here we show that caustic fo…
▽ More
The dynamics of small, yet heavy, identical particles in turbulence exhibits singularities, called caustics, that lead to large fluctuations in the spatial particle-number density, and in collision velocities. For large particle, inertia the fluid velocity at the particle position is essentially a white-noise signal and caustic formation is analogous to Kramers escape. Here we show that caustic formation at small particle inertia is different. Caustics tend to form in the vicinity of particle trajectories that experience a specific history of fluid-velocity gradients, characterised by low vorticity and a violent strain exceeding a large threshold. We develop a theory that explains our findings in terms of an optimal path to caustic formation that is approached in the small inertia limit.
△ Less
Submitted 15 June, 2021; v1 submitted 15 December, 2020;
originally announced December 2020.
-
Holography of pp-waves in conformal gravity
Authors:
A. Bhatnagar,
I. Lovrekovic
Abstract:
We consider holography of two pp-wave metrics in conformal gravity, their one point functions, and asymptotic symmetries. One of the metrics is a generalization of the standard pp-waves in Einstein gravity to conformal gravity. The holography of this metric shows that within conformal gravity one can have realised solution which has non-vanishing partially massless response (PMR) tensor even for v…
▽ More
We consider holography of two pp-wave metrics in conformal gravity, their one point functions, and asymptotic symmetries. One of the metrics is a generalization of the standard pp-waves in Einstein gravity to conformal gravity. The holography of this metric shows that within conformal gravity one can have realised solution which has non-vanishing partially massless response (PMR) tensor even for vanishing subleading term in the Fefferman-Graham expansion (i.e. Neumann boundary conditions), and vice-versa.
△ Less
Submitted 11 November, 2020;
originally announced November 2020.
-
Infant-ID: Fingerprints for Global Good
Authors:
Joshua J. Engelsma,
Debayan Deb,
Kai Cao,
Anjoo Bhatnagar,
Prem S. Sudhish,
Anil K. Jain
Abstract:
In many of the least developed and develo** countries, a multitude of infants continue to suffer and die from vaccine-preventable diseases and malnutrition. Lamentably, the lack of official identification documentation makes it exceedingly difficult to track which infants have been vaccinated and which infants have received nutritional supplements. Answering these questions could prevent this in…
▽ More
In many of the least developed and develo** countries, a multitude of infants continue to suffer and die from vaccine-preventable diseases and malnutrition. Lamentably, the lack of official identification documentation makes it exceedingly difficult to track which infants have been vaccinated and which infants have received nutritional supplements. Answering these questions could prevent this infant suffering and premature death around the world. To that end, we propose Infant-Prints, an end-to-end, low-cost, infant fingerprint recognition system. Infant-Prints is comprised of our (i) custom built, compact, low-cost (85 USD), high-resolution (1,900 ppi), ergonomic fingerprint reader, and (ii) high-resolution infant fingerprint matcher. To evaluate the efficacy of Infant-Prints, we collected a longitudinal infant fingerprint database captured in 4 different sessions over a 12-month time span (December 2018 to January 2020), from 315 infants at the Saran Ashram Hospital, a charitable hospital in Dayalbagh, Agra, India. Our experimental results demonstrate, for the first time, that Infant-Prints can deliver accurate and reliable recognition (over time) of infants enrolled between the ages of 2-3 months, in time for effective delivery of vaccinations, healthcare, and nutritional supplements (TAR=95.2% @ FAR = 1.0% for infants aged 8-16 weeks at enrollment and authenticated 3 months later).
△ Less
Submitted 7 October, 2020;
originally announced October 2020.
-
Leggett-Garg tests for macrorealism: interference experiments and the simple harmonic oscillator
Authors:
J. J. Halliwell,
A. Bhatnagar,
E. Ireland,
H. Nadeem,
V. Wimalaweera
Abstract:
Leggett-Garg (LG) tests for macrorealism were originally designed to explore quantum coherence on the macroscopic scale. Interference experiments and systems modelled by harmonic oscillators provide useful examples of situations in which macroscopicity has been approached experimentally and are readily turned into LG tests for a single dichotomic variable Q. Applying this approach to the double-sl…
▽ More
Leggett-Garg (LG) tests for macrorealism were originally designed to explore quantum coherence on the macroscopic scale. Interference experiments and systems modelled by harmonic oscillators provide useful examples of situations in which macroscopicity has been approached experimentally and are readily turned into LG tests for a single dichotomic variable Q. Applying this approach to the double-slit experiment in which a non-invasive measurement at the slits is included, we exhibit LG violations. We find that these violations are always accompanied by destructive interference. The converse is not true in general and we find that there are non-trivial regimes in which there is destructive interference but the two-time LG inequalities are satisfied which implies that it is in fact often possible to assign (indirectly determined) probabilities for the interferometer paths. Similar features have been observed in recent work involving a LG analysis of a Mach-Zehnder interferometer and we compare with those results. We extend the analysis to the triple-slit experiment again finding LG violations, and we also exhibit examples of some surprising relationships between LG inequalities and NSIT conditions that do not exist for dichotomic variables. For the simple harmonic oscillator, we find an analytically tractable example showing a two-time LG violation with a gaussian initial state, echoing in simpler form recent results of Bose et al (Phys. Rev. Lett. 120, 210402 (2018)).
△ Less
Submitted 21 March, 2021; v1 submitted 8 September, 2020;
originally announced September 2020.
-
An investigation of phone-based subword units for end-to-end speech recognition
Authors:
Weiran Wang,
Guangsen Wang,
Aadyot Bhatnagar,
Yingbo Zhou,
Caiming Xiong,
Richard Socher
Abstract:
Phones and their context-dependent variants have been the standard modeling units for conventional speech recognition systems, while characters and subwords have demonstrated their effectiveness for end-to-end recognition systems. We investigate the use of phone-based subwords, in particular, byte pair encoder (BPE), as modeling units for end-to-end speech recognition. In addition, we also develop…
▽ More
Phones and their context-dependent variants have been the standard modeling units for conventional speech recognition systems, while characters and subwords have demonstrated their effectiveness for end-to-end recognition systems. We investigate the use of phone-based subwords, in particular, byte pair encoder (BPE), as modeling units for end-to-end speech recognition. In addition, we also developed multi-level language model-based decoding algorithms based on a pronunciation dictionary. Besides the use of the lexicon, which is easily available, our system avoids the need of additional expert knowledge or processing steps from conventional systems. Experimental results show that phone-based BPEs tend to yield more accurate recognition systems than the character-based counterpart. In addition, further improvement can be obtained with a novel one-pass joint beam search decoder, which efficiently combines phone- and character-based BPE systems. For Switchboard, our phone-based BPE system achieves 6.8\%/14.4\% word error rate (WER) on the Switchboard/CallHome portion of the test set while joint decoding achieves 6.3\%/13.3\% WER. On Fisher + Switchboard, joint decoding leads to 4.9\%/9.5\% WER, setting new milestones for telephony speech recognition.
△ Less
Submitted 21 June, 2021; v1 submitted 8 April, 2020;
originally announced April 2020.
-
Statistics of relative velocity for particles settling under gravity in a turbulent flow
Authors:
Akshay Bhatnagar
Abstract:
We study the joint probability distributions of separation, $R$, and radial component of the relative velocity, $V_{\rm R}$, of particles settling under gravity in a turbulent flow. We also obtain the moments of these distributions and analyze their anisotropy using spherical harmonics. We find that the qualitative nature of the joint distributions remains the same as no gravity case. Distribution…
▽ More
We study the joint probability distributions of separation, $R$, and radial component of the relative velocity, $V_{\rm R}$, of particles settling under gravity in a turbulent flow. We also obtain the moments of these distributions and analyze their anisotropy using spherical harmonics. We find that the qualitative nature of the joint distributions remains the same as no gravity case. Distributions of $V_{\rm R}$ for fixed values of $R$ show a power-law dependence on $V_{\rm R}$ for a range of $V_{\rm R}$, exponent of the power-law depends on the gravity. Effects of gravity are also manifested in the following ways: (a) moments of the distributions are anisotropic; the degree of anisotropy depends on particle's Stokes number, but does not depend on $R$ for small values of $R$. (b) mean velocity of collision between two particles is decreased for particles having equal Stokes numbers but increased for particles having different Stokes numbers. For the later, collision velocity is set by the difference in their settling velocities.
△ Less
Submitted 8 January, 2020;
originally announced January 2020.
-
The spreading of viruses by airborne aerosols: lessons from a first-passage-time problem for tracers in turbulent flows
Authors:
Akhilesh Kumar Verma,
Akshay Bhatnagar,
Dhrubaditya Mitra,
Rahul Pandit
Abstract:
We study the spreading of viruses, such as SARS-CoV-2, by airborne aerosols, via a new first-passage-time problem for Lagrangian tracers that are advected by a turbulent flow: By direct numerical simulations of the three-dimensional (3D) incompressible, Navier-Stokes equation, we obtain the time $t_R$ at which a tracer, initially at the origin of a sphere of radius $R$, crosses the surface of the…
▽ More
We study the spreading of viruses, such as SARS-CoV-2, by airborne aerosols, via a new first-passage-time problem for Lagrangian tracers that are advected by a turbulent flow: By direct numerical simulations of the three-dimensional (3D) incompressible, Navier-Stokes equation, we obtain the time $t_R$ at which a tracer, initially at the origin of a sphere of radius $R$, crosses the surface of the sphere \textit{for the first time}. We obtain the probability distribution function $\mathcal{P}(R,t_R)$ and show that it displays two qualitatively different behaviors: (a) for $R \ll L_{\rm I}$, $\mathcal{P}(R,t_R)$ has a power-law tail $\sim t_R^{-α}$, with the exponent $α= 4$ and $L_{\rm I}$ the integral scale of the turbulent flow; (b) for $l_{\rm I} \lesssim R $, the tail of $\mathcal{P}(R,t_R)$ decays exponentially. We develop models that allow us to obtain these asymptotic behaviors analytically. We show how to use $\mathcal{P}(R,t_R)$ to develop social-distancing guidelines for the mitigation of the spreading of airborne aerosols with viruses such as SARS-CoV-2.
△ Less
Submitted 18 June, 2020; v1 submitted 5 January, 2020;
originally announced January 2020.
-
Lagrangian Irreversibility and Eulerian Dissipation in Fully-Developed Turbulence
Authors:
Jason R. Picardo,
Akshay Bhatnagar,
Samriddhi Sankar Ray
Abstract:
We revisit the issue of Lagrangian irreversibility in the context of recent results [Xu, et al., PNAS, 111, 7558 (2014)] on flight-crash events in turbulent flows and show how extreme events in the Eulerian dissipation statistics are related to the statistics of power-fluctuations for tracer trajectories. Surprisingly, we find that particle trajectories in intense dissipation zones are dominated b…
▽ More
We revisit the issue of Lagrangian irreversibility in the context of recent results [Xu, et al., PNAS, 111, 7558 (2014)] on flight-crash events in turbulent flows and show how extreme events in the Eulerian dissipation statistics are related to the statistics of power-fluctuations for tracer trajectories. Surprisingly, we find that particle trajectories in intense dissipation zones are dominated by energy gains sharper than energy losses, contrary to flight-crashes, through a pressure-gradient driven take-off phenomenon. Our conclusions are rationalised by analysing data from simulations of three-dimensional intermittent turbulence, as well as from non-intermittent decimated flows. Lagrangian irreversibility is found to persist even in the latter case, wherein fluctuations of the dissipation rate are shown to be relatively mild and to follow probability distribution functions with exponential tails.
△ Less
Submitted 14 April, 2020; v1 submitted 28 August, 2019;
originally announced August 2019.
-
Voting Rules that are Unbiased but not Transitive-Symmetric
Authors:
Aadyot Bhatnagar
Abstract:
We explore the relation between two natural symmetry properties of voting rules. The first is transitive-symmetry -- the property of invariance to a transitive permutation group -- while the second is the "unbiased" property of every voter having the same influence for all i.i.d. probability measures. We show that these properties are distinct by two constructions -- one probabilistic, one explici…
▽ More
We explore the relation between two natural symmetry properties of voting rules. The first is transitive-symmetry -- the property of invariance to a transitive permutation group -- while the second is the "unbiased" property of every voter having the same influence for all i.i.d. probability measures. We show that these properties are distinct by two constructions -- one probabilistic, one explicit -- of rules that are unbiased but not transitive-symmetric.
△ Less
Submitted 2 July, 2019;
originally announced July 2019.
-
A Consideration of Cosmic Evolution from the Points of View of the Inflationary and Cyclic Theories
Authors:
Anantya Bhatnagar
Abstract:
This study reviews the advances made in inflationary theory, especially regarding the seeming disparity between inflation energy and dark energy, and their significance to cosmic evolution as a whole. I attempt to connect the two sources of expansion and thereby enhance the predictive capacity of the consensus model. I also attempt to contrast its strengths and weaknesses with those of the cyclic…
▽ More
This study reviews the advances made in inflationary theory, especially regarding the seeming disparity between inflation energy and dark energy, and their significance to cosmic evolution as a whole. I attempt to connect the two sources of expansion and thereby enhance the predictive capacity of the consensus model. I also attempt to contrast its strengths and weaknesses with those of the cyclic theory developed by Turok and Steinhardt, particularly where cosmic evolution over larger time scales is concerned. Furthermore, I endeavor to provide a physical meaning to the existence and workings of dark energy as well as dark matter in the cyclic model (as compared to their simply being extra parameters in the inflation picture), so that each observed component of the universe is given significance.
△ Less
Submitted 13 December, 2018;
originally announced December 2018.
-
Fine-grained Apparel Classification and Retrieval without rich annotations
Authors:
Aniket Bhatnagar,
Sanchit Aggarwal
Abstract:
The ability to correctly classify and retrieve apparel images has a variety of applications important to e-commerce, online advertising and internet search. In this work, we propose a robust framework for fine-grained apparel classification, in-shop and cross-domain retrieval which eliminates the requirement of rich annotations like bounding boxes and human-joints or clothing landmarks, and traini…
▽ More
The ability to correctly classify and retrieve apparel images has a variety of applications important to e-commerce, online advertising and internet search. In this work, we propose a robust framework for fine-grained apparel classification, in-shop and cross-domain retrieval which eliminates the requirement of rich annotations like bounding boxes and human-joints or clothing landmarks, and training of bounding box/ key-landmark detector for the same. Factors such as subtle appearance differences, variations in human poses, different shooting angles, apparel deformations, and self-occlusion add to the challenges in classification and retrieval of apparel items. Cross-domain retrieval is even harder due to the presence of large variation between online shop** images, usually taken in ideal lighting, pose, positive angle and clean background as compared with street photos captured by users in complicated conditions with poor lighting and cluttered scenes. Our framework uses compact bilinear CNN with tensor sketch algorithm to generate embeddings that capture local pairwise feature interactions in a translationally invariant manner. For apparel classification, we pass the feature embeddings through a softmax classifier, while, the in-shop and cross-domain retrieval pipelines use a triplet-loss based optimization approach, such that squared Euclidean distance between embeddings measures the dissimilarity between the images. Unlike previous works that relied on bounding box, key clothing landmarks or human joint detectors to assist the final deep classifier, proposed framework can be trained directly on the provided category labels or generated triplets for triplet loss optimization. Lastly, Experimental results on the DeepFashion fine-grained categorization, and in-shop and consumer-to-shop retrieval datasets provide a comparative analysis with previous work performed in the domain.
△ Less
Submitted 6 November, 2018;
originally announced November 2018.
-
Clustering and dynamic decoupling of dust grains in turbulent molecular clouds
Authors:
Lars Mattsson,
Akshay Bhatnagar,
Fred A. Gent,
Beatriz Villarroel
Abstract:
We present high resolution ($1024^3$) simulations of super-/hyper-sonic isothermal hydrodynamic turbulence inside an interstellar molecular cloud (resolving scales of typically 20 -- 100 AU), including a multi-disperse population of dust grains, i.e., a range of grain sizes is considered. Due to inertia, large grains (typical radius $a \gtrsim 1.0\,μ$m) will decouple from the gas flow, while small…
▽ More
We present high resolution ($1024^3$) simulations of super-/hyper-sonic isothermal hydrodynamic turbulence inside an interstellar molecular cloud (resolving scales of typically 20 -- 100 AU), including a multi-disperse population of dust grains, i.e., a range of grain sizes is considered. Due to inertia, large grains (typical radius $a \gtrsim 1.0\,μ$m) will decouple from the gas flow, while small grains ($a\lesssim 0.1\,μ$m) will tend to better trace the motions of the gas. We note that simulations with purely solenoidal forcing show somewhat more pronounced decoupling and less clustering compared to simulations with purely compressive forcing. Overall, small and large grains tend to cluster, while intermediate-size grains show essentially a random isotropic distribution. As a consequence of increased clustering, the grain-grain interaction rate is locally elevated; but since small and large grains are often not spatially correlated, it is unclear what effect this clustering would have on the coagulation rate. Due to spatial separation of dust and gas, a diffuse upper limit to the grain sizes obtained by condensational growth is also expected, since large (decoupled) grains are not necessarily located where the growth species in the molecular gas is.
△ Less
Submitted 2 November, 2018;
originally announced November 2018.
-
Relative velocities in bi-disperse turbulent aerosols: simulations and theory
Authors:
Akshay Bhatnagar,
K. Gustavsson,
B. Mehlig,
Dhrubaditya Mitra
Abstract:
We perform direct numerical simulations of a bi-disperse suspension of heavy spherical particles in forced, homogeneous, and isotropic three-dimensional turbulence. We compute the joint distribution of relative particle distances and longitudinal relative velocities between particles of different sizes, and compare the results with recent theoretical predictions [Meibohm et al. Phys. Rev. E 96 (20…
▽ More
We perform direct numerical simulations of a bi-disperse suspension of heavy spherical particles in forced, homogeneous, and isotropic three-dimensional turbulence. We compute the joint distribution of relative particle distances and longitudinal relative velocities between particles of different sizes, and compare the results with recent theoretical predictions [Meibohm et al. Phys. Rev. E 96 (2017) 061102] for the shape of this distribution. We also compute the moments of relative velocities as a function of particle separation, and compare with the theoretical predictions. We observe good agreement.
△ Less
Submitted 20 December, 2018; v1 submitted 27 September, 2018;
originally announced September 2018.
-
Learning to Search via Retrospective Imitation
Authors:
Jialin Song,
Ravi Lanka,
Albert Zhao,
Aadyot Bhatnagar,
Yisong Yue,
Masahiro Ono
Abstract:
We study the problem of learning a good search policy for combinatorial search spaces. We propose retrospective imitation learning, which, after initial training by an expert, improves itself by learning from \textit{retrospective inspections} of its own roll-outs. That is, when the policy eventually reaches a feasible solution in a combinatorial search tree after making mistakes and backtracks, i…
▽ More
We study the problem of learning a good search policy for combinatorial search spaces. We propose retrospective imitation learning, which, after initial training by an expert, improves itself by learning from \textit{retrospective inspections} of its own roll-outs. That is, when the policy eventually reaches a feasible solution in a combinatorial search tree after making mistakes and backtracks, it retrospectively constructs an improved search trace to the solution by removing backtracks, which is then used to further train the policy. A key feature of our approach is that it can iteratively scale up, or transfer, to larger problem sizes than those solved by the initial expert demonstrations, thus dramatically expanding its applicability beyond that of conventional imitation learning. We showcase the effectiveness of our approach on a range of tasks, including synthetic maze solving and combinatorial problems expressed as integer programs.
△ Less
Submitted 23 June, 2019; v1 submitted 3 April, 2018;
originally announced April 2018.
-
Risk Factors Associated with Mortality in Game of Thrones: A Longitudinal Cohort Study
Authors:
Suveen Angraal,
Ambika Bhatnagar,
Suraj Verma,
Sukhman Shergill,
Aakriti Gupta,
Rohan Khera
Abstract:
Objective: To assess mortality, and identify the risk factors associated with mortality in Game of Thrones (GoT). Design and Setting: A longitudinal cohort study in the fictional kingdom of Westeros and Essos. Participants: All the characters appearing in the GoT since airing of its first episode with screen time of greater than or equal to 5 minutes. Main Outcome Measures: All-cause mortality. Mu…
▽ More
Objective: To assess mortality, and identify the risk factors associated with mortality in Game of Thrones (GoT). Design and Setting: A longitudinal cohort study in the fictional kingdom of Westeros and Essos. Participants: All the characters appearing in the GoT since airing of its first episode with screen time of greater than or equal to 5 minutes. Main Outcome Measures: All-cause mortality. Multivariate Cox proportional hazard model was used to assess the risk factors associated with mortality, represented by hazard ratios, with episodes as the unit of time. Results: Of the 132 characters, followed up for a median time of 32 episodes, a total 89 (67.4%) characters died; with external invasive injury as the most common cause of death, attributing to 42.4% of the total deaths. Age (in decades) was a significant risk factor for death [HR, 1.24 (95% CI, 1.08-1.43), P=0.0001]. Although statistically non-significant, allegiance to house Targaryen [HR, 1.10 (95% CI, 0.32-3.77)] was associated with a higher risk for mortality per episode than house Stark. Characters residing in South were less likely to die than characters residing in North [HR, 0.58 (95% CI, 0.29-1.16), P=0.12]. Advisors showed a lower risk of mortality than the members of houses, with some statistical significance [HR, 0.39 (95% CI, 0.14-1.08), P=0.07]. Conclusions: There is a high mortality rate among the characters in GoT. Residing in the North and being a member of a house is very dangerous in GoT. Allegiance to house Stark trended to be safer than house Targaryen.
△ Less
Submitted 1 August, 2018; v1 submitted 12 February, 2018;
originally announced February 2018.
-
Heavy inertial particles in turbulent flows gain energy slowly but lose it rapidly
Authors:
Akshay Bhatnagar,
Anupam Gupta,
Dhrubaditya Mitra,
Rahul Pandit
Abstract:
We present an extensive numerical study of the time irreversibility of the dynamics of heavy inertial particles in three-dimensional, statistically homogeneous and isotropic turbulent flows. We show that the probability density function (PDF) of the increment, $W(τ)$, of a particle's energy over a time-scale $τ$ is non-Gaussian, and skewed towards negative values. This implies that, on average, pa…
▽ More
We present an extensive numerical study of the time irreversibility of the dynamics of heavy inertial particles in three-dimensional, statistically homogeneous and isotropic turbulent flows. We show that the probability density function (PDF) of the increment, $W(τ)$, of a particle's energy over a time-scale $τ$ is non-Gaussian, and skewed towards negative values. This implies that, on average, particles gain energy over a period of time that is longer than the duration over which they lose energy. We call this $\textit{slow gain}$ and $\textit{fast loss}$. We find that the third moment of $W(τ)$ scales as $τ^3$, for small values of $τ$. We show that the PDF of power-input $p$ is negatively skewed too; we use this skewness ${\rm Ir}$ as a measure of the time-irreversibility and we demonstrate that it increases sharply with the Stokes number ${\rm St}$, for small ${\rm St}$; this increase slows down at ${\rm St} \simeq 1$. Furthermore, we obtain the PDFs of $t^+$ and $t^-$, the times over which $p$ has, respectively, positive or negative signs, i.e., the particle gains or loses energy. We obtain from these PDFs a direct and natural quantification of the the slow-gain and fast-loss of the particles, because these PDFs possess exponential tails, whence we infer the characteristic loss and gain times $t_{\rm loss}$ and $t_{\rm gain}$, respectively; and we obtain $t_{\rm loss} < t_{\rm gain}$, for all the cases we have considered. Finally, we show that the slow-gain in energy of the particles is equally likely in vortical or strain-dominated regions of the flow; in contrast, the fast-loss of energy occurs with greater probability in the latter than in the former.
△ Less
Submitted 19 November, 2017;
originally announced November 2017.
-
Statistics of the relative velocity of particles in turbulent flows : monodisperse particles
Authors:
Akshay Bhatnagar,
K. Gustavsson,
Dhrubaditya Mitra
Abstract:
We use direct numerical simulations to calculate the joint probability density function of the relative distance $R$ and relative radial velocity component $V_R$ for a pair of heavy inertial particles suspended in homogeneous and isotropic turbulent flows. At small scales the distribution is scale invariant, with a scaling exponent that is related to the particle-particle correlation dimension in…
▽ More
We use direct numerical simulations to calculate the joint probability density function of the relative distance $R$ and relative radial velocity component $V_R$ for a pair of heavy inertial particles suspended in homogeneous and isotropic turbulent flows. At small scales the distribution is scale invariant, with a scaling exponent that is related to the particle-particle correlation dimension in phase space, $D_2$. It was argued [1, 2] that the scale invariant part of the distribution has two asymptotic regimes: (1) $|V_R| \ll R$ where the distribution depends solely on $R$; and (2) $|V_R| \gg R$ where the distribution is a function of $|V_R|$ alone. The probability distributions in these two regimes are matched along a straight line $|V_R| = z^\ast R$. Our simulations confirm that this is indeed correct. We further obtain $D_2$ and $z^\ast$ as a function of the Stokes number, ${\rm St}$. The former depends non-monotonically on ${\rm St}$ with a minimum at about ${\rm St} \approx 0.7$ and the latter has only a weak dependence on ${\rm St}$.
△ Less
Submitted 13 October, 2017;
originally announced October 2017.
-
Lagrangian Statistics for Navier-Stokes Turbulence under Fourier-mode reduction: Fractal and Homogeneous Decimations
Authors:
Michele Buzzicotti,
Akshay Bhatnagar,
Luca Biferale,
Alessandra S. Lanotte,
Samriddhi Sankar Ray
Abstract:
We study small-scale and high-frequency turbulent fluctuations in three-dimensional flows under Fourier-mode reduction. The Navier-Stokes equations are evolved on a restricted set of modes, obtained as a projection on a fractal or homogeneous Fourier set. We find a strong sensitivity (reduction) of the high-frequency variability of the Lagrangian velocity fluctuations on the degree of mode decimat…
▽ More
We study small-scale and high-frequency turbulent fluctuations in three-dimensional flows under Fourier-mode reduction. The Navier-Stokes equations are evolved on a restricted set of modes, obtained as a projection on a fractal or homogeneous Fourier set. We find a strong sensitivity (reduction) of the high-frequency variability of the Lagrangian velocity fluctuations on the degree of mode decimation, similarly to what is already reported for Eulerian statistics. This is quantified by a tendency towards a quasi-Gaussian statistics, i.e., to a reduction of intermittency, at all scales and frequencies. This can be attributed to a strong depletion of vortex filaments and of the vortex stretching mechanism. Nevertheless, we found that Eulerian and Lagrangian ensembles are still connected by a dimensional bridge-relation which is independent of the degree of Fourier-mode decimation.
△ Less
Submitted 2 January, 2017;
originally announced January 2017.
-
Map** the Microscale Origins of MRI Contrast with Subcellular NV Diamond Magnetometry
Authors:
Hunter C. Davis,
Pradeep Ramesh,
Aadyot Bhatnagar,
Audrey Lee-Gosselin,
John F. Barry,
David R. Glenn,
Ronald L. Walsworth,
Mikhail G. Shapiro
Abstract:
Magnetic resonance imaging (MRI) is a widely used biomedical imaging modality that derives much of its contrast from microscale magnetic field gradients in biological tissues. However, the connection between these sub-voxel field patterns and MRI contrast has not been studied experimentally. Here, we describe a new method to map subcellular magnetic fields in mammalian cells and tissues using nitr…
▽ More
Magnetic resonance imaging (MRI) is a widely used biomedical imaging modality that derives much of its contrast from microscale magnetic field gradients in biological tissues. However, the connection between these sub-voxel field patterns and MRI contrast has not been studied experimentally. Here, we describe a new method to map subcellular magnetic fields in mammalian cells and tissues using nitrogen vacancy diamond magnetometry and connect these maps to voxel-scale MRI contrast, providing insights for in vivo imaging and contrast agent design.
△ Less
Submitted 6 October, 2016;
originally announced October 2016.
-
How long do particles spend in vortical regions in turbulent flows?
Authors:
Akshay Bhatnagar,
Anupam Gupta,
Dhrubaditya Mitra,
Rahul Pandit,
Prasad Perlekar
Abstract:
We obtain the probability distribution functions (PDFs) of the time that a Lagrangian tracer or a heavy inertial particle spends in vortical or strain-dominated regions of a turbulent flow, by carrying out direct numerical simulation (DNS) of such particles advected by statistically steady, homogeneous and isotropic turbulence in the forced, three-dimensional, incompressible Navier-Stokes equation…
▽ More
We obtain the probability distribution functions (PDFs) of the time that a Lagrangian tracer or a heavy inertial particle spends in vortical or strain-dominated regions of a turbulent flow, by carrying out direct numerical simulation (DNS) of such particles advected by statistically steady, homogeneous and isotropic turbulence in the forced, three-dimensional, incompressible Navier-Stokes equation. We use the two invariants, $Q$ and $R$, of the velocity-gradient tensor to distinguish between vortical and strain-dominated regions of the flow and partition the $Q-R$ plane into four different regions depending on the topology of the flow; out of these four regions two correspond to vorticity-dominated regions of the flow and two correspond to strain-dominated ones. We obtain $Q$ and $R$ along the trajectories of tracers and heavy inertial particles and find out the time $\mathrm{t_{pers}}$ for which they remain in one of the four regions of the $Q-R$ plane. We find that the PDFs of $\mathrm{t_{pers}}$ display exponentially decaying tails for all four regions for tracers and heavy inertial particles. From these PDFs we extract characteristic times scales, which help us to quantify the time that such particles spend in vortical or strain-dominated regions of the flow.
△ Less
Submitted 8 September, 2016;
originally announced September 2016.
-
Toward Early and Order-of-Magnitude Cascade Prediction in Social Networks
Authors:
Ruocheng Guo,
Elham Shaabani,
Abhinav Bhatnagar,
Paulo Shakarian
Abstract:
When a piece of information (microblog, photograph, video, link, etc.) starts to spread in a social network, an important question arises: will it spread to viral proportions - where viral can be defined as an order-of-magnitude increase. However, several previous studies have established that cascade size and frequency are related through a power-law - which leads to a severe imbalance in this cl…
▽ More
When a piece of information (microblog, photograph, video, link, etc.) starts to spread in a social network, an important question arises: will it spread to viral proportions - where viral can be defined as an order-of-magnitude increase. However, several previous studies have established that cascade size and frequency are related through a power-law - which leads to a severe imbalance in this classification problem. In this paper, we devise a suite of measurements based on structural diversity - the variety of social contexts (communities) in which individuals partaking in a given cascade engage. We demonstrate these measures are able to distinguish viral from non-viral cascades, despite the severe imbalance of the data for this problem. Further, we leverage these measurements as features in a classification approach, successfully predicting microblogs that grow from 50 to 500 reposts with precision of 0.69 and recall of 0.52 for the viral class - despite this class comprising under 2% of samples. This significantly outperforms our baseline approach as well as the current state-of-the-art. We also show this approach also performs well for identifying if cascades observed for 60 minutes will grow to 500 reposts as well as demonstrate how we can tradeoff between precision and recall.
△ Less
Submitted 8 August, 2016;
originally announced August 2016.
-
Toward Order-of-Magnitude Cascade Prediction
Authors:
Ruocheng Guo,
Elham Shaabani,
Abhinav Bhatnagar,
Paulo Shakarian
Abstract:
When a piece of information (microblog, photograph, video, link, etc.) starts to spread in a social network, an important question arises: will it spread to "viral" proportions -- where "viral" is defined as an order-of-magnitude increase. However, several previous studies have established that cascade size and frequency are related through a power-law - which leads to a severe imbalance in this c…
▽ More
When a piece of information (microblog, photograph, video, link, etc.) starts to spread in a social network, an important question arises: will it spread to "viral" proportions -- where "viral" is defined as an order-of-magnitude increase. However, several previous studies have established that cascade size and frequency are related through a power-law - which leads to a severe imbalance in this classification problem. In this paper, we devise a suite of measurements based on "structural diversity" -- the variety of social contexts (communities) in which individuals partaking in a given cascade engage. We demonstrate these measures are able to distinguish viral from non-viral cascades, despite the severe imbalance of the data for this problem. Further, we leverage these measurements as features in a classification approach, successfully predicting microblogs that grow from 50 to 500 reposts with precision of 0.69 and recall of 0.52 for the viral class - despite this class comprising under 2\% of samples. This significantly outperforms our baseline approach as well as the current state-of-the-art. Our work also demonstrates how we can tradeoff between precision and recall.
△ Less
Submitted 13 August, 2015;
originally announced August 2015.
-
Biometrics for Child Vaccination and Welfare: Persistence of Fingerprint Recognition for Infants and Toddlers
Authors:
Anil K. Jain,
Sunpreet S. Arora,
Lacey Best-Rowden,
Kai Cao,
Prem Sewak Sudhish,
Anjoo Bhatnagar
Abstract:
With a number of emerging applications requiring biometric recognition of children (e.g., tracking child vaccination schedules, identifying missing children and preventing newborn baby swaps in hospitals), investigating the temporal stability of biometric recognition accuracy for children is important. The persistence of recognition accuracy of three of the most commonly used biometric traits (fin…
▽ More
With a number of emerging applications requiring biometric recognition of children (e.g., tracking child vaccination schedules, identifying missing children and preventing newborn baby swaps in hospitals), investigating the temporal stability of biometric recognition accuracy for children is important. The persistence of recognition accuracy of three of the most commonly used biometric traits (fingerprints, face and iris) has been investigated for adults. However, persistence of biometric recognition accuracy has not been studied systematically for children in the age group of 0-4 years. Given that very young children are often uncooperative and do not comprehend or follow instructions, in our opinion, among all biometric modalities, fingerprints are the most viable for recognizing children. This is primarily because it is easier to capture fingerprints of young children compared to other biometric traits, e.g., iris, where a child needs to stare directly towards the camera to initiate iris capture. In this report, we detail our initiative to investigate the persistence of fingerprint recognition for children in the age group of 0-4 years. Based on preliminary results obtained for the data collected in the first phase of our study, use of fingerprints for recognition of 0-4 year-old children appears promising.
△ Less
Submitted 17 April, 2015;
originally announced April 2015.
-
Universal Statistical Properties of Inertial-particle Trajectories in Three-dimensional, Homogeneous, Isotropic, Fluid Turbulence
Authors:
Akshay Bhatnagar,
Anupam Gupta,
Dhrubaditya Mitra,
Prasad Perlekar,
Rahul Pandit
Abstract:
We uncover universal statistical properties of the trajectories of heavy inertial particles in three-dimensional, statistically steady, homogeneous, and isotropic turbulent flows by extensive direct numerical simulations. We show that the probability distribution functions (PDFs) $P(φ)$, of the angle $φ$ between the Eulerian velocity ${\bf u}$ and the particle velocity ${\bf v}$, at this point and…
▽ More
We uncover universal statistical properties of the trajectories of heavy inertial particles in three-dimensional, statistically steady, homogeneous, and isotropic turbulent flows by extensive direct numerical simulations. We show that the probability distribution functions (PDFs) $P(φ)$, of the angle $φ$ between the Eulerian velocity ${\bf u}$ and the particle velocity ${\bf v}$, at this point and time, shows a power-law region in which $P(φ) \sim φ^{-γ}$, with a new universal exponent $γ\simeq 4$. Furthermore, the PDFs of the trajectory curvature $κ$ and modulus $θ$ of the torsion $\vartheta$ have power-law tails that scale, respectively, as $P(κ) \sim κ^{-h_κ}$, as $κ\to \infty$, and $P(θ) \sim θ^{-h_θ}$, as $θ\to \infty$, with exponents $h_κ\simeq 2.5$ and $h_θ\simeq 3$ that are universal to the extent that they do not depend on the Stokes number ${\rm St}$ (given our error bars). We also show that $γ$, $h_κ$ and $h_θ$ can be obtained by using simple stochastic models. We characterize the complexity of heavy-particle trajectories by the number $N_{\rm I}(t,{\rm St})$ of points (up until time $t$) at which $\vartheta$ changes sign. We show that $n_{\rm I}({\rm St}) \equiv \lim_{t\to\infty} \frac{N_{\rm I}(t,{\rm St})}{t} \sim {\rm St}^{-Δ}$, with $Δ\simeq 0.4$ a universal exponent.
△ Less
Submitted 8 December, 2014;
originally announced December 2014.
-
Hydrogen storage in carbon derived from solid endosperm of coconut
Authors:
Viney Dixit,
Ashish Bhatnagar,
R. R. Shahi,
T. P. Yadav,
O. N. Srivastava
Abstract:
Carbons are being widely investigated as hydrogen storage material owing to their light weight, fast hydrogen adsorption kinetics and cost effectiveness. However, these materials suffer from low hydrogen storage capacity, particularly at room temperature. The aim of the present study is to develop carbon-based material from natural bio-precursor which shows at least moderate hydrogen storage at ro…
▽ More
Carbons are being widely investigated as hydrogen storage material owing to their light weight, fast hydrogen adsorption kinetics and cost effectiveness. However, these materials suffer from low hydrogen storage capacity, particularly at room temperature. The aim of the present study is to develop carbon-based material from natural bio-precursor which shows at least moderate hydrogen storage at room temperature. For this purpose, hydrogenation characteristics of carbon derived from solid endosperm of coconut is studied. Solid endosperm and its carbonized version have a native grown distribution of light elements like K (from KCl), Na, Mg, etc. The hydrogen storage measurements reveal that the as synthesized materials have good hydrogen adsorption and desorption capacity with fast kinetics. The synthesized material adsorbs 2.30 wt. % at room temperature and 8.00 wt.% at liquid nitrogen temperature under 70 atm pressure. The present work is likely to initiate research on native light metal bearing carbon derived from natural precursors.
△ Less
Submitted 25 September, 2014;
originally announced September 2014.
-
The Moduli Space of Totally Marked Degree Two Rational Maps
Authors:
Anupam Bhatnagar
Abstract:
A rational map $φ: \mathbb{P}^1 \to \mathbb{P}^1$ along with an ordered list of fixed and critical points is called a totally marked rational map. The space of totally marked degree two rational maps, $Rat^{tm}_2$ can be parametrized by an affine open subset of $(\mathbb{P}^1)^5$. We consider the natural action of $SL_2$ on $Rat^{tm}_2$ induced from the action of $SL_2$ on $(\mathbb{P}^1)^5$ and p…
▽ More
A rational map $φ: \mathbb{P}^1 \to \mathbb{P}^1$ along with an ordered list of fixed and critical points is called a totally marked rational map. The space of totally marked degree two rational maps, $Rat^{tm}_2$ can be parametrized by an affine open subset of $(\mathbb{P}^1)^5$. We consider the natural action of $SL_2$ on $Rat^{tm}_2$ induced from the action of $SL_2$ on $(\mathbb{P}^1)^5$ and prove that the quotient space $Rat^{tm}_2/SL_2$ exists as a scheme. The quotient is isomorphic to a Del Pezzo surface with the isomorphism being defined over $\mathbb{Z}[1/2]$.
△ Less
Submitted 17 August, 2014;
originally announced August 2014.
-
Isotriviality and the space of morphisms on projective varieties
Authors:
Anupam Bhatnagar,
Alon Levy
Abstract:
Let $K=k(C)$ be the function field of a smooth projective curve $C$ over an infinite field $k$, let $X$ be a projective variety over $k$. We prove two results. First, we show with some conditions that a $K$-morphism $φ: X_K \to X_K$ of degree at least two is isotrivial if and only if $φ$ has potential good reduction at all places $v$ of $K$. Second, let $(X,φ), (Y,ψ)$ be dynamical systems where…
▽ More
Let $K=k(C)$ be the function field of a smooth projective curve $C$ over an infinite field $k$, let $X$ be a projective variety over $k$. We prove two results. First, we show with some conditions that a $K$-morphism $φ: X_K \to X_K$ of degree at least two is isotrivial if and only if $φ$ has potential good reduction at all places $v$ of $K$. Second, let $(X,φ), (Y,ψ)$ be dynamical systems where $X,Y$ are defined over $k$ and $g:X_{K} \to Y_{K}$ a dominant $K$-morphism, such that $g \circ φ= ψ\circ g$. We show under certain conditions that if $φ$ is defined over $k$, then $ψ$ is defined over $k$.
△ Less
Submitted 17 November, 2013; v1 submitted 26 March, 2013;
originally announced March 2013.
-
An Approach Of Substitution Method Based On ASCII Codes In Encryption Technique
Authors:
Avinash Sharma,
Anurag Bhatnagar,
Nikhar Tak,
Anuradha Sharma,
Jitendra Avasthi,
Prerna sharma
Abstract:
In poly alphabetic substitution the plain texts letters are enciphered differently according to their position. The name poly alphabetic suggests that there are more than one key so we have used two keys combination instead of just one, in order to produce the cipher text. We can also use three or more keys to make the enciphering process more complicated. In this paper have produced ASCII Codes o…
▽ More
In poly alphabetic substitution the plain texts letters are enciphered differently according to their position. The name poly alphabetic suggests that there are more than one key so we have used two keys combination instead of just one, in order to produce the cipher text. We can also use three or more keys to make the enciphering process more complicated. In this paper have produced ASCII Codes of the plain text and then we have reversed it said reverse ASCII Codes and then we have generated two keys K1 is generated by addition of reverse ASCII Codes and K2 is generated by addition of ASCII Codes. Then these K1 and K2 Keys are alternatively applied on Reverse ASCII codes in order to produce cipher text. On the Destination hand Deciphering is used to produce the plain text again. Our technique generates random cipher text for the same plain text and this is the major advantage of our technique.
△ Less
Submitted 18 February, 2013;
originally announced February 2013.
-
Projective varieties covered by isotrivial families
Authors:
Anupam Bhatnagar
Abstract:
Let X,Y be projective schemes over a discrete valuation ring R, where Y is generically smooth and g:X \to Y a surjective R-morphism such that g_*\mathcal{O}_X = \mathcal{O}_Y. We show that if the family X \to Spec(R) is isotrivial, then the generic fiber of the family Y \to Spec(R) is isotrivial.
Let X,Y be projective schemes over a discrete valuation ring R, where Y is generically smooth and g:X \to Y a surjective R-morphism such that g_*\mathcal{O}_X = \mathcal{O}_Y. We show that if the family X \to Spec(R) is isotrivial, then the generic fiber of the family Y \to Spec(R) is isotrivial.
△ Less
Submitted 24 January, 2013; v1 submitted 25 October, 2011;
originally announced October 2011.
-
Very ample polarized self maps extend to projective space
Authors:
Anupam Bhatnagar,
Lucien Szpiro
Abstract:
Let $X$ be a projective variety defined over an infinite field, equipped with a line bundle $L$, giving an embedding of $X$ into $\mb{P}^m$ and let $φ: X \to X$ be a morphism such that $φ^*L \cong L^{\otimes q}, q\geq 2$. Then there exists an integer $r>0$ extending $φ^r$ to $\mb{P}^m$.
Let $X$ be a projective variety defined over an infinite field, equipped with a line bundle $L$, giving an embedding of $X$ into $\mb{P}^m$ and let $φ: X \to X$ be a morphism such that $φ^*L \cong L^{\otimes q}, q\geq 2$. Then there exists an integer $r>0$ extending $φ^r$ to $\mb{P}^m$.
△ Less
Submitted 7 December, 2011; v1 submitted 13 October, 2010;
originally announced October 2010.
-
Variation of solar irradiance and mode frequencies during Maunder minimum
Authors:
A. Bhatnagar,
Kiran Jain,
S. C. Tripathy
Abstract:
Using the sunspot numbers reported during the Maunder minimum and the empirical relations between the mode frequencies and solar activity indices, the variations in the total solar irradiance and 10.7 cm radio flux for the period 1645 to 1715 is estimated. We find that the total solar irradiance and radio flux during the Maunder minimum decreased by 0.19% and 52% respectively, as compared to the…
▽ More
Using the sunspot numbers reported during the Maunder minimum and the empirical relations between the mode frequencies and solar activity indices, the variations in the total solar irradiance and 10.7 cm radio flux for the period 1645 to 1715 is estimated. We find that the total solar irradiance and radio flux during the Maunder minimum decreased by 0.19% and 52% respectively, as compared to the values for solar cycle 22.
△ Less
Submitted 3 January, 2002;
originally announced January 2002.
-
How good are the predictions for oscillation frequencies
Authors:
Kiran Jain,
S. C. Tripathy,
A. Bhatnagar
Abstract:
We have used available intermediate degree p-mode frequencies for the solar cycle 23 to check the validity of previously derived empirical relations for frequency shifts (Jain et al.: 2000, Solar Phys., 192, 487). We find that the calculated and observed frequency shifts during the rising phase of the cycle 23 are in good agreement. The observed frequency shift from minimum to maximum of this cy…
▽ More
We have used available intermediate degree p-mode frequencies for the solar cycle 23 to check the validity of previously derived empirical relations for frequency shifts (Jain et al.: 2000, Solar Phys., 192, 487). We find that the calculated and observed frequency shifts during the rising phase of the cycle 23 are in good agreement. The observed frequency shift from minimum to maximum of this cycle as calculated from MDI frequency data sets is 251 $\pm$ 7 nHz and from GONG data is 238 $\pm$ 11 nHz. These values are in close agreement with the empirically predicted value of 271 $\pm$ 22 nHz.
△ Less
Submitted 3 January, 2002;
originally announced January 2002.
-
On the solar rotation rate in the upper convection zone
Authors:
Kiran Jain,
S. C. Tripathy,
A. Bhatnagar
Abstract:
We present results on variation in rotation rate in the upper convection zone using data from GONG and MDI/SOHO covering a period of more than four years. We find that the first few odd-order splitting coefficients vary systematically with the solar cycle. The rotation rate near the solar surface calculated from analytical methods agrees well with that of inversion techniques. The residual rotat…
▽ More
We present results on variation in rotation rate in the upper convection zone using data from GONG and MDI/SOHO covering a period of more than four years. We find that the first few odd-order splitting coefficients vary systematically with the solar cycle. The rotation rate near the solar surface calculated from analytical methods agrees well with that of inversion techniques. The residual rotation rate in the outer layers seem to be correlated with the solar activity.
△ Less
Submitted 12 March, 2001;
originally announced March 2001.