-
BOrg: A Brain Organoid-Based Mitosis Dataset for Automatic Analysis of Brain Diseases
Authors:
Muhammad Awais,
Mehaboobathunnisa Sahul Hameed,
Bidisha Bhattacharya,
Orly Reiner,
Rao Muhammad Anwer
Abstract:
Recent advances have enabled the study of human brain development using brain organoids derived from stem cells. Quantifying cellular processes like mitosis in these organoids offers insights into neurodevelopmental disorders, but the manual analysis is time-consuming, and existing datasets lack specific details for brain organoid studies. We introduce BOrg, a dataset designed to study mitotic eve…
▽ More
Recent advances have enabled the study of human brain development using brain organoids derived from stem cells. Quantifying cellular processes like mitosis in these organoids offers insights into neurodevelopmental disorders, but the manual analysis is time-consuming, and existing datasets lack specific details for brain organoid studies. We introduce BOrg, a dataset designed to study mitotic events in the embryonic development of the brain using confocal microscopy images of brain organoids. BOrg utilizes an efficient annotation pipeline with sparse point annotations and techniques that minimize expert effort, overcoming limitations of standard deep learning approaches on sparse data. We adapt and benchmark state-of-the-art object detection and cell counting models on BOrg for detecting and analyzing mitotic cells across prophase, metaphase, anaphase, and telophase stages. Our results demonstrate these adapted models significantly improve mitosis analysis efficiency and accuracy for brain organoid research compared to existing methods. BOrg facilitates the development of automated tools to quantify statistics like mitosis rates, aiding mechanistic studies of neurodevelopmental processes and disorders. Data and code are available at https://github.com/awaisrauf/borg.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Infinite dimensional dynamical maps
Authors:
Bihalan Bhattacharya,
Uwe Franz,
Saikat Patra,
Ritabrata Sengupta
Abstract:
Completely positive trace preserving maps are widely used in quantum information theory. These are mostly studied using the master equation perspective. A central part in this theory is to study whether a given system of dynamical maps $\{Λ_t: t \ge 0\}$ is Markovian or non-Markovian. We study the problem when the underlying Hilbert space is of infinite dimensional. We construct a sufficient condi…
▽ More
Completely positive trace preserving maps are widely used in quantum information theory. These are mostly studied using the master equation perspective. A central part in this theory is to study whether a given system of dynamical maps $\{Λ_t: t \ge 0\}$ is Markovian or non-Markovian. We study the problem when the underlying Hilbert space is of infinite dimensional. We construct a sufficient condition for checking P (resp. CP) divisibility of dynamical maps. We construct several examples where the underlying Hilbert space may not be of finite dimensional. We also give a special emphasis on Gaussian dynamical maps and get a version of our result in it.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Flux-density stability and temporal changes in spectra of millisecond pulsars using GMRT
Authors:
Rahul Sharan,
Bhaswati Bhattacharyaa,
Sangita Kumari,
Jayanta Roy,
Ankita Ghosh
Abstract:
This paper presents an investigation of spectral properties of 10 millisecond pulsars (MSPs) discovered by the uGMRT, observed from 2017-2023 using band 3 (300-500 MHz) and 4 (550-750 MHz) of uGMRT. For these MSPs, we have reported a range of spectral indices from ~0 to -4.8, while averaging the full observing band and all the observing epochs. For every MSP, we calculated the mean flux densities…
▽ More
This paper presents an investigation of spectral properties of 10 millisecond pulsars (MSPs) discovered by the uGMRT, observed from 2017-2023 using band 3 (300-500 MHz) and 4 (550-750 MHz) of uGMRT. For these MSPs, we have reported a range of spectral indices from ~0 to -4.8, while averaging the full observing band and all the observing epochs. For every MSP, we calculated the mean flux densities across 7-8 sub-bands each with approximately 25 MHz bandwidth spanning band 3 and band 4. We computed their modulation indices as well as average and maximum-to-median flux densities within each subband. Using a temporal variation of flux density we calculated the refractive scintillation time scales and estimated structure function with time lag for 8 MSPs in the sample. We note a significant temporal evolution of the in-band spectra, classified into three categories based on the nature of the best-fit power-law spectra, having single positive spectral indices, multiple broken power law, and single negative spectral indices. Additionally, indications of low-frequency turnover and a temporal variation of the turnover frequency (to the extent that turnover was observed for some of the epochs while not seen for the rest) were noted for all the MSPs. To the best of our knowledge, this is the first systematic investigation probing temporal changes in the MSP spectra as well as in turnover frequency. Future exploration with dense monitoring combined with modeling of spectra can provide vital insight into the intrinsic emission properties of the MSPs and ISM properties.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Quasielastic Lepton-Nucleus Scattering and the Correlated Fermi Gas Model
Authors:
Bhubanjyoti Bhattacharya,
Sam Carey,
Erez O. Cohen,
Gil Paz
Abstract:
The neutrino research program in the coming decades will require improved precision. A major source of uncertainty is the interaction of neutrinos with nuclei that serve as targets for such experiments. Broadly speaking, this interaction often depends, e.g., for charge-current quasi-elastic scattering, on the combination of ``nucleon physics", expressed by form factors, and ``nuclear physics", exp…
▽ More
The neutrino research program in the coming decades will require improved precision. A major source of uncertainty is the interaction of neutrinos with nuclei that serve as targets for such experiments. Broadly speaking, this interaction often depends, e.g., for charge-current quasi-elastic scattering, on the combination of ``nucleon physics", expressed by form factors, and ``nuclear physics", expressed by a nuclear model. It is important to get a good handle on both. We present a fully analytic implementation of the Correlated Fermi Gas Model for electron-nucleus and charge-current quasi-elastic neutrino-nucleus scattering. The implementation is used to compare separately form factors and nuclear model effects for both electron-carbon and neutrino-carbon scattering data.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Higher-Order Graphon Theory: Fluctuations, Degeneracies, and Inference
Authors:
Anirban Chatterjee,
Soham Dan,
Bhaswar B. Bhattacharya
Abstract:
Exchangeable random graphs, which include some of the most widely studied network models, have emerged as the mainstay of statistical network analysis in recent years. Graphons, which are the central objects in graph limit theory, provide a natural way to sample exchangeable random graphs. It is well known that network moments (motif/subgraph counts) identify a graphon (up to an isomorphism), henc…
▽ More
Exchangeable random graphs, which include some of the most widely studied network models, have emerged as the mainstay of statistical network analysis in recent years. Graphons, which are the central objects in graph limit theory, provide a natural way to sample exchangeable random graphs. It is well known that network moments (motif/subgraph counts) identify a graphon (up to an isomorphism), hence, understanding the sampling distribution of subgraph counts in random graphs sampled from a graphon is pivotal for nonparametric network inference. In this paper, we derive the joint asymptotic distribution of any finite collection of network moments in random graphs sampled from a graphon, that includes both the non-degenerate case (where the distribution is Gaussian) as well as the degenerate case (where the distribution has both Gaussian or non-Gaussian components). This provides the higher-order fluctuation theory for subgraph counts in the graphon model. We also develop a novel multiplier bootstrap for graphons that consistently approximates the limiting distribution of the network moments (both in the Gaussian and non-Gaussian regimes). Using this and a procedure for testing degeneracy, we construct joint confidence sets for any finite collection of motif densities. This provides a general framework for statistical inference based on network moments in the graphon model. To illustrate the broad scope of our results we also consider the problem of detecting global structure (that is, testing whether the graphon is a constant function) based on small subgraphs. We propose a consistent test for this problem, invoking celebrated results on quasi-random graphs, and derive its limiting distribution both under the null and the alternative.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
A class of Schwarz qubit maps with diagonal unitary and orthogonal symmetries
Authors:
Dariusz Chruściński,
Bihalan Bhattacharya
Abstract:
A class of unital qubit maps displaying diagonal unitary and orthogonal symmetries is analyzed. Such maps already found a lot applications in quantum information theory. We provide a complete characterization of this class of maps showing intricate relation between positivity, operator Schwarz inequality, and complete positivity. Finally, it is shown how to generalize the entire picture beyond uni…
▽ More
A class of unital qubit maps displaying diagonal unitary and orthogonal symmetries is analyzed. Such maps already found a lot applications in quantum information theory. We provide a complete characterization of this class of maps showing intricate relation between positivity, operator Schwarz inequality, and complete positivity. Finally, it is shown how to generalize the entire picture beyond unital case (so called generalized Schwarz maps). Interestingly, the first example of Schwarz but not completely positive map found by Choi belongs to our class. As a case study we provide a full characterization of Pauli maps. Our analysis leads to generalization of seminal Fujiwara-Algoet conditions for Pauli quantum channels.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Moderate Deviation and Berry-Esseen Bounds in the $p$-Spin Curie-Weiss Model
Authors:
Somabha Mukherjee,
Tianyu Liu,
Bhaswar B. Bhattacharya
Abstract:
Limit theorems for the magnetization in the $p$-spin Curie-Weiss model, for $p \geq 3$, has been derived recently by Mukherjee et al. (2021). In this paper, we strengthen these results by proving Cramér-type moderate deviation theorems and Berry-Esseen bounds for the magnetization (suitably centered and scaled). In particular, we show that the rate of convergence is $O(N^{-\frac{1}{2}})$ when the…
▽ More
Limit theorems for the magnetization in the $p$-spin Curie-Weiss model, for $p \geq 3$, has been derived recently by Mukherjee et al. (2021). In this paper, we strengthen these results by proving Cramér-type moderate deviation theorems and Berry-Esseen bounds for the magnetization (suitably centered and scaled). In particular, we show that the rate of convergence is $O(N^{-\frac{1}{2}})$ when the magnetization has asymptotically Gaussian fluctuations, and it is $O(N^{-\frac{1}{4}})$ when the fluctuations are non-Gaussian. As an application, we derive a Berry-Esseen bound for the maximum pseudolikelihood estimate of the inverse temperature in $p$-spin Curie-Weiss model with no external field, for all points in the parameter space where consistent estimation is possible.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Growth Rate of the Number of Empty Triangles in the Plane
Authors:
Bhaswar B. Bhattacharya,
Sandip Das,
Sk Samim Islam,
Saumya Sen
Abstract:
Given a set $P$ of $n$ points in the plane, in general position, denote by $N_Δ(P)$ the number of empty triangles with vertices in $P$. In this paper we investigate by how much $N_Δ(P)$ changes if a point $x$ is removed from $P$. By constructing a graph $G_P(x)$ based on the arrangement of the empty triangles incident on $x$, we transform this geometric problem to the problem of counting triangles…
▽ More
Given a set $P$ of $n$ points in the plane, in general position, denote by $N_Δ(P)$ the number of empty triangles with vertices in $P$. In this paper we investigate by how much $N_Δ(P)$ changes if a point $x$ is removed from $P$. By constructing a graph $G_P(x)$ based on the arrangement of the empty triangles incident on $x$, we transform this geometric problem to the problem of counting triangles in the graph $G_P(x)$. We study properties of the graph $G_P(x)$ and, in particular, show that it is kite-free. This relates the growth rate of the number of empty triangles to the famous Ruzsa-Szemerédi problem.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Anomalies in Hadronic $B$ Decays
Authors:
Raphaël Berthiaume,
Bhubanjyoti Bhattacharya,
Rida Boumris,
Alexandre Jean,
Suman Kumbhakar,
David London
Abstract:
In this paper, we perform fits to $B \to PP$ decays, where $B = \{B^0, B^+, B_s^0\}$ and the pseudoscalar $P = \{π, K\}$, under the assumption of flavor SU(3) symmetry [SU(3)$_F$]. Although the fits to $ΔS=0$ or $ΔS=1$ decays individually are good, the combined fit is very poor: there is a $3.6σ$ disagreement with the SM. One can remove this discrepancy by adding SU(3)$_F$-breaking effects, but 10…
▽ More
In this paper, we perform fits to $B \to PP$ decays, where $B = \{B^0, B^+, B_s^0\}$ and the pseudoscalar $P = \{π, K\}$, under the assumption of flavor SU(3) symmetry [SU(3)$_F$]. Although the fits to $ΔS=0$ or $ΔS=1$ decays individually are good, the combined fit is very poor: there is a $3.6σ$ disagreement with the SM. One can remove this discrepancy by adding SU(3)$_F$-breaking effects, but 1000\% SU(3)$_F$ breaking is required. The above results are rigorous, group-theoretically -- no dynamical assumptions have been made. When one adds an assumption motivated by QCD factorization, the discrepancy with the SM grows to $4.4σ$.
△ Less
Submitted 11 December, 2023; v1 submitted 29 November, 2023;
originally announced November 2023.
-
An anthropomorphic continuum robotic neck actuated by SMA spring-based multipennate muscle architecture
Authors:
Ratnangshu Das,
Yashaswi Sinha,
Anirudha Bhattacharjee,
Bishakh Bhattacharya
Abstract:
This work presents a novel Shape Memory Alloy spring actuated continuum robotic neck that derives inspiration from pennate muscle architecture. The proposed design has 2DOF, and experimental studies reveal that the designed joint can replicate the human head's anthropomorphic range of motion. We enumerate the analytical modelling for SMA actuators and the kinematic model of the proposed design con…
▽ More
This work presents a novel Shape Memory Alloy spring actuated continuum robotic neck that derives inspiration from pennate muscle architecture. The proposed design has 2DOF, and experimental studies reveal that the designed joint can replicate the human head's anthropomorphic range of motion. We enumerate the analytical modelling for SMA actuators and the kinematic model of the proposed design configuration. A series of experiments were conducted to assess the performance of the anthropomorphic neck by measuring the range of motion with varying input currents. Furthermore, the experiments were conducted to validate the analytical model of the SMA Multiphysics and the continuum backbone. The existing humanoid necks have been powered by conventional actuators that have relatively low energy efficiency and are prone to wear. The current research envisages application of nonconventional actuator such as SMA springs with specific geometric configuration yielding high power to weight ratio that delivers smooth motion for continuum robots as demonstrated in this present work.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
Charmless $B\to PPP$ Decays: the Fully-Antisymmetric Final State
Authors:
Bhubanjyoti Bhattacharya,
Mirjam Fines-Neuschild,
Andrea Houck,
Maxime Imbeault,
Alexandre Jean,
David London
Abstract:
Under flavor $SU(3)$ symmetry (SU(3)$_F$), the final-state particles in $B\to PPP$ decays ($P$ is a pseudoscalar meson) are treated as identical, and the $PPP$ must be in a fully-symmetric (FS) state, a fully-antisymmetric (FA) state, or in one of four mixed states. In this paper, we present the formalism for the FA states. We write the amplitudes for the 22 $B\to PPP$ decays that can be in an FA…
▽ More
Under flavor $SU(3)$ symmetry (SU(3)$_F$), the final-state particles in $B\to PPP$ decays ($P$ is a pseudoscalar meson) are treated as identical, and the $PPP$ must be in a fully-symmetric (FS) state, a fully-antisymmetric (FA) state, or in one of four mixed states. In this paper, we present the formalism for the FA states. We write the amplitudes for the 22 $B\to PPP$ decays that can be in an FA state in terms of both SU(3)$_F$ reduced matrix elements and diagrams. This shows the equivalence of diagrams and SU(3)$_F$. We also give 15 relations among the amplitudes in the SU(3)$_F$ limit, as well as the additional four that appear when the diagrams $E$/$A$/$PA$ are neglected. We present sets of $B \to PPP$ decays that can be used to extract $γ$ using the FA amplitudes. The value(s) of $γ$ found in this way can be compared with the value(s) found using the FS states.
△ Less
Submitted 4 January, 2024; v1 submitted 30 August, 2023;
originally announced August 2023.
-
Assistive Chatbots for healthcare: a succinct review
Authors:
Basabdatta Sen Bhattacharya,
Vibhav Sinai Pissurlenkar
Abstract:
Artificial Intelligence (AI) for supporting healthcare services has never been more necessitated than by the recent global pandemic. Here, we review the state-of-the-art in AI-enabled Chatbots in healthcare proposed during the last 10 years (2013-2023). The focus on AI-enabled technology is because of its potential for enhancing the quality of human-machine interaction via Chatbots, reducing depen…
▽ More
Artificial Intelligence (AI) for supporting healthcare services has never been more necessitated than by the recent global pandemic. Here, we review the state-of-the-art in AI-enabled Chatbots in healthcare proposed during the last 10 years (2013-2023). The focus on AI-enabled technology is because of its potential for enhancing the quality of human-machine interaction via Chatbots, reducing dependence on human-human interaction and saving man-hours. Our review indicates that there are a handful of (commercial) Chatbots that are being used for patient support, while there are others (non-commercial) that are in the clinical trial phases. However, there is a lack of trust on this technology regarding patient safety and data protection, as well as a lack of wider awareness on its benefits among the healthcare workers and professionals. Also, patients have expressed dissatisfaction with Natural Language Processing (NLP) skills of the Chatbots in comparison to humans. Notwithstanding the recent introduction of ChatGPT that has raised the bar for the NLP technology, this Chatbot cannot be trusted with patient safety and medical ethics without thorough and rigorous checks to serve in the `narrow' domain of assistive healthcare. Our review suggests that to enable deployment and integration of AI-enabled Chatbots in public health services, the need of the hour is: to build technology that is simple and safe to use; to build confidence on the technology among: (a) the medical community by focussed training and development; (b) the patients and wider community through outreach.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Degree Heterogeneity in Higher-Order Networks: Inference in the Hypergraph $\boldsymbolβ$-Model
Authors:
Sagnik Nandy,
Bhaswar B. Bhattacharya
Abstract:
The $\boldsymbolβ$-model for random graphs is commonly used for representing pairwise interactions in a network with degree heterogeneity. Going beyond pairwise interactions, Stasi et al. (2014) introduced the hypergraph $\boldsymbolβ$-model for capturing degree heterogeneity in networks with higher-order (multi-way) interactions. In this paper we initiate the rigorous study of the hypergraph…
▽ More
The $\boldsymbolβ$-model for random graphs is commonly used for representing pairwise interactions in a network with degree heterogeneity. Going beyond pairwise interactions, Stasi et al. (2014) introduced the hypergraph $\boldsymbolβ$-model for capturing degree heterogeneity in networks with higher-order (multi-way) interactions. In this paper we initiate the rigorous study of the hypergraph $\boldsymbolβ$-model with multiple layers, which allows for hyperedges of different sizes across the layers. To begin with, we derive the rates of convergence of the maximum likelihood (ML) estimate and establish their minimax rate optimality. We also derive the limiting distribution of the ML estimate and construct asymptotically valid confidence intervals for the model parameters. Next, we consider the goodness-of-fit problem in the hypergraph $\boldsymbolβ$-model. Specifically, we establish the asymptotic normality of the likelihood ratio (LR) test under the null hypothesis, derive its detection threshold, and also its limiting power at the threshold. Interestingly, the detection threshold of the LR test turns out to be minimax optimal, that is, all tests are asymptotically powerless below this threshold. The theoretical results are further validated in numerical experiments. In addition to develo** the theoretical framework for estimation and inference for hypergraph $\boldsymbolβ$-models, the above results fill a number of gaps in the graph $\boldsymbolβ$-model literature, such as the minimax optimality of the ML estimates and the non-null properties of the LR test, which, to the best of our knowledge, have not been studied before.
△ Less
Submitted 5 June, 2024; v1 submitted 6 July, 2023;
originally announced July 2023.
-
Bootstrapped Edge Count Tests for Nonparametric Two-Sample Inference Under Heterogeneity
Authors:
Trambak Banerjee,
Bhaswar B. Bhattacharya,
Gourab Mukherjee
Abstract:
Nonparametric two-sample testing is a classical problem in inferential statistics. While modern two-sample tests, such as the edge count test and its variants, can handle multivariate and non-Euclidean data, contemporary gargantuan datasets often exhibit heterogeneity due to the presence of latent subpopulations. Direct application of these tests, without regulating for such heterogeneity, may lea…
▽ More
Nonparametric two-sample testing is a classical problem in inferential statistics. While modern two-sample tests, such as the edge count test and its variants, can handle multivariate and non-Euclidean data, contemporary gargantuan datasets often exhibit heterogeneity due to the presence of latent subpopulations. Direct application of these tests, without regulating for such heterogeneity, may lead to incorrect statistical decisions. We develop a new nonparametric testing procedure that accurately detects differences between the two samples in the presence of unknown heterogeneity in the data generation process. Our framework handles this latent heterogeneity through a composite null that entertains the possibility that the two samples arise from a mixture distribution with identical component distributions but with possibly different mixing weights. In this regime, we study the asymptotic behavior of weighted edge count test statistic and show that it can be effectively re-calibrated to detect arbitrary deviations from the composite null. For practical implementation we propose a Bootstrapped Weighted Edge Count test which involves a bootstrap-based calibration procedure that can be easily implemented across a wide range of heterogeneous regimes. A comprehensive simulation study and an application to detecting aberrant user behaviors in online games demonstrates the excellent non-asymptotic performance of the proposed test.
△ Less
Submitted 26 April, 2023;
originally announced April 2023.
-
A Subquadratic Time Algorithm for the Weighted $k$-Center Problem on Cactus Graphs
Authors:
Binay Bhattacharya,
Sandip Das,
Subhadeep Ranjan Dev
Abstract:
The weighted $k$-center problem in graphs is a classical facility location problem where we place $k$ centers on the graph, which minimize the maximum weighted distance of a vertex to its nearest center. We study this problem when the underlying graph is a cactus with $n$ vertices and present an $O(n \log^2 n)$ time algorithm for the same. This time complexity improves upon the $O(n^2)$ time algor…
▽ More
The weighted $k$-center problem in graphs is a classical facility location problem where we place $k$ centers on the graph, which minimize the maximum weighted distance of a vertex to its nearest center. We study this problem when the underlying graph is a cactus with $n$ vertices and present an $O(n \log^2 n)$ time algorithm for the same. This time complexity improves upon the $O(n^2)$ time algorithm by Ben-Moshe et al. [TCS 2007], which is the current state-of-the-art.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
A nested hierarchy of second order upper bounds on system failure probability
Authors:
Sourangshu Ghosh,
Baidurya Bhattacharya
Abstract:
For a coherent, binary system made up of binary elements, the exact failure probability requires knowledge of statistical dependence of all orders among the minimal cut sets. Since dependence among the cut sets beyond the second order is generally difficult to obtain, second order bounds on system failure probability have practical value. The upper bound is conservative by definition and can be ad…
▽ More
For a coherent, binary system made up of binary elements, the exact failure probability requires knowledge of statistical dependence of all orders among the minimal cut sets. Since dependence among the cut sets beyond the second order is generally difficult to obtain, second order bounds on system failure probability have practical value. The upper bound is conservative by definition and can be adopted in reliability based decision making. In this paper we propose a new hierarchy of m-level second order upper bounds, Bm : the well-known Kounias-Vanmarcke-Hunter-Ditlevsen (KVHD) bound - the current standard for upper bounds using second order joint probabilities - turns out to be the weakest member of this family (m = 1). We prove that Bm is non-increasing with level m in every ordering of the cut sets, and derive conditions under which Bm+1 is strictly less than Bm for any m and any ordering. We also derive conditions under which the optimal level m bound is strictly less than the optimal level m + 1 bound, and show that this improvement asymptotically achieves a probability of 1 as long as the second order joint probabilities are only constrained by the pair of corresponding first order probabilities. Numerical examples show that our second order upper bounds can yield tighter values than previously achieved and in every case exhibit considerable less scatter across the entire n! orderings of the cut sets compared to KVHD bounds. Our results therefore may lead to more efficient identification of the optimal upper bound when coupled with existing linear programming and tree search based approaches.
△ Less
Submitted 12 February, 2023;
originally announced March 2023.
-
Boosting the Power of Kernel Two-Sample Tests
Authors:
Anirban Chatterjee,
Bhaswar B. Bhattacharya
Abstract:
The kernel two-sample test based on the maximum mean discrepancy (MMD) is one of the most popular methods for detecting differences between two distributions over general metric spaces. In this paper we propose a method to boost the power of the kernel test by combining MMD estimates over multiple kernels using their Mahalanobis distance. We derive the asymptotic null distribution of the proposed…
▽ More
The kernel two-sample test based on the maximum mean discrepancy (MMD) is one of the most popular methods for detecting differences between two distributions over general metric spaces. In this paper we propose a method to boost the power of the kernel test by combining MMD estimates over multiple kernels using their Mahalanobis distance. We derive the asymptotic null distribution of the proposed test statistic and use a multiplier bootstrap approach to efficiently compute the rejection region. The resulting test is universally consistent and, since it is obtained by aggregating over a collection of kernels/bandwidths, is more powerful in detecting a wide range of alternatives in finite samples. We also derive the distribution of the test statistic for both fixed and local contiguous alternatives. The latter, in particular, implies that the proposed test is statistically efficient, that is, it has non-trivial asymptotic (Pitman) efficiency. Extensive numerical experiments are performed on both synthetic and real-world datasets to illustrate the efficacy of the proposed method over single kernel tests. Our asymptotic results rely on deriving the joint distribution of MMD estimates using the framework of multiple stochastic integrals, which is more broadly useful, specifically, in understanding the efficiency properties of recently proposed adaptive MMD tests based on kernel aggregation.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
Distribution-free joint independence testing and robust independent component analysis using optimal transport
Authors:
Ziang Niu,
Bhaswar B. Bhattacharya
Abstract:
In this paper we study the problem of measuring and testing joint independence for a collection of multivariate random variables. Using the emerging theory of optimal transport (OT) based multivariate ranks, we propose a distribution-free test for multivariate joint independence. Towards this we introduce the notion of rank joint distance covariance (RJdCov), the higher-order rank analogue of the…
▽ More
In this paper we study the problem of measuring and testing joint independence for a collection of multivariate random variables. Using the emerging theory of optimal transport (OT) based multivariate ranks, we propose a distribution-free test for multivariate joint independence. Towards this we introduce the notion of rank joint distance covariance (RJdCov), the higher-order rank analogue of the celebrated distance covariance measure, that captures the dependencies among all the subsets of the variables. The RJdCov can be easily estimated from the data without any moment assumptions and the associated test for joint independence is universally consistent. We can calibrate the test without any knowledge of the (unknown) marginal distributions (due to the distribution-free property), both asymptotically and in finite samples. In addition to being distribution-free and universally consistent, the proposed test is also statistically efficient, that is, it has non-trivial asymptotic (Pitman) efficiency. We demonstrate this by computing the limiting local power of the test for both mixture alternatives and joint Konijn alternatives. We also use the RJdCov measure to develop a method for independent component analysis (ICA) that is easy to implement and robust to outliers and contamination. Extensive simulations are performed to illustrate the efficacy of the proposed test in comparison to other existing methods. Finally, we apply the proposed test to learn the higher-order dependence structure among different US industries based on stock prices.
△ Less
Submitted 30 November, 2022; v1 submitted 28 November, 2022;
originally announced November 2022.
-
A U-spin Puzzle in $B$ Decays
Authors:
Bhubanjyoti Bhattacharya,
Suman Kumbhakar,
David London,
Nicolas Payot
Abstract:
We impose U spin symmetry ($SU(2)_{\rm Uspin}$) on the Hamiltonian for $B$ decays. As expected, we find the equality of amplitudes related by the exchange $d \leftrightarrow s$. We also find that the amplitudes for the $ΔS=0$ processes $B^0 \to π^+π^-$, $B_s^0\toπ^+ K^-$ and $B^0\to K^+ K^-$ form a U-spin triangle relation. The amplitudes for $B_s^0\to K^+ K^-$, $B^0\toπ^- K^+$ and…
▽ More
We impose U spin symmetry ($SU(2)_{\rm Uspin}$) on the Hamiltonian for $B$ decays. As expected, we find the equality of amplitudes related by the exchange $d \leftrightarrow s$. We also find that the amplitudes for the $ΔS=0$ processes $B^0 \to π^+π^-$, $B_s^0\toπ^+ K^-$ and $B^0\to K^+ K^-$ form a U-spin triangle relation. The amplitudes for $B_s^0\to K^+ K^-$, $B^0\toπ^- K^+$ and $B_s^0\toπ^+π^-$ form a similar $ΔS=1$ triangle relation. And these two triangles are related to one another by $d \leftrightarrow s$. We perform fits to the observables for these six decays. If perfect U spin is assumed, the fit is very poor. If U-spin-breaking contributions are added, we find many scenarios that can explain the data. However, in all cases, 100\% U-spin breaking is required, considerably larger than the naive expectation of $\sim 20\%$. This is the U-spin puzzle; it may be strongly hinting at the presence of new physics.
△ Less
Submitted 7 January, 2023; v1 submitted 13 November, 2022;
originally announced November 2022.
-
A Framework for Automated Correctness Checking of Biochemical Protocol Realizations on Digital Microfluidic Biochips
Authors:
Sukanta Bhattacharjee,
Ansuman Banerjee,
Krishnendu Chakrabarty,
Bhargab B. Bhattacharya
Abstract:
Recent advances in digital microfluidic (DMF) technologies offer a promising platform for a wide variety of biochemical applications, such as DNA analysis, automated drug discovery, and toxicity monitoring. For on-chip implementation of complex bioassays, automated synthesis tools have been developed to meet the design challenges. Currently, the synthesis tools pass through a number of complex des…
▽ More
Recent advances in digital microfluidic (DMF) technologies offer a promising platform for a wide variety of biochemical applications, such as DNA analysis, automated drug discovery, and toxicity monitoring. For on-chip implementation of complex bioassays, automated synthesis tools have been developed to meet the design challenges. Currently, the synthesis tools pass through a number of complex design steps to realize a given biochemical protocol on a target DMF architecture. Thus, design errors can arise during the synthesis steps. Before deploying a DMF biochip on a safety critical system, it is necessary to ensure that the desired biochemical protocol has been correctly implemented, i.e., the synthesized output (actuation sequences for the biochip) is free from any design or realization errors. We propose a symbolic constraint-based analysis framework for checking the correctness of a synthesized biochemical protocol with respect to the original design specification. The verification scheme based on this framework can detect several post-synthesis fluidic violations and realization errors in 2D-array based or pin-constrained biochips as well as in cyberphysical systems. It further generates diagnostic feedback for error localization. We present experimental results on the polymerase chain reaction (PCR) and in-vitro multiplexed bioassays to demonstrate the proposed verification approach.
△ Less
Submitted 9 November, 2022;
originally announced November 2022.
-
Detecting entanglement harnessing Lindblad structure
Authors:
Vaibhav Chimalgi,
Bihalan Bhattacharya,
Suchetana Goswami,
Samyadeb Bhattacharya
Abstract:
The problem of entanglement detection is a long standing problem in quantum information theory. One of the primary procedures of detecting entanglement is to find the suitable positive but non-completely positive maps. Here we try to give a generic prescription to construct a positive map that can be useful for such scenarios. We study a class of positive maps arising from Lindblad structures. We…
▽ More
The problem of entanglement detection is a long standing problem in quantum information theory. One of the primary procedures of detecting entanglement is to find the suitable positive but non-completely positive maps. Here we try to give a generic prescription to construct a positive map that can be useful for such scenarios. We study a class of positive maps arising from Lindblad structures. We show that two famous positive maps viz. transposition and Choi map can be obtained as a special case of a class of positive maps having Lindblad structure. Generalizing the transposition map to a one parameter family we have used it to detect genuine multipartite entanglement. Finally being motivated by the negativity of entanglement, we have defined a similar measure for genuine multipartite entanglement.
△ Less
Submitted 31 October, 2022;
originally announced October 2022.
-
Measurement incompatibility and quantum advantage in communication
Authors:
Debashis Saha,
Debarshi Das,
Arun Kumar Das,
Bihalan Bhattacharya,
A. S. Majumdar
Abstract:
Measurement incompatibility stipulates the existence of quantum measurements that cannot be carried out simultaneously on single systems. We show that the set of input-output probabilities obtained from d-dimensional classical systems assisted with shared randomness is the same as the set obtained from d-dimensional quantum strategies restricted to compatible measurements with shared randomness in…
▽ More
Measurement incompatibility stipulates the existence of quantum measurements that cannot be carried out simultaneously on single systems. We show that the set of input-output probabilities obtained from d-dimensional classical systems assisted with shared randomness is the same as the set obtained from d-dimensional quantum strategies restricted to compatible measurements with shared randomness in any communication scenario. Thus, measurement incompatibility is necessary for quantum advantage in communication, and any quantum advantage (with or without shared randomness) in communication acts as a witness to the incompatibility of the measurements at the receiver's end in a semi-device-independent way. We introduce a class of communication tasks - a general version of random access codes - to witness incompatibility of an arbitrary number of quantum measurements with arbitrary outcomes acting on d-dimensional systems, and provide generic upper bounds on the success metric of these tasks for compatible measurements. We identify all sets of three incompatible rank-one projective qubit measurements that random access codes can witness. Finally, we present the generic relationship between different sets of probability distributions - classical, quantum with or without shared randomness, and quantum restricted to compatible measurements with or without shared randomness - produced in communication scenarios.
△ Less
Submitted 12 June, 2023; v1 submitted 29 September, 2022;
originally announced September 2022.
-
A Prufer-Sequence Based Representation of Large Graphs for Structural Encoding of Logic Networks
Authors:
Manjari Pradhan,
Bhargab B. Bhattacharya
Abstract:
The pervasiveness of graphs in today's real life systems is quite evident, where the system either explicitly exists as graph or can be readily modelled as one. Such graphical structure is thus a store house rich information. This has various implication depending on whether we are interested in a node or the graph as a whole. In this paper, we are primarily concerned with the later, that is, the…
▽ More
The pervasiveness of graphs in today's real life systems is quite evident, where the system either explicitly exists as graph or can be readily modelled as one. Such graphical structure is thus a store house rich information. This has various implication depending on whether we are interested in a node or the graph as a whole. In this paper, we are primarily concerned with the later, that is, the inference that the structure of the graph influences the property of the real life system it represents. A model of such structural influence would be useful in inferencing useful properties of complex and large systems, like VLSI circuits, through its structural property. However, before we can apply some machine learning (ML) based technique to model such relationship, an effective representation of the graph is imperative. In this paper, we propose a graph representation which is lossless, linear-sized in terms of number of vertices and gives a 1-D representation of the graph. Our representation is based on Prufer encoding for trees. Moreover, our method is based on a novel technique, called $\mathcal{GT}$-enhancement whereby we first transform the graph such that it can be represented by a singular tree. The encoding also provides scope to include additional graph property and improve the interpretability of the code.
△ Less
Submitted 4 September, 2022;
originally announced September 2022.
-
Implications for the $ΔA_{FB}$ anomaly in ${\bar B}^0\to D^{*+}\ell^- {\barν}$ using a new Monte Carlo Event Generator
Authors:
Bhubanjyoti Bhattacharya,
Thomas E. Browder,
Quinn Campagna,
Alakabha Datta,
Shawn Dubey,
Lopamudra Mukherjee,
Alexei Sibidanov
Abstract:
Recent experimental results in $B$ physics from Belle, BaBar and LHCb suggest new physics (NP) in the weak $b\to c$ charged-current and the $b\to s$ neutral-current processes. Here we focus on the charged-current case and specifically on the decay modes $\bar{B}^0\to D^{*+}\ell^- \barν$ with $\ell = e$ and $μ$. The world averages of the ratios $R_D$ and $R_D^{*}$ currently differ from the Standard…
▽ More
Recent experimental results in $B$ physics from Belle, BaBar and LHCb suggest new physics (NP) in the weak $b\to c$ charged-current and the $b\to s$ neutral-current processes. Here we focus on the charged-current case and specifically on the decay modes $\bar{B}^0\to D^{*+}\ell^- \barν$ with $\ell = e$ and $μ$. The world averages of the ratios $R_D$ and $R_D^{*}$ currently differ from the Standard Model (SM) predictions by $3.4σ$ while recently a new anomaly has been observed in the forward-backward asymmetry measurement, $A_{FB}$, in $ \bar{B}^0\to D^{*+}μ^- \barν$ decay. It is found that $ΔA_{FB} = A_{FB}(B\to D^{*} μν) - A_{FB} (B\to D^{*} e ν)$ is around $4.1σ$ away from the SM prediction in an analysis of 2019 Belle data. In this work we explore possible solutions to the $ΔA_{FB}$ anomaly and point out correlated NP signals in other angular observables. These correlations between angular observables must be present in the case of beyond the Standard Model physics. We stress the importance of $Δ$ type observables that are obtained by taking the difference of the observable for the muon and the electron mode. These quantities cancel form factor uncertainties in the SM and allow for clean tests of NP. These intriguing results also suggest an urgent need for improved simulation and analysis techniques in $\bar{B}^0\to D^{*+}\ell^- \barν$ decays. Here we also describe a new Monte Carlo Event-generator tool based on EVTGEN that we developed to allow simulation of the NP signatures in $\bar{B}^0\to D^{*+}\ell^- ν$, which arise due to the interference between the SM and NP amplitudes. We then discuss prospects for improved observables sensitive to NP couplings with 1, 5, 50, and 250 ab$^{-1}$ of Belle II data, which seem to be ideally suited for this class of measurements.
△ Less
Submitted 21 December, 2022; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Improved Upper Bound on Independent Domination Number for Hypercubes
Authors:
Debabani Chowdhury,
Debesh K. Das,
Bhargab B. Bhattacharya
Abstract:
We revisit the problem of determining the independent domination number in hypercubes for which the known upper bound is still not tight for general dimensions. We present here a constructive method to build an independent dominating set $S_n$ for the $n$-dimensional hypercube $Q_n$, where $n=2p+1$, $p$ being a positive integer $\ge 1$, provided an independent dominating set $S_p$ for the $p$-dime…
▽ More
We revisit the problem of determining the independent domination number in hypercubes for which the known upper bound is still not tight for general dimensions. We present here a constructive method to build an independent dominating set $S_n$ for the $n$-dimensional hypercube $Q_n$, where $n=2p+1$, $p$ being a positive integer $\ge 1$, provided an independent dominating set $S_p$ for the $p$-dimensional hypercube $Q_p$, is known. The procedure also computes the minimum independent dominating set for all $n=2^k-1$, $k>1$. Finally, we establish that the independent domination number $α_n\leq 3 \times 2^{n-k-2}$ for $7\times 2^{k-2}-1\leq n<2^{k+1}-1$, $k>1$. This is an improved upper bound for this range as compared to earlier work.
△ Less
Submitted 13 May, 2022;
originally announced May 2022.
-
An Efficient Algorithm for the Proximity Connected Two Center Problem
Authors:
Binay Bhattacharya,
Amirhossein Mozafari,
Thomas C. Shermer
Abstract:
Given a set $P$ of $n$ points in the plane, the $k$-center problem is to find $k$ congruent disks of minimum possible radius such that their union covers all the points in $P$. The $2$-center problem is a special case of the $k$-center problem that has been extensively studied in the recent past \cite{CAHN,HT,SH}. In this paper, we consider a generalized version of the $2$-center problem called \t…
▽ More
Given a set $P$ of $n$ points in the plane, the $k$-center problem is to find $k$ congruent disks of minimum possible radius such that their union covers all the points in $P$. The $2$-center problem is a special case of the $k$-center problem that has been extensively studied in the recent past \cite{CAHN,HT,SH}. In this paper, we consider a generalized version of the $2$-center problem called \textit{proximity connected} $2$-center (PCTC) problem. In this problem, we are also given a parameter $δ\geq 0$ and we have the additional constraint that the distance between the centers of the disks should be at most $δ$. Note that when $δ=0$, the PCTC problem is reduced to the $1$-center(minimum enclosing disk) problem and when $δ$ tends to infinity, it is reduced to the $2$-center problem. The PCTC problem first appeared in the context of wireless networks in 1992 \cite{ACN0}, but obtaining a nontrivial deterministic algorithm for the problem remained open. In this paper, we resolve this open problem by providing a deterministic $O(n^2\log n)$ time algorithm for the problem.
△ Less
Submitted 19 April, 2022;
originally announced April 2022.
-
Enhancing HEP research in predominantly undergraduate institutions and community colleges
Authors:
Matt Bellis,
Bhubanjyoti Bhattacharya,
David DeMuth,
Julie Hogan,
Kathrine Laureto,
Sudhir Malik,
Ben Pearson
Abstract:
The long-term success of HEP lies in expanding inclusiveness beyond national labs and academic research institutions to a vast community of predominantly undergraduate institutions (PUI) and community colleges (CC). Institutions such as PUIs and CCs offer an early starting point in the pipeline that can mitigate issues of lack of diversity and underrepresented participation of different groups in…
▽ More
The long-term success of HEP lies in expanding inclusiveness beyond national labs and academic research institutions to a vast community of predominantly undergraduate institutions (PUI) and community colleges (CC). Institutions such as PUIs and CCs offer an early starting point in the pipeline that can mitigate issues of lack of diversity and underrepresented participation of different groups in HEP. However, there are many underlying systemic, structural, and cultural challenges that need to be addressed collectively. Experimental collaborations are largely populated by national labs and research-focused academic institutions (non-PUIs). The faculty at PUIs and CCs have a high teaching load that is detrimental to their research participation. In addition, there is a lack of guidance, access, and tough competition for securing research funding. The students also suffer from a lack of research infrastructure and technical equipment that can only be found at national labs and larger universities. There are existing successful efforts to enhance the HEP research experience of students and faculty members. This paper discusses ways to leverage these to provide more research opportunities and establish a sustainable national program targeting specifically the issues faced by communities at PUIs and CCs. The need for research mentoring and skill building for faculty members is also laid out. The changes discussed in this paper would make a direct impact on the current spectrum of challenges.
△ Less
Submitted 1 April, 2022; v1 submitted 21 March, 2022;
originally announced March 2022.
-
A new tool to search for physics beyond the Standard Model in ${\bar B}\to D^{*+}\ell^- {\barν}$
Authors:
Bhubanjyoti Bhattacharya,
Thomas Browder,
Quinn Campagna,
Alakabha Datta,
Shawn Dubey,
Lopamudra Mukherjee,
Alexei Sibidanov
Abstract:
Recent experimental results in $B$ physics from Belle, BaBar and LHCb suggest new physics (NP) in the weak $b\to c$ charged-current and the $b\to s$ neutral-current processes. Here we focus on the charged-current case and specifically on the decay modes $B\to D^{*+}\ell^- \barν$ with $\ell = e, μ,$ and $τ$. The world averages of the ratios $R_D$ and $R_D^{*}$ currently differ from the Standard Mod…
▽ More
Recent experimental results in $B$ physics from Belle, BaBar and LHCb suggest new physics (NP) in the weak $b\to c$ charged-current and the $b\to s$ neutral-current processes. Here we focus on the charged-current case and specifically on the decay modes $B\to D^{*+}\ell^- \barν$ with $\ell = e, μ,$ and $τ$. The world averages of the ratios $R_D$ and $R_D^{*}$ currently differ from the Standard Model (SM) by $3.4σ$ while $ΔA_{FB} = A_{FB}(B\to D^{*} μν) - A_{FB} (B\to D^{*} e ν)$ is found to be $4.1σ$ away from the SM prediction in an analysis of 2019 Belle data. These intriguing results suggest an urgent need for improved simulation and analysis techniques in $B\to D^{*+}\ell^- \barν$ decays. Here we describe a Monte Carlo Event-generator tool based on EVTGEN developed to allow simulation of the NP signatures in $B\to D^*\ell^- ν$, which arise due to the interference between the SM and NP amplitudes. As a demonstration of the proposed approach, we exhibit some examples of NP couplings that are consistent with current data and could explain the $ΔA_{FB}$ anomaly in $B\to D^*\ell^- ν$ while remaining consistent with other constraints. We show that the $Δ$-type observables such as $ΔA_{FB}$ and $ΔS_5$ eliminate most QCD uncertainties from form factors and allow for clean measurements of NP. We introduce correlated observables that improve the sensitivity to NP. We discuss prospects for improved observables sensitive to NP couplings with the expected 50 ab$^{-1}$ of Belle II data, which seems to be ideally suited for this class of measurements.
△ Less
Submitted 6 October, 2022; v1 submitted 14 March, 2022;
originally announced March 2022.
-
Asymptotic Distribution of Random Quadratic Forms
Authors:
Bhaswar B. Bhattacharya,
Sayan Das,
Somabha Mukherjee,
Sumit Mukherjee
Abstract:
In this paper we characterize all distributional limits of the random quadratic form $T_n =\sum_{1\le u< v\le n} a_{u, v} X_u X_v$, where $((a_{u, v}))_{1\le u,v\le n}$ is a $\{0, 1\}$-valued symmetric matrix with zeros on the diagonal and $X_1, X_2, \ldots, X_n$ are i.i.d.~ mean $0$ variance $1$ random variables with common distribution function $F$. In particular, we show that any distributional…
▽ More
In this paper we characterize all distributional limits of the random quadratic form $T_n =\sum_{1\le u< v\le n} a_{u, v} X_u X_v$, where $((a_{u, v}))_{1\le u,v\le n}$ is a $\{0, 1\}$-valued symmetric matrix with zeros on the diagonal and $X_1, X_2, \ldots, X_n$ are i.i.d.~ mean $0$ variance $1$ random variables with common distribution function $F$. In particular, we show that any distributional limit of $S_n:=T_n/\sqrt{\mathrm{Var}[T_n]}$ can be expressed as the sum of three independent components: a Gaussian, a (possibly) infinite weighted sum of independent centered chi-squares, and a Gaussian mixture with a random variance. As a consequence, we prove a fourth moment theorem for the asymptotic normality of $S_n$, which applies even when $F$ does not have finite fourth moment. More formally, we show that $S_n$ converges to $N(0, 1)$ if and only if the fourth moment of $S_n$ (appropriately truncated when $F$ does not have finite fourth moment) converges to 3 (the fourth moment of the standard normal distribution).
△ Less
Submitted 5 March, 2022;
originally announced March 2022.
-
Revisiting the sub-pulse drifting phenomenon in PSR J1822-2256: Drift Modes, Sparks, and Emission Heights
Authors:
Parul Janagal,
Manoneeta Chakraborty,
N. D. Ramesh Bhat,
Bhaswati Bhattacharya,
Samuel J. McSweeney
Abstract:
Sub-pulse drifting in pulsar radio emission is considered to be one of the most promising phenomenon for uncovering the underlying physical processes. Here we present a detailed study of such a phenomenon in observations of PSR J1822$-$2256, made using the upgraded Giant Meterwave Radio Telescope (uGMRT). Observations were made simultaneously using the Band 3 (300-500 MHz) and Band 4 (550-750 MHz)…
▽ More
Sub-pulse drifting in pulsar radio emission is considered to be one of the most promising phenomenon for uncovering the underlying physical processes. Here we present a detailed study of such a phenomenon in observations of PSR J1822$-$2256, made using the upgraded Giant Meterwave Radio Telescope (uGMRT). Observations were made simultaneously using the Band 3 (300-500 MHz) and Band 4 (550-750 MHz) receivers of the uGMRT. The pulsar is known to exhibit subpulse drifting, mode changing, and nulling. Our observations reveal four distinct sub-pulse drifting modes of emission (A, B, C, and D) for this pulsar, with the drift periodicities of 17.9 $P_1$, 5.8 $P_1$, 8 $P_1$, 14.1 $P_1$, respectively (where $P_1$ is the pulsar rotation period), two of which exhibit some new features that were not reported in the previous studies. We also investigate the possible spark configuration, characterised by the number of sparks ($n$) in the carousel patterns of these four drift modes, and our analysis suggests two representative solutions for the number of sparks for a carousel rotation period, $P_4$, which lies in the range of $13$ to $16$. The large frequency coverage of our data (300-750 MHz) is also leveraged to explore the frequency dependence of single-pulse characteristics of the pulsar emission, particularly the frequency-dependent subpulse behaviour and the emission heights for the observed drift modes. Our analysis suggests a clear modal dependence of inferred emission heights. We discuss the implications for the pulsar emission mechanism and its relation to the proposed spark configuration.
△ Less
Submitted 13 November, 2021;
originally announced November 2021.
-
High Dimensional Logistic Regression Under Network Dependence
Authors:
Somabha Mukherjee,
Ziang Niu,
Sagnik Halder,
Bhaswar B. Bhattacharya,
George Michailidis
Abstract:
Logistic regression is one of the most fundamental methods for modeling the probability of a binary outcome based on a collection of covariates. However, the classical formulation of logistic regression relies on the independent sampling assumption, which is often violated when the outcomes interact through an underlying network structure. This necessitates the development of models that can simul…
▽ More
Logistic regression is one of the most fundamental methods for modeling the probability of a binary outcome based on a collection of covariates. However, the classical formulation of logistic regression relies on the independent sampling assumption, which is often violated when the outcomes interact through an underlying network structure. This necessitates the development of models that can simultaneously handle both the network peer-effect (arising from neighborhood interactions) and the effect of high-dimensional covariates. In this paper, we develop a framework for incorporating such dependencies in a high-dimensional logistic regression model by introducing a quadratic interaction term, as in the Ising model, designed to capture pairwise interactions from the underlying network. The resulting model can also be viewed as an Ising model, where the node-dependent external fields linearly encode the high-dimensional covariates. We propose a penalized maximum pseudo-likelihood method for estimating the network peer-effect and the effect of the covariates, which, in addition to handling the high-dimensionality of the parameters, conveniently avoids the computational intractability of the maximum likelihood approach. Consequently, our method is computationally efficient and, under various standard regularity conditions, our estimate attains the classical high-dimensional rate of consistency. In particular, our results imply that even under network dependence it is possible to consistently estimate the model parameters at the same rate as in classical logistic regression, when the true parameter is sparse and the underlying network is not too dense. As a consequence of the general results, we derive the rates of consistency for various natural network models. We also develop an efficient algorithm for computing the estimates and validate our theoretical results in numerical experiments.
△ Less
Submitted 9 September, 2022; v1 submitted 7 October, 2021;
originally announced October 2021.
-
Sparse Uniformity Testing
Authors:
Bhaswar B. Bhattacharya,
Rajarshi Mukherjee
Abstract:
In this paper we consider the uniformity testing problem for high-dimensional discrete distributions (multinomials) under sparse alternatives. More precisely, we derive sharp detection thresholds for testing, based on $n$ samples, whether a discrete distribution supported on $d$ elements differs from the uniform distribution only in $s$ (out of the $d$) coordinates and is $\varepsilon$-far (in tot…
▽ More
In this paper we consider the uniformity testing problem for high-dimensional discrete distributions (multinomials) under sparse alternatives. More precisely, we derive sharp detection thresholds for testing, based on $n$ samples, whether a discrete distribution supported on $d$ elements differs from the uniform distribution only in $s$ (out of the $d$) coordinates and is $\varepsilon$-far (in total variation distance) from uniformity. Our results reveal various interesting phase transitions which depend on the interplay of the sample size $n$ and the signal strength $\varepsilon$ with the dimension $d$ and the sparsity level $s$. For instance, if the sample size is less than a threshold (which depends on $d$ and $s$), then all tests are asymptotically powerless, irrespective of the magnitude of the signal strength. On the other hand, if the sample size is above the threshold, then the detection boundary undergoes a further phase transition depending on the signal strength. Here, a $χ^2$-type test attains the detection boundary in the dense regime, whereas in the sparse regime a Bonferroni correction of two maximum-type tests and a version of the Higher Criticism test is optimal up to sharp constants. These results combined provide a complete description of the phase diagram for the sparse uniformity testing problem across all regimes of the parameters $n$, $d$, and $s$. One of the challenges in dealing with multinomials is that the parameters are always constrained to lie in the simplex. This results in the aforementioned two-layered phase transition, a new phenomenon which does not arise in classical high-dimensional sparse testing problems.
△ Less
Submitted 16 February, 2022; v1 submitted 21 September, 2021;
originally announced September 2021.
-
Sparse Distributed Memory using Spiking Neural Networks on Nengo
Authors:
Rohan Deepak Ajwani,
Arshika Lalan,
Basabdatta Sen Bhattacharya,
Joy Bose
Abstract:
We present a Spiking Neural Network (SNN) based Sparse Distributed Memory (SDM) implemented on the Nengo framework. We have based our work on previous work by Furber et al, 2004, implementing SDM using N-of-M codes. As an integral part of the SDM design, we have implemented Correlation Matrix Memory (CMM) using SNN on Nengo. Our SNN implementation uses Leaky Integrate and Fire (LIF) spiking neuron…
▽ More
We present a Spiking Neural Network (SNN) based Sparse Distributed Memory (SDM) implemented on the Nengo framework. We have based our work on previous work by Furber et al, 2004, implementing SDM using N-of-M codes. As an integral part of the SDM design, we have implemented Correlation Matrix Memory (CMM) using SNN on Nengo. Our SNN implementation uses Leaky Integrate and Fire (LIF) spiking neuron models on Nengo. Our objective is to understand how well SNN-based SDMs perform in comparison to conventional SDMs. Towards this, we have simulated both conventional and SNN-based SDM and CMM on Nengo. We observe that SNN-based models perform similarly as the conventional ones. In order to evaluate the performance of different SNNs, we repeated the experiment using Adaptive-LIF, Spiking Rectified Linear Unit, and Izhikevich models and obtained similar results. We conclude that it is indeed feasible to develop some types of associative memories using spiking neurons whose memory capacity and other features are similar to the performance without SNNs. Finally we have implemented an application where MNIST images, encoded with N-of-M codes, are associated with their labels and stored in the SNN-based SDM.
△ Less
Submitted 3 December, 2021; v1 submitted 7 September, 2021;
originally announced September 2021.
-
Non-Markovianity and entanglement detection
Authors:
Sourav Chanduka,
Bihalan Bhattacharya,
Rounak Mundra,
Samyadeb Bhattacharya,
Indranil Chakrabarty
Abstract:
We have established a novel method to detect non-Markovian indivisible quantum channels using structural physical approximation. We have shown that this method can be used to detect eternal non -Markovian operations. We have further established that harnessing eternal non-Markovianity, we can device a protocol to detect quantum entanglement.
We have established a novel method to detect non-Markovian indivisible quantum channels using structural physical approximation. We have shown that this method can be used to detect eternal non -Markovian operations. We have further established that harnessing eternal non-Markovianity, we can device a protocol to detect quantum entanglement.
△ Less
Submitted 6 September, 2021;
originally announced September 2021.
-
Flavor $SU(3)$ in Cabibbo-favored $D$-meson decays
Authors:
Bhubanjyoti Bhattacharya,
Alakabha Datta,
Alexey A. Petrov,
John Waite
Abstract:
Model-independent description of nonleptonic decays of charmed mesons is a challenging task due to large nonperturbative effects of strong interactions on the transition amplitudes. We discuss the equivalence of two different flavor-$SU(3)$-based descriptions of Cabibbo-favored non-leptonic decays of charmed mesons to two-pseudoscalars final states including the $η$ and $η^\prime$ mesons.
Model-independent description of nonleptonic decays of charmed mesons is a challenging task due to large nonperturbative effects of strong interactions on the transition amplitudes. We discuss the equivalence of two different flavor-$SU(3)$-based descriptions of Cabibbo-favored non-leptonic decays of charmed mesons to two-pseudoscalars final states including the $η$ and $η^\prime$ mesons.
△ Less
Submitted 28 September, 2021; v1 submitted 28 July, 2021;
originally announced July 2021.
-
Sample Preparation Meets Farey Sequence: A New Design Technique for Free-Flowing Microfluidic Networks
Authors:
Tapalina Banerjee,
Sudip Poddar,
Tsung-Yi Ho,
Bhargab B. Bhattacharya
Abstract:
Design of microfluidic biochips has led to newer challenges to the EDA community due to the availability of various flow-based architectures and the need for catering to diverse applications such as sample preparation, personalized medicine, point-of-care diagnostics, and drug design. The ongoing Covid-19 pandemic has increased the demand for low-cost diagnostic lab-on-chips manifold. Sample prepa…
▽ More
Design of microfluidic biochips has led to newer challenges to the EDA community due to the availability of various flow-based architectures and the need for catering to diverse applications such as sample preparation, personalized medicine, point-of-care diagnostics, and drug design. The ongoing Covid-19 pandemic has increased the demand for low-cost diagnostic lab-on-chips manifold. Sample preparation (dilution or mixing of biochemical fluids) is an indispensable step of any biochemical experiment including sensitive detection and successful assay execution downstream. Although for valve-based microfluidic biochips various design automation tools are currently available, they are expensive, and prone to various manufacturing and operational defects. Additionally, many problems are left open in the domain of free-flowing biochips, where only a single layer of flow-channels is used for fluid-flow devoid of any kind of control layer/valves. In this work, we present a methodology for designing a free-flowing biochip that is capable of performing fluid dilution according to users requirement. The proposed algorithm for sample preparation utilizes the Farey-sequence arithmetic of fractions that are used to represent the concentration factor of the target fluid. We also present the detailed layout design of a free-flowing microfluidic architecture that emulates the dilution algorithm. The network is simulated using COMSOL multi-physics software accounting for relevant hydrodynamic parameters. Experiments on various test-cases support the efficacy of the proposed design in terms of accuracy, convergence time, reactant cost, and simplicity of the fluidic network compared to prior art.
△ Less
Submitted 26 July, 2021;
originally announced July 2021.
-
Foveal-pit inspired filtering of DVS spike response
Authors:
Shriya T. P. Gupta,
Pablo Linares-Serrano,
Basabdatta Sen Bhattacharya,
Teresa Serrano-Gotarredona
Abstract:
In this paper, we present results of processing Dynamic Vision Sensor (DVS) recordings of visual patterns with a retinal model based on foveal-pit inspired Difference of Gaussian (DoG) filters. A DVS sensor was stimulated with varying number of vertical white and black bars of different spatial frequencies moving horizontally at a constant velocity. The output spikes generated by the DVS sensor we…
▽ More
In this paper, we present results of processing Dynamic Vision Sensor (DVS) recordings of visual patterns with a retinal model based on foveal-pit inspired Difference of Gaussian (DoG) filters. A DVS sensor was stimulated with varying number of vertical white and black bars of different spatial frequencies moving horizontally at a constant velocity. The output spikes generated by the DVS sensor were applied as input to a set of DoG filters inspired by the receptive field structure of the primate visual pathway. In particular, these filters mimic the receptive fields of the midget and parasol ganglion cells (spiking neurons of the retina) that sub-serve the photo-receptors of the foveal-pit. The features extracted with the foveal-pit model are used for further classification using a spiking convolutional neural network trained with a backpropagation variant adapted for spiking neural networks.
△ Less
Submitted 29 May, 2021;
originally announced May 2021.
-
Implementing a foveal-pit inspired filter in a Spiking Convolutional Neural Network: a preliminary study
Authors:
Shriya T. P. Gupta,
Basabdatta Sen Bhattacharya
Abstract:
We have presented a Spiking Convolutional Neural Network (SCNN) that incorporates retinal foveal-pit inspired Difference of Gaussian filters and rank-order encoding. The model is trained using a variant of the backpropagation algorithm adapted to work with spiking neurons, as implemented in the Nengo library. We have evaluated the performance of our model on two publicly available datasets - one f…
▽ More
We have presented a Spiking Convolutional Neural Network (SCNN) that incorporates retinal foveal-pit inspired Difference of Gaussian filters and rank-order encoding. The model is trained using a variant of the backpropagation algorithm adapted to work with spiking neurons, as implemented in the Nengo library. We have evaluated the performance of our model on two publicly available datasets - one for digit recognition task, and the other for vehicle recognition task. The network has achieved up to 90% accuracy, where loss is calculated using the cross-entropy function. This is an improvement over around 57% accuracy obtained with the alternate approach of performing the classification without any kind of neural filtering. Overall, our proof-of-concept study indicates that introducing biologically plausible filtering in existing SCNN architecture will work well with noisy input images such as those in our vehicle recognition task. Based on our results, we plan to enhance our SCNN by integrating lateral inhibition-based redundancy reduction prior to rank-ordering, which will further improve the classification accuracy by the network.
△ Less
Submitted 29 May, 2021;
originally announced May 2021.
-
Optimizing the location of vaccination sites to stop a zoonotic epidemic
Authors:
Ricardo Castillo-Neyra,
Bhaswar Bhattacharya,
Aris Saxena,
Brinkley Raynor,
Elvis Diaz,
Gian Franco Condori,
Maria Rieders,
Michael Z. Levy
Abstract:
The mainstay of canine rabies control is fixed point mass dog vaccination campaigns (MDVC). However, in some regions, ideal vaccination coverage in dogs is not obtained due to low participation in the MDVC. Travel distance to the vaccination sites has been identified as an important barrier to participation. We aim to increase MDVC participation by optimally placing fixed point vaccination locatio…
▽ More
The mainstay of canine rabies control is fixed point mass dog vaccination campaigns (MDVC). However, in some regions, ideal vaccination coverage in dogs is not obtained due to low participation in the MDVC. Travel distance to the vaccination sites has been identified as an important barrier to participation. We aim to increase MDVC participation by optimally placing fixed point vaccination locations to minimize walking distance to the nearest vaccination location. We quantified participation probability based on walking distance to the nearest vaccination point using a Poisson regression model. The regression was fit with survey data collected from 2016-2019. We then used a computational recursive interchange technique to solve the facility location problem to find a set of optimal placements of fixed point vaccination locations. Finally, we compared predicted participation of optimally placed vaccination sites to historical participation data from surveys collected from 2016-2019. We identified the p-median algorithm to solve the facility location problem as ideal for fixed point vaccination placement. We found a predicted increase in MDVC participation if vaccination locations are placed optimally. We also found a more even vaccination coverage with optimized vaccination sites; however, the workload in some optimized locations increased significantly. We developed a data-driven computational algorithm to combat an ongoing rabies epidemic by optimally using limited resources to maximize vaccination coverage. The main positive effects we expect if this algorithm is to be implemented would be increased overall vaccination coverage and increased spatial evenness of coverage. A potential negative effect could be the presence of long waiting lines as participation increases.
△ Less
Submitted 25 May, 2021;
originally announced May 2021.
-
Phase Entrainment by Periodic Stimuli In Silico: A Quantitative Study
Authors:
Swapna Sasi,
Basabdatta Sen Bhattacharya
Abstract:
We present a quantitative study of phase entrainment by periodic visual stimuli in a biologically inspired neural network. The objective is to understand the neuronal population dynamics that underlie phase entrainment of brain oscillations by external stimuli, which is used for therapeutic treatment in neurological disorders, for example in Parkinsonian tremor. Yet, the neuronal dynamics underpin…
▽ More
We present a quantitative study of phase entrainment by periodic visual stimuli in a biologically inspired neural network. The objective is to understand the neuronal population dynamics that underlie phase entrainment of brain oscillations by external stimuli, which is used for therapeutic treatment in neurological disorders, for example in Parkinsonian tremor. Yet, the neuronal dynamics underpinning such entrainment is not fully understood. Rhythmic sensory stimulation is one way of studying phase synchronization in the brain. A recent experimental study has reported phase entrainment of brain oscillations during steady state visually evoked potentials (SSVEP), which are scalp electroencephalogram corresponding to periodic stimuli. We have simulated SSVEP-like signals corresponding to periodic pulse input to our in silico model. We have used phase locking values, normalised Shannon entropy and conditional probability as synchronisation indices to show phase synchrony in the neuronal populations. Our experiment demonstrates that the phase synchronisation disappears with jitter in the input inter-pulse intervals, and this would not be the case if the output signal were to be the superposition of the responses to the different input signals. Thus, the phase synchronisation implies entrainment of the network response by the periodic input. Overall, our study shows the plausibility of using biologically inspired in silico models, validated by experimental works, to understand and make testable predictions on brain entrainment as a therapeutic treatment in specific neurological disorders.
△ Less
Submitted 15 November, 2021; v1 submitted 22 May, 2021;
originally announced May 2021.
-
Fluctuations of Subgraph Counts in Graphon Based Random Graphs
Authors:
Bhaswar B. Bhattacharya,
Anirban Chatterjee,
Svante Janson
Abstract:
Given a graphon $W$ and a finite simple graph $H$, with vertex set $V(H)$, denote by $X_n(H, W)$ the number of copies of $H$ in a $W$-random graph on $n$ vertices. The asymptotic distribution of $X_n(H, W)$ was recently obtained by Hladký, Pelekis, and Šileikis (2021) in the case where $H$ is a clique. In this paper, we extend this result to any fixed graph $H$. Towards this we introduce a notion…
▽ More
Given a graphon $W$ and a finite simple graph $H$, with vertex set $V(H)$, denote by $X_n(H, W)$ the number of copies of $H$ in a $W$-random graph on $n$ vertices. The asymptotic distribution of $X_n(H, W)$ was recently obtained by Hladký, Pelekis, and Šileikis (2021) in the case where $H$ is a clique. In this paper, we extend this result to any fixed graph $H$. Towards this we introduce a notion of $H$-regularity of graphons and show that if the graphon $W$ is not $H$-regular, then $X_n(H, W)$ has Gaussian fluctuations with scaling $n^{|V(H)|-\frac{1}{2}}$. On the other hand, if $W$ is $H$-regular, then the fluctuations are of order $n^{|V(H)|-1}$ and the limiting distribution of $X_n(H, W)$ can have both Gaussian and non-Gaussian components, where the non-Gaussian component is a (possibly) infinite weighted sum of centered chi-squared random variables with the weights determined by the spectral properties of a graphon derived from $W$. Our proofs use the asymptotic theory of generalized $U$-statistics developed by Janson and Nowicki (1991). We also investigate the structure of $H$-regular graphons for which either the Gaussian or the non-Gaussian component of the limiting distribution (but not both) is degenerate. Interestingly, there are also $H$-regular graphons $W$ for which both the Gaussian or the non-Gaussian components are degenerate, that is, $X_n(H, W)$ has a degenerate limit even under the scaling $n^{|V(H)|-1}$. We give an example of this degeneracy with $H=K_{1, 3}$ (the 3-star) and also establish non-degeneracy in a few examples. This naturally leads to interesting open questions on higher-order degeneracies.
△ Less
Submitted 17 January, 2022; v1 submitted 15 April, 2021;
originally announced April 2021.
-
Bayesian Optimisation for a Biologically Inspired Population Neural Network
Authors:
Mahak Kothari,
Swapna Sasi,
Jun Chen,
Elham Zareian,
Basabdatta Sen Bhattacharya
Abstract:
We have used Bayesian Optimisation (BO) to find hyper-parameters in an existing biologically plausible population neural network. The 8-dimensional optimal hyper-parameter combination should be such that the network dynamics simulate the resting state alpha rhythm (8 - 13 Hz rhythms in brain signals). Each combination of these eight hyper-parameters constitutes a 'datapoint' in the parameter space…
▽ More
We have used Bayesian Optimisation (BO) to find hyper-parameters in an existing biologically plausible population neural network. The 8-dimensional optimal hyper-parameter combination should be such that the network dynamics simulate the resting state alpha rhythm (8 - 13 Hz rhythms in brain signals). Each combination of these eight hyper-parameters constitutes a 'datapoint' in the parameter space. The best combination of these parameters leads to the neural network's output power spectral peak being constraint within the alpha band. Further, constraints were introduced to the BO algorithm based on qualitative observation of the network output time series, so that high amplitude pseudo-periodic oscillations are removed. Upon successful implementation for alpha band, we further optimised the network to oscillate within the theta (4 - 8 Hz) and beta (13 - 30 Hz) bands. The changing rhythms in the model can now be studied using the identified optimal hyper-parameters for the respective frequency bands. We have previously tuned parameters in the existing neural network by the trial-and-error approach; however, due to time and computational constraints, we could not vary more than three parameters at once. The approach detailed here, allows an automatic hyper-parameter search, producing reliable parameter sets for the network.
△ Less
Submitted 13 April, 2021;
originally announced April 2021.
-
Axion-like particles resolve the $B \to πK$ and g-2 anomalies
Authors:
Bhubanjyoti Bhattacharya,
Alakabha Datta,
Danny Marfatia,
Soumitra Nandi,
John Waite
Abstract:
We offer a new solution to an old puzzle in the penguin-dominated $B\toπK$ decays. The puzzle is the inconsistency among the measurements of the branching ratios and CP asymmetries of the four $B\toπK$ decays: $B^+ \to π^+ K^0$, $B^+\to π^0 K^+$, $B_d^0\toπ^- K^+$, $B_d^0\toπ^0 K^0$. We solve the $B\toπK$ puzzle by considering the effect of an axion-like particle (ALP) that mixes with the $π^0$ an…
▽ More
We offer a new solution to an old puzzle in the penguin-dominated $B\toπK$ decays. The puzzle is the inconsistency among the measurements of the branching ratios and CP asymmetries of the four $B\toπK$ decays: $B^+ \to π^+ K^0$, $B^+\to π^0 K^+$, $B_d^0\toπ^- K^+$, $B_d^0\toπ^0 K^0$. We solve the $B\toπK$ puzzle by considering the effect of an axion-like particle (ALP) that mixes with the $π^0$ and has mass close to the $π^0$ mass. We show that the ALP can also explain the anomalies in the electron and muon anomalous magnetic moments.
△ Less
Submitted 12 August, 2021; v1 submitted 8 April, 2021;
originally announced April 2021.
-
Efficiency Lower Bounds for Distribution-Free Hotelling-Type Two-Sample Tests Based on Optimal Transport
Authors:
Nabarun Deb,
Bhaswar B. Bhattacharya,
Bodhisattva Sen
Abstract:
The Wilcoxon rank-sum test is one of the most popular distribution-free procedures for testing the equality of two univariate probability distributions. One of the main reasons for its popularity can be attributed to the remarkable result of Hodges and Lehmann (1956), which shows that the asymptotic relative efficiency of Wilcoxon's test with respect to Student's $t$-test, under location alternati…
▽ More
The Wilcoxon rank-sum test is one of the most popular distribution-free procedures for testing the equality of two univariate probability distributions. One of the main reasons for its popularity can be attributed to the remarkable result of Hodges and Lehmann (1956), which shows that the asymptotic relative efficiency of Wilcoxon's test with respect to Student's $t$-test, under location alternatives, never falls below 0.864, despite the former being exactly distribution-free for all sample sizes. Even more striking is the result of Chernoff and Savage (1958), which shows that the efficiency of a Gaussian score transformed Wilcoxon's test, against the $t$-test, is lower bounded by 1. In this paper we study the two-sample problem in the multivariate setting and propose distribution-free analogues of the Hotelling $T^2$ test (the natural multidimensional counterpart of Student's $t$-test) based on optimal transport and obtain extensions of the above celebrated results over various natural families of multivariate distributions. Our proposed tests are consistent against a general class of alternatives and satisfy Hodges-Lehmann and Chernoff-Savage-type efficiency lower bounds, despite being entirely agnostic to the underlying data generating mechanism. In particular, a collection of our proposed tests suffer from no loss in asymptotic efficiency, when compared to Hotelling $T^2$. To the best of our knowledge, these are the first collection of multivariate, nonparametric, exactly distribution-free tests that provably achieve such attractive efficiency lower bounds. We also demonstrate the broader scope of our methods in optimal transport based nonparametric inference by constructing exactly distribution-free multivariate tests for mutual independence, which suffer from no loss in asymptotic efficiency against the classical Wilks' likelihood ratio test, under Konijn alternatives.
△ Less
Submitted 18 August, 2021; v1 submitted 5 April, 2021;
originally announced April 2021.
-
Quantifying Synchronization in a Biologically Inspired Neural Network
Authors:
Pranav Mahajan,
Advait Rane,
Swapna Sasi,
Basabdatta Sen Bhattacharya
Abstract:
We present a collated set of algorithms to obtain objective measures of synchronisation in brain time-series data. The algorithms are implemented in MATLAB; we refer to our collated set of 'tools' as SyncBox. Our motivation for SyncBox is to understand the underlying dynamics in an existing population neural network, commonly referred to as neural mass models, that mimic Local Field Potentials of…
▽ More
We present a collated set of algorithms to obtain objective measures of synchronisation in brain time-series data. The algorithms are implemented in MATLAB; we refer to our collated set of 'tools' as SyncBox. Our motivation for SyncBox is to understand the underlying dynamics in an existing population neural network, commonly referred to as neural mass models, that mimic Local Field Potentials of the visual thalamic tissue. Specifically, we aim to measure the phase synchronisation objectively in the model response to periodic stimuli; this is to mimic the condition of Steady-state-visually-evoked-potentials (SSVEP), which are scalp Electroencephalograph (EEG) corresponding to periodic stimuli. We showcase the use of SyncBox on our existing neural mass model of the visual thalamus. Following our successful testing of SyncBox, it is currently being used for further research on understanding the underlying dynamics in enhanced neural networks of the visual pathway
△ Less
Submitted 10 December, 2020;
originally announced December 2020.
-
Motif Estimation via Subgraph Sampling: The Fourth Moment Phenomenon
Authors:
Bhaswar B. Bhattacharya,
Sayan Das,
Sumit Mukherjee
Abstract:
Network sampling is an indispensable tool for understanding features of large complex networks where it is practically impossible to search over the entire graph. In this paper, we develop a framework for statistical inference for counting network motifs, such as edges, triangles, and wedges, in the widely used subgraph sampling model, where each vertex is sampled independently, and the subgraph i…
▽ More
Network sampling is an indispensable tool for understanding features of large complex networks where it is practically impossible to search over the entire graph. In this paper, we develop a framework for statistical inference for counting network motifs, such as edges, triangles, and wedges, in the widely used subgraph sampling model, where each vertex is sampled independently, and the subgraph induced by the sampled vertices is observed. We derive necessary and sufficient conditions for the consistency and the asymptotic normality of the natural Horvitz-Thompson (HT) estimator, which can be used for constructing confidence intervals and hypothesis testing for the motif counts based on the sampled graph. In particular, we show that the asymptotic normality of the HT estimator exhibits an interesting fourth-moment phenomenon, which asserts that the HT estimator (appropriately centered and rescaled) converges in distribution to the standard normal whenever its fourth-moment converges to 3 (the fourth-moment of the standard normal distribution). As a consequence, we derive the exact thresholds for consistency and asymptotic normality of the HT estimator in various natural graph ensembles, such as sparse graphs with bounded degree, Erdos-Renyi random graphs, random regular graphs, and dense graphons.
△ Less
Submitted 5 November, 2020;
originally announced November 2020.
-
Generating and detecting bound entanglement in two-qutrits using a family of indecomposable positive maps
Authors:
Bihalan Bhattacharya,
Suchetana Goswami,
Rounak Mundra,
Nirman Ganguly,
Indranil Chakrabarty,
Samyadeb Bhattacharya,
A. S. Majumdar
Abstract:
The problem of bound entanglement detection is a challenging aspect of quantum information theory for higher dimensional systems. Here, we propose an indecomposable positive map for two-qutrit systems, which is shown to generate a class of positive partial transposed (PPT) states. A corresponding witness operator is constructed and shown to be weakly optimal and locally implementable. Further, we…
▽ More
The problem of bound entanglement detection is a challenging aspect of quantum information theory for higher dimensional systems. Here, we propose an indecomposable positive map for two-qutrit systems, which is shown to generate a class of positive partial transposed (PPT) states. A corresponding witness operator is constructed and shown to be weakly optimal and locally implementable. Further, we perform a structural physical approximation of the indecomposable map to make it a completely positive one, and find a new PPT entangled state which is not detectable by certain other well-known entanglement detection criteria.
△ Less
Submitted 18 September, 2020; v1 submitted 29 August, 2020;
originally announced August 2020.
-
Estimation in Tensor Ising Models
Authors:
Somabha Mukherjee,
Jaesung Son,
Bhaswar B. Bhattacharya
Abstract:
The $p$-tensor Ising model is a one-parameter discrete exponential family for modeling dependent binary data, where the sufficient statistic is a multi-linear form of degree $p \geq 2$. This is a natural generalization of the matrix Ising model, that provides a convenient mathematical framework for capturing higher-order dependencies in complex relational data. In this paper, we consider the probl…
▽ More
The $p$-tensor Ising model is a one-parameter discrete exponential family for modeling dependent binary data, where the sufficient statistic is a multi-linear form of degree $p \geq 2$. This is a natural generalization of the matrix Ising model, that provides a convenient mathematical framework for capturing higher-order dependencies in complex relational data. In this paper, we consider the problem of estimating the natural parameter of the $p$-tensor Ising model given a single sample from the distribution on $N$ nodes. Our estimate is based on the maximum pseudo-likelihood (MPL) method, which provides a computationally efficient algorithm for estimating the parameter that avoids computing the intractable partition function. We derive general conditions under which the MPL estimate is $\sqrt N$-consistent, that is, it converges to the true parameter at rate $1/\sqrt N$. In particular, we show the $\sqrt N$-consistency of the MPL estimate in the $p$-spin Sherrington-Kirkpatrick (SK) model, spin systems on general $p$-uniform hypergraphs, and Ising models on the hypergraph stochastic block model (HSBM). In fact, for the HSBM we pin down the exact location of the phase transition threshold, which is determined by the positivity of a certain mean-field variational problem, such that above this threshold the MPL estimate is $\sqrt N$-consistent, while below the threshold no estimator is consistent. Finally, we derive the precise fluctuations of the MPL estimate in the special case of the $p$-tensor Curie-Weiss model. An interesting consequence of our results is that the MPL estimate in the Curie-Weiss model saturates the Cramer-Rao lower bound at all points above the estimation threshold, that is, the MPL estimate incurs no loss in asymptotic efficiency, even though it is obtained by minimizing only an approximation of the true likelihood function for computational tractability.
△ Less
Submitted 28 August, 2020;
originally announced August 2020.
-
Parameter Estimation for Undirected Graphical Models with Hard Constraints
Authors:
Bhaswar B. Bhattacharya,
Kavita Ramanan
Abstract:
The hardcore model on a graph $G$ with parameter $λ>0$ is a probability measure on the collection of all independent sets of $G$, that assigns to each independent set $I$ a probability proportional to $λ^{|I|}$. In this paper we consider the problem of estimating the parameter $λ$ given a single sample from the hardcore model on a graph $G$. To bypass the computational intractability of the maximu…
▽ More
The hardcore model on a graph $G$ with parameter $λ>0$ is a probability measure on the collection of all independent sets of $G$, that assigns to each independent set $I$ a probability proportional to $λ^{|I|}$. In this paper we consider the problem of estimating the parameter $λ$ given a single sample from the hardcore model on a graph $G$. To bypass the computational intractability of the maximum likelihood method, we use the maximum pseudo-likelihood (MPL) estimator, which for the hardcore model has a surprisingly simple closed form expression. We show that for any sequence of graphs $\{G_N\}_{N\geq 1}$, where $G_N$ is a graph on $N$ vertices, the MPL estimate of $λ$ is $\sqrt N$-consistent, whenever the graph sequence has uniformly bounded average degree. We then derive sufficient conditions under which the MPL estimate of the activity parameters is $\sqrt N$-consistent given a single sample from a general $H$-coloring model, in which restrictions between adjacent colors are encoded by a constraint graph $H$. We verify the sufficient conditions for models where there is at least one unconstrained color as long as the graph sequence has uniformly bounded average degree. This applies to many $H$-coloring examples such as the Widom-Rowlinson and multi-state hard-core models. On the other hand, for the $q$-coloring model, which falls outside this class, we show that consistent estimation may be impossible even for graphs with bounded average degree. Nevertheless, we show that the MPL estimate is $\sqrt N$-consistent in the $q$-coloring model when $\{G_N\}_{N\geq 1}$ has bounded average double neighborhood. The presence of hard constraints, as opposed to soft constraints, leads to new challenges, and our proofs entail applications of the method of exchangeable pairs as well as combinatorial arguments that employ the probabilistic method.
△ Less
Submitted 23 June, 2021; v1 submitted 22 August, 2020;
originally announced August 2020.
-
Phase Transitions of the Maximum Likelihood Estimates in the $p$-Spin Curie-Weiss Model
Authors:
Somabha Mukherjee,
Jaesung Son,
Bhaswar B. Bhattacharya
Abstract:
In this paper we consider the problem of parameter estimation in the $p$-spin Curie-Weiss model, for $p \geq 3$. We provide a complete description of the limiting properties of the maximum likelihood (ML) estimates of the inverse temperature and the magnetic field given a single realization from the $p$-spin Curie-Weiss model, complementing the well-known results in the 2-spin case (Comets and Gid…
▽ More
In this paper we consider the problem of parameter estimation in the $p$-spin Curie-Weiss model, for $p \geq 3$. We provide a complete description of the limiting properties of the maximum likelihood (ML) estimates of the inverse temperature and the magnetic field given a single realization from the $p$-spin Curie-Weiss model, complementing the well-known results in the 2-spin case (Comets and Gidas (1991)). Our results unearth various new phase transitions and surprising limit theorems, such as the existence of a 'critical' curve in the parameter space, where the limiting distribution of the ML estimates is a mixture with both continuous and discrete components. The number of mixture components is either two or three, depending on, among other things, the sign of one of the parameters and the parity of $p$. Another interesting revelation is the existence of certain 'special' points in the parameter space where the ML estimates exhibit a superefficiency phenomenon, converging to a non-Gaussian limiting distribution at rate $N^{\frac{3}{4}}$. Using these results we can obtain asymptotically valid confidence intervals for the inverse temperature and the magnetic field at all points in the parameter space where consistent estimation is possible.
△ Less
Submitted 24 August, 2022; v1 submitted 7 May, 2020;
originally announced May 2020.