-
Minimax And Adaptive Transfer Learning for Nonparametric Classification under Distributed Differential Privacy Constraints
Authors:
Arnab Auddy,
T. Tony Cai,
Abhinav Chakraborty
Abstract:
This paper considers minimax and adaptive transfer learning for nonparametric classification under the posterior drift model with distributed differential privacy constraints. Our study is conducted within a heterogeneous framework, encompassing diverse sample sizes, varying privacy parameters, and data heterogeneity across different servers. We first establish the minimax misclassification rate,…
▽ More
This paper considers minimax and adaptive transfer learning for nonparametric classification under the posterior drift model with distributed differential privacy constraints. Our study is conducted within a heterogeneous framework, encompassing diverse sample sizes, varying privacy parameters, and data heterogeneity across different servers. We first establish the minimax misclassification rate, precisely characterizing the effects of privacy constraints, source samples, and target samples on classification accuracy. The results reveal interesting phase transition phenomena and highlight the intricate trade-offs between preserving privacy and achieving classification accuracy. We then develop a data-driven adaptive classifier that achieves the optimal rate within a logarithmic factor across a large collection of parameter spaces while satisfying the same set of differential privacy constraints. Simulation studies and real-world data applications further elucidate the theoretical analysis with numerical results.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Optimal Federated Learning for Nonparametric Regression with Heterogeneous Distributed Differential Privacy Constraints
Authors:
T. Tony Cai,
Abhinav Chakraborty,
Lasse Vuursteen
Abstract:
This paper studies federated learning for nonparametric regression in the context of distributed samples across different servers, each adhering to distinct differential privacy constraints. The setting we consider is heterogeneous, encompassing both varying sample sizes and differential privacy constraints across servers. Within this framework, both global and pointwise estimation are considered,…
▽ More
This paper studies federated learning for nonparametric regression in the context of distributed samples across different servers, each adhering to distinct differential privacy constraints. The setting we consider is heterogeneous, encompassing both varying sample sizes and differential privacy constraints across servers. Within this framework, both global and pointwise estimation are considered, and optimal rates of convergence over the Besov spaces are established.
Distributed privacy-preserving estimators are proposed and their risk properties are investigated. Matching minimax lower bounds, up to a logarithmic factor, are established for both global and pointwise estimation. Together, these findings shed light on the tradeoff between statistical accuracy and privacy preservation. In particular, we characterize the compromise not only in terms of the privacy budget but also concerning the loss incurred by distributing data within the privacy framework as a whole. This insight captures the folklore wisdom that it is easier to retain privacy in larger samples, and explores the differences between pointwise and global estimation under distributed privacy constraints.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Federated Nonparametric Hypothesis Testing with Differential Privacy Constraints: Optimal Rates and Adaptive Tests
Authors:
T. Tony Cai,
Abhinav Chakraborty,
Lasse Vuursteen
Abstract:
Federated learning has attracted significant recent attention due to its applicability across a wide range of settings where data is collected and analyzed across disparate locations. In this paper, we study federated nonparametric goodness-of-fit testing in the white-noise-with-drift model under distributed differential privacy (DP) constraints.
We first establish matching lower and upper bound…
▽ More
Federated learning has attracted significant recent attention due to its applicability across a wide range of settings where data is collected and analyzed across disparate locations. In this paper, we study federated nonparametric goodness-of-fit testing in the white-noise-with-drift model under distributed differential privacy (DP) constraints.
We first establish matching lower and upper bounds, up to a logarithmic factor, on the minimax separation rate. This optimal rate serves as a benchmark for the difficulty of the testing problem, factoring in model characteristics such as the number of observations, noise level, and regularity of the signal class, along with the strictness of the $(ε,δ)$-DP requirement. The results demonstrate interesting and novel phase transition phenomena. Furthermore, the results reveal an interesting phenomenon that distributed one-shot protocols with access to shared randomness outperform those without access to shared randomness. We also construct a data-driven testing procedure that possesses the ability to adapt to an unknown regularity parameter over a large collection of function classes with minimal additional cost, all while maintaining adherence to the same set of DP constraints.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
PriME: Privacy-aware Membership profile Estimation in networks
Authors:
Abhinav Chakraborty,
Sayak Chatterjee,
Sagnik Nandy
Abstract:
This paper presents a novel approach to estimating community membership probabilities for network vertices generated by the Degree Corrected Mixed Membership Stochastic Block Model while preserving individual edge privacy. Operating within the $\varepsilon$-edge local differential privacy framework, we introduce an optimal private algorithm based on a symmetric edge flip mechanism and spectral clu…
▽ More
This paper presents a novel approach to estimating community membership probabilities for network vertices generated by the Degree Corrected Mixed Membership Stochastic Block Model while preserving individual edge privacy. Operating within the $\varepsilon$-edge local differential privacy framework, we introduce an optimal private algorithm based on a symmetric edge flip mechanism and spectral clustering for accurate estimation of vertex community memberships. We conduct a comprehensive analysis of the estimation risk and establish the optimality of our procedure by providing matching lower bounds to the minimax risk under privacy constraints. To validate our approach, we demonstrate its performance through numerical simulations and its practical application to real-world data. This work represents a significant step forward in balancing accurate community membership estimation with stringent privacy preservation in network data analysis.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Early detection of disease outbreaks and non-outbreaks using incidence data
Authors:
Shan Gao,
Amit K. Chakraborty,
Russell Greiner,
Mark A. Lewis,
Hao Wang
Abstract:
Forecasting the occurrence and absence of novel disease outbreaks is essential for disease management. Here, we develop a general model, with no real-world training data, that accurately forecasts outbreaks and non-outbreaks. We propose a novel framework, using a feature-based time series classification method to forecast outbreaks and non-outbreaks. We tested our methods on synthetic data from a…
▽ More
Forecasting the occurrence and absence of novel disease outbreaks is essential for disease management. Here, we develop a general model, with no real-world training data, that accurately forecasts outbreaks and non-outbreaks. We propose a novel framework, using a feature-based time series classification method to forecast outbreaks and non-outbreaks. We tested our methods on synthetic data from a Susceptible-Infected-Recovered model for slowly changing, noisy disease dynamics. Outbreak sequences give a transcritical bifurcation within a specified future time window, whereas non-outbreak (null bifurcation) sequences do not. We identified incipient differences in time series of infectives leading to future outbreaks and non-outbreaks. These differences are reflected in 22 statistical features and 5 early warning signal indicators. Classifier performance, given by the area under the receiver-operating curve, ranged from 0.99 for large expanding windows of training data to 0.7 for small rolling windows. Real-world performances of classifiers were tested on two empirical datasets, COVID-19 data from Singapore and SARS data from Hong Kong, with two classifiers exhibiting high accuracy. In summary, we showed that there are statistical features that distinguish outbreak and non-outbreak sequences long before outbreaks occur. We could detect these differences in synthetic and real-world data sets, well before potential outbreaks occur.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Popov Mirror-Prox for solving Variational Inequalities
Authors:
Abhishek Chakraborty,
Angelia Nedić
Abstract:
We consider the mirror-prox algorithm for solving monotone Variational Inequality (VI) problems. As the mirror-prox algorithm is not practically implementable, except in special instances of VIs (such as affine VIs), we consider its implementation with Popov method updates. We provide convergence rate analysis of our proposed method for a monotone VI with a Lipschitz continuous map**. We establi…
▽ More
We consider the mirror-prox algorithm for solving monotone Variational Inequality (VI) problems. As the mirror-prox algorithm is not practically implementable, except in special instances of VIs (such as affine VIs), we consider its implementation with Popov method updates. We provide convergence rate analysis of our proposed method for a monotone VI with a Lipschitz continuous map**. We establish a convergence rate of $O(1/t)$, in terms of the number $t$ of iterations, for the dual gap function. Simulations on a two player matrix game corroborate our findings.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Random Methods for Variational Inequalities
Authors:
Abhishek Chakraborty,
Angelia Nedić
Abstract:
This paper considers a variational inequality (VI) problem arising from a game among multiple agents, where each agent aims to minimize its own cost function subject to its constrained set represented as the intersection of a (possibly infinite) number of convex functional level sets. A direct projection-based approach or Lagrangian-based techniques for such a problem can be computationally expens…
▽ More
This paper considers a variational inequality (VI) problem arising from a game among multiple agents, where each agent aims to minimize its own cost function subject to its constrained set represented as the intersection of a (possibly infinite) number of convex functional level sets. A direct projection-based approach or Lagrangian-based techniques for such a problem can be computationally expensive if not impossible to implement. To deal with the problem, we consider randomized methods that avoid the projection step on the whole constraint set by employing random feasibility updates. In particular, we propose and analyze such random methods for solving VIs based on the projection method, Korpelevich method, and Popov method. We establish the almost sure convergence of the methods and, also, provide their convergence rate guarantees. We illustrate the performance of the methods in simulations for two-agent games.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
PrIsing: Privacy-Preserving Peer Effect Estimation via Ising Model
Authors:
Abhinav Chakraborty,
Anirban Chatterjee,
Abhinandan Dalal
Abstract:
The Ising model, originally developed as a spin-glass model for ferromagnetic elements, has gained popularity as a network-based model for capturing dependencies in agents' outputs. Its increasing adoption in healthcare and the social sciences has raised privacy concerns regarding the confidentiality of agents' responses. In this paper, we present a novel $(\varepsilon,δ)$-differentially private a…
▽ More
The Ising model, originally developed as a spin-glass model for ferromagnetic elements, has gained popularity as a network-based model for capturing dependencies in agents' outputs. Its increasing adoption in healthcare and the social sciences has raised privacy concerns regarding the confidentiality of agents' responses. In this paper, we present a novel $(\varepsilon,δ)$-differentially private algorithm specifically designed to protect the privacy of individual agents' outcomes. Our algorithm allows for precise estimation of the natural parameter using a single network through an objective perturbation technique. Furthermore, we establish regret bounds for this algorithm and assess its performance on synthetic datasets and two real-world networks: one involving HIV status in a social network and the other concerning the political leaning of online blogs.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
An Anisotropic $hp$-Adaptation Framework for Ultraweak Discontinuous Petrov-Galerkin Formulations
Authors:
Ankit Chakraborty,
Stefan Henneking,
Leszek Demkowicz
Abstract:
In this article, we present a three-dimensional anisotropic $hp$-mesh refinement strategy for ultraweak discontinuous Petrov--Galerkin (DPG) formulations with optimal test functions. The refinement strategy utilizes the built-in residual-based error estimator accompanying the DPG discretization. The refinement strategy is a two-step process: (a) use the built-in error estimator to mark and isotrop…
▽ More
In this article, we present a three-dimensional anisotropic $hp$-mesh refinement strategy for ultraweak discontinuous Petrov--Galerkin (DPG) formulations with optimal test functions. The refinement strategy utilizes the built-in residual-based error estimator accompanying the DPG discretization. The refinement strategy is a two-step process: (a) use the built-in error estimator to mark and isotropically $hp$-refine elements of the (coarse) mesh to generate a finer mesh; (b) use the reference solution on the finer mesh to compute optimal $h$- and $p$-refinements of the selected elements in the coarse mesh. The process is repeated with coarse and fine mesh being generated in every adaptation cycle, until a prescribed error tolerance is achieved. We demonstrate the performance of the proposed refinement strategy using several numerical examples on hexahedral meshes.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
RANS-PINN based Simulation Surrogates for Predicting Turbulent Flows
Authors:
Shinjan Ghosh,
Amit Chakraborty,
Georgia Olympia Brikis,
Biswadip Dey
Abstract:
Physics-informed neural networks (PINNs) provide a framework to build surrogate models for dynamical systems governed by differential equations. During the learning process, PINNs incorporate a physics-based regularization term within the loss function to enhance generalization performance. Since simulating dynamics controlled by partial differential equations (PDEs) can be computationally expensi…
▽ More
Physics-informed neural networks (PINNs) provide a framework to build surrogate models for dynamical systems governed by differential equations. During the learning process, PINNs incorporate a physics-based regularization term within the loss function to enhance generalization performance. Since simulating dynamics controlled by partial differential equations (PDEs) can be computationally expensive, PINNs have gained popularity in learning parametric surrogates for fluid flow problems governed by Navier-Stokes equations. In this work, we introduce RANS-PINN, a modified PINN framework, to predict flow fields (i.e., velocity and pressure) in high Reynolds number turbulent flow regimes. To account for the additional complexity introduced by turbulence, RANS-PINN employs a 2-equation eddy viscosity model based on a Reynolds-averaged Navier-Stokes (RANS) formulation. Furthermore, we adopt a novel training approach that ensures effective initialization and balance among the various components of the loss function. The effectiveness of the RANS-PINN framework is then demonstrated using a parametric PINN.
△ Less
Submitted 11 August, 2023; v1 submitted 9 June, 2023;
originally announced June 2023.
-
On bipartite circle graphs and Khovanov homology
Authors:
Apratim Chakraborty
Abstract:
We prove that independence complex of a bipartite circle graph is homotopy equivalent to a wedge of spheres, resolving a conjecture posed by Przytycki and Silvero. As a corollary, we obtain that extreme Khovanov spectrum, $\mathcal{X}_{j_{extreme}}$ is homotopy equivalent to a wedge of spheres. In particular, the extreme Khovanov homology has no torsion.
We prove that independence complex of a bipartite circle graph is homotopy equivalent to a wedge of spheres, resolving a conjecture posed by Przytycki and Silvero. As a corollary, we obtain that extreme Khovanov spectrum, $\mathcal{X}_{j_{extreme}}$ is homotopy equivalent to a wedge of spheres. In particular, the extreme Khovanov homology has no torsion.
△ Less
Submitted 21 March, 2023; v1 submitted 5 February, 2023;
originally announced February 2023.
-
Reconciling model-X and doubly robust approaches to conditional independence testing
Authors:
Ziang Niu,
Abhinav Chakraborty,
Oliver Dukes,
Eugene Katsevich
Abstract:
Model-X approaches to testing conditional independence between a predictor and an outcome variable given a vector of covariates usually assume exact knowledge of the conditional distribution of the predictor given the covariates. Nevertheless, model-X methodologies are often deployed with this conditional distribution learned in sample. We investigate the consequences of this choice through the le…
▽ More
Model-X approaches to testing conditional independence between a predictor and an outcome variable given a vector of covariates usually assume exact knowledge of the conditional distribution of the predictor given the covariates. Nevertheless, model-X methodologies are often deployed with this conditional distribution learned in sample. We investigate the consequences of this choice through the lens of the distilled conditional randomization test (dCRT). We find that Type-I error control is still possible, but only if the mean of the outcome variable given the covariates is estimated well enough. This demonstrates that the dCRT is doubly robust, and motivates a comparison to the generalized covariance measure (GCM) test, another doubly robust conditional independence test. We prove that these two tests are asymptotically equivalent, and show that the GCM test is optimal against (generalized) partially linear alternatives by leveraging semiparametric efficiency theory. In an extensive simulation study, we compare the dCRT to the GCM test. These two tests have broadly similar Type-I error and power, though dCRT can have somewhat better Type-I error control but somewhat worse power in small samples or when the response is discrete. We also find that post-lasso based test statistics (as compared to lasso based statistics) can dramatically improve Type-I error control for both methods.
△ Less
Submitted 8 February, 2023; v1 submitted 26 November, 2022;
originally announced November 2022.
-
On Elser's conjecture and the topology of $U$-nucleus complex
Authors:
Apratim Chakraborty,
Anupam Mondal,
Sajal Mukherjee,
Kuldeep Saha
Abstract:
Dorpalen-Barry et al. proved Elser's conjecture about sign of Elser's number by interpreting them as certain sums of reduced Euler characteristics of an abstract simplicial complex known as $U$-nucleus complex. We prove a conjecture posed by them regarding the homology of $U$-nucleus complex.
Dorpalen-Barry et al. proved Elser's conjecture about sign of Elser's number by interpreting them as certain sums of reduced Euler characteristics of an abstract simplicial complex known as $U$-nucleus complex. We prove a conjecture posed by them regarding the homology of $U$-nucleus complex.
△ Less
Submitted 14 March, 2023; v1 submitted 23 March, 2022;
originally announced March 2022.
-
Multigoal-oriented dual-weighted-residual error estimation using deep neural networks
Authors:
Ayan Chakraborty,
Thomas Wick,
Xiaoying Zhuang,
Timon Rabczuk
Abstract:
Deep learning has shown successful application in visual recognition and certain artificial intelligence tasks. Deep learning is also considered as a powerful tool with high flexibility to approximate functions. In the present work, functions with desired properties are devised to approximate the solutions of PDEs. Our approach is based on a posteriori error estimation in which the adjoint problem…
▽ More
Deep learning has shown successful application in visual recognition and certain artificial intelligence tasks. Deep learning is also considered as a powerful tool with high flexibility to approximate functions. In the present work, functions with desired properties are devised to approximate the solutions of PDEs. Our approach is based on a posteriori error estimation in which the adjoint problem is solved for the error localization to formulate an error estimator within the framework of neural network. An efficient and easy to implement algorithm is developed to obtain a posteriori error estimate for multiple goal functionals by employing the dual-weighted residual approach, which is followed by the computation of both primal and adjoint solutions using the neural network. The present study shows that such a data-driven model based learning has superior approximation of quantities of interest even with relatively less training data. The novel algorithmic developments are substantiated with numerical test examples. The advantages of using deep neural network over the shallow neural network are demonstrated and the convergence enhancing techniques are also presented
△ Less
Submitted 22 December, 2021; v1 submitted 21 December, 2021;
originally announced December 2021.
-
Some new symmetric structures in Ramsey theory
Authors:
Aninda Chakraborty,
Sayan Goswami
Abstract:
In this article, we will investigate several new configurations in Ramsey Theory, using the $\ostar_{l,k}$-operation on the set of integers, recently introduced in \cite{key-4}. This operation is useful to study symmetric structures in the set of integers, such as monochromatic configurations of the form $\left\{ x,y,x+y+xy\right\} $ as one of its simplest case. In \cite{key-4}, the author has stu…
▽ More
In this article, we will investigate several new configurations in Ramsey Theory, using the $\ostar_{l,k}$-operation on the set of integers, recently introduced in \cite{key-4}. This operation is useful to study symmetric structures in the set of integers, such as monochromatic configurations of the form $\left\{ x,y,x+y+xy\right\} $ as one of its simplest case. In \cite{key-4}, the author has studied more general symmetric structures. It has been shown that the Hindman's Theorem, van der Waerden's Theorem, Deuber's Theorem have their own symmetric versions. In this article we will explore several new structures, including polynomial versions of these symmetric structures and some of its variants. As a result, we get several new symmetric polynomial configurations as well as new linear symmetric patterns. In the final section, we will also introduce two new operations on the set of non-negative integers $\mathbb{N}$, to obtain further new configurations.
△ Less
Submitted 28 July, 2021;
originally announced July 2021.
-
An abstract formulation of image partition regularity
Authors:
Aninda Chakraborty,
Sayan Goswami
Abstract:
Inspired by the paper [1] of V. Bergelson, John H.Johnson Jr., J. Moreira, we formulate an abstract version of image partition regularity. To establish the result we have used a variant of first entry condition and for infinite case we contained our work to Milliken-Taylor system.
Inspired by the paper [1] of V. Bergelson, John H.Johnson Jr., J. Moreira, we formulate an abstract version of image partition regularity. To establish the result we have used a variant of first entry condition and for infinite case we contained our work to Milliken-Taylor system.
△ Less
Submitted 1 February, 2021; v1 submitted 20 January, 2021;
originally announced January 2021.
-
Cabling Legendrian and transverse knots
Authors:
Apratim Chakraborty,
John B. Etnyre,
Hyunki Min
Abstract:
In this paper we will show how to classify Legendrian and transverse knots in the knot type of "sufficiently positive" cables of a knot in terms of the classification of the underlying knot. We will also completely explain the phenomena of "Legendrian large" cables. These are Legendrian representatives of cables that have Thurston-Bennequin invariant larger that the framing coming from the cabling…
▽ More
In this paper we will show how to classify Legendrian and transverse knots in the knot type of "sufficiently positive" cables of a knot in terms of the classification of the underlying knot. We will also completely explain the phenomena of "Legendrian large" cables. These are Legendrian representatives of cables that have Thurston-Bennequin invariant larger that the framing coming from the cabling torus. Such examples have only recently, and unexpectedly, been found. We will also give criteria that determines the classification of Legendrian and transverse knots the the knot type of negative cables.
△ Less
Submitted 21 October, 2021; v1 submitted 22 December, 2020;
originally announced December 2020.
-
An analogue to infinitery Hales-Jewett theorem
Authors:
Aninda Chakraborty,
Sayan Goswami
Abstract:
In a recent work, N. Hindman, D. Strauss and L. Zamboni have shown that the Hales-Jewett theorem can be combined with a sufficiently well behaved homomorphisms. In this paper we will show that those combined extensions can be made if we replace the alphabet by an increasing sequence of alphabets, infact it holds for some Ramsey theoretic small sets. To obtained this we achieved some interesting co…
▽ More
In a recent work, N. Hindman, D. Strauss and L. Zamboni have shown that the Hales-Jewett theorem can be combined with a sufficiently well behaved homomorphisms. In this paper we will show that those combined extensions can be made if we replace the alphabet by an increasing sequence of alphabets, infact it holds for some Ramsey theoretic small sets. To obtained this we achieved some interesting configurations.
△ Less
Submitted 5 December, 2020;
originally announced December 2020.
-
Benchmarking Energy-Conserving Neural Networks for Learning Dynamics from Data
Authors:
Yaofeng Desmond Zhong,
Biswadip Dey,
Amit Chakraborty
Abstract:
The last few years have witnessed an increased interest in incorporating physics-informed inductive bias in deep learning frameworks. In particular, a growing volume of literature has been exploring ways to enforce energy conservation while using neural networks for learning dynamics from observed time-series data. In this work, we survey ten recently proposed energy-conserving neural network mode…
▽ More
The last few years have witnessed an increased interest in incorporating physics-informed inductive bias in deep learning frameworks. In particular, a growing volume of literature has been exploring ways to enforce energy conservation while using neural networks for learning dynamics from observed time-series data. In this work, we survey ten recently proposed energy-conserving neural network models, including HNN, LNN, DeLaN, SymODEN, CHNN, CLNN and their variants. We provide a compact derivation of the theory behind these models and explain their similarities and differences. Their performance are compared in 4 physical systems. We point out the possibility of leveraging some of these energy-conserving models to design energy-based controllers.
△ Less
Submitted 28 April, 2023; v1 submitted 3 December, 2020;
originally announced December 2020.
-
Sparse Representations of Positive Functions via First and Second-Order Pseudo-Mirror Descent
Authors:
Abhishek Chakraborty,
Ketan Rajawat,
Alec Koppel
Abstract:
We consider expected risk minimization problems when the range of the estimator is required to be nonnegative, motivated by the settings of maximum likelihood estimation (MLE) and trajectory optimization. To facilitate nonlinear interpolation, we hypothesize that the search space is a Reproducing Kernel Hilbert Space (RKHS). We develop first and second-order variants of stochastic mirror descent e…
▽ More
We consider expected risk minimization problems when the range of the estimator is required to be nonnegative, motivated by the settings of maximum likelihood estimation (MLE) and trajectory optimization. To facilitate nonlinear interpolation, we hypothesize that the search space is a Reproducing Kernel Hilbert Space (RKHS). We develop first and second-order variants of stochastic mirror descent employing (i) \emph{pseudo-gradients} and (ii) complexity-reducing projections. Compressive projection in the first-order scheme is executed via kernel orthogonal matching pursuit (KOMP), which overcomes the fact that the vanilla RKHS parameterization grows unbounded with the iteration index in the stochastic setting. Moreover, pseudo-gradients are needed when gradient estimates for cost are only computable up to some numerical error, which arise in, e.g., integral approximations. Under constant step-size and compression budget, we establish tradeoffs between the radius of convergence of the expected sub-optimality and the projection budget parameter, as well as non-asymptotic bounds on the model complexity. To refine the solution's precision, we develop a second-order extension which employs recursively averaged pseudo-gradient outer-products to approximate the Hessian inverse, whose convergence in mean is established under an additional eigenvalue decay condition on the Hessian of the optimal RKHS element, which is unique to this work. Experiments demonstrate favorable performance on inhomogeneous Poisson Process intensity estimation in practice.
△ Less
Submitted 3 May, 2022; v1 submitted 13 November, 2020;
originally announced November 2020.
-
On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension
Authors:
Udari Madhushani,
Biswadip Dey,
Naomi Ehrich Leonard,
Amit Chakraborty
Abstract:
Value function based reinforcement learning (RL) algorithms, for example, $Q$-learning, learn optimal policies from datasets of actions, rewards, and state transitions. However, when the underlying state transition dynamics are stochastic and evolve on a high-dimensional space, generating independent and identically distributed (IID) data samples for creating these datasets poses a significant cha…
▽ More
Value function based reinforcement learning (RL) algorithms, for example, $Q$-learning, learn optimal policies from datasets of actions, rewards, and state transitions. However, when the underlying state transition dynamics are stochastic and evolve on a high-dimensional space, generating independent and identically distributed (IID) data samples for creating these datasets poses a significant challenge due to the intractability of the associated normalizing integral. In these scenarios, Hamiltonian Monte Carlo (HMC) sampling offers a computationally tractable way to generate data for training RL algorithms. In this paper, we introduce a framework, called \textit{Hamiltonian $Q$-Learning}, that demonstrates, both theoretically and empirically, that $Q$ values can be learned from a dataset generated by HMC samples of actions, rewards, and state transitions. Furthermore, to exploit the underlying low-rank structure of the $Q$ function, Hamiltonian $Q$-Learning uses a matrix completion algorithm for reconstructing the updated $Q$ function from $Q$ value updates over a much smaller subset of state-action pairs. Thus, by providing an efficient way to apply $Q$-learning in stochastic, high-dimensional settings, the proposed approach broadens the scope of RL algorithms for real-world applications.
△ Less
Submitted 28 March, 2022; v1 submitted 11 November, 2020;
originally announced November 2020.
-
High dimensional PCA: a new model selection criterion
Authors:
Abhinav Chakraborty,
Soumendu Sundar Mukherjee,
Arijit Chakrabarti
Abstract:
Given a random sample from a multivariate population, estimating the number of large eigenvalues of the population covariance matrix is an important problem in Statistics with wide applications in many areas. In the context of Principal Component Analysis (PCA), the linear combinations of the original variables having the largest amounts of variation are determined by this number. In this paper, w…
▽ More
Given a random sample from a multivariate population, estimating the number of large eigenvalues of the population covariance matrix is an important problem in Statistics with wide applications in many areas. In the context of Principal Component Analysis (PCA), the linear combinations of the original variables having the largest amounts of variation are determined by this number. In this paper, we study the high dimensional asymptotic regime where the number of variables grows at the same rate as the number of observations, and use the spiked covariance model proposed in Johnstone (2001), under which the problem reduces to model selection. Our focus is on the Akaike Information Criterion (AIC) which is known to be strongly consistent from the work of Bai et al. (2018). However, Bai et al. (2018) requires a certain "gap condition" ensuring the dominant eigenvalues to be above a threshold strictly larger than the BBP threshold (Baik et al. (2005), both quantities depending on the limiting ratio of the number of variables and observations. It is well-known that, below the BBP threshold, a spiked covariance structure becomes indistinguishable from one with no spikes. Thus the strong consistency of AIC requires some extra signal strength.
In this paper, we investigate whether consistency continues to hold even if the "gap" is made smaller. We show that strong consistency under arbitrarily small gap is achievable if we alter the penalty term of AIC suitably depending on the target gap. Furthermore, another intuitive alteration of the penalty can indeed make the gap exactly zero, although we can only achieve weak consistency in this case. We compare the two newly-proposed estimators with other existing estimators in the literature via extensive simulation studies, and show, by suitably calibrating our proposals, that a significant improvement in terms of mean-squared error is achievable.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
Combined Algebraic Properties in Gaussian and Quaternion Ring
Authors:
Aninda Chakraborty
Abstract:
It is known that for an IP^{*} set A in (\mathbb{N},+) and a sequence \left\langle x_{n}\right\rangle _{n=1}^{\infty} in \mathbb{N}, there exists a sum subsystem \left\langle y_{n}\right\rangle _{n=1}^{\infty} of \left\langle x_{n}\right\rangle _{n=1}^{\infty} such that FS\left(\left\langle y_{n}\right\rangle _{n=1}^{\infty}\right)\cup FP\left(\left\langle y_{n}\right\rangle _{n=1}^{\infty}\right)…
▽ More
It is known that for an IP^{*} set A in (\mathbb{N},+) and a sequence \left\langle x_{n}\right\rangle _{n=1}^{\infty} in \mathbb{N}, there exists a sum subsystem \left\langle y_{n}\right\rangle _{n=1}^{\infty} of \left\langle x_{n}\right\rangle _{n=1}^{\infty} such that FS\left(\left\langle y_{n}\right\rangle _{n=1}^{\infty}\right)\cup FP\left(\left\langle y_{n}\right\rangle _{n=1}^{\infty}\right)\subseteq A. Similar types of results have also been proved for central^{*} sets and C^{*}-sets where the sequences have been considered from the class of minimal sequences and almost minimal sequences. In this present work, our aim to establish the similar type of results for the ring of Gaussian integers and the ring of integer quaternions.
△ Less
Submitted 18 October, 2020;
originally announced October 2020.
-
Abundance of Matrices In Gaussian Integers
Authors:
Aninda Chakraborty
Abstract:
In [HLS], N. Hindman, I. Leader and D. Strauss proved the abundance for a matrix with rational entries. In this paper we proved it for the ring of Gaussian integers. We showed the result when the matrix is taken with entries from \mathbb{Q}\left[i\right]. The main obstacle is in the field of complex numbers, no linear order relation exists. We overcome that in a tactful way.
In [HLS], N. Hindman, I. Leader and D. Strauss proved the abundance for a matrix with rational entries. In this paper we proved it for the ring of Gaussian integers. We showed the result when the matrix is taken with entries from \mathbb{Q}\left[i\right]. The main obstacle is in the field of complex numbers, no linear order relation exists. We overcome that in a tactful way.
△ Less
Submitted 18 October, 2020;
originally announced October 2020.
-
Hales-Jewett type configurations in small sets
Authors:
Aninda Chakraborty,
Sayan Goswami
Abstract:
In a recent work, N. Hindman, D. Strauss and L. Zamboni have shown that the Hales-Jewett theorem can be combined with a sufficiently well behaved homomorphisms. Their work was completely algebraic in nature, where they have used the algebra of Stone-Cech compactification of discrete semigroup. They have proved the existence of those configurations in piecewise syndetic sets, which is a Ramsey theo…
▽ More
In a recent work, N. Hindman, D. Strauss and L. Zamboni have shown that the Hales-Jewett theorem can be combined with a sufficiently well behaved homomorphisms. Their work was completely algebraic in nature, where they have used the algebra of Stone-Cech compactification of discrete semigroup. They have proved the existence of those configurations in piecewise syndetic sets, which is a Ramsey theoretic rich set. In our work we will show those forms are still present in very small but Ramsey theoretic sets, (like J-set, C-set) and our proof is purely elementary in nature.
△ Less
Submitted 2 December, 2021; v1 submitted 1 September, 2020;
originally announced September 2020.
-
Polynomial central set theorem near zero
Authors:
Aninda Chakraborty,
Sayan Goswami
Abstract:
N. Hindman and I. Leader introduced the set of ultrafilters 0+ on (0,1) and characterize smallest ideal of (0+,+) and proved the Central Set Theorem near zero. Recently Polynomial Central Set Theorem has been proved by V. Bergelson, J. H. Johnson Jr. and J. Moreira. In this article, we will prove Polynomial Central Set Theorem near zero.
N. Hindman and I. Leader introduced the set of ultrafilters 0+ on (0,1) and characterize smallest ideal of (0+,+) and proved the Central Set Theorem near zero. Recently Polynomial Central Set Theorem has been proved by V. Bergelson, J. H. Johnson Jr. and J. Moreira. In this article, we will prove Polynomial Central Set Theorem near zero.
△ Less
Submitted 18 December, 2019;
originally announced December 2019.
-
Synthesis of Feedback Controller for Nonlinear Control Systems with Optimal Region of Attraction
Authors:
Ayan Chakraborty,
Indranil Saha
Abstract:
We propose a framework for synthesizing a feedback control policy that maximizes the region of attraction (ROA) of a closed-loop nonlinear dynamical system. Our synthesis technique relies on stochastic optimization, which involves computation of an objective function capturing the ROA for a feedback control law. We employ a machine learning technique based on deep neural network to estimate the RO…
▽ More
We propose a framework for synthesizing a feedback control policy that maximizes the region of attraction (ROA) of a closed-loop nonlinear dynamical system. Our synthesis technique relies on stochastic optimization, which involves computation of an objective function capturing the ROA for a feedback control law. We employ a machine learning technique based on deep neural network to estimate the ROA for a given feedback controller. Overall, our technique is capable of synthesizing a controller co-optimizing traditional control objectives like LQR cost together with ROA. We demonstrate the efficacy of our technique through exhaustive experiments carried out on various nonlinear systems.
△ Less
Submitted 26 April, 2020; v1 submitted 10 November, 2019;
originally announced November 2019.
-
Invariants of annular links, cobordisms and transverse links from combinatorial link Floer complex
Authors:
Apratim Chakraborty
Abstract:
We define an annular concordance invariant and study its properties. When specialized to braids, this invariant gives bounds on band rank. We introduce a modified chain complex to reformulate the invariant. Then, by focusing on a special case, we give a refinement of the transverse invariant $\hatθ$. We also study the relationship of this invariant with transverse and braid monodromy properties.
We define an annular concordance invariant and study its properties. When specialized to braids, this invariant gives bounds on band rank. We introduce a modified chain complex to reformulate the invariant. Then, by focusing on a special case, we give a refinement of the transverse invariant $\hatθ$. We also study the relationship of this invariant with transverse and braid monodromy properties.
△ Less
Submitted 16 October, 2019;
originally announced October 2019.
-
$IP^\star$ set in product space of countable adequate commutative partial semigroups
Authors:
Aninda Chakraborty
Abstract:
A partial semigroup is a set with restricted binary operation. In this work we will extend a result due to V. Bergelson and N. Hindman concerning the rich structure presented in the product space of semigroups to partial semigroup. An $IP^{\star}$ set in a semigroup is a set that intersect every set of the form $\left\{ FS(x_{n})_{n=1}^{\infty}:x_{n}\in S\right\} $. V. Bergelson and N. Hindman pro…
▽ More
A partial semigroup is a set with restricted binary operation. In this work we will extend a result due to V. Bergelson and N. Hindman concerning the rich structure presented in the product space of semigroups to partial semigroup. An $IP^{\star}$ set in a semigroup is a set that intersect every set of the form $\left\{ FS(x_{n})_{n=1}^{\infty}:x_{n}\in S\right\} $. V. Bergelson and N. Hindman proved that if $S_{1},S_{2},\ldots,S_{l}$ are finite collection of commutative semigroup, then under certain condition, an $IP^{\star}$ set in $S_{1}\times S_{2}\times\ldots\times S_{l}$ contains cartesian products of arbitrarily large finite substructures of the form $FS\left(x_{1,n}\right)_{n=1}^{\infty}\times FS\left(x_{2,n}\right)_{n=1}^{\infty}\times\ldots\times FS\left(x_{l,n}\right)_{n=1}^{\infty}$. In this work we will extend this result to countable adequate commutative partial semigroup.
△ Less
Submitted 22 September, 2019;
originally announced September 2019.
-
On Statistical Properties of A Veracity Scoring Method for Spatial Data
Authors:
Arnab Chakraborty,
Soumendra N. Lahiri
Abstract:
Measuring veracity or reliability of noisy data is of utmost importance, especially in the scenarios where the information are gathered through automated systems. In a recent paper, Chakraborty et. al. (2019) have introduced a veracity scoring technique for geostatistical data. The authors have used a high-quality `reference' data to measure the veracity of the varying-quality observations and inc…
▽ More
Measuring veracity or reliability of noisy data is of utmost importance, especially in the scenarios where the information are gathered through automated systems. In a recent paper, Chakraborty et. al. (2019) have introduced a veracity scoring technique for geostatistical data. The authors have used a high-quality `reference' data to measure the veracity of the varying-quality observations and incorporated the veracity scores in their analysis of mobile-sensor generated noisy weather data to generate efficient predictions of the ambient temperature process. In this paper, we consider the scenario when no reference data is available and hence, the veracity scores (referred as VS) are defined based on `local' summaries of the observations. We develop a VS-based estimation method for parameters of a spatial regression model. Under a non-stationary noise structure and fairly general assumptions on the underlying spatial process, we show that the VS-based estimators of the regression parameters are consistent. Moreover, we establish the advantage of the VS-based estimators as compared to the ordinary least squares (OLS) estimator by analyzing their asymptotic mean squared errors. We illustrate the merits of the VS-based technique through simulations and apply the methodology to a real data set on mass percentages of ash in coal seams in Pennsylvania.
△ Less
Submitted 20 June, 2019;
originally announced June 2019.
-
Sets with arithmetic progressions are abundant
Authors:
Aninda Chakraborty,
Sayan Goswami
Abstract:
Furstenberg, Glasscock, Bergelson, Beiglboeck have been studied abundance in arithmatic progression on various large sets like piecewise syndetic, central, thick, etc. but also there are so many sets in which abundance in progression is still unsettled like J-sets, C-sets, D-sets etc. But all of these sets have a common property that they contains arbitrary length of arithmatic progressions. These…
▽ More
Furstenberg, Glasscock, Bergelson, Beiglboeck have been studied abundance in arithmatic progression on various large sets like piecewise syndetic, central, thick, etc. but also there are so many sets in which abundance in progression is still unsettled like J-sets, C-sets, D-sets etc. But all of these sets have a common property that they contains arbitrary length of arithmatic progressions. These type of sets are called sets of A.P. rich, we have given an elementary proof of abundance of those sets.
△ Less
Submitted 5 May, 2019;
originally announced May 2019.
-
Richness of arithmetic progression in commutative semigroup
Authors:
Aninda Chakraborty,
Sayan Goswami
Abstract:
Furstenberg and Glasner proved that for an arbitrary k in N, any piecewise syndetic set contains k term arithmetic progressions and such collection is also piecewise syndetic in Z: They used algebraic structure of beta N. The above result was extended for arbitrary semigroups by Bergelson and Hindman, again using the structure of Stone-Cech compactification of general semigroup. Beiglboeck provide…
▽ More
Furstenberg and Glasner proved that for an arbitrary k in N, any piecewise syndetic set contains k term arithmetic progressions and such collection is also piecewise syndetic in Z: They used algebraic structure of beta N. The above result was extended for arbitrary semigroups by Bergelson and Hindman, again using the structure of Stone-Cech compactification of general semigroup. Beiglboeck provided an elementary proof of the above result and asked whether the combinatorial argument in his proof can be enhanced in a way which makes it applicable to a more abstract setting. In a recent work the second author of this paper and S.Jana provided an affirmative answer to Beiglboeck's question for countable commutative semigroup. In this work we will extend the result of Beiglboeck in different type of settings.
△ Less
Submitted 22 April, 2019;
originally announced April 2019.
-
Transverse and Legendrian invariants of cables in combinatorial link Floer homology
Authors:
Apratim Chakraborty
Abstract:
We study the Ozsváth-Szabó-Thurston transverse invariant in combinatorial link Floer homology for certain transverse cables $\mathscr{L}_{p,q}$ of transverse link $L$ in $S^3$. Transverse cables $\mathscr{L}_{p,q}$ are constructed from the grid diagram of $L$. The main result is $\hatθ(\mathscr{L}_{p,q})=0$ if and only if $\hatθ(L)=0$ for $\frac{q}{p}$ sufficiently large. We also prove a similar r…
▽ More
We study the Ozsváth-Szabó-Thurston transverse invariant in combinatorial link Floer homology for certain transverse cables $\mathscr{L}_{p,q}$ of transverse link $L$ in $S^3$. Transverse cables $\mathscr{L}_{p,q}$ are constructed from the grid diagram of $L$. The main result is $\hatθ(\mathscr{L}_{p,q})=0$ if and only if $\hatθ(L)=0$ for $\frac{q}{p}$ sufficiently large. We also prove a similar result for invariants of Legendrian knots. Our proof uses an inclusion map $i$ of certain grid complexes associated to $L$ and $L_{p,q}$. We use these results to generate many infinite families of examples of Legendrian and transversely non-simple topological link types.
△ Less
Submitted 1 October, 2021; v1 submitted 28 March, 2019;
originally announced March 2019.
-
Estimation of recurrence for nilpotent group action
Authors:
Aninda Chakraborty,
Dibyendu De,
Sayan Goswami
Abstract:
We estimate size of recurrence of an action of a nilpotent group by homeomorphisms of a compact space for polynomial map**s into a nilpotent group form the partial semigroup $(\mathcal{P}_{f}(\mathbb{N}),\uplus)$. To do this we have used algebraic structure of the Stone-Čech copactification partial semigroup and that of the given nilpotent group.
We estimate size of recurrence of an action of a nilpotent group by homeomorphisms of a compact space for polynomial map**s into a nilpotent group form the partial semigroup $(\mathcal{P}_{f}(\mathbb{N}),\uplus)$. To do this we have used algebraic structure of the Stone-Čech copactification partial semigroup and that of the given nilpotent group.
△ Less
Submitted 11 September, 2018;
originally announced October 2018.
-
Non uniform weighted extended B-Spline finite element analysis of non linear elliptic partial differential equations
Authors:
B. V. Rathish Kumar,
Ayan Chakraborty
Abstract:
We propose a non uniform web spline based finite element analysis for elliptic partial differential equation with the gradient type nonlinearity in their principal coefficients like p-laplacian equation and Quasi-Newtonian fluid flow equations. We discuss the well-posednes of the problems and also derive the apriori error estimates for the proposed finite element analysis and obtain convergence ra…
▽ More
We propose a non uniform web spline based finite element analysis for elliptic partial differential equation with the gradient type nonlinearity in their principal coefficients like p-laplacian equation and Quasi-Newtonian fluid flow equations. We discuss the well-posednes of the problems and also derive the apriori error estimates for the proposed finite element analysis and obtain convergence rate of $\mathcal{O}(h^α)$ for $α> 0$.
△ Less
Submitted 30 June, 2018;
originally announced July 2018.
-
Web spline error estimation of non-cooperative elliptic equations for population dynamics
Authors:
Ayan Chakraborty,
B. V. Rathish Kumar
Abstract:
We analyze the error of the WEB-S finite element method applied to elliptic systems with non-cooperative dominant coupling,with a mixed Dirichlet/Neumann/Robin boundary condition. This problem is strongly related to a posteriori error estimates, giving computable bounds for computational errors and detecting zones in the solution domain where such errors are too large and certain mesh refinements…
▽ More
We analyze the error of the WEB-S finite element method applied to elliptic systems with non-cooperative dominant coupling,with a mixed Dirichlet/Neumann/Robin boundary condition. This problem is strongly related to a posteriori error estimates, giving computable bounds for computational errors and detecting zones in the solution domain where such errors are too large and certain mesh refinements should be performed. These results are based on an extensive regularity analysis of the interface problems of concern.Finally, the error analysis is illustrated by numerical experiments.
△ Less
Submitted 24 June, 2018;
originally announced June 2018.
-
Weighted Extended B-Spline Finite Element Analysis of a coupled system of general Elliptic equations
Authors:
Ayan Chakraborty,
BV. Rathish Kumar
Abstract:
In this study we establish the existence and uniqueness of the solution of a coupled system of general elliptic equations with anisotropic diffusion , non-uniform advection and variably influencing reaction terms on Lipschitz continuous domain $Ω\subset \mathbb{R}^m $ (m$\geq$1) with a Dirichlet boundary. Later we consider the finite element (FE) approximation of the coupled equations in a meshles…
▽ More
In this study we establish the existence and uniqueness of the solution of a coupled system of general elliptic equations with anisotropic diffusion , non-uniform advection and variably influencing reaction terms on Lipschitz continuous domain $Ω\subset \mathbb{R}^m $ (m$\geq$1) with a Dirichlet boundary. Later we consider the finite element (FE) approximation of the coupled equations in a meshless framework based on weighted extended B-Spine functions (WEBS).The a priori error estimates corresponding to the finite element analysis are derived to establish the convergence of the corresponding FE scheme and the numerical methodology has been tested on few examples.
△ Less
Submitted 24 June, 2018;
originally announced June 2018.
-
A Nonparametric Ensemble Binary Classifier and its Statistical Properties
Authors:
Tanujit Chakraborty,
Ashis Kumar Chakraborty,
C. A. Murthy
Abstract:
In this work, we propose an ensemble of classification trees (CT) and artificial neural networks (ANN). Several statistical properties including universal consistency and upper bound of an important parameter of the proposed classifier are shown. Numerical evidence is also provided using various real life data sets to assess the performance of the model. Our proposed nonparametric ensemble classif…
▽ More
In this work, we propose an ensemble of classification trees (CT) and artificial neural networks (ANN). Several statistical properties including universal consistency and upper bound of an important parameter of the proposed classifier are shown. Numerical evidence is also provided using various real life data sets to assess the performance of the model. Our proposed nonparametric ensemble classifier doesn't suffer from the `curse of dimensionality' and can be used in a wide variety of feature selection cum classification problems. Performance of the proposed model is quite better when compared to many other state-of-the-art models used for similar situations.
△ Less
Submitted 18 September, 2018; v1 submitted 29 April, 2018;
originally announced April 2018.
-
Enlarging Maurer-Cartan form via Kronecker product and construction of Coupled Integrable systems by Nilpotent, Hadamard, Idempotent and K-idempotent matrix
Authors:
Arindam Chakraborty
Abstract:
Coupled nonlinear integrable systems are generated from usual zero curvature equation. The relevant Maurer-Cartan forms are constructed by combining suitably chosen matrices (nilpotent, Hadamard, idempotent and k-idempotent) and Lie algebraic elements via Kronecker product. In each case a closure type property among the matrices chosen is found to be playing a key role to produce both the coupling…
▽ More
Coupled nonlinear integrable systems are generated from usual zero curvature equation. The relevant Maurer-Cartan forms are constructed by combining suitably chosen matrices (nilpotent, Hadamard, idempotent and k-idempotent) and Lie algebraic elements via Kronecker product. In each case a closure type property among the matrices chosen is found to be playing a key role to produce both the coupling and nonlinearity present in the system of equations obtained. The method is highly flexible and can be used to construct general systems containing 'p' number of equations. It is also shown that these new equations can be written in the Hamiltonian form (with a preassigned symplectic operator) with the trace identity introduced by Tu. Since the Lax operator is known one can obtain the hereditary operators signifying the complete integrability. Various properties of Kronecker product are found to be useful in our construction.
△ Less
Submitted 22 September, 2017;
originally announced September 2017.
-
Congestion Barcodes: Exploring the Topology of Urban Congestion Using Persistent Homology
Authors:
Yu Wu,
Gabriel Shindnes,
Vaibhav Karve,
Derrek Yager,
Daniel B. Work,
Arnab Chakraborty,
Richard B. Sowers
Abstract:
This work presents a new method to quantify connectivity in transportation networks. Inspired by the field of topological data analysis, we propose a novel approach to explore the robustness of road network connectivity in the presence of congestion on the roadway. The robustness of the pattern is summarized in a congestion barcode, which can be constructed directly from traffic datasets commonly…
▽ More
This work presents a new method to quantify connectivity in transportation networks. Inspired by the field of topological data analysis, we propose a novel approach to explore the robustness of road network connectivity in the presence of congestion on the roadway. The robustness of the pattern is summarized in a congestion barcode, which can be constructed directly from traffic datasets commonly used for navigation. As an initial demonstration, we illustrate the main technique on a publicly available traffic dataset in a neighborhood in New York City.
△ Less
Submitted 19 July, 2017;
originally announced July 2017.
-
The Nearest Hermitian Inverse Eigenvalue Problem Solution with Respect to the 2-Norm
Authors:
Marcel Padilla,
Benedikt Kolbe,
Aniruddha Chakraborty
Abstract:
Assume that the eigenvalues of a finite hermitian linear operator have been deduced accurately but the linear operator itself could not be determined with precision. Given a set of eigenvalues $λ$ and a hermitian matrix $M$, this paper will explain, with proofs, how to find a hermitian matrix $A$ with the desired eigenvalues $λ$ that is as close as possible to the given operator $M$ according to t…
▽ More
Assume that the eigenvalues of a finite hermitian linear operator have been deduced accurately but the linear operator itself could not be determined with precision. Given a set of eigenvalues $λ$ and a hermitian matrix $M$, this paper will explain, with proofs, how to find a hermitian matrix $A$ with the desired eigenvalues $λ$ that is as close as possible to the given operator $M$ according to the operator 2-norm metric. Furthermore the effects of this solution are put to a test using random matrices and grayscale images which evidently show the smoothing property of eigenvalue corrections.
△ Less
Submitted 2 March, 2017;
originally announced March 2017.
-
Bayesian sparse multiple regression for simultaneous rank reduction and variable selection
Authors:
Antik Chakraborty,
Anirban Bhattacharya,
Bani K. Mallick
Abstract:
We develop a Bayesian methodology aimed at simultaneously estimating low-rank and row-sparse matrices in a high-dimensional multiple-response linear regression model. We consider a carefully devised shrinkage prior on the matrix of regression coefficients which obviates the need to specify a prior on the rank, and shrinks the regression matrix towards low-rank and row-sparse structures. We provide…
▽ More
We develop a Bayesian methodology aimed at simultaneously estimating low-rank and row-sparse matrices in a high-dimensional multiple-response linear regression model. We consider a carefully devised shrinkage prior on the matrix of regression coefficients which obviates the need to specify a prior on the rank, and shrinks the regression matrix towards low-rank and row-sparse structures. We provide theoretical support to the proposed methodology by proving minimax optimality of the posterior mean under the prediction risk in ultra-high dimensional settings where the number of predictors can grow sub-exponentially relative to the sample size. A one-step post-processing scheme induced by group lasso penalties on the rows of the estimated coefficient matrix is proposed for variable selection, with default choices of tuning parameters. We additionally provide an estimate of the rank using a novel optimization function achieving dimension reduction in the covariate space. We exhibit the performance of the proposed methodology in an extensive simulation study and a real data example.
△ Less
Submitted 8 April, 2019; v1 submitted 2 December, 2016;
originally announced December 2016.
-
Hybrid Regularisation of Functional Linear Models
Authors:
Anirvan Chakraborty,
Victor M. Panaretos
Abstract:
We consider the problem of estimating the slope function in a functional regression with a scalar response and a functional covariate. This central problem of functional data analysis is well known to be ill-posed, thus requiring a regularised estimation procedure. The two most commonly used approaches are based on spectral truncation or Tikhonov regularisation of the empirical covariance operator…
▽ More
We consider the problem of estimating the slope function in a functional regression with a scalar response and a functional covariate. This central problem of functional data analysis is well known to be ill-posed, thus requiring a regularised estimation procedure. The two most commonly used approaches are based on spectral truncation or Tikhonov regularisation of the empirical covariance operator. In principle, Tikhonov regularisation is the more canonical choice. Compared to spectral truncation, it is robust to eigenvalue ties, while it attains the optimal minimax rate of convergence in the mean squared sense, and not just in a concentration probability sense. In this paper, we show that, surprisingly, one can strictly improve upon the performance of the Tikhonov estimator in finite samples by means of a linear estimator, while retaining its stability and asymptotic properties by combining it with a form of spectral truncation. Specifically, we construct an estimator that additively decomposes the functional covariate by projecting it onto two orthogonal subspaces defined via functional PCA; it then applies Tikhonov regularisation to the one component, while leaving the other component unregularised. We prove that when the covariate is Gaussian, this hybrid estimator uniformly improves upon the MSE of the Tikhonov estimator in a non-asymptotic sense, effectively rendering it inadmissible. This domination is shown to also persist under discrete observation of the covariate function. The hybrid estimator is linear, straightforward to construct in practice, and with no computational overhead relative to the standard regularisation methods. By means of simulation, it is shown to furnish sizeable gains even for modest sample sizes.
△ Less
Submitted 4 October, 2016;
originally announced October 2016.
-
Proximal gradient method for huberized support vector machine
Authors:
Yangyang Xu,
Ioannis Akrotirianakis,
Amit Chakraborty
Abstract:
The Support Vector Machine (SVM) has been used in a wide variety of classification problems. The original SVM uses the hinge loss function, which is non-differentiable and makes the problem difficult to solve in particular for regularized SVMs, such as with $\ell_1$-regularization. This paper considers the Huberized SVM (HSVM), which uses a differentiable approximation of the hinge loss function.…
▽ More
The Support Vector Machine (SVM) has been used in a wide variety of classification problems. The original SVM uses the hinge loss function, which is non-differentiable and makes the problem difficult to solve in particular for regularized SVMs, such as with $\ell_1$-regularization. This paper considers the Huberized SVM (HSVM), which uses a differentiable approximation of the hinge loss function. We first explore the use of the Proximal Gradient (PG) method to solving binary-class HSVM (B-HSVM) and then generalize it to multi-class HSVM (M-HSVM). Under strong convexity assumptions, we show that our algorithm converges linearly. In addition, we give a finite convergence result about the support of the solution, based on which we further accelerate the algorithm by a two-stage method. We present extensive numerical experiments on both synthetic and real datasets which demonstrate the superiority of our methods over some state-of-the-art methods for both binary- and multi-class SVMs.
△ Less
Submitted 30 November, 2015;
originally announced November 2015.
-
Alternating direction method of multipliers for regularized multiclass support vector machines
Authors:
Yangyang Xu,
Ioannis Akrotirianakis,
Amit Chakraborty
Abstract:
The support vector machine (SVM) was originally designed for binary classifications. A lot of effort has been put to generalize the binary SVM to multiclass SVM (MSVM) which are more complex problems. Initially, MSVMs were solved by considering their dual formulations which are quadratic programs and can be solved by standard second-order methods. However, the duals of MSVMs with regularizers are…
▽ More
The support vector machine (SVM) was originally designed for binary classifications. A lot of effort has been put to generalize the binary SVM to multiclass SVM (MSVM) which are more complex problems. Initially, MSVMs were solved by considering their dual formulations which are quadratic programs and can be solved by standard second-order methods. However, the duals of MSVMs with regularizers are usually more difficult to formulate and computationally very expensive to solve. This paper focuses on several regularized MSVMs and extends the alternating direction method of multiplier (ADMM) to these MSVMs. Using a splitting technique, all considered MSVMs are written as two-block convex programs, for which the ADMM has global convergence guarantees. Numerical experiments on synthetic and real data demonstrate the high efficiency and accuracy of our algorithms.
△ Less
Submitted 29 November, 2015;
originally announced November 2015.
-
Tests for high dimensional data based on means, spatial signs and spatial ranks
Authors:
Anirvan Chakraborty,
Probal Chaudhuri
Abstract:
Tests based on sample mean vectors and sample spatial signs have been studied in the recent literature for high dimensional data with the dimension larger than the sample size. For suitable sequences of alternatives, we show that the powers of the mean based tests and the tests based on spatial signs and ranks tend to be same as the data dimension grows to infinity for any sample size, when the co…
▽ More
Tests based on sample mean vectors and sample spatial signs have been studied in the recent literature for high dimensional data with the dimension larger than the sample size. For suitable sequences of alternatives, we show that the powers of the mean based tests and the tests based on spatial signs and ranks tend to be same as the data dimension grows to infinity for any sample size, when the coordinate variables satisfy appropriate mixing conditions. Further, their limiting powers do not depend on the heaviness of the tails of the distributions. This is in striking contrast to the asymptotic results obtained in the classical multivariate setup. On the other hand, we show that in the presence of stronger dependence among the coordinate variables, the spatial sign and rank based tests for high dimensional data can be asymptotically more powerful than the mean based tests if in addition to the data dimension, the sample size also grows to infinity. The sizes of some mean based tests for high dimensional data studied in the recent literature are observed to be significantly different from their nominal levels. This is due to the inadequacy of the asymptotic approximations used for the distributions of those test statistics. However, our asymptotic approximations for the tests based on spatial signs and ranks are observed to work well when the tests are applied on a variety of simulated and real datasets.
△ Less
Submitted 21 May, 2015;
originally announced May 2015.
-
The deepest point for distributions in infinite dimensional spaces
Authors:
Anirvan Chakraborty,
Probal Chaudhuri
Abstract:
Identification of the center of a data cloud is one of the basic problems in statistics. One popular choice for such a center is the median, and several versions of median in finite dimensional spaces have been studied in the literature. In particular, medians based on different notions of data depth have been extensively studied by many researchers, who defined median as the point, where the dept…
▽ More
Identification of the center of a data cloud is one of the basic problems in statistics. One popular choice for such a center is the median, and several versions of median in finite dimensional spaces have been studied in the literature. In particular, medians based on different notions of data depth have been extensively studied by many researchers, who defined median as the point, where the depth function attains its maximum value. In other words, the median is the deepest point in the sample space according to that definition. In this paper, we investigate the deepest point for probability distributions in infinite dimensional spaces. We show that for some well-known depth functions like the band depth and the half-region depth in function spaces, there may not be any meaningful deepest point for many well-known and commonly used probability models. On the other hand, certain modified versions of those depth functions as well as the spatial depth function, which can be defined in any Hilbert space, lead to some useful notions of the deepest point with nice geometric and statistical properties. The empirical versions of those deepest points can be conveniently computed for functional data, and we demonstrate this using some simulated and real data sets.
△ Less
Submitted 12 February, 2014;
originally announced February 2014.
-
ROOT: Energy Efficient Routing through Optimized Tree in Sensor Networks
Authors:
Kaushik Chakraborty,
Ayon Chakraborty,
Swarup Kumar Mitra,
Mrinal Kanti Naskar
Abstract:
This paper has been withdrawn by the author due to a crucial sign error in equation 1
This paper has been withdrawn by the author due to a crucial sign error in equation 1
△ Less
Submitted 26 June, 2011; v1 submitted 10 May, 2011;
originally announced May 2011.
-
Energy Efficient Routing in Wireless Sensor Networks: A Genetic Approach
Authors:
Ayon Chakraborty,
Swarup Kumar Mitra,
Mrinal Kanti Naskar
Abstract:
This paper has been withdrawn by the author due to a crucial sign error in equation 1
This paper has been withdrawn by the author due to a crucial sign error in equation 1
△ Less
Submitted 26 June, 2011; v1 submitted 10 May, 2011;
originally announced May 2011.