Skip to main content

Showing 1–32 of 32 results for author: Banerjee, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2401.15519  [pdf, other

    eess.SP stat.ME

    Large Deviation Analysis of Score-based Hypothesis Testing

    Authors: Enmao Diao, Taposh Banerjee, Vahid Tarokh

    Abstract: Score-based statistical models play an important role in modern machine learning, statistics, and signal processing. For hypothesis testing, a score-based hypothesis test is proposed in \cite{wu2022score}. We analyze the performance of this score-based hypothesis testing procedure and derive upper bounds on the probabilities of its Type I and II errors. We prove that the exponents of our error bou… ▽ More

    Submitted 3 February, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

  2. arXiv:2310.09673  [pdf, other

    stat.ME eess.SP math.ST

    Robust Quickest Change Detection in Non-Stationary Processes

    Authors: Yingze Hou, Yousef Oleyaeimotlagh, Rahul Mishra, Hoda Bidkhori, Taposh Banerjee

    Abstract: Optimal algorithms are developed for robust detection of changes in non-stationary processes. These are processes in which the distribution of the data after change varies with time. The decision-maker does not have access to precise information on the post-change distribution. It is shown that if the post-change non-stationary family has a distribution that is least favorable in a well-defined se… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  3. arXiv:2308.11026  [pdf, other

    stat.ME

    Harnessing The Collective Wisdom: Fusion Learning Using Decision Sequences From Diverse Sources

    Authors: Trambak Banerjee, Bowen Gang, Jianliang He

    Abstract: Learning from the collective wisdom of crowds enhances the transparency of scientific findings by incorporating diverse perspectives into the decision-making process. Synthesizing such collective wisdom is related to the statistical notion of fusion learning from multiple data sources or studies. However, fusing inferences from diverse sources is challenging since cross-source heterogeneity and po… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 29 pages and 10 figures. Under review at a journal

  4. arXiv:2308.05883  [pdf, other

    stat.ME stat.ML

    Empirical Bayes Estimation with Side Information: A Nonparametric Integrative Tweedie Approach

    Authors: Jiajun Luo, Trambak Banerjee, Gourab Mukherjee, Wenguang Sun

    Abstract: We investigate the problem of compound estimation of normal means while accounting for the presence of side information. Leveraging the empirical Bayes framework, we develop a nonparametric integrative Tweedie (NIT) approach that incorporates structural knowledge encoded in multivariate auxiliary data to enhance the precision of compound estimation. Our approach employs convex optimization tools t… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: the paper is based on chapter 2 of Jiajun Luo's PhD thesis which is archived at https://digitallibrary.usc.edu/asset-management/2A3BF1OVJ47TH

  5. arXiv:2306.07362  [pdf, other

    stat.ME

    Large-Scale Multiple Testing of Composite Null Hypotheses Under Heteroskedasticity

    Authors: Bowen Gang, Trambak Banerjee

    Abstract: Heteroskedasticity poses several methodological challenges in designing valid and powerful procedures for simultaneous testing of composite null hypotheses. In particular, the conventional practice of standardizing or re-scaling heteroskedastic test statistics in this setting may severely affect the power of the underlying multiple testing procedure. Additionally, when the inferential parameter of… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  6. arXiv:2306.05091  [pdf, other

    stat.ME eess.SP

    Robust Quickest Change Detection for Unnormalized Models

    Authors: Suya Wu, Enmao Diao, Taposh Banerjee, Jie Ding, Vahid Tarokh

    Abstract: Detecting an abrupt and persistent change in the underlying distribution of online data streams is an important problem in many applications. This paper proposes a new robust score-based algorithm called RSCUSUM, which can be applied to unnormalized models and addresses the issue of unknown post-change distributions. RSCUSUM replaces the Kullback-Leibler divergence with the Fisher divergence betwe… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted for the 39th Conference on Uncertainty in Artificial Intelligence (UAI 2023). arXiv admin note: text overlap with arXiv:2302.00250

  7. arXiv:2304.13848  [pdf, ps, other

    stat.ME stat.CO stat.ML

    Bootstrapped Edge Count Tests for Nonparametric Two-Sample Inference Under Heterogeneity

    Authors: Trambak Banerjee, Bhaswar B. Bhattacharya, Gourab Mukherjee

    Abstract: Nonparametric two-sample testing is a classical problem in inferential statistics. While modern two-sample tests, such as the edge count test and its variants, can handle multivariate and non-Euclidean data, contemporary gargantuan datasets often exhibit heterogeneity due to the presence of latent subpopulations. Direct application of these tests, without regulating for such heterogeneity, may lea… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  8. arXiv:2303.02826  [pdf, other

    stat.ME eess.SP math.ST

    Quickest Change Detection in Statistically Periodic Processes with Unknown Post-Change Distribution

    Authors: Yousef Oleyaeimotlagh, Taposh Banerjee, Ahmad Taha, Eugene John

    Abstract: Algorithms are developed for the quickest detection of a change in statistically periodic processes. These are processes in which the statistical properties are nonstationary but repeat after a fixed time interval. It is assumed that the pre-change law is known to the decision maker but the post-change law is unknown. In this framework, three families of problems are studied: robust quickest chang… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

  9. arXiv:2302.00250  [pdf, other

    stat.ML cs.LG

    Quickest Change Detection for Unnormalized Statistical Models

    Authors: Suya Wu, Enmao Diao, Taposh Banerjee, Jie Ding, Vahid Tarokh

    Abstract: Classical quickest change detection algorithms require modeling pre-change and post-change distributions. Such an approach may not be feasible for various machine learning models because of the complexity of computing the explicit distributions. Additionally, these methods may suffer from a lack of robustness to model mismatch and noise. This paper develops a new variant of the classical Cumulativ… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: A version of this paper has been accepted by the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

  10. arXiv:2208.03908  [pdf, other

    stat.AP

    Do financial regulators act in the public's interest? A Bayesian latent class estimation framework for assessing regulatory responses to banking crises

    Authors: Padma Sharma, Trambak Banerjee

    Abstract: When banks fail amidst financial crises, the public criticizes regulators for bailing out or liquidating specific banks, especially the ones that gain attention due to their size or dominance. A comprehensive assessment of regulators, however, requires examining all their decisions, and not just specific ones, against the regulator's dual objective of preserving financial stability while discourag… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

  11. arXiv:2201.07026  [pdf, other

    cs.LG econ.EM stat.ML

    Socioeconomic disparities and COVID-19: the causal connections

    Authors: Tannista Banerjee, Ayan Paul, Vishak Srikanth, Inga Strümke

    Abstract: The analysis of causation is a challenging task that can be approached in various ways. With the increasing use of machine learning based models in computational socioeconomics, explaining these models while taking causal connections into account is a necessity. In this work, we advocate the use of an explanatory framework from cooperative game theory augmented with $do$ calculus, namely causal Sh… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

  12. arXiv:2003.02937  [pdf, other

    stat.ME

    A Nearest-Neighbor Based Nonparametric Test for Viral Remodeling in Heterogeneous Single-Cell Proteomic Data

    Authors: Trambak Banerjee, Bhaswar B. Bhattacharya, Gourab Mukherjee

    Abstract: An important problem in contemporary immunology studies based on single-cell protein expression data is to determine whether cellular expressions are remodeled post infection by a pathogen. One natural approach for detecting such changes is to use non-parametric two-sample statistical tests. However, in single-cell studies, direct application of these tests is often inadequate because single-cell… ▽ More

    Submitted 24 June, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: Final version

  13. arXiv:2002.12586  [pdf, other

    stat.ME

    Nonparametric Empirical Bayes Estimation on Heterogeneous Data

    Authors: Trambak Banerjee, Luella J. Fu, Gareth M. James, Gourab Mukherjee, Wenguang Sun

    Abstract: The simultaneous estimation of many parameters based on data collected from corresponding studies is a key research problem that has received renewed attention in the high-dimensional setting. Many practical situations involve heterogeneous data where heterogeneity is captured by a nuisance parameter. Effectively pooling information across samples while correctly accounting for heterogeneity prese… ▽ More

    Submitted 14 August, 2023; v1 submitted 28 February, 2020; originally announced February 2020.

    Comments: Citations corrected and a new author added. No change in content!

    MSC Class: 62G08; 62G05; 62G20 ACM Class: G.3

  14. arXiv:1912.10127  [pdf, other

    eess.IV cs.LG q-bio.QM stat.ML

    A Generalizable Method for Automated Quality Control of Functional Neuroimaging Datasets

    Authors: Matthew Kollada, Qingzhu Gao, Monika S Mellem, Tathagata Banerjee, William J Martin

    Abstract: Over the last twenty five years, advances in the collection and analysis of fMRI data have enabled new insights into the brain basis of human health and disease. Individual behavioral variation can now be visualized at a neural level as patterns of connectivity among brain regions. Functional brain imaging is enhancing our understanding of clinical psychiatric disorders by revealing ties between r… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

  15. arXiv:1911.01212  [pdf, other

    cs.CL cs.LG stat.ML

    Scrambled Translation Problem: A Problem of Denoising UNMT

    Authors: Tamali Banerjee, Rudra Murthy V, Pushpak Bhattacharyya

    Abstract: In this paper, we identify an interesting kind of error in the output of Unsupervised Neural Machine Translation (UNMT) systems like \textit{Undreamt}(footnote). We refer to this error type as \textit{Scrambled Translation problem}. We observe that UNMT models which use \textit{word shuffle} noise (as in case of Undreamt) can generate correct words, but fail to stitch them together to form phrases… ▽ More

    Submitted 17 June, 2021; v1 submitted 30 October, 2019; originally announced November 2019.

    Comments: Accepted by MT Summit 2021

  16. arXiv:1910.08997  [pdf, other

    math.ST stat.ME

    A General Framework for Empirical Bayes Estimation in Discrete Linear Exponential Family

    Authors: Trambak Banerjee, Qiang Liu, Gourab Mukherjee, Wenguang Sun

    Abstract: We develop a Nonparametric Empirical Bayes (NEB) framework for compound estimation in the discrete linear exponential family, which includes a wide class of discrete distributions frequently arising from modern big data applications. We propose to directly estimate the Bayes shrinkage factor in the generalized Robbins' formula via solving a scalable convex program, which is carefully developed bas… ▽ More

    Submitted 20 October, 2019; originally announced October 2019.

  17. arXiv:1901.11440  [pdf

    cs.LG stat.ML

    Toward Sensor-based Sleep Monitoring with Electrodermal Activity Measures

    Authors: William Romine, Tanvi Banerjee, Garrett Goodman

    Abstract: We use self-report and electrodermal activity (EDA) wearable sensor data from 77 nights of sleep on six participants to test the efficacy of EDA data for sleep monitoring. We used factor analysis to find latent factors in the EDA data, and causal model search to find the most probable graphical model accounting for self-reported sleep efficiency (SE), sleep quality (SQ), and the latent EDA factors… ▽ More

    Submitted 31 January, 2019; originally announced January 2019.

    Comments: 9 pages, 1 figure, 1 table, journal

    Journal ref: Sensors 2019, 19(6), 1417

  18. arXiv:1811.11930  [pdf, other

    stat.ME

    Adaptive Sparse Estimation with Side Information

    Authors: Trambak Banerjee, Gourab Mukherjee, Wenguang Sun

    Abstract: The article considers the problem of estimating a high-dimensional sparse parameter in the presence of side information that encodes the sparsity structure. We develop a general framework that involves first using an auxiliary sequence to capture the side information, and then incorporating the auxiliary sequence in inference to reduce the estimation risk. The proposed method, which carries out ad… ▽ More

    Submitted 17 October, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: final version

  19. arXiv:1809.00358  [pdf, other

    eess.SP q-bio.NC stat.AP stat.ME

    Sequential Detection of Regime Changes in Neural Data

    Authors: Taposh Banerjee, Stephen Allsop, Kay M. Tye, Demba Ba, Vahid Tarokh

    Abstract: The problem of detecting changes in firing patterns in neural data is studied. The problem is formulated as a quickest change detection problem. Important algorithms from the literature are reviewed. A new algorithmic technique is discussed to detect deviations from learned baseline behavior. The algorithms studied can be applied to both spike and local field potential data. The algorithms are app… ▽ More

    Submitted 2 September, 2018; originally announced September 2018.

  20. arXiv:1807.06945  [pdf, other

    eess.SP cs.LG stat.ME stat.ML

    Cyclostationary Statistical Models and Algorithms for Anomaly Detection Using Multi-Modal Data

    Authors: Taposh Banerjee, Gene Whipps, Prudhvi Gurram, Vahid Tarokh

    Abstract: A framework is proposed to detect anomalies in multi-modal data. A deep neural network-based object detector is employed to extract counts of objects and sub-events from the data. A cyclostationary model is proposed to model regular patterns of behavior in the count sequences. The anomaly detection problem is formulated as a problem of detecting deviations from learned cyclostationary behavior. Se… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

  21. arXiv:1803.08947  [pdf, other

    stat.AP cs.IT

    Sequential Event Detection Using Multimodal Data in Nonstationary Environments

    Authors: Taposh Banerjee, Gene Whipps, Prudhvi Gurram, Vahid Tarokh

    Abstract: The problem of sequential detection of anomalies in multimodal data is considered. The objective is to observe physical sensor data from CCTV cameras, and social media data from Twitter and Instagram to detect anomalous behaviors or events. Data from each modality is transformed to discrete time count data by using an artificial neural network to obtain counts of objects in CCTV images and by coun… ▽ More

    Submitted 23 March, 2018; originally announced March 2018.

  22. Early hospital mortality prediction using vital signals

    Authors: Reza Sadeghi, Tanvi Banerjee, William Romine

    Abstract: Early hospital mortality prediction is critical as intensivists strive to make efficient medical decisions about the severely ill patients staying in intensive care units. As a result, various methods have been developed to address this problem based on clinical records. However, some of the laboratory test results are time-consuming and need to be processed. In this paper, we propose a novel meth… ▽ More

    Submitted 9 February, 2019; v1 submitted 17 March, 2018; originally announced March 2018.

    Comments: 11 pages, 5 figures, preprint of accepted paper in IEEE&ACM CHASE 2018 and published in Smart Health journal

    Journal ref: Smart Health 9-10 (2018) 265-274

  23. arXiv:1710.10279  [pdf, other

    stat.ME stat.AP

    Wavelet Shrinkage and Thresholding based Robust Classification for Brain Computer Interface

    Authors: Taposh Banerjee, John Choi, Bijan Pesaran, Demba Ba, Vahid Tarokh

    Abstract: A macaque monkey is trained to perform two different kinds of tasks, memory aided and visually aided. In each task, the monkey saccades to eight possible target locations. A classifier is proposed for direction decoding and task decoding based on local field potentials (LFP) collected from the prefrontal cortex. The LFP time-series data is modeled in a nonparametric regression framework, as a func… ▽ More

    Submitted 27 November, 2017; v1 submitted 27 October, 2017; originally announced October 2017.

  24. arXiv:1710.01821  [pdf, other

    stat.ME cs.IT

    Classification of Local Field Potentials using Gaussian Sequence Model

    Authors: Taposh Banerjee, John Choi, Bijan Pesaran, Demba Ba, Vahid Tarokh

    Abstract: A problem of classification of local field potentials (LFPs), recorded from the prefrontal cortex of a macaque monkey, is considered. An adult macaque monkey is trained to perform a memory-based saccade. The objective is to decode the eye movement goals from the LFP collected during a memory period. The LFP classification problem is modeled as that of classification of smooth functions embedded in… ▽ More

    Submitted 27 November, 2017; v1 submitted 4 October, 2017; originally announced October 2017.

  25. arXiv:1702.08000  [pdf, other

    stat.ML cs.LG

    Kiefer Wolfowitz Algorithm is Asymptotically Optimal for a Class of Non-Stationary Bandit Problems

    Authors: Rahul Singh, Taposh Banerjee

    Abstract: We consider the problem of designing an allocation rule or an "online learning algorithm" for a class of bandit problems in which the set of control actions available at each time $s$ is a convex, compact subset of $\mathbb{R}^d$. Upon choosing an action $x$ at time $s$, the algorithm obtains a noisy value of the unknown and time-varying function $f_s$ evaluated at $x$. The "regret" of an algorith… ▽ More

    Submitted 8 March, 2017; v1 submitted 26 February, 2017; originally announced February 2017.

  26. arXiv:1701.02857  [pdf, other

    stat.ME

    Feature Screening in Large Scale Cluster Analysis

    Authors: Trambak Banerjee, Gourab Mukherjee, Peter Radchenko

    Abstract: We propose a novel methodology for feature screening in clustering massive datasets, in which both the number of features and the number of observations can potentially be very large. Taking advantage of a fusion penalization based convex clustering criterion, we propose a very fast screening procedure that efficiently discards non-informative features by first computing a clustering score corresp… ▽ More

    Submitted 4 October, 2017; v1 submitted 11 January, 2017; originally announced January 2017.

    Comments: final version

  27. arXiv:1609.06757  [pdf, other

    stat.AP eess.SY math.ST

    Quickest Change Detection Approach to Optimal Control in Markov Decision Processes with Model Changes

    Authors: Taposh Banerjee, Miao Liu, Jonathan P. How

    Abstract: Optimal control in non-stationary Markov decision processes (MDP) is a challenging problem. The aim in such a control problem is to maximize the long-term discounted reward when the transition dynamics or the reward function can change over time. When a prior knowledge of change statistics is available, the standard Bayesian approach to this problem is to reformulate it as a partially observable M… ▽ More

    Submitted 1 March, 2017; v1 submitted 21 September, 2016; originally announced September 2016.

    Comments: In Proceedings of American Control Conference 2017, 7 pages

  28. arXiv:1506.06199  [pdf, other

    math.ST cs.IT stat.ME

    Non-parametric Quickest Change Detection for Large Scale Random Matrices

    Authors: Taposh Banerjee, Hamed Firouzi, Alfred O. Hero III

    Abstract: The problem of quickest detection of a change in the distribution of a $n\times p$ random matrix based on a sequence of observations having a single unknown change point is considered. The forms of the pre- and post-change distributions of the rows of the matrices are assumed to belong to the family of elliptically contoured densities with sparse dispersion matrices but are otherwise unknown. We p… ▽ More

    Submitted 19 June, 2015; originally announced June 2015.

    Comments: Proc. of ISIT, Hong Kong, 2015

  29. arXiv:1210.5552  [pdf, other

    math.ST cs.IT math.OC math.PR stat.AP

    Quickest Change Detection

    Authors: Venugopal V. Veeravalli, Taposh Banerjee

    Abstract: The problem of detecting changes in the statistical properties of a stochastic system and time series arises in various branches of science and engineering. It has a wide spectrum of important applications ranging from machine monitoring to biomedical signal processing. In all of these applications the observations being monitored undergo a change in distribution in response to a change or anomaly… ▽ More

    Submitted 19 October, 2012; originally announced October 2012.

  30. arXiv:0908.1407  [pdf, ps, other

    cs.IT cs.PF stat.AP

    Generalized Analysis of a Distributed Energy Efficient Algorithm for Change Detection

    Authors: Taposh Banerjee, Vinod Sharma

    Abstract: An energy efficient distributed Change Detection scheme based on Page's CUSUM algorithm was presented in \cite{icassp}. In this paper we consider a nonparametric version of this algorithm. In the algorithm in \cite{icassp}, each sensor runs CUSUM and transmits only when the CUSUM is above some threshold. The transmissions from the sensors are fused at the physical layer. The channel is modeled a… ▽ More

    Submitted 10 August, 2009; originally announced August 2009.

    Comments: Accepted as a short paper in Proc. of the 12th ACM International Symposium on Modeling, Analysis and Simulation of Wireless and Mobile Systems (MSWiM), Tenerife, Canary Islands, Spain, Oct 26-30, 2009. Please contact [email protected] or [email protected] for any clarifications. Also visit: http://www.ece.iisc.ernet.in/~vinod/

    ACM Class: G.3; H.3.4

  31. Optimal factorial designs for cDNA microarray experiments

    Authors: Tathagata Banerjee, Rahul Mukerjee

    Abstract: We consider cDNA microarray experiments when the cell populations have a factorial structure, and investigate the problem of their optimal designing under a baseline parametrization where the objects of interest differ from those under the more common orthogonal parametrization. First, analytical results are given for the $2\times 2$ factorial. Since practical applications often involve a more c… ▽ More

    Submitted 27 March, 2008; originally announced March 2008.

    Comments: Published in at http://dx.doi.org/10.1214/07-AOAS144 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS144

    Journal ref: Annals of Applied Statistics 2007, Vol. 2, No. 1, 366-385

  32. A Conversation with Shoutir Kishore Chatterjee

    Authors: Tathagata Banerjee, Rahul Mukerjee

    Abstract: Shoutir Kishore Chatterjee was born in Ranchi, a small hill station in India, on November 6, 1934. He received his B.Sc. in statistics from the Presidency College, Calcutta, in 1954, and M.Sc. and Ph.D. degrees in statistics from the University of Calcutta in 1956 and 1962, respectively. He was appointed a lecturer in the Department of Statistics, University of Calcutta, in 1960 and was a member… ▽ More

    Submitted 25 October, 2007; originally announced October 2007.

    Comments: Published in at http://dx.doi.org/10.1214/088342306000000565 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS219

    Journal ref: Statistical Science 2007, Vol. 22, No. 2, 279-290