-
Atypical behaviors of a tagged particle in asymmetric simple exclusion
Authors:
Sunder Sethuraman,
S. R. S. Varadhan
Abstract:
Consider the asymmetric nearest-neighbor exclusion process (ASEP) on ${\mathbb Z}$ with single particle drift $γ>0$, starting from a Bernoulli product invariant measure $ν_ρ$ with density $ρ$. It is known that the position $X_{N}$ of a tagged particle, say initially at the origin, at time $N$ satisfies an a.s. law of large numbers $\frac{1}{N}X_N \rightarrow γ(1-ρ)$ as $N\uparrow\infty$.
In this…
▽ More
Consider the asymmetric nearest-neighbor exclusion process (ASEP) on ${\mathbb Z}$ with single particle drift $γ>0$, starting from a Bernoulli product invariant measure $ν_ρ$ with density $ρ$. It is known that the position $X_{N}$ of a tagged particle, say initially at the origin, at time $N$ satisfies an a.s. law of large numbers $\frac{1}{N}X_N \rightarrow γ(1-ρ)$ as $N\uparrow\infty$.
In this context, we study the `typical' behavior of the tagged particle and `bulk' density evolution subject to `atypical' events $\{X_N\geq AN\}$ or $\{X_N\leq AN\}$ for $A\neq γ(1-ρ)$. We detail different structures, depending on whether $A<0$, $0\leq A< γ(1-ρ)$, $γ(1-ρ)<A< γ$, or $A\geq γ$, under which these atypical events are achieved, and compute associated large deviation costs. Among our results is an `upper tail' large deviation principle in scale $N$ for $\frac{1}{N}X_N$.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Effective mass of the Fröhlich Polaron and the Landau-Pekar-Spohn conjecture
Authors:
Rodrigo Bazaes,
Chiranjib Mukherjee,
Mark Sellke,
S. R. S. Varadhan
Abstract:
We prove that there is a constant $\overline C\in (0,\infty)$ such that the effective mass $m(α)$ of the Fröhlich Polaron satisfies $m(α) \geq \overline C α^4$, which is sharp according to a long-standing prediction of Landau-Pekar [19] from 1948 and of Spohn [35] from 1987. The method of proof, which demonstrates how the $α^4$ divergence rate of $m(α)$ appears in a natural way, is based on analyz…
▽ More
We prove that there is a constant $\overline C\in (0,\infty)$ such that the effective mass $m(α)$ of the Fröhlich Polaron satisfies $m(α) \geq \overline C α^4$, which is sharp according to a long-standing prediction of Landau-Pekar [19] from 1948 and of Spohn [35] from 1987. The method of proof, which demonstrates how the $α^4$ divergence rate of $m(α)$ appears in a natural way, is based on analyzing the Gaussian representation of the Polaron measure and that of the associated tilted Poisson point process developed in [25], together with an explicit identification of local interval process in the strong coupling limit $α\to\infty$ in terms of functionals of the {\it Pekar variational formula}.}
△ Less
Submitted 27 February, 2024; v1 submitted 24 July, 2023;
originally announced July 2023.
-
Semi-Parametric Sensitivity Analysis for Trials with Irregular and Informative Assessment Times
Authors:
Bonnie B. Smith,
Yu**g Gao,
Shu Yang,
Ravi Varadhan,
Andrea J. Apter,
Daniel O. Scharfstein
Abstract:
Many trials are designed to collect outcomes at or around pre-specified times after randomization. In practice, there can be substantial variability in the times when participants are actually assessed. Such irregular assessment times pose a challenge to learning the effect of treatment since not all participants have outcome assessments at the times of interest. Furthermore, observed outcome valu…
▽ More
Many trials are designed to collect outcomes at or around pre-specified times after randomization. In practice, there can be substantial variability in the times when participants are actually assessed. Such irregular assessment times pose a challenge to learning the effect of treatment since not all participants have outcome assessments at the times of interest. Furthermore, observed outcome values may not be representative of all participants' outcomes at a given time. This problem, known as informative assessment times, can arise if participants tend to have assessments when their outcomes are better (or worse) than at other times, or if participants with better outcomes tend to have more (or fewer) assessments. Methods have been developed that account for some types of informative assessment; however, since these methods rely on untestable assumptions, sensitivity analyses are needed. We develop a sensitivity analysis methodology by extending existing weighting methods. Our method accounts for the possibility that participants with worse outcomes at a given time are more (or less) likely than other participants to have an assessment at that time, even after controlling for variables observed earlier in the study. We apply our method to a randomized trial of low-income individuals with uncontrolled asthma. We illustrate implementation of our influence-function based estimation procedure in detail, and we derive the large-sample distribution of our estimator and evaluate its finite-sample performance.
△ Less
Submitted 5 November, 2023; v1 submitted 25 April, 2022;
originally announced April 2022.
-
Network Security Modeling using NetFlow Data: Detecting Botnet attacks in IP Traffic
Authors:
Ganesh Subramaniam,
Huan Chen,
Ravi Varadhan,
Robert Archibald
Abstract:
Cybersecurity, security monitoring of malicious events in IP traffic, is an important field largely unexplored by statisticians. Computer scientists have made significant contributions in this area using statistical anomaly detection and other supervised learning methods to detect specific malicious events. In this research, we investigate the detection of botnet command and control (C&C) hosts in…
▽ More
Cybersecurity, security monitoring of malicious events in IP traffic, is an important field largely unexplored by statisticians. Computer scientists have made significant contributions in this area using statistical anomaly detection and other supervised learning methods to detect specific malicious events. In this research, we investigate the detection of botnet command and control (C&C) hosts in massive IP traffic. We use the NetFlow data, the industry standard for monitoring of IP traffic for exploratory analysis and extracting new features. Using statistical as well as deep learning models, we develop a statistical intrusion detection system (SIDS) to predict traffic traces identified with malicious attacks. Employing interpretative machine learning techniques, botnet traffic signatures are derived. These models successfully detected botnet C&C hosts and compromised devices. The results were validated by matching predictions to existing blacklists of published malicious IP addresses.
△ Less
Submitted 19 August, 2021;
originally announced August 2021.
-
Improved Small Domain Estimation via Compromise Regression Weights
Authors:
Nicholas C. Henderson,
Ravi Varadhan,
Thomas A. Louis
Abstract:
Shrinkage estimates of small domain parameters typically utilize a combination of a noisy "direct" estimate that only uses data from a specific small domain and a more stable regression estimate. When the regression model is misspecified, estimation performance for the noisier domains can suffer due to substantial shrinkage towards a poorly estimated regression surface. In this paper, we introduce…
▽ More
Shrinkage estimates of small domain parameters typically utilize a combination of a noisy "direct" estimate that only uses data from a specific small domain and a more stable regression estimate. When the regression model is misspecified, estimation performance for the noisier domains can suffer due to substantial shrinkage towards a poorly estimated regression surface. In this paper, we introduce a new class of robust, empirically-driven regression weights that target estimation of the small domain means under potential misspecification of the global regression model. Our regression weights are a convex combination of the model-based weights associated with the best linear unbiased predictor (BLUP) and those associated with the observed best predictor (OBP). The compromise parameter in this convex combination is found by minimizing a novel, unbiased estimate of the mean-squared prediction error for the small domain means, and we label the associated small domain estimates the "compromise best predictor", or CBP. Using a data-adaptive mixture for the regression weights enables the CBP to possess the robustness of the OBP while retaining the main advantages of the EBLUP whenever the regression model is correct. We demonstrate the use of the CBP in an application estimating gait speed in older adults.
△ Less
Submitted 10 January, 2022; v1 submitted 28 June, 2020;
originally announced June 2020.
-
Brand vs. Generic: Addressing Non-Adherence, Secular Trends, and Non-Overlap
Authors:
Lamar Hunt III,
Irene B. Murimi,
Jodi B. Segal,
Marissa J. Seamans,
Daniel O. Scharfstein,
Ravi Varadhan
Abstract:
While generic drugs offer a cost-effective alternative to brand name drugs, regulators need a method to assess therapeutic equivalence in a post market setting. We develop such a method in the context of assessing the therapeutic equivalence of immediate release (IM) venlafaxine, based on a large insurance claims dataset provided by OptumLabs\textsuperscript{\textregistered}. To properly address t…
▽ More
While generic drugs offer a cost-effective alternative to brand name drugs, regulators need a method to assess therapeutic equivalence in a post market setting. We develop such a method in the context of assessing the therapeutic equivalence of immediate release (IM) venlafaxine, based on a large insurance claims dataset provided by OptumLabs\textsuperscript{\textregistered}. To properly address this question, our methodology must deal with issues of non-adherence, secular trends in health outcomes, and lack of treatment overlap due to sharp uptake of the generic once it becomes available. We define, identify (under assumptions) and estimate (using G-computation) a causal effect for a time-to-event outcome by extending regression discontinuity to survival curves. We do not find evidence for a lack of therapeutic equivalence of brand and generic IM venlafaxine.
△ Less
Submitted 7 July, 2019;
originally announced July 2019.
-
Identification of the Polaron measure in strong coupling and the Pekar variational formula
Authors:
Chiranjib Mukherjee,
S. R. S. Varadhan
Abstract:
The path measure corresponding to the Fröhlich Polaron appearing in quantum statistical mechanics is defined as the tilted measure $$ \widehat{\mathbb P}_{\varepsilon,T}= \frac{1}{Z(\varepsilon,T)}\exp\bigg(\frac{1}{2}\int_{-T}^T\int_{-T}^T \frac{\varepsilon\mathrm e^{-\varepsilon |t-s|}}{|ω(t)-ω(s)|} \mathrm d s \,\mathrm d t\bigg)\mathrm d\mathbb P. $$ Here $\varepsilon>0$ is the Kac parameter (…
▽ More
The path measure corresponding to the Fröhlich Polaron appearing in quantum statistical mechanics is defined as the tilted measure $$ \widehat{\mathbb P}_{\varepsilon,T}= \frac{1}{Z(\varepsilon,T)}\exp\bigg(\frac{1}{2}\int_{-T}^T\int_{-T}^T \frac{\varepsilon\mathrm e^{-\varepsilon |t-s|}}{|ω(t)-ω(s)|} \mathrm d s \,\mathrm d t\bigg)\mathrm d\mathbb P. $$ Here $\varepsilon>0$ is the Kac parameter (or the inverse-coupling), and $\mathbb P$ is the law of $3d$ Brownian increments. In [13] it was shown that the (thermodynamic) limit $\lim_{T\to\infty}\widehat{\mathbb P}_{\varepsilon,T}=\widehat{\mathbb P}_\varepsilon$ exists as a process with stationary increments and this limit was identified explicitly as a mixture of Gaussian processes. In the present article, the strong coupling limit or the vanishing Kac parameter limit $\lim_{\varepsilon\to 0} \widehat{\mathbb P}_\varepsilon$ is investigated. It is shown that this limit exists and coincides with the increments of the Pekar process, which is a stationary diffusion process with generator $\frac 12 Δ+ (\nablaψ/ψ)\cdot \nabla$, where $ψ$ is the unique (modulo shifts) maximizer of the Pekar variational problem $$ g_0=\sup_{\|ψ\|_2=1} \Big\{\int_{\mathbb R^3}\int_{\mathbb R^3}\,ψ^2(x) ψ^2(y)|x-y|^{-1}\mathrm d x\mathrm d y -\frac 12\|\nabla ψ\|_2^2\Big\}. $$ As shown in [12,6,1], the Pekar process is itself approximated by the limiting "mean-field Polaron measures", and thus, the present identification of the strong coupling Polaron is a rigorous justification of the "mean-field approximation" (on the level of path measures) conjectured by Spohn in [15]. This approximation in the vanishing Kac limit ($\varepsilon\to 0$) is also shown to hold for a general class of Kac-Interaction of the form $H(t,x)=\varepsilon \mathrm e^{-\varepsilon|t|} V(x)$ where $V$ is any continuous function vanishing at infinity.
△ Less
Submitted 26 July, 2021; v1 submitted 17 December, 2018;
originally announced December 2018.
-
SQUAREM: An R Package for Off-the-Shelf Acceleration of EM, MM and Other EM-like Monotone Algorithms
Authors:
Yu Du,
Ravi Varadhan
Abstract:
We discuss R package SQUAREM for accelerating iterative algorithms which exhibit slow, monotone convergence. These include the well-known expectation-maximization algorithm, majorize-minimize (MM), and other EM-like algorithms such as expectation conditional maximization, and generalized EM algorithms. We demonstrate the simplicity, generality, and power of SQUAREM through a wide array of applicat…
▽ More
We discuss R package SQUAREM for accelerating iterative algorithms which exhibit slow, monotone convergence. These include the well-known expectation-maximization algorithm, majorize-minimize (MM), and other EM-like algorithms such as expectation conditional maximization, and generalized EM algorithms. We demonstrate the simplicity, generality, and power of SQUAREM through a wide array of applications of EM/MM problems, including binary Poisson mixture, factor analysis, interval censoring, genetics admixture, and logistic regression maximum likelihood estimation (an MM problem). We show that SQUAREM is easy to apply, and can accelerate any fixed-point, smooth, contraction map** with linear convergence rate. Squared iterative scheme (Squarem) algorithm provides significant speed-up of EM-like algorithms. The margin of the advantage for Squarem is especially huge for high-dimensional problems or when EM step is relatively time-consuming to evaluate. Squarem can be used off-the-shelf since there is no need for the user to tweak any control parameters to optimize performance. Given its remarkable ease of use, Squarem may be considered as a default accelerator for slowly converging EM-like algorithms. All the comparisons of CPU computing time in the paper are made on a quad-core 2.3 GHz Intel Core i7 Mac computer. R Package SQUAREM can be downloaded at https://cran.r-project.org/web/packages/SQUAREM/index.html.
△ Less
Submitted 31 October, 2018; v1 submitted 25 October, 2018;
originally announced October 2018.
-
Bayesian Bivariate Subgroup Analysis for Risk-Benefit Evaluation
Authors:
Nicholas C. Henderson,
Ravi Varadhan
Abstract:
Subgroup analysis is a frequently used tool for evaluating heterogeneity of treatment effect and heterogeneity in treatment harm across observed baseline patient characteristics. While treatment efficacy and adverse event measures are often reported separately for each subgroup, analyzing their within-subgroup joint distribution is critical for better informed patient decision-making. In this pape…
▽ More
Subgroup analysis is a frequently used tool for evaluating heterogeneity of treatment effect and heterogeneity in treatment harm across observed baseline patient characteristics. While treatment efficacy and adverse event measures are often reported separately for each subgroup, analyzing their within-subgroup joint distribution is critical for better informed patient decision-making. In this paper, we describe Bayesian models for performing a subgroup analysis to compare the joint occurrence of a primary endpoint and an adverse event between two treatment arms. Our approaches emphasize estimation of heterogeneity in this joint distribution across subgroups, and our approaches directly accommodate subgroups with small numbers of observed primary and adverse event combinations. In addition, we describe several ways in which our models may be used to generate interpretable summary measures of benefit-risk tradeoffs for each subgroup. The methods described here are illustrated throughout using a large cardiovascular trial (N = 9,361) investigating the efficacy of an intervention for reducing systolic blood pressure to a lower-than-usual target.
△ Less
Submitted 11 August, 2018;
originally announced August 2018.
-
Strong coupling limit of the Polaron measure and the Pekar process
Authors:
Chiranjib Mukherjee,
S. R. S. Varadhan
Abstract:
The {\it{Polaron measure}} is defined as the transformed path measure
$$\widehat{\mathbb P}_{ε,T}= Z_{ε,T}^{-1}\,\, \exp\bigg\{\frac{1}{2}\int_{-T}^T\int_{-T}^T\frac{ε\e^{-ε|t-s|}}{|ω(t)-ω(s)|} \,\d s \,\d t\bigg\}\d\mathbb P$$ with respect to the law $\mathbb P$ of three dimensional Brownian increments on a finite interval $[-T,T]$, and $ Z_{ε,T}$ is the partition function with $ε>0$ being a co…
▽ More
The {\it{Polaron measure}} is defined as the transformed path measure
$$\widehat{\mathbb P}_{ε,T}= Z_{ε,T}^{-1}\,\, \exp\bigg\{\frac{1}{2}\int_{-T}^T\int_{-T}^T\frac{ε\e^{-ε|t-s|}}{|ω(t)-ω(s)|} \,\d s \,\d t\bigg\}\d\mathbb P$$ with respect to the law $\mathbb P$ of three dimensional Brownian increments on a finite interval $[-T,T]$, and $ Z_{ε,T}$ is the partition function with $ε>0$ being a constant. The logarithmic asymptotic behavior of the partition function $Z_{ε,T}$ was analyzed in \cite{DV83} showing that $$ g_0=\lim_{ε\to 0}\bigg[\lim_{T\to\infty}\frac{\log Z_{\eps,T}}{2T}\bigg]=\sup_{\heap{ψ\in H^1(\R^3)}{\|ψ\|_2=1}} \bigg\{\int_{\mathbb R^3}\int_{\mathbb R^3}\d x\d y\,\frac {ψ^2(x) ψ^2(y)}{|x-y|} -\frac 12\big\|\nabla ψ\big\|_2^2\bigg\}. $$ In \cite{MV18} we analyzed the actual path measures and showed that the limit ${\widehat {\mathbb P}}_{\eps}=\lim_{T\to\infty}\widehat{\mathbb P}_{\eps,T}$ exists and identified this limit explicitly, and as a corollary, we also deduced the central limit theorem for
$(2T)^{-1/2}(ω(T)-ω(-T))$ under $\widehat{\mathbb P}_{\eps,T}$ and obtained an expression for the limiting variance $σ^2(\eps)$.
In the present article, we investigate the {\it{strong coupling limit}} $\lim_{\eps\to 0} \lim_{T\to\infty} \widehat{\mathbb P}_{\eps,T}=\lim_{\eps\to 0} \widehat {\mathbb P}_\eps$ and show that this limit coincides with the increments of the stationary Pekar process with generator
$$
\frac 12 Δ+ \bigg(\frac{\nablaψ}ψ\bigg)\cdot \nabla
$$ for any maximizer $ψ$ of the free enrgy $g_0$. The Pekar process was also earlier identified in \cite{MV14}, \cite{KM15} and \cite{BKM15} as the limiting object of the {\it{mean-field Polaron}} measures.}
△ Less
Submitted 18 June, 2018;
originally announced June 2018.
-
Damped Anderson acceleration with restarts and monotonicity control for accelerating EM and EM-like algorithms
Authors:
Nicholas C. Henderson,
Ravi Varadhan
Abstract:
The expectation-maximization (EM) algorithm is a well-known iterative method for computing maximum likelihood estimates from incomplete data. Despite its numerous advantages, a main drawback of the EM algorithm is its frequently observed slow convergence which often hinders the application of EM algorithms in high-dimensional problems or in other complex settings.To address the need for more rapid…
▽ More
The expectation-maximization (EM) algorithm is a well-known iterative method for computing maximum likelihood estimates from incomplete data. Despite its numerous advantages, a main drawback of the EM algorithm is its frequently observed slow convergence which often hinders the application of EM algorithms in high-dimensional problems or in other complex settings.To address the need for more rapidly convergent EM algorithms, we describe a new class of acceleration schemes that build on the Anderson acceleration technique for speeding fixed-point iterations. Our approach is effective at greatly accelerating the convergence of EM algorithms and is automatically scalable to high dimensional settings. Through the introduction of periodic algorithm restarts and a dam** factor, our acceleration scheme provides faster and more robust convergence when compared to un-modified Anderson acceleration while also improving global convergence. Crucially, our method works as an "off-the-shelf" method in that it may be directly used to accelerate any EM algorithm without relying on the use of any model-specific features or insights. Through a series of simulation studies involving five representative problems, we show that our algorithm is substantially faster than the existing state-of-art acceleration schemes.
△ Less
Submitted 11 August, 2018; v1 submitted 18 March, 2018;
originally announced March 2018.
-
Identification of the Polaron measure I: Fixed coupling regime and the central limit theorem for large times
Authors:
Chiranjib Mukherjee,
S. R. S. Varadhan
Abstract:
We consider the Fröhlich model of the Polaron whose path integral formulation leads to the transformed path measure
$$
\widehat{\mathbb P}_{α,T}(\mathrm dω)= Z_{α,T}^{-1}\,\, \exp\bigg\{\fracα{2}\int_{-T}^T\int_{-T}^T\frac{e^{-|t-s|}}{|ω(t)-ω(s)|} \, d s \, d t\bigg\}\,\mathbb P(\mathrm dω)
$$
with respect to $\mathbb P$ which governs the law of the increments of the three dimensional Brow…
▽ More
We consider the Fröhlich model of the Polaron whose path integral formulation leads to the transformed path measure
$$
\widehat{\mathbb P}_{α,T}(\mathrm dω)= Z_{α,T}^{-1}\,\, \exp\bigg\{\fracα{2}\int_{-T}^T\int_{-T}^T\frac{e^{-|t-s|}}{|ω(t)-ω(s)|} \, d s \, d t\bigg\}\,\mathbb P(\mathrm dω)
$$
with respect to $\mathbb P$ which governs the law of the increments of the three dimensional Brownian motion on a finite interval $[-T,T]$, and $ Z_{α,T}$ is the partition function or the normalizing constant and $α>0$ is a constant. The Polaron measure reflects a self attractive interaction. According to a conjecture of Pekar that was proved in [DV83]
$$
g_0=\lim_{α\to\infty}\frac{1}{α^2}\bigg[\lim_{T\to\infty}\frac{\log Z_{α,T}}{2T}\bigg]
$$
exists and has a variational formula. In this article we show that for any $α>0$, the infinite-volume limit $\widehat{\mathbb P}_α=\lim_{T\to\infty}\widehat{\mathbb P}_{α,T}$ exists which is also identified explicitly. As a corollary, we deduce the central limit theorem (for any $α>0$ and as $T\to\infty$) for the distribution of $\frac{ω(T)-ω(-T)}{\sqrt{2T}}$ both under the finite-volume Polaron measure $\widehat{\mathbb P}_{α,T}$ and its infinite-volume counterpart $\widehat{\mathbb P}_α$, and obtain an expression for the limiting variance.
△ Less
Submitted 5 September, 2021; v1 submitted 15 February, 2018;
originally announced February 2018.
-
Individualized Treatment Effects with Censored Data via Fully Nonparametric Bayesian Accelerated Failure Time Models
Authors:
Nicholas C. Henderson,
Thomas A. Louis,
Gary L. Rosner,
Ravi Varadhan
Abstract:
Individuals often respond differently to identical treatments, and characterizing such variability in treatment response is an important aim in the practice of personalized medicine. In this article, we describe a non-parametric accelerated failure time model that can be used to analyze heterogeneous treatment effects (HTE) when patient outcomes are time-to-event. By utilizing Bayesian additive re…
▽ More
Individuals often respond differently to identical treatments, and characterizing such variability in treatment response is an important aim in the practice of personalized medicine. In this article, we describe a non-parametric accelerated failure time model that can be used to analyze heterogeneous treatment effects (HTE) when patient outcomes are time-to-event. By utilizing Bayesian additive regression trees and a mean-constrained Dirichlet process mixture model, our approach offers a flexible model for the regression function while placing few restrictions on the baseline hazard. Our non-parametric method leads to natural estimates of individual treatment effect and has the flexibility to address many major goals of HTE assessment. Moreover, our method requires little user input in terms of tuning parameter selection or subgroup specification. We illustrate the merits of our proposed approach with a detailed analysis of two large clinical trials for the prevention and treatment of congestive heart failure using an angiotensin-converting enzyme inhibitor. The analysis revealed considerable evidence for the presence of HTE in both trials as demonstrated by substantial estimated variation in treatment effect and by high proportions of patients exhibiting strong evidence of having treatment effects which differ from the overall treatment effect.
△ Less
Submitted 20 June, 2017;
originally announced June 2017.
-
Tails of Polynomials of Random Variables and Stable Limits for Nonconventional Sums
Authors:
Yuri Kifer,
S. R. S. Varadhan
Abstract:
We obtain decay rates of probabilities of tails of polynomials in several independent random variables with heavy tails and derive stable limit theorems for nonconventional sums of such polynomials
We obtain decay rates of probabilities of tails of polynomials in several independent random variables with heavy tails and derive stable limit theorems for nonconventional sums of such polynomials
△ Less
Submitted 18 April, 2016;
originally announced April 2016.
-
Tails of polynomials of random variables and stable limits for nonconventional sums
Authors:
Yuri Kifer,
S. R. S. Varadhan
Abstract:
We obtain first decay rates of probabilities of tails of multivariate polynomials built on independent random variables with heavy tails. Then we derive stable limit theorems for nonconventional sums of the form $\sum_{Nt\geq n\geq 1}F(X(q_1(n)),...,X(q_\ell(n)))$ where $F$ is a polynomial, $1\leq q_1(n)<\cdots <q_\ell(n)$ are integer valued increasing functions satisfying certain conditions and…
▽ More
We obtain first decay rates of probabilities of tails of multivariate polynomials built on independent random variables with heavy tails. Then we derive stable limit theorems for nonconventional sums of the form $\sum_{Nt\geq n\geq 1}F(X(q_1(n)),...,X(q_\ell(n)))$ where $F$ is a polynomial, $1\leq q_1(n)<\cdots <q_\ell(n)$ are integer valued increasing functions satisfying certain conditions and $X(n),\, n\geq 0$ is a sequence of independent random variables with heavy tails.
△ Less
Submitted 4 September, 2015; v1 submitted 16 April, 2015;
originally announced April 2015.
-
Fluctuations of the Self-Normalized Sum in the Curie-Weiss Model of SOC
Authors:
Matthias Gorny,
S. R. S. Varadhan
Abstract:
We extend the main theorem of arXiv:1301.6911 about the fluctuations in the Curie-Weiss model of SOC. We present a short proof using the Hubbard-Stratonovich transformation with the self-normalized sum of the random variables.
We extend the main theorem of arXiv:1301.6911 about the fluctuations in the Curie-Weiss model of SOC. We present a short proof using the Hubbard-Stratonovich transformation with the self-normalized sum of the random variables.
△ Less
Submitted 15 March, 2015;
originally announced March 2015.
-
Brownian Occupation Measures, Compactness and Large Deviations
Authors:
Chiranjib Mukherjee,
S. R. S. Varadhan
Abstract:
In proving large deviation estimates, the lower bound for open sets and upper bound for compact sets are essentially local estimates. On the other hand, the upper bound for closed sets is global and compactness of space or an exponential tightness estimate is needed to establish it. In dealing with the occupation measure $L_t(A)=\frac{1}{t}\int_0^t{\1}_A(W_s) \d s$ of the $d$ dimensional Brownian…
▽ More
In proving large deviation estimates, the lower bound for open sets and upper bound for compact sets are essentially local estimates. On the other hand, the upper bound for closed sets is global and compactness of space or an exponential tightness estimate is needed to establish it. In dealing with the occupation measure $L_t(A)=\frac{1}{t}\int_0^t{\1}_A(W_s) \d s$ of the $d$ dimensional Brownian motion, which is not positive recurrent, there is no possibility of exponential tightness. The space of probability distributions $\mathcal {M}_1(\R^d)$ can be compactified by replacing the usual topology of weak convergence by the vague toplogy, where the space is treated as the dual of continuous functions with compact support. This is essentially the one point compactification of $\R^d$ by adding a point at $\infty$ that results in the compactification of $\mathcal M_1(\R^d)$ by allowing some mass to escape to the point at $\infty$. If one were to use only test functions that are continuous and vanish at $\infty$ then the compactification results in the space of sub-probability distributions $\mathcal {M}_{\le 1}(\R^d)$ by ignoring the mass at $\infty$.
The main drawback of this compactification is that it ignores the underlying translation invariance. More explicitly, we may be interested in the space of equivalence classes of orbits $\widetilde{\mathcal M}_1=\widetilde{\mathcal M}_1(\R^d)$ under the action of the translation group $\R^d$ on $\mathcal M_1(\R^d)$. There are problems for which it is natural to compactify this space of orbits. We will provide such a compactification, prove a large deviation principle there and give an application to a relevant problem.
△ Less
Submitted 16 October, 2015; v1 submitted 21 April, 2014;
originally announced April 2014.
-
Large deviations for diffusions interacting through their ranks
Authors:
Amir Dembo,
Mykhaylo Shkolnikov,
S. R. Srinivasa Varadhan,
Ofer Zeitouni
Abstract:
We prove a Large Deviations Principle (LDP) for systems of diffusions (particles) interacting through their ranks, when the number of particles tends to infinity. We show that the limiting particle density is given by the unique solution of the approriate McKean-Vlasov equation and that the corresponding cumulative distribution function evolves according to the porous medium equation with convecti…
▽ More
We prove a Large Deviations Principle (LDP) for systems of diffusions (particles) interacting through their ranks, when the number of particles tends to infinity. We show that the limiting particle density is given by the unique solution of the approriate McKean-Vlasov equation and that the corresponding cumulative distribution function evolves according to the porous medium equation with convection. The large deviations rate function is provided in explicit form. This is the first instance of a LDP for interacting diffusions, where the interaction occurs both through the drift and the diffusion coefficients and where the rate function can be given explicitly. In the course of the proof, we obtain new regularity results for a certain tilted version of the porous medium equation.
△ Less
Submitted 4 April, 2017; v1 submitted 22 November, 2012;
originally announced November 2012.
-
Nonconventional Large Deviations Theorems
Authors:
Yuri Kifer,
S. R. S. Varadhan
Abstract:
We obtain large deviations theorems for nonconventional sums with underlying process being a Markov process satisfying the Doeblin condition or a dynamical system such as subshift of finite type or hyperbolic or expanding transformation.
We obtain large deviations theorems for nonconventional sums with underlying process being a Markov process satisfying the Doeblin condition or a dynamical system such as subshift of finite type or hyperbolic or expanding transformation.
△ Less
Submitted 6 February, 2013; v1 submitted 1 June, 2012;
originally announced June 2012.
-
Large Deviations for Random Matrices
Authors:
Sourav Chatterjee,
S. R. S. Varadhan
Abstract:
We prove a large deviation result for a random symmetric n x n matrix with independent identically distributed entries to have a few eigenvalues of size n. If the spectrum S survives when the matrix is rescaled by a factor of n, it can only be the eigenvalues of a Hilbert-Schmidt kernel k(x,y) on [0,1] x [0,1]. The rate function for k is $I(k)=1/2\int h(k(x,y) dxdy$ where h is the Cramer rate func…
▽ More
We prove a large deviation result for a random symmetric n x n matrix with independent identically distributed entries to have a few eigenvalues of size n. If the spectrum S survives when the matrix is rescaled by a factor of n, it can only be the eigenvalues of a Hilbert-Schmidt kernel k(x,y) on [0,1] x [0,1]. The rate function for k is $I(k)=1/2\int h(k(x,y) dxdy$ where h is the Cramer rate function for the common distribution of the entries that is assumed to have a tail decaying faster than any Gaussian. The large deviation for S is then obtained by contraction.
△ Less
Submitted 18 April, 2013; v1 submitted 22 June, 2011;
originally announced June 2011.
-
Large deviations for the current and tagged particle in 1D nearest-neighbor symmetric simple exclusion
Authors:
Sunder Sethuraman,
S. R. S. Varadhan
Abstract:
Laws of large numbers, starting from certain nonequilibrium measures, have been shown for the integrated current across a bond, and a tagged particle in one-dimensional symmetric nearest-neighbor simple exclusion [Ann. Inst. Henri Poincare Probab. Stat. 42 (2006) 567-577]. In this article, we prove corresponding large deviation principles and evaluate the rate functions, showing different growth b…
▽ More
Laws of large numbers, starting from certain nonequilibrium measures, have been shown for the integrated current across a bond, and a tagged particle in one-dimensional symmetric nearest-neighbor simple exclusion [Ann. Inst. Henri Poincare Probab. Stat. 42 (2006) 567-577]. In this article, we prove corresponding large deviation principles and evaluate the rate functions, showing different growth behaviors near and far from their zeroes which connect with results in [J. Stat. Phys. 136 (2009) 1-15].
△ Less
Submitted 27 May, 2013; v1 submitted 7 January, 2011;
originally announced January 2011.
-
Nonconventional limit theorems in discrete and continuous time via martingales
Authors:
Yuri Kifer,
S. R. S. Varadhan
Abstract:
We obtain functional central limit theorems for both discrete time expressions of the form $1/\sqrt{N}\sum_{n=1}^{[Nt]}(F(X(q_1(n)),\ldots, X(q_{\ell}(n)))-\bar{F})$ and similar expressions in the continuous time where the sum is replaced by an integral. Here $X(n),n\geq0$ is a sufficiently fast mixing vector process with some moment conditions and stationarity properties, $F$ is a continuous func…
▽ More
We obtain functional central limit theorems for both discrete time expressions of the form $1/\sqrt{N}\sum_{n=1}^{[Nt]}(F(X(q_1(n)),\ldots, X(q_{\ell}(n)))-\bar{F})$ and similar expressions in the continuous time where the sum is replaced by an integral. Here $X(n),n\geq0$ is a sufficiently fast mixing vector process with some moment conditions and stationarity properties, $F$ is a continuous function with polynomial growth and certain regularity properties, $\bar{F}=\int F\,d(μ\times\cdots\timesμ)$, $μ$ is the distribution of $X(0)$ and $q_i(n)=in$ for $i\le k\leq\ell$ while for $i>k$ they are positive functions taking on integer values on integers with some growth conditions which are satisfied, for instance, when $q_i$'s are polynomials of increasing degrees. These results decisively generalize [Probab. Theory Related Fields 148 (2010) 71-106], whose method was only applicable to the case $k=2$ under substantially more restrictive moment and mixing conditions and which could not be extended to convergence of processes and to the corresponding continuous time case. As in [Probab. Theory Related Fields 148 (2010) 71-106], our results hold true when $X_i(n)=T^nf_i$, where $T$ is a mixing subshift of finite type, a hyperbolic diffeomorphism or an expanding transformation taken with a Gibbs invariant measure, as well as in the case when $X_i(n)=f_i({Υ}_n)$, where ${Υ}_n$ is a Markov chain satisfying the Doeblin condition considered as a stationary process with respect to its invariant measure. Moreover, our relaxed mixing conditions yield applications to other types of dynamical systems and Markov processes, for instance, where a spectral gap can be established. The continuous time version holds true when, for instance, $X_i(t)=f_i(ξ_t)$, where $ξ_t$ is a nondegenerate continuous time Markov chain with a finite state space or a nondegenerate diffusion on a compact manifold. A partial motivation for such limit theorems is due to a series of papers dealing with nonconventional ergodic averages.
△ Less
Submitted 25 February, 2014; v1 submitted 10 December, 2010;
originally announced December 2010.
-
The large deviation principle for the Erdős-Rényi random graph
Authors:
Sourav Chatterjee,
S. R. S. Varadhan
Abstract:
What does an Erdos-Renyi graph look like when a rare event happens? This paper answers this question when p is fixed and n tends to infinity by establishing a large deviation principle under an appropriate topology. The formulation and proof of the main result uses the recent development of the theory of graph limits by Lovasz and coauthors and Szemeredi's regularity lemma from graph theory. As a…
▽ More
What does an Erdos-Renyi graph look like when a rare event happens? This paper answers this question when p is fixed and n tends to infinity by establishing a large deviation principle under an appropriate topology. The formulation and proof of the main result uses the recent development of the theory of graph limits by Lovasz and coauthors and Szemeredi's regularity lemma from graph theory. As a basic application of the general principle, we work out large deviations for the number of triangles in G(n,p). Surprisingly, even this simple example yields an interesting double phase transition.
△ Less
Submitted 3 April, 2011; v1 submitted 11 August, 2010;
originally announced August 2010.
-
Special invited paper. Large deviations
Authors:
S. R. S. Varadhan
Abstract:
This paper is based on Wald Lectures given at the annual meeting of the IMS in Minneapolis during August 2005. It is a survey of the theory of large deviations.
This paper is based on Wald Lectures given at the annual meeting of the IMS in Minneapolis during August 2005. It is a survey of the theory of large deviations.
△ Less
Submitted 15 April, 2008;
originally announced April 2008.
-
Random walks in a random environment
Authors:
S R S Varadhan
Abstract:
Random walks as well as diffusions in random media are considered. Methods are developed that allow one to establish large deviation results for both the `quenched' and the `averaged' case.
Random walks as well as diffusions in random media are considered. Methods are developed that allow one to establish large deviation results for both the `quenched' and the `averaged' case.
△ Less
Submitted 5 March, 2005;
originally announced March 2005.
-
A martingale proof of Dobrushin's theorem for non-homogeneous Markov chains
Authors:
Sunder Sethuraman,
S. R. S. Varadhan
Abstract:
In 1956, Dobrushin proved a definitive central limit theorem for non-homogeneous Markov chains. In this note, a shorter and different proof elucidating more the assumptions is given through martingale approximation.
In 1956, Dobrushin proved a definitive central limit theorem for non-homogeneous Markov chains. In this note, a shorter and different proof elucidating more the assumptions is given through martingale approximation.
△ Less
Submitted 12 April, 2004;
originally announced April 2004.