-
Monte Carlo methods on compact complex manifolds using Bergman kernels
Authors:
Thibaut Lemoine,
Rémi Bardenet
Abstract:
In this paper, we propose a new randomized method for numerical integration on a compact complex manifold with respect to a continuous volume form. Taking for quadrature nodes a suitable determinantal point process, we build an unbiased Monte Carlo estimator of the integral of any Lipschitz function, and show that the estimator satisfies a central limit theorem, with a faster rate than under indep…
▽ More
In this paper, we propose a new randomized method for numerical integration on a compact complex manifold with respect to a continuous volume form. Taking for quadrature nodes a suitable determinantal point process, we build an unbiased Monte Carlo estimator of the integral of any Lipschitz function, and show that the estimator satisfies a central limit theorem, with a faster rate than under independent sampling. In particular, seeing a complex manifold of dimension $d$ as a real manifold of dimension $d_{\mathbb{R}}=2d$, the mean squared error for $N$ quadrature nodes decays as $N^{-1-2/d_{\mathbb{R}}}$; this is faster than previous DPP-based quadratures and reaches the optimal worst-case rate investigated by [Bakhvalov 1965] in Euclidean spaces. The determinantal point process we use is characterized by its kernel, which is the Bergman kernel of a holomorphic Hermitian line bundle, and we strongly build upon the work of Berman that led to the central limit theorem in [Berman, 2018].We provide numerical illustrations for the Riemann sphere.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
On the Number of Steps of CyclePop** in Weakly Inconsistent U(1)-Connection Graphs
Authors:
Michaël Fanuel,
Rémi Bardenet
Abstract:
A U(1)-connection graph $G$ is a graph in which each oriented edge is endowed with a unit complex number, the latter being conjugated under orientation flip. We consider cycle-rooted spanning forests (CRSFs), a particular kind of spanning subgraphs of $G$ that have recently found computational applications as randomized spectral sparsifiers. In this context, CRSFs are drawn from a determinantal me…
▽ More
A U(1)-connection graph $G$ is a graph in which each oriented edge is endowed with a unit complex number, the latter being conjugated under orientation flip. We consider cycle-rooted spanning forests (CRSFs), a particular kind of spanning subgraphs of $G$ that have recently found computational applications as randomized spectral sparsifiers. In this context, CRSFs are drawn from a determinantal measure. Under a condition on the connection, Kassel and Kenyon gave an elegant algorithm, named CyclePop**, to sample from this distribution. The algorithm is an extension of the celebrated algorithm of Wilson that uses a loop-erased random walk to sample uniform spanning trees. In this paper, we give an alternative, elementary proof of correctness of CyclePop** for CRSF sampling; we fill the gaps of a proof sketch by Kassel, who was himself inspired by Marchal's proof of the correctness of Wilson's original algorithm. One benefit of the full proof à la Marchal is that we obtain a concise expression for the law of the number of steps to complete the sampling procedure, shedding light on practical situations where the algorithm is expected to run fast. Furthermore, we show how to extend the proof to more general distributions over CRSFs, which are not determinantal. The correctness of CyclePop** is known even in the non-determinantal case from the work of Kassel and Kenyon, so our merit is only to provide an alternate proof. One interest of this alternate proof is again to provide the distribution of the time complexity of the algorithm, in terms of a Poisson point process on the graph loops, or equivalently as a Poisson process on pyramids of cycles, a combinatorial notion introduced by Viennot. Finally, we strive to make the connections to loop measures and combinatorial structures as explicit as possible, to provide a reference for future extensions of the algorithm and its analysis.
△ Less
Submitted 7 June, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.
-
Point Processes and spatial statistics in time-frequency analysis
Authors:
Barbara Pascal,
Rémi Bardenet
Abstract:
A finite-energy signal is represented by a square-integrable, complex-valued function $t\mapsto s(t)$ of a real variable $t$, interpreted as time. Similarly, a noisy signal is represented by a random process. Time-frequency analysis, a subfield of signal processing, amounts to describing the temporal evolution of the frequency content of a signal. Loosely speaking, if $s$ is the audio recording of…
▽ More
A finite-energy signal is represented by a square-integrable, complex-valued function $t\mapsto s(t)$ of a real variable $t$, interpreted as time. Similarly, a noisy signal is represented by a random process. Time-frequency analysis, a subfield of signal processing, amounts to describing the temporal evolution of the frequency content of a signal. Loosely speaking, if $s$ is the audio recording of a musical piece, time-frequency analysis somehow consists in writing the musical score of the piece. Mathematically, the operation is performed through a transform $\mathcal{V}$, map** $s \in L^2(\mathbb{R})$ onto a complex-valued function $\mathcal{V}s \in L^2(\mathbb{R}^2)$ of time $t$ and angular frequency $ω$. The squared modulus $(t, ω) \mapsto \vert\mathcal{V}s(t,ω)\vert^2$ of the time-frequency representation is known as the spectrogram of $s$; in the musical score analogy, a peaked spectrogram at $(t_0,ω_0)$ corresponds to a musical note at angular frequency $ω_0$ localized at time $t_0$. More generally, the intuition is that upper level sets of the spectrogram contain relevant information about in the original signal. Hence, many signal processing algorithms revolve around identifying maxima of the spectrogram. In contrast, zeros of the spectrogram indicate perfect silence, that is, a time at which a particular frequency is absent. Assimilating $\mathbb{R}^2$ to $\mathbb{C}$ through $z = ω+ \mathrm{i}t$, this chapter focuses on time-frequency transforms $\mathcal{V}$ that map signals to analytic functions. The zeros of the spectrogram of a noisy signal are then the zeros of a random analytic function, hence forming a Point Process in $\mathbb{C}$. This chapter is devoted to the study of these Point Processes, to their links with zeros of Gaussian Analytic Functions, and to designing signal detection and denoising algorithms using spatial statistics.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Monte Carlo with kernel-based Gibbs measures: Guarantees for probabilistic herding
Authors:
Martin Rouault,
Rémi Bardenet,
Mylène Maïda
Abstract:
Kernel herding belongs to a family of deterministic quadratures that seek to minimize the worst-case integration error over a reproducing kernel Hilbert space (RKHS). In spite of strong experimental support, it has revealed difficult to prove that this worst-case error decreases at a faster rate than the standard square root of the number of quadrature nodes, at least in the usual case where the R…
▽ More
Kernel herding belongs to a family of deterministic quadratures that seek to minimize the worst-case integration error over a reproducing kernel Hilbert space (RKHS). In spite of strong experimental support, it has revealed difficult to prove that this worst-case error decreases at a faster rate than the standard square root of the number of quadrature nodes, at least in the usual case where the RKHS is infinite-dimensional. In this theoretical paper, we study a joint probability distribution over quadrature nodes, whose support tends to minimize the same worst-case error as kernel herding. We prove that it does outperform i.i.d. Monte Carlo, in the sense of coming with a tighter concentration inequality on the worst-case integration error. While not improving the rate yet, this demonstrates that the mathematical tools of the study of Gibbs measures can help understand to what extent kernel herding and its variants improve on computationally cheaper methods. Moreover, we provide early experimental evidence that a faster rate of convergence, though not worst-case, is likely.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
Benchmarking multi-component signal processing methods in the time-frequency plane
Authors:
Juan M. Miramont,
Rémi Bardenet,
Pierre Chainais,
Francois Auger
Abstract:
Signal processing in the time-frequency plane has a long history and remains a field of methodological innovation. For instance, detection and denoising based on the zeros of the spectrogram have been proposed since 2015, contrasting with a long history of focusing on larger values of the spectrogram. Yet, unlike neighboring fields like optimization and machine learning, time-frequency signal proc…
▽ More
Signal processing in the time-frequency plane has a long history and remains a field of methodological innovation. For instance, detection and denoising based on the zeros of the spectrogram have been proposed since 2015, contrasting with a long history of focusing on larger values of the spectrogram. Yet, unlike neighboring fields like optimization and machine learning, time-frequency signal processing lacks widely-adopted benchmarking tools. In this work, we contribute an open-source, Python-based toolbox termed MCSM-Benchs for benchmarking multi-component signal analysis methods, and we demonstrate our toolbox on three time-frequency benchmarks. First, we compare different methods for signal detection based on the zeros of the spectrogram, including unexplored variations of previously proposed detection tests. Second, we compare zero-based denoising methods to both classical and novel methods based on large values and ridges of the spectrogram. Finally, we compare the denoising performance of these methods against typical spectrogram thresholding strategies, in terms of post-processing artifacts commonly referred to as musical noise. At a low level, the obtained results provide new insight on the assessed approaches, and in particular research directions to further develop zero-based methods. At a higher level, our benchmarks exemplify the benefits of using a public, collaborative, common framework for benchmarking.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Signal reconstruction using determinantal sampling
Authors:
Ayoub Belhadji,
Rémi Bardenet,
Pierre Chainais
Abstract:
We study the approximation of a square-integrable function from a finite number of evaluations on a random set of nodes according to a well-chosen distribution. This is particularly relevant when the function is assumed to belong to a reproducing kernel Hilbert space (RKHS). This work proposes to combine several natural finite-dimensional approximations based two possible probability distributions…
▽ More
We study the approximation of a square-integrable function from a finite number of evaluations on a random set of nodes according to a well-chosen distribution. This is particularly relevant when the function is assumed to belong to a reproducing kernel Hilbert space (RKHS). This work proposes to combine several natural finite-dimensional approximations based two possible probability distributions of nodes. These distributions are related to determinantal point processes, and use the kernel of the RKHS to favor RKHS-adapted regularity in the random design. While previous work on determinantal sampling relied on the RKHS norm, we prove mean-square guarantees in $L^2$ norm. We show that determinantal point processes and mixtures thereof can yield fast convergence rates. Our results also shed light on how the rate changes as more smoothness is assumed, a phenomenon known as superconvergence. Besides, determinantal sampling generalizes i.i.d. sampling from the Christoffel function which is standard in the literature. More importantly, determinantal sampling guarantees the so-called instance optimality property for a smaller number of function evaluations than i.i.d. sampling.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Repelled point processes with application to numerical integration
Authors:
Diala Hawat,
Rémi Bardenet,
Raphaël Lachièze-Rey
Abstract:
Linear statistics of point processes yield Monte Carlo estimators of integrals. While the simplest approach relies on a homogeneous Poisson point process, more regularly spread point processes, such as scrambled low-discrepancy sequences or determinantal point processes, can yield Monte Carlo estimators with fast-decaying mean square error. Following the intuition that more regular configurations…
▽ More
Linear statistics of point processes yield Monte Carlo estimators of integrals. While the simplest approach relies on a homogeneous Poisson point process, more regularly spread point processes, such as scrambled low-discrepancy sequences or determinantal point processes, can yield Monte Carlo estimators with fast-decaying mean square error. Following the intuition that more regular configurations result in lower integration error, we introduce the repulsion operator, which reduces clustering by slightly pushing the points of a configuration away from each other. Our main theoretical result is that applying the repulsion operator to a homogeneous Poisson point process yields an unbiased Monte Carlo estimator with lower variance than under the original point process. On the computational side, the evaluation of our estimator is only quadratic in the number of integrand evaluations and can be easily parallelized without any communication across tasks. We illustrate our variance reduction result with numerical experiments and compare it to popular Monte Carlo methods. Finally, we numerically investigate a few open questions on the repulsion operator. In particular, the experiments suggest that the variance reduction also holds when the operator is applied to other motion-invariant point processes.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
On sampling determinantal and Pfaffian point processes on a quantum computer
Authors:
Rémi Bardenet,
Michaël Fanuel,
Alexandre Feller
Abstract:
DPPs were introduced by Macchi as a model in quantum optics the 1970s. Since then, they have been widely used as models and subsampling tools in statistics and computer science. Most applications require sampling from a DPP, and given their quantum origin, it is natural to wonder whether sampling a DPP on a quantum computer is easier than on a classical one. We focus here on DPPs over a finite sta…
▽ More
DPPs were introduced by Macchi as a model in quantum optics the 1970s. Since then, they have been widely used as models and subsampling tools in statistics and computer science. Most applications require sampling from a DPP, and given their quantum origin, it is natural to wonder whether sampling a DPP on a quantum computer is easier than on a classical one. We focus here on DPPs over a finite state space, which are distributions over the subsets of $\{1,\dots,N\}$ parametrized by an $N\times N$ Hermitian kernel matrix. Vanilla sampling consists in two steps, of respective costs $\mathcal{O}(N^3)$ and $\mathcal{O}(Nr^2)$ operations on a classical computer, where $r$ is the rank of the kernel matrix. A large first part of the current paper consists in explaining why the state-of-the-art in quantum simulation of fermionic systems already yields quantum DPP sampling algorithms. We then modify existing quantum circuits, and discuss their insertion in a full DPP sampling pipeline that starts from practical kernel specifications. The bottom line is that, with $P$ (classical) parallel processors, we can divide the preprocessing cost by $P$ and build a quantum circuit with $\mathcal{O}(Nr)$ gates that sample a given DPP, with depth varying from $\mathcal{O}(N)$ to $\mathcal{O}(r\log N)$ depending on qubit-communication constraints on the target machine. We also connect existing work on the simulation of superconductors to Pfaffian point processes, which generalize DPPs and would be a natural addition to the machine learner's toolbox. In particular, we describe "projective" Pfaffian point processes, the cardinality of which has constant parity, almost surely. Finally, the circuits are empirically validated on a classical simulator and on 5-qubit IBM machines.
△ Less
Submitted 22 November, 2023; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Smoothing complex-valued signals on Graphs with Monte-Carlo
Authors:
Hugo Jaquard,
Michaël Fanuel,
Pierre-Olivier Amblard,
Rémi Bardenet,
Simon Barthelmé,
Nicolas Tremblay
Abstract:
We introduce new smoothing estimators for complex signals on graphs, based on a recently studied Determinantal Point Process (DPP). These estimators are built from subsets of edges and nodes drawn according to this DPP, making up trees and unicycles, i.e., connected components containing exactly one cycle. We provide a Julia implementation of these estimators and study their performance when appli…
▽ More
We introduce new smoothing estimators for complex signals on graphs, based on a recently studied Determinantal Point Process (DPP). These estimators are built from subsets of edges and nodes drawn according to this DPP, making up trees and unicycles, i.e., connected components containing exactly one cycle. We provide a Julia implementation of these estimators and study their performance when applied to a ranking problem.
△ Less
Submitted 28 February, 2023; v1 submitted 15 October, 2022;
originally announced October 2022.
-
From point processes to quantum optics and back
Authors:
Rémi Bardenet,
Alexandre Feller,
Jérémie Bouttier,
Pascal Degiovanni,
Adrien Hardy,
Adam Rançon,
Benjamin Roussel,
Grégory Schehr,
Christoph I. Westbrook
Abstract:
Some fifty years ago, in her seminal PhD thesis, Odile Macchi introduced permanental and determinantal point processes. Her initial motivation was to provide models for the set of detection times in fundamental bosonic or fermionic optical experiments, respectively. After two rather quiet decades, these point processes have quickly become standard examples of point processes with nontrivial, yet t…
▽ More
Some fifty years ago, in her seminal PhD thesis, Odile Macchi introduced permanental and determinantal point processes. Her initial motivation was to provide models for the set of detection times in fundamental bosonic or fermionic optical experiments, respectively. After two rather quiet decades, these point processes have quickly become standard examples of point processes with nontrivial, yet tractable, correlation structures. In particular, determinantal point processes have been since the 1990s a technical workhorse in random matrix theory and combinatorics, and a standard model for repulsive point patterns in machine learning and spatial statistics since the 2010s. Meanwhile, our ability to experimentally probe the correlations between detection events in bosonic and fermionic optics has progressed tremendously. In Part I of this survey, we provide a modern introduction to the concepts in Macchi's thesis and their physical motivation, under the combined eye of mathematicians, physicists, and signal processers. Our objective is to provide a shared basis of knowledge for later cross-disciplinary work on point processes in quantum optics, and reconnect with the physical roots of permanental and determinantal point processes.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Sparsification of the regularized magnetic Laplacian with multi-type spanning forests
Authors:
Michaël Fanuel,
Rémi Bardenet
Abstract:
In this paper, we consider a ${\rm U}(1)$-connection graph, that is, a graph where each oriented edge is endowed with a unit modulus complex number that is conjugated under orientation flip. A natural replacement for the combinatorial Laplacian is then the magnetic Laplacian, an Hermitian matrix that includes information about the graph's connection. Magnetic Laplacians appear, e.g., in the proble…
▽ More
In this paper, we consider a ${\rm U}(1)$-connection graph, that is, a graph where each oriented edge is endowed with a unit modulus complex number that is conjugated under orientation flip. A natural replacement for the combinatorial Laplacian is then the magnetic Laplacian, an Hermitian matrix that includes information about the graph's connection. Magnetic Laplacians appear, e.g., in the problem of angular synchronization. In the context of large and dense graphs, we study here sparsifiers of the magnetic Laplacian $Δ$, i.e., spectral approximations based on subgraphs with few edges. Our approach relies on sampling multi-type spanning forests (MTSFs) using a custom determinantal point process, a probability distribution over edges that favours diversity. In a word, an MTSF is a spanning subgraph whose connected components are either trees or cycle-rooted trees. The latter partially capture the angular inconsistencies of the connection graph, and thus provide a way to compress the information contained in the connection. Interestingly, when the connection graph has weakly inconsistent cycles, samples from the determinantal point process under consideration can be obtained à la Wilson, using a random walk with cycle pop**. We provide statistical guarantees for a choice of natural estimators of the connection Laplacian, and investigate two practical applications of our sparsifiers: ranking with angular synchronization and graph-based semi-supervised learning. From a statistical perspective, a side result of this paper of independent interest is a matrix Chernoff bound with intrinsic dimension, which allows considering the influence of a regularization -- of the form $Δ+ q \mathbb{I}$ with $q>0$ -- on sparsification guarantees.
△ Less
Submitted 20 March, 2024; v1 submitted 31 August, 2022;
originally announced August 2022.
-
On estimating the structure factor of a point process, with applications to hyperuniformity
Authors:
Diala Hawat,
Guillaume Gautier,
Rémi Bardenet,
Raphaël Lachièze-Rey
Abstract:
Hyperuniformity is the study of stationary point processes with a sub-Poisson variance in a large window. In other words, counting the points of a hyperuniform point process that fall in a given large region yields a small-variance Monte Carlo estimation of the volume. Hyperuniform point processes have received a lot of attention in statistical physics, both for the investigation of natural organi…
▽ More
Hyperuniformity is the study of stationary point processes with a sub-Poisson variance in a large window. In other words, counting the points of a hyperuniform point process that fall in a given large region yields a small-variance Monte Carlo estimation of the volume. Hyperuniform point processes have received a lot of attention in statistical physics, both for the investigation of natural organized structures and the synthesis of materials. Unfortunately, rigorously proving that a point process is hyperuniform is usually difficult. A common practice in statistical physics and chemistry is to use a few samples to estimate a spectral measure called the structure factor. Its decay around zero provides a diagnostic of hyperuniformity. Different applied fields use however different estimators, and important algorithmic choices proceed from each field's lore. This paper provides a systematic survey and derivation of known or otherwise natural estimators of the structure factor. We also leverage the consistency of these estimators to contribute the first asymptotically valid statistical test of hyperuniformity. We benchmark all estimators and hyperuniformity diagnostics on a set of examples. In an effort to make investigations of the structure factor and hyperuniformity systematic and reproducible, we further provide the Python toolbox structure_factor, containing all the estimators and tools that we discuss.
△ Less
Submitted 2 March, 2023; v1 submitted 16 March, 2022;
originally announced March 2022.
-
A covariant, discrete time-frequency representation tailored for zero-based signal detection
Authors:
Barbara Pascal,
Rémi Bardenet
Abstract:
Recent work in time-frequency analysis proposed to switch the focus from the maxima of the spectrogram toward its zeros, which, for signals corrupted by Gaussian noise, form a random point pattern with a very stable structure leveraged by modern spatial statistics tools to perform component disentanglement and signal detection. The major bottlenecks of this approach are the discretization of the S…
▽ More
Recent work in time-frequency analysis proposed to switch the focus from the maxima of the spectrogram toward its zeros, which, for signals corrupted by Gaussian noise, form a random point pattern with a very stable structure leveraged by modern spatial statistics tools to perform component disentanglement and signal detection. The major bottlenecks of this approach are the discretization of the Short-Time Fourier Transform and the boundedness of the time-frequency observation window deteriorating the estimation of summary statistics of the zeros, on which signal processing procedures rely. To circumvent these limitations, we introduce the Kravchuk transform, a generalized time-frequency representation suited to discrete signals, providing a covariant and numerically tractable counterpart to a recently proposed discrete transform, with a compact phase space, particularly amenable to spatial statistics. Interesting properties of the Kravchuk transform are demonstrated, among which covariance under the action of SO(3) and invertibility. We further show that the point process of the zeros of the Kravchuk transform of white Gaussian noise coincides with those of the spherical Gaussian Analytic Function, implying its invariance under isometries of the sphere. Elaborating on this theorem, we develop a procedure for signal detection based on the spatial statistics of the zeros of the Kravchuk spectrogram, whose statistical power is assessed by intensive numerical simulations, and compares favorably to state-of-the-art zeros-based detection procedures. Furthermore it appears to be particularly robust to both low signal-to-noise ratio and small number of samples.
△ Less
Submitted 6 February, 2023; v1 submitted 8 February, 2022;
originally announced February 2022.
-
Determinantal point processes based on orthogonal polynomials for sampling minibatches in SGD
Authors:
Remi Bardenet,
Subhro Ghosh,
Meixia Lin
Abstract:
Stochastic gradient descent (SGD) is a cornerstone of machine learning. When the number N of data items is large, SGD relies on constructing an unbiased estimator of the gradient of the empirical risk using a small subset of the original dataset, called a minibatch. Default minibatch construction involves uniformly sampling a subset of the desired size, but alternatives have been explored for vari…
▽ More
Stochastic gradient descent (SGD) is a cornerstone of machine learning. When the number N of data items is large, SGD relies on constructing an unbiased estimator of the gradient of the empirical risk using a small subset of the original dataset, called a minibatch. Default minibatch construction involves uniformly sampling a subset of the desired size, but alternatives have been explored for variance reduction. In particular, experimental evidence suggests drawing minibatches from determinantal point processes (DPPs), distributions over minibatches that favour diversity among selected items. However, like in recent work on DPPs for coresets, providing a systematic and principled understanding of how and why DPPs help has been difficult. In this work, we contribute an orthogonal polynomial-based DPP paradigm for minibatch sampling in SGD. Our approach leverages the specific data distribution at hand, which endows it with greater sensitivity and power over existing data-agnostic methods. We substantiate our method via a detailed theoretical analysis of its convergence properties, interweaving between the discrete data set and the underlying continuous domain. In particular, we show how specific DPPs and a string of controlled approximations can lead to gradient estimators with a variance that decays faster with the batchsize than under uniform sampling. Coupled with existing finite-time guarantees for SGD on convex objectives, this entails that, DPP minibatches lead to a smaller bound on the mean square approximation error than uniform minibatches. Moreover, our estimators are amenable to a recent algorithm that directly samples linear statistics of DPPs (i.e., the gradient estimator) without sampling the underlying DPP (i.e., the minibatch), thereby reducing computational overhead. We provide detailed synthetic as well as real data experiments to substantiate our theoretical claims.
△ Less
Submitted 11 December, 2021;
originally announced December 2021.
-
Nonparametric estimation of continuous DPPs with kernel methods
Authors:
Michaël Fanuel,
Rémi Bardenet
Abstract:
Determinantal Point Process (DPPs) are statistical models for repulsive point patterns. Both sampling and inference are tractable for DPPs, a rare feature among models with negative dependence that explains their popularity in machine learning and spatial statistics. Parametric and nonparametric inference methods have been proposed in the finite case, i.e. when the point patterns live in a finite…
▽ More
Determinantal Point Process (DPPs) are statistical models for repulsive point patterns. Both sampling and inference are tractable for DPPs, a rare feature among models with negative dependence that explains their popularity in machine learning and spatial statistics. Parametric and nonparametric inference methods have been proposed in the finite case, i.e. when the point patterns live in a finite ground set. In the continuous case, only parametric methods have been investigated, while nonparametric maximum likelihood for DPPs -- an optimization problem over trace-class operators -- has remained an open question. In this paper, we show that a restricted version of this maximum likelihood (MLE) problem falls within the scope of a recent representer theorem for nonnegative functions in an RKHS. This leads to a finite-dimensional problem, with strong statistical ties to the original MLE. Moreover, we propose, analyze, and demonstrate a fixed point algorithm to solve this finite-dimensional problem. Finally, we also provide a controlled estimate of the correlation kernel of the DPP, thus providing more interpretability.
△ Less
Submitted 27 November, 2021; v1 submitted 27 June, 2021;
originally announced June 2021.
-
On proportional volume sampling for experimental design in general spaces
Authors:
Arnaud Poinas,
Rémi Bardenet
Abstract:
Optimal design for linear regression is a fundamental task in statistics. For finite design spaces, recent progress has shown that random designs drawn using proportional volume sampling (PVS) lead to approximation guarantees for A-optimal design. PVS strikes the balance between design nodes that jointly fill the design space, while marginally staying in regions of high mass under the solution of…
▽ More
Optimal design for linear regression is a fundamental task in statistics. For finite design spaces, recent progress has shown that random designs drawn using proportional volume sampling (PVS) lead to approximation guarantees for A-optimal design. PVS strikes the balance between design nodes that jointly fill the design space, while marginally staying in regions of high mass under the solution of a relaxed convex version of the original problem. In this paper, we examine some of the statistical implications of a new variant of PVS for (possibly Bayesian) optimal design. Using point process machinery, we treat the case of a generic Polish design space. We show that not only are the A-optimality approximation guarantees preserved, but we obtain similar guarantees for D-optimal design that tighten recent results. Moreover, we show that PVS can be sampled in polynomial time. Unfortunately, in spite of its elegance and tractability, we demonstrate on a simple example that the practical implications of general PVS are likely limited. In the second part of the paper, we focus on applications and investigate the use of PVS as a subroutine for stochastic search heuristics. We demonstrate that PVS is a robust addition to the practitioner's toolbox, especially when the regression functions are nonstandard and the design space, while low-dimensional, has a complicated shape (e.g., nonlinear boundaries, several connected components).
△ Less
Submitted 1 February, 2021; v1 submitted 9 November, 2020;
originally announced November 2020.
-
Learning from DPPs via Sampling: Beyond HKPV and symmetry
Authors:
Rémi Bardenet,
Subhroshekhar Ghosh
Abstract:
Determinantal point processes (DPPs) have become a significant tool for recommendation systems, feature selection, or summary extraction, harnessing the intrinsic ability of these probabilistic models to facilitate sample diversity. The ability to sample from DPPs is paramount to the empirical investigation of these models. Most exact samplers are variants of a spectral meta-algorithm due to Hough…
▽ More
Determinantal point processes (DPPs) have become a significant tool for recommendation systems, feature selection, or summary extraction, harnessing the intrinsic ability of these probabilistic models to facilitate sample diversity. The ability to sample from DPPs is paramount to the empirical investigation of these models. Most exact samplers are variants of a spectral meta-algorithm due to Hough, Krishnapur, Peres and Virág (henceforth HKPV), which is in general time and resource intensive. For DPPs with symmetric kernels, scalable HKPV samplers have been proposed that either first downsample the ground set of items, or force the kernel to be low-rank, using e.g. Nyström-type decompositions.
In the present work, we contribute a radically different approach than HKPV. Exploiting the fact that many statistical and learning objectives can be effectively accomplished by only sampling certain key observables of a DPP (so-called linear statistics), we invoke an expression for the Laplace transform of such an observable as a single determinant, which holds in complete generality. Combining traditional low-rank approximation techniques with Laplace inversion algorithms from numerical analysis, we show how to directly approximate the distribution function of a linear statistic of a DPP. This distribution function can then be used in hypothesis testing or to actually sample the linear statistic, as per requirement. Our approach is scalable and applies to very general DPPs, beyond traditional symmetric kernels.
△ Less
Submitted 8 July, 2020;
originally announced July 2020.
-
Fast sampling from $β$-ensembles
Authors:
Guillaume Gautier,
Rémi Bardenet,
Michal Valko
Abstract:
We study sampling algorithms for $β$-ensembles with time complexity less than cubic in the cardinality of the ensemble. Following Dumitriu & Edelman (2002), we see the ensemble as the eigenvalues of a random tridiagonal matrix, namely a random Jacobi matrix. First, we provide a unifying and elementary treatment of the tridiagonal models associated to the three classical Hermite, Laguerre and Jacob…
▽ More
We study sampling algorithms for $β$-ensembles with time complexity less than cubic in the cardinality of the ensemble. Following Dumitriu & Edelman (2002), we see the ensemble as the eigenvalues of a random tridiagonal matrix, namely a random Jacobi matrix. First, we provide a unifying and elementary treatment of the tridiagonal models associated to the three classical Hermite, Laguerre and Jacobi ensembles. For this purpose, we use simple changes of variables between successive reparametrizations of the coefficients defining the tridiagonal matrix. Second, we derive an approximate sampler for the simulation of $β$-ensembles, and illustrate how fast it can be for polynomial potentials. This method combines a Gibbs sampler on Jacobi matrices and the diagonalization of these matrices. In practice, even for large ensembles, only a few Gibbs passes suffice for the marginal distribution of the eigenvalues to fit the expected theoretical distribution. When the conditionals in the Gibbs sampler can be simulated exactly, the same fast empirical convergence is observed for the fluctuations of the largest eigenvalue. Our experimental results support a conjecture by Krishnapur et al. (2016), that the Gibbs chain on Jacobi matrices of size $N$ mixes in $\mathcal{O}(\log(N))$.
△ Less
Submitted 4 March, 2020;
originally announced March 2020.
-
Kernel interpolation with continuous volume sampling
Authors:
Ayoub Belhadji,
Rémi Bardenet,
Pierre Chainais
Abstract:
A fundamental task in kernel methods is to pick nodes and weights, so as to approximate a given function from an RKHS by the weighted sum of kernel translates located at the nodes. This is the crux of kernel density estimation, kernel quadrature, or interpolation from discrete samples. Furthermore, RKHSs offer a convenient mathematical and computational framework. We introduce and analyse continuo…
▽ More
A fundamental task in kernel methods is to pick nodes and weights, so as to approximate a given function from an RKHS by the weighted sum of kernel translates located at the nodes. This is the crux of kernel density estimation, kernel quadrature, or interpolation from discrete samples. Furthermore, RKHSs offer a convenient mathematical and computational framework. We introduce and analyse continuous volume sampling (VS), the continuous counterpart -- for choosing node locations -- of a discrete distribution introduced in (Deshpande & Vempala, 2006). Our contribution is theoretical: we prove almost optimal bounds for interpolation and quadrature under VS. While similar bounds already exist for some specific RKHSs using ad-hoc node constructions, VS offers bounds that apply to any Mercer kernel and depend on the spectrum of the associated integration operator. We emphasize that, unlike previous randomized approaches that rely on regularized leverage scores or determinantal point processes, evaluating the pdf of VS only requires pointwise evaluations of the kernel. VS is thus naturally amenable to MCMC samplers.
△ Less
Submitted 22 February, 2020;
originally announced February 2020.
-
Kernel quadrature with DPPs
Authors:
Ayoub Belhadji,
Rémi Bardenet,
Pierre Chainais
Abstract:
We study quadrature rules for functions from an RKHS, using nodes sampled from a determinantal point process (DPP). DPPs are parametrized by a kernel, and we use a truncated and saturated version of the RKHS kernel. This link between the two kernels, along with DPP machinery, leads to relatively tight bounds on the quadrature error, that depends on the spectrum of the RKHS kernel. Finally, we expe…
▽ More
We study quadrature rules for functions from an RKHS, using nodes sampled from a determinantal point process (DPP). DPPs are parametrized by a kernel, and we use a truncated and saturated version of the RKHS kernel. This link between the two kernels, along with DPP machinery, leads to relatively tight bounds on the quadrature error, that depends on the spectrum of the RKHS kernel. Finally, we experimentally compare DPPs to existing kernel-based quadratures such as herding, Bayesian quadrature, or leverage score sampling. Numerical results confirm the interest of DPPs, and even suggest faster rates than our bounds in particular cases.
△ Less
Submitted 31 December, 2019; v1 submitted 18 June, 2019;
originally announced June 2019.
-
A determinantal point process for column subset selection
Authors:
Ayoub Belhadji,
Rémi Bardenet,
Pierre Chainais
Abstract:
Dimensionality reduction is a first step of many machine learning pipelines. Two popular approaches are principal component analysis, which projects onto a small number of well chosen but non-interpretable directions, and feature selection, which selects a small number of the original features. Feature selection can be abstracted as a numerical linear algebra problem called the column subset selec…
▽ More
Dimensionality reduction is a first step of many machine learning pipelines. Two popular approaches are principal component analysis, which projects onto a small number of well chosen but non-interpretable directions, and feature selection, which selects a small number of the original features. Feature selection can be abstracted as a numerical linear algebra problem called the column subset selection problem (CSSP). CSSP corresponds to selecting the best subset of columns of a matrix $X \in \mathbb{R}^{N \times d}$, where \emph{best} is often meant in the sense of minimizing the approximation error, i.e., the norm of the residual after projection of $X$ onto the space spanned by the selected columns. Such an optimization over subsets of $\{1,\dots,d\}$ is usually impractical. One workaround that has been vastly explored is to resort to polynomial-cost, random subset selection algorithms that favor small values of this approximation error. We propose such a randomized algorithm, based on sampling from a projection determinantal point process (DPP), a repulsive distribution over a fixed number $k$ of indices $\{1,\dots,d\}$ that favors diversity among the selected columns. We give bounds on the ratio of the expected approximation error for this DPP over the optimal error of PCA. These bounds improve over the state-of-the-art bounds of \emph{volume sampling} when some realistic structural assumptions are satisfied for $X$. Numerical experiments suggest that our bounds are tight, and that our algorithms have comparable performance with the \emph{double phase} algorithm, often considered to be the practical state-of-the-art. Column subset selection with DPPs thus inherits the best of both worlds: good empirical performance and tight error bounds.
△ Less
Submitted 23 December, 2018;
originally announced December 2018.
-
DPPy: Sampling DPPs with Python
Authors:
Guillaume Gautier,
Guillermo Polito,
Rémi Bardenet,
Michal Valko
Abstract:
Determinantal point processes (DPPs) are specific probability distributions over clouds of points that are used as models and computational tools across physics, probability, statistics, and more recently machine learning. Sampling from DPPs is a challenge and therefore we present DPPy, a Python toolbox that gathers known exact and approximate sampling algorithms for both finite and continuous DPP…
▽ More
Determinantal point processes (DPPs) are specific probability distributions over clouds of points that are used as models and computational tools across physics, probability, statistics, and more recently machine learning. Sampling from DPPs is a challenge and therefore we present DPPy, a Python toolbox that gathers known exact and approximate sampling algorithms for both finite and continuous DPPs. The project is hosted on GitHub and equipped with an extensive documentation.
△ Less
Submitted 12 August, 2019; v1 submitted 19 September, 2018;
originally announced September 2018.
-
Time-frequency transforms of white noises and Gaussian analytic functions
Authors:
Rémi Bardenet,
Adrien Hardy
Abstract:
A family of Gaussian analytic functions (GAFs) has recently been linked to the Gabor transform of white Gaussian noise [Bardenet et al., 2017]. This answered pioneering work by Flandrin [2015], who observed that the zeros of the Gabor transform of white noise had a very regular distribution and proposed filtering algorithms based on the zeros of a spectrogram. The mathematical link with GAFs provi…
▽ More
A family of Gaussian analytic functions (GAFs) has recently been linked to the Gabor transform of white Gaussian noise [Bardenet et al., 2017]. This answered pioneering work by Flandrin [2015], who observed that the zeros of the Gabor transform of white noise had a very regular distribution and proposed filtering algorithms based on the zeros of a spectrogram. The mathematical link with GAFs provides a wealth of probabilistic results to inform the design of such signal processing procedures. In this paper, we study in a systematic way the link between GAFs and a class of time-frequency transforms of Gaussian white noises on Hilbert spaces of signals. Our main observation is a conceptual correspondence between pairs (transform, GAF) and generating functions for classical orthogonal polynomials. This correspondence covers some classical time-frequency transforms, such as the Gabor transform and the Daubechies-Paul analytic wavelet transform. It also unveils new windowed discrete Fourier transforms, which map white noises to fundamental GAFs. All these transforms may thus be of interest to the research program `filtering with zeros'. We also identify the GAF whose zeros are the extrema of the Gabor transform of the white noise and derive their first intensity. Moreover, we discuss important subtleties in defining a white noise and its transform on infinite dimensional Hilbert spaces. Finally, we provide quantitative estimates concerning the finite-dimensional approximations of these white noises, which is of practical interest when it comes to implementing signal processing algorithms based on GAFs.
△ Less
Submitted 23 July, 2019; v1 submitted 30 July, 2018;
originally announced July 2018.
-
On the zeros of the spectrogram of white noise
Authors:
Rémi Bardenet,
Julien Flamant,
Pierre Chainais
Abstract:
In a recent paper, Flandrin [2015] has proposed filtering based on the zeros of a spectrogram, using the short-time Fourier transform and a Gaussian window. His results are based on empirical observations on the distribution of the zeros of the spectrogram of white Gaussian noise. These zeros tend to be uniformly spread over the time-frequency plane, and not to clutter. Our contributions are three…
▽ More
In a recent paper, Flandrin [2015] has proposed filtering based on the zeros of a spectrogram, using the short-time Fourier transform and a Gaussian window. His results are based on empirical observations on the distribution of the zeros of the spectrogram of white Gaussian noise. These zeros tend to be uniformly spread over the time-frequency plane, and not to clutter. Our contributions are threefold: we rigorously define the zeros of the spectrogram of continuous white Gaussian noise, we explicitly characterize their statistical distribution, and we investigate the computational and statistical underpinnings of the practical implementation of signal detection based on the statistics of spectrogram zeros. In particular, we stress that the zeros of spectrograms of white Gaussian noise correspond to zeros of Gaussian analytic functions, a topic of recent independent mathematical interest [Hough et al., 2009].
△ Less
Submitted 31 July, 2017;
originally announced August 2017.
-
Zonotope hit-and-run for efficient sampling from projection DPPs
Authors:
Guillaume Gautier,
Rémi Bardenet,
Michal Valko
Abstract:
Determinantal point processes (DPPs) are distributions over sets of items that model diversity using kernels. Their applications in machine learning include summary extraction and recommendation systems. Yet, the cost of sampling from a DPP is prohibitive in large-scale applications, which has triggered an effort towards efficient approximate samplers. We build a novel MCMC sampler that combines i…
▽ More
Determinantal point processes (DPPs) are distributions over sets of items that model diversity using kernels. Their applications in machine learning include summary extraction and recommendation systems. Yet, the cost of sampling from a DPP is prohibitive in large-scale applications, which has triggered an effort towards efficient approximate samplers. We build a novel MCMC sampler that combines ideas from combinatorial geometry, linear programming, and Monte Carlo methods to sample from DPPs with a fixed sample cardinality, also called projection DPPs. Our sampler leverages the ability of the hit-and-run MCMC kernel to efficiently move across convex bodies. Previous theoretical results yield a fast mixing time of our chain when targeting a distribution that is close to a projection DPP, but not a DPP in general. Our empirical results demonstrate that this extends to sampling projection DPPs, i.e., our sampler is more sample-efficient than previous approaches which in turn translates to faster convergence when dealing with costly-to-evaluate functions, such as summary extraction in our experiments.
△ Less
Submitted 15 June, 2017; v1 submitted 30 May, 2017;
originally announced May 2017.
-
Monte Carlo with Determinantal Point Processes
Authors:
Rémi Bardenet,
Adrien Hardy
Abstract:
We show that repulsive random variables can yield Monte Carlo methods with faster convergence rates than the typical $N^{-1/2}$, where $N$ is the number of integrand evaluations. More precisely, we propose stochastic numerical quadratures involving determinantal point processes associated with multivariate orthogonal polynomials, and we obtain root mean square errors that decrease as…
▽ More
We show that repulsive random variables can yield Monte Carlo methods with faster convergence rates than the typical $N^{-1/2}$, where $N$ is the number of integrand evaluations. More precisely, we propose stochastic numerical quadratures involving determinantal point processes associated with multivariate orthogonal polynomials, and we obtain root mean square errors that decrease as $N^{-(1+1/d)/2}$, where $d$ is the dimension of the ambient space. First, we prove a central limit theorem (CLT) for the linear statistics of a class of determinantal point processes, when the reference measure is a product measure supported on a hypercube, which satisfies the Nevai-class regularity condition, a result which may be of independent interest. Next, we introduce a Monte Carlo method based on these determinantal point processes, and prove a CLT with explicit limiting variance for the quadrature error, when the reference measure satisfies a stronger regularity condition. As a corollary, by taking a specific reference measure and using a construction similar to importance sampling, we obtain a general Monte Carlo method, which applies to any measure with continuously derivable density. Loosely speaking, our method can be interpreted as a stochastic counterpart to Gaussian quadrature, which, at the price of some convergence rate, is easily generalizable to any dimension and has a more explicit error term.
△ Less
Submitted 15 June, 2019; v1 submitted 2 May, 2016;
originally announced May 2016.
-
Inference for determinantal point processes without spectral knowledge
Authors:
Rémi Bardenet,
Michalis K. Titsias
Abstract:
Determinantal point processes (DPPs) are point process models that naturally encode diversity between the points of a given realization, through a positive definite kernel $K$. DPPs possess desirable properties, such as exact sampling or analyticity of the moments, but learning the parameters of kernel $K$ through likelihood-based inference is not straightforward. First, the kernel that appears in…
▽ More
Determinantal point processes (DPPs) are point process models that naturally encode diversity between the points of a given realization, through a positive definite kernel $K$. DPPs possess desirable properties, such as exact sampling or analyticity of the moments, but learning the parameters of kernel $K$ through likelihood-based inference is not straightforward. First, the kernel that appears in the likelihood is not $K$, but another kernel $L$ related to $K$ through an often intractable spectral decomposition. This issue is typically bypassed in machine learning by directly parametrizing the kernel $L$, at the price of some interpretability of the model parameters. We follow this approach here. Second, the likelihood has an intractable normalizing constant, which takes the form of a large determinant in the case of a DPP over a finite set of objects, and the form of a Fredholm determinant in the case of a DPP over a continuous domain. Our main contribution is to derive bounds on the likelihood of a DPP, both for finite and continuous domains. Unlike previous work, our bounds are cheap to evaluate since they do not rely on approximating the spectrum of a large matrix or an operator. Through usual arguments, these bounds thus yield cheap variational inference and moderately expensive exact Markov chain Monte Carlo inference methods for DPPs.
△ Less
Submitted 4 July, 2015;
originally announced July 2015.
-
On Markov chain Monte Carlo methods for tall data
Authors:
Rémi Bardenet,
Arnaud Doucet,
Chris Holmes
Abstract:
Markov chain Monte Carlo methods are often deemed too computationally intensive to be of any practical use for big data applications, and in particular for inference on datasets containing a large number $n$ of individual data points, also known as tall datasets. In scenarios where data are assumed independent, various approaches to scale up the Metropolis-Hastings algorithm in a Bayesian inferenc…
▽ More
Markov chain Monte Carlo methods are often deemed too computationally intensive to be of any practical use for big data applications, and in particular for inference on datasets containing a large number $n$ of individual data points, also known as tall datasets. In scenarios where data are assumed independent, various approaches to scale up the Metropolis-Hastings algorithm in a Bayesian inference context have been recently proposed in machine learning and computational statistics. These approaches can be grouped into two categories: divide-and-conquer approaches and, subsampling-based algorithms. The aims of this article are as follows. First, we present a comprehensive review of the existing literature, commenting on the underlying assumptions and theoretical guarantees of each method. Second, by leveraging our understanding of these limitations, we propose an original subsampling-based approach which samples from a distribution provably close to the posterior distribution of interest, yet can require less than $O(n)$ data point likelihood evaluations at each iteration for certain statistical models in favourable scenarios. Finally, we have only been able so far to propose subsampling-based methods which display good performance in scenarios where the Bernstein-von Mises approximation of the target posterior distribution is excellent. It remains an open challenge to develop such methods in scenarios where the Bernstein-von Mises approximation is poor.
△ Less
Submitted 11 May, 2015;
originally announced May 2015.
-
Highlights from the Pierre Auger Observatory
Authors:
Antoine Letessier-Selvon,
A. Aab,
P. Abreu,
M. Aglietta,
M. Ahlers,
E. J. Ahn,
I. F. M. Albuquerque,
I. Allekotte,
J. Allen,
P. Allison,
A. Almela,
J. Alvarez Castillo,
J. Alvarez-Muniz,
R. Alves Batista,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antivcic,
C. Aramo,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave
, et al. (472 additional authors not shown)
Abstract:
The Pierre Auger Observatory is the world's largest cosmic ray observatory. Our current exposure reaches nearly 40,000 km$^2$ str and provides us with an unprecedented quality data set. The performance and stability of the detectors and their enhancements are described. Data analyses have led to a number of major breakthroughs. Among these we discuss the energy spectrum and the searches for large-…
▽ More
The Pierre Auger Observatory is the world's largest cosmic ray observatory. Our current exposure reaches nearly 40,000 km$^2$ str and provides us with an unprecedented quality data set. The performance and stability of the detectors and their enhancements are described. Data analyses have led to a number of major breakthroughs. Among these we discuss the energy spectrum and the searches for large-scale anisotropies. We present analyses of our X$_{max}$ data and show how it can be interpreted in terms of mass composition. We also describe some new analyses that extract mass sensitive parameters from the 100% duty cycle SD data. A coherent interpretation of all these recent results opens new directions. The consequences regarding the cosmic ray composition and the properties of UHECR sources are briefly discussed.
△ Less
Submitted 19 October, 2013; v1 submitted 17 October, 2013;
originally announced October 2013.
-
Pierre Auger Observatory and Telescope Array: Joint Contributions to the 33rd International Cosmic Ray Conference (ICRC 2013)
Authors:
The Telescope Array,
Pierre Auger Collaborations,
:,
T. Abu-Zayyad,
M. Allen,
R. Anderson,
R. Azuma,
E. Barcikowski,
J. W Belz,
D. R. Bergman,
S. A. Blake,
R. Cady,
M. J. Chae,
B. G. Cheon,
J. Chiba,
M. Chikawa,
W. R. Cho,
T. Fujii,
M. Fukushima,
K. Goto,
W. Hanlon,
Y. Hayashi,
N. Hayashida,
K. Hibino,
K. Honda
, et al. (598 additional authors not shown)
Abstract:
Joint contributions of the Pierre Auger and Telescope Array Collaborations to the 33rd International Cosmic Ray Conference, Rio de Janeiro, Brazil, July 2013: cross-calibration of the fluorescence telescopes, large scale anisotropies and mass composition.
Joint contributions of the Pierre Auger and Telescope Array Collaborations to the 33rd International Cosmic Ray Conference, Rio de Janeiro, Brazil, July 2013: cross-calibration of the fluorescence telescopes, large scale anisotropies and mass composition.
△ Less
Submitted 2 October, 2013;
originally announced October 2013.
-
Concentration inequalities for sampling without replacement
Authors:
Rémi Bardenet,
Odalric-Ambrym Maillard
Abstract:
Concentration inequalities quantify the deviation of a random variable from a fixed value. In spite of numerous applications, such as opinion surveys or ecological counting procedures, few concentration results are known for the setting of sampling without replacement from a finite population. Until now, the best general concentration inequality has been a Hoeffding inequality due to Serfling [Ann…
▽ More
Concentration inequalities quantify the deviation of a random variable from a fixed value. In spite of numerous applications, such as opinion surveys or ecological counting procedures, few concentration results are known for the setting of sampling without replacement from a finite population. Until now, the best general concentration inequality has been a Hoeffding inequality due to Serfling [Ann. Statist. 2 (1974) 39-48]. In this paper, we first improve on the fundamental result of Serfling [Ann. Statist. 2 (1974) 39-48], and further extend it to obtain a Bernstein concentration bound for sampling without replacement. We then derive an empirical version of our bound that does not require the variance to be known to the user.
△ Less
Submitted 27 July, 2015; v1 submitted 16 September, 2013;
originally announced September 2013.
-
The Pierre Auger Observatory: Contributions to the 33rd International Cosmic Ray Conference (ICRC 2013)
Authors:
The Pierre Auger Collaboration,
Alexander Aab,
Pedro Abreu,
Marco Aglietta,
Markus Ahlers,
Eun-Joo Ahn,
Ivone Albuquerque,
Ingomar Allekotte,
Jeff Allen,
Patrick Allison,
Alejandro Almela,
Jesus Alvarez Castillo,
Jaime Alvarez-Muñiz,
Rafael Alves Batista,
Michelangelo Ambrosio,
Amin Aminaei,
Luis Anchordoqui,
Sofia Andringa,
Tome Antičić,
Carla Aramo,
Fernando Arqueros,
Hernán Gonzalo Asorey,
Pedro Assis,
Julien Aublin,
Maximo Ave
, et al. (473 additional authors not shown)
Abstract:
Contributions of the Pierre Auger Collaboration to the 33rd International Cosmic Ray Conference, Rio de Janeiro, Brazil, July 2013
Contributions of the Pierre Auger Collaboration to the 33rd International Cosmic Ray Conference, Rio de Janeiro, Brazil, July 2013
△ Less
Submitted 18 July, 2013;
originally announced July 2013.
-
Adaptive MCMC with online relabeling
Authors:
Rémi Bardenet,
Olivier Cappé,
Gersende Fort,
Balázs Kégl
Abstract:
When targeting a distribution that is artificially invariant under some permutations, Markov chain Monte Carlo (MCMC) algorithms face the label-switching problem, rendering marginal inference particularly cumbersome. Such a situation arises, for example, in the Bayesian analysis of finite mixture models. Adaptive MCMC algorithms such as adaptive Metropolis (AM), which self-calibrates its proposal…
▽ More
When targeting a distribution that is artificially invariant under some permutations, Markov chain Monte Carlo (MCMC) algorithms face the label-switching problem, rendering marginal inference particularly cumbersome. Such a situation arises, for example, in the Bayesian analysis of finite mixture models. Adaptive MCMC algorithms such as adaptive Metropolis (AM), which self-calibrates its proposal distribution using an online estimate of the covariance matrix of the target, are no exception. To address the label-switching issue, relabeling algorithms associate a permutation to each MCMC sample, trying to obtain reasonable marginals. In the case of adaptive Metropolis (Bernoulli 7 (2001) 223-242), an online relabeling strategy is required. This paper is devoted to the AMOR algorithm, a provably consistent variant of AM that can cope with the label-switching problem. The idea is to nest relabeling steps within the MCMC algorithm based on the estimation of a single covariance matrix that is used both for adapting the covariance of the proposal distribution in the Metropolis algorithm step and for online relabeling. We compare the behavior of AMOR to similar relabeling methods. In the case of compactly supported target distributions, we prove a strong law of large numbers for AMOR and its ergodicity. These are the first results on the consistency of an online relabeling algorithm to our knowledge. The proof underlines latent relations between relabeling and vector quantization.
△ Less
Submitted 27 July, 2015; v1 submitted 9 October, 2012;
originally announced October 2012.
-
Antennas for the Detection of Radio Emission Pulses from Cosmic-Ray induced Air Showers at the Pierre Auger Observatory
Authors:
P. Abreu,
M. Aglietta,
M. Ahlers,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
A. Almela,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
R. Alves Batista,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave
, et al. (490 additional authors not shown)
Abstract:
The Pierre Auger Observatory is exploring the potential of the radio detection technique to study extensive air showers induced by ultra-high energy cosmic rays. The Auger Engineering Radio Array (AERA) addresses both technological and scientific aspects of the radio technique. A first phase of AERA has been operating since September 2010 with detector stations observing radio signals at frequenci…
▽ More
The Pierre Auger Observatory is exploring the potential of the radio detection technique to study extensive air showers induced by ultra-high energy cosmic rays. The Auger Engineering Radio Array (AERA) addresses both technological and scientific aspects of the radio technique. A first phase of AERA has been operating since September 2010 with detector stations observing radio signals at frequencies between 30 and 80 MHz. In this paper we present comparative studies to identify and optimize the antenna design for the final configuration of AERA consisting of 160 individual radio detector stations. The transient nature of the air shower signal requires a detailed description of the antenna sensor. As the ultra-wideband reception of pulses is not widely discussed in antenna literature, we review the relevant antenna characteristics and enhance theoretical considerations towards the impulse response of antennas including polarization effects and multiple signal reflections. On the basis of the vector effective length we study the transient response characteristics of three candidate antennas in the time domain. Observing the variation of the continuous galactic background intensity we rank the antennas with respect to the noise level added to the galactic signal.
△ Less
Submitted 17 September, 2012;
originally announced September 2012.
-
The Rapid Atmospheric Monitoring System of the Pierre Auger Observatory
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
M. Ahlers,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
A. Almela,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
R. Alves Batista,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin
, et al. (486 additional authors not shown)
Abstract:
The Pierre Auger Observatory is a facility built to detect air showers produced by cosmic rays above 10^17 eV. During clear nights with a low illuminated moon fraction, the UV fluorescence light produced by air showers is recorded by optical telescopes at the Observatory. To correct the observations for variations in atmospheric conditions, atmospheric monitoring is performed at regular intervals…
▽ More
The Pierre Auger Observatory is a facility built to detect air showers produced by cosmic rays above 10^17 eV. During clear nights with a low illuminated moon fraction, the UV fluorescence light produced by air showers is recorded by optical telescopes at the Observatory. To correct the observations for variations in atmospheric conditions, atmospheric monitoring is performed at regular intervals ranging from several minutes (for cloud identification) to several hours (for aerosol conditions) to several days (for vertical profiles of temperature, pressure, and humidity). In 2009, the monitoring program was upgraded to allow for additional targeted measurements of atmospheric conditions shortly after the detection of air showers of special interest, e.g., showers produced by very high-energy cosmic rays or showers with atypical longitudinal profiles. The former events are of particular importance for the determination of the energy scale of the Observatory, and the latter are characteristic of unusual air shower physics or exotic primary particle types. The purpose of targeted (or "rapid") monitoring is to improve the resolution of the atmospheric measurements for such events. In this paper, we report on the implementation of the rapid monitoring program and its current status. The rapid monitoring data have been analyzed and applied to the reconstruction of air showers of high interest, and indicate that the air fluorescence measurements affected by clouds and aerosols are effectively corrected using measurements from the regular atmospheric monitoring program. We find that the rapid monitoring program has potential for supporting dedicated physics analyses beyond the standard event reconstruction.
△ Less
Submitted 4 August, 2012;
originally announced August 2012.
-
A search for ultra-high energy neutrinos in highly inclined events at the Pierre Auger Observatory
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
M. Ahlers,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
A. Almela,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Anticic,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave
, et al. (475 additional authors not shown)
Abstract:
The Surface Detector of the Pierre Auger Observatory is sensitive to neutrinos of all flavours above 0.1 EeV. These interact through charged and neutral currents in the atmosphere giving rise to extensive air showers. When interacting deeply in the atmosphere at nearly horizontal incidence, neutrinos can be distinguished from regular hadronic cosmic rays by the broad time structure of their shower…
▽ More
The Surface Detector of the Pierre Auger Observatory is sensitive to neutrinos of all flavours above 0.1 EeV. These interact through charged and neutral currents in the atmosphere giving rise to extensive air showers. When interacting deeply in the atmosphere at nearly horizontal incidence, neutrinos can be distinguished from regular hadronic cosmic rays by the broad time structure of their shower signals in the water-Cherenkov detectors. In this paper we present for the first time an analysis based on down-going neutrinos. We describe the search procedure, the possible sources of background, the method to compute the exposure and the associated systematic uncertainties. No candidate neutrinos have been found in data collected from 1 January 2004 to 31 May 2010. Assuming an E^-2 differential energy spectrum the limit on the single flavour neutrino is (E^2 * dN/dE) < 1.74x10^-7 GeV cm^-2 s^-1 sr^-1 at 90% C.L. in the energy range 1x10^17 eV < E < 1x10^20 eV.
△ Less
Submitted 7 February, 2012;
originally announced February 2012.
-
Description of Atmospheric Conditions at the Pierre Auger Observatory using the Global Data Assimilation System (GDAS)
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
M. Ahlers,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
A. Almela,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave
, et al. (477 additional authors not shown)
Abstract:
Atmospheric conditions at the site of a cosmic ray observatory must be known for reconstructing observed extensive air showers. The Global Data Assimilation System (GDAS) is a global atmospheric model predicated on meteorological measurements and numerical weather predictions. GDAS provides altitude-dependent profiles of the main state variables of the atmosphere like temperature, pressure, and hu…
▽ More
Atmospheric conditions at the site of a cosmic ray observatory must be known for reconstructing observed extensive air showers. The Global Data Assimilation System (GDAS) is a global atmospheric model predicated on meteorological measurements and numerical weather predictions. GDAS provides altitude-dependent profiles of the main state variables of the atmosphere like temperature, pressure, and humidity. The original data and their application to the air shower reconstruction of the Pierre Auger Observatory are described. By comparisons with radiosonde and weather station measurements obtained on-site in Malargüe and averaged monthly models, the utility of the GDAS data is shown.
△ Less
Submitted 24 January, 2012; v1 submitted 11 January, 2012;
originally announced January 2012.
-
The effect of the geomagnetic field on cosmic ray energy estimates and large scale anisotropy searches on data from the Pierre Auger Observatory
Authors:
P. Abreu,
M. Aglietta,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
A. Anzalone,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier,
G. Avila
, et al. (473 additional authors not shown)
Abstract:
We present a comprehensive study of the influence of the geomagnetic field on the energy estimation of extensive air showers with a zenith angle smaller than $60^\circ$, detected at the Pierre Auger Observatory. The geomagnetic field induces an azimuthal modulation of the estimated energy of cosmic rays up to the ~2% level at large zenith angles. We present a method to account for this modulation…
▽ More
We present a comprehensive study of the influence of the geomagnetic field on the energy estimation of extensive air showers with a zenith angle smaller than $60^\circ$, detected at the Pierre Auger Observatory. The geomagnetic field induces an azimuthal modulation of the estimated energy of cosmic rays up to the ~2% level at large zenith angles. We present a method to account for this modulation of the reconstructed energy. We analyse the effect of the modulation on large scale anisotropy searches in the arrival direction distributions of cosmic rays. At a given energy, the geomagnetic effect is shown to induce a pseudo-dipolar pattern at the percent level in the declination distribution that needs to be accounted for.
△ Less
Submitted 30 November, 2011;
originally announced November 2011.
-
The Lateral Trigger Probability function for the Ultra-High Energy Cosmic Ray Showers detected by the Pierre Auger Observatory
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
A. Anzalone,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier
, et al. (473 additional authors not shown)
Abstract:
In this paper we introduce the concept of Lateral Trigger Probability (LTP) function, i.e., the probability for an extensive air shower (EAS) to trigger an individual detector of a ground based array as a function of distance to the shower axis, taking into account energy, mass and direction of the primary cosmic ray. We apply this concept to the surface array of the Pierre Auger Observatory consi…
▽ More
In this paper we introduce the concept of Lateral Trigger Probability (LTP) function, i.e., the probability for an extensive air shower (EAS) to trigger an individual detector of a ground based array as a function of distance to the shower axis, taking into account energy, mass and direction of the primary cosmic ray. We apply this concept to the surface array of the Pierre Auger Observatory consisting of a 1.5 km spaced grid of about 1600 water Cherenkov stations. Using Monte Carlo simulations of ultra-high energy showers the LTP functions are derived for energies in the range between 10^{17} and 10^{19} eV and zenith angles up to 65 degs. A parametrization combining a step function with an exponential is found to reproduce them very well in the considered range of energies and zenith angles. The LTP functions can also be obtained from data using events simultaneously observed by the fluorescence and the surface detector of the Pierre Auger Observatory (hybrid events). We validate the Monte-Carlo results showing how LTP functions from data are in good agreement with simulations.
△ Less
Submitted 28 November, 2011;
originally announced November 2011.
-
Search for signatures of magnetically-induced alignment in the arrival directions measured by the Pierre Auger Observatory
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
A. Anzalone,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier
, et al. (474 additional authors not shown)
Abstract:
We present the results of an analysis of data recorded at the Pierre Auger Observatory in which we search for groups of directionally-aligned events (or `multiplets') which exhibit a correlation between arrival direction and the inverse of the energy. These signatures are expected from sets of events coming from the same source after having been deflected by intervening coherent magnetic fields. T…
▽ More
We present the results of an analysis of data recorded at the Pierre Auger Observatory in which we search for groups of directionally-aligned events (or `multiplets') which exhibit a correlation between arrival direction and the inverse of the energy. These signatures are expected from sets of events coming from the same source after having been deflected by intervening coherent magnetic fields. The observation of several events from the same source would open the possibility to accurately reconstruct the position of the source and also measure the integral of the component of the magnetic field orthogonal to the trajectory of the cosmic rays. We describe the largest multiplets found and compute the probability that they appeared by chance from an isotropic distribution. We find no statistically significant evidence for the presence of multiplets arising from magnetic deflections in the present data.
△ Less
Submitted 10 November, 2011;
originally announced November 2011.
-
The Pierre Auger Observatory I: The Cosmic Ray Energy Spectrum and Related Measurements
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
A. Anzalone,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier
, et al. (471 additional authors not shown)
Abstract:
Studies of the cosmic ray energy spectrum at the highest energies with the Pierre Auger Observatory
Studies of the cosmic ray energy spectrum at the highest energies with the Pierre Auger Observatory
△ Less
Submitted 24 July, 2011;
originally announced July 2011.
-
The Pierre Auger Observatory V: Enhancements
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
A. Anzalone,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier
, et al. (471 additional authors not shown)
Abstract:
Ongoing and planned enhancements of the Pierre Auger Observatory
Ongoing and planned enhancements of the Pierre Auger Observatory
△ Less
Submitted 24 July, 2011;
originally announced July 2011.
-
The Pierre Auger Observatory IV: Operation and Monitoring
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
A. Anzalone,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier
, et al. (471 additional authors not shown)
Abstract:
Technical reports on operations and monitoring of the Pierre Auger Observatory
Technical reports on operations and monitoring of the Pierre Auger Observatory
△ Less
Submitted 24 July, 2011;
originally announced July 2011.
-
The Pierre Auger Observatory III: Other Astrophysical Observations
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
A. Anzalone,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier
, et al. (471 additional authors not shown)
Abstract:
Astrophysical observations of ultra-high-energy cosmic rays with the Pierre Auger Observatory
Astrophysical observations of ultra-high-energy cosmic rays with the Pierre Auger Observatory
△ Less
Submitted 24 July, 2011;
originally announced July 2011.
-
The Pierre Auger Observatory II: Studies of Cosmic Ray Composition and Hadronic Interaction models
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
A. Anzalone,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier
, et al. (471 additional authors not shown)
Abstract:
Studies of the composition of the highest energy cosmic rays with the Pierre Auger Observatory, including examination of hadronic physics effects on the structure of extensive air showers.
Studies of the composition of the highest energy cosmic rays with the Pierre Auger Observatory, including examination of hadronic physics effects on the structure of extensive air showers.
△ Less
Submitted 24 July, 2011;
originally announced July 2011.
-
Anisotropy and chemical composition of ultra-high energy cosmic rays using arrival directions measured by the Pierre Auger Observatory
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
A. Anzalone,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier
, et al. (468 additional authors not shown)
Abstract:
The Pierre Auger Collaboration has reported evidence for anisotropy in the distribution of arrival directions of the cosmic rays with energies $E>E_{th}=5.5\times 10^{19}$ eV. These show a correlation with the distribution of nearby extragalactic objects, including an apparent excess around the direction of Centaurus A. If the particles responsible for these excesses at $E>E_{th}$ are heavy nuclei…
▽ More
The Pierre Auger Collaboration has reported evidence for anisotropy in the distribution of arrival directions of the cosmic rays with energies $E>E_{th}=5.5\times 10^{19}$ eV. These show a correlation with the distribution of nearby extragalactic objects, including an apparent excess around the direction of Centaurus A. If the particles responsible for these excesses at $E>E_{th}$ are heavy nuclei with charge $Z$, the proton component of the sources should lead to excesses in the same regions at energies $E/Z$. We here report the lack of anisotropies in these directions at energies above $E_{th}/Z$ (for illustrative values of $Z=6,\ 13,\ 26$). If the anisotropies above $E_{th}$ are due to nuclei with charge $Z$, and under reasonable assumptions about the acceleration process, these observations imply stringent constraints on the allowed proton fraction at the lower energies.
△ Less
Submitted 4 July, 2011; v1 submitted 15 June, 2011;
originally announced June 2011.
-
Search for First Harmonic Modulation in the Right Ascension Distribution of Cosmic Rays Detected at the Pierre Auger Observatory
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier,
G. Avila
, et al. (444 additional authors not shown)
Abstract:
We present the results of searches for dipolar-type anisotropies in different energy ranges above $2.5\times 10^{17}$ eV with the surface detector array of the Pierre Auger Observatory, reporting on both the phase and the amplitude measurements of the first harmonic modulation in the right-ascension distribution. Upper limits on the amplitudes are obtained, which provide the most stringent bounds…
▽ More
We present the results of searches for dipolar-type anisotropies in different energy ranges above $2.5\times 10^{17}$ eV with the surface detector array of the Pierre Auger Observatory, reporting on both the phase and the amplitude measurements of the first harmonic modulation in the right-ascension distribution. Upper limits on the amplitudes are obtained, which provide the most stringent bounds at present, being below 2% at 99% $C.L.$ for EeV energies. We also compare our results to those of previous experiments as well as with some theoretical expectations.
△ Less
Submitted 14 March, 2011;
originally announced March 2011.
-
Advanced functionality for radio analysis in the Offline software framework of the Pierre Auger Observatory
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
E. J. Ahn,
I. F. M. Albuquerque,
D. Allard,
I. Allekotte,
J. Allen,
P. Allison,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
C. Aramo,
E. Arganda,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier,
G. Avila
, et al. (446 additional authors not shown)
Abstract:
The advent of the Auger Engineering Radio Array (AERA) necessitates the development of a powerful framework for the analysis of radio measurements of cosmic ray air showers. As AERA performs "radio-hybrid" measurements of air shower radio emission in coincidence with the surface particle detectors and fluorescence telescopes of the Pierre Auger Observatory, the radio analysis functionality had to…
▽ More
The advent of the Auger Engineering Radio Array (AERA) necessitates the development of a powerful framework for the analysis of radio measurements of cosmic ray air showers. As AERA performs "radio-hybrid" measurements of air shower radio emission in coincidence with the surface particle detectors and fluorescence telescopes of the Pierre Auger Observatory, the radio analysis functionality had to be incorporated in the existing hybrid analysis solutions for fluoresence and surface detector data. This goal has been achieved in a natural way by extending the existing Auger Offline software framework with radio functionality. In this article, we lay out the design, highlights and features of the radio extension implemented in the Auger Offline framework. Its functionality has achieved a high degree of sophistication and offers advanced features such as vectorial reconstruction of the electric field, advanced signal processing algorithms, a transparent and efficient handling of FFTs, a very detailed simulation of detector effects, and the read-in of multiple data formats including data from various radio simulation codes. The source code of this radio functionality can be made available to interested parties on request.
△ Less
Submitted 3 February, 2011; v1 submitted 24 January, 2011;
originally announced January 2011.
-
Update on the correlation of the highest energy cosmic rays with nearby extragalactic matter
Authors:
The Pierre Auger Collaboration,
P. Abreu,
M. Aglietta,
E. J. Ahn,
D. Allard,
I. Allekotte,
J. Allen,
J. Alvarez Castillo,
J. Alvarez-Muñiz,
M. Ambrosio,
A. Aminaei,
L. Anchordoqui,
S. Andringa,
T. Antičić,
A. Anzalone,
C. Aramo,
E. Arganda,
K. Arisaka,
F. Arqueros,
H. Asorey,
P. Assis,
J. Aublin,
M. Ave,
M. Avenier,
G. Avila
, et al. (450 additional authors not shown)
Abstract:
Data collected by the Pierre Auger Observatory through 31 August 2007 showed evidence for anisotropy in the arrival directions of cosmic rays above the Greisen-Zatsepin-Kuz'min energy threshold, \nobreak{$6\times 10^{19}$eV}. The anisotropy was measured by the fraction of arrival directions that are less than $3.1^\circ$ from the position of an active galactic nucleus within 75 Mpc (using the Véro…
▽ More
Data collected by the Pierre Auger Observatory through 31 August 2007 showed evidence for anisotropy in the arrival directions of cosmic rays above the Greisen-Zatsepin-Kuz'min energy threshold, \nobreak{$6\times 10^{19}$eV}. The anisotropy was measured by the fraction of arrival directions that are less than $3.1^\circ$ from the position of an active galactic nucleus within 75 Mpc (using the Véron-Cetty and Véron $12^{\rm th}$ catalog). An updated measurement of this fraction is reported here using the arrival directions of cosmic rays recorded above the same energy threshold through 31 December 2009. The number of arrival directions has increased from 27 to 69, allowing a more precise measurement. The correlating fraction is $(38^{+7}_{-6})%$, compared with $21%$ expected for isotropic cosmic rays. This is down from the early estimate of $(69^{+11}_{-13})%$. The enlarged set of arrival directions is examined also in relation to other populations of nearby extragalactic objects: galaxies in the 2 Microns All Sky Survey and active galactic nuclei detected in hard X-rays by the Swift Burst Alert Telescope. A celestial region around the position of the radiogalaxy Cen A has the largest excess of arrival directions relative to isotropic expectations. The 2-point autocorrelation function is shown for the enlarged set of arrival directions and compared to the isotropic expectation.
△ Less
Submitted 29 September, 2010; v1 submitted 9 September, 2010;
originally announced September 2010.