-
Altermagnetism imaged and controlled down to the nanoscale
Authors:
O. J. Amin,
A. Dal Din,
E. Golias,
Y. Niu,
A. Zakharov,
S. C. Fromage,
C. J. B. Fields,
S. L. Heywood,
R. B. Cousins,
J. Krempasky,
J. H. Dil,
D. Kriegner,
B. Kiraly,
R. P. Campion,
A. W. Rushforth,
K. W. Edmonds,
S. S. Dhesi,
L. Šmejkal,
T. Jungwirth,
P. Wadley
Abstract:
Nanoscale detection and control of the magnetic order underpins a broad spectrum of fundamental research and practical device applications. The key principle involved is the breaking of time-reversal ($\cal{T}$) symmetry, which in ferromagnets is generated by an internal magnetization. However, the presence of a net-magnetization also imposes severe limitations on compatibility with other prominen…
▽ More
Nanoscale detection and control of the magnetic order underpins a broad spectrum of fundamental research and practical device applications. The key principle involved is the breaking of time-reversal ($\cal{T}$) symmetry, which in ferromagnets is generated by an internal magnetization. However, the presence of a net-magnetization also imposes severe limitations on compatibility with other prominent phases ranging from superconductors to topological insulators, as well as on spintronic device scalability. Recently, altermagnetism has been proposed as a solution to this restriction, since it shares the enabling $\cal{T}$-symmetry breaking characteristic of ferromagnetism, combined with the antiferromagnetic-like vanishing net-magnetization. To date, altermagnetic ordering has been inferred from spatially averaged probes. Here, we demonstrate nanoscale imaging and control of altermagnetic ordering ranging from nanoscale vortices to domain walls to microscale single-domain states in MnTe. We combine the $\cal{T}$-symmetry breaking sensitivity of X-ray magnetic circular dichroism with magnetic linear dichroism and photoemission electron microscopy, to achieve detailed imaging of the local altermagnetic ordering vector. A rich variety of spin configurations can be imposed using microstructure patterning or thermal cycling in magnetic fields. The demonstrated detection and control of altermagnetism paves the way for future research ranging from ultra-scalable digital and neuromorphic spintronic devices, to the interplay of altermagnetism with non-dissipative superconducting or topological phases.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
PHYSTAT Informal Review: Marginalizing versus Profiling of Nuisance Parameters
Authors:
Robert D. Cousins,
Larry Wasserman
Abstract:
This is a writeup, with some elaboration, of the talks by the two authors (a physicist and a statistician) at the first PHYSTAT Informal review on January 24, 2024. We discuss Bayesian and frequentist approaches to dealing with nuisance parameters, in particular, integrated versus profiled likelihood methods. In regular models, with finitely many parameters and large sample sizes, the two approach…
▽ More
This is a writeup, with some elaboration, of the talks by the two authors (a physicist and a statistician) at the first PHYSTAT Informal review on January 24, 2024. We discuss Bayesian and frequentist approaches to dealing with nuisance parameters, in particular, integrated versus profiled likelihood methods. In regular models, with finitely many parameters and large sample sizes, the two approaches are asymptotically equivalent. But, outside this setting, the two methods can lead to different tests and confidence intervals. Assessing which approach is better generally requires comparing the power of the tests or the length of the confidence intervals. This analysis has to be conducted on a case-by-case basis. In the extreme case where the number of nuisance parameters is very large, possibly infinite, neither approach may be useful. Part I provides an informal history of usage in high energy particle physics, including a simple illustrative example. Part II includes an overview of some more recently developed methods in the statistics literature, including methods applicable when the use of the likelihood function is problematic.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Hyperpixels: Pixel Filter Arrays of Multivariate Optical Elements for Optimized Spectral Imaging
Authors:
Calum Williams,
Richard Cousins,
Christopher J. Mellor,
Sarah E. Bohndiek,
George S. D. Gordon
Abstract:
We introduce the concept of `hyperpixels' in which each element of a pixel filter array (suitable for CMOS image sensor integration) has a spectral transmission tailored to a target spectral component expected in application-specific scenes. These are analogous to arrays of multivariate optical elements that could be used for sensing specific analytes. Spectral tailoring is achieved by engineering…
▽ More
We introduce the concept of `hyperpixels' in which each element of a pixel filter array (suitable for CMOS image sensor integration) has a spectral transmission tailored to a target spectral component expected in application-specific scenes. These are analogous to arrays of multivariate optical elements that could be used for sensing specific analytes. Spectral tailoring is achieved by engineering the heights of multiple sub-pixel Fabry-Perot resonators that cover each pixel area. We first present a design approach for hyperpixels, based on a matched filter concept and, as an exemplar, design a set of 4 hyperpixels tailored to optimally discriminate between 4 spectral reflectance targets. Next, we fabricate repeating 2x2 pixel filter arrays of these designs, alongside repeating 2x2 arrays of an optimal bandpass filters, perform both spectral and imaging characterization. Experimentally measured hyperpixel transmission spectra show a 2.4x reduction in unmixing matrix condition number (p=0.031) compared to the optimal band-pass set. Imaging experiments using the filter arrays with a monochrome sensor achieve a 3.47x reduction in unmixing matrix condition number (p=0.020) compared to the optimal band-pass set. This demonstrates the utility of the hyperpixel approach and shows its superiority even over the optimal bandpass case. We expect that with further improvements in design and fabrication processes increased performance may be obtained. Because the hyperpixels are straightforward to customize, fabricate and can be placed atop monochrome sensors, this approach is highly versatile and could be adapted to a wide range of real-time imaging applications which are limited by low SNR including micro-endoscopy, capsule endoscopy, industrial inspection and machine vision.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Single- and multi-layer micro-scale diffractive lens fabrication for fiber imaging probes with versatile depth-of-field
Authors:
Fei He,
Rafael Fuentes-Dominguez,
Richard Cousins,
Christopher J. Mellor,
Jennifer K. Barton,
George S. D. Gordon
Abstract:
Hair-thin optical fiber endoscopes have opened up new paradigms for advanced imaging applications in vivo. In certain applications, such as optical coherence tomography (OCT), light-sha** structures may be required on fiber facets to generate needle-like Bessel beams with large depth-of-field, while in others shorter depths of field with high lateral resolutions are preferable. In this paper, we…
▽ More
Hair-thin optical fiber endoscopes have opened up new paradigms for advanced imaging applications in vivo. In certain applications, such as optical coherence tomography (OCT), light-sha** structures may be required on fiber facets to generate needle-like Bessel beams with large depth-of-field, while in others shorter depths of field with high lateral resolutions are preferable. In this paper, we demonstrate a novel method to fabricate light-sha** structures on optical fibres, achieved via bonding encapsulated planar diffractive lenses onto fiber facets. Diffractive metallic structures have the advantages of being simple to design, fabricate and transfer, and our encapsulation approach is scalable to multi-layer stacks. As a demonstration, we design and transfer a Fresnel zone plate and a diffractive axicon onto fiber facets, and show that the latter device generates a needle-like Bessel beam with 350 mu m focal depth. We also evaluate the imaging performance of both devices and show that the axicon fiber is able to maintain focussed images of a USAF resolution target over a 150 mu m distance. Finally, we fabricate a two-layer stack of Fresnel zone plates on a fiber and characterise the modified beam profile and demonstrate good imaging performance. We anticipate our fabrication approach could enable multi-functional complex optical structures (e.g. using plasmonics, polarization control) to be integrated onto fibers for ultra-thin advanced imaging and sensing.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Hele-Shaw flow of a nematic liquid crystal
Authors:
Joseph R. L. Cousins,
Nigel J. Mottram,
Stephen K. Wilson
Abstract:
Motivated by the variety of applications in which nematic Hele-Shaw flow occurs, a theoretical model for Hele-Shaw flow of a nematic liquid crystal is formulated and analysed. We derive the thin-film Ericksen-Leslie equations that govern nematic Hele-Shaw flow, and consider two important limiting cases in which we can make significant analytical progress. Firstly, we consider the leading-order pro…
▽ More
Motivated by the variety of applications in which nematic Hele-Shaw flow occurs, a theoretical model for Hele-Shaw flow of a nematic liquid crystal is formulated and analysed. We derive the thin-film Ericksen-Leslie equations that govern nematic Hele-Shaw flow, and consider two important limiting cases in which we can make significant analytical progress. Firstly, we consider the leading-order problem in the limiting case in which elasticity effects dominate viscous effects, and find that the nematic liquid crystal anchoring on the plates leads to a fixed director field and an anisotropic patterned viscosity that can be used to guide the flow of the nematic. Secondly, we consider the leading-order problem in the opposite limiting case in which viscous effects dominate elasticity effects, and find that the flow is identical to that of an isotropic fluid and the behaviour of the director is determined by the flow. As an example of the insight which can be gained by using the present approach, we then consider the flow of nematic according to a simple model for the squeezing stage of the One Drop Filling method, an important method for the manufacture of Liquid Crystal Displays, in these two limiting cases.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
Weak-anchoring effects in a thin pinned ridge of nematic liquid crystal
Authors:
Joseph R. L. Cousins,
Akhshay S. Bhadwal,
Lindsey T. Corson,
Brian R. Duffy,
Ian C. Sage,
Carl V. Brown,
Nigel J. Mottram,
Stephen K. Wilson
Abstract:
A theoretical investigation of weak-anchoring effects in a thin two-dimensional pinned static ridge of nematic liquid crystal resting on a flat solid substrate in an atmosphere of passive gas is performed. Specifically, we solve a reduced version of the general system of governing equations recently derived by Cousins et al. [Proc. Roy. Soc. A}, 478(2259):20210849, 2022] valid for a symmetric thin…
▽ More
A theoretical investigation of weak-anchoring effects in a thin two-dimensional pinned static ridge of nematic liquid crystal resting on a flat solid substrate in an atmosphere of passive gas is performed. Specifically, we solve a reduced version of the general system of governing equations recently derived by Cousins et al. [Proc. Roy. Soc. A}, 478(2259):20210849, 2022] valid for a symmetric thin ridge under the one-constant approximation of the Frank--Oseen bulk elastic energy with pinned contact lines to determine the shape of the ridge and the behaviour of the director within it. Numerical investigations covering a wide range of parameter values indicate that the energetically-preferred solutions can be classified in terms of the Jenkins--Barratt--Barbero--Barberi critical thickness into five qualitatively different types of solution. In particular, the theoretical results suggest that anchoring breaking occurs close to the contact lines. The theoretical predictions are supported by the results of physical experiments for a ridge of the nematic 4'-pentyl-4-biphenylcarbonitrile (5CB). In particular, these experiments show that the homeotropic anchoring at the gas--nematic interface is broken close to the contact lines by the stronger rubbed planar anchoring at the nematic--substrate interface. A comparison between the experimental values of and the theoretical predictions for the effective refractive index of the ridge gives a first estimate of the anchoring strength of an interface between air and 5CB to be $(9.80\pm1.12)\times10^{-6}\,{\rm N m}^{-1}$ at a temperature of $(22\pm1.5)^\circ$C.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
Young and Young--Laplace equations for a static ridge of nematic liquid crystal, and transitions between equilibrium states
Authors:
Joseph R. L. Cousins,
Brian R. Duffy,
Stephen K. Wilson,
Nigel J. Mottram
Abstract:
Motivated by the need for greater understanding of systems that involve interfaces between a nematic liquid crystal, a solid substrate, and a passive gas that includes nematic--substrate--gas three-phase contact lines, we analyse a two-dimensional static ridge of nematic resting on a solid substrate in an atmosphere of passive gas. Specifically, we obtain the first complete theoretical description…
▽ More
Motivated by the need for greater understanding of systems that involve interfaces between a nematic liquid crystal, a solid substrate, and a passive gas that includes nematic--substrate--gas three-phase contact lines, we analyse a two-dimensional static ridge of nematic resting on a solid substrate in an atmosphere of passive gas. Specifically, we obtain the first complete theoretical description for this system, including nematic Young and Young--Laplace equations, and then, under the assumption that anchoring breaking occurs in regions adjacent to the contact lines, we use the nematic Young equations to determine the continuous and discontinuous transitions that occur between the equilibrium states of complete wetting, partial wetting, and complete dewetting. In particular, in addition to continuous transitions analogous to those that occur in the classical case of an isotropic liquid, we find a variety of discontinuous transitions, as well as contact-angle hysteresis, and regions of parameter space in which there exist multiple partial wetting states that do not occur in the classical case.
△ Less
Submitted 15 November, 2021;
originally announced November 2021.
-
What is the likelihood function, and how is it used in particle physics?
Authors:
Robert D. Cousins
Abstract:
Likelihood functions are ubiquitous in data analyses at the LHC and elsewhere in particle physics. Partly because "probability" and "likelihood" are virtual synonyms in everyday English, but crucially distinct in data analysis, there is great potential for confusion. Furthermore, each of various approaches to statistical inference (likelihoodist, Neyman-Pearson, Bayesian) uses the likelihood funct…
▽ More
Likelihood functions are ubiquitous in data analyses at the LHC and elsewhere in particle physics. Partly because "probability" and "likelihood" are virtual synonyms in everyday English, but crucially distinct in data analysis, there is great potential for confusion. Furthermore, each of various approaches to statistical inference (likelihoodist, Neyman-Pearson, Bayesian) uses the likelihood function in different ways. This note is intended to provide a brief introduction at the advanced undergraduate or beginning graduate student level, citing a few papers giving examples and containing numerous pointers to the vast literature on likelihood. The Likelihood Principle (routinely violated in particle physics analyses) is mentioned as an unresolved issue in the philosophical foundations of statistics.
△ Less
Submitted 1 October, 2020;
originally announced October 2020.
-
Connections between statistical practice in elementary particle physics and the severity concept as discussed in Mayo's Statistical Inference as Severe Testing
Authors:
Robert D. Cousins
Abstract:
For many years, philosopher-of-statistics Deborah Mayo has been advocating the concept of severe testing as a key part of hypothesis testing. Her recent book, Statistical Inference as Severe Testing, is a comprehensive exposition of her arguments in the context of a historical study of many threads of statistical inference, both frequentist and Bayesian. Her foundational point of view is called er…
▽ More
For many years, philosopher-of-statistics Deborah Mayo has been advocating the concept of severe testing as a key part of hypothesis testing. Her recent book, Statistical Inference as Severe Testing, is a comprehensive exposition of her arguments in the context of a historical study of many threads of statistical inference, both frequentist and Bayesian. Her foundational point of view is called error statistics, emphasizing frequentist evaluation of the errors called Type I and Type II in the Neyman-Pearson theory of frequentist hypothesis testing. Since the field of elementary particle physics (also known as high energy physics) has strong traditions in frequentist inference, one might expect that something like the severity concept was independently developed in the field. Indeed, I find that, at least operationally (numerically), we high-energy physicists have long interpreted data in ways that map directly onto severity. Whether or not we subscribe to Mayo's philosophical interpretations of severity is a more complicated story that I do not address here.
△ Less
Submitted 13 January, 2021; v1 submitted 22 February, 2020;
originally announced February 2020.
-
Many perspectives on Deborah Mayo's "Statistical Inference as Severe Testing: How to Get Beyond the Statistics Wars"
Authors:
Andrew Gelman,
Brian Haig,
Christian Hennig,
Art Owen,
Robert Cousins,
Stan Young,
Christian Robert,
Corey Yanofsky,
E. J. Wagenmakers,
Ron Kenett,
Daniel Lakeland
Abstract:
The new book by philosopher Deborah Mayo is relevant to data science for topical reasons, as she takes various controversial positions regarding hypothesis testing and statistical practice, and also as an entry point to thinking about the philosophy of statistics. The present article is a slightly expanded version of a series of informal reviews and comments on Mayo's book. We hope this discussion…
▽ More
The new book by philosopher Deborah Mayo is relevant to data science for topical reasons, as she takes various controversial positions regarding hypothesis testing and statistical practice, and also as an entry point to thinking about the philosophy of statistics. The present article is a slightly expanded version of a series of informal reviews and comments on Mayo's book. We hope this discussion will introduce people to Mayo's ideas along with other perspectives on the topics she addresses.
△ Less
Submitted 29 May, 2019; v1 submitted 21 May, 2019;
originally announced May 2019.
-
Comment on "Optimal prior for Bayesian inference in a constrained parameter space" by S. Hannestad and T. Tram, arXiv:1710.08899
Authors:
Robert D. Cousins
Abstract:
The Jeffreys prior for a constrained part of a parameter space is the same as that for the unconstrained space, contrary to the assertions of Hannestad and Tram.
The Jeffreys prior for a constrained part of a parameter space is the same as that for the unconstrained space, contrary to the assertions of Hannestad and Tram.
△ Less
Submitted 20 February, 2019;
originally announced February 2019.
-
Lectures on Statistics in Theory: Prelude to Statistics in Practice
Authors:
Robert D. Cousins
Abstract:
This is a writeup of lectures on "statistics" that have evolved from the initial version for the 2009 Hadron Collider Physics Summer School at CERN to versions for other venues and, most recently, for the African School of Fundamental Physics and Applications in 2024. The emphasis is on foundations, using simple examples to illustrate the points that are still debated in the professional statistic…
▽ More
This is a writeup of lectures on "statistics" that have evolved from the initial version for the 2009 Hadron Collider Physics Summer School at CERN to versions for other venues and, most recently, for the African School of Fundamental Physics and Applications in 2024. The emphasis is on foundations, using simple examples to illustrate the points that are still debated in the professional statistics literature. The three main approaches to interval estimation (Neyman confidence, Bayesian, likelihood ratio) are discussed and compared in detail, with and without nuisance parameters. Hypothesis testing is discussed mainly from the frequentist point of view, with pointers to the Bayesian literature. Various foundational issues are emphasized, including the conditionality principle and the likelihood principle.
△ Less
Submitted 26 June, 2024; v1 submitted 16 July, 2018;
originally announced July 2018.
-
Should unfolded histograms be used to test hypotheses?
Authors:
Robert D. Cousins,
Samuel J. May,
Yipeng Sun
Abstract:
In many analyses in high energy physics, attempts are made to remove the effects of detector smearing in data by techniques referred to as "unfolding" histograms, thus obtaining estimates of the true values of histogram bin contents. Such unfolded histograms are then compared to theoretical predictions, either to judge the goodness of fit of a theory, or to compare the abilities of two or more the…
▽ More
In many analyses in high energy physics, attempts are made to remove the effects of detector smearing in data by techniques referred to as "unfolding" histograms, thus obtaining estimates of the true values of histogram bin contents. Such unfolded histograms are then compared to theoretical predictions, either to judge the goodness of fit of a theory, or to compare the abilities of two or more theories to describe the data. When doing this, even informally, one is testing hypotheses. However, a more fundamentally sound way to test hypotheses is to smear the theoretical predictions by simulating detector response and then comparing to the data without unfolding; this is also frequently done in high energy physics, particularly in searches for new physics. One can thus ask: to what extent does hypothesis testing after unfolding data materially reproduce the results obtained from testing by smearing theoretical predictions? We argue that this "bottom-line-test" of unfolding methods should be studied more commonly, in addition to common practices of examining variance and bias of estimates of the true contents of histogram bins. We illustrate bottom-line-tests in a simple toy problem with two hypotheses.
△ Less
Submitted 24 July, 2016;
originally announced July 2016.
-
Observation of the rare $B^0_s\toμ^+μ^-$ decay from the combined analysis of CMS and LHCb data
Authors:
The CMS,
LHCb Collaborations,
:,
V. Khachatryan,
A. M. Sirunyan,
A. Tumasyan,
W. Adam,
T. Bergauer,
M. Dragicevic,
J. Erö,
M. Friedl,
R. Frühwirth,
V. M. Ghete,
C. Hartl,
N. Hörmann,
J. Hrubec,
M. Jeitler,
W. Kiesenhofer,
V. Knünz,
M. Krammer,
I. Krätschmer,
D. Liko,
I. Mikulec,
D. Rabady,
B. Rahbaran
, et al. (2807 additional authors not shown)
Abstract:
A joint measurement is presented of the branching fractions $B^0_s\toμ^+μ^-$ and $B^0\toμ^+μ^-$ in proton-proton collisions at the LHC by the CMS and LHCb experiments. The data samples were collected in 2011 at a centre-of-mass energy of 7 TeV, and in 2012 at 8 TeV. The combined analysis produces the first observation of the $B^0_s\toμ^+μ^-$ decay, with a statistical significance exceeding six sta…
▽ More
A joint measurement is presented of the branching fractions $B^0_s\toμ^+μ^-$ and $B^0\toμ^+μ^-$ in proton-proton collisions at the LHC by the CMS and LHCb experiments. The data samples were collected in 2011 at a centre-of-mass energy of 7 TeV, and in 2012 at 8 TeV. The combined analysis produces the first observation of the $B^0_s\toμ^+μ^-$ decay, with a statistical significance exceeding six standard deviations, and the best measurement of its branching fraction so far. Furthermore, evidence for the $B^0\toμ^+μ^-$ decay is obtained with a statistical significance of three standard deviations. The branching fraction measurements are statistically compatible with SM predictions and impose stringent constraints on several theories beyond the SM.
△ Less
Submitted 17 August, 2015; v1 submitted 17 November, 2014;
originally announced November 2014.
-
The Jeffreys-Lindley Paradox and Discovery Criteria in High Energy Physics
Authors:
Robert D. Cousins
Abstract:
The Jeffreys-Lindley paradox displays how the use of a p-value (or number of standard deviations z) in a frequentist hypothesis test can lead to an inference that is radically different from that of a Bayesian hypothesis test in the form advocated by Harold Jeffreys in the 1930s and common today. The setting is the test of a well-specified null hypothesis (such as the Standard Model of elementary…
▽ More
The Jeffreys-Lindley paradox displays how the use of a p-value (or number of standard deviations z) in a frequentist hypothesis test can lead to an inference that is radically different from that of a Bayesian hypothesis test in the form advocated by Harold Jeffreys in the 1930s and common today. The setting is the test of a well-specified null hypothesis (such as the Standard Model of elementary particle physics, possibly with "nuisance parameters") versus a composite alternative (such as the Standard Model plus a new force of nature of unknown strength). The p-value, as well as the ratio of the likelihood under the null hypothesis to the maximized likelihood under the alternative, can strongly disfavor the null hypothesis, while the Bayesian posterior probability for the null hypothesis can be arbitrarily large. The academic statistics literature contains many impassioned comments on this paradox, yet there is no consensus either on its relevance to scientific communication or on its correct resolution. The paradox is quite relevant to frontier research in high energy physics. This paper is an attempt to explain the situation to both physicists and statisticians, in the hope that further progress can be made.
△ Less
Submitted 23 August, 2014; v1 submitted 14 October, 2013;
originally announced October 2013.
-
Nonlinear modal coupling in a high-stress doubly-clamped nanomechanical resonator
Authors:
K. J. Lulla,
R. B. Cousins,
A. Venkatesan,
M. J. Patton,
A. D. Armour,
C. J. Mellor,
J. R. Owers-Bradley
Abstract:
We present results from a study of the nonlinear intermodal coupling between different flexural vibrational modes of a single high-stress, doubly-clamped silicon nitride nanomechanical beam. The measurements were carried out at 100 mK and the beam was actuated using the magnetomotive technique. We observed the nonlinear behavior of the modes individually and also measured the coupling between them…
▽ More
We present results from a study of the nonlinear intermodal coupling between different flexural vibrational modes of a single high-stress, doubly-clamped silicon nitride nanomechanical beam. The measurements were carried out at 100 mK and the beam was actuated using the magnetomotive technique. We observed the nonlinear behavior of the modes individually and also measured the coupling between them by driving the beam at multiple frequencies. We demonstrate that the different modes of the resonator are coupled to each other by the displacement induced tension in the beam, which also leads to the well known Duffing nonlinearity in doubly-clamped beams.
△ Less
Submitted 19 April, 2012;
originally announced April 2012.
-
Negatively Biased Relevant Subsets Induced by the Most-Powerful One-Sided Upper Confidence Limits for a Bounded Physical Parameter
Authors:
Robert D. Cousins
Abstract:
Suppose an observable x is the measured value (negative or non-negative) of a true mean mu (physically non-negative) in an experiment with a Gaussian resolution function with known fixed rms deviation s. The most powerful one-sided upper confidence limit at 95% C.L. is UL = x+1.64s, which I refer to as the "original diagonal line". Perceived problems in HEP with small or non-physical upper limits…
▽ More
Suppose an observable x is the measured value (negative or non-negative) of a true mean mu (physically non-negative) in an experiment with a Gaussian resolution function with known fixed rms deviation s. The most powerful one-sided upper confidence limit at 95% C.L. is UL = x+1.64s, which I refer to as the "original diagonal line". Perceived problems in HEP with small or non-physical upper limits for x<0 historically led, for example, to substitution of max(0,x) for x, and eventually to abandonment in the Particle Data Group's Review of Particle Physics of this diagonal line relationship between UL and x. Recently Cowan, Cranmer, Gross, and Vitells (CCGV) have advocated a concept of "power constraint" that when applied to this problem yields variants of diagonal line, including UL = max(-1,x)+1.64s. Thus it is timely to consider again what is problematic about the original diagonal line, and whether or not modifications cure these defects. In a 2002 Comment, statistician Leon Jay Gleser pointed to the literature on recognizable and relevant subsets. For upper limits given by the original diagonal line, the sample space for x has recognizable relevant subsets in which the quoted 95% C.L. is known to be negatively biased (anti-conservative) by a finite amount for all values of mu. This issue is at the heart of a dispute between Jerzy Neyman and Sir Ronald Fisher over fifty years ago, the crux of which is the relevance of pre-data coverage probabilities when making post-data inferences. The literature describes illuminating connections to Bayesian statistics as well. Methods such as that advocated by CCGV have 100% unconditional coverage for certain values of mu and hence formally evade the traditional criteria for negatively biased relevant subsets; I argue that concerns remain. Comparison with frequentist intervals advocated by Feldman and Cousins also sheds light on the issues.
△ Less
Submitted 9 September, 2011;
originally announced September 2011.
-
Frequentist Evaluation of Intervals Estimated for a Binomial Parameter and for the Ratio of Poisson Means
Authors:
Robert D. Cousins,
Kathryn E. Hymes,
Jordan Tucker
Abstract:
Confidence intervals for a binomial parameter or for the ratio of Poisson means are commonly desired in high energy physics (HEP) applications such as measuring a detection efficiency or branching ratio. Due to the discreteness of the data, in both of these problems the frequentist coverage probability unfortunately depends on the unknown parameter. Trade-offs among desiderata have led to numero…
▽ More
Confidence intervals for a binomial parameter or for the ratio of Poisson means are commonly desired in high energy physics (HEP) applications such as measuring a detection efficiency or branching ratio. Due to the discreteness of the data, in both of these problems the frequentist coverage probability unfortunately depends on the unknown parameter. Trade-offs among desiderata have led to numerous sets of intervals in the statistics literature, while in HEP one typically encounters only the classic intervals of Clopper-Pearson (central intervals with no undercoverage but substantial over-coverage) or a few approximate methods which perform rather poorly. If strict coverage is relaxed, some sort of averaging is needed to compare intervals. In most of the statistics literature, this averaging is over different values of the unknown parameter, which is conceptually problematic from the frequentist point of view in which the unknown parameter is typically fixed. In contrast, we perform an (unconditional) {\it average over observed data} in the ratio-of-Poisson-means problem. If strict conditional coverage is desired, we recommend Clopper-Pearson intervals and intervals from inverting the likelihood ratio test (for central and non-central intervals, respectively). Lancaster's mid-$P$ modification to either provides excellent unconditional average coverage in the ratio-of-Poisson-means problem.
△ Less
Submitted 14 November, 2009; v1 submitted 24 May, 2009;
originally announced May 2009.
-
Comment on "Bayesian Analysis of Pentaquark Signals from CLAS Data", with Response to the Reply by Ireland and Protopopsecu
Authors:
Robert D. Cousins
Abstract:
The CLAS Collaboration has published an analysis using Bayesian model selection. My Comment criticizing their use of arbitrary prior probability density functions, and a Reply by D.G. Ireland and D. Protopopsecu, have now been published as well. This paper responds to the Reply and discusses the issues in more detail, with particular emphasis on the problems of priors in Bayesian model selection…
▽ More
The CLAS Collaboration has published an analysis using Bayesian model selection. My Comment criticizing their use of arbitrary prior probability density functions, and a Reply by D.G. Ireland and D. Protopopsecu, have now been published as well. This paper responds to the Reply and discusses the issues in more detail, with particular emphasis on the problems of priors in Bayesian model selection.
△ Less
Submitted 23 August, 2009; v1 submitted 8 July, 2008;
originally announced July 2008.
-
Annotated Bibliography of Some Papers on Combining Significances or p-values
Authors:
Robert D. Cousins
Abstract:
A question that comes up repeatedly is how to combine the results of two experiments if all that is known is that one experiment had a n-sigma effect and another experiment had a m-sigma effect. This question is not well-posed: depending on what additional assumptions are made, the preferred answer is different. The note lists some of the more prominent papers on the topic, with some brief comme…
▽ More
A question that comes up repeatedly is how to combine the results of two experiments if all that is known is that one experiment had a n-sigma effect and another experiment had a m-sigma effect. This question is not well-posed: depending on what additional assumptions are made, the preferred answer is different. The note lists some of the more prominent papers on the topic, with some brief comments and excerpts.
△ Less
Submitted 20 December, 2008; v1 submitted 15 May, 2007;
originally announced May 2007.
-
Evaluation of three methods for calculating statistical significance when incorporating a systematic uncertainty into a test of the background-only hypothesis for a Poisson process
Authors:
Robert D. Cousins,
James T. Linnemann,
Jordan Tucker
Abstract:
Hypothesis tests for the presence of new sources of Poisson counts amidst background processes are frequently performed in high energy physics (HEP), gamma ray astronomy (GRA), and other branches of science. While there are conceptual issues already when the mean rate of background is precisely known, the issues are even more difficult when the mean background rate has non-negligible uncertainty…
▽ More
Hypothesis tests for the presence of new sources of Poisson counts amidst background processes are frequently performed in high energy physics (HEP), gamma ray astronomy (GRA), and other branches of science. While there are conceptual issues already when the mean rate of background is precisely known, the issues are even more difficult when the mean background rate has non-negligible uncertainty. After describing a variety of methods to be found in the HEP and GRA literature, we consider in detail three classes of algorithms and evaluate them over a wide range of parameter space, by the criterion of how close the ensemble-average Type I error rate (rejection of the background-only hypothesis when it is true) compares with the nominal significance level given by the algorithm. We recommend wider use of an algorithm firmly grounded in frequentist tests of the ratio of Poisson means, although for very low counts the over-coverage can be severe due to the effect of discreteness. We extend the studies of Cranmer, who found that a popular Bayesian-frequentist hybrid can undercover severely when taken to high Z values. We also examine the profile likelihood method, which has long been used in GRA and HEP; it provides an excellent approximation in much of the parameter space, as previously studied by Rolke and collaborators.
△ Less
Submitted 20 November, 2008; v1 submitted 19 February, 2007;
originally announced February 2007.
-
Application of Conditioning to the Gaussian-with-Boundary Problem in the Unified Approach to Confidence Intervals
Authors:
Robert D. Cousins
Abstract:
Roe and Woodroofe (RW) have suggested that certain conditional probabilities be incorporated into the ``unified approach'' for constructing confidence intervals, previously described by Feldman and Cousins (FC). RW illustrated this conditioning technique using one of the two prototype problems in the FC paper, that of Poisson processes with background. The main effect was on the upper curve in t…
▽ More
Roe and Woodroofe (RW) have suggested that certain conditional probabilities be incorporated into the ``unified approach'' for constructing confidence intervals, previously described by Feldman and Cousins (FC). RW illustrated this conditioning technique using one of the two prototype problems in the FC paper, that of Poisson processes with background. The main effect was on the upper curve in the confidence belt. In this paper, we attempt to apply this style of conditioning to the other prototype problem, that of Gaussian errors with a bounded physical region. We find that the lower curve on the confidence belt is also moved significantly, in an undesirable manner.
△ Less
Submitted 15 January, 2000;
originally announced January 2000.
-
Kalman Filter Track Fits and Track Breakpoint Analysis
Authors:
Pierre Astier,
Alessandro Cardini,
Robert D. Cousins,
Antoine Letessier-Selvon,
Boris A. Popov,
Tatiana Vinogradova
Abstract:
We give an overview of track fitting using the Kalman filter method in the NOMAD detector at CERN, and emphasize how the wealth of by-product information can be used to analyze track breakpoints (discontinuities in track parameters caused by scattering, decay, etc.). After reviewing how this information has been previously exploited by others, we describe extensions which add power to breakpoint…
▽ More
We give an overview of track fitting using the Kalman filter method in the NOMAD detector at CERN, and emphasize how the wealth of by-product information can be used to analyze track breakpoints (discontinuities in track parameters caused by scattering, decay, etc.). After reviewing how this information has been previously exploited by others, we describe extensions which add power to breakpoint detection and characterization. We show how complete fits to the entire track, with breakpoint parameters added, can be easily obtained from the information from unbroken fits. Tests inspired by the Fisher F-test can then be used to judge breakpoints. Signed quantities (such as change in momentum at the breakpoint) can supplement unsigned quantities such as the various chisquares. We illustrate the method with electrons from real data, and with Monte Carlo simulations of pion decays.
△ Less
Submitted 16 December, 1999;
originally announced December 1999.
-
A Unified Approach to the Classical Statistical Analysis of Small Signals
Authors:
Gary J. Feldman,
Robert D. Cousins
Abstract:
We give a classical confidence belt construction which unifies the treatment of upper confidence limits for null results and two-sided confidence intervals for non-null results. The unified treatment solves a problem (apparently not previously recognized) that the choice of upper limit or two-sided intervals leads to intervals which are not confidence intervals if the choice is based on the data…
▽ More
We give a classical confidence belt construction which unifies the treatment of upper confidence limits for null results and two-sided confidence intervals for non-null results. The unified treatment solves a problem (apparently not previously recognized) that the choice of upper limit or two-sided intervals leads to intervals which are not confidence intervals if the choice is based on the data. We apply the construction to two related problems which have recently been a battle-ground between classical and Bayesian statistics: Poisson processes with background, and Gaussian errors with a bounded physical region. In contrast with the usual classical construction for upper limits, our construction avoids unphysical confidence intervals. In contrast with some popular Bayesian intervals, our intervals eliminate conservatism (frequentist coverage greater than the stated confidence) in the Gaussian case and reduce it to a level dictated by discreteness in the Poisson case. We generalize the method in order to apply it to analysis of experiments searching for neutrino oscillations. We show that this technique both gives correct coverage and is powerful, while other classical techniques that have been used by neutrino oscillation search experiments fail one or both of these criteria.
△ Less
Submitted 15 December, 1999; v1 submitted 21 November, 1997;
originally announced November 1997.