-
Toroidal Probabilistic Spherical Discriminant Analysis
Authors:
Anna Silnova,
Niko Brümmer,
Albert Swart,
Lukáš Burget
Abstract:
In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring back-ends are commonly used, namely cosine scoring and PLDA. We have recently proposed PSDA, an analog to PLDA that uses Von Mises-Fisher distributions instead of Gaussians. In this paper, we present toroidal PSDA (T-PSDA). It extends PSDA with the ability to model within and between-speaker…
▽ More
In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring back-ends are commonly used, namely cosine scoring and PLDA. We have recently proposed PSDA, an analog to PLDA that uses Von Mises-Fisher distributions instead of Gaussians. In this paper, we present toroidal PSDA (T-PSDA). It extends PSDA with the ability to model within and between-speaker variabilities in toroidal submanifolds of the hypersphere. Like PLDA and PSDA, the model allows closed-form scoring and closed-form EM updates for training. On VoxCeleb, we find T-PSDA accuracy on par with cosine scoring, while PLDA accuracy is inferior. On NIST SRE'21 we find that T-PSDA gives large accuracy gains compared to both cosine scoring and PLDA.
△ Less
Submitted 27 October, 2022;
originally announced October 2022.
-
Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings
Authors:
Niko Brümmer,
Albert Swart,
Ladislav Mošner,
Anna Silnova,
Oldřich Plchot,
Themos Stafylakis,
Lukáš Burget
Abstract:
In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring backends are commonly used, namely cosine scoring or PLDA. Both have advantages and disadvantages, depending on the context. Cosine scoring follows naturally from the spherical geometry, but for PLDA the blessing is mixed -- length normalization Gaussianizes the between-speaker distribution,…
▽ More
In speaker recognition, where speech segments are mapped to embeddings on the unit hypersphere, two scoring backends are commonly used, namely cosine scoring or PLDA. Both have advantages and disadvantages, depending on the context. Cosine scoring follows naturally from the spherical geometry, but for PLDA the blessing is mixed -- length normalization Gaussianizes the between-speaker distribution, but violates the assumption of a speaker-independent within-speaker distribution. We propose PSDA, an analogue to PLDA that uses Von Mises-Fisher distributions on the hypersphere for both within and between-class distributions. We show how the self-conjugacy of this distribution gives closed-form likelihood-ratio scores, making it a drop-in replacement for PLDA at scoring time. All kinds of trials can be scored, including single-enroll and multi-enroll verification, as well as more complex likelihood-ratios that could be used in clustering and diarization. Learning is done via an EM-algorithm with closed-form updates. We explain the model and present some first experiments.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
Analyzing speaker verification embedding extractors and back-ends under language and channel mismatch
Authors:
Anna Silnova,
Themos Stafylakis,
Ladislav Mosner,
Oldrich Plchot,
Johan Rohdin,
Pavel Matejka,
Lukas Burget,
Ondrej Glembek,
Niko Brummer
Abstract:
In this paper, we analyze the behavior and performance of speaker embeddings and the back-end scoring model under domain and language mismatch. We present our findings regarding ResNet-based speaker embedding architectures and show that reduced temporal stride yields improved performance. We then consider a PLDA back-end and show how a combination of small speaker subspace, language-dependent PLDA…
▽ More
In this paper, we analyze the behavior and performance of speaker embeddings and the back-end scoring model under domain and language mismatch. We present our findings regarding ResNet-based speaker embedding architectures and show that reduced temporal stride yields improved performance. We then consider a PLDA back-end and show how a combination of small speaker subspace, language-dependent PLDA mixture, and nuisance-attribute projection can have a drastic impact on the performance of the system. Besides, we present an efficient way of scoring and fusing class posterior logit vectors recently shown to perform well for speaker verification task. The experiments are performed using the NIST SRE 2021 setup.
△ Less
Submitted 19 March, 2022;
originally announced March 2022.
-
How to use KL-divergence to construct conjugate priors, with well-defined non-informative limits, for the multivariate Gaussian
Authors:
Niko Brümmer
Abstract:
The Wishart distribution is the standard conjugate prior for the precision of the multivariate Gaussian likelihood, when the mean is known -- while the normal-Wishart can be used when the mean is also unknown. It is however not so obvious how to assign values to the hyperparameters of these distributions. In particular, when forming non-informative limits of these distributions, the shape (or degr…
▽ More
The Wishart distribution is the standard conjugate prior for the precision of the multivariate Gaussian likelihood, when the mean is known -- while the normal-Wishart can be used when the mean is also unknown. It is however not so obvious how to assign values to the hyperparameters of these distributions. In particular, when forming non-informative limits of these distributions, the shape (or degrees of freedom) parameter of the Wishart must be handled with care. The intuitive solution of directly interpreting the shape as a pseudocount and letting it go to zero, as proposed by some authors, violates the restrictions on the shape parameter. We show how to use the scaled KL-divergence between multivariate Gaussians as an energy function to construct Wishart and normal-Wishart conjugate priors. When used as informative priors, the salient feature of these distributions is the mode, while the KL scaling factor serves as the pseudocount. The scale factor can be taken down to the limit at zero, to form non-informative priors that do not violate the restrictions on the Wishart shape parameter. This limit is non-informative in the sense that the posterior mode is identical to the maximum likelihood estimate of the parameters of the Gaussian.
△ Less
Submitted 16 September, 2021; v1 submitted 15 September, 2021;
originally announced September 2021.
-
The Phonexia VoxCeleb Speaker Recognition Challenge 2021 System Description
Authors:
Josef Slavíček,
Albert Swart,
Michal Klčo,
Niko Brümmer
Abstract:
We describe the Phonexia submission for the VoxCeleb Speaker Recognition Challenge 2021 (VoxSRC-21) in the unsupervised speaker verification track. Our solution was very similar to IDLab's winning submission for VoxSRC-20. An embedding extractor was bootstrapped using momentum contrastive learning, with input augmentations as the only source of supervision. This was followed by several iterations…
▽ More
We describe the Phonexia submission for the VoxCeleb Speaker Recognition Challenge 2021 (VoxSRC-21) in the unsupervised speaker verification track. Our solution was very similar to IDLab's winning submission for VoxSRC-20. An embedding extractor was bootstrapped using momentum contrastive learning, with input augmentations as the only source of supervision. This was followed by several iterations of clustering to assign pseudo-speaker labels that were then used for supervised embedding extractor training. Finally, a score fusion was done, by averaging the zt-normalized cosine scores of five different embedding extractors. We briefly also describe unsuccessful solutions involving i-vectors instead of DNN embeddings and PLDA instead of cosine scoring.
△ Less
Submitted 8 September, 2021; v1 submitted 5 September, 2021;
originally announced September 2021.
-
Out of a hundred trials, how many errors does your speaker verifier make?
Authors:
Niko Brümmer,
Luciana Ferrer,
Albert Swart
Abstract:
Out of a hundred trials, how many errors does your speaker verifier make? For the user this is an important, practical question, but researchers and vendors typically sidestep it and supply instead the conditional error-rates that are given by the ROC/DET curve. We posit that the user's question is answered by the Bayes error-rate. We present a tutorial to show how to compute the error-rate that r…
▽ More
Out of a hundred trials, how many errors does your speaker verifier make? For the user this is an important, practical question, but researchers and vendors typically sidestep it and supply instead the conditional error-rates that are given by the ROC/DET curve. We posit that the user's question is answered by the Bayes error-rate. We present a tutorial to show how to compute the error-rate that results when making Bayes decisions with calibrated likelihood ratios, supplied by the verifier, and an hypothesis prior, supplied by the user. For perfect calibration, the Bayes error-rate is upper bounded by min(EER,P,1-P), where EER is the equal-error-rate and P, 1-P are the prior probabilities of the competing hypotheses. The EER represents the accuracy of the verifier, while min(P,1-P) represents the hardness of the classification problem. We further show how the Bayes error-rate can be computed also for non-perfect calibration and how to generalize from error-rate to expected cost. We offer some criticism of decisions made by direct score thresholding. Finally, we demonstrate by analyzing error-rates of the recently published DCA-PLDA speaker verifier.
△ Less
Submitted 1 April, 2021;
originally announced April 2021.
-
A Speaker Verification Backend with Robust Performance across Conditions
Authors:
Luciana Ferrer,
Mitchell McLaren,
Niko Brummer
Abstract:
In this paper, we address the problem of speaker verification in conditions unseen or unknown during development. A standard method for speaker verification consists of extracting speaker embeddings with a deep neural network and processing them through a backend composed of probabilistic linear discriminant analysis (PLDA) and global logistic regression score calibration. This method is known to…
▽ More
In this paper, we address the problem of speaker verification in conditions unseen or unknown during development. A standard method for speaker verification consists of extracting speaker embeddings with a deep neural network and processing them through a backend composed of probabilistic linear discriminant analysis (PLDA) and global logistic regression score calibration. This method is known to result in systems that work poorly on conditions different from those used to train the calibration model. We propose to modify the standard backend, introducing an adaptive calibrator that uses duration and other automatically extracted side-information to adapt to the conditions of the inputs. The backend is trained discriminatively to optimize binary cross-entropy. When trained on a number of diverse datasets that are labeled only with respect to speaker, the proposed backend consistently and, in some cases, dramatically improves calibration, compared to the standard PLDA approach, on a number of held-out datasets, some of which are markedly different from the training data. Discrimination performance is also consistently improved. We show that joint training of the PLDA and the adaptive calibrator is essential -- the same benefits cannot be achieved when freezing PLDA and fine-tuning the calibrator. To our knowledge, the results in this paper are the first evidence in the literature that it is possible to develop a speaker verification system with robust out-of-the-box performance on a large variety of conditions.
△ Less
Submitted 17 August, 2021; v1 submitted 2 February, 2021;
originally announced February 2021.
-
Probabilistic embeddings for speaker diarization
Authors:
Anna Silnova,
Niko Brümmer,
Johan Rohdin,
Themos Stafylakis,
Lukáš Burget
Abstract:
Speaker embeddings (x-vectors) extracted from very short segments of speech have recently been shown to give competitive performance in speaker diarization. We generalize this recipe by extracting from each speech segment, in parallel with the x-vector, also a diagonal precision matrix, thus providing a path for the propagation of information about the quality of the speech segment into a PLDA sco…
▽ More
Speaker embeddings (x-vectors) extracted from very short segments of speech have recently been shown to give competitive performance in speaker diarization. We generalize this recipe by extracting from each speech segment, in parallel with the x-vector, also a diagonal precision matrix, thus providing a path for the propagation of information about the quality of the speech segment into a PLDA scoring backend. These precisions quantify the uncertainty about what the values of the embeddings might have been if they had been extracted from high quality speech segments. The proposed probabilistic embeddings (x-vectors with precisions) are interfaced with the PLDA model by treating the x-vectors as hidden variables and marginalizing them out. We apply the proposed probabilistic embeddings as input to an agglomerative hierarchical clustering (AHC) algorithm to do diarization in the DIHARD'19 evaluation set. We compute the full PLDA likelihood 'by the book' for each clustering hypothesis that is considered by AHC. We do joint discriminative training of the PLDA parameters and of the probabilistic x-vector extractor. We demonstrate accuracy gains relative to a baseline AHC algorithm, applied to traditional xvectors (without uncertainty), and which uses averaging of binary log-likelihood-ratios, rather than by-the-book scoring.
△ Less
Submitted 6 November, 2020; v1 submitted 6 April, 2020;
originally announced April 2020.
-
Large-Scale Speaker Diarization of Radio Broadcast Archives
Authors:
Emre Yılmaz,
Adem Derinel,
Zhou Kun,
Henk van den Heuvel,
Niko Brummer,
Haizhou Li,
David A. van Leeuwen
Abstract:
This paper describes our initial efforts to build a large-scale speaker diarization (SD) and identification system on a recently digitized radio broadcast archive from the Netherlands which has more than 6500 audio tapes with 3000 hours of Frisian-Dutch speech recorded between 1950-2016. The employed large-scale diarization scheme involves two stages: (1) tape-level speaker diarization providing p…
▽ More
This paper describes our initial efforts to build a large-scale speaker diarization (SD) and identification system on a recently digitized radio broadcast archive from the Netherlands which has more than 6500 audio tapes with 3000 hours of Frisian-Dutch speech recorded between 1950-2016. The employed large-scale diarization scheme involves two stages: (1) tape-level speaker diarization providing pseudo-speaker identities and (2) speaker linking to relate pseudo-speakers appearing in multiple tapes. Having access to the speaker models of several frequently appearing speakers from the previously collected FAME! speech corpus, we further perform speaker identification by linking these known speakers to the pseudo-speakers identified at the first stage. In this work, we present a recently created longitudinal and multilingual SD corpus designed for large-scale SD research and evaluate the performance of a new speaker linking system using x-vectors with PLDA to quantify cross-tape speaker similarity on this corpus. The performance of this speaker linking system is evaluated on a small subset of the archive which is manually annotated with speaker information. The speaker linking performance reported on this subset (53 hours) and the whole archive (3000 hours) is compared to quantify the impact of scaling up in the amount of speech data.
△ Less
Submitted 28 June, 2019; v1 submitted 19 June, 2019;
originally announced June 2019.
-
Fast variational Bayes for heavy-tailed PLDA applied to i-vectors and x-vectors
Authors:
Anna Silnova,
Niko Brummer,
Daniel Garcia-Romero,
David Snyder,
Lukas Burget
Abstract:
The standard state-of-the-art backend for text-independent speaker recognizers that use i-vectors or x-vectors, is Gaussian PLDA (G-PLDA), assisted by a Gaussianization step involving length normalization. G-PLDA can be trained with both generative or discriminative methods. It has long been known that heavy-tailed PLDA (HT-PLDA), applied without length normalization, gives similar accuracy, but a…
▽ More
The standard state-of-the-art backend for text-independent speaker recognizers that use i-vectors or x-vectors, is Gaussian PLDA (G-PLDA), assisted by a Gaussianization step involving length normalization. G-PLDA can be trained with both generative or discriminative methods. It has long been known that heavy-tailed PLDA (HT-PLDA), applied without length normalization, gives similar accuracy, but at considerable extra computational cost. We have recently introduced a fast scoring algorithm for a discriminatively trained HT-PLDA backend. This paper extends that work by introducing a fast, variational Bayes, generative training algorithm. We compare old and new backends, with and without length-normalization, with i-vectors and x-vectors, on SRE'10, SRE'16 and SITW.
△ Less
Submitted 24 March, 2018;
originally announced March 2018.
-
Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model
Authors:
Niko Brummer,
Anna Silnova,
Lukas Burget,
Themos Stafylakis
Abstract:
Embeddings in machine learning are low-dimensional representations of complex input patterns, with the property that simple geometric operations like Euclidean distances and dot products can be used for classification and comparison tasks. The proposed meta-embeddings are special embeddings that live in more general inner product spaces. They are designed to propagate uncertainty to the final outp…
▽ More
Embeddings in machine learning are low-dimensional representations of complex input patterns, with the property that simple geometric operations like Euclidean distances and dot products can be used for classification and comparison tasks. The proposed meta-embeddings are special embeddings that live in more general inner product spaces. They are designed to propagate uncertainty to the final output in speaker recognition and similar applications. The familiar Gaussian PLDA model (GPLDA) can be re-formulated as an extractor for Gaussian meta-embeddings (GMEs), such that likelihood ratio scores are given by Hilbert space inner products between Gaussian likelihood functions. GMEs extracted by the GPLDA model have fixed precisions and do not propagate uncertainty. We show that a generalization to heavy-tailed PLDA gives GMEs with variable precisions, which do propagate uncertainty. Experiments on NIST SRE 2010 and 2016 show that the proposed method applied to i-vectors without length normalization is up to 20% more accurate than GPLDA applied to length-normalized ivectors.
△ Less
Submitted 27 February, 2018;
originally announced February 2018.
-
Language-depedent I-Vectors for LRE15
Authors:
Niko Brümmer,
Albert Swart
Abstract:
A standard recipe for spoken language recognition is to apply a Gaussian back-end to i-vectors. This ignores the uncertainty in the i-vector extraction, which could be important especially for short utterances. A recent paper by Cumani, Plchot and Fer proposes a solution to propagate that uncertainty into the backend. We propose an alternative method of propagating the uncertainty.
A standard recipe for spoken language recognition is to apply a Gaussian back-end to i-vectors. This ignores the uncertainty in the i-vector extraction, which could be important especially for short utterances. A recent paper by Cumani, Plchot and Fer proposes a solution to propagate that uncertainty into the backend. We propose an alternative method of propagating the uncertainty.
△ Less
Submitted 29 September, 2017;
originally announced October 2017.
-
A Generative Model for Score Normalization in Speaker Recognition
Authors:
Albert Swart,
Niko Brummer
Abstract:
We propose a theoretical framework for thinking about score normalization, which confirms that normalization is not needed under (admittedly fragile) ideal conditions. If, however, these conditions are not met, e.g. under data-set shift between training and runtime, our theory reveals dependencies between scores that could be exploited by strategies such as score normalization. Indeed, it has been…
▽ More
We propose a theoretical framework for thinking about score normalization, which confirms that normalization is not needed under (admittedly fragile) ideal conditions. If, however, these conditions are not met, e.g. under data-set shift between training and runtime, our theory reveals dependencies between scores that could be exploited by strategies such as score normalization. Indeed, it has been demonstrated over and over experimentally, that various ad-hoc score normalization recipes do work. We present a first attempt at using probability theory to design a generative score-space normalization model which gives similar improvements to ZT-norm on the text-dependent RSR 2015 database.
△ Less
Submitted 28 September, 2017;
originally announced September 2017.
-
Note on the equivalence of hierarchical variational models and auxiliary deep generative models
Authors:
Niko Brümmer
Abstract:
This note compares two recently published machine learning methods for constructing flexible, but tractable families of variational hidden-variable posteriors. The first method, called "hierarchical variational models" enriches the inference model with an extra variable, while the other, called "auxiliary deep generative models", enriches the generative model instead. We conclude that the two meth…
▽ More
This note compares two recently published machine learning methods for constructing flexible, but tractable families of variational hidden-variable posteriors. The first method, called "hierarchical variational models" enriches the inference model with an extra variable, while the other, called "auxiliary deep generative models", enriches the generative model instead. We conclude that the two methods are mathematically equivalent.
△ Less
Submitted 9 March, 2016; v1 submitted 8 March, 2016;
originally announced March 2016.
-
VB calibration to improve the interface between phone recognizer and i-vector extractor
Authors:
Niko Brümmer
Abstract:
The EM training algorithm of the classical i-vector extractor is often incorrectly described as a maximum-likelihood method. The i-vector model is however intractable: the likelihood itself and the hidden-variable posteriors needed for the EM algorithm cannot be computed in closed form. We show here that the classical i-vector extractor recipe is actually a mean-field variational Bayes (VB) recipe…
▽ More
The EM training algorithm of the classical i-vector extractor is often incorrectly described as a maximum-likelihood method. The i-vector model is however intractable: the likelihood itself and the hidden-variable posteriors needed for the EM algorithm cannot be computed in closed form. We show here that the classical i-vector extractor recipe is actually a mean-field variational Bayes (VB) recipe.
This theoretical VB interpretation turns out to be of further use, because it also offers an interpretation of the newer phonetic i-vector extractor recipe, thereby unifying the two flavours of extractor.
More importantly, the VB interpretation is also practically useful: it suggests ways of modifying existing i-vector extractors to make them more accurate. In particular, in existing methods, the approximate VB posterior for the GMM states is fixed, while only the parameters of the generative model are adapted. Here we explore the possibility of also mildly adjusting (calibrating) those posteriors, so that they better fit the generative model.
△ Less
Submitted 14 October, 2015; v1 submitted 12 October, 2015;
originally announced October 2015.
-
Constrained speaker linking
Authors:
David A. van Leeuwen,
Niko Brümmer
Abstract:
In this paper we study speaker linking (a.k.a.\ partitioning) given constraints of the distribution of speaker identities over speech recordings. Specifically, we show that the intractable partitioning problem becomes tractable when the constraints pre-partition the data in smaller cliques with non-overlap** speakers. The surprisingly common case where speakers in telephone conversations are kno…
▽ More
In this paper we study speaker linking (a.k.a.\ partitioning) given constraints of the distribution of speaker identities over speech recordings. Specifically, we show that the intractable partitioning problem becomes tractable when the constraints pre-partition the data in smaller cliques with non-overlap** speakers. The surprisingly common case where speakers in telephone conversations are known, but the assignment of channels to identities is unspecified, is treated in a Bayesian way. We show that for the Dutch CGN database, where this channel assignment task is at hand, a lightweight speaker recognition system can quite effectively solve the channel assignment problem, with 93% of the cliques solved. We further show that the posterior distribution over channel assignment configurations is well calibrated.
△ Less
Submitted 2 April, 2014; v1 submitted 26 March, 2014;
originally announced March 2014.
-
What is the `relevant population' in Bayesian forensic inference?
Authors:
Niko Brümmer,
Edward de Villiers
Abstract:
In works discussing the Bayesian paradigm for presenting forensic evidence in court, the concept of a `relevant population' is often mentioned, without a clear definition of what is meant, and without recommendations of how to select such populations. This note is to try to better understand this concept. Our analysis is intended to be general enough to be applicable to different forensic technolo…
▽ More
In works discussing the Bayesian paradigm for presenting forensic evidence in court, the concept of a `relevant population' is often mentioned, without a clear definition of what is meant, and without recommendations of how to select such populations. This note is to try to better understand this concept. Our analysis is intended to be general enough to be applicable to different forensic technologies and we shall consider both DNA profiling and speaker recognition as examples.
△ Less
Submitted 24 March, 2014;
originally announced March 2014.
-
Bayesian calibration for forensic evidence reporting
Authors:
Niko Brümmer,
Albert Swart
Abstract:
We introduce a Bayesian solution for the problem in forensic speaker recognition, where there may be very little background material for estimating score calibration parameters. We work within the Bayesian paradigm of evidence reporting and develop a principled probabilistic treatment of the problem, which results in a Bayesian likelihood-ratio as the vehicle for reporting weight of evidence. We s…
▽ More
We introduce a Bayesian solution for the problem in forensic speaker recognition, where there may be very little background material for estimating score calibration parameters. We work within the Bayesian paradigm of evidence reporting and develop a principled probabilistic treatment of the problem, which results in a Bayesian likelihood-ratio as the vehicle for reporting weight of evidence. We show in contrast, that reporting a likelihood-ratio distribution does not solve this problem. Our solution is experimentally exercised on a simulated forensic scenario, using NIST SRE'12 scores, which demonstrates a clear advantage for the proposed method compared to the traditional plugin calibration recipe.
△ Less
Submitted 10 June, 2014; v1 submitted 24 March, 2014;
originally announced March 2014.
-
A comparison of linear and non-linear calibrations for speaker recognition
Authors:
Niko Brümmer,
Albert Swart,
David van Leeuwen
Abstract:
In recent work on both generative and discriminative score to log-likelihood-ratio calibration, it was shown that linear transforms give good accuracy only for a limited range of operating points. Moreover, these methods required tailoring of the calibration training objective functions in order to target the desired region of best accuracy. Here, we generalize the linear recipes to non-linear one…
▽ More
In recent work on both generative and discriminative score to log-likelihood-ratio calibration, it was shown that linear transforms give good accuracy only for a limited range of operating points. Moreover, these methods required tailoring of the calibration training objective functions in order to target the desired region of best accuracy. Here, we generalize the linear recipes to non-linear ones. We experiment with a non-linear, non-parametric, discriminative PAV solution, as well as parametric, generative, maximum-likelihood solutions that use Gaussian, Student's T and normal-inverse-Gaussian score distributions. Experiments on NIST SRE'12 scores suggest that the non-linear methods provide wider ranges of optimal accuracy and can be trained without having to resort to objective function tailoring.
△ Less
Submitted 9 April, 2014; v1 submitted 11 February, 2014;
originally announced February 2014.
-
The EM algorithm and the Laplace Approximation
Authors:
Niko Brümmer
Abstract:
The Laplace approximation calls for the computation of second derivatives at the likelihood maximum. When the maximum is found by the EM-algorithm, there is a convenient way to compute these derivatives. The likelihood gradient can be obtained from the EM-auxiliary, while the Hessian can be obtained from this gradient with the Pearlmutter trick.
The Laplace approximation calls for the computation of second derivatives at the likelihood maximum. When the maximum is found by the EM-algorithm, there is a convenient way to compute these derivatives. The likelihood gradient can be obtained from the EM-auxiliary, while the Hessian can be obtained from this gradient with the Pearlmutter trick.
△ Less
Submitted 24 January, 2014;
originally announced January 2014.
-
Generative Modelling for Unsupervised Score Calibration
Authors:
Niko Brümmer,
Daniel Garcia-Romero
Abstract:
Score calibration enables automatic speaker recognizers to make cost-effective accept / reject decisions. Traditional calibration requires supervised data, which is an expensive resource. We propose a 2-component GMM for unsupervised calibration and demonstrate good performance relative to a supervised baseline on NIST SRE'10 and SRE'12. A Bayesian analysis demonstrates that the uncertainty associ…
▽ More
Score calibration enables automatic speaker recognizers to make cost-effective accept / reject decisions. Traditional calibration requires supervised data, which is an expensive resource. We propose a 2-component GMM for unsupervised calibration and demonstrate good performance relative to a supervised baseline on NIST SRE'10 and SRE'12. A Bayesian analysis demonstrates that the uncertainty associated with the unsupervised calibration parameter estimates is surprisingly small.
△ Less
Submitted 14 February, 2014; v1 submitted 4 November, 2013;
originally announced November 2013.
-
Likelihood-ratio calibration using prior-weighted proper scoring rules
Authors:
Niko Brümmer,
George Doddington
Abstract:
Prior-weighted logistic regression has become a standard tool for calibration in speaker recognition. Logistic regression is the optimization of the expected value of the logarithmic scoring rule. We generalize this via a parametric family of proper scoring rules. Our theoretical analysis shows how different members of this family induce different relative weightings over a spectrum of application…
▽ More
Prior-weighted logistic regression has become a standard tool for calibration in speaker recognition. Logistic regression is the optimization of the expected value of the logarithmic scoring rule. We generalize this via a parametric family of proper scoring rules. Our theoretical analysis shows how different members of this family induce different relative weightings over a spectrum of applications of which the decision thresholds range from low to high. Special attention is given to the interaction between prior weighting and proper scoring rule parameters. Experiments on NIST SRE'12 suggest that for applications with low false-alarm rate requirements, scoring rules tailored to emphasize higher score thresholds may give better accuracy than logistic regression.
△ Less
Submitted 30 July, 2013;
originally announced July 2013.
-
Generative, Fully Bayesian, Gaussian, Openset Pattern Classifier
Authors:
Niko Brummer
Abstract:
This report works out the details of a closed-form, fully Bayesian, multiclass, openset, generative pattern classifier using multivariate Gaussian likelihoods, with conjugate priors. The generative model has a common within-class covariance, which is proportional to the between-class covariance in the conjugate prior. The scalar proportionality constant is the only plugin parameter. All other mode…
▽ More
This report works out the details of a closed-form, fully Bayesian, multiclass, openset, generative pattern classifier using multivariate Gaussian likelihoods, with conjugate priors. The generative model has a common within-class covariance, which is proportional to the between-class covariance in the conjugate prior. The scalar proportionality constant is the only plugin parameter. All other model parameters are intergated out in closed form. An expression is given for the model evidence, which can be used to make plugin estimates for the proportionality constant. Pattern recognition is done via the predictive likeihoods of classes for which training data is available, as well as a predicitve likelihood for any as yet unseen class.
△ Less
Submitted 24 July, 2013; v1 submitted 23 July, 2013;
originally announced July 2013.
-
Tutorial for Bayesian forensic likelihood ratio
Authors:
Niko Brümmer
Abstract:
In the Bayesian paradigm for presenting forensic evidence to court, it is recommended that the weight of the evidence be summarized as a likelihood ratio (LR) between two opposing hypotheses of how the evidence could have been produced. Such LRs are necessarily based on probabilistic models, the parameters of which may be uncertain. It has been suggested by some authors that the value of the LR, b…
▽ More
In the Bayesian paradigm for presenting forensic evidence to court, it is recommended that the weight of the evidence be summarized as a likelihood ratio (LR) between two opposing hypotheses of how the evidence could have been produced. Such LRs are necessarily based on probabilistic models, the parameters of which may be uncertain. It has been suggested by some authors that the value of the LR, being a function of the model parameters should therefore also be considered uncertain and that this uncertainty should be communicated to the court. In this tutorial, we consider a simple example of a 'fully Bayesian' solution, where model uncertainty is integrated out to produce a value for the LR which is not uncertain. We show that this solution agrees with common sense. In particular, the LR magnitude is a function of the amount of data that is available to estimate the model parameters.
△ Less
Submitted 12 April, 2013;
originally announced April 2013.
-
The BOSARIS Toolkit: Theory, Algorithms and Code for Surviving the New DCF
Authors:
Niko Brümmer,
Edward de Villiers
Abstract:
The change of two orders of magnitude in the 'new DCF' of NIST's SRE'10, relative to the 'old DCF' evaluation criterion, posed a difficult challenge for participants and evaluator alike. Initially, participants were at a loss as to how to calibrate their systems, while the evaluator underestimated the required number of evaluation trials. After the fact, it is now obvious that both calibration and…
▽ More
The change of two orders of magnitude in the 'new DCF' of NIST's SRE'10, relative to the 'old DCF' evaluation criterion, posed a difficult challenge for participants and evaluator alike. Initially, participants were at a loss as to how to calibrate their systems, while the evaluator underestimated the required number of evaluation trials. After the fact, it is now obvious that both calibration and evaluation require very large sets of trials. This poses the challenges of (i) how to decide what number of trials is enough, and (ii) how to process such large data sets with reasonable memory and CPU requirements. After SRE'10, at the BOSARIS Workshop, we built solutions to these problems into the freely available BOSARIS Toolkit. This paper explains the principles and algorithms behind this toolkit. The main contributions of the toolkit are: 1. The Normalized Bayes Error-Rate Plot, which analyses likelihood- ratio calibration over a wide range of DCF operating points. These plots also help in judging the adequacy of the sizes of calibration and evaluation databases. 2. Efficient algorithms to compute DCF and minDCF for large score files, over the range of operating points required by these plots. 3. A new score file format, which facilitates working with very large trial lists. 4. A faster logistic regression optimizer for fusion and calibration. 5. A principled way to define EER (equal error rate), which is of practical interest when the absolute error count is small.
△ Less
Submitted 10 April, 2013;
originally announced April 2013.
-
The PAV algorithm optimizes binary proper scoring rules
Authors:
Niko Brummer,
Johan du Preez
Abstract:
There has been much recent interest in application of the pool-adjacent-violators (PAV) algorithm for the purpose of calibrating the probabilistic outputs of automatic pattern recognition and machine learning algorithms. Special cost functions, known as proper scoring rules form natural objective functions to judge the goodness of such calibration. We show that for binary pattern classifiers, the…
▽ More
There has been much recent interest in application of the pool-adjacent-violators (PAV) algorithm for the purpose of calibrating the probabilistic outputs of automatic pattern recognition and machine learning algorithms. Special cost functions, known as proper scoring rules form natural objective functions to judge the goodness of such calibration. We show that for binary pattern classifiers, the non-parametric optimization of calibration, subject to a monotonicity constraint, can be solved by PAV and that this solution is optimal for all regular binary proper scoring rules. This extends previous results which were limited to convex binary proper scoring rules. We further show that this result holds not only for calibration of probabilities, but also for calibration of log-likelihood-ratios, in which case optimality holds independently of the prior probabilities of the pattern classes.
△ Less
Submitted 8 April, 2013;
originally announced April 2013.
-
The distribution of calibrated likelihood-ratios in speaker recognition
Authors:
David A. van Leeuwen,
Niko Brümmer
Abstract:
This paper studies properties of the score distributions of calibrated log-likelihood-ratios that are used in automatic speaker recognition. We derive the essential condition for calibration that the log likelihood ratio of the log-likelihood-ratio is the log-likelihood-ratio. We then investigate what the consequence of this condition is to the probability density functions (PDFs) of the log-likel…
▽ More
This paper studies properties of the score distributions of calibrated log-likelihood-ratios that are used in automatic speaker recognition. We derive the essential condition for calibration that the log likelihood ratio of the log-likelihood-ratio is the log-likelihood-ratio. We then investigate what the consequence of this condition is to the probability density functions (PDFs) of the log-likelihood-ratio score. We show that if the PDF of the non-target distribution is Gaussian, then the PDF of the target distribution must be Gaussian as well. The means and variances of these two PDFs are interrelated, and determined completely by the discrimination performance of the recognizer characterized by the equal error rate. These relations allow for a new way of computing the offset and scaling parameters for linear calibration, and we derive closed-form expressions for these and show that for modern i-vector systems with PLDA scoring this leads to good calibration, comparable to traditional logistic regression, over a wide range of system performance.
△ Less
Submitted 8 June, 2013; v1 submitted 3 April, 2013;
originally announced April 2013.
-
Production of Z0 bosons in elastic and quasi-elastic ep collisions at HERA
Authors:
ZEUS collaboration,
H. Abramowicz,
I. Abt,
L. Adamczyk,
M. Adamus,
R. Aggarwal,
S. Antonelli,
P. Antonioli,
A. Antonov,
M. Arneodo,
O. Arslan,
V. Aushev,
Y. Aushev,
O. Bachynska,
A. Bamberger,
A. N. Barakbaev,
G. Barbagli,
G. Bari,
F. Barreiro,
N. Bartosik,
D. Bartsch,
M. Basile,
O. Behnke,
J. Behr,
U. Behrens
, et al. (278 additional authors not shown)
Abstract:
The production of Z0 bosons in the reaction ep -> eZ0p*, where p* stands for a proton or a low-mass nucleon resonance, has been studied in ep collisions at HERA using the ZEUS detector. The analysis is based on a data sample collected between 1996 and 2007, amounting to 496 pb-1 of integrated luminosity. The Z0 was measured in the hadronic decay mode. The elasticity of the events was ensured by a…
▽ More
The production of Z0 bosons in the reaction ep -> eZ0p*, where p* stands for a proton or a low-mass nucleon resonance, has been studied in ep collisions at HERA using the ZEUS detector. The analysis is based on a data sample collected between 1996 and 2007, amounting to 496 pb-1 of integrated luminosity. The Z0 was measured in the hadronic decay mode. The elasticity of the events was ensured by a cut on eta_max < 3.0, where eta_max is the maximum pseudorapidity of energy deposits in the calorimeter defined with respect to the proton beam direction. A signal was observed at the Z0 mass. The cross section of the reaction ep -> eZ0p* was measured to be sigma(ep -> eZ0p*) = 0.13 +/- 0.06 (stat.) +/- 0.01 (syst.) pb, in agreement with the Standard Model prediction of 0.16 pb. This is the first measurement of Z0 production in ep collisions.
△ Less
Submitted 19 October, 2012;
originally announced October 2012.
-
Measurement of high-Q2 neutral current deep inelastic e+p scattering cross sections with a longitudinally polarised positron beam at HERA
Authors:
ZEUS Collaboration,
H. Abramowicz,
I. Abt,
L. Adamczyk,
M. Adamus,
R. Aggarwal,
S. Antonelli,
P. Antonioli,
A. Antonov,
M. Arneodo,
O. Arslan,
V. Aushev,
Y. Aushev,
O. Bachynska,
A. Bamberger,
A. N. Barakbaev,
G. Barbagli,
G. Bari,
F. Barreiro,
N. Bartosik,
D. Bartsch,
M. Basile,
O. Behnke,
J. Behr,
U. Behrens
, et al. (278 additional authors not shown)
Abstract:
Measurements of neutral current cross sections for deep inelastic scattering in e+p collisions at HERA with a longitudinally polarised positron beam are presented. The single-differential cross-sections d(sigma)/dQ2, d(sigma)/dx and d(sigma)/dy and the reduced cross-section were measured in the kinematic region Q2 > 185 GeV2 and y < 0.9, where Q2 is the four-momentum transfer squared, x the Bjorke…
▽ More
Measurements of neutral current cross sections for deep inelastic scattering in e+p collisions at HERA with a longitudinally polarised positron beam are presented. The single-differential cross-sections d(sigma)/dQ2, d(sigma)/dx and d(sigma)/dy and the reduced cross-section were measured in the kinematic region Q2 > 185 GeV2 and y < 0.9, where Q2 is the four-momentum transfer squared, x the Bjorken scaling variable, and y the inelasticity of the interaction. The measurements were performed separately for positively and negatively polarised positron beams. The measurements are based on an integrated luminosity of 135.5 pb-1 collected with the ZEUS detector in 2006 and 2007 at a centre-of-mass energy of 318 GeV. The structure functions F3 and F3(gamma)Z were determined by combining the e+p results presented in this paper with previously published e-p neutral current results. The asymmetry parameter A+ is used to demonstrate the parity violation predicted in electroweak interactions. The measurements are well described by the predictions of the Standard Model.
△ Less
Submitted 12 May, 2014; v1 submitted 30 August, 2012;
originally announced August 2012.
-
Inclusive-jet photoproduction at HERA and determination of alphas
Authors:
ZEUS Collaboration,
H. Abramowicz,
I. Abt,
L. Adamczyk,
M. Adamus,
R. Aggarwal,
S. Antonelli,
P. Antonioli,
A. Antonov,
M. Arneodo,
V. Aushev,
Y. Aushev,
O. Bachynska,
A. Bamberger,
A. N. Barakbaev,
G. Barbagli,
G. Bari,
F. Barreiro,
N. Bartosik,
D. Bartsch,
M. Basile,
O. Behnke,
J. Behr,
U. Behrens,
L. Bellagamba
, et al. (281 additional authors not shown)
Abstract:
Inclusive-jet cross sections have been measured in the reaction ep->e+jet+X for photon virtuality Q2 < 1 GeV2 and gamma-p centre-of-mass energies in the region 142 < W(gamma-p) < 293 GeV with the ZEUS detector at HERA using an integrated luminosity of 300 pb-1. Jets were identified using the kT, anti-kT or SIScone jet algorithms in the laboratory frame. Single-differential cross sections are prese…
▽ More
Inclusive-jet cross sections have been measured in the reaction ep->e+jet+X for photon virtuality Q2 < 1 GeV2 and gamma-p centre-of-mass energies in the region 142 < W(gamma-p) < 293 GeV with the ZEUS detector at HERA using an integrated luminosity of 300 pb-1. Jets were identified using the kT, anti-kT or SIScone jet algorithms in the laboratory frame. Single-differential cross sections are presented as functions of the jet transverse energy, ETjet, and pseudorapidity, etajet, for jets with ETjet > 17 GeV and -1 < etajet < 2.5. In addition, measurements of double-differential inclusive-jet cross sections are presented as functions of ETjet in different regions of etajet. Next-to-leading-order QCD calculations give a good description of the measurements, except for jets with low ETjet and high etajet. The influence of non-perturbative effects not related to hadronisation was studied. Measurements of the ratios of cross sections using different jet algorithms are also presented; the measured ratios are well described by calculations including up to O(alphas2) terms. Values of alphas(Mz) were extracted from the measurements and the energy-scale dependence of the coupling was determined. The value of alphas(Mz) extracted from the measurements based on the kT jet algorithm is alphas(Mz) = 0.1206 +0.0023 -0.0022 (exp.) +0.0042 -0.0035 (th.); the results from the anti-kT and SIScone algorithms are compatible with this value and have a similar precision.
△ Less
Submitted 28 May, 2012;
originally announced May 2012.
-
Exclusive electroproduction of two pions at HERA
Authors:
ZEUS collaboration,
H. Abramowicz,
I. Abt,
L. Adamczyk,
M. Adamus,
R. Aggarwal,
S. Antonelli,
P. Antonioli,
A. Antonov,
M. Arneodo,
D. Ashery,
V. Aushev,
Y. Aushev,
O. Bachynska,
A. Bamberger,
A. N. Barakbaev,
G. Barbagli,
G. Bari,
F. Barreiro,
N. Bartosik,
D. Bartsch,
M. Basile,
O. Behnke,
J. Behr,
U. Behrens
, et al. (280 additional authors not shown)
Abstract:
The exclusive electroproduction of two pions in the mass range 0.4 < Mππ < 2.5 GeV has been studied with the ZEUS detector at HERA using an integrated luminosity of 82 pb-1. The analysis was carried out in the kinematic range of 2 < Q2 < 80 GeV2, 32 < W < 180 GeV and |t| < 0.6 GeV2, where Q2 is the photon virtuality, W is the photon-proton centre-of-mass energy and t is the squared four-momentum t…
▽ More
The exclusive electroproduction of two pions in the mass range 0.4 < Mππ < 2.5 GeV has been studied with the ZEUS detector at HERA using an integrated luminosity of 82 pb-1. The analysis was carried out in the kinematic range of 2 < Q2 < 80 GeV2, 32 < W < 180 GeV and |t| < 0.6 GeV2, where Q2 is the photon virtuality, W is the photon-proton centre-of-mass energy and t is the squared four-momentum transfer at the proton vertex. The two-pion invariant-mass distribution is interpreted in terms of the pion electromagnetic form factor, |F(Mππ)|, assuming that the studied mass range includes the contributions of the ρ, ρ' and ρ" vector-meson states. The masses and widths of the resonances were obtained and the Q2 dependence of the cross-section ratios σ(ρ' \rightarrow ππ)/σ(ρ) and σ(ρ" \rightarrow ππ)/σ(ρ) was extracted. The pion form factor obtained in the present analysis is compared to that obtained in e+e- \rightarrow π+π-.
△ Less
Submitted 21 November, 2011;
originally announced November 2011.
-
Search for single-top production in ep collisions at HERA
Authors:
ZEUS Collaboration,
H. Abramowicz,
I. Abt,
L. Adamczyk,
M. Adamus,
R. Aggarwal,
S. Antonelli,
P. Antonioli,
A. Antonov,
M. Arneodo,
V. Aushev,
Y. Aushev,
O. Bachynska,
A. Bamberger,
A. N. Barakbaev,
G. Barbagli,
G. Bari,
F. Barreiro,
N. Bartosik,
D. Bartsch,
M. Basile,
O. Behnke,
J. Behr,
U . Behrens,
L. Bellagamba
, et al. (278 additional authors not shown)
Abstract:
A search for single-top production, $ep \rightarrow etX$, has been performed with the ZEUS detector at HERA using data corresponding to an integrated luminosity of $0.37\fbi$. No evidence for top production was found, consistent with the expectation from the Standard Model. Limits were computed for single-top production via flavour changing neutral current transitions. The result was combined with…
▽ More
A search for single-top production, $ep \rightarrow etX$, has been performed with the ZEUS detector at HERA using data corresponding to an integrated luminosity of $0.37\fbi$. No evidence for top production was found, consistent with the expectation from the Standard Model. Limits were computed for single-top production via flavour changing neutral current transitions. The result was combined with a previous ZEUS result yielding a total luminosity of 0.50fb-1. A 95% credibility level upper limit of 0.13 pb was obtained for the cross section at the centre-of-mass energy of $\sqrt{s}=315\gev$.
△ Less
Submitted 4 February, 2012; v1 submitted 16 November, 2011;
originally announced November 2011.
-
Scaled momentum distributions for K0s and Lambda/bar Lambda in DIS at HERA
Authors:
ZEUS Collaboration,
H. Abramowicz,
I. Abt,
L. Adamczyk,
M. Adamus,
R. Aggarwal,
S. Antonelli,
P. Antonioli,
A. Antonov,
M. Arneodo,
V. Aushev,
Y. Aushev,
O. Bachynska,
A. Bamberger,
A. N. Barakbaev,
G. Barbagli,
G. Bari,
F. Barreiro,
N. Bartosik,
D. Bartsch,
M. Basile,
O. Behnke,
J. Behr,
U. Behrens,
L. Bellagamba
, et al. (278 additional authors not shown)
Abstract:
Scaled momentum distributions for the strange hadrons K0s and Lambda/bar Lambda were measured in deep inelastic ep scattering with the ZEUS detector at HERA using an integrated luminosity of 330 pb-1. The evolution of these distributions with the photon virtuality, Q2, was studied in the kinematic region 10<Q2<40000 GeV2 and 0.001<x<0.75, where x is the Bjorken scaling variable. Clear scaling viol…
▽ More
Scaled momentum distributions for the strange hadrons K0s and Lambda/bar Lambda were measured in deep inelastic ep scattering with the ZEUS detector at HERA using an integrated luminosity of 330 pb-1. The evolution of these distributions with the photon virtuality, Q2, was studied in the kinematic region 10<Q2<40000 GeV2 and 0.001<x<0.75, where x is the Bjorken scaling variable. Clear scaling violations are observed. Predictions based on different approaches to fragmentation were compared to the measurements. Leading-logarithm parton-shower Monte Carlo calculations interfaced to the Lund string fragmentation model describe the data reasonably well in the whole range measured. Next-to-leading-order QCD calculations based on fragmentation functions, FFs, extracted from e+e- data alone, fail to describe the measurements. The calculations based on FFs extracted from a global analysis including e+e-, ep and pp data give an improved description. The measurements presented in this paper have the potential to further constrain the FFs of quarks, anti-quarks and gluons yielding K0s and Lambda/bar Lambda strange hadrons.
△ Less
Submitted 19 April, 2012; v1 submitted 15 November, 2011;
originally announced November 2011.
-
Measurement of the t dependence in exclusive photoproduction of Upsilon(1S) mesons at HERA
Authors:
ZEUS collaboration,
H. Abramowicz,
I. Abt,
L. Adamczyk,
M. Adamus,
R. Aggarwal,
S. Antonelli,
P. Antonioli,
A. Antonov,
M. Arneodo,
V. Aushev,
Y. Aushev,
O. Bachynska,
A. Bamberger,
A. N. Barakbaev,
G. Barbagli,
G. Bari,
F. Barreiro,
N. Bartosik,
D. Bartsch,
M. Basile,
O. Behnke,
J. Behr,
U. Behrens,
L. Bellagamba
, et al. (278 additional authors not shown)
Abstract:
The exclusive photoproduction reaction gamma p -> Upsilon(1S) p has been studied with the ZEUS detector in ep collisions at HERA using an integrated luminosity of 468 pb^-1. The measurement covers the kinematic range 60<W<220 GeV and Q^2<1 GeV^2, where W is the photon-proton centre-of-mass energy and Q^2 is the photon virtuality. The exponential slope, b, of the t dependence of the cross section,…
▽ More
The exclusive photoproduction reaction gamma p -> Upsilon(1S) p has been studied with the ZEUS detector in ep collisions at HERA using an integrated luminosity of 468 pb^-1. The measurement covers the kinematic range 60<W<220 GeV and Q^2<1 GeV^2, where W is the photon-proton centre-of-mass energy and Q^2 is the photon virtuality. The exponential slope, b, of the t dependence of the cross section, where t is the squared four-momentum transfer at the proton vertex, has been measured, yielding b = 4.3 +2.0 -1.3 (stat.) +0.5 -0.6 (syst.) GeV^-2. This constitutes the first measurement of the t dependence of the gamma p -> Upsilon(1S) p cross section.
△ Less
Submitted 4 February, 2012; v1 submitted 9 November, 2011;
originally announced November 2011.
-
Measurement of heavy-quark jet photoproduction at HERA
Authors:
ZEUS Collaboration,
H. Abramowicz,
I. Abt,
L. Adamczyk,
M. Adamus,
R. Aggarwal,
S. Antonelli,
P. Antonioli,
A. Antonov,
M. Arneodo,
V. Aushev,
Y. Aushev,
O. Bachynska,
A. Bamberger,
A. N. Barakbaev,
G. Barbagli,
G. Bari,
F. Barreiro,
N. Bartosik,
D. Bartsch,
M. Basile,
O. Behnke,
J. Behr,
U. Behrens,
L. Bellagamba
, et al. (287 additional authors not shown)
Abstract:
Photoproduction of beauty and charm quarks in events with at least two jets has been measured with the ZEUS detector at HERA using an integrated luminosity of 133 $pb^{-1}$. The fractions of jets containing b and c quarks were extracted using the invariant mass of charged tracks associated with secondary vertices and the decay-length significance of these vertices. Differential cross sections as a…
▽ More
Photoproduction of beauty and charm quarks in events with at least two jets has been measured with the ZEUS detector at HERA using an integrated luminosity of 133 $pb^{-1}$. The fractions of jets containing b and c quarks were extracted using the invariant mass of charged tracks associated with secondary vertices and the decay-length significance of these vertices. Differential cross sections as a function of jet transverse momentum, $p_{T}^{\text{jet}}$, and pseudorapidity, $η^{\text{jet}}$, were measured. The data are compared with previous measurements and are well described by next-to-leading-order QCD predictions.
△ Less
Submitted 28 April, 2011;
originally announced April 2011.
-
Measurement of beauty production in deep inelastic scattering at HERA using decays into electrons
Authors:
ZEUS collaboration,
H. Abramowicz,
I. Abt,
L. Adamczyk,
M. Adamus,
R. Aggarwal,
S. Antonelli,
P. Antonioli,
A. Antonov,
M. Arneodo,
V. Aushev,
Y. Aushev,
O. Bachynska,
A. Bamberger,
A. N. Barakbaev,
G. Barbagli,
G. Bari,
F. Barreiro,
N. Bartosik,
D. Bartsch,
M. Basile,
O. Behnke,
J. Behr,
U. Behrens,
L. Bellagamba
, et al. (289 additional authors not shown)
Abstract:
The production of beauty quarks in ep interactions has been studied with the ZEUS detector at HERA for exchanged four-momentum squared Q^2 > 10 GeV^2, using an integrated luminosity of 363 pb^{-1}. The beauty events were identified using electrons from semileptonic b decays with a transverse momentum 0.9 < p_T^e < 8 GeV and pseudorapidity |eta^e| < 1.5. Cross sections for beauty production were me…
▽ More
The production of beauty quarks in ep interactions has been studied with the ZEUS detector at HERA for exchanged four-momentum squared Q^2 > 10 GeV^2, using an integrated luminosity of 363 pb^{-1}. The beauty events were identified using electrons from semileptonic b decays with a transverse momentum 0.9 < p_T^e < 8 GeV and pseudorapidity |eta^e| < 1.5. Cross sections for beauty production were measured and compared with next-to-leading-order QCD calculations. The beauty contribution to the proton structure function F_2 was extracted from the double-differential cross section as a function of Bjorken-x and Q^2.
△ Less
Submitted 10 March, 2011; v1 submitted 19 January, 2011;
originally announced January 2011.
-
Measurement of beauty production in DIS and F_2^bbbar extraction at ZEUS
Authors:
ZEUS collaboration,
H. Abramowicz,
I. Abt,
L. Adamczyk,
M. Adamus,
R. Aggarwal,
S. Antonelli,
P. Antonioli,
A. Antonov,
M. Arneodo,
V. Aushev,
Y. Aushev,
O. Bachynska,
A. Bamberger,
A. N. Barakbaev,
G. Barbagli,
G. Bari,
F. Barreiro,
D. Bartsch,
M. Basile,
O. Behnke,
J. Behr,
U. Behrens,
L. Bellagamba,
A. Bertolin
, et al. (289 additional authors not shown)
Abstract:
Beauty production in deep inelastic scattering with events in which a muon and a jet are observed in the final state has been measured with the ZEUS detector at HERA using an integrated luminosity of 114 pb^-1. The fraction of events with beauty quarks in the data was determined using the distribution of the transverse momentum of the muon relative to the jet. The cross section for beauty producti…
▽ More
Beauty production in deep inelastic scattering with events in which a muon and a jet are observed in the final state has been measured with the ZEUS detector at HERA using an integrated luminosity of 114 pb^-1. The fraction of events with beauty quarks in the data was determined using the distribution of the transverse momentum of the muon relative to the jet. The cross section for beauty production was measured in the kinematic range of photon virtuality, Q^2 > 2 Gev^2, and inelasticity, 0.05 < y < 0.7, with the requirement of a muon and a jet. Total and differential cross sections are presented and compared to QCD predictions. The beauty contribution to the structure function F_2 was extracted and is compared to theoretical predictions.
△ Less
Submitted 19 May, 2010;
originally announced May 2010.
-
Experimental momentum spectra of identified hadrons at $e^+e^-$ colliders compared to QCD calculations
Authors:
Nichol Brummer,
Imperial College,
London
Abstract:
Experimental data on the shape of hadronic momentum spectra are compared to theoretical predictions in the context of calculations in the Modified Leading Log Approximation (MLLA), under the assumption of Local Parton Hadron Duality (LPHD).
Considered are experimental measurements at $e^+e^-$-colliders of $ξ_p^*$, the position of the maximum in the distribution of $ξ_p=\log(1/x_p)$, where…
▽ More
Experimental data on the shape of hadronic momentum spectra are compared to theoretical predictions in the context of calculations in the Modified Leading Log Approximation (MLLA), under the assumption of Local Parton Hadron Duality (LPHD).
Considered are experimental measurements at $e^+e^-$-colliders of $ξ_p^*$, the position of the maximum in the distribution of $ξ_p=\log(1/x_p)$, where $x_p=p/p_{beam}$. The parameter $ξ_p^*$ is determined for various hadrons at various centre of mass energies. The dependence on the hadron type poses some interesting questions about the process of hadron-formation. The dependence of $ξ^*_p$ on the centre of mass energy is seen to be described adequately by perturbation theory. A quantitative check of LPHD + MLLA is possible by extracting a value of $α_s$ from an overall fit to the scaling behaviour of $ξ^*_p$.
△ Less
Submitted 28 March, 1995; v1 submitted 27 March, 1995;
originally announced March 1995.
-
An optimal method of moments to measure the charge asymmetry at the $Z^0$
Authors:
Nichol C. Brummer
Abstract:
Parity violation at LEP or SLC can be measured through the charge asymmetry. An optimal method of moments is developed here to measure this asymmetry, as well as similar asymmetries. This method is equivalent to the likelihood fit. It is simpler in use, as it gives analytical formulas for both the asymmetry and its statistical error. These formulas give the dependence of the accuracy on the expe…
▽ More
Parity violation at LEP or SLC can be measured through the charge asymmetry. An optimal method of moments is developed here to measure this asymmetry, as well as similar asymmetries. This method is equivalent to the likelihood fit. It is simpler in use, as it gives analytical formulas for both the asymmetry and its statistical error. These formulas give the dependence of the accuracy on the experimental angular acceptance explicitly.
△ Less
Submitted 11 May, 1994;
originally announced May 1994.
-
Experimental momentum spectra of identified hadrons in jets and the predictions from LPHD + MLLA
Authors:
Nichol. C. Brummer
Abstract:
Experimental data on the shape of hadronic momentum spectra are compared with theoretical predictions in the context of calculations in the Modified Leading Log Approximation (MLLA), under the assumption of Local Parton Hadron Duality (LPHD). Considered are experimental measurements at $e^+e^-$-colliders of $ξ_p^*$, the position of the maximum in the distribution of $ξ_p=\log(1/x_p)$, where…
▽ More
Experimental data on the shape of hadronic momentum spectra are compared with theoretical predictions in the context of calculations in the Modified Leading Log Approximation (MLLA), under the assumption of Local Parton Hadron Duality (LPHD). Considered are experimental measurements at $e^+e^-$-colliders of $ξ_p^*$, the position of the maximum in the distribution of $ξ_p=\log(1/x_p)$, where $x_p=p/p_{beam}$. The parameter $ξ_p^*$ is determined for various hadrons at various centre of mass energies. It is interesting to look at the dependence of $ξ^*_p$ on the hadron type. This is used to study the influence of the hadron type on the cut-off scale $Q_0$ in the parton shower development. The dependence of $ξ^*_p$ on the centre of mass energy is seen to be described adequately by perturbation theory. The approach is made quantitative by extracting a value of $α_s(m_Z)$ from an overall fit to the scaling behaviour of $ξ^*_p$.
△ Less
Submitted 11 May, 1994;
originally announced May 1994.