-
Learning text-to-video retrieval from image captioning
Authors:
Lucas Ventura,
Cordelia Schmid,
Gül Varol
Abstract:
We describe a protocol to study text-to-video retrieval training with unlabeled videos, where we assume (i) no access to labels for any videos, i.e., no access to the set of ground-truth captions, but (ii) access to labeled images in the form of text. Using image expert models is a realistic scenario given that annotating images is cheaper therefore scalable, in contrast to expensive video labelin…
▽ More
We describe a protocol to study text-to-video retrieval training with unlabeled videos, where we assume (i) no access to labels for any videos, i.e., no access to the set of ground-truth captions, but (ii) access to labeled images in the form of text. Using image expert models is a realistic scenario given that annotating images is cheaper therefore scalable, in contrast to expensive video labeling schemes. Recently, zero-shot image experts such as CLIP have established a new strong baseline for video understanding tasks. In this paper, we make use of this progress and instantiate the image experts from two types of models: a text-to-image retrieval model to provide an initial backbone, and image captioning models to provide supervision signal into unlabeled videos. We show that automatically labeling video frames with image captioning allows text-to-video retrieval training. This process adapts the features to the target domain at no manual annotation cost, consequently outperforming the strong zero-shot CLIP baseline. During training, we sample captions from multiple video frames that best match the visual content, and perform a temporal pooling over frame representations by scoring frames according to their relevance to each caption. We conduct extensive ablations to provide insights and demonstrate the effectiveness of this simple framework by outperforming the CLIP zero-shot baselines on text-to-video retrieval on three standard datasets, namely ActivityNet, MSR-VTT, and MSVD.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
CoVR: Learning Composed Video Retrieval from Web Video Captions
Authors:
Lucas Ventura,
Antoine Yang,
Cordelia Schmid,
Gül Varol
Abstract:
Composed Image Retrieval (CoIR) has recently gained popularity as a task that considers both text and image queries together, to search for relevant images in a database. Most CoIR approaches require manually annotated datasets, comprising image-text-image triplets, where the text describes a modification from the query image to the target image. However, manual curation of CoIR triplets is expens…
▽ More
Composed Image Retrieval (CoIR) has recently gained popularity as a task that considers both text and image queries together, to search for relevant images in a database. Most CoIR approaches require manually annotated datasets, comprising image-text-image triplets, where the text describes a modification from the query image to the target image. However, manual curation of CoIR triplets is expensive and prevents scalability. In this work, we instead propose a scalable automatic dataset creation methodology that generates triplets given video-caption pairs, while also expanding the scope of the task to include composed video retrieval (CoVR). To this end, we mine paired videos with a similar caption from a large database, and leverage a large language model to generate the corresponding modification text. Applying this methodology to the extensive WebVid2M collection, we automatically construct our WebVid-CoVR dataset, resulting in 1.6 million triplets. Moreover, we introduce a new benchmark for CoVR with a manually annotated evaluation set, along with baseline results. Our experiments further demonstrate that training a CoVR model on our dataset effectively transfers to CoIR, leading to improved state-of-the-art performance in the zero-shot setup on both the CIRR and FashionIQ benchmarks. Our code, datasets, and models are publicly available at https://imagine.enpc.fr/~ventural/covr.
△ Less
Submitted 30 May, 2024; v1 submitted 28 August, 2023;
originally announced August 2023.
-
Learning When to Say Goodbye: What Should be the Shelf Life of an Indicator of Compromise?
Authors:
Breno Tostes,
Leonardo Ventura,
Enrico Lovat,
Matheus Martins,
Daniel Sadoc Menasché
Abstract:
Indicators of Compromise (IOCs), such as IP addresses, file hashes, and domain names associated with known malware or attacks, are cornerstones of cybersecurity, serving to identify malicious activity on a network. In this work, we leverage real data to compare different parameterizations of IOC aging models. Our dataset comprises traffic at a real environment for more than 1 year. Among our trace…
▽ More
Indicators of Compromise (IOCs), such as IP addresses, file hashes, and domain names associated with known malware or attacks, are cornerstones of cybersecurity, serving to identify malicious activity on a network. In this work, we leverage real data to compare different parameterizations of IOC aging models. Our dataset comprises traffic at a real environment for more than 1 year. Among our trace-driven findings, we determine thresholds for the ratio between miss over monitoring costs such that the system benefits from storing IOCs for a finite time-to-live (TTL) before eviction. To the best of our knowledge, this is the first real world evaluation of thresholds related to IOC aging, paving the way towards realistic IOC decaying models.
△ Less
Submitted 31 July, 2023;
originally announced July 2023.
-
On approximate robust confidence distributions
Authors:
Elena Bortolato,
Laura Ventura
Abstract:
A confidence distribution is a complete tool for making frequentist inference for a parameter of interest $ψ$ based on an assumed parametric model. Indeed, it allows to reach point estimates, to assess their precision, to set up tests along with measures of evidence for statements of the type "$ψ> ψ_0$" or "$ψ_1 \leq ψ\leq ψ_2$", to derive confidence intervals, comparing the parameter of interest…
▽ More
A confidence distribution is a complete tool for making frequentist inference for a parameter of interest $ψ$ based on an assumed parametric model. Indeed, it allows to reach point estimates, to assess their precision, to set up tests along with measures of evidence for statements of the type "$ψ> ψ_0$" or "$ψ_1 \leq ψ\leq ψ_2$", to derive confidence intervals, comparing the parameter of interest with other parameters from other studies, etc.
The aim of this contribution is to discuss robust confidence distributions derived from unbiased $M-$estimating functions, which provide robust inference for $ψ$ when the assumed distribution is just an approximate parametric model or in the presence of deviant values in the observed data. Paralleling likelihood-based results and extending results available for robust scoring rules, we first illustrate how robust confidence distributions can be derived from the asymptotic theory of robust pivotal quantities. Then, we discuss the derivation of robust confidence distributions via simulation methods. An application and a simulation study are illustrated in the context of non-inferiority testing, in which null hypotheses of the form $H_0: ψ\leq ψ_0$ are of interest.
△ Less
Submitted 19 December, 2022;
originally announced December 2022.
-
Robust confidence distributions from proper scoring rules
Authors:
Erlis Ruli,
Laura Ventura,
Monica Musio
Abstract:
A confidence distribution is a distribution for a parameter of interest based on a parametric statistical model. As such, it serves the same purpose for frequentist statisticians as a posterior distribution for Bayesians, since it allows to reach point estimates, to assess their precision, to set up tests along with measures of evidence, to derive confidence intervals, comparing the parameter of i…
▽ More
A confidence distribution is a distribution for a parameter of interest based on a parametric statistical model. As such, it serves the same purpose for frequentist statisticians as a posterior distribution for Bayesians, since it allows to reach point estimates, to assess their precision, to set up tests along with measures of evidence, to derive confidence intervals, comparing the parameter of interest with other parameters from other studies, etc. A general recipe for deriving confidence distributions is based on classical pivotal quantities and their exact or approximate distributions.
However, in the presence of model misspecifications or outlying values in the observed data, classical pivotal quantities, and thus confidence distributions, may be inaccurate. The aim of this paper is to discuss the derivation and application of robust confidence distributions. In particular, we discuss a general approach based on the Tsallis scoring rule in order to compute a robust confidence distribution. Examples and simulation results are discussed for some problems often encountered in practice, such as the two-sample heteroschedastic comparison, the receiver operating characteristic curves and regression models.
△ Less
Submitted 2 September, 2021;
originally announced September 2021.
-
A new Bayesian discrepancy measure
Authors:
Francesco Bertolino,
Mara Manca,
Monica Musio,
Walter Racugno,
Laura Ventura
Abstract:
The aim of this article is to make a contribution to the Bayesian procedure of testing precise hypotheses for parametric models. For this purpose, we define the Bayesian Discrepancy Measure that allows one to evaluate the suitability of a given hypothesis with respect to the available information (prior law and data). To summarise this information, the posterior median is employed, allowing a simp…
▽ More
The aim of this article is to make a contribution to the Bayesian procedure of testing precise hypotheses for parametric models. For this purpose, we define the Bayesian Discrepancy Measure that allows one to evaluate the suitability of a given hypothesis with respect to the available information (prior law and data). To summarise this information, the posterior median is employed, allowing a simple assessment of the discrepancy with a fixed hypothesis. The Bayesian Discrepancy Measure assesses the compatibility of a single hypothesis with the observed data, as opposed to the more common comparative approach where a hypothesis is rejected in favour of a competing hypothesis. The proposed measure of evidence has properties of consistency and invariance. After presenting the definition of the measure for a parameter of interest, both in the absence and in the presence of nuisance parameters, we illustrate some examples showing its conceptual and interpretative simplicity. Finally, we compare the BDT with the Full Bayesian Significance Test, a well-known Bayesian testing procedure for sharp hypotheses.
△ Less
Submitted 18 November, 2022; v1 submitted 28 May, 2021;
originally announced May 2021.
-
Spontaneous breaking of the Peccei-Quinn symmetry during warm inflation
Authors:
João G. Rosa,
Luís B. Ventura
Abstract:
We show that, for values of the axion decay constant parametrically close to the GUT scale, the Peccei-Quinn phase transition may naturally occur during warm inflation. This results from interactions between the Peccei-Quinn scalar field and the ambient thermal bath, which is sustained by the inflaton field through dissipative effects. It is therefore possible for the axion field to appear as a dy…
▽ More
We show that, for values of the axion decay constant parametrically close to the GUT scale, the Peccei-Quinn phase transition may naturally occur during warm inflation. This results from interactions between the Peccei-Quinn scalar field and the ambient thermal bath, which is sustained by the inflaton field through dissipative effects. It is therefore possible for the axion field to appear as a dynamical degree of freedom only after observable CMB scales have become super-horizon, thus avoiding the large-scale axion isocurvature perturbations that typically plague such models. This nevertheless yields a nearly scale-invariant spectrum of axion isocurvature perturbations on small scales, with a density contrast of up to a few percent, which may have a significant impact on the formation of gravitationally-bound axion structures such as mini-clusters.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
-
Can Everybody Sign Now? Exploring Sign Language Video Generation from 2D Poses
Authors:
Lucas Ventura,
Amanda Duarte,
Xavier Giro-i-Nieto
Abstract:
Recent work have addressed the generation of human poses represented by 2D/3D coordinates of human joints for sign language. We use the state of the art in Deep Learning for motion transfer and evaluate them on How2Sign, an American Sign Language dataset, to generate videos of signers performing sign language given a 2D pose skeleton. We evaluate the generated videos quantitatively and qualitative…
▽ More
Recent work have addressed the generation of human poses represented by 2D/3D coordinates of human joints for sign language. We use the state of the art in Deep Learning for motion transfer and evaluate them on How2Sign, an American Sign Language dataset, to generate videos of signers performing sign language given a 2D pose skeleton. We evaluate the generated videos quantitatively and qualitatively showing that the current models are not enough to generated adequate videos for Sign Language due to lack of detail in hands.
△ Less
Submitted 4 January, 2021; v1 submitted 20 December, 2020;
originally announced December 2020.
-
Warm Inflation, Neutrinos and Dark matter: a minimal extension of the Standard Model
Authors:
Miguel Levy,
João G. Rosa,
Luis B. Ventura
Abstract:
We show that warm inflation can be realized within a minimal extension of the Standard Model with three right-handed neutrinos, three complex scalars and a gauged lepton/B-L U(1) symmetry. This simple model can address all the shortcomings of the Standard Model that are not related to fine-tuning, within general relativity, with distinctive experimental signatures that can be probed in the near fu…
▽ More
We show that warm inflation can be realized within a minimal extension of the Standard Model with three right-handed neutrinos, three complex scalars and a gauged lepton/B-L U(1) symmetry. This simple model can address all the shortcomings of the Standard Model that are not related to fine-tuning, within general relativity, with distinctive experimental signatures that can be probed in the near future. The inflaton field emerges from the collective breaking of the U(1) symmetry, and interacts with two of the right-handed neutrinos, sustaining a high-temperature radiation bath during inflation. The discrete interchange symmetry of the model protects the scalar potential against large thermal corrections and leads to a stable inflaton remnant at late times which can account for dark matter. Consistency of the model and agreement with Cosmic Microwave Background observations naturally yield light neutrino masses below 0.1 eV, while thermal leptogenesis occurs naturally after a smooth exit from inflation into the radiation era.
△ Less
Submitted 7 December, 2020;
originally announced December 2020.
-
Bending of light in axion backgrounds
Authors:
Jamie I. McDonald,
Luís B. Ventura
Abstract:
In this work we examine refraction of light by computing full solutions to axion electrodynamics. We also allow for the possibility of an additional plasma component. We then specialise to wavelengths which are small compared to background scales to determine if refraction can be described by geometric optics. We also allow for the possibility of an additional plasma component. In the absence of p…
▽ More
In this work we examine refraction of light by computing full solutions to axion electrodynamics. We also allow for the possibility of an additional plasma component. We then specialise to wavelengths which are small compared to background scales to determine if refraction can be described by geometric optics. We also allow for the possibility of an additional plasma component. In the absence of plasma, for small incidence angles relative to the optical axis, axion electrodynamics and geometric optics are in good agreement, with refraction occurring at $\mathcal{O}(g_{a γγ}^2)$. However, for rays which lie far from the optical axis, the agreement with geometric optics breaks down and the dominant refraction requires a full wave-optical treatment, occurring at $\mathcal{O}(g_{a γγ})$. In the presence of sufficiently large plasma masses, the wave-like nature of light becomes suppressed and geometric optics is in good agreement with the full theory for all rays. Our results therefore suggest the necessity of a more comprehensive study of lensing and ray-tracing in axion backgrounds, including a full account of the novel $\mathcal{O}(g_{a γγ})$ wave-optical contribution to refraction.
△ Less
Submitted 29 August, 2020;
originally announced August 2020.
-
How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language
Authors:
Amanda Duarte,
Shruti Palaskar,
Lucas Ventura,
Deepti Ghadiyaram,
Kenneth DeHaan,
Florian Metze,
Jordi Torres,
Xavier Giro-i-Nieto
Abstract:
One of the factors that have hindered progress in the areas of sign language recognition, translation, and production is the absence of large annotated datasets. Towards this end, we introduce How2Sign, a multimodal and multiview continuous American Sign Language (ASL) dataset, consisting of a parallel corpus of more than 80 hours of sign language videos and a set of corresponding modalities inclu…
▽ More
One of the factors that have hindered progress in the areas of sign language recognition, translation, and production is the absence of large annotated datasets. Towards this end, we introduce How2Sign, a multimodal and multiview continuous American Sign Language (ASL) dataset, consisting of a parallel corpus of more than 80 hours of sign language videos and a set of corresponding modalities including speech, English transcripts, and depth. A three-hour subset was further recorded in the Panoptic studio enabling detailed 3D pose estimation. To evaluate the potential of How2Sign for real-world impact, we conduct a study with ASL signers and show that synthesized videos using our dataset can indeed be understood. The study further gives insights on challenges that computer vision should address in order to make progress in this field.
Dataset website: http://how2sign.github.io/
△ Less
Submitted 1 April, 2021; v1 submitted 18 August, 2020;
originally announced August 2020.
-
Classification of cancer pathology reports: a large-scale comparative study
Authors:
Stefano Martina,
Leonardo Ventura,
Paolo Frasconi
Abstract:
We report about the application of state-of-the-art deep learning techniques to the automatic and interpretable assignment of ICD-O3 topography and morphology codes to free-text cancer reports. We present results on a large dataset (more than 80 000 labeled and 1 500 000 unlabeled anonymized reports written in Italian and collected from hospitals in Tuscany over more than a decade) and with a larg…
▽ More
We report about the application of state-of-the-art deep learning techniques to the automatic and interpretable assignment of ICD-O3 topography and morphology codes to free-text cancer reports. We present results on a large dataset (more than 80 000 labeled and 1 500 000 unlabeled anonymized reports written in Italian and collected from hospitals in Tuscany over more than a decade) and with a large number of classes (134 morphological classes and 61 topographical classes). We compare alternative architectures in terms of prediction accuracy and interpretability and show that our best model achieves a multiclass accuracy of 90.3% on topography site assignment and 84.8% on morphology type assignment. We found that in this context hierarchical models are not better than flat models and that an element-wise maximum aggregator is slightly better than attentive models on site classification. Moreover, the maximum aggregator offers a way to interpret the classification process.
△ Less
Submitted 29 June, 2020;
originally announced June 2020.
-
Robust inference for nonlinear regression models from the Tsallis score: application to Covid-19 contagion in Italy
Authors:
Paolo Girardi,
Luca Greco,
Valentina Mameli,
Monica Musio,
Walter Racugno,
Erlis Ruli,
Laura Ventura
Abstract:
We discuss an approach for fitting robust nonlinear regression models, which can be employed to model and predict the contagion dynamics of the Covid-19 in Italy. The focus is on the analysis of epidemic data using robust dose-response curves, but the functionality is applicable to arbitrary nonlinear regression models.
We discuss an approach for fitting robust nonlinear regression models, which can be employed to model and predict the contagion dynamics of the Covid-19 in Italy. The focus is on the analysis of epidemic data using robust dose-response curves, but the functionality is applicable to arbitrary nonlinear regression models.
△ Less
Submitted 9 April, 2020; v1 submitted 7 April, 2020;
originally announced April 2020.
-
Optical properties of dynamical axion backgrounds
Authors:
Jamie I. McDonald,
Luís B. Ventura
Abstract:
We discuss spectral distortions, time delays and refraction of light in an axion or axion-plasma background. This involves solving the full set of geodesic equations associated to the system of Hamiltonian optics, allowing us to self-consistently take into account the evolution of the momentum, frequency and position of photons. We support our arguments with analytic approximations and full numeri…
▽ More
We discuss spectral distortions, time delays and refraction of light in an axion or axion-plasma background. This involves solving the full set of geodesic equations associated to the system of Hamiltonian optics, allowing us to self-consistently take into account the evolution of the momentum, frequency and position of photons. We support our arguments with analytic approximations and full numerical solutions. Remarkably, the introduction of a plasma enhances the sensitivity to axion-induced birefringence, allowing these effects to occur at linear order in the axion-photon coupling even when the axion background is not present at either the emission or detection points. This suggests a general enhancement of axion-induced birefringence when the background refractive index is different from one.
△ Less
Submitted 22 November, 2019;
originally announced November 2019.
-
Warm Little Inflaton becomes Dark Energy
Authors:
João G. Rosa,
Luís B. Ventura
Abstract:
We present a model where the inflaton field behaves like quintessence at late times, generating the present phase of accelerated expansion. This is achieved within the framework of warm inflation, in particular the Warm Little Inflaton scenario, where the underlying symmetries guarantee a successful inflationary period in a warm regime sustained by dissipative effects without significant backreact…
▽ More
We present a model where the inflaton field behaves like quintessence at late times, generating the present phase of accelerated expansion. This is achieved within the framework of warm inflation, in particular the Warm Little Inflaton scenario, where the underlying symmetries guarantee a successful inflationary period in a warm regime sustained by dissipative effects without significant backreaction on the scalar potential. This yields a smooth transition into a radiation-dominated epoch, at which point dissipative effects naturally shut down as the temperature drops below the mass of the fermions directly coupled to the inflaton. The post-inflationary dynamics is then analogous to a thawing quintessence scenario, with no kination phase at the end of inflation. Observational signatures of this scenario include the modified consistency relation between the tensor-to-scalar ratio and tensor spectral index typical of warm inflation models, the variation of the dark energy equation of state at low redshifts characteristic of thawing quintessence scenarios, and correlated dark energy isocurvature perturbations.
△ Less
Submitted 15 July, 2019; v1 submitted 27 June, 2019;
originally announced June 2019.
-
next-gen-scraPy: Extracting NFL Tracking Data from Images to Evaluate Quarterbacks and Pass Defenses
Authors:
Sarah Mallepalle,
Ron Yurko,
Konstantinos Pelechrinis,
Samuel L. Ventura
Abstract:
The NFL collects detailed tracking data capturing the location of all players and the ball during each play. Although the raw form of this data is not publicly available, the NFL releases a set of aggregated statistics via their Next Gen Stats (NGS) platform. They also provide charts showing the locations of pass attempts and outcomes for individual quarterbacks. Our work aims to partially close t…
▽ More
The NFL collects detailed tracking data capturing the location of all players and the ball during each play. Although the raw form of this data is not publicly available, the NFL releases a set of aggregated statistics via their Next Gen Stats (NGS) platform. They also provide charts showing the locations of pass attempts and outcomes for individual quarterbacks. Our work aims to partially close the gap between what data is available privately (to NFL teams) and publicly, and our contribution is twofold. First, we introduce an image processing tool designed specifically for extracting the raw data from the NGS pass charts. We extract the pass outcome, coordinates, and other metadata. Second, we analyze the resulting dataset, examining the spatial tendencies and performances of individual quarterbacks and defenses. We use a generalized additive model for completion percentages by field location. We introduce a Naive Bayes approach for estimating the 2-D completion percentage surfaces of individual teams and quarterbacks, and we provide a one-number summary, completion percentage above expectation (CPAE), for evaluating quarterbacks and team defenses. We find that our pass location data closely matches the NFL's tracking data, and that our CPAE metric closely matches the NFL's proprietary CPAE metric.
△ Less
Submitted 5 December, 2019; v1 submitted 7 June, 2019;
originally announced June 2019.
-
Going Deep: Models for Continuous-Time Within-Play Valuation of Game Outcomes in American Football with Tracking Data
Authors:
Ronald Yurko,
Francesca Matano,
Lee F. Richardson,
Nicholas Granered,
Taylor Pospisil,
Konstantinos Pelechrinis,
Samuel L. Ventura
Abstract:
Continuous-time assessments of game outcomes in sports have become increasingly common in the last decade. In American football, only discrete-time estimates of play value were possible, since the most advanced public football datasets were recorded at the play-by-play level. While measures such as expected points and win probability are useful for evaluating football plays and game situations, th…
▽ More
Continuous-time assessments of game outcomes in sports have become increasingly common in the last decade. In American football, only discrete-time estimates of play value were possible, since the most advanced public football datasets were recorded at the play-by-play level. While measures such as expected points and win probability are useful for evaluating football plays and game situations, there has been no research into how these values change throughout the course of a play. In this work, we make two main contributions: First, we introduce a general framework for continuous-time within-play valuation in the National Football League using player-tracking data. Our modular framework incorporates several modular sub-models, to easily incorporate recent work involving player tracking data in football. Second, we use a long short-term memory recurrent neural network to construct a ball-carrier model to estimate how many yards the ball-carrier is expected to gain from their current position, conditional on the locations and trajectories of the ball-carrier, their teammates and opponents. Additionally, we demonstrate an extension with conditional density estimation so that the expectation of any measure of play value can be calculated in continuous-time, which was never before possible at such a granular level.
△ Less
Submitted 12 November, 2019; v1 submitted 4 June, 2019;
originally announced June 2019.
-
Warm Little Inflaton becomes Cold Dark Matter
Authors:
Joao G. Rosa,
Luis B. Ventura
Abstract:
We present a model where the inflaton can naturally account for all the dark matter in the Universe within the warm inflation paradigm. In particular, we show that the symmetries of the Warm Little Inflaton scenario (i) avoid large thermal and radiative corrections to the scalar potential, (ii) allow for sufficiently strong dissipative effects to sustain a radiation bath during inflation that beco…
▽ More
We present a model where the inflaton can naturally account for all the dark matter in the Universe within the warm inflation paradigm. In particular, we show that the symmetries of the Warm Little Inflaton scenario (i) avoid large thermal and radiative corrections to the scalar potential, (ii) allow for sufficiently strong dissipative effects to sustain a radiation bath during inflation that becomes dominant at the end of the slow-roll regime, and (iii) enable a stable inflaton remnant in the post-inflationary epochs. The latter behaves as dark radiation until parametrically before matter-radiation equality, leading to a non-negligible contribution to the effective number of relativistic degrees of freedom during nucleosynthesis, becoming the dominant cold dark matter component in the Universe for inflaton masses in the $10^{-4}-10^{-1}$ eV range. Cold dark matter isocurvature perturbations, anti-correlated with the main adiabatic component, provide a smoking gun for this scenario that can be tested in the near future.
△ Less
Submitted 8 May, 2019; v1 submitted 13 November, 2018;
originally announced November 2018.
-
Objective Bayesian inference with proper scoring rules
Authors:
Federica Giummolè,
Valentina Mameli,
Erlis Ruli,
Laura Ventura
Abstract:
Standard Bayesian analyses can be difficult to perform when the full likelihood, and consequently the full posterior distribution, is too complex and difficult to specify or if robustness with respect to data or to model misspecifications is required. In these situations, we suggest to resort to a posterior distribution for the parameter of interest based on proper scoring rules. Scoring rules are…
▽ More
Standard Bayesian analyses can be difficult to perform when the full likelihood, and consequently the full posterior distribution, is too complex and difficult to specify or if robustness with respect to data or to model misspecifications is required. In these situations, we suggest to resort to a posterior distribution for the parameter of interest based on proper scoring rules. Scoring rules are loss functions designed to measure the quality of a probability distribution for a random variable, given its observed value. Important examples are the Tsallis score and the Hyvärinen score, which allow us to deal with model misspecifications or with complex models. Also the full and the composite likelihoods are both special instances of scoring rules.
The aim of this paper is twofold. Firstly, we discuss the use of scoring rules in the Bayes formula in order to compute a posterior distribution, named SR-posterior distribution, and we derive its asymptotic normality. Secondly, we propose a procedure for building default priors for the unknown parameter of interest that can be used to update the information provided by the scoring rule in the SR-posterior distribution. In particular, a reference prior is obtained by maximizing the average $α-$divergence from the SR-posterior distribution. For $0 \leq |α|<1$, the result is a Jeffreys-type prior that is proportional to the square root of the determinant of the Godambe information matrix associated to the scoring rule. Some examples are discussed.
△ Less
Submitted 6 January, 2019; v1 submitted 29 November, 2017;
originally announced November 2017.
-
Robust approximate Bayesian inference
Authors:
Erlis Ruli,
Nicola Sartori,
Laura Ventura
Abstract:
We discuss an approach for deriving robust posterior distributions from $M$-estimating functions using Approximate Bayesian Computation (ABC) methods. In particular, we use $M$-estimating functions to construct suitable summary statistics in ABC algorithms. The theoretical properties of the robust posterior distributions are discussed. Special attention is given to the application of the method to…
▽ More
We discuss an approach for deriving robust posterior distributions from $M$-estimating functions using Approximate Bayesian Computation (ABC) methods. In particular, we use $M$-estimating functions to construct suitable summary statistics in ABC algorithms. The theoretical properties of the robust posterior distributions are discussed. Special attention is given to the application of the method to linear mixed models. Simulation results and an application to a clinical study demonstrate the usefulness of the method. An R implementation is also provided in the robustBLME package.
△ Less
Submitted 12 June, 2019; v1 submitted 6 June, 2017;
originally announced June 2017.
-
SPEW: Synthetic Populations and Ecosystems of the World
Authors:
Shannon Gallagher,
Lee Richardson,
Samuel L. Ventura,
William F. Eddy
Abstract:
Agent-based models (ABMs) simulate interactions between autonomous agents in constrained environments over time. ABMs are often used for modeling the spread of infectious diseases. In order to simulate disease outbreaks or other phenomena, ABMs rely on "synthetic ecosystems," or information about agents and their environments that is representative of the real world. Previous approaches for genera…
▽ More
Agent-based models (ABMs) simulate interactions between autonomous agents in constrained environments over time. ABMs are often used for modeling the spread of infectious diseases. In order to simulate disease outbreaks or other phenomena, ABMs rely on "synthetic ecosystems," or information about agents and their environments that is representative of the real world. Previous approaches for generating synthetic ecosystems have some limitations: they are not open-source, cannot be adapted to new or updated input data sources, and do not allow for alternative methods for sampling agent characteristics and locations. We introduce a general framework for generating Synthetic Populations and Ecosystems of the World (SPEW), implemented as an open-source R package. SPEW allows researchers to choose from a variety of sampling methods for agent characteristics and locations when generating synthetic ecosystems for any geographic region. SPEW can produce synthetic ecosystems for any agent (e.g. humans, mosquitoes, etc), provided that appropriate data is available. We analyze the accuracy and computational efficiency of SPEW given different sampling methods for agent characteristics and locations and provide a suite of diagnostics to screen our synthetic ecosystems. SPEW has generated over five billion human agents across approximately 100,000 geographic regions in about 70 countries, available online.
△ Less
Submitted 9 January, 2017;
originally announced January 2017.
-
Improved Laplace Approximation for Marginal Likelihoods
Authors:
Erlis Ruli,
Nicola Sartori,
Laura Ventura
Abstract:
Statistical applications often involve the calculation of intractable multidimensional integrals. The Laplace formula is widely used to approximate such integrals. However, in high-dimensional or small sample size problems, the shape of the integrand function may be far from that of the Gaussian density, and thus the standard Laplace approximation can be inaccurate. We propose an improved Laplace…
▽ More
Statistical applications often involve the calculation of intractable multidimensional integrals. The Laplace formula is widely used to approximate such integrals. However, in high-dimensional or small sample size problems, the shape of the integrand function may be far from that of the Gaussian density, and thus the standard Laplace approximation can be inaccurate. We propose an improved Laplace approximation that reduces the asymptotic error of the standard Laplace formula by one order of magnitude, thus leading to third-order accuracy. We also show, by means of practical examples of various complexity, that the proposed method is extremely accurate, even in high dimensions, improving over the standard Laplace formula. Such examples also demonstrate that the accuracy of the proposed method is comparable with that of other existing methods, which are computationally more demanding. An R implementation of the improved Laplace approximation is also provided through the R package iLaplace available on CRAN.
△ Less
Submitted 29 December, 2016; v1 submitted 23 February, 2015;
originally announced February 2015.
-
Fine-structure constant constraints on Bekenstein-type models
Authors:
P. M. M. Leal,
C. J. A. P. Martins,
L. B. Ventura
Abstract:
Astrophysical tests of the stability of dimensionless fundamental couplings, such as the fine-structure constant $α$, are an area of much increased recent activity, following some indications of possible spacetime variations at the few parts per million level. Here we obtain updated constraints on the Bekenstein-Sandvik-Barrow-Magueijo model, which is arguably the simplest model allowing for $α$ v…
▽ More
Astrophysical tests of the stability of dimensionless fundamental couplings, such as the fine-structure constant $α$, are an area of much increased recent activity, following some indications of possible spacetime variations at the few parts per million level. Here we obtain updated constraints on the Bekenstein-Sandvik-Barrow-Magueijo model, which is arguably the simplest model allowing for $α$ variations. Recent accurate spectroscopic measurements allow us to improve previous constraints by about an order of magnitude. We briefly comment on the dependence of the results on the data sample, as well as on the improvements expected from future facilities.
△ Less
Submitted 15 July, 2014;
originally announced July 2014.
-
A Comparison of Blocking Methods for Record Linkage
Authors:
Rebecca C. Steorts,
Samuel L. Ventura,
Mauricio Sadinle,
Stephen E. Fienberg
Abstract:
Record linkage seeks to merge databases and to remove duplicates when unique identifiers are not available. Most approaches use blocking techniques to reduce the computational complexity associated with record linkage. We review traditional blocking techniques, which typically partition the records according to a set of field attributes, and consider two variants of a method known as locality sens…
▽ More
Record linkage seeks to merge databases and to remove duplicates when unique identifiers are not available. Most approaches use blocking techniques to reduce the computational complexity associated with record linkage. We review traditional blocking techniques, which typically partition the records according to a set of field attributes, and consider two variants of a method known as locality sensitive hashing, sometimes referred to as "private blocking." We compare these approaches in terms of their recall, reduction ratio, and computational complexity. We evaluate these methods using different synthetic datafiles and conclude with a discussion of privacy-related issues.
△ Less
Submitted 11 July, 2014;
originally announced July 2014.
-
Minimum scoring rule inference
Authors:
Philip Dawid,
Monica Musio,
Laura Ventura
Abstract:
Proper scoring rules are methods for encouraging honest assessment of probability distributions. Just like likelihood, a proper scoring rule can be applied to supply an unbiased estimating equation for any statistical model, and the theory of such equations can be applied to understand the properties of the associated estimator. In this paper we develop some basic scoring rule estimation theory, a…
▽ More
Proper scoring rules are methods for encouraging honest assessment of probability distributions. Just like likelihood, a proper scoring rule can be applied to supply an unbiased estimating equation for any statistical model, and the theory of such equations can be applied to understand the properties of the associated estimator. In this paper we develop some basic scoring rule estimation theory, and explore robustness and interval estimation properties by means of theory and simulations.
△ Less
Submitted 16 March, 2014;
originally announced March 2014.
-
Approximate Bayesian Computation with composite score functions
Authors:
Erlis Ruli,
Nicola Sartori,
Laura Ventura
Abstract:
Both Approximate Bayesian Computation (ABC) and composite likelihood methods are useful for Bayesian and frequentist inference, respectively, when the likelihood function is intractable. We propose to use composite likelihood score functions as summary statistics in ABC in order to obtain accurate approximations to the posterior distribution. This is motivated by the use of the score function of t…
▽ More
Both Approximate Bayesian Computation (ABC) and composite likelihood methods are useful for Bayesian and frequentist inference, respectively, when the likelihood function is intractable. We propose to use composite likelihood score functions as summary statistics in ABC in order to obtain accurate approximations to the posterior distribution. This is motivated by the use of the score function of the full likelihood, and extended to general unbiased estimating functions in complex models. Moreover, we show that if the composite score is suitably standardised, the resulting ABC procedure is invariant to reparameterisations and automatically adjusts the curvature of the composite likelihood, and of the corresponding posterior distribution. The method is illustrated through examples with simulated data, and an application to modelling of spatial extreme rainfall data is discussed.
△ Less
Submitted 24 February, 2015; v1 submitted 28 November, 2013;
originally announced November 2013.
-
Trouble With The Curve: Improving MLB Pitch Classification
Authors:
Michael A. Pane,
Samuel L. Ventura,
Rebecca C. Steorts,
A. C. Thomas
Abstract:
The PITCHf/x database has allowed the statistical analysis of of Major League Baseball (MLB) to flourish since its introduction in late 2006. Using PITCHf/x, pitches have been classified by hand, requiring considerable effort, or using neural network clustering and classification, which is often difficult to interpret. To address these issues, we use model-based clustering with a multivariate Gaus…
▽ More
The PITCHf/x database has allowed the statistical analysis of of Major League Baseball (MLB) to flourish since its introduction in late 2006. Using PITCHf/x, pitches have been classified by hand, requiring considerable effort, or using neural network clustering and classification, which is often difficult to interpret. To address these issues, we use model-based clustering with a multivariate Gaussian mixture model and an appropriate adjustment factor as an alternative to current methods. Furthermore, we describe a new pitch classification algorithm based on our clustering approach to address the problems of pitch misclassification. We illustrate our methods for various pitchers from the PITCHf/x database that covers a wide variety of pitch types.
△ Less
Submitted 5 April, 2013;
originally announced April 2013.
-
A note on marginal posterior simulation via higher-order tail area approximations
Authors:
Erlis Ruli,
Nicola Sartori,
Laura Ventura
Abstract:
We explore the use of higher-order tail area approximations for Bayesian simulation. These approximations give rise to an alternative simulation scheme to MCMC for Bayesian computation of marginal posterior distributions for a scalar parameter of interest, in the presence of nuisance parameters. Its advantage over MCMC methods is that samples are drawn independently with lower computational time a…
▽ More
We explore the use of higher-order tail area approximations for Bayesian simulation. These approximations give rise to an alternative simulation scheme to MCMC for Bayesian computation of marginal posterior distributions for a scalar parameter of interest, in the presence of nuisance parameters. Its advantage over MCMC methods is that samples are drawn independently with lower computational time and the implementation requires only standard maximum likelihood routines. The method is illustrated by a genetic linkage model, a normal regression with censored data and a logistic regression model.
△ Less
Submitted 5 December, 2012;
originally announced December 2012.
-
Competing Process Hazard Function Models for Player Ratings in Ice Hockey
Authors:
A. C. Thomas,
Samuel L. Ventura,
Shane Jensen,
Stephen Ma
Abstract:
Evaluating the overall ability of players in the National Hockey League (NHL) is a difficult task. Existing methods such as the famous "plus/minus" statistic have many shortcomings. Standard linear regression methods work well when player substitutions are relatively uncommon and scoring events are relatively common, such as in basketball, but as neither of these conditions exists for hockey, we u…
▽ More
Evaluating the overall ability of players in the National Hockey League (NHL) is a difficult task. Existing methods such as the famous "plus/minus" statistic have many shortcomings. Standard linear regression methods work well when player substitutions are relatively uncommon and scoring events are relatively common, such as in basketball, but as neither of these conditions exists for hockey, we use an approach that embraces the unique characteristics of the sport. We model the scoring rate for each team as its own semi-Markov process, with hazard functions for each process that depend on the players on the ice. This method yields offensive and defensive player ability ratings which take into account quality of teammates and opponents, the game situation, and other desired factors, that themselves have a meaningful interpretation in terms of game outcomes. Additionally, since the number of parameters in this model can be quite large, we make use of two different shrinkage methods depending on the question of interest: full Bayesian hierarchical models that partially pool parameters according to player position, and penalized maximum likelihood estimation to select a smaller number of parameters that stand out as being substantially different from average. We apply the model to all five-on-five (full-strength) situations for games in five NHL seasons.
△ Less
Submitted 28 February, 2013; v1 submitted 3 August, 2012;
originally announced August 2012.