Skip to main content

Showing 1–29 of 29 results for author: Ventura, L

.
  1. arXiv:2404.17498  [pdf, other

    cs.CV

    Learning text-to-video retrieval from image captioning

    Authors: Lucas Ventura, Cordelia Schmid, Gül Varol

    Abstract: We describe a protocol to study text-to-video retrieval training with unlabeled videos, where we assume (i) no access to labels for any videos, i.e., no access to the set of ground-truth captions, but (ii) access to labeled images in the form of text. Using image expert models is a realistic scenario given that annotating images is cheaper therefore scalable, in contrast to expensive video labelin… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: A short version of this work appeared at CVPR 2023 Workshops. Project page: https://imagine.enpc.fr/~ventural/multicaps/

  2. CoVR: Learning Composed Video Retrieval from Web Video Captions

    Authors: Lucas Ventura, Antoine Yang, Cordelia Schmid, Gül Varol

    Abstract: Composed Image Retrieval (CoIR) has recently gained popularity as a task that considers both text and image queries together, to search for relevant images in a database. Most CoIR approaches require manually annotated datasets, comprising image-text-image triplets, where the text describes a modification from the query image to the target image. However, manual curation of CoIR triplets is expens… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: AAAI 2024, Updated the results on CIRR with the correct evaluation. Project page: Project page: https://imagine.enpc.fr/~ventural/covr/

  3. arXiv:2307.16852  [pdf, other

    cs.CR

    Learning When to Say Goodbye: What Should be the Shelf Life of an Indicator of Compromise?

    Authors: Breno Tostes, Leonardo Ventura, Enrico Lovat, Matheus Martins, Daniel Sadoc Menasché

    Abstract: Indicators of Compromise (IOCs), such as IP addresses, file hashes, and domain names associated with known malware or attacks, are cornerstones of cybersecurity, serving to identify malicious activity on a network. In this work, we leverage real data to compare different parameterizations of IOC aging models. Our dataset comprises traffic at a real environment for more than 1 year. Among our trace… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 2023 IEEE International Conference on Cyber Security and Resilience (IEEE CSR)

    ACM Class: K.6.5; D.4.6

  4. arXiv:2212.09552  [pdf, ps, other

    stat.ME stat.AP stat.CO

    On approximate robust confidence distributions

    Authors: Elena Bortolato, Laura Ventura

    Abstract: A confidence distribution is a complete tool for making frequentist inference for a parameter of interest $ψ$ based on an assumed parametric model. Indeed, it allows to reach point estimates, to assess their precision, to set up tests along with measures of evidence for statements of the type "$ψ> ψ_0$" or "$ψ_1 \leq ψ\leq ψ_2$", to derive confidence intervals, comparing the parameter of interest… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  5. arXiv:2109.01219  [pdf, other

    stat.ME

    Robust confidence distributions from proper scoring rules

    Authors: Erlis Ruli, Laura Ventura, Monica Musio

    Abstract: A confidence distribution is a distribution for a parameter of interest based on a parametric statistical model. As such, it serves the same purpose for frequentist statisticians as a posterior distribution for Bayesians, since it allows to reach point estimates, to assess their precision, to set up tests along with measures of evidence, to derive confidence intervals, comparing the parameter of i… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: submitted

  6. arXiv:2105.13716  [pdf, other

    stat.ME

    A new Bayesian discrepancy measure

    Authors: Francesco Bertolino, Mara Manca, Monica Musio, Walter Racugno, Laura Ventura

    Abstract: The aim of this article is to make a contribution to the Bayesian procedure of testing precise hypotheses for parametric models. For this purpose, we define the Bayesian Discrepancy Measure that allows one to evaluate the suitability of a given hypothesis with respect to the available information (prior law and data). To summarise this information, the posterior median is employed, allowing a simp… ▽ More

    Submitted 18 November, 2022; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: 29 pages 9 figures

    MSC Class: 62F15; 62F03; 62A; 62C10

  7. arXiv:2105.05771  [pdf, other

    hep-ph astro-ph.CO gr-qc

    Spontaneous breaking of the Peccei-Quinn symmetry during warm inflation

    Authors: João G. Rosa, Luís B. Ventura

    Abstract: We show that, for values of the axion decay constant parametrically close to the GUT scale, the Peccei-Quinn phase transition may naturally occur during warm inflation. This results from interactions between the Peccei-Quinn scalar field and the ambient thermal bath, which is sustained by the inflaton field through dissipative effects. It is therefore possible for the axion field to appear as a dy… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: 12 pages, 6 figures

  8. arXiv:2012.10941  [pdf, other

    cs.CV cs.AI

    Can Everybody Sign Now? Exploring Sign Language Video Generation from 2D Poses

    Authors: Lucas Ventura, Amanda Duarte, Xavier Giro-i-Nieto

    Abstract: Recent work have addressed the generation of human poses represented by 2D/3D coordinates of human joints for sign language. We use the state of the art in Deep Learning for motion transfer and evaluate them on How2Sign, an American Sign Language dataset, to generate videos of signers performing sign language given a 2D pose skeleton. We evaluate the generated videos quantitatively and qualitative… ▽ More

    Submitted 4 January, 2021; v1 submitted 20 December, 2020; originally announced December 2020.

    Comments: Video here: https://youtu.be/4ve1sGzWl2g

  9. arXiv:2012.03988  [pdf, other

    hep-ph astro-ph.CO gr-qc

    Warm Inflation, Neutrinos and Dark matter: a minimal extension of the Standard Model

    Authors: Miguel Levy, João G. Rosa, Luis B. Ventura

    Abstract: We show that warm inflation can be realized within a minimal extension of the Standard Model with three right-handed neutrinos, three complex scalars and a gauged lepton/B-L U(1) symmetry. This simple model can address all the shortcomings of the Standard Model that are not related to fine-tuning, within general relativity, with distinctive experimental signatures that can be probed in the near fu… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: 43 pages (30 main + 13 appendices), 8 figures. Comments are welcome

  10. arXiv:2008.12923  [pdf, other

    hep-ph gr-qc

    Bending of light in axion backgrounds

    Authors: Jamie I. McDonald, Luís B. Ventura

    Abstract: In this work we examine refraction of light by computing full solutions to axion electrodynamics. We also allow for the possibility of an additional plasma component. We then specialise to wavelengths which are small compared to background scales to determine if refraction can be described by geometric optics. We also allow for the possibility of an additional plasma component. In the absence of p… ▽ More

    Submitted 29 August, 2020; originally announced August 2020.

    Comments: 14 pages, 7 figures. Comments are welcome

  11. arXiv:2008.08143  [pdf, other

    cs.CV

    How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language

    Authors: Amanda Duarte, Shruti Palaskar, Lucas Ventura, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giro-i-Nieto

    Abstract: One of the factors that have hindered progress in the areas of sign language recognition, translation, and production is the absence of large annotated datasets. Towards this end, we introduce How2Sign, a multimodal and multiview continuous American Sign Language (ASL) dataset, consisting of a parallel corpus of more than 80 hours of sign language videos and a set of corresponding modalities inclu… ▽ More

    Submitted 1 April, 2021; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: Accepted at CVPR 2021. Dataset website: http://how2sign.github.io/

  12. arXiv:2006.16370  [pdf, other

    cs.LG cs.CL eess.IV stat.ML

    Classification of cancer pathology reports: a large-scale comparative study

    Authors: Stefano Martina, Leonardo Ventura, Paolo Frasconi

    Abstract: We report about the application of state-of-the-art deep learning techniques to the automatic and interpretable assignment of ICD-O3 topography and morphology codes to free-text cancer reports. We present results on a large dataset (more than 80 000 labeled and 1 500 000 unlabeled anonymized reports written in Italian and collected from hospitals in Tuscany over more than a decade) and with a larg… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: 10 pages, 6 figures, 3 tables, accepted for publication in IEEE Journal of Biomedical and Health Informatics (J-BHI)

    ACM Class: I.2.6; I.2.7; J.3

    Journal ref: IEEE Journal of Biomedical and Health Informatics 24 (11), 3085-3094 (2020)

  13. arXiv:2004.03187  [pdf, other

    stat.AP stat.ME

    Robust inference for nonlinear regression models from the Tsallis score: application to Covid-19 contagion in Italy

    Authors: Paolo Girardi, Luca Greco, Valentina Mameli, Monica Musio, Walter Racugno, Erlis Ruli, Laura Ventura

    Abstract: We discuss an approach for fitting robust nonlinear regression models, which can be employed to model and predict the contagion dynamics of the Covid-19 in Italy. The focus is on the analysis of epidemic data using robust dose-response curves, but the functionality is applicable to arbitrary nonlinear regression models.

    Submitted 9 April, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: 15 pages, 6 figures, submitted

  14. arXiv:1911.10221  [pdf, other

    hep-ph astro-ph.CO gr-qc

    Optical properties of dynamical axion backgrounds

    Authors: Jamie I. McDonald, Luís B. Ventura

    Abstract: We discuss spectral distortions, time delays and refraction of light in an axion or axion-plasma background. This involves solving the full set of geodesic equations associated to the system of Hamiltonian optics, allowing us to self-consistently take into account the evolution of the momentum, frequency and position of photons. We support our arguments with analytic approximations and full numeri… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: Comments are welcome

    Journal ref: Phys. Rev. D 101, 123503 (2020)

  15. arXiv:1906.11835  [pdf, other

    hep-ph astro-ph.CO gr-qc hep-th

    Warm Little Inflaton becomes Dark Energy

    Authors: João G. Rosa, Luís B. Ventura

    Abstract: We present a model where the inflaton field behaves like quintessence at late times, generating the present phase of accelerated expansion. This is achieved within the framework of warm inflation, in particular the Warm Little Inflaton scenario, where the underlying symmetries guarantee a successful inflationary period in a warm regime sustained by dissipative effects without significant backreact… ▽ More

    Submitted 15 July, 2019; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: 6 pages, 3 figures, added references and comments. Comments are welcome

  16. arXiv:1906.03339  [pdf, other

    stat.AP

    next-gen-scraPy: Extracting NFL Tracking Data from Images to Evaluate Quarterbacks and Pass Defenses

    Authors: Sarah Mallepalle, Ron Yurko, Konstantinos Pelechrinis, Samuel L. Ventura

    Abstract: The NFL collects detailed tracking data capturing the location of all players and the ball during each play. Although the raw form of this data is not publicly available, the NFL releases a set of aggregated statistics via their Next Gen Stats (NGS) platform. They also provide charts showing the locations of pass attempts and outcomes for individual quarterbacks. Our work aims to partially close t… ▽ More

    Submitted 5 December, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

  17. arXiv:1906.01760  [pdf, other

    stat.AP

    Going Deep: Models for Continuous-Time Within-Play Valuation of Game Outcomes in American Football with Tracking Data

    Authors: Ronald Yurko, Francesca Matano, Lee F. Richardson, Nicholas Granered, Taylor Pospisil, Konstantinos Pelechrinis, Samuel L. Ventura

    Abstract: Continuous-time assessments of game outcomes in sports have become increasingly common in the last decade. In American football, only discrete-time estimates of play value were possible, since the most advanced public football datasets were recorded at the play-by-play level. While measures such as expected points and win probability are useful for evaluating football plays and game situations, th… ▽ More

    Submitted 12 November, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

  18. arXiv:1811.05493  [pdf, other

    hep-ph astro-ph.CO gr-qc hep-th

    Warm Little Inflaton becomes Cold Dark Matter

    Authors: Joao G. Rosa, Luis B. Ventura

    Abstract: We present a model where the inflaton can naturally account for all the dark matter in the Universe within the warm inflation paradigm. In particular, we show that the symmetries of the Warm Little Inflaton scenario (i) avoid large thermal and radiative corrections to the scalar potential, (ii) allow for sufficiently strong dissipative effects to sustain a radiation bath during inflation that beco… ▽ More

    Submitted 8 May, 2019; v1 submitted 13 November, 2018; originally announced November 2018.

    Comments: 6 pages, 2 figures; Matches version published in Physical Review Letters

    Journal ref: Phys. Rev. Lett. 122, 161301 (2019)

  19. Objective Bayesian inference with proper scoring rules

    Authors: Federica Giummolè, Valentina Mameli, Erlis Ruli, Laura Ventura

    Abstract: Standard Bayesian analyses can be difficult to perform when the full likelihood, and consequently the full posterior distribution, is too complex and difficult to specify or if robustness with respect to data or to model misspecifications is required. In these situations, we suggest to resort to a posterior distribution for the parameter of interest based on proper scoring rules. Scoring rules are… ▽ More

    Submitted 6 January, 2019; v1 submitted 29 November, 2017; originally announced November 2017.

    Comments: 29 pages and 9 figures

    Journal ref: Test 2019

  20. Robust approximate Bayesian inference

    Authors: Erlis Ruli, Nicola Sartori, Laura Ventura

    Abstract: We discuss an approach for deriving robust posterior distributions from $M$-estimating functions using Approximate Bayesian Computation (ABC) methods. In particular, we use $M$-estimating functions to construct suitable summary statistics in ABC algorithms. The theoretical properties of the robust posterior distributions are discussed. Special attention is given to the application of the method to… ▽ More

    Submitted 12 June, 2019; v1 submitted 6 June, 2017; originally announced June 2017.

    Comments: This is a revised and personal manuscript version of the article that has been accepted for publication by Journal of Statistical Planning and Inference

  21. arXiv:1701.02383  [pdf, other

    stat.OT physics.soc-ph q-bio.PE

    SPEW: Synthetic Populations and Ecosystems of the World

    Authors: Shannon Gallagher, Lee Richardson, Samuel L. Ventura, William F. Eddy

    Abstract: Agent-based models (ABMs) simulate interactions between autonomous agents in constrained environments over time. ABMs are often used for modeling the spread of infectious diseases. In order to simulate disease outbreaks or other phenomena, ABMs rely on "synthetic ecosystems," or information about agents and their environments that is representative of the real world. Previous approaches for genera… ▽ More

    Submitted 9 January, 2017; originally announced January 2017.

  22. arXiv:1502.06440  [pdf, ps, other

    stat.CO stat.ME

    Improved Laplace Approximation for Marginal Likelihoods

    Authors: Erlis Ruli, Nicola Sartori, Laura Ventura

    Abstract: Statistical applications often involve the calculation of intractable multidimensional integrals. The Laplace formula is widely used to approximate such integrals. However, in high-dimensional or small sample size problems, the shape of the integrand function may be far from that of the Gaussian density, and thus the standard Laplace approximation can be inaccurate. We propose an improved Laplace… ▽ More

    Submitted 29 December, 2016; v1 submitted 23 February, 2015; originally announced February 2015.

    Comments: 24 pages

    Journal ref: Electronic Journal of Statistics 10(2), 3986-4009, 2016

  23. arXiv:1407.4099  [pdf, ps, other

    astro-ph.CO gr-qc hep-ph hep-th

    Fine-structure constant constraints on Bekenstein-type models

    Authors: P. M. M. Leal, C. J. A. P. Martins, L. B. Ventura

    Abstract: Astrophysical tests of the stability of dimensionless fundamental couplings, such as the fine-structure constant $α$, are an area of much increased recent activity, following some indications of possible spacetime variations at the few parts per million level. Here we obtain updated constraints on the Bekenstein-Sandvik-Barrow-Magueijo model, which is arguably the simplest model allowing for $α$ v… ▽ More

    Submitted 15 July, 2014; originally announced July 2014.

    Comments: 5 pages, 2 figures

    Journal ref: Phys. Rev.D90, 027305 (2014)

  24. arXiv:1407.3191  [pdf, other

    cs.DB stat.AP

    A Comparison of Blocking Methods for Record Linkage

    Authors: Rebecca C. Steorts, Samuel L. Ventura, Mauricio Sadinle, Stephen E. Fienberg

    Abstract: Record linkage seeks to merge databases and to remove duplicates when unique identifiers are not available. Most approaches use blocking techniques to reduce the computational complexity associated with record linkage. We review traditional blocking techniques, which typically partition the records according to a set of field attributes, and consider two variants of a method known as locality sens… ▽ More

    Submitted 11 July, 2014; originally announced July 2014.

    Comments: 22 pages, 2 tables, 7 figures

  25. Minimum scoring rule inference

    Authors: Philip Dawid, Monica Musio, Laura Ventura

    Abstract: Proper scoring rules are methods for encouraging honest assessment of probability distributions. Just like likelihood, a proper scoring rule can be applied to supply an unbiased estimating equation for any statistical model, and the theory of such equations can be applied to understand the properties of the associated estimator. In this paper we develop some basic scoring rule estimation theory, a… ▽ More

    Submitted 16 March, 2014; originally announced March 2014.

    Comments: 27 pages, 3 figures

    MSC Class: 62F99

    Journal ref: Scandinavian Journal of Statistics, Volume 43, Issue 1, pages 123-138, March 2016

  26. Approximate Bayesian Computation with composite score functions

    Authors: Erlis Ruli, Nicola Sartori, Laura Ventura

    Abstract: Both Approximate Bayesian Computation (ABC) and composite likelihood methods are useful for Bayesian and frequentist inference, respectively, when the likelihood function is intractable. We propose to use composite likelihood score functions as summary statistics in ABC in order to obtain accurate approximations to the posterior distribution. This is motivated by the use of the score function of t… ▽ More

    Submitted 24 February, 2015; v1 submitted 28 November, 2013; originally announced November 2013.

    Comments: Statistics and Computing (final version)

    Report number: STCO-D-14-00149R2

  27. arXiv:1304.1756  [pdf, other

    stat.AP

    Trouble With The Curve: Improving MLB Pitch Classification

    Authors: Michael A. Pane, Samuel L. Ventura, Rebecca C. Steorts, A. C. Thomas

    Abstract: The PITCHf/x database has allowed the statistical analysis of of Major League Baseball (MLB) to flourish since its introduction in late 2006. Using PITCHf/x, pitches have been classified by hand, requiring considerable effort, or using neural network clustering and classification, which is often difficult to interpret. To address these issues, we use model-based clustering with a multivariate Gaus… ▽ More

    Submitted 5 April, 2013; originally announced April 2013.

  28. A note on marginal posterior simulation via higher-order tail area approximations

    Authors: Erlis Ruli, Nicola Sartori, Laura Ventura

    Abstract: We explore the use of higher-order tail area approximations for Bayesian simulation. These approximations give rise to an alternative simulation scheme to MCMC for Bayesian computation of marginal posterior distributions for a scalar parameter of interest, in the presence of nuisance parameters. Its advantage over MCMC methods is that samples are drawn independently with lower computational time a… ▽ More

    Submitted 5 December, 2012; originally announced December 2012.

    Journal ref: Bayesian Analysis 09 2014

  29. arXiv:1208.0799  [pdf, other

    stat.AP

    Competing Process Hazard Function Models for Player Ratings in Ice Hockey

    Authors: A. C. Thomas, Samuel L. Ventura, Shane Jensen, Stephen Ma

    Abstract: Evaluating the overall ability of players in the National Hockey League (NHL) is a difficult task. Existing methods such as the famous "plus/minus" statistic have many shortcomings. Standard linear regression methods work well when player substitutions are relatively uncommon and scoring events are relatively common, such as in basketball, but as neither of these conditions exists for hockey, we u… ▽ More

    Submitted 28 February, 2013; v1 submitted 3 August, 2012; originally announced August 2012.