Skip to main content

Showing 1–11 of 11 results for author: Denti, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.13310  [pdf, other

    stat.ME stat.AP

    A finite-infinite shared atoms nested model for the Bayesian analysis of large grouped data

    Authors: Laura D'Angelo, Francesco Denti

    Abstract: The use of hierarchical mixture priors with shared atoms has recently flourished in the Bayesian literature for partially exchangeable data. Leveraging on nested levels of mixtures, these models allow the estimation of a two-layered data partition: across groups and across observations. This paper discusses and compares the properties of such modeling strategies when the mixing weights are assigne… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2212.01865  [pdf, other

    stat.CO stat.AP

    Variational Inference for Semiparametric Bayesian Novelty Detection in Large Datasets

    Authors: Luca Benedetti, Eric Boniardi, Leonardo Chiani, Jacopo Ghirri, Marta Mastropietro, Andrea Cappozzo, Francesco Denti

    Abstract: After being trained on a fully-labeled training set, where the observations are grouped into a certain number of known classes, novelty detection methods aim to classify the instances of an unlabeled test set while allowing for the presence of previously unseen classes. These models are valuable in many areas, ranging from social network and food adulteration analyses to biology, where an evolving… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

  3. arXiv:2205.00930  [pdf, other

    stat.ME stat.AP

    Multiple hypothesis screening using mixtures of non-local distributions with applications to genomic studies

    Authors: Francesco Denti, Stefano Peluso, Michele Guindani, Antonietta Mira

    Abstract: The analysis of large-scale datasets, especially in biomedical contexts, frequently involves a principled screening of multiple hypotheses. The celebrated two-group model jointly models the distribution of the test statistics with mixtures of two competing densities, the null and the alternative distributions. We investigate the use of weighted densities and, in particular, non-local densities as… ▽ More

    Submitted 9 March, 2023; v1 submitted 2 May, 2022; originally announced May 2022.

  4. arXiv:2203.04165  [pdf, other

    stat.AP stat.CO stat.ML

    On the intrinsic dimensionality of Covid-19 data: a global perspective

    Authors: Abhishek Varghese, Edgar Santos-Fernandez, Francesco Denti, Antonietta Mira, Kerrie Mengersen

    Abstract: This paper aims to develop a global perspective of the complexity of the relationship between the standardised per-capita growth rate of Covid-19 cases, deaths, and the OxCGRT Covid-19 Stringency Index, a measure describing a country's stringency of lockdown policies. To achieve our goal, we use a heterogeneous intrinsic dimension estimator implemented as a Bayesian mixture model, called Hidalgo.… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    MSC Class: 62P10

  5. arXiv:2106.08281  [pdf, other

    stat.ME

    A Horseshoe mixture model for Bayesian screening with an application to light sheet fluorescence microscopy in brain imaging

    Authors: Francesco Denti, Ricardo Azevedo, Chelsie Lo, Damian Wheeler, Sunil P. Gandhi, Michele Guindani, Babak Shahbaba

    Abstract: In this paper, we focus on identifying differentially activated brain regions using a light sheet fluorescence microscopy - a recently developed technique for whole-brain imaging. Most existing statistical methods solve this problem by partitioning the brain regions into two classes: significantly and non-significantly activated. However, for the brain imaging problem at the center of our study, s… ▽ More

    Submitted 27 January, 2023; v1 submitted 15 June, 2021; originally announced June 2021.

  6. arXiv:2104.13832  [pdf, other

    stat.ME

    Distributional Results for Model-Based Intrinsic Dimension Estimators

    Authors: Francesco Denti, Diego Doimo, Alessandro Laio, Antonietta Mira

    Abstract: Modern datasets are characterized by a large number of features that may conceal complex dependency structures. To deal with this type of data, dimensionality reduction techniques are essential. Numerous dimensionality reduction methods rely on the concept of intrinsic dimension, a measure of the complexity of the dataset. In this article, we first review the TWO-NN model, a likelihood-based intri… ▽ More

    Submitted 1 June, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

  7. arXiv:2102.11425  [pdf, other

    stat.CO stat.ME

    intRinsic: an R Package for Model-Based Estimation of the Intrinsic Dimension of a Dataset

    Authors: Francesco Denti

    Abstract: This article illustrates intRinsic, an R package that implements novel state-of-the-art likelihood-based estimators of the intrinsic dimension of a dataset, an essential quantity for most dimensionality reduction techniques. In order to make these novel estimators easily accessible, the package contains a small number of high-level functions that rely on a broader set of efficient, low-level routi… ▽ More

    Submitted 23 February, 2023; v1 submitted 22 February, 2021; originally announced February 2021.

  8. arXiv:2008.07077  [pdf, other

    stat.ME stat.AP

    A Common Atom Model for the Bayesian Nonparametric Analysis of Nested Data

    Authors: Francesco Denti, Federico Camerlenghi, Michele Guindani, Antonietta Mira

    Abstract: The use of high-dimensional data for targeted therapeutic interventions requires new ways to characterize the heterogeneity observed across subgroups of a specific population. In particular, models for partially exchangeable data are needed for inference on nested datasets, where the observations are assumed to be organized in different units and some sharing of information is required to learn di… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

  9. A Two-Stage Bayesian Semiparametric Model for Novelty Detection with Robust Prior Information

    Authors: Francesco Denti, Andrea Cappozzo, Francesca Greselin

    Abstract: Novelty detection methods aim at partitioning the test units into already observed and previously unseen patterns. However, two significant issues arise: there may be considerable interest in identifying specific structures within the novelty, and contamination in the known classes could completely blur the actual separation between manifest and new groups. Motivated by these problems, we propose… ▽ More

    Submitted 17 June, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

  10. arXiv:2002.04148  [pdf, other

    stat.AP

    The role of intrinsic dimension in high-resolution player tracking data -- Insights in basketball

    Authors: Edgar Santos-Fernandez, Francesco Denti, Kerrie Mengersen, Antonietta Mira

    Abstract: A new range of statistical analysis has emerged in sports after the introduction of the high-resolution player tracking technology, specifically in basketball. However, this high dimensional data is often challenging for statistical inference and decision making. In this article, we employ Hidalgo, a state-of-the-art Bayesian mixture model that allows the estimation of heterogeneous intrinsic dime… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 21 pages, 16 figures, Codes + data + results can be found in https://github.com/EdgarSantos-Fernandez/id_basketball, Submitted

  11. arXiv:1902.10459  [pdf, other

    stat.ML cs.LG

    Data segmentation based on the local intrinsic dimension

    Authors: Michele Allegra, Elena Facco, Francesco Denti, Alessandro Laio, Antonietta Mira

    Abstract: One of the founding paradigms of machine learning is that a small number of variables is often sufficient to describe high-dimensional data. The minimum number of variables required is called the intrinsic dimension (ID) of the data. Contrary to common intuition, there are cases where the ID varies within the same data set. This fact has been highlighted in technical discussions, but seldom exploi… ▽ More

    Submitted 13 July, 2020; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: 11 pages, 6 figures + 9 pages Supplementary Information