Skip to main content

Showing 1–19 of 19 results for author: Chiaromonte, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.17925  [pdf, other

    cs.LG stat.AP

    Accurate and fast anomaly detection in industrial processes and IoT environments

    Authors: Simone Tonini, Andrea Vandin, Francesca Chiaromonte, Daniele Licari, Fernando Barsacchi

    Abstract: We present a novel, simple and widely applicable semi-supervised procedure for anomaly detection in industrial and IoT environments, SAnD (Simple Anomaly Detection). SAnD comprises 5 steps, each leveraging well-known statistical tools, namely; smoothing filters, variance inflation factors, the Mahalanobis distance, threshold selection algorithms and feature importance techniques. To our knowledge,… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  2. arXiv:2312.16346  [pdf, other

    stat.AP

    An efficient approach to characterize spatio-temporal dependence in cortical surface fMRI data

    Authors: Huy Dang, Marzia Cremona, Nicole Lazar, Francesca Chiaromonte

    Abstract: Functional magnetic resonance imaging (fMRI) is a neuroimaging technique known for its ability to capture brain activity non-invasively and at fine spatial resolution (2-3mm). Cortical surface fMRI (cs-fMRI) is a recent development of fMRI that focuses on signals from tissues that have neuronal activities, as opposed to the whole brain. cs-fMRI data is plagued with non-stationary spatial correlati… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  3. arXiv:2307.09820  [pdf, other

    stat.AP physics.soc-ph

    Contrasting pre-vaccine COVID-19 waves in Italy through Functional Data Analysis

    Authors: Tobia Boschi, Jacopo Di Iorio, Lorenzo Testa, Marzia A. Cremona, Francesca Chiaromonte

    Abstract: We use data from 107 Italian provinces to characterize and compare mortality patterns in the first two COVID-19 epidemic waves, which occurred prior to the introduction of vaccines. We also associate these patterns with mobility, timing of government restrictions, and socio-demographic, infrastructural, and environmental covariates. Notwithstanding limitations in the accuracy and reliability of pu… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: main: 12 pages, 5 figures supplement: 8 pages, 11 figures

  4. arXiv:2306.04254  [pdf, other

    stat.ME

    funBIalign: a hierachical algorithm for functional motif discovery based on mean squared residue scores

    Authors: Jacopo Di Iorio, Marzia A. Cremona, Francesca Chiaromonte

    Abstract: Motif discovery is gaining increasing attention in the domain of functional data analysis. Functional motifs are typical "shapes" or "patterns" that recur multiple times in different portions of a single curve and/or in misaligned portions of multiple curves. In this paper, we define functional motifs using an additive model and we propose funBIalign for their discovery and evaluation. Inspired by… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  5. arXiv:2303.14801  [pdf, other

    stat.ME stat.CO stat.ML

    FAStEN: an efficient adaptive method for feature selection and estimation in high-dimensional functional regressions

    Authors: Tobia Boschi, Lorenzo Testa, Francesca Chiaromonte, Matthew Reimherr

    Abstract: Functional regression analysis is an established tool for many contemporary scientific applications. Regression problems involving large and complex data sets are ubiquitous, and feature selection is crucial for avoiding overfitting and achieving accurate predictions. We propose a new, flexible and ultra-efficient approach to perform feature selection in a sparse high dimensional function-on-funct… ▽ More

    Submitted 4 September, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

  6. arXiv:2206.05718  [pdf, other

    stat.ME

    smoothEM: a new approach for the simultaneous assessment of smooth patterns and spikes

    Authors: Huy Dang, Marzia Cremona, Francesca Chiaromonte

    Abstract: We consider functional data where an underlying smooth curve is composed not just with errors, but also with irregular spikes. We propose an approach that, combining regularized spline smoothing and an Expectation-Maximization algorithm, allows one to both identify spikes and estimate the smooth component. Imposing some assumptions on the error distribution, we prove consistency of EM estimates. N… ▽ More

    Submitted 16 July, 2023; v1 submitted 12 June, 2022; originally announced June 2022.

  7. Venture Capital investments through the lens of Network and Functional Data Analysis

    Authors: Christian Esposito, Marco Gortan, Lorenzo Testa, Francesca Chiaromonte, Giorgio Fagiolo, Andrea Mina, Giulio Rossetti

    Abstract: In this paper we characterize the performance of venture capital-backed firms based on their ability to attract investment. The aim of the study is to identify relevant predictors of success built from the network structure of firms' and investors' relations. Focusing on deal-level data for the health sector, we first create a bipartite network among firms and investors, and then apply functional… ▽ More

    Submitted 10 August, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: 17 pages, 9 figures, supplementary material attached

    Journal ref: Applied Network Science 7, 42 (2022)

  8. arXiv:2111.06371  [pdf, ps, other

    cs.SI econ.GN stat.AP

    Can you always reap what you sow? Network and functional data analysis of VC investments in health-tech companies

    Authors: Christian Esposito, Marco Gortan, Lorenzo Testa, Francesca Chiaromonte, Giorgio Fagiolo, Andrea Mina, Giulio Rossetti

    Abstract: "Success" of firms in venture capital markets is hard to define, and its determinants are still poorly understood. We build a bipartite network of investors and firms in the healthcare sector, describing its structure and its communities. Then, we characterize "success" introducing progressively more refined definitions, and we find a positive association between such definitions and the centralit… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: 12 pages, 4 figures, accepted for publication in the proceedings of the 10th International Conference on Complex Networks and Their Applications

    Journal ref: Proceedings of 10th International Conference of Complex Networks and their applications 2021

  9. arXiv:2106.11941  [pdf, ps, other

    stat.ME

    Doubly Robust Feature Selection with Mean and Variance Outlier Detection and Oracle Properties

    Authors: Luca Insolia, Francesca Chiaromonte, Runze Li, Marco Riani

    Abstract: We propose a general approach to handle data contaminations that might disrupt the performance of feature selection and estimation procedures for high-dimensional linear models. Specifically, we consider the co-occurrence of mean-shift and variance-inflation outliers, which can be modeled as additional fixed and random components, respectively, and evaluated independently. Our proposal performs fe… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: 35 pages, 9 figures (including supplementary material)

  10. arXiv:2104.09452  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Epsilon Consistent Mixup: Structural Regularization with an Adaptive Consistency-Interpolation Tradeoff

    Authors: Vincent Pisztora, Yanglan Ou, Xiaolei Huang, Francesca Chiaromonte, Jia Li

    Abstract: In this paper we propose $ε$-Consistent Mixup ($ε$mu). $ε$mu is a data-based structural regularization technique that combines Mixup's linear interpolation with consistency regularization in the Mixup direction, by compelling a simple adaptive tradeoff between the two. This learnable combination of consistency and interpolation induces a more flexible structure on the evolution of the response acr… ▽ More

    Submitted 29 September, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

  11. arXiv:2008.04700  [pdf, other

    stat.AP physics.soc-ph

    The shapes of an epidemic: using Functional Data Analysis to characterize COVID-19 in Italy

    Authors: Tobia Boschi, Jacopo Di Iorio, Lorenzo Testa, Marzia A. Cremona, Francesca Chiaromonte

    Abstract: We investigate patterns of COVID-19 mortality across 20 Italian regions and their association with mobility, positivity, and socio-demographic, infrastructural and environmental covariates. Notwithstanding limitations in accuracy and resolution of the data available from public sources, we pinpoint significant trends exploiting information in curves and shapes with Functional Data Analysis techniq… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    MSC Class: 62P10; 62R10 ACM Class: J.3

    Journal ref: Scientific Reports volume 11, Article number: 17054 (2021)

  12. arXiv:2007.06114  [pdf, ps, other

    stat.ME math.ST

    Simultaneous Feature Selection and Outlier Detection with Optimality Guarantees

    Authors: Luca Insolia, Ana Kenney, Francesca Chiaromonte, Giovanni Felici

    Abstract: Sparse estimation methods capable of tolerating outliers have been broadly investigated in the last decade. We contribute to this research considering high-dimensional regression problems contaminated by multiple mean-shift outliers which affect both the response and the design matrix. We develop a general framework for this class of problems and propose the use of mixed-integer programming to sim… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

  13. arXiv:2006.03970  [pdf, other

    stat.ML cs.LG stat.CO

    An Efficient Semi-smooth Newton Augmented Lagrangian Method for Elastic Net

    Authors: Tobia Boschi, Matthew Reimherr, Francesca Chiaromonte

    Abstract: Feature selection is an important and active research area in statistics and machine learning. The Elastic Net is often used to perform selection when the features present non-negligible collinearity or practitioners wish to incorporate additional known structure. In this article, we propose a new Semi-smooth Newton Augmented Lagrangian Method to efficiently solve the Elastic Net in ultra-high dim… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

    MSC Class: 62J07 ACM Class: G.3

  14. arXiv:2006.03141  [pdf, other

    cs.SI physics.soc-ph stat.AP

    The relationship between human mobility and viral transmissibility during the COVID-19 epidemics in Italy

    Authors: Paolo Cintia, Luca Pappalardo, Salvatore Rinzivillo, Daniele Fadda, Tobia Boschi, Fosca Giannotti, Francesca Chiaromonte, Pietro Bonato, Francesco Fabbri, Francesco Penone, Marcello Savarese, Francesco Calabrese, Giorgio Guzzetta, Flavia Riccardo, Valentina Marziano, Piero Poletti, Filippo Trentini, Antonino Bella, Xanthi Andrianou, Martina Del Manso, Massimo Fabiani, Stefania Bellino, Stefano Boros, Alberto Mateo Urdiales, Maria Fenicia Vescio , et al. (7 additional authors not shown)

    Abstract: In 2020, countries affected by the COVID-19 pandemic implemented various non-pharmaceutical interventions to contrast the spread of the virus and its impact on their healthcare systems and economies. Using Italian data at different geographic scales, we investigate the relationship between human mobility, which subsumes many facets of the population's response to the changing situation, and the sp… ▽ More

    Submitted 1 April, 2021; v1 submitted 4 June, 2020; originally announced June 2020.

  15. arXiv:1907.11142  [pdf, other

    stat.ME cs.LG stat.ML

    On the bias of H-scores for comparing biclusters, and how to correct it

    Authors: Jacopo Di Iorio, Francesca Chiaromonte, Marzia A. Cremona

    Abstract: In the last two decades several biclustering methods have been developed as new unsupervised learning techniques to simultaneously cluster rows and columns of a data matrix. These algorithms play a central role in contemporary machine learning and in many applications, e.g. to computational biology and bioinformatics. The H-score is the evaluation score underlying the seminal biclustering algorith… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Comments: 12 pages, 3 figures

    Journal ref: Bioinformatics 2020, 36(1): 2955-2957

  16. Probabilistic $K$-mean with local alignment for clustering and motif discovery in functional data

    Authors: Marzia A. Cremona, Francesca Chiaromonte

    Abstract: We develop a new method to locally cluster curves and discover functional motifs, i.e.~typical ``shapes'' that may recur several times along and across the curves capturing important local characteristics. In order to identify these shared curve portions, our method leverages ideas from functional data analysis (joint clustering and alignment of curves), bioinformatics (local alignment through the… ▽ More

    Submitted 7 July, 2020; v1 submitted 14 August, 2018; originally announced August 2018.

    Comments: 22 pages, 6 figures. This work has been presented at various conferences

    Journal ref: Journal of Computational and Graphical Statistics 2022

  17. arXiv:1808.02526  [pdf, other

    stat.ME

    MIP-BOOST: Efficient and Effective $L_0$ Feature Selection for Linear Regression

    Authors: Ana Kenney, Francesca Chiaromonte, Giovanni Felici

    Abstract: Recent advances in mathematical programming have made Mixed Integer Optimization a competitive alternative to popular regularization methods for selecting features in regression problems. The approach exhibits unquestionable foundational appeal and versatility, but also poses important challenges. Here we propose MIP-BOOST, a revision of standard Mixed Integer Programming feature selection that re… ▽ More

    Submitted 30 September, 2019; v1 submitted 7 August, 2018; originally announced August 2018.

    Comments: This work has been presented at JSM 2018 (Vancouver, Canada), ISNPS 2018 (Salerno, Italy), and various other conferences

  18. arXiv:1506.08278  [pdf, other

    math.ST stat.ME

    Composite likelihood inference in a discrete latent variable model for two-way "clustering-by-segmentation" problems

    Authors: Francesco Bartolucci, Francesca Chiaromonte, Prabhani Kuruppumullage Don, Bruce George Lindsay

    Abstract: We consider a discrete latent variable model for two-way data arrays, which allows one to simultaneously produce clusters along one of the data dimensions (e.g. exchangeable observational units or features) and contiguous groups, or segments, along the other (e.g. consecutively ordered times or locations). The model relies on a hidden Markov structure but, given its complexity, cannot be estimated… ▽ More

    Submitted 27 June, 2015; originally announced June 2015.

  19. arXiv:1401.5506  [pdf, other

    stat.ME

    An attraction-repulsion point process model for respiratory syncytial virus infections

    Authors: Joshua Goldstein, Murali Haran, Ivan Simeonov, John Fricks, Francesca Chiaromonte

    Abstract: How is the progression of a virus influenced by properties intrinsic to individual cells? We address this question by studying the susceptibility of cells infected with two strains of the human respiratory syncytial virus (RSV-A and RSV-B) in an in vitro experiment. Spatial patterns of infected cells give us insight into how local conditions influence susceptibility to the virus. We observe a comp… ▽ More

    Submitted 13 July, 2014; v1 submitted 21 January, 2014; originally announced January 2014.