Skip to main content

Showing 1–13 of 13 results for author: Liquet, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.13339  [pdf, other

    stat.ME stat.CO

    Group COMBSS: Group Selection via Continuous Optimization

    Authors: Anant Mathur, Sarat Moka, Benoit Liquet, Zdravko Botev

    Abstract: We present a new optimization method for the group selection problem in linear regression. In this problem, predictors are assumed to have a natural group structure and the goal is to select a small set of groups that best fits the response. The incorporation of group structure in a predictor matrix is a key factor in obtaining better estimators and identifying associations between response and pr… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  2. arXiv:2403.20007  [pdf, other

    stat.ME stat.CO stat.OT

    Best Subset Solution Path for Linear Dimension Reduction Models using Continuous Optimization

    Authors: Benoit Liquet, Sarat Moka, Samuel Muller

    Abstract: The selection of best variables is a challenging problem in supervised and unsupervised learning, especially in high dimensional contexts where the number of variables is usually much larger than the number of observations. In this paper, we focus on two multivariate statistical methods: principal components analysis and partial least squares. Both approaches are popular linear dimension-reduction… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Main paper 26 pages including references and 17 pages for the supplementary material

  3. arXiv:2403.13076  [pdf, other

    stat.ME

    Spatial Autoregressive Model on a Dirichlet Distribution

    Authors: Teo Nguyen, Sarat Moka, Kerrie Mengersen, Benoit Liquet

    Abstract: Compositional data find broad application across diverse fields due to their efficacy in representing proportions or percentages of various components within a whole. Spatial dependencies often exist in compositional data, particularly when the data represents different land uses or ecological variables. Ignoring the spatial autocorrelations in modelling of compositional data may lead to incorrect… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 33 pages, 2 figures, submitted to "Computational Statistics & Data Analysis"

  4. arXiv:2403.12332  [pdf, other

    stat.ME

    A maximum penalised likelihood approach for semiparametric accelerated failure time models with time-varying covariates and partly interval censoring

    Authors: Aishwarya Bhaskaran, Ding Ma, Benoit Liquet, Angela Hong, Serigne N Lo, Stephane Heritier, Jun Ma

    Abstract: Accelerated failure time (AFT) models are frequently used for modelling survival data. This approach is attractive as it quantifies the direct relationship between the time until an event occurs and various covariates. It asserts that the failure times experience either acceleration or deceleration through a multiplicative factor when these covariates are present. While existing literature provide… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 31 pages, 5 figures, 4 tables

  5. arXiv:2205.02617  [pdf, other

    stat.ME stat.CO

    COMBSS: Best Subset Selection via Continuous Optimization

    Authors: Sarat Moka, Benoit Liquet, Houying Zhu, Samuel Muller

    Abstract: The problem of best subset selection in linear regression is considered with the aim to find a fixed size subset of features that best fits the response. This is particularly challenging when the total available number of features is very large compared to the number of data samples. Existing optimal methods for solving this problem tend to be slow while fast methods tend to have low accuracy. Ide… ▽ More

    Submitted 24 November, 2023; v1 submitted 5 May, 2022; originally announced May 2022.

  6. arXiv:2203.11873  [pdf, other

    stat.ME

    Nonstationary Spatial Process Models with Spatially Varying Covariance Kernels

    Authors: Sébastien Coube-Sisqueille, Sudipto Banerjee, Benoît Liquet

    Abstract: Spatial process models for capturing nonstationary behavior in scientific data present several challenges with regard to statistical inference and uncertainty quantification. While nonstationary spatially-varying kernels are attractive for their flexibility and richness, their practical implementation has been reported to be overwhelmingly cumbersome because of the high-dimensional parameter space… ▽ More

    Submitted 28 March, 2024; v1 submitted 22 March, 2022; originally announced March 2022.

  7. Understanding links between water-quality variables and nitrate concentration in freshwater streams using high-frequency sensor data

    Authors: Claire Kermorvant, Benoit Liquet, Guy Litt, Kerrie Mengersen, Erin Peterson, Rob Hyndman, Jeremy B. Jones Jr., Catherine Leigh

    Abstract: Real time monitoring using in situ sensors is becoming a common approach for measuring water quality within watersheds. High frequency measurements produce big data sets that present opportunities to conduct new analyses for improved understanding of water quality dynamics and more effective management of rivers and streams. Of primary importance is enhancing knowledge of the relationships between… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 4 figures, 17 pages

    MSC Class: I.2.7 ACM Class: F.2.2

  8. arXiv:2010.00896  [pdf, other

    stat.CO stat.AP

    Improving performances of MCMC for Nearest Neighbor Gaussian Process models with full data augmentation

    Authors: Sébastien Coube-Sisqueille, Benoît Liquet

    Abstract: Even though Nearest Neighbor Gaussian Processes (NNGP) alleviate considerably MCMC implementation of Bayesian space-time models, they do not solve the convergence problems caused by high model dimension. Frugal alternatives such as response or collapsed algorithms are an answer.gree Our approach is to keep full data augmentation but to try and make it more efficient. We present two strategies to d… ▽ More

    Submitted 14 September, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: 15 pages, 7 figures

  9. arXiv:2005.14462  [pdf, other

    stat.ME

    Estimation of Semi-Markov Multi-state Models: A Comparison of the Sojourn Times and Transition Intensities Approaches

    Authors: Azam Asanjarani, Benoit Liquet, Yoni Nazarathy

    Abstract: Semi-Markov models are widely used for survival analysis and reliability analysis. In general, there are two competing parameterizations and each entails its own interpretation and inference properties. On the one hand, a semi-Markov process can be defined based on the distribution of sojourn times, often via hazard rates, together with transition probabilities of an embedded Markov chain. On the… ▽ More

    Submitted 28 December, 2020; v1 submitted 29 May, 2020; originally announced May 2020.

  10. arXiv:1702.07066  [pdf, other

    stat.ML

    A Unified Parallel Algorithm for Regularized Group PLS Scalable to Big Data

    Authors: Pierre Lafaye de Micheaux, Benoit Liquet, Matthew Sutton

    Abstract: Partial Least Squares (PLS) methods have been heavily exploited to analyse the association between two blocs of data. These powerful approaches can be applied to data sets where the number of variables is greater than the number of observations and in presence of high collinearity between variables. Different sparse versions of PLS have been developed to integrate multiple data sets while simultan… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.

  11. arXiv:1503.01842  [pdf, other

    stat.CO

    CEoptim: Cross-Entropy R Package for Optimization

    Authors: Tim Benham, Qibin Duan, Dirk P. Kroese, Benoit Liquet

    Abstract: The cross-entropy (CE) method is simple and versatile technique for optimization, based on Kullback-Leibler (or cross-entropy) minimization. The method can be applied to a wide range of optimization tasks, including continuous, discrete, mixed and constrained optimization problems. The new package CEoptim provides the R implementation of the CE method for optimization. We describe the general CE m… ▽ More

    Submitted 5 March, 2015; originally announced March 2015.

    Comments: 28 pages, 11 figures

  12. arXiv:1503.00890  [pdf, other

    stat.CO stat.ME

    Estimation of extended mixed models using latent classes and latent processes: the R package lcmm

    Authors: Cécile Proust-Lima, Viviane Philipps, Benoit Liquet

    Abstract: The R package lcmm provides a series of functions to estimate statistical models based on linear mixed model theory. It includes the estimation of mixed models and latent class mixed models for Gaussian longitudinal outcomes (hlme), curvilinear and ordinal univariate longitudinal outcomes (lcmm) and curvilinear multivariate outcomes (multlcmm), as well as joint latent class mixed models (Jointlcmm… ▽ More

    Submitted 24 January, 2016; v1 submitted 3 March, 2015; originally announced March 2015.

    Journal ref: Journal of Statistical Software (2017), 78(2), 1-56

  13. arXiv:1112.0295  [pdf, other

    stat.CO

    ClustOfVar: An R Package for the Clustering of Variables

    Authors: M. Chavent, V. Kuentz, B. Liquet, L. Saracco

    Abstract: Clustering of variables is as a way to arrange variables into homogeneous clusters, i.e., groups of variables which are strongly related to each other and thus bring the same information. These approaches can then be useful for dimension reduction and variable selection. Several specific methods have been developed for the clustering of numerical variables. However concerning qualitative variables… ▽ More

    Submitted 1 December, 2011; originally announced December 2011.

    Journal ref: Journal of Statistical Software (2012), 50(13), 1-16