Skip to main content

Showing 1–37 of 37 results for author: Kneib, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.07440  [pdf, other

    stat.ME

    Bayesian Penalized Transformation Models: Structured Additive Location-Scale Regression for Arbitrary Conditional Distributions

    Authors: Johannes Brachem, Paul F. V. Wiemann, Thomas Kneib

    Abstract: Penalized transformation models (PTMs) are a novel form of location-scale regression. In PTMs, the shape of the response's conditional distribution is estimated directly from the data, and structured additive predictors are placed on its location and scale. The core of the model is a monotonically increasing transformation function that relates the response distribution to a reference distribution… ▽ More

    Submitted 7 May, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  2. arXiv:2309.16861  [pdf, other

    stat.ME math.ST

    Demystifying Spatial Confounding

    Authors: Emiko Dupont, Isa Marques, Thomas Kneib

    Abstract: Spatial confounding is a fundamental issue in regression models for spatially indexed data. It arises because spatial random effects, included to approximate unmeasured spatial variation, are typically not independent of the covariates in the model. This can lead to significant bias in covariate effect estimates. Despite extensive research, it is still a topic of much confusion with sometimes puzz… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  3. A simplified spatial+ approach to mitigate spatial confounding in multivariate spatial areal models

    Authors: A. Urdangarin, T. Goicoa, T. Kneib, M. D. Ugarte

    Abstract: Spatial areal models encounter the well-known and challenging problem of spatial confounding. This issue makes it arduous to distinguish between the impacts of observed covariates and spatial random effects. Despite previous research and various proposed methods to tackle this problem, finding a definitive solution remains elusive. In this paper, we propose a simplified version of the spatial+ app… ▽ More

    Submitted 5 January, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Journal ref: Spatial Statistics (2024)

  4. "Spatial Joint Models through Bayesian Structured Piece-wise Additive Joint Modelling for Longitudinal and Time-to-Event Data"

    Authors: Anja Rappl, Thomas Kneib, Stefan Lang, Elisabeth Bergherr

    Abstract: Joint models for longitudinal and time-to-event data have seen many developments in recent years. Though spatial joint models are still rare and the traditional proportional hazards formulation of the time-to-event part of the model is accompanied by computational challenges. We propose a joint model with a piece-wise exponential formulation of the hazard using the counting process representation… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

  5. arXiv:2301.11862  [pdf, other

    stat.ML cs.LG

    Neural Additive Models for Location Scale and Shape: A Framework for Interpretable Neural Regression Beyond the Mean

    Authors: Anton Thielmann, René-Marcel Kruse, Thomas Kneib, Benjamin Säfken

    Abstract: Deep neural networks (DNNs) have proven to be highly effective in a variety of tasks, making them the go-to method for problems requiring high-level predictive power. Despite this success, the inner workings of DNNs are often not transparent, making them difficult to interpret or understand. This lack of interpretability has led to increased research on inherently interpretable neural networks in… ▽ More

    Submitted 29 February, 2024; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: Accepted at the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  6. arXiv:2209.10975  [pdf, other

    stat.CO

    Liesel: A Probabilistic Programming Framework for Develo** Semi-Parametric Regression Models and Custom Bayesian Inference Algorithms

    Authors: Hannes Riebl, Paul F. V. Wiemann, Thomas Kneib

    Abstract: Liesel is a new probabilistic programming framework developed with the aim of supporting research on Bayesian inference based on Markov chain Monte Carlo (MCMC) simulations in general and semi-parametric regression specifications in particular. Its three main components are (i) an R interface (RLiesel) for the configuration of an initial semi-parametric regression model, (ii) a graph-based model b… ▽ More

    Submitted 29 November, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 27 pages, 10 figures, updated for compatibility with Liesel v0.2.7, added second case study

  7. arXiv:2208.10294  [pdf, other

    stat.ME

    Multivariate Distributional Stochastic Frontier Models

    Authors: Rouven Schmidt, Thomas Kneib

    Abstract: The primary objective of Stochastic Frontier (SF) Analysis is the deconvolution of the estimated composed error terms into noise and inefficiency. Assuming a parametric production function (e.g. Cobb-Douglas, Translog, etc.), might lead to false inefficiency estimates. To overcome this limiting assumption, the production function can be modelled utilizing P-splines. Application of this powerful an… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 17 pages, 3 figures

  8. arXiv:2207.11236  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    Twitmo: A Twitter Data Topic Modeling and Visualization Package for R

    Authors: Andreas Buchmüller, Gillian Kant, Christoph Weisser, Benjamin Säfken, Krisztina Kis-Katos, Thomas Kneib

    Abstract: We present Twitmo, a package that provides a broad range of methods to collect, pre-process, analyze and visualize geo-tagged Twitter data. Twitmo enables the user to collect geo-tagged Tweets from Twitter and and provides a comprehensive and user-friendly toolbox to generate topic distributions from Latent Dirichlet Allocations (LDA), correlated topic models (CTM) and structural topic models (STM… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: 16 pages, 4 figures

    MSC Class: 68N30 (Primary) 62P25; 97K80 (Secondary)

  9. arXiv:2205.08594  [pdf, other

    stat.ME stat.ML

    Bayesian Discrete Conditional Transformation Models

    Authors: Manuel Carlan, Thomas Kneib

    Abstract: We propose a novel Bayesian model framework for discrete ordinal and count data based on conditional transformations of the responses. The conditional transformation function is estimated from the data in conjunction with an a priori chosen reference distribution. For count responses, the resulting transformation model is novel in the sense that it is a Bayesian fully parametric yet distribution-f… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  10. arXiv:2204.00778  [pdf, other

    stat.ML cs.LG

    Distributional Gradient Boosting Machines

    Authors: Alexander März, Thomas Kneib

    Abstract: We present a unified probabilistic gradient boosting framework for regression tasks that models and predicts the entire conditional distribution of a univariate response variable as a function of covariates. Our likelihood-based approach allows us to either model all conditional moments of a parametric distribution, or to approximate the conditional cumulative distribution function via Normalizing… ▽ More

    Submitted 2 April, 2022; originally announced April 2022.

    Comments: Distributional Regression, LightGBM, Normalizing Flow, Probabilistic Forecasting, XGBoost

  11. arXiv:2111.14207  [pdf, other

    stat.ME

    Using the Softplus Function to Construct Alternative Link Functions in Generalized Linear Models and Beyond

    Authors: Paul F. V. Wiemann, Thomas Kneib, Julien Hambuckers

    Abstract: Response functions linking regression predictors to properties of the response distribution are fundamental components in many statistical models. However, the choice of these functions is typically based on the domain of the modeled quantities and is not further scrutinized. For example, the exponential response function is usually assumed for parameters restricted to be positive although it impl… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

  12. arXiv:2106.03737  [pdf, other

    stat.ME

    A multivariate Gaussian random field prior against spatial confounding

    Authors: Isa Marques, Thomas Kneib, Nadja Klein

    Abstract: Spatial models are used in a variety research areas, such as environmental sciences, epidemiology, or physics. A common phenomenon in many spatial regression models is spatial confounding. This phenomenon takes place when spatially indexed covariates modeling the mean of the response are correlated with the spatial random effect. As a result, estimates for regression coefficients of the covariates… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Submitted to Environmetrics and currently under review

  13. arXiv:2105.08686  [pdf, other

    stat.ME stat.AP

    Flexible Bayesian Modeling of Counts: Constructing Penalized Complexity Priors

    Authors: Mahsa Nadifar, Hossein Baghishani, Thomas Kneib, Afshin Fallah

    Abstract: Many of the data, particularly in medicine and disease map** are count. Indeed, the under or overdispersion problem in count data distrusts the performance of the classical Poisson model. For taking into account this problem, in this paper, we introduce a new Bayesian structured additive regression model, called gamma count, with enough flexibility in modeling dispersion. Setting convenient prio… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

  14. arXiv:2101.05630  [pdf, other

    stat.ME

    Adaptive shrinkage of smooth functional effects towards a predefined functional subspace

    Authors: Paul Wiemann, Thomas Kneib

    Abstract: In this paper, we propose a new horseshoe-type prior hierarchy for adaptively shrinking spline-based functional effects towards a predefined vector space of parametric functions. Instead of shrinking each spline coefficient towards zero, we use an adapted horseshoe prior to control the deviation from the predefined vector space. For this purpose, the modified horseshoe prior is set up with one sca… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

  15. arXiv:2012.11016  [pdf, other

    stat.ME

    Bayesian Conditional Transformation Models

    Authors: Manuel Carlan, Thomas Kneib, Nadja Klein

    Abstract: Recent developments in statistical regression methodology shift away from pure mean regression towards distributional regression models. One important strand thereof is that of conditional transformation models (CTMs). CTMs infer the entire conditional distribution directly by applying a transformation function to the response conditionally on a set of covariates towards a simple log-concave refer… ▽ More

    Submitted 21 May, 2022; v1 submitted 20 December, 2020; originally announced December 2020.

  16. arXiv:2009.03646  [pdf, other

    stat.ME stat.AP

    Beyond unidimensional poverty analysis using distributional copula models for mixed ordered-continuous outcomes

    Authors: Maike Hohberg, Francesco Donat, Giampiero Marra, Thomas Kneib

    Abstract: Poverty is a multidimensional concept often comprising a monetary outcome and other welfare dimensions such as education, subjective well-being or health, that are measured on an ordinal scale. In applied research, multidimensional poverty is ubiquitously assessed by studying each poverty dimension independently in univariate regression models or by combining several poverty dimensions into a scal… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

  17. arXiv:2006.03459  [pdf, other

    stat.ME stat.CO

    Analytic expressions for the Cumulative Distribution Function of the Composed Error Term in Stochastic Frontier Analysis with Truncated Normal and Exponential Inefficiencies

    Authors: Rouven Schmidt, Thomas Kneib

    Abstract: In the stochastic frontier model, the composed error term consists of the measurement error and the inefficiency term. A general assumption is that the inefficiency term follows a truncated normal or exponential distribution. In a wide variety of models evaluating the cumulative distribution function of the composed error term is required. This work introduces and proves four representation theore… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

    Comments: 16 pages, 2 figures

  18. arXiv:1910.08599  [pdf, other

    stat.ME

    Noncrossing structured additive multiple-output Bayesian quantile regression models

    Authors: Bruno Santos, Thomas Kneib

    Abstract: Quantile regression models are a powerful tool for studying different points of the conditional distribution of univariate response variables. Their multivariate counterpart extension though is not straightforward, starting with the definition of multivariate quantiles. We propose here a flexible Bayesian quantile regression model when the response variable is multivariate, where we are able to de… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

  19. arXiv:1908.00823  [pdf, other

    stat.AP stat.ME

    Generalised Joint Regression for Count Data with a Focus on Modelling Football Matches

    Authors: Hendrik van der Wurp, Andreas Groll, Thomas Kneib, Giampiero Marra, Rosalba Radice

    Abstract: We propose a versatile joint regression framework for count responses. The method is implemented in the R add-on package GJRM and allows for modelling linear and non-linear dependence through the use of several copulae. Moreover, the parameters of the marginal distributions of the count responses and of the copula can be specified as flexible functions of covariates. Motivated by a football applic… ▽ More

    Submitted 21 August, 2019; v1 submitted 2 August, 2019; originally announced August 2019.

  20. Multivariate Conditional Transformation Models

    Authors: Nadja Klein, Torsten Hothorn, Luisa Barbanti, Thomas Kneib

    Abstract: Regression models describing the joint distribution of multivariate response variables conditional on covariate information have become an important aspect of contemporary regression analysis. However, a limitation of such models is that they often rely on rather simplistic assumptions, e.g. a constant dependency structure that is not allowed to vary with the covariates or the restriction to linea… ▽ More

    Submitted 3 September, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

    Journal ref: Scandinavian Journal of Statistics (2022); 49: 116-142

  21. Bayesian Effect Selection in Structured Additive Distributional Regression Models

    Authors: Nadja Klein, Manuel Carlan, Thomas Kneib, Stefan Lang, Helga Wagner

    Abstract: We propose a novel spike and slab prior specification with scaled beta prime marginals for the importance parameters of regression coefficients to allow for general effect selection within the class of structured additive distributional regression. This enables us to model effects on all distributional parameters for arbitrary parametric distributions, and to consider various effect types such as… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

    Journal ref: Bayesian Anal., Advance publication (2020), 29 pages

  22. arXiv:1806.09386  [pdf, other

    stat.AP

    Treatment effects beyond the mean using GAMLSS

    Authors: Maike Hohberg, Peter Pütz, Thomas Kneib

    Abstract: This paper introduces distributional regression, also known as generalized additive models for location, scale and shape (GAMLSS), as a modeling framework for analyzing treatment effects beyond the mean. By relating each parameter of the response distribution to explanatory variables, GAMLSS model the treatment effect on the whole conditional distribution. Additionally, any nonnormal outcome and n… ▽ More

    Submitted 28 March, 2019; v1 submitted 25 June, 2018; originally announced June 2018.

  23. arXiv:1806.03729  [pdf, ps, other

    stat.ME math.ST stat.AP

    Lost in translation: On the impact of data coding on penalized regression with interactions

    Authors: Johannes W R Martini, Francisco Rosales, Ngoc-Thuy Ha, Thomas Kneib, Johannes Heise, Valentin Wimmer

    Abstract: Penalized regression approaches are standard tools in quantitative genetics. It is known that the fit of an \emph{ordinary least squares} (OLS) regression is independent of certain transformations of the coding of the predictor variables, and that the standard mixed model \emph{ridge regression best linear unbiased prediction} (RRBLUP) is neither affected by translations of the variable coding, no… ▽ More

    Submitted 10 June, 2018; originally announced June 2018.

    MSC Class: 62J05 62J07

    Journal ref: G3 (2019) https://doi.org/10.1534/g3.118.200961

  24. arXiv:1803.05664  [pdf, other

    stat.CO stat.AP

    Conditional Model Selection in Mixed-Effects Models with cAIC4

    Authors: Benjamin Säfken, David Rügamer, Thomas Kneib, Sonja Greven

    Abstract: Model selection in mixed models based on the conditional distribution is appropriate for many practical applications and has been a focus of recent statistical research. In this paper we introduce the R-package cAIC4 that allows for the computation of the conditional Akaike Information Criterion (cAIC). Computation of the conditional AIC needs to take into account the uncertainty of the random eff… ▽ More

    Submitted 17 March, 2018; v1 submitted 15 March, 2018; originally announced March 2018.

  25. arXiv:1711.10786  [pdf, other

    stat.AP

    Bayesian Measurement Error Correction in Structured Additive Distributional Regression with an Application to the Analysis of Sensor Data on Soil-Plant Variability

    Authors: Alessio Pollice, Giovanna Jona Lasinio, Roberta Rossi, Mariana Amato, Thomas Kneib, Stefan Lang

    Abstract: The flexibility of the Bayesian approach to account for covariates with measurement error is combined with semiparametric regression models for a class of continuous, discrete and mixed univariate response distributions with potentially all parameters depending on a structured additive predictor. Markov chain Monte Carlo enables a modular and numerically efficient implementation of Bayesian measur… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

  26. arXiv:1710.02385  [pdf, ps, other

    stat.ME stat.CO

    Gradient boosting in Markov-switching generalized additive models for location, scale and shape

    Authors: Timo Adam, Andreas Mayr, Thomas Kneib

    Abstract: We propose a novel class of flexible latent-state time series regression models which we call Markov-switching generalized additive models for location, scale and shape. In contrast to conventional Markov-switching regression models, the presented methodology allows us to model different state-dependent parameters of the response distribution - not only the mean, but also variance, skewness and ku… ▽ More

    Submitted 17 May, 2018; v1 submitted 6 October, 2017; originally announced October 2017.

  27. arXiv:1609.02686  [pdf, other

    stat.ML stat.ME

    Boosting Joint Models for Longitudinal and Time-to-Event Data

    Authors: Elisabeth Waldmann, David Taylor-Robinson, Nadja Klein, Thomas Kneib, Tania Pressler, Matthias Schmid, Andreas Mayr

    Abstract: Joint Models for longitudinal and time-to-event data have gained a lot of attention in the last few years as they are a helpful technique to approach common a data structure in clinical studies where longitudinal outcomes are recorded alongside event times. Those two processes are often linked and the two outcomes should thus be modeled jointly in order to prevent the potential bias introduced by… ▽ More

    Submitted 22 December, 2016; v1 submitted 9 September, 2016; originally announced September 2016.

  28. Bayesian structured additive distributional regression with an application to regional income inequality in Germany

    Authors: Nadja Klein, Thomas Kneib, Stefan Lang, Alexander Sohn

    Abstract: We propose a generic Bayesian framework for inference in distributional regression models in which each parameter of a potentially complex response distribution and not only the mean is related to a structured additive predictor. The latter is composed additively of a variety of different functional effect types such as nonlinear effects, spatial effects, random coefficients, interaction surfaces… ▽ More

    Submitted 17 September, 2015; originally announced September 2015.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS823 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS823

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 2, 1024-1052

  29. arXiv:1406.3774  [pdf, other

    stat.ME stat.CO

    Markov-switching generalized additive models

    Authors: Roland Langrock, Thomas Kneib, Richard Glennie, Théo Michelot

    Abstract: We consider Markov-switching regression models, i.e. models for time series regression analyses where the functional relationship between covariates and response is subject to regime switching controlled by an unobservable Markov chain. Building on the powerful hidden Markov model machinery and the methods for penalized B-splines routinely used in regression analyses, we develop a framework for no… ▽ More

    Submitted 10 May, 2015; v1 submitted 14 June, 2014; originally announced June 2014.

  30. A Unified Framework of Constrained Regression

    Authors: Benjamin Hofner, Thomas Kneib, Torsten Hothorn

    Abstract: Generalized additive models (GAMs) play an important role in modeling and understanding complex relationships in modern applied statistics. They allow for flexible, data-driven estimation of covariate effects. Yet researchers often have a priori knowledge of certain effects, which might be monotonic or periodic (cyclic) or should fulfill boundary conditions. We propose a unified framework to incor… ▽ More

    Submitted 7 November, 2014; v1 submitted 27 March, 2014; originally announced March 2014.

    Comments: This is a preliminary version of the manuscript. The final publication is available at http://link.springer.com/article/10.1007/s11222-014-9520-y

  31. arXiv:1312.5054  [pdf, other

    stat.ME

    Bayesian Geoadditive Expectile Regression

    Authors: Elisabeth Waldmann, Fabian Sobotka, Thomas Kneib

    Abstract: Regression classes modeling more than the mean of the response have found a lot of attention in the last years. Expectile regression is a special and computationally convenient case of this family of models. Expectiles offer a quantile-like characterisation of a complete distribution and include the mean as a special case. In the frequentist framework the impact of a lot of covariates with very di… ▽ More

    Submitted 18 December, 2013; originally announced December 2013.

  32. arXiv:1311.1039  [pdf, other

    stat.AP q-bio.QM stat.ME

    Maximum penalized likelihood estimation in semiparametric capture-recapture models

    Authors: Théo Michelot, Roland Langrock, Thomas Kneib, Ruth King

    Abstract: We discuss the semiparametric modeling of mark-recapture-recovery data where the temporal and/or individual variation of model parameters is explained via covariates. Typically, in such analyses a fixed (or mixed) effects parametric model is specified for the relationship between the model parameters and the covariates of interest. In this paper, we discuss the modeling of the relationship via the… ▽ More

    Submitted 20 May, 2015; v1 submitted 5 November, 2013; originally announced November 2013.

  33. arXiv:1309.0423  [pdf, other

    stat.ME

    Nonparametric inference in hidden Markov models using P-splines

    Authors: Roland Langrock, Thomas Kneib, Alexander Sohn, Stacy DeRuiter

    Abstract: Hidden Markov models (HMMs) are flexible time series models in which the distributions of the observations depend on unobserved serially correlated states. The state-dependent distributions in HMMs are usually taken from some class of parametrically specified distributions. The choice of this class can be difficult, and an unfortunate choice can have serious consequences for example on state estim… ▽ More

    Submitted 17 June, 2014; v1 submitted 2 September, 2013; originally announced September 2013.

  34. arXiv:1308.5836  [pdf, other

    stat.ME q-fin.ST

    Semiparametric stochastic volatility modelling using penalized splines

    Authors: Roland Langrock, Théo Michelot, Alexander Sohn, Thomas Kneib

    Abstract: Stochastic volatility (SV) models mimic many of the stylized facts attributed to time series of asset returns, while maintaining conceptual simplicity. The commonly made assumption of conditionally normally distributed or Student-t-distributed returns, given the volatility, has however been questioned. In this manuscript, we introduce a novel maximum penalized likelihood approach for estimating th… ▽ More

    Submitted 17 June, 2014; v1 submitted 27 August, 2013; originally announced August 2013.

  35. arXiv:1303.0670  [pdf, other

    stat.ME

    Penalized Likelihood and Bayesian Function Selection in Regression Models

    Authors: Fabian Scheipl, Thomas Kneib, Ludwig Fahrmeir

    Abstract: Challenging research in various fields has driven a wide range of methodological advances in variable selection for regression models with high-dimensional predictors. In comparison, selection of nonlinear functions in models with additive predictors has been considered only more recently. Several competing suggestions have been developed at about the same time and often do not refer to each other… ▽ More

    Submitted 4 March, 2013; originally announced March 2013.

  36. Conditional Transformation Models

    Authors: Torsten Hothorn, Thomas Kneib, Peter Bühlmann

    Abstract: The ultimate goal of regression analysis is to obtain information about the conditional distribution of a response given a set of explanatory variables. This goal is, however, seldom achieved because most established regression models only estimate the conditional mean as a function of the explanatory variables and assume that higher moments are not affected by the regressors. The underlying reaso… ▽ More

    Submitted 28 November, 2012; v1 submitted 27 January, 2012; originally announced January 2012.

    MSC Class: 62H12; 62G08; 62J02; 62J07

    Journal ref: Journal of the Royal Statistical Society, Series B (Methodology), 2014

  37. Spike-and-Slab Priors for Function Selection in Structured Additive Regression Models

    Authors: Fabian Scheipl, Ludwig Fahrmeir, Thomas Kneib

    Abstract: Structured additive regression provides a general framework for complex Gaussian and non-Gaussian regression models, with predictors comprising arbitrary combinations of nonlinear functions and surfaces, spatial effects, varying coefficients, random effects and further regression terms. The large flexibility of structured additive regression makes function selection a challenging and important tas… ▽ More

    Submitted 2 December, 2011; v1 submitted 26 May, 2011; originally announced May 2011.

    Journal ref: Journal of the American Statistical Association (2012), 107:500, pages 1518--1532