Search | arXiv e-print repository

arXiv:2406.13691 [pdf, other]

Computationally efficient multi-level Gaussian process regression for functional data observed under completely or partially regular sampling designs

Authors: Adam Gorm Hoffmann, Claus Thorn Ekstrøm, Andreas Kryger Jensen

Abstract: Gaussian process regression is a frequently used statistical method for flexible yet fully probabilistic non-linear regression modeling. A common obstacle is its computational complexity which scales poorly with the number of observations. This is especially an issue when applying Gaussian process models to multiple functions simultaneously in various applications of functional data analysis. We… ▽ More Gaussian process regression is a frequently used statistical method for flexible yet fully probabilistic non-linear regression modeling. A common obstacle is its computational complexity which scales poorly with the number of observations. This is especially an issue when applying Gaussian process models to multiple functions simultaneously in various applications of functional data analysis. We consider a multi-level Gaussian process regression model where a common mean function and individual subject-specific deviations are modeled simultaneously as latent Gaussian processes. We derive exact analytic and computationally efficient expressions for the log-likelihood function and the posterior distributions in the case where the observations are sampled on either a completely or partially regular grid. This enables us to fit the model to large data sets that are currently computationally inaccessible using a standard implementation. We show through a simulation study that our analytic expressions are several orders of magnitude faster compared to a standard implementation, and we provide an implementation in the probabilistic programming language Stan. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 48 pages, 3 figures

MSC Class: 62F15; 60G15; 62G08 ACM Class: G.3

arXiv:2312.09422 [pdf, other]

Joint Alignment of Multivariate Quasi-Periodic Functional Data Using Deep Learning

Authors: Vi Thanh Pham, Jonas Bille Nielsen, Klaus Fuglsang Kofoed, Jørgen Tobias Kühl, Andreas Kryger Jensen

Abstract: The joint alignment of multivariate functional data plays an important role in various fields such as signal processing, neuroscience and medicine, including the statistical analysis of data from wearable devices. Traditional methods often ignore the phase variability and instead focus on the variability in the observed amplitude. We present a novel method for joint alignment of multivariate quasi… ▽ More The joint alignment of multivariate functional data plays an important role in various fields such as signal processing, neuroscience and medicine, including the statistical analysis of data from wearable devices. Traditional methods often ignore the phase variability and instead focus on the variability in the observed amplitude. We present a novel method for joint alignment of multivariate quasi-periodic functions using deep neural networks, decomposing, but retaining all the information in the data by preserving both phase and amplitude variability. Our proposed neural network uses a special activation of the output that builds on the unit simplex transformation, and we utilize a loss function based on the Fisher-Rao metric to train our model. Furthermore, our method is unsupervised and can provide an optimal common template function as well as subject-specific templates. We demonstrate our method on two simulated datasets and one real example, comprising data from 12-lead 10s electrocardiogram recordings. △ Less

Submitted 14 November, 2023; originally announced December 2023.

Comments: 28 pages, 6 figures

arXiv:2305.10555 [pdf, other]

Sharp symbolic nonparametric bounds for measures of benefit in observational and imperfect randomized studies with ordinal outcomes

Authors: Erin E Gabriel, Michael C Sachs, Andreas Kryger Jensen

Abstract: The probability of benefit is a valuable and important measure of treatment effect, which has advantages over the average treatment effect. Particularly for an ordinal outcome, it has a better interpretation and can make apparent different aspects of the treatment impact. Unfortunately, this measure, and variations of it, are not identifiable even in randomized trials with perfect compliance. Ther… ▽ More The probability of benefit is a valuable and important measure of treatment effect, which has advantages over the average treatment effect. Particularly for an ordinal outcome, it has a better interpretation and can make apparent different aspects of the treatment impact. Unfortunately, this measure, and variations of it, are not identifiable even in randomized trials with perfect compliance. There is, for this reason, a long literature on nonparametric bounds for unidentifiable measures of benefit. These have primarily focused on perfect randomized trial settings and one or two specific estimands. We expand these bounds to observational settings with unmeasured confounders and imperfect randomized trials for all three estimands considered in the literature: the probability of benefit, the probability of no harm, and the relative treatment effect. △ Less

Submitted 17 May, 2023; originally announced May 2023.

arXiv:2012.11915 [pdf, other]

Having a Ball: evaluating scoring streaks and game excitement using in-match trend estimation

Authors: Claus Thorn Ekstrøm, Andreas Kryger Jensen

Abstract: Many popular sports involve matches between two teams or players where each team have the possibility of scoring points throughout the match. While the overall match winner and result is interesting, it conveys little information about the underlying scoring trends throughout the match. Modeling approaches that accommodate a finer granularity of the score difference throughout the match is needed… ▽ More Many popular sports involve matches between two teams or players where each team have the possibility of scoring points throughout the match. While the overall match winner and result is interesting, it conveys little information about the underlying scoring trends throughout the match. Modeling approaches that accommodate a finer granularity of the score difference throughout the match is needed to evaluate in-game strategies, discuss scoring streaks, teams strengths, and other aspects of the game. We propose a latent Gaussian process to model the score difference between two teams and introduce the Trend Direction Index as an easily interpretable probabilistic measure of the current trend in the match as well as a measure of post-game trend evaluation. In addition we propose the Excitement Trend Index - the expected number of monotonicity changes in the running score difference - as a measure of overall game excitement. Our proposed methodology is applied to all 1143 matches from the 2019-2020 National Basketball Association (NBA) season. We show how the trends can be interpreted in individual games and how the excitement score can be used to cluster teams according to how exciting they are to watch. △ Less

Submitted 22 December, 2020; originally announced December 2020.

arXiv:1912.11848 [pdf, other]

Quantifying the Trendiness of Trends

Authors: Andreas Kryger Jensen, Claus Thorn Ekstrøm

Abstract: News media often report that the trend of some public health outcome has changed. These statements are frequently based on longitudinal data, and the change in trend is typically found to have occurred at the most recent data collection time point - if no change had occurred the story is less likely to be reported. Such claims may potentially influence public health decisions on a national level.… ▽ More News media often report that the trend of some public health outcome has changed. These statements are frequently based on longitudinal data, and the change in trend is typically found to have occurred at the most recent data collection time point - if no change had occurred the story is less likely to be reported. Such claims may potentially influence public health decisions on a national level. We propose two measures for quantifying the trendiness of trends. Assuming that reality evolves in continuous time we define what constitutes a trend and a change in trend, and introduce a probabilistic Trend Direction Index. This index has the interpretation of the probability that a latent characteristic has changed monotonicity at any given time conditional on observed data. We also define an index of Expected Trend Instability quantifying the expected number of changes in trend on an interval. Using a latent Gaussian Process model we show how the Trend Direction Index and the Expected Trend Instability can be estimated in a Bayesian framework and use the methods to analyze the proportion of smokers in Denmark during the last 20 years, and the development of new COVID-19 cases in Italy from February 24th onwards. △ Less

Submitted 3 October, 2020; v1 submitted 26 December, 2019; originally announced December 2019.

arXiv:1910.12267 [pdf, other]

A novel high-power test for continuous outcomes truncated by death

Authors: Andreas Kryger Jensen, Theis Lange

Abstract: Patient reported outcomes including quality of life (QoL) assessments are increasingly being included as either primary or secondary outcomes in randomized controlled trials. While making the outcomes more relevant for patients it entails a challenge in cases where death or a similar event makes the outcome of interest undefined. A pragmatic - and much used - solution is to assign diseased patient… ▽ More Patient reported outcomes including quality of life (QoL) assessments are increasingly being included as either primary or secondary outcomes in randomized controlled trials. While making the outcomes more relevant for patients it entails a challenge in cases where death or a similar event makes the outcome of interest undefined. A pragmatic - and much used - solution is to assign diseased patient with the lowest possible QoL score. This makes medical sense, but creates a statistical problem since traditional tests such as t-tests or Wilcox tests potentially looses large amounts of statistical power. In this paper we propose a novel test that can keep the medical relevant composite outcome, but preserve full statistical power. The test is also applicable in other situations where a specific value (say 0 days alive outside hospitals) encodes a special meaning. The test is implemented in an R package which is available for download. △ Less

Submitted 27 October, 2019; originally announced October 2019.

arXiv:1808.05157 [pdf, other]

doi 10.1109/TIT.2019.2922627

Asymptotic majorization of finite probability distributions

Authors: Asger Kjærulff Jensen

Abstract: This paper studies majorization of high tensor powers of finitely supported probability distributions. Viewing probability distributions as a resource with majorization as a means of transformation corresponds to the resource theory of pure bipartite quantum states under LOCC transformations vis-à-vis Nielsen's Theorem. In [T. Fritz (2017)] a formula for the asymptotic exchange rate between any tw… ▽ More This paper studies majorization of high tensor powers of finitely supported probability distributions. Viewing probability distributions as a resource with majorization as a means of transformation corresponds to the resource theory of pure bipartite quantum states under LOCC transformations vis-à-vis Nielsen's Theorem. In [T. Fritz (2017)] a formula for the asymptotic exchange rate between any two finitely supported probability distributions was conjectured. The main result of the present paper is Theorem 3.11, which resolves this conjecture. △ Less

Submitted 15 August, 2018; originally announced August 2018.

arXiv:1807.05130 [pdf, other]

The asymptotic spectrum of LOCC transformations

Authors: Asger Kjærulff Jensen, Péter Vrana

Abstract: We study exact, non-deterministic conversion of multipartite pure quantum states into one-another via local operations and classical communication (LOCC) and asymptotic entanglement transformation under such channels. In particular, we consider the maximal number of copies of any given target state that can be extracted exactly from many copies of any given initial state as a function of the expon… ▽ More We study exact, non-deterministic conversion of multipartite pure quantum states into one-another via local operations and classical communication (LOCC) and asymptotic entanglement transformation under such channels. In particular, we consider the maximal number of copies of any given target state that can be extracted exactly from many copies of any given initial state as a function of the exponential decay in success probability, known as the converese error exponent. We give a formula for the optimal rate presented as an infimum over the asymptotic spectrum of LOCC conversion. A full understanding of exact asymptotic extraction rates between pure states in the converse regime thus depends on a full understanding of this spectrum. We present a characterisation of spectral points and use it to describe the spectrum in the bipartite case. This leads to a full description of the spectrum and thus an explicit formula for the asymptotic extraction rate between pure bipartite states, given a converse error exponent. This extends the result on entanglement concentration in [Hayashi et al, 2003], where the target state is fixed as the Bell state. In the limit of vanishing converse error exponent the rate formula provides an upper bound on the exact asymptotic extraction rate between two states, when the probability of success goes to 1. In the bipartite case we prove that this bound holds with equality. △ Less

Submitted 16 August, 2018; v1 submitted 13 July, 2018; originally announced July 2018.

Comments: v1: 21 pages v2: 21 pages, Minor corrections v3: 17 pages, Minor corrections, new reference added, parts of Section 5 and the Appendix removed, the omitted material can be found in an extended form in arXiv:1808.05157

arXiv:1801.04852 [pdf, other]

doi 10.1137/18M1174829

Border rank is not multiplicative under the tensor product

Authors: Matthias Christandl, Fulvio Gesmundo, Asger Kjærulff Jensen

Abstract: It has recently been shown that the tensor rank can be strictly submultiplicative under the tensor product, where the tensor product of two tensors is a tensor whose order is the sum of the orders of the two factors. The necessary upper bounds were obtained with help of border rank. It was left open whether border rank itself can be strictly submultiplicative. We answer this question in the affirm… ▽ More It has recently been shown that the tensor rank can be strictly submultiplicative under the tensor product, where the tensor product of two tensors is a tensor whose order is the sum of the orders of the two factors. The necessary upper bounds were obtained with help of border rank. It was left open whether border rank itself can be strictly submultiplicative. We answer this question in the affirmative. In order to do so, we construct lines in projective space along which the border rank drops multiple times and use this result in conjunction with a previous construction for a tensor rank drop. Our results also imply strict submultiplicativity for cactus rank and border cactus rank. △ Less

Submitted 1 May, 2019; v1 submitted 15 January, 2018; originally announced January 2018.

Comments: 25 pages, 1 figure - Revised version

MSC Class: 14M20; 15A69; 15A72

Journal ref: SIAM J. Appl. Algebra Geometry, 3(2) - 231-255 (2019)

arXiv:1705.09379 [pdf, other]

doi 10.1016/j.laa.2017.12.020

Tensor rank is not multiplicative under the tensor product

Authors: Matthias Christandl, Asger Kjærulff Jensen, Jeroen Zuiddam

Abstract: The tensor rank of a tensor t is the smallest number r such that t can be decomposed as a sum of r simple tensors. Let s be a k-tensor and let t be an l-tensor. The tensor product of s and t is a (k + l)-tensor. Tensor rank is sub-multiplicative under the tensor product. We revisit the connection between restrictions and degenerations. A result of our study is that tensor rank is not in general mu… ▽ More The tensor rank of a tensor t is the smallest number r such that t can be decomposed as a sum of r simple tensors. Let s be a k-tensor and let t be an l-tensor. The tensor product of s and t is a (k + l)-tensor. Tensor rank is sub-multiplicative under the tensor product. We revisit the connection between restrictions and degenerations. A result of our study is that tensor rank is not in general multiplicative under the tensor product. This answers a question of Draisma and Saptharishi. Specifically, if a tensor t has border rank strictly smaller than its rank, then the tensor rank of t is not multiplicative under taking a sufficiently hight tensor product power. The "tensor Kronecker product" from algebraic complexity theory is related to our tensor product but different, namely it multiplies two k-tensors to get a k-tensor. Nonmultiplicativity of the tensor Kronecker product has been known since the work of Strassen. It remains an open question whether border rank and asymptotic rank are multiplicative under the tensor product. Interestingly, lower bounds on border rank obtained from generalised flattenings (including Young flattenings) multiply under the tensor product. △ Less

Submitted 29 September, 2022; v1 submitted 25 May, 2017; originally announced May 2017.

Comments: Fixed a typo in Remark 9

MSC Class: 15A69

Journal ref: Linear Algebra Appl. 543 (2018) 125-139

arXiv:1508.06803 [pdf, other]

Sequential rank agreement methods for comparison of ranked lists

Authors: Claus Thorn Ekstrøm, Thomas Alexander Gerds, Andreas Kryger Jensen, Kasper Brink-Jensen

Abstract: The comparison of alternative rankings of a set of items is a general and prominent task in applied statistics. Predictor variables are ranked according to magnitude of association with an outcome, prediction models rank subjects according to the personalized risk of an event, and genetic studies rank genes according to their difference in gene expression levels. This article constructs measures o… ▽ More The comparison of alternative rankings of a set of items is a general and prominent task in applied statistics. Predictor variables are ranked according to magnitude of association with an outcome, prediction models rank subjects according to the personalized risk of an event, and genetic studies rank genes according to their difference in gene expression levels. This article constructs measures of the agreement of two or more ordered lists. We use the standard deviation of the ranks to define a measure of agreement that both provides an intuitive interpretation and can be applied to any number of lists even if some or all are incomplete or censored. The approach can identify change-points in the agreement of the lists and the sequential changes of agreement as a function of the depth of the lists can be compared graphically to a permutation based reference set. The usefulness of these tools are illustrated using gene rankings, and using data from two Danish ovarian cancer studies where we assess the within and between agreement of different statistical classification methods. △ Less

Submitted 27 August, 2015; originally announced August 2015.

Showing 1–11 of 11 results for author: Jensen, A K