Skip to main content

Showing 1–15 of 15 results for author: Kümmerle, C

.
  1. arXiv:2406.19391  [pdf, other

    cs.CV

    Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads

    Authors: Ali Khaleghi Rahimian, Manish Kumar Govind, Subhajit Maity, Dominick Reilly, Christian Kümmerle, Srijan Das, Aritra Dutta

    Abstract: Visual perception tasks are predominantly solved by Vision Transformer (ViT) architectures, which, despite their effectiveness, encounter a computational bottleneck due to the quadratic complexity of computing self-attention. This inefficiency is largely due to the self-attention heads capturing redundant token interactions, reflecting inherent redundancy within visual data. Many works have aimed… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: The code is publicly available at https://github.com/Charlotte-CharMLab/Fibottention

  2. arXiv:2405.15903  [pdf, other

    cs.LG

    UnitNorm: Rethinking Normalization for Transformers in Time Series

    Authors: Nan Huang, Christian Kümmerle, Xiang Zhang

    Abstract: Normalization techniques are crucial for enhancing Transformer models' performance and stability in time series analysis tasks, yet traditional methods like batch and layer normalization often lead to issues such as token shift, attention shift, and sparse attention. We propose UnitNorm, a novel approach that scales input vectors by their norms and modulates attention patterns, effectively circumv… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2306.04961  [pdf, other

    cs.LG cs.IT math.OC

    Recovering Simultaneously Structured Data via Non-Convex Iteratively Reweighted Least Squares

    Authors: Christian Kümmerle, Johannes Maly

    Abstract: We propose a new algorithm for the problem of recovering data that adheres to multiple, heterogeneous low-dimensional structures from linear observations. Focusing on data matrices that are simultaneously row-sparse and low-rank, we propose and analyze an iteratively reweighted least squares (IRLS) algorithm that is able to leverage both structures. In particular, it optimizes a combination of non… ▽ More

    Submitted 18 January, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 35 pages, 7 figures

  4. arXiv:2212.00746  [pdf, other

    cs.IT cs.LG math.OC stat.ML

    Learning Transition Operators From Sparse Space-Time Samples

    Authors: Christian Kümmerle, Mauro Maggioni, Sui Tang

    Abstract: We consider the nonlinear inverse problem of learning a transition operator $\mathbf{A}$ from partial observations at different times, in particular from sparse observations of entries of its powers $\mathbf{A},\mathbf{A}^2,\cdots,\mathbf{A}^{T}$. This Spatio-Temporal Transition Operator Recovery problem is motivated by the recent interest in learning time-varying graph signals that are driven by… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 34 pages, 12 figures

  5. arXiv:2208.11846  [pdf, other

    math.OC

    Global Linear and Local Superlinear Convergence of IRLS for Non-Smooth Robust Regression

    Authors: Liangzu Peng, Christian Kümmerle, René Vidal

    Abstract: We advance both the theory and practice of robust $\ell_p$-quasinorm regression for $p \in (0,1]$ by using novel variants of iteratively reweighted least-squares (IRLS) to solve the underlying non-smooth problem. In the convex case, $p=1$, we prove that this IRLS variant converges globally at a linear rate under a mild, deterministic condition on the feature matrix called the \textit{stable range… ▽ More

    Submitted 11 October, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: Accepted to NeurIPS 2022

  6. arXiv:2106.02119  [pdf, other

    math.OC math.NA stat.ML

    A Scalable Second Order Method for Ill-Conditioned Matrix Completion from Few Samples

    Authors: Christian Kümmerle, Claudio Mayrink Verdun

    Abstract: We propose an iterative algorithm for low-rank matrix completion that can be interpreted as an iteratively reweighted least squares (IRLS) algorithm, a saddle-esca** smoothing Newton method or a variable metric proximal gradient method applied to a non-convex rank surrogate. It combines the favorable data-efficiency of previous IRLS approaches with an improved scalability by several orders of ma… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 45 pages, 8 figures, to be published in ICML 2021. arXiv admin note: text overlap with arXiv:2009.02905

  7. arXiv:2101.08298  [pdf, ps, other

    cs.IT eess.SP math.NA math.PR

    Dictionary-Sparse Recovery From Heavy-Tailed Measurements

    Authors: Pedro Abdalla, Christian Kümmerle

    Abstract: The recovery of signals that are sparse not in a basis, but rather sparse with respect to an over-complete dictionary is one of the most flexible settings in the field of compressed sensing with numerous applications. As in the standard compressed sensing setting, it is possible that the signal can be reconstructed efficiently from few, linear measurements, for example by the so-called $\ell_1$-sy… ▽ More

    Submitted 29 September, 2021; v1 submitted 20 January, 2021; originally announced January 2021.

    Comments: 24 pages

  8. arXiv:2012.12250  [pdf, ps, other

    math.OC cs.IT cs.LG math.NA

    Iteratively Reweighted Least Squares for Basis Pursuit with Global Linear Convergence Rate

    Authors: Christian Kümmerle, Claudio Mayrink Verdun, Dominik Stöger

    Abstract: The recovery of sparse data is at the core of many applications in machine learning and signal processing. While such problems can be tackled using $\ell_1$-regularization as in the LASSO estimator and in the Basis Pursuit approach, specialized algorithms are typically required to solve the corresponding high-dimensional non-smooth optimization for large instances. Iteratively Reweighted Least Squ… ▽ More

    Submitted 11 November, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: 26 pages, 3 figures

    Journal ref: NeurIPS 2021 (Spotlight)

  9. arXiv:2010.12402  [pdf, ps, other

    cs.IT

    On the robustness of noise-blind low-rank recovery from rank-one measurements

    Authors: Felix Krahmer, Christian Kümmerle, Oleh Melnyk

    Abstract: We prove new results about the robustness of well-known convex noise-blind optimization formulations for the reconstruction of low-rank matrices from underdetermined linear measurements. Our results are applicable for symmetric rank-one measurements as used in a formulation of the phase retrieval problem. We obtain these results by establishing that with high probability rank-one measurement ope… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: 39 pages, 4 figures

  10. arXiv:2009.02905  [pdf, other

    math.OC cs.IT cs.LG math.ST

    Esca** Saddle Points in Ill-Conditioned Matrix Completion with a Scalable Second Order Method

    Authors: Christian Kümmerle, Claudio M. Verdun

    Abstract: We propose an iterative algorithm for low-rank matrix completion that can be interpreted as both an iteratively reweighted least squares (IRLS) algorithm and a saddle-esca** smoothing Newton method applied to a non-convex rank surrogate objective. It combines the favorable data efficiency of previous IRLS approaches with an improved scalability by several orders of magnitude. Our method attains… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

    Comments: 15 pages, presented at the Workshop on "Beyond first-order methods in ML systems" at the $37^th$ International Conference on Machine Learning (ICML), Vienna, Austria, 2020

  11. arXiv:1907.07258  [pdf, ps, other

    math.PR cs.IT

    On the geometry of polytopes generated by heavy-tailed random vectors

    Authors: Olivier Guédon, Felix Krahmer, Christian Kümmerle, Shahar Mendelson, Holger Rauhut

    Abstract: We study the geometry of centrally-symmetric random polytopes, generated by $N$ independent copies of a random vector $X$ taking values in $\mathbb{R}^n$. We show that under minimal assumptions on $X$, for $N \gtrsim n$ and with high probability, the polytope contains a deterministic set that is naturally associated with the random vector---namely, the polar of a certain floating body. This solves… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

    Comments: 23 pages

    MSC Class: 52A22; 46B06; 60B20; 65K10; 52A23; 46B09; 15B52

  12. arXiv:1901.05744  [pdf, ps, other

    cs.LG stat.ML

    The Oracle of DLphi

    Authors: Dominik Alfke, Weston Baines, Jan Blechschmidt, Mauricio J. del Razo Sarmina, Amnon Drory, Dennis Elbrächter, Nando Farchmin, Matteo Gambara, Silke Glas, Philipp Grohs, Peter Hinz, Danijel Kivaranovic, Christian Kümmerle, Gitta Kutyniok, Sebastian Lunz, Jan Macdonald, Ryan Malthaner, Gregory Naisat, Ariel Neufeld, Philipp Christian Petersen, Rafael Reisenhofer, Jun-Da Sheng, Laura Thesing, Philipp Trunschke, Johannes von Lindheim , et al. (2 additional authors not shown)

    Abstract: We present a novel technique based on deep learning and set theory which yields exceptional classification and prediction results. Having access to a sufficiently large amount of labelled training data, our methodology is capable of predicting the labels of the test data almost always even if the training data is entirely unrelated to the test data. In other words, we prove in a specific setting t… ▽ More

    Submitted 27 January, 2019; v1 submitted 17 January, 2019; originally announced January 2019.

    MSC Class: 68T05; 82C32

  13. arXiv:1811.07472  [pdf, other

    math.OC cs.IT

    Denoising and Completion of Structured Low-Rank Matrices via Iteratively Reweighted Least Squares

    Authors: Christian Kümmerle, Claudio Mayrink Verdun

    Abstract: We propose a new Iteratively Reweighted Least Squares (IRLS) algorithm for the problem of completing or denoising low-rank matrices that are structured, e.g., that possess a Hankel, Toeplitz or block-Hankel/Toeplitz structure. The algorithm optimizes an objective based on a non-convex surrogate of the rank by solving a sequence of quadratic problems. Our strategy combines computational efficiency,… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

    Comments: 3 pages, 2 figures, to appear in iTWIST'18

    Journal ref: In Proceedings of iTWIST'18, Paper-ID: 18, Marseille, France, November, 21-23, 2018

  14. arXiv:1806.04261  [pdf, other

    math.PR math.FA math.OC

    A Quotient Property for Matrices with Heavy-Tailed Entries and its Application to Noise-Blind Compressed Sensing

    Authors: Felix Krahmer, Christian Kümmerle, Holger Rauhut

    Abstract: For a large class of random matrices $A$ with i.i.d. entries we show that the $\ell_1$-quotient property holds with probability exponentially close to 1. In contrast to previous results, our analysis does not require concentration of the entrywise distributions. We provide a unified proof that recovers corresponding previous results for (sub-)Gaussian and Weibull distributions. Our findings genera… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.

    Comments: 22 pages, 2 figures

    MSC Class: 46B20; 46B09; 15A52; 65K10; 52A22

  15. arXiv:1703.05038  [pdf, other

    math.NA cs.IT math.OC

    Harmonic Mean Iteratively Reweighted Least Squares for Low-Rank Matrix Recovery

    Authors: Christian Kümmerle, Juliane Sigl

    Abstract: We propose a new iteratively reweighted least squares (IRLS) algorithm for the recovery of a matrix $X \in \mathbb{C}^{d_1\times d_2}$ of rank $r \ll\min(d_1,d_2)$ from incomplete linear observations, solving a sequence of low complexity linear problems. The easily implementable algorithm, which we call harmonic mean iteratively reweighted least squares (HM-IRLS), optimizes a non-convex Schatten-… ▽ More

    Submitted 27 February, 2018; v1 submitted 15 March, 2017; originally announced March 2017.

    Comments: 47 pages, 6 figures