Skip to main content

Showing 1–22 of 22 results for author: Bölcskei, H

Searching in archive math. Search in all archives.
.
  1. arXiv:2407.01250  [pdf, other

    cs.LG cs.IT math.DS

    Metric-Entropy Limits on Nonlinear Dynamical System Learning

    Authors: Yang Pan, Clemens Hutter, Helmut Bölcskei

    Abstract: This paper is concerned with the fundamental limits of nonlinear dynamical system learning from input-output traces. Specifically, we show that recurrent neural networks (RNNs) are capable of learning nonlinear systems that satisfy a Lipschitz property and forget past inputs fast enough in a metric-entropy optimal manner. As the sets of sequence-to-sequence maps realized by the dynamical systems w… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.05556  [pdf, ps, other

    math.FA

    Entropy of Compact Operators with Applications to Landau-Pollak-Slepian Theory and Sobolev Spaces

    Authors: Thomas Allard, Helmut Bölcskei

    Abstract: We derive a precise general relation between the entropy of a compact operator and its eigenvalues. It is then shown how this result along with the underlying philosophy can be applied to improve substantially on the best known characterizations of the entropy of the Landau-Pollak-Slepian operator and the metric entropy of unit balls in Sobolev spaces.

    Submitted 8 June, 2024; originally announced June 2024.

  3. arXiv:2405.11066  [pdf, other

    math.FA math.CV

    Ellipsoid Methods for Metric Entropy Computation

    Authors: Thomas Allard, Helmut Bölcskei

    Abstract: We present a new methodology for the characterization of the metric entropy of infinite-dimensional ellipsoids with exponentially decaying semi-axes. This procedure does not rely on the explicit construction of coverings or packings and provides a unified framework for the derivation of the metric entropy of a wide variety of analytic function classes, such as periodic functions analytic on a stri… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    MSC Class: 41A46; 30E10

  4. arXiv:2211.15466  [pdf, other

    math.DS cs.IT

    Metric entropy of causal, discrete-time LTI systems

    Authors: Clemens Hutter, Thomas Allard, Helmut Bölcskei

    Abstract: In [1] it is shown that recurrent neural networks (RNNs) can learn - in a metric entropy optimal manner - discrete time, linear time-invariant (LTI) systems. This is effected by comparing the number of bits needed to encode the approximating RNN to the metric entropy of the class of LTI systems under consideration [2, 3]. The purpose of this note is to provide an elementary self-contained proof of… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: [1] arXiv:2105.02556

  5. arXiv:2111.12312  [pdf, ps, other

    math.PR cs.IT

    Lossy Compression of General Random Variables

    Authors: Erwin Riegler, Helmut Bölcskei, Günther Koliander

    Abstract: This paper is concerned with the lossy compression of general random variables, specifically with rate-distortion theory and quantization of random variables taking values in general measurable spaces such as, e.g., manifolds and fractal sets. Manifold structures are prevalent in data science, e.g., in compressed sensing, machine learning, image processing, and handwritten digit recognition. Fract… ▽ More

    Submitted 2 June, 2023; v1 submitted 24 November, 2021; originally announced November 2021.

  6. arXiv:2105.02556  [pdf, other

    cs.LG cs.IT math.DS

    Metric Entropy Limits on Recurrent Neural Network Learning of Linear Dynamical Systems

    Authors: Clemens Hutter, Recep Gül, Helmut Bölcskei

    Abstract: One of the most influential results in neural network theory is the universal approximation theorem [1, 2, 3] which states that continuous functions can be approximated to within arbitrary accuracy by single-hidden-layer feedforward neural networks. The purpose of this paper is to establish a result in this spirit for the approximation of general discrete-time linear dynamical systems - including… ▽ More

    Submitted 15 December, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: 28 pages

  7. arXiv:2101.09341  [pdf, ps, other

    cs.IT eess.SY math.FA

    Beurling-type density criteria for system identification

    Authors: V. Vlačić, C. Aubel, H. Bölcskei

    Abstract: This paper addresses the problem of identifying a linear time-varying (LTV) system characterized by a (possibly infinite) discrete set of delay-Doppler shifts without a lattice (or other geometry-discretizing) constraint on the support set. Concretely, we show that a class of such LTV systems is identifiable whenever the upper uniform Beurling density of the delay-Doppler support sets, measured un… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

  8. arXiv:1906.06994  [pdf, other

    math.CO cs.AI cs.IT math.CV stat.ML

    Neural network identifiability for a family of sigmoidal nonlinearities

    Authors: Verner Vlačić, Helmut Bölcskei

    Abstract: This paper addresses the following question of neural network identifiability: Does the input-output map realized by a feed-forward neural network with respect to a given nonlinearity uniquely specify the network architecture, weights, and biases? Existing literature on the subject Sussman 1992, Albertini, Sontag et al. 1993, Fefferman 1994 suggests that the answer should be yes, up to certain sym… ▽ More

    Submitted 2 September, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: 43 pages, 11 figures

  9. arXiv:1803.06887  [pdf, ps, other

    math.FA cs.IT

    Lossless Analog Compression

    Authors: Giovanni Alberti, Helmut Bölcskei, Camillo De Lellis, Günther Koliander, Erwin Riegler

    Abstract: We establish the fundamental limits of lossless analog compression by considering the recovery of arbitrary m-dimensional real random vectors x from the noiseless linear measurements y=Ax with n x m measurement matrix A. Our theory is inspired by the groundbreaking work of Wu and Verdu (2010) on almost lossless analog compression, but applies to the nonasymptotic, i.e., fixed-m case, and considers… ▽ More

    Submitted 17 July, 2019; v1 submitted 19 March, 2018; originally announced March 2018.

  10. arXiv:1707.02711  [pdf, ps, other

    stat.ML cs.CV cs.IT cs.LG math.FA

    Topology Reduction in Deep Convolutional Feature Extraction Networks

    Authors: Thomas Wiatowski, Philipp Grohs, Helmut Bölcskei

    Abstract: Deep convolutional neural networks (CNNs) used in practice employ potentially hundreds of layers and $10$,$000$s of nodes. Such network sizes entail significant computational complexity due to the large number of convolutions that need to be carried out; in addition, a large number of parameters needs to be learned and stored. Very deep and wide CNNs may therefore not be well suited to application… ▽ More

    Submitted 14 March, 2018; v1 submitted 10 July, 2017; originally announced July 2017.

    Comments: Corrected errors in arguments on spectral decay of Sobolev functions. Replaced part of the decay results (Sections 5-7) by corresponding statements for effectively band-limited functions

    Journal ref: Proc. of SPIE (Wavelets and Sparsity XVII), San Diego, USA, Vol. 10394, pp. 1039418:1-1039418:12, Aug. 2017, (invited paper)

  11. arXiv:1705.01714  [pdf, other

    cs.LG cs.IT math.FA

    Optimal Approximation with Sparsely Connected Deep Neural Networks

    Authors: Helmut Bölcskei, Philipp Grohs, Gitta Kutyniok, Philipp Petersen

    Abstract: We derive fundamental lower bounds on the connectivity and the memory requirements of deep neural networks guaranteeing uniform approximation rates for arbitrary function classes in $L^2(\mathbb R^d)$. In other words, we establish a connection between the complexity of a function class and the complexity of deep neural networks approximating functions from this class to within a prescribed accurac… ▽ More

    Submitted 16 May, 2018; v1 submitted 4 May, 2017; originally announced May 2017.

    MSC Class: 41A25; 82C32; 42C40; 42C15; 41A46; 68T05; 94A34; 94A12

  12. arXiv:1704.03636  [pdf, other

    cs.IT cs.LG math.FA stat.ML

    Energy Propagation in Deep Convolutional Neural Networks

    Authors: Thomas Wiatowski, Philipp Grohs, Helmut Bölcskei

    Abstract: Many practical machine learning tasks employ very deep convolutional neural networks. Such large depths pose formidable computational challenges in training and operating the network. It is therefore important to understand how fast the energy contained in the propagated signals (a.k.a. feature maps) decays across layers. In addition, it is desirable that the feature extractor generated by the net… ▽ More

    Submitted 1 February, 2018; v1 submitted 12 April, 2017; originally announced April 2017.

    Comments: Corrected errors in arguments on the spectral decay of Sobolev functions and on the volume of tubes, IEEE Transactions on Information Theory, 2018

  13. arXiv:1701.02538  [pdf, other

    cs.IT math.FA math.NA math.NT

    Vandermonde Matrices with Nodes in the Unit Disk and the Large Sieve

    Authors: Céline Aubel, Helmut Bölcskei

    Abstract: We derive bounds on the extremal singular values and the condition number of NxK, with N>=K, Vandermonde matrices with nodes in the unit disk. The mathematical techniques we develop to prove our main results are inspired by a link---first established by by Selberg [1] and later extended by Moitra [2]---between the extremal singular values of Vandermonde matrices with nodes on the unit circle and l… ▽ More

    Submitted 3 August, 2017; v1 submitted 10 January, 2017; originally announced January 2017.

    Comments: 45 pages, 2 figures, accepted for publication in Applied and Computational Harmonic Analysis

    MSC Class: 15A12; 65F35

  14. arXiv:1605.00031  [pdf, other

    cs.LG cs.CV math.NA stat.ML

    Deep Convolutional Neural Networks on Cartoon Functions

    Authors: Philipp Grohs, Thomas Wiatowski, Helmut Bölcskei

    Abstract: Wiatowski and Bölcskei, 2015, proved that deformation stability and vertical translation invariance of deep convolutional neural network-based feature extractors are guaranteed by the network structure per se rather than the specific convolution kernels and non-linearities. While the translation invariance result applies to square-integrable functions, the deformation stability bound holds for ban… ▽ More

    Submitted 12 February, 2018; v1 submitted 29 April, 2016; originally announced May 2016.

    Comments: This is a slightly updated version of the paper published in the ISIT proceedings. Specifically, we corrected errors in the arguments on the volume of tubes. Note that this correction does not affect the main statements of the paper

    Journal ref: Proc. of IEEE International Symposium on Information Theory (ISIT), Barcelona, Spain, pp. 1163-1167, July 2016

  15. arXiv:1512.06293  [pdf, other

    cs.IT cs.AI cs.LG math.FA stat.ML

    A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction

    Authors: Thomas Wiatowski, Helmut Bölcskei

    Abstract: Deep convolutional neural networks have led to breakthrough results in numerous practical machine learning tasks such as classification of images in the ImageNet data set, control-policy-learning to play Atari games or the board game Go, and image captioning. Many of these applications first perform feature extraction and then feed the results thereof into a trainable classifier. The mathematical… ▽ More

    Submitted 24 October, 2017; v1 submitted 19 December, 2015; originally announced December 2015.

    Comments: IEEE Transactions on Information Theory, to appear

  16. arXiv:1504.05487  [pdf, ps, other

    cs.LG cs.IT math.FA stat.ML

    Deep Convolutional Neural Networks Based on Semi-Discrete Frames

    Authors: Thomas Wiatowski, Helmut Bölcskei

    Abstract: Deep convolutional neural networks have led to breakthrough results in practical feature extraction applications. The mathematical analysis of these networks was pioneered by Mallat, 2012. Specifically, Mallat considered so-called scattering networks based on identical semi-discrete wavelet frames in each network layer, and proved translation-invariance as well as deformation stability of the resu… ▽ More

    Submitted 21 April, 2015; originally announced April 2015.

    Comments: Proc. of IEEE International Symposium on Information Theory (ISIT), Hong Kong, China, June 2015, to appear

    Journal ref: Proc. of IEEE International Symposium on Information Theory (ISIT), Hong Kong, China, pp. 1212-1216, June 2015

  17. arXiv:1504.05036  [pdf, ps, other

    cs.IT math.FA

    Density Criteria for the Identification of Linear Time-Varying Systems

    Authors: Céline Aubel, Helmut Bölcskei

    Abstract: This paper addresses the problem of identifying a linear time-varying (LTV) system characterized by a (possibly infinite) discrete set of delays and Doppler shifts. We prove that stable identifiability is possible if the upper uniform Beurling density of the delay-Doppler support set is strictly smaller than 1/2 and stable identifiability is impossible for densities strictly larger than 1/2. The p… ▽ More

    Submitted 20 April, 2015; originally announced April 2015.

    Comments: IEEE International Symposium on Information Theory (ISIT), Hong Kong, China, June 2015

  18. arXiv:1305.3486  [pdf, ps, other

    cs.IT cs.LG math.ST stat.ML

    Noisy Subspace Clustering via Thresholding

    Authors: Reinhard Heckel, Helmut Bölcskei

    Abstract: We consider the problem of clustering noisy high-dimensional data points into a union of low-dimensional subspaces and a set of outliers. The number of subspaces, their dimensions, and their orientations are unknown. A probabilistic performance analysis of the thresholding-based subspace clustering (TSC) algorithm introduced recently in [1] shows that TSC succeeds in the noisy case, even when the… ▽ More

    Submitted 18 July, 2013; v1 submitted 15 May, 2013; originally announced May 2013.

    Comments: Presented at the IEEE Int. Symp. Inf. Theory (ISIT) 2013, Istanbul, Turkey. The version posted here corrects a minor error in the published version. Specifically, the exponent -c n_l in the success probability of Theorem 1 and in the corresponding proof outline has been corrected to -c(n_l-1)

  19. arXiv:1303.3716  [pdf, ps, other

    cs.IT cs.LG math.ST stat.ML

    Subspace Clustering via Thresholding and Spectral Clustering

    Authors: Reinhard Heckel, Helmut Bölcskei

    Abstract: We consider the problem of clustering a set of high-dimensional data points into sets of low-dimensional linear subspaces. The number of subspaces, their dimensions, and their orientations are unknown. We propose a simple and low-complexity clustering algorithm based on thresholding the correlations between the data points followed by spectral clustering. A probabilistic performance analysis shows… ▽ More

    Submitted 15 March, 2013; originally announced March 2013.

    Comments: ICASSP 2013

  20. Noncoherent SIMO Pre-Log via Resolution of Singularities

    Authors: Erwin Riegler, Veniamin I. Morgenshtern, Giuseppe Durisi, Shaowei Lin, Bernd Sturmfels, Helmut Bölcskei

    Abstract: We establish a lower bound on the noncoherent capacity pre-log of a temporally correlated Rayleigh block-fading single-input multiple-output (SIMO) channel. Our result holds for arbitrary rank Q of the channel correlation matrix, arbitrary block-length L > Q, and arbitrary number of receive antennas R, and includes the result in Morgenshtern et al. (2010) as a special case. It is well known that t… ▽ More

    Submitted 30 May, 2011; originally announced May 2011.

    Comments: IEEE International Symposium on Information Theory 2011 (ISIT 2011), Saint Petersburg, Russia, to appear

  21. arXiv:0905.1215  [pdf, other

    cs.IT cs.CC math.ST

    Tail Behavior of Sphere-Decoding Complexity in Random Lattices

    Authors: Dominik Seethaler, Joakim Jaldén, Christoph Studer, Helmut Bölcskei

    Abstract: We analyze the (computational) complexity distribution of sphere-decoding (SD) for random infinite lattices. In particular, we show that under fairly general assumptions on the statistics of the lattice basis matrix, the tail behavior of the SD complexity distribution is solely determined by the inverse volume of a fundamental region of the underlying lattice. Particularizing this result to NxM,… ▽ More

    Submitted 8 May, 2009; originally announced May 2009.

    Comments: To be presented at IEEE ISIT 2009, Seoul, Korea

    ACM Class: C.2.1; B.7.1; F.2; I.1.2

  22. arXiv:math/0108096  [pdf, ps, other

    math.FA cs.IT math.GR

    Geometrically Uniform Frames

    Authors: Yonina C. Eldar, H. Bolcskei

    Abstract: We introduce a new class of frames with strong symmetry properties called geometrically uniform frames (GU), that are defined over an abelian group of unitary matrices and are generated by a single generating vector. The notion of GU frames is then extended to compound GU (CGU) frames which are generated by an abelian group of unitary matrices using multiple generating vectors. The dual frame… ▽ More

    Submitted 13 August, 2001; originally announced August 2001.

    Comments: Submitted to IEEE Transactions on Information Theory. LaTex, 43 pages

    Journal ref: IEEE Trans. Inform. Theory, vol. 49, pp. 993-1006, Apr. 2003.