Skip to main content

Showing 1–6 of 6 results for author: Racz, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20278  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Length independent generalization bounds for deep SSM architectures with stability constraints

    Authors: Dániel Rácz, Mihály Petreczky, Bálint Daróczy

    Abstract: Many state-of-the-art models trained on long-range sequences, for example S4, S5 or LRU, are made of sequential blocks combining State-Space Models (SSMs) with neural networks. In this paper we provide a PAC bound that holds for these kind of architectures with stable SSM blocks and does not depend on the length of the input sequence. Imposing stability of the SSM blocks is a standard practice in… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 25 pages, no figures, under submission

    MSC Class: 68 ACM Class: I.2.6

  2. arXiv:2405.10054  [pdf, other

    cs.LG eess.SY

    A finite-sample generalization bound for stable LPV systems

    Authors: Daniel Racz, Martin Gonzalez, Mihaly Petreczky, Andras Benczur, Balint Daroczy

    Abstract: One of the main theoretical challenges in learning dynamical systems from data is providing upper bounds on the generalization error, that is, the difference between the expected prediction error and the empirical prediction error measured on some finite sample. In machine learning, a popular class of such bounds are the so-called Probably Approximately Correct (PAC) bounds. In this paper, we deri… ▽ More

    Submitted 21 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 8 pages, 1 figure, under review

    MSC Class: 68 ACM Class: I.2.0

  3. arXiv:2310.17378  [pdf, other

    cs.LG cs.AI

    Optimization dependent generalization bound for ReLU networks based on sensitivity in the tangent bundle

    Authors: Dániel Rácz, Mihály Petreczky, András Csertán, Bálint Daróczy

    Abstract: Recent advances in deep learning have given us some very promising results on the generalization ability of deep neural networks, however literature still lacks a comprehensive theory explaining why heavily over-parametrized models are able to generalize well while fitting the training data. In this paper we propose a PAC type bound on the generalization error of feedforward ReLU networks via esti… ▽ More

    Submitted 4 December, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 17 pages, 5 figures, OPT2023: 15th Annual Workshop on Optimization for Machine Learning at the 37th NeurIPS 2023, New Orleans, LA, USA

    MSC Class: 68 ACM Class: I.2.6

  4. arXiv:2307.03630  [pdf, ps, other

    cs.LG

    PAC bounds of continuous Linear Parameter-Varying systems related to neural ODEs

    Authors: Dániel Rácz, Mihály Petreczky, Bálint Daróczy

    Abstract: We consider the problem of learning Neural Ordinary Differential Equations (neural ODEs) within the context of Linear Parameter-Varying (LPV) systems in continuous-time. LPV systems contain bilinear systems which are known to be universal approximators for non-linear systems. Moreover, a large class of neural ODEs can be embedded into LPV systems. As our main contribution we provide Probably Appro… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 12 pages

    MSC Class: 68 ACM Class: I.2.0

  5. arXiv:2202.01934  [pdf, other

    cs.LG

    Smartphone-based Hard-braking Event Detection at Scale for Road Safety Services

    Authors: Luyang Liu, David Racz, Kara Vaillancourt, Julie Michelman, Matt Barnes, Stefan Mellem, Paul Eastham, Bradley Green, Charles Armstrong, Rishi Bal, Shawn O'Banion, Feng Guo

    Abstract: Road crashes are the sixth leading cause of lost disability-adjusted life-years (DALYs) worldwide. One major challenge in traffic safety research is the sparsity of crashes, which makes it difficult to achieve a fine-grain understanding of crash causations and predict future crash risk in a timely manner. Hard-braking events have been widely used as a safety surrogate due to their relatively high… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  6. arXiv:2110.13581  [pdf, other

    cs.LG cs.AI

    Gradient representations in ReLU networks as similarity functions

    Authors: Dániel Rácz, Bálint Daróczy

    Abstract: Feed-forward networks can be interpreted as map**s with linear decision surfaces at the level of the last layer. We investigate how the tangent space of the network can be exploited to refine the decision in case of ReLU (Rectified Linear Unit) activations. We show that a simple Riemannian metric parametrized on the parameters of the network forms a similarity function at least as good as the or… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at 29th ESANN 2021, 6-8 October 2021, Belgium, 7 pages, 1 figure