Skip to main content

Showing 1–14 of 14 results for author: Daroczy, B

.
  1. arXiv:2405.20278  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Length independent generalization bounds for deep SSM architectures

    Authors: Dániel Rácz, Mihály Petreczky, Bálint Daróczy

    Abstract: Many state-of-the-art models trained on long-range sequences, for example S4, S5 or LRU, are made of sequential blocks combining State-Space Models (SSMs) with neural networks. In this paper we provide a PAC bound that holds for these kind of architectures with stable SSM blocks and does not depend on the length of the input sequence. Imposing stability of the SSM blocks is a standard practice in… ▽ More

    Submitted 11 July, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 20 pages, no figures, accepted at ICML 2024 Next Generation of Sequence Modeling Architectures Workshop

    MSC Class: 68 ACM Class: I.2.6

  2. arXiv:2405.10054  [pdf, other

    cs.LG eess.SY

    A finite-sample generalization bound for stable LPV systems

    Authors: Daniel Racz, Martin Gonzalez, Mihaly Petreczky, Andras Benczur, Balint Daroczy

    Abstract: One of the main theoretical challenges in learning dynamical systems from data is providing upper bounds on the generalization error, that is, the difference between the expected prediction error and the empirical prediction error measured on some finite sample. In machine learning, a popular class of such bounds are the so-called Probably Approximately Correct (PAC) bounds. In this paper, we deri… ▽ More

    Submitted 21 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 8 pages, 1 figure, under review

    MSC Class: 68 ACM Class: I.2.0

  3. arXiv:2310.17378  [pdf, other

    cs.LG cs.AI

    Optimization dependent generalization bound for ReLU networks based on sensitivity in the tangent bundle

    Authors: Dániel Rácz, Mihály Petreczky, András Csertán, Bálint Daróczy

    Abstract: Recent advances in deep learning have given us some very promising results on the generalization ability of deep neural networks, however literature still lacks a comprehensive theory explaining why heavily over-parametrized models are able to generalize well while fitting the training data. In this paper we propose a PAC type bound on the generalization error of feedforward ReLU networks via esti… ▽ More

    Submitted 4 December, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 17 pages, 5 figures, OPT2023: 15th Annual Workshop on Optimization for Machine Learning at the 37th NeurIPS 2023, New Orleans, LA, USA

    MSC Class: 68 ACM Class: I.2.6

  4. arXiv:2307.03630  [pdf, ps, other

    cs.LG

    PAC bounds of continuous Linear Parameter-Varying systems related to neural ODEs

    Authors: Dániel Rácz, Mihály Petreczky, Bálint Daróczy

    Abstract: We consider the problem of learning Neural Ordinary Differential Equations (neural ODEs) within the context of Linear Parameter-Varying (LPV) systems in continuous-time. LPV systems contain bilinear systems which are known to be universal approximators for non-linear systems. Moreover, a large class of neural ODEs can be embedded into LPV systems. As our main contribution we provide Probably Appro… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 12 pages

    MSC Class: 68 ACM Class: I.2.0

  5. arXiv:2110.13581  [pdf, other

    cs.LG cs.AI

    Gradient representations in ReLU networks as similarity functions

    Authors: Dániel Rácz, Bálint Daróczy

    Abstract: Feed-forward networks can be interpreted as map**s with linear decision surfaces at the level of the last layer. We investigate how the tangent space of the network can be exploited to refine the decision in case of ReLU (Rectified Linear Unit) activations. We show that a simple Riemannian metric parametrized on the parameters of the network forms a similarity function at least as good as the or… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at 29th ESANN 2021, 6-8 October 2021, Belgium, 7 pages, 1 figure

  6. arXiv:2102.00949  [pdf, other

    quant-ph cs.DS cs.LG

    Quantum Inspired Adaptive Boosting

    Authors: Bálint Daróczy, Katalin Friedl, László Kabódi, Attila Pereszlényi, Dániel Szabó

    Abstract: Building on the quantum ensemble based classifier algorithm of Schuld and Petruccione [arXiv:1704.02146v1], we devise equivalent classical algorithms which show that this quantum ensemble method does not have advantage over classical algorithms. Essentially, we simplify their algorithm until it is intuitive to come up with an equivalent classical version. One of the classical algorithms is extreme… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: 11 pages, 1 figure

  7. arXiv:2006.06780  [pdf, other

    cs.LG cs.NE stat.ML

    Tangent Space Sensitivity and Distribution of Linear Regions in ReLU Networks

    Authors: Bálint Daróczy

    Abstract: Recent articles indicate that deep neural networks are efficient models for various learning problems. However they are often highly sensitive to various changes that cannot be detected by an independent observer. As our understanding of deep neural networks with traditional generalization bounds still remains incomplete, there are several measures which capture the behaviour of the model in case… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 14 pages, 4 figures, 2 tables

    MSC Class: 68T07 ACM Class: I.2.6

  8. arXiv:1912.09306  [pdf, other

    cs.LG stat.ML

    Tangent Space Separability in Feedforward Neural Networks

    Authors: Bálint Daróczy, Rita Aleksziev, András Benczúr

    Abstract: Hierarchical neural networks are exponentially more efficient than their corresponding "shallow" counterpart with the same expressive power, but involve huge number of parameters and require tedious amounts of training. By approximating the tangent subspace, we suggest a sparse representation that enables switching to shallow networks, GradNet after a very early training stage. Our experiments sho… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: 10 pages; accepted at Workshop "Beyond First-Order Optimization Methods in Machine Learning", 33rd Conference on Neural Information Processing Systems (NeurIPS 2019). arXiv admin note: substantial text overlap with arXiv:1807.06630

    MSC Class: I.2.6; I.5.1 ACM Class: I.2.6; I.5.1

  9. arXiv:1807.06630  [pdf, other

    cs.LG stat.ML

    Expressive power of outer product manifolds on feed-forward neural networks

    Authors: Bálint Daróczy, Rita Aleksziev, András Benczúr

    Abstract: Hierarchical neural networks are exponentially more efficient than their corresponding "shallow" counterpart with the same expressive power, but involve huge number of parameters and require tedious amounts of training. Our main idea is to mathematically understand and describe the hierarchical structure of feedforward neural networks by reparametrization invariant Riemannian metrics. By computing… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: 11 pages, 8 figures, under submission

  10. And Now for Something Completely Different: Visual Novelty in an Online Network of Designers

    Authors: Johannes Wachs, Bálint Daróczy, Anikó Hannák, Katinka Páll, Christoph Riedl

    Abstract: Novelty is a key ingredient of innovation but quantifying it is difficult. This is especially true for visual work like graphic design. Using designs shared on an online social network of professional digital designers, we measure visual novelty using statistical learning methods to compare an images features with those of images that have been created before. We then relate social network positio… ▽ More

    Submitted 23 April, 2018; v1 submitted 16 April, 2018; originally announced April 2018.

    Comments: accepted to 10th International ACM Web Science Conference, 2018, May 27-30, Amsterdam, The Netherlands, 11 pages, 6 figures, 60 references

  11. Machine learning methods for multimedia information retrieval

    Authors: Bálint Zoltán Daróczy

    Abstract: In this thesis we examined several multimodal feature extraction and learning methods for retrieval and classification purposes. We reread briefly some theoretical results of learning in Section 2 and reviewed several generative and discriminative models in Section 3 while we described the similarity kernel in Section 4. We examined different aspects of the multimodal image retrieval and classific… ▽ More

    Submitted 14 May, 2017; originally announced May 2017.

    Comments: doctoral thesis, 2016

  12. arXiv:1705.02972  [pdf, ps, other

    cs.SI cs.CY

    Why Do Men Get More Attention? Exploring Factors Behind Success in an Online Design Community

    Authors: Johannes Wachs, Anikó Hannák, András Vörös, Bálint Daróczy

    Abstract: Online platforms are an increasingly popular tool for people to produce, promote or sell their work. However recent studies indicate that social disparities and biases present in the real world might transfer to online platforms and could be exacerbated by seemingly harmless design choices on the site (e.g., recommendation systems or publicly visible success measures). In this paper we analyze an… ▽ More

    Submitted 8 May, 2017; originally announced May 2017.

    Comments: in The International AAAI Conference on Web and Social Media (ICWSM2017), Montreal, May 2017

    Journal ref: ICWSM 2017

  13. arXiv:1611.01974  [pdf, other

    cs.IR

    Item-to-item recommendation based on Contextual Fisher Information

    Authors: Bálint Daróczy, Frederick Ayala-Gómez, András Benczúr

    Abstract: Web recommendation services bear great importance in e-commerce, as they aid the user in navigating through the items that are most relevant to her needs. In a typical Web site, long history of previous activities or purchases by the user is rarely available. Hence in most cases, recommenders propose items that are similar to the most recent ones viewed in the current user session. The correspondi… ▽ More

    Submitted 8 November, 2016; v1 submitted 7 November, 2016; originally announced November 2016.

    Comments: 9 pages, 8 figures, 4 tables

  14. arXiv:1505.03002  [pdf, other

    physics.soc-ph cs.IR cs.SI

    Statistical analysis of NOMAO customer votes for spots of France

    Authors: Robert Palovics, Balint Daroczy, Andras Benczur, Julia Pap, Leonardo Ermann, Samuel Phan, Alexei D. Chepelianskii, Dima L. Shepelyansky

    Abstract: We investigate the statistical properties of votes of customers for spots of France collected by the startup company NOMAO. The frequencies of votes per spot and per customer are characterized by a power law distributions which remain stable on a time scale of a decade when the number of votes is varied by almost two orders of magnitude. Using the computer science methods we explore the spectrum a… ▽ More

    Submitted 12 May, 2015; originally announced May 2015.

    Comments: 10 pages, 12 figs

    Journal ref: Eur. Phys. J. B. v.88, p.194 (2015)