Skip to main content

Showing 1–6 of 6 results for author: Somogyvári, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.15793  [pdf, other

    cs.LG cs.AI cs.CE stat.CO

    Feature space reduction method for ultrahigh-dimensional, multiclass data: Random forest-based multiround screening (RFMS)

    Authors: Gergely Hanczár, Marcell Stip**er, Dávid Hanák, Marcell T. Kurbucz, Olivér M. Törteli, Ágnes Chripkó, Zoltán Somogyvári

    Abstract: In recent years, numerous screening methods have been published for ultrahigh-dimensional data that contain hundreds of thousands of features; however, most of these features cannot handle data with thousands of classes. Prediction models built to authenticate users based on multichannel biometric data result in this type of problem. In this study, we present a novel method known as random forest-… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 9 pages, 2 figures, 2 tables

    MSC Class: 62G05; 68T01; 62H30 ACM Class: I.2.6; I.2.1; G.3

  2. arXiv:2206.10747  [pdf, other

    cs.LG cs.AI cs.DB stat.CO

    BiometricBlender: Ultra-high dimensional, multi-class synthetic data generator to imitate biometric feature space

    Authors: Marcell Stip**er, Dávid Hanák, Marcell T. Kurbucz, Gergely Hanczár, Olivér M. Törteli, Zoltán Somogyvári

    Abstract: The lack of freely available (real-life or synthetic) high or ultra-high dimensional, multi-class datasets may hamper the rapidly growing research on feature screening, especially in the field of biometrics, where the usage of such datasets is common. This paper reports a Python package called BiometricBlender, which is an ultra-high dimensional, multi-class synthetic data generator to benchmark a… ▽ More

    Submitted 25 April, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: DISCLAIMER: This is a preprint article. A final peer reviewed article has now been published in SoftwareX. DOI: https://doi.org/10.1016/j.softx.2023.101366

    MSC Class: 62H30 (Primary) 68T10; 94A62 (Secondary) ACM Class: G.3; I.5.2; K.6.5

    Journal ref: SoftwareX 22 (2023) 101366

  3. arXiv:2105.02322  [pdf, other

    cs.NE

    Reconstructing shared dynamics with a deep neural network

    Authors: Zsigmond Benkő, Zoltán Somogyvári

    Abstract: Determining hidden shared patterns behind dynamic phenomena can be a game-changer in multiple areas of research. Here we present the principles and show a method to identify hidden shared dynamics from time series by a two-module, feedforward neural network architecture: the Mapper-Coach network. We reconstruct unobserved, continuous latent variable input, the time series generated by a chaotic lo… ▽ More

    Submitted 14 October, 2022; v1 submitted 5 May, 2021; originally announced May 2021.

    MSC Class: 68T07

  4. arXiv:2008.03221  [pdf, other

    stat.ME cs.LG stat.ML

    Manifold-adaptive dimension estimation revisited

    Authors: Zsigmond Benkő, Marcell Stip**er, Roberta Rehus, Attila Bencze, Dániel Fabó, Boglárka Hajnal, Loránd Erőss, András Telcs, Zoltán Somogyvári

    Abstract: Data dimensionality informs us about data complexity and sets limit on the structure of successful signal processing pipelines. In this work we revisit and improve the manifold-adaptive Farahmand-Szepesvári-Audibert (FSA) dimension estimator, making it one of the best nearest neighbor-based dimension estimators available. We compute the probability density function of local FSA estimates, if the l… ▽ More

    Submitted 10 August, 2020; v1 submitted 7 August, 2020; originally announced August 2020.

  5. arXiv:2004.11468  [pdf, other

    cs.LG eess.SP physics.data-an stat.ML

    How to find a unicorn: a novel model-free, unsupervised anomaly detection method for time series

    Authors: Zsigmond Benkő, Tamás Bábel, Zoltán Somogyvári

    Abstract: Recognition of anomalous events is a challenging but critical task in many scientific and industrial fields, especially when the properties of anomalies are unknown. In this paper, we introduce a new anomaly concept called "unicorn" or unique event and present a new, model-free, unsupervised detection algorithm to detect unicorns. The key component of the new algorithm is the Temporal Outlier Fact… ▽ More

    Submitted 15 June, 2021; v1 submitted 23 April, 2020; originally announced April 2020.

  6. arXiv:1206.3933  [pdf, other

    cs.SI physics.soc-ph

    Prediction of Emerging Technologies Based on Analysis of the U.S. Patent Citation Network

    Authors: Péter Érdi, Kinga Makovi, Zoltán Somogyvári, Katherine Strandburg, Jan Tobochnik, Péter Volf, László Zalányi

    Abstract: The network of patents connected by citations is an evolving graph, which provides a representation of the innovation process. A patent citing another implies that the cited patent reflects a piece of previously existing knowledge that the citing patent builds upon. A methodology presented here (i) identifies actual clusters of patents: i.e. technological branches, and (ii) gives predictions about… ▽ More

    Submitted 4 April, 2013; v1 submitted 18 June, 2012; originally announced June 2012.

    Journal ref: Scientometrics: Volume 95, Issue 1 (2013), Page 225-242