Skip to main content

Showing 1–7 of 7 results for author: Kalinke, F

.
  1. arXiv:2406.08401  [pdf, other

    stat.ML cs.LG math.ST

    Nyström Kernel Stein Discrepancy

    Authors: Florian Kalinke, Zoltan Szabo, Bharath K. Sriperumbudur

    Abstract: Kernel methods underpin many of the most successful approaches in data science and statistics, and they allow representing probability measures as elements of a reproducing kernel Hilbert space without loss of information. Recently, the kernel Stein discrepancy (KSD), which combines Stein's method with kernel techniques, gained considerable attention. Through the Stein operator, KSD allows the con… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    MSC Class: 46E22 (Primary) 62G10 (Secondary) ACM Class: G.3; I.2.6

  2. arXiv:2403.07735  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ML

    The Minimax Rate of HSIC Estimation for Translation-Invariant Kernels

    Authors: Florian Kalinke, Zoltan Szabo

    Abstract: Kernel techniques are among the most influential approaches in data science and statistics. Under mild conditions, the reproducing kernel Hilbert space associated to a kernel is capable of encoding the independence of $M\ge 2$ random variables. Probably the most widespread independence measure relying on kernels is the so-called Hilbert-Schmidt independence criterion (HSIC; also referred to as dis… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    MSC Class: 62C20; 46E22; 47B32; 94A15; 62G10 ACM Class: G.3; H.1.1; I.2.6

  3. arXiv:2402.00592  [pdf, other

    cs.LG stat.ML

    Partial-Label Learning with a Reject Option

    Authors: Tobias Fuchs, Florian Kalinke, Klemens Böhm

    Abstract: In real-world applications, one often encounters ambiguously labeled data, where different annotators assign conflicting class labels. Partial-label learning allows training classifiers in this weakly supervised setting, where state-of-the-art methods already show good predictive performance. However, even the best algorithms give incorrect predictions, which can have severe consequences when they… ▽ More

    Submitted 5 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  4. arXiv:2306.12974  [pdf, other

    cs.LG

    Adaptive Bernstein Change Detector for High-Dimensional Data Streams

    Authors: Marco Heyden, Edouard Fouché, Vadim Arzamasov, Tanja Fenn, Florian Kalinke, Klemens Böhm

    Abstract: Change detection is of fundamental importance when analyzing data streams. Detecting changes both quickly and accurately enables monitoring and prediction systems to react, e.g., by issuing an alarm or by updating a learning algorithm. However, detecting changes is challenging when observations are high-dimensional. In high-dimensional data, change detectors should not only be able to identify whe… ▽ More

    Submitted 14 January, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    MSC Class: 68T05 ACM Class: I.2.6

  5. arXiv:2302.09930  [pdf, other

    stat.ML cs.IT cs.LG

    Nyström $M$-Hilbert-Schmidt Independence Criterion

    Authors: Florian Kalinke, Zoltán Szabó

    Abstract: Kernel techniques are among the most popular and powerful approaches of data science. Among the key features that make kernels ubiquitous are (i) the number of domains they have been designed for, (ii) the Hilbert structure of the function class associated to kernels facilitating their statistical analysis, and (iii) their ability to represent probability distributions without loss of information.… ▽ More

    Submitted 31 July, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    MSC Class: 46E22; 94A17 ACM Class: I.2.6; H.1.1

    Journal ref: Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, PMLR 216:1005-1015, 2023

  6. arXiv:2205.12706  [pdf, other

    cs.LG

    Maximum Mean Discrepancy on Exponential Windows for Online Change Detection

    Authors: Florian Kalinke, Marco Heyden, Edouard Fouché, Klemens Böhm

    Abstract: Detecting changes is of fundamental importance when analyzing data streams and has many applications, e.g., predictive maintenance, fraud detection, or medicine. A principled approach to detect changes is to compare the distributions of observations within the stream to each other via hypothesis testing. Maximum mean discrepancy (MMD; also called energy distance) is a well-known (semi-)metric on t… ▽ More

    Submitted 13 March, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

    ACM Class: I.2.6; H.1.1

  7. Efficient Subspace Search in Data Streams

    Authors: Edouard Fouché, Florian Kalinke, Klemens Böhm

    Abstract: In the real world, data streams are ubiquitous -- think of network traffic or sensor data. Mining patterns, e.g., outliers or clusters, from such data must take place in real time. This is challenging because (1) streams often have high dimensionality, and (2) the data characteristics may change over time. Existing approaches tend to focus on only one aspect, either high dimensionality or the spec… ▽ More

    Submitted 7 January, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: Accepted Manuscript to Information Systems, Volume 97, Elsevier. Final authenticated version: https://doi.org/10.1016/j.is.2020.101705

    Journal ref: In: Information Systems 97 (2021), p. 101705. ISSN: 0306-4379