Skip to main content

Showing 1–17 of 17 results for author: Pevny, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2208.02694  [pdf, ps, other

    stat.ML cs.LG

    Explaining Classifiers Trained on Raw Hierarchical Multiple-Instance Data

    Authors: Tomáš Pevný, Viliam Lisý, Branislav Bošanský, Petr Somol, Michal Pěchouček

    Abstract: Learning from raw data input, thus limiting the need for feature engineering, is a component of many successful applications of machine learning methods in various domains. While many problems naturally translate into a vector representation directly usable in standard classifiers, a number of data sources have the natural form of structured data interchange formats (e.g., security logs in JSON/XM… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

  2. arXiv:2110.04776  [pdf, ps, other

    cs.LG stat.ML

    Fitting large mixture models using stochastic component selection

    Authors: Milan Papež, Tomáš Pevný, Václav Šmídl

    Abstract: Traditional methods for unsupervised learning of finite mixture models require to evaluate the likelihood of all components of the mixture. This becomes computationally prohibitive when the number of components is large, as it is, for example, in the sum-product (transform) networks. Therefore, we propose to apply a combination of the expectation maximization and the Metropolis-Hastings algorithm… ▽ More

    Submitted 10 October, 2021; originally announced October 2021.

  3. arXiv:2105.09107  [pdf, ps, other

    stat.ML cs.LG cs.MS

    Mill.jl and JsonGrinder.jl: automated differentiable feature extraction for learning from raw JSON data

    Authors: Simon Mandlik, Matej Racinsky, Viliam Lisy, Tomas Pevny

    Abstract: Learning from raw data input, thus limiting the need for manual feature engineering, is one of the key components of many successful applications of machine learning methods. While machine learning problems are often formulated on data that naturally translate into a vector representation suitable for classifiers, there are data sources, for example in cybersecurity, that are naturally represented… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: 5 pages, 2 figures, 1 table, submitted to section on one-source software of Journal of Machine Learning Research

  4. arXiv:2006.01681  [pdf, other

    cs.LG cs.NE stat.ML

    Neural Power Units

    Authors: Niklas Heim, Tomáš Pevný, Václav Šmídl

    Abstract: Conventional Neural Networks can approximate simple arithmetic operations, but fail to generalize beyond the range of numbers that were seen during training. Neural Arithmetic Units aim to overcome this difficulty, but current arithmetic units are either limited to operate on positive numbers or can only represent a subset of arithmetic operations. We introduce the Neural Power Unit (NPU) that ope… ▽ More

    Submitted 17 December, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

  5. arXiv:2005.01297  [pdf, other

    stat.ML cs.LG

    Sum-Product-Transform Networks: Exploiting Symmetries using Invertible Transformations

    Authors: Tomas Pevny, Vasek Smidl, Martin Trapp, Ondrej Polacek, Tomas Oberhuber

    Abstract: In this work, we propose Sum-Product-Transform Networks (SPTN), an extension of sum-product networks that uses invertible transformations as additional internal nodes. The type and placement of transformations determine properties of the resulting SPTN with many interesting special cases. Importantly, SPTN with Gaussian leaves and affine transformations pose the same inference task tractable that… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

  6. arXiv:2002.10923  [pdf, other

    cs.LG stat.ML

    General Framework for Binary Classification on Top Samples

    Authors: Lukáš Adam, Václav Mácha, Václav Šmídl, Tomáš Pevný

    Abstract: Many binary classification problems minimize misclassification above (or below) a threshold. We show that instances of ranking problems, accuracy at the top or hypothesis testing may be written in this form. We propose a general framework to handle these classes of problems and show which known methods (both known and newly proposed) fall into this framework. We provide a theoretical analysis of t… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    MSC Class: 90C15; 90C26; 49M05

  7. arXiv:1912.00656  [pdf, other

    stat.ML cs.LG

    Rodent: Relevance determination in differential equations

    Authors: Niklas Heim, Václav Šmídl, Tomáš Pevný

    Abstract: We aim to identify the generating, ordinary differential equation (ODE) from a set of trajectories of a partially observed system. Our approach does not need prescribed basis functions to learn the ODE model, but only a rich set of Neural Arithmetic Units. For maximal explainability of the learnt model, we minimise the state size of the ODE as well as the number of non-zero parameters that are nee… ▽ More

    Submitted 12 March, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

  8. arXiv:1911.08756  [pdf, other

    cs.LG cs.AI stat.ML

    Classification with Costly Features in Hierarchical Deep Sets

    Authors: Jaromír Janisch, Tomáš Pevný, Viliam Lisý

    Abstract: Classification with Costly Features (CwCF) is a classification problem that includes the cost of features in the optimization criteria. Individually for each sample, its features are sequentially acquired to maximize accuracy while minimizing the acquired features' cost. However, existing approaches can only process data that can be expressed as vectors of fixed length. In real life, the data ofte… ▽ More

    Submitted 29 February, 2024; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: formerly Hierarchical Multiple-Instance Data Classification with Costly Features; RL4RealLife @ ICML2021; code available at https://github.com/jaromiru/rcwcf

  9. arXiv:1909.02564  [pdf, other

    cs.LG cs.AI stat.ML

    Classification with Costly Features as a Sequential Decision-Making Problem

    Authors: Jaromír Janisch, Tomáš Pevný, Viliam Lisý

    Abstract: This work focuses on a specific classification problem, where the information about a sample is not readily available, but has to be acquired for a cost, and there is a per-sample budget. Inspired by real-world use-cases, we analyze average and hard variations of a directly specified budget. We postulate the problem in its explicit formulation and then convert it into an equivalent MDP, that can b… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

    Journal ref: Machine Learning (2020): 1-29

  10. arXiv:1906.09084  [pdf, other

    cs.LG cs.CR stat.ML

    Joint Detection of Malicious Domains and Infected Clients

    Authors: Paul Prasse, Rene Knaebel, Lukas Machlica, Tomas Pevny, Tobias Scheffer

    Abstract: Detection of malware-infected computers and detection of malicious web domains based on their encrypted HTTPS traffic are challenging problems, because only addresses, timestamps, and data volumes are observable. The detection problems are coupled, because infected clients tend to interact with malicious domains. Traffic data can be collected at a large scale, and antivirus tools can be used to id… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

    Comments: Mach Learn (2019)

  11. arXiv:1906.00764  [pdf, ps, other

    cs.LG stat.ML

    Approximation capability of neural networks on spaces of probability measures and tree-structured domains

    Authors: Tomas Pevny, Vojtech Kovarik

    Abstract: This paper extends the proof of density of neural networks in the space of continuous (or even measurable) functions on Euclidean spaces to functions on compact sets of probability measures. By doing so the work parallels a more then a decade old results on mean-map embedding of probability measures in reproducing kernel Hilbert spaces. The work has wide practical consequences for multi-instance l… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

  12. arXiv:1905.11890  [pdf, other

    stat.ML cs.LG

    Anomaly scores for generative models

    Authors: Václav Šmídl, Jan Bím, Tomáš Pevný

    Abstract: Reconstruction error is a prevalent score used to identify anomalous samples when data are modeled by generative models, such as (variational) auto-encoders or generative adversarial networks. This score relies on the assumption that normal samples are located on a manifold and all anomalous samples are located outside. Since the manifold can be learned only where the training data lie, there are… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: 9 pages, 3 figures, submitted to NeurIPS 2019

  13. arXiv:1807.05027  [pdf, ps, other

    cs.LG stat.ML

    Are generative deep models for novelty detection truly better?

    Authors: Vít Škvára, Tomáš Pevný, Václav Šmídl

    Abstract: Many deep models have been recently proposed for anomaly detection. This paper presents comparison of selected generative deep models and classical anomaly detection methods on an extensive number of non--image benchmark datasets. We provide statistical comparison of the selected models, in many configurations, architectures and hyperparamaters. We arrive to conclusion that performance of the gene… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: 7 pages, ODD v5.0 - KDD 2018 workshop

  14. arXiv:1807.00173  [pdf, other

    math.OC cs.LG stat.ML

    Algorithms for solving optimization problems arising from deep neural net models: nonsmooth problems

    Authors: Vyacheslav Kungurtsev, Tomas Pevny

    Abstract: Machine Learning models incorporating multiple layered learning networks have been seen to provide effective models for various classification problems. The resulting optimization problem to solve for the optimal vector minimizing the empirical risk is, however, highly nonconvex. This alone presents a challenge to application and development of appropriate optimization algorithms for solving the p… ▽ More

    Submitted 30 June, 2018; originally announced July 2018.

    Report number: Cisco Prague WP5 Report 2016-02

  15. arXiv:1807.00172  [pdf, other

    math.OC cs.LG stat.ML

    Algorithms for solving optimization problems arising from deep neural net models: smooth problems

    Authors: Vyacheslav Kungurtsev, Tomas Pevny

    Abstract: Machine Learning models incorporating multiple layered learning networks have been seen to provide effective models for various classification problems. The resulting optimization problem to solve for the optimal vector minimizing the empirical risk is, however, highly nonlinear. This presents a challenge to application and development of appropriate optimization algorithms for solving the problem… ▽ More

    Submitted 30 June, 2018; originally announced July 2018.

    Report number: Cisco Prague WP5 Project Report 2016-01

  16. arXiv:1711.07364  [pdf, other

    cs.AI cs.LG stat.ML

    Classification with Costly Features using Deep Reinforcement Learning

    Authors: Jaromír Janisch, Tomáš Pevný, Viliam Lisý

    Abstract: We study a classification problem where each feature can be acquired for a cost and the goal is to optimize a trade-off between the expected classification error and the feature cost. We revisit a former approach that has framed the problem as a sequential decision-making problem and solved it by Q-learning with a linear approximation, where individual actions are either requests for feature value… ▽ More

    Submitted 12 November, 2018; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: AAAI 2019

  17. arXiv:1609.07257  [pdf, other

    cs.LG stat.ML

    Using Neural Network Formalism to Solve Multiple-Instance Problems

    Authors: Tomas Pevny, Petr Somol

    Abstract: Many objects in the real world are difficult to describe by a single numerical vector of a fixed length, whereas describing them by a set of vectors is more natural. Therefore, Multiple instance learning (MIL) techniques have been constantly gaining on importance throughout last years. MIL formalism represents each object (sample) by a set (bag) of feature vectors (instances) of fixed length where… ▽ More

    Submitted 7 March, 2017; v1 submitted 23 September, 2016; originally announced September 2016.

    Comments: Accepted to International Symposium on Neural Networks