Skip to main content

Showing 1–7 of 7 results for author: Hammerschmidt, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2203.16331  [pdf, other

    cs.LG cs.LO cs.SE

    FlexFringe: Modeling Software Behavior by Learning Probabilistic Automata

    Authors: Sicco Verwer, Christian Hammerschmidt

    Abstract: We present the efficient implementations of probabilistic deterministic finite automaton learning methods available in FlexFringe. These implement well-known strategies for state-merging including several modifications to improve their performance in practice. We show experimentally that these algorithms obtain competitive results and significant improvements over a default implementation. We also… ▽ More

    Submitted 24 August, 2023; v1 submitted 28 March, 2022; originally announced March 2022.

  2. arXiv:2005.03773  [pdf, other

    cs.LG stat.ML

    Minority Class Oversampling for Tabular Data with Deep Generative Models

    Authors: Ramiro Camino, Christian Hammerschmidt, Radu State

    Abstract: In practice, machine learning experts are often confronted with imbalanced data. Without accounting for the imbalance, common classifiers perform poorly and standard evaluation metrics mislead the practitioners on the model's performance. A common method to treat imbalanced datasets is under- and oversampling. In this process, samples are either removed from the majority class or synthetic samples… ▽ More

    Submitted 20 July, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

  3. Beyond Labeling: Using Clustering to Build Network Behavioral Profiles of Malware Families

    Authors: Azqa Nadeem, Christian Hammerschmidt, Carlos H. Gañán, Sicco Verwer

    Abstract: Malware family labels are known to be inconsistent. They are also black-box since they do not represent the capabilities of malware. The current state-of-the-art in malware capability assessment include mostly manual approaches, which are infeasible due to the ever-increasing volume of discovered malware samples. We propose a novel unsupervised machine learning-based method called MalPaCA, which a… ▽ More

    Submitted 13 November, 2020; v1 submitted 2 April, 2019; originally announced April 2019.

    Comments: Accepted as a chapter in Springer MAAIDL 2020

  4. arXiv:1902.10666  [pdf, other

    cs.LG stat.ML

    Improving Missing Data Imputation with Deep Generative Models

    Authors: Ramiro D. Camino, Christian A. Hammerschmidt, Radu State

    Abstract: Datasets with missing values are very common on industry applications, and they can have a negative impact on machine learning models. Recent studies introduced solutions to the problem of imputing missing values based on deep generative models. Previous experiments with Generative Adversarial Networks and Variational Autoencoders showed interesting results in this domain, but it is not clear whic… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

  5. arXiv:1807.01202  [pdf, other

    stat.ML cs.LG

    Generating Multi-Categorical Samples with Generative Adversarial Networks

    Authors: Ramiro Camino, Christian Hammerschmidt, Radu State

    Abstract: We propose a method to train generative adversarial networks on mutivariate feature vectors representing multiple categorical values. In contrast to the continuous domain, where GAN-based methods have delivered considerable results, GANs struggle to perform equally well on discrete data. We propose and compare several architectures based on multiple (Gumbel) softmax output layers taking into accou… ▽ More

    Submitted 4 July, 2018; v1 submitted 3 July, 2018; originally announced July 2018.

    Journal ref: Presented at the ICML 2018 workshop on Theoretical Foundations and Applications of Deep Generative Models, Stockholm, Sweden

  6. arXiv:1707.09430  [pdf, ps, other

    stat.ML cs.LG

    Human in the Loop: Interactive Passive Automata Learning via Evidence-Driven State-Merging Algorithms

    Authors: Christian A. Hammerschmidt, Radu State, Sicco Verwer

    Abstract: We present an interactive version of an evidence-driven state-merging (EDSM) algorithm for learning variants of finite state automata. Learning these automata often amounts to recovering or reverse engineering the model generating the data despite noisy, incomplete, or imperfectly sampled data sources rather than optimizing a purely numeric target function. Domain expertise and human knowledge abo… ▽ More

    Submitted 28 July, 2017; originally announced July 2017.

    Comments: 4 pages, presented at the Human in the Loop workshop at ICML 2017

  7. arXiv:1611.07100  [pdf, other

    stat.ML cs.AI

    Interpreting Finite Automata for Sequential Data

    Authors: Christian Albert Hammerschmidt, Sicco Verwer, Qin Lin, Radu State

    Abstract: Automaton models are often seen as interpretable models. Interpretability itself is not well defined: it remains unclear what interpretability means without first explicitly specifying objectives or desired attributes. In this paper, we identify the key properties used to interpret automata and propose a modification of a state-merging approach to learn variants of finite state automata. We apply… ▽ More

    Submitted 24 November, 2016; v1 submitted 21 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

    ACM Class: I.2.6