Skip to main content

Showing 1–15 of 15 results for author: Koch, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2308.09368  [pdf, other

    cs.CV cs.CL cs.CY cs.LG stat.ML

    A tailored Handwritten-Text-Recognition System for Medieval Latin

    Authors: Philipp Koch, Gilary Vera Nuñez, Esteban Garces Arias, Christian Heumann, Matthias Schöffel, Alexander Häberlin, Matthias Aßenmacher

    Abstract: The Bavarian Academy of Sciences and Humanities aims to digitize its Medieval Latin Dictionary. This dictionary entails record cards referring to lemmas in medieval Latin, a low-resource language. A crucial step of the digitization process is the Handwritten Text Recognition (HTR) of the handwritten lemmas found on these record cards. In our work, we introduce an end-to-end pipeline, tailored to t… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: This paper has been accepted at the First Workshop on Ancient Language Processing, co-located with RANLP 2023. This is the author's version of the work. The definite version of record will be published in the proceedings

  2. arXiv:2301.04856  [pdf, other

    cs.CL cs.LG stat.ML

    Multimodal Deep Learning

    Authors: Cem Akkus, Luyang Chu, Vladana Djakovic, Steffen Jauch-Walser, Philipp Koch, Giacomo Loss, Christopher Marquardt, Marco Moldovan, Nadja Sauter, Maximilian Schneider, Rickmer Schulte, Karol Urbanczyk, Jann Goschenhofer, Christian Heumann, Rasmus Hvingelby, Daniel Schalk, Matthias Aßenmacher

    Abstract: This book is the result of a seminar in which we reviewed multimodal approaches and attempted to create a solid overview of the field, starting with the current state-of-the-art approaches in the two subfields of Deep Learning individually. Further, modeling frameworks are discussed where one modality is transformed into the other, as well as models in which one modality is utilized to enhance rep… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

  3. Personalized Automatic Sleep Staging with Single-Night Data: a Pilot Study with KL-Divergence Regularization

    Authors: Huy Phan, Kaare Mikkelsen, Oliver Y. Chén, Philipp Koch, Alfred Mertins, Preben Kidmose, Maarten De Vos

    Abstract: Brain waves vary between people. An obvious way to improve automatic sleep staging for longitudinal sleep monitoring is personalization of algorithms based on individual characteristics extracted from the first night of data. As a single night is a very small amount of data to train a sleep staging model, we propose a Kullback-Leibler (KL) divergence regularized transfer learning approach to addre… ▽ More

    Submitted 11 May, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: This article has been published in Physiological Measurement

  4. arXiv:2001.05532  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Improving GANs for Speech Enhancement

    Authors: Huy Phan, Ian V. McLoughlin, Lam Pham, Oliver Y. Chén, Philipp Koch, Maarten De Vos, Alfred Mertins

    Abstract: Generative adversarial networks (GAN) have recently been shown to be efficient for speech enhancement. However, most, if not all, existing speech enhancement GANs (SEGAN) make use of a single generator to perform one-stage enhancement map**. In this work, we propose to use multiple generators that are chained to perform multi-stage enhancement map**, which gradually refines the noisy input sig… ▽ More

    Submitted 12 September, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: This letter has been accepted for publication in IEEE Signal Processing Letters

  5. arXiv:1909.09223  [pdf, other

    cs.LG stat.ML

    InterpretML: A Unified Framework for Machine Learning Interpretability

    Authors: Harsha Nori, Samuel Jenkins, Paul Koch, Rich Caruana

    Abstract: InterpretML is an open-source Python package which exposes machine learning interpretability algorithms to practitioners and researchers. InterpretML exposes two types of interpretability - glassbox models, which are machine learning models designed for interpretability (ex: linear models, rule lists, generalized additive models), and blackbox explainability techniques for explaining existing syst… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

  6. arXiv:1908.04909  [pdf, other

    cs.LG cs.DC cs.NE stat.ML

    Constrained Multi-Objective Optimization for Automated Machine Learning

    Authors: Steven Gardner, Oleg Golovidov, Joshua Griffin, Patrick Koch, Wayne Thompson, Brett Wujek, Yan Xu

    Abstract: Automated machine learning has gained a lot of attention recently. Building and selecting the right machine learning models is often a multi-objective optimization problem. General purpose machine learning software that simultaneously supports multiple objectives and constraints is scant, though the potential benefits are great. In this work, we present a framework called Autotune that effectively… ▽ More

    Submitted 13 August, 2019; originally announced August 2019.

    Comments: 10 pages, 8 figures, accepted at DSAA 2019

  7. arXiv:1907.13177  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Towards More Accurate Automatic Sleep Staging via Deep Transfer Learning

    Authors: Huy Phan, Oliver Y. Chén, Philipp Koch, Zongqing Lu, Ian McLoughlin, Alfred Mertins, Maarten De Vos

    Abstract: Background: Despite recent significant progress in the development of automatic sleep staging methods, building a good model still remains a big challenge for sleep studies with a small cohort due to the data-variability and data-inefficiency issues. This work presents a deep transfer learning approach to overcome these issues and enable transferring knowledge from a large dataset to a small cohor… ▽ More

    Submitted 27 August, 2020; v1 submitted 30 July, 2019; originally announced July 2019.

    Comments: This article has been published in IEEE Transactions on Biomedical Engineering

  8. arXiv:1904.05945  [pdf, ps, other

    cs.LG stat.ML

    Deep Transfer Learning for Single-Channel Automatic Sleep Staging with Channel Mismatch

    Authors: Huy Phan, Oliver Y. Chén, Philipp Koch, Alfred Mertins, Maarten De Vos

    Abstract: Many sleep studies suffer from the problem of insufficient data to fully utilize deep neural networks as different labs use different recordings set ups, leading to the need of training automated algorithms on rather small databases, whereas large annotated databases are around but cannot be directly included into these studies for data compensation due to channel mismatch. This work presents a de… ▽ More

    Submitted 18 June, 2019; v1 submitted 11 April, 2019; originally announced April 2019.

    Comments: Accepted for 27th European Signal Processing Conference (EUSIPCO 2019)

  9. arXiv:1904.03543  [pdf, ps, other

    cs.SD cs.LG eess.AS stat.ML

    Spatio-Temporal Attention Pooling for Audio Scene Classification

    Authors: Huy Phan, Oliver Y. Chén, Lam Pham, Philipp Koch, Maarten De Vos, Ian McLoughlin, Alfred Mertins

    Abstract: Acoustic scenes are rich and redundant in their content. In this work, we present a spatio-temporal attention pooling layer coupled with a convolutional recurrent neural network to learn from patterns that are discriminative while suppressing those that are irrelevant for acoustic scene classification. The convolutional layers in this network learn invariant features from time-frequency input. The… ▽ More

    Submitted 28 June, 2019; v1 submitted 6 April, 2019; originally announced April 2019.

    Comments: To appear at the 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)

  10. arXiv:1811.01092  [pdf, ps, other

    cs.LG cs.SD eess.AS stat.ML

    Unifying Isolated and Overlap** Audio Event Detection with Multi-Label Multi-Task Convolutional Recurrent Neural Networks

    Authors: Huy Phan, Oliver Y. Chén, Philipp Koch, Lam Pham, Ian McLoughlin, Alfred Mertins, Maarten De Vos

    Abstract: We propose a multi-label multi-task framework based on a convolutional recurrent neural network to unify detection of isolated and overlap** audio events. The framework leverages the power of convolutional recurrent neural network architectures; convolutional layers learn effective features over which higher recurrent layers perform sequential modelling. Furthermore, the output layer is designed… ▽ More

    Submitted 18 February, 2019; v1 submitted 2 November, 2018; originally announced November 2018.

    Comments: Accepted for the 44th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2019)

  11. arXiv:1810.09092  [pdf, other

    cs.LG stat.ML

    Axiomatic Interpretability for Multiclass Additive Models

    Authors: Xuezhou Zhang, Sarah Tan, Paul Koch, Yin Lou, Urszula Chajewska, Rich Caruana

    Abstract: Generalized additive models (GAMs) are favored in many regression and binary classification problems because they are able to fit complex, nonlinear functions while still remaining interpretable. In the first part of this paper, we generalize a state-of-the-art GAM learning algorithm based on boosted trees to the multiclass setting, and show that this multiclass algorithm outperforms existing GAM… ▽ More

    Submitted 30 May, 2019; v1 submitted 22 October, 2018; originally announced October 2018.

    Comments: KDD 2019

  12. arXiv:1804.07824  [pdf, other

    cs.LG cs.DC cs.NE stat.ML

    Autotune: A Derivative-free Optimization Framework for Hyperparameter Tuning

    Authors: Patrick Koch, Oleg Golovidov, Steven Gardner, Brett Wujek, Joshua Griffin, Yan Xu

    Abstract: Machine learning applications often require hyperparameter tuning. The hyperparameters usually drive both the efficiency of the model training process and the resulting model quality. For hyperparameter tuning, machine learning algorithms are complex black-boxes. This creates a class of challenging optimization problems, whose objective functions tend to be nonsmooth, discontinuous, unpredictably… ▽ More

    Submitted 2 August, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

    Comments: 10 Pages, 9 figures, accept by KDD 2018

  13. Considerations When Learning Additive Explanations for Black-Box Models

    Authors: Sarah Tan, Giles Hooker, Paul Koch, Albert Gordo, Rich Caruana

    Abstract: Many methods to explain black-box models, whether local or global, are additive. In this paper, we study global additive explanations for non-additive models, focusing on four explanation methods: partial dependence, Shapley explanations adapted to a global setting, distilled additive explanations, and gradient-based explanations. We show that different explanation methods characterize non-additiv… ▽ More

    Submitted 31 July, 2023; v1 submitted 25 January, 2018; originally announced January 2018.

    Comments: Published at Machine Learning (2023). Previously titled "Learning Global Additive Explanations for Neural Nets Using Model Distillation". A short version was presented at NeurIPS 2018 Machine Learning for Health Workshop

  14. arXiv:1612.04468  [pdf, other

    cs.CV cs.AI stat.ML

    Sparse Factorization Layers for Neural Networks with Limited Supervision

    Authors: Parker Koch, Jason J. Corso

    Abstract: Whereas CNNs have demonstrated immense progress in many vision problems, they suffer from a dependence on monumental amounts of labeled training data. On the other hand, dictionary learning does not scale to the size of problems that CNNs can handle, despite being very effective at low-level vision tasks such as denoising and inpainting. Recently, interest has grown in adapting dictionary learning… ▽ More

    Submitted 13 December, 2016; originally announced December 2016.

  15. arXiv:0911.4397  [pdf, other

    stat.ML

    How slow is slow? SFA detects signals that are slower than the driving force

    Authors: Wolfgang Konen, Patrick Koch

    Abstract: Slow feature analysis (SFA) is a method for extracting slowly varying driving forces from quickly varying nonstationary time series. We show here that it is possible for SFA to detect a component which is even slower than the driving force itself (e.g. the envelope of a modulated sine wave). It is shown that it depends on circumstances like the embedding dimension, the time series predictability… ▽ More

    Submitted 23 November, 2009; originally announced November 2009.