Skip to main content

Showing 1–42 of 42 results for author: Pernkopf, F

.
  1. arXiv:2405.15514  [pdf, other

    stat.ML cs.AI cs.LG

    On the Convexity and Reliability of the Bethe Free Energy Approximation

    Authors: Harald Leisenberger, Christian Knoll, Franz Pernkopf

    Abstract: The Bethe free energy approximation provides an effective way for relaxing NP-hard problems of probabilistic inference. However, its accuracy depends on the model parameters and particularly degrades if a phase transition in the model occurs. In this work, we analyze when the Bethe approximation is reliable and how this can be verified. We argue and show by experiment that it is mostly accurate if… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: This work has been submitted to the Journal of Machine Learning Research (JMLR) for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  2. arXiv:2402.14781  [pdf, ps, other

    cs.LG cs.AI stat.ME stat.ML

    Rao-Blackwellising Bayesian Causal Inference

    Authors: Christian Toth, Christian Knoll, Franz Pernkopf, Robert Peharz

    Abstract: Bayesian causal inference, i.e., inferring a posterior over causal models for the use in downstream causal reasoning tasks, poses a hard computational inference problem that is little explored in literature. In this work, we combine techniques from order-based MCMC structure learning with recent advances in gradient-based graph learning into an effective Bayesian causal inference framework. Specif… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 8 pages + references + appendices (19 pages total)

  3. Angle-Equivariant Convolutional Neural Networks for Interference Mitigation in Automotive Radar

    Authors: Christian Oswald, Mate Toth, Paul Meissner, Franz Pernkopf

    Abstract: In automotive applications, frequency modulated continuous wave (FMCW) radar is an established technology to determine the distance, velocity and angle of objects in the vicinity of the vehicle. The quality of predictions might be seriously impaired if mutual interference between radar sensors occurs. Previous work processes data from the entire receiver array in parallel to increase interference… ▽ More

    Submitted 18 December, 2023; originally announced January 2024.

    Comments: 4 pages, 3 figures

    Journal ref: 2023 20th European Radar Conference (EuRAD) (pp. 135-138). IEEE

  4. arXiv:2312.09790  [pdf, other

    cs.LG eess.SP

    End-to-End Training of Neural Networks for Automotive Radar Interference Mitigation

    Authors: Christian Oswald, Mate Toth, Paul Meissner, Franz Pernkopf

    Abstract: In this paper we propose a new method for training neural networks (NNs) for frequency modulated continuous wave (FMCW) radar mutual interference mitigation. Instead of training NNs to regress from interfered to clean radar signals as in previous work, we train NNs directly on object detection maps. We do so by performing a continuous relaxation of the cell-averaging constant false alarm rate (CA-… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 2023 IEEE International Radar Conference (RADAR), 6 pages, 4 figures

  5. arXiv:2311.10478  [pdf, other

    eess.SP

    "UWBCarGraz" Dataset for Car Occupancy Detection using Ultra-Wideband Radar

    Authors: Jakob Möderl, Stefan Posch, Franz Pernkopf, Klaus Witrisal

    Abstract: We present a data-driven car occupancy detection algorithm using ultra-wideband radar based on the ResNet architecture. The algorithm is trained on a dataset of channel impulse responses obtained from measurements at three different activity levels of the occupants (i.e. breathing, talking, moving). We compare the presented algorithm against a state-of-the-art car occupancy detection algorithm bas… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: v1 (17.11.2023). 6 pages, 5 figures

  6. arXiv:2306.00442  [pdf, other

    eess.SP

    Fast Variational Block-Sparse Bayesian Learning

    Authors: Jakob Möderl, Franz Pernkopf, Klaus Witrisal, Erik Leitinger

    Abstract: We present a fast update rule for variational block-sparse Bayesian learning (SBL) methods. Based on a variational Bayesian approximation, we show that iterative updates of probability density functions (PDFs) of the prior precisions and weights can be expressed as a nonlinear first-order recurrence from one estimate of the parameters of the proxy PDFs to the next. In particular, for commonly used… ▽ More

    Submitted 13 December, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 10 pages, 4 figures, submitted to IEEE Transactions on Signal Processing on 1st of June, 2023, Major Revision on Dec. 3, 2023

  7. arXiv:2303.07821  [pdf, ps, other

    cs.IT eess.SP

    Self-attention for Enhanced OAMP Detection in MIMO Systems

    Authors: Alexander Fuchs, Christian Knoll, Nima N. Moghadam, Alexey Pak **liang Huang, Erik Leitinger, Franz Pernkopf

    Abstract: Multiple-Input Multiple-Output (MIMO) systems are essential for wireless communications. Sinceclassical algorithms for symbol detection in MIMO setups require large computational resourcesor provide poor results, data-driven algorithms are becoming more popular. Most of the proposedalgorithms, however, introduce approximations leading to degraded performance for realistic MIMOsystems. In this pape… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: 8 pages, 2 figures, ICASSP 2023

    ACM Class: I.2.1; H.1.1

  8. arXiv:2303.03017  [pdf, other

    eess.SP eess.AS

    Variational Inference of Structured Line Spectra Exploiting Group-Sparsity

    Authors: Jakob Möderl, Franz Pernkopf, Klaus Witrisal, Erik Leitinger

    Abstract: In this paper, we present a variational inference algorithm that decomposes a signal into multiple groups of related spectral lines. The spectral lines in each group are associated with a group parameter common to all spectral lines within the group. The proposed algorithm jointly estimates the group parameters, the number of spetral lines within a group, and the number of groups exploiting a Bern… ▽ More

    Submitted 31 May, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 13 Pages, 5 Figures. Submitted to IEEE Transactions on Signal Processing on 6th of March, 2023. Update 31.05.2023: Fixed wrong/missing internal references

  9. arXiv:2210.07619  [pdf, other

    eess.SP

    Variational Message Passing-Based Respiratory Motion Estimation and Detection Using Radar Signals

    Authors: Jakob Möderl, Erik Leitinger, Franz Pernkopf, Klaus Witrisal

    Abstract: We present a variational message passing (VMP) approach to detect the presence of a person based on their respiratory chest motion using multistatic ultra-wideband (UWB) radar. In the process, the respiratory motion is estimated for contact-free vital sign monitoring. The received signal is modeled by a backscatter channel and the respiratory motion and propagation channels are estimated using VMP… ▽ More

    Submitted 29 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: 29.10.22: Updated with extension to multistatic radar systems. Submitted to ICASSP 2023, 4 pages + references, 4 figures, UWB radar rar occupancy detection, variational message passing

  10. arXiv:2206.02063  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Active Bayesian Causal Inference

    Authors: Christian Toth, Lars Lorch, Christian Knoll, Andreas Krause, Franz Pernkopf, Robert Peharz, Julius von Kügelgen

    Abstract: Causal discovery and causal reasoning are classically treated as separate and consecutive tasks: one first infers the causal graph, and then uses it to estimate causal effects of interventions. However, such a two-stage approach is uneconomical, especially in terms of actively collected interventional data, since the causal query of interest may not require a fully-specified causal model. From a B… ▽ More

    Submitted 15 October, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready version. RP & JvK are shared last authors. 10 pages + Bibliography + Appendix (34 pages total)

  11. Explainable Machine Learning for Breakdown Prediction in High Gradient RF Cavities

    Authors: Christoph Obermair, Thomas Cartier-Michaud, Andrea Apollonio, William Millar, Lukas Felsberger, Lorenz Fischl, Holger Severin Bovbjerg, Daniel Wollmann, Walter Wuensch, Nuria Catalan-Lasheras, Marçà Boronat, Franz Pernkopf, Graeme Burt

    Abstract: The occurrence of vacuum arcs or radio frequency (rf) breakdowns is one of the most prevalent factors limiting the high-gradient performance of normal conducting rf cavities in particle accelerators. In this paper, we search for the existence of previously unrecognized features related to the incidence of rf breakdowns by applying a machine learning strategy to high-gradient cavity data from CERN'… ▽ More

    Submitted 8 December, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

  12. Resource-efficient Deep Neural Networks for Automotive Radar Interference Mitigation

    Authors: Johanna Rock, Wolfgang Roth, Mate Toth, Paul Meissner, Franz Pernkopf

    Abstract: Radar sensors are crucial for environment perception of driver assistance systems as well as autonomous vehicles. With a rising number of radar sensors and the so far unregulated automotive radar frequency band, mutual interference is inevitable and must be dealt with. Algorithms and models operating on radar data are required to run the early processing steps on specialized radar sensor hardware.… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 15 pages; published in IEEE Journal of Selected Topics in Signal Processing, Special Issue on Recent Advances in Automotive Radar Signal Processing, Volume: 15, Issue: 4, June 2021. arXiv admin note: text overlap with arXiv:2011.12706

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, vol. 15, no. 4, pp. 927-940, June 2021

  13. arXiv:2110.01955  [pdf, other

    cs.LG cs.CV

    Distribution Mismatch Correction for Improved Robustness in Deep Neural Networks

    Authors: Alexander Fuchs, Christian Knoll, Franz Pernkopf

    Abstract: Deep neural networks rely heavily on normalization methods to improve their performance and learning behavior. Although normalization methods spurred the development of increasingly deep and efficient architectures, they also increase the vulnerability with respect to noise and input corruptions. In most applications, however, noise is ubiquitous and diverse; this can often lead to complete failur… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    ACM Class: I.2.0; I.4.0

  14. arXiv:2108.01991  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Lung Sound Classification Using Co-tuning and Stochastic Normalization

    Authors: Truc Nguyen, Franz Pernkopf

    Abstract: In this paper, we use pre-trained ResNet models as backbone architectures for classification of adventitious lung sounds and respiratory diseases. The knowledge of the pre-trained model is transferred by using vanilla fine-tuning, co-tuning, stochastic normalization and the combination of the co-tuning and stochastic normalization techniques. Furthermore, data augmentation in both time domain and… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

    Comments: Submitted to IEEE BE Transaction

  15. arXiv:2105.00929  [pdf, other

    eess.SP cs.CV

    Complex-valued Convolutional Neural Networks for Enhanced Radar Signal Denoising and Interference Mitigation

    Authors: Alexander Fuchs, Johanna Rock, Mate Toth, Paul Meissner, Franz Pernkopf

    Abstract: Autonomous driving highly depends on capable sensors to perceive the environment and to deliver reliable information to the vehicles' control systems. To increase its robustness, a diversified set of sensors is used, including radar sensors. Radar is a vital contribution of sensory information, providing high resolution range as well as velocity measurements. The increased use of radar sensors in… ▽ More

    Submitted 29 April, 2021; originally announced May 2021.

    Journal ref: IEEE International Radar Conference 2021

  16. arXiv:2104.14921  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Crackle Detection In Lung Sounds Using Transfer Learning And Multi-Input Convolitional Neural Networks

    Authors: Truc Nguyen, Franz Pernkopf

    Abstract: Large annotated lung sound databases are publicly available and might be used to train algorithms for diagnosis systems. However, it might be a challenge to develop a well-performing algorithm for small non-public data, which have only a few subjects and show differences in recording devices and setup. In this paper, we use transfer learning to tackle the mismatch of the recording setup. This allo… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: Under Review in Proceeding of EMBC 2021

  17. arXiv:2104.06666  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    End-to-end Keyword Spotting using Neural Architecture Search and Quantization

    Authors: David Peter, Wolfgang Roth, Franz Pernkopf

    Abstract: This paper introduces neural architecture search (NAS) for the automatic discovery of end-to-end keyword spotting (KWS) models in limited resource environments. We employ a differentiable NAS approach to optimize the structure of convolutional neural networks (CNNs) operating on raw audio waveforms. After a suitable KWS model is found with NAS, we conduct quantization of weights and activations to… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2012.10138

  18. arXiv:2103.13443  [pdf, other

    cs.SD cs.LG eess.AS

    Blind Speech Separation and Dereverberation using Neural Beamforming

    Authors: Lukas Pfeifenberger, Franz Pernkopf

    Abstract: In this paper, we present the Blind Speech Separation and Dereverberation (BSSD) network, which performs simultaneous speaker separation, dereverberation and speaker identification in a single neural network. Speaker separation is guided by a set of predefined spatial cues. Dereverberation is performed by using neural beamforming, and speaker identification is aided by embedding vectors and triple… ▽ More

    Submitted 4 November, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: 13 pages, 9 figures

  19. arXiv:2012.10138  [pdf, other

    eess.AS cs.LG

    Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization

    Authors: David Peter, Wolfgang Roth, Franz Pernkopf

    Abstract: This paper introduces neural architecture search (NAS) for the automatic discovery of small models for keyword spotting (KWS) in limited resource environments. We employ a differentiable NAS approach to optimize the structure of convolutional neural networks (CNNs) to maximize the classification accuracy while minimizing the number of operations per inference. Using NAS only, we were able to obtai… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

  20. Deep Interference Mitigation and Denoising of Real-World FMCW Radar Signals

    Authors: Johanna Rock, Mate Toth, Paul Meissner, Franz Pernkopf

    Abstract: Radar sensors are crucial for environment perception of driver assistance systems as well as autonomous cars. Key performance factors are a fine range resolution and the possibility to directly measure velocity. With a rising number of radar sensors and the so far unregulated automotive radar frequency band, mutual interference is inevitable and must be dealt with. Sensors must be capable of detec… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: 2020 IEEE International Radar Conference (RADAR)

  21. arXiv:2011.12706  [pdf, other

    eess.SP cs.LG

    Quantized Neural Networks for Radar Interference Mitigation

    Authors: Johanna Rock, Wolfgang Roth, Paul Meissner, Franz Pernkopf

    Abstract: Radar sensors are crucial for environment perception of driver assistance systems as well as autonomous vehicles. Key performance factors are weather resistance and the possibility to directly measure velocity. With a rising number of radar sensors and the so far unregulated automotive radar frequency band, mutual interference is inevitable and must be dealt with. Algorithms and models operating o… ▽ More

    Submitted 1 December, 2020; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: ITEM Workshop at ECML-PKDD 2020

  22. arXiv:2010.11773  [pdf, other

    cs.LG cs.AI stat.ML

    On Resource-Efficient Bayesian Network Classifiers and Deep Neural Networks

    Authors: Wolfgang Roth, Günther Schindler, Holger Fröning, Franz Pernkopf

    Abstract: We present two methods to reduce the complexity of Bayesian network (BN) classifiers. First, we introduce quantization-aware training using the straight-through gradient estimator to quantize the parameters of BNs to few bits. Second, we extend a recently proposed differentiable tree-augmented naive Bayes (TAN) structure learning approach by also considering the model size. Both methods are motiva… ▽ More

    Submitted 22 September, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted at ICPR 2020, fixed Figure 5

  23. arXiv:2008.09566  [pdf, other

    cs.LG cs.AI stat.ML

    Differentiable TAN Structure Learning for Bayesian Network Classifiers

    Authors: Wolfgang Roth, Franz Pernkopf

    Abstract: Learning the structure of Bayesian networks is a difficult combinatorial optimization problem. In this paper, we consider learning of tree-augmented naive Bayes (TAN) structures for Bayesian network classifiers with discrete input features. Instead of performing a combinatorial optimization over the space of possible graph structures, the proposed method learns a distribution over graph structures… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: Accepted at PGM 2020

  24. arXiv:2007.11477  [pdf, other

    eess.AS cs.LG cs.SD

    Resource-Efficient Speech Mask Estimation for Multi-Channel Speech Enhancement

    Authors: Lukas Pfeifenberger, Matthias Zöhrer, Günther Schindler, Wolfgang Roth, Holger Fröning, Franz Pernkopf

    Abstract: While machine learning techniques are traditionally resource intensive, we are currently witnessing an increased interest in hardware and energy efficient approaches. This need for resource-efficient machine learning is primarily driven by the demand for embedded systems and their usage in ubiquitous computing and IoT applications. In this article, we provide a resource-efficient approach for mult… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

  25. arXiv:2007.11465  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Wasserstein Routed Capsule Networks

    Authors: Alexander Fuchs, Franz Pernkopf

    Abstract: Capsule networks offer interesting properties and provide an alternative to today's deep neural network architectures. However, recent approaches have failed to consistently achieve competitive results across different image datasets. We propose a new parameter efficient capsule architecture, that is able to tackle complex tasks by using neural networks trained with an approximate Wasserstein obje… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: 8 pages, 3 figures

    ACM Class: I.2.10

  26. arXiv:2001.03048  [pdf, other

    stat.ML cs.LG

    Resource-Efficient Neural Networks for Embedded Systems

    Authors: Wolfgang Roth, Günther Schindler, Bernhard Klein, Robert Peharz, Sebastian Tschiatschek, Holger Fröning, Franz Pernkopf, Zoubin Ghahramani

    Abstract: While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation, and the vision of the Internet of Things fuel the interest in resource-efficient approaches. These approaches aim for a carefully chosen trade-off between performance and resource consumption in terms of computation and energy. The development of such approaches is among the major challenges… ▽ More

    Submitted 7 April, 2024; v1 submitted 7 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: text overlap with arXiv:1812.02240; accepted at JMLR

  27. arXiv:1910.04536  [pdf, other

    cs.LG stat.ML

    Deep Structured Mixtures of Gaussian Processes

    Authors: Martin Trapp, Robert Peharz, Franz Pernkopf, Carl E. Rasmussen

    Abstract: Gaussian Processes (GPs) are powerful non-parametric Bayesian regression models that allow exact posterior inference, but exhibit high computational and memory costs. In order to improve scalability of GPs, approximate posterior inference is frequently employed, where a prominent class of approximation techniques is based on local GP experts. However, local-expert techniques proposed so far are ei… ▽ More

    Submitted 26 April, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: AISTATS 2020

  28. arXiv:1907.04708  [pdf, other

    cs.LG stat.ML

    Learning a Behavior Model of Hybrid Systems Through Combining Model-Based Testing and Machine Learning (Full Version)

    Authors: Bernhard K. Aichernig, Roderick Bloem, Masoud Ebrahimi, Martin Horn, Franz Pernkopf, Wolfgang Roth, Astrid Rupp, Martin Tappler, Markus Tranninger

    Abstract: Models play an essential role in the design process of cyber-physical systems. They form the basis for simulation and analysis and help in identifying design problems as early as possible. However, the construction of models that comprise physical and digital behavior is challenging. Therefore, there is considerable interest in learning such hybrid behavior by means of machine learning which requi… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: This is an extended version of the conference paper "Learning a Behavior Model of Hybrid Systems Through Combining Model-Based Testing and Machine Learning" accepted for presentation at IFIP-ICTSS 2019, the 31st International Conference on Testing Software and Systems in Paris, France

  29. arXiv:1906.10044  [pdf, other

    eess.SP cs.CV

    Complex Signal Denoising and Interference Mitigation for Automotive Radar Using Convolutional Neural Networks

    Authors: Johanna Rock, Mate Toth, Elmar Messner, Paul Meissner, Franz Pernkopf

    Abstract: Driver assistance systems as well as autonomous cars have to rely on sensors to perceive their environment. A heterogeneous set of sensors is used to perform this task robustly. Among them, radar sensors are indispensable because of their range resolution and the possibility to directly measure velocity. Since more and more radar sensors are deployed on the streets, mutual interference must be dea… ▽ More

    Submitted 25 June, 2019; v1 submitted 24 June, 2019; originally announced June 2019.

    Comments: FUSION 2019; 8 pages

  30. arXiv:1906.05180  [pdf, other

    cs.LG stat.ML

    Parameterized Structured Pruning for Deep Neural Networks

    Authors: Guenther Schindler, Wolfgang Roth, Franz Pernkopf, Holger Froening

    Abstract: As a result of the growing size of Deep Neural Networks (DNNs), the gap to hardware capabilities in terms of memory and compute increases. To effectively compress DNNs, quantization and connection pruning are usually considered. However, unconstrained pruning usually leads to unstructured parallelism, which maps poorly to massively parallel processors, and substantially reduces the efficiency of g… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

  31. arXiv:1905.10884  [pdf, other

    cs.LG stat.ML

    Bayesian Learning of Sum-Product Networks

    Authors: Martin Trapp, Robert Peharz, Hong Ge, Franz Pernkopf, Zoubin Ghahramani

    Abstract: Sum-product networks (SPNs) are flexible density estimators and have received significant attention due to their attractive inference properties. While parameter learning in SPNs is well developed, structure learning leaves something to be desired: Even though there is a plethora of SPN structure learners, most of them are somewhat ad-hoc and based on intuition rather than a clear learning princip… ▽ More

    Submitted 4 November, 2019; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2019; See conference page for supplement

  32. arXiv:1905.08196  [pdf, other

    cs.LG stat.ML

    Optimisation of Overparametrized Sum-Product Networks

    Authors: Martin Trapp, Robert Peharz, Franz Pernkopf

    Abstract: It seems to be a pearl of conventional wisdom that parameter learning in deep sum-product networks is surprisingly fast compared to shallow mixture models. This paper examines the effects of overparameterization in sum-product networks on the speed of parameter optimisation. Using theoretical analysis and empirical experiments, we show that deep sum-product networks exhibit an implicit acceleratio… ▽ More

    Submitted 29 May, 2019; v1 submitted 20 May, 2019; originally announced May 2019.

    Comments: Workshop on Tractable Probabilistic Models (TPM) at ICML 2019

  33. arXiv:1812.02240  [pdf, other

    cs.LG stat.ML

    Efficient and Robust Machine Learning for Real-World Systems

    Authors: Franz Pernkopf, Wolfgang Roth, Matthias Zoehrer, Lukas Pfeifenberger, Guenther Schindler, Holger Froening, Sebastian Tschiatschek, Robert Peharz, Matthew Mattina, Zoubin Ghahramani

    Abstract: While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation and the vision of the Internet-of-Things fuel the interest in resource efficient approaches. These approaches require a carefully chosen trade-off between performance and resource consumption in terms of computation and energy. On top of this, it is crucial to treat uncertainty in a consisten… ▽ More

    Submitted 5 December, 2018; originally announced December 2018.

  34. arXiv:1812.01339  [pdf, other

    stat.ML cs.LG

    Self-Guided Belief Propagation -- A Homotopy Continuation Method

    Authors: Christian Knoll, Adrian Weller, Franz Pernkopf

    Abstract: Belief propagation (BP) is a popular method for performing probabilistic inference on graphical models. In this work, we enhance BP and propose self-guided belief propagation (SBP) that incorporates the pairwise potentials only gradually. This homotopy continuation method converges to a unique solution and increases the accuracy without increasing the computational burden. We provide a formal anal… ▽ More

    Submitted 19 March, 2021; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  35. arXiv:1810.06897  [pdf, other

    cs.SD eess.AS

    Sound event detection using weakly-labeled semi-supervised data with GCRNNS, VAT and Self-Adaptive Label Refinement

    Authors: Robert Harb, Franz Pernkopf

    Abstract: In this paper, we present a gated convolutional recurrent neural network based approach to solve task 4, large-scale weakly labelled semi-supervised sound event detection in domestic environments, of the DCASE 2018 challenge. Gated linear units and a temporal attention layer are used to predict the onset and offset of sound events in 10s long audio clips. Whereby for training only weakly-labelled… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: Accepted at DCASE 2018 Workshop for oral presentation

  36. arXiv:1809.04400  [pdf, other

    cs.LG stat.ML

    Learning Deep Mixtures of Gaussian Process Experts Using Sum-Product Networks

    Authors: Martin Trapp, Robert Peharz, Carl E. Rasmussen, Franz Pernkopf

    Abstract: While Gaussian processes (GPs) are the method of choice for regression tasks, they also come with practical difficulties, as inference cost scales cubic in time and quadratic in memory. In this paper, we introduce a natural and expressive way to tackle these problems, by incorporating GPs in sum-product networks (SPNs), a recently proposed tractable probabilistic model allowing exact and efficient… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: Presented at the Workshop on Tractable Probabilistic Models (TPM 2018), ICML 2018

  37. arXiv:1807.02324  [pdf, other

    cs.LG stat.ML

    Sum-Product Networks for Sequence Labeling

    Authors: Martin Ratajczak, Sebastian Tschiatschek, Franz Pernkopf

    Abstract: We consider higher-order linear-chain conditional random fields (HO-LC-CRFs) for sequence modelling, and use sum-product networks (SPNs) for representing higher-order input- and output-dependent factors. SPNs are a recently introduced class of deep models for which exact and efficient inference can be performed. By combining HO-LC-CRFs with SPNs, expressive models over both the output labels and t… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

  38. arXiv:1806.00981  [pdf, other

    cs.LG cs.CR stat.ML

    Automatic Clustering of a Network Protocol with Weakly-Supervised Clustering

    Authors: Tobias Schrank, Franz Pernkopf

    Abstract: Abstraction is a fundamental part when learning behavioral models of systems. Usually the process of abstraction is manually defined by domain experts. This paper presents a method to perform automatic abstraction for network protocols. In particular a weakly supervised clustering algorithm is used to build an abstraction with a small vocabulary size for the widely used TLS protocol. To show the e… ▽ More

    Submitted 4 June, 2018; originally announced June 2018.

  39. arXiv:1710.03444  [pdf, other

    stat.ML cs.LG

    Safe Semi-Supervised Learning of Sum-Product Networks

    Authors: Martin Trapp, Tamas Madl, Robert Peharz, Franz Pernkopf, Robert Trappl

    Abstract: In several domains obtaining class annotations is expensive while at the same time unlabelled data are abundant. While most semi-supervised approaches enforce restrictive assumptions on the data distribution, recent work has managed to learn semi-supervised models in a non-restrictive regime. However, so far such approaches have only been proposed for linear models. In this work, we introduce semi… ▽ More

    Submitted 10 October, 2017; originally announced October 2017.

    Comments: Conference on Uncertainty in Artificial Intelligence (UAI), 2017

  40. arXiv:1605.06451  [pdf, other

    stat.ML math.AG

    Fixed Points of Belief Propagation -- An Analysis via Polynomial Homotopy Continuation

    Authors: Christian Knoll, Franz Pernkopf, Dhagash Mehta, Tianran Chen

    Abstract: Belief propagation (BP) is an iterative method to perform approximate inference on arbitrary graphical models. Whether BP converges and if the solution is a unique fixed point depends on both the structure and the parametrization of the model. To understand this dependence it is interesting to find \emph{all} fixed points. In this work, we formulate a set of polynomial equations, the solutions of… ▽ More

    Submitted 30 May, 2017; v1 submitted 20 May, 2016; originally announced May 2016.

  41. arXiv:1601.06180  [pdf, ps, other

    cs.AI cs.LG

    On the Latent Variable Interpretation in Sum-Product Networks

    Authors: Robert Peharz, Robert Gens, Franz Pernkopf, Pedro Domingos

    Abstract: One of the central themes in Sum-Product networks (SPNs) is the interpretation of sum nodes as marginalized latent variables (LVs). This interpretation yields an increased syntactic or semantic structure, allows the application of the EM algorithm and to efficiently perform MPE inference. In literature, the LV interpretation was justified by explicitly introducing the indicator variables correspon… ▽ More

    Submitted 28 October, 2016; v1 submitted 22 January, 2016; originally announced January 2016.

    Comments: Revised version, accepted for publication in IEEE Transactions on Machine Intelligence and Pattern Analysis (TPAMI). Shortened and revised Section 4: Thanks to our reviewers, pointing out that Theorem 2 holds for selective SPNs. Added paragraph in Section 2.1, relating sizes of original/augmented SPNs. Fixed typos, rephrased sentences, revised references

    MSC Class: 62

  42. arXiv:1206.6431  [pdf

    cs.LG stat.ML

    Exact Maximum Margin Structure Learning of Bayesian Networks

    Authors: Robert Peharz, Franz Pernkopf

    Abstract: Recently, there has been much interest in finding globally optimal Bayesian network structures. These techniques were developed for generative scores and can not be directly extended to discriminative scores, as desired for classification. In this paper, we propose an exact method for finding network structures maximizing the probabilistic soft margin, a successfully applied discriminative score.… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: ICML