Skip to main content

Showing 1–50 of 55 results for author: Geiger, B C

.
  1. arXiv:2406.02146  [pdf, other

    cs.LG

    Activation Bottleneck: Sigmoidal Neural Networks Cannot Forecast a Straight Line

    Authors: Maximilian Toller, Hussain Hussain, Bernhard C Geiger

    Abstract: A neural network has an activation bottleneck if one of its hidden layers has a bounded image. We show that networks with an activation bottleneck cannot forecast unbounded sequences such as straight lines, random walks, or any sequence with a trend: The difference between prediction and ground truth becomes arbitrary large, regardless of the training procedure. Widely-used neural network architec… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2402.09090  [pdf, other

    nlin.AO

    Software in the natural world: A computational approach to hierarchical emergence

    Authors: Fernando E. Rosas, Bernhard C. Geiger, Andrea I Luppi, Anil K. Seth, Daniel Polani, Michael Gastpar, Pedro A. M. Mediano

    Abstract: Understanding the functional architecture of complex systems is crucial to illuminate their inner workings and enable effective methods for their prediction and control. Recent advances have introduced tools to characterise emergent macroscopic levels; however, while these approaches are successful in identifying when emergence takes place, they are limited in the extent they can determine how it… ▽ More

    Submitted 5 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: 33 pages, 13 figures

  3. arXiv:2402.08313  [pdf, other

    cs.LG

    Approximating Families of Sharp Solutions to Fisher's Equation with Physics-Informed Neural Networks

    Authors: Franz M. Rohrhofer, Stefan Posch, Clemens Gößnitzer, Bernhard C. Geiger

    Abstract: This paper employs physics-informed neural networks (PINNs) to solve Fisher's equation, a fundamental representation of a reaction-diffusion system with both simplicity and significance. The focus lies specifically in investigating Fisher's equation under conditions of large reaction rate coefficients, wherein solutions manifest as traveling waves, posing a challenge for numerical methods due to t… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 14 pages, 7 figures

  4. arXiv:2308.01954  [pdf, other

    cs.LG cs.CE

    Bringing Chemistry to Scale: Loss Weight Adjustment for Multivariate Regression in Deep Learning of Thermochemical Processes

    Authors: Franz M. Rohrhofer, Stefan Posch, Clemens Gößnitzer, José M. García-Oliver, Bernhard C. Geiger

    Abstract: Flamelet models are widely used in computational fluid dynamics to simulate thermochemical processes in turbulent combustion. These models typically employ memory-expensive lookup tables that are predetermined and represent the combustion process to be simulated. Artificial neural networks (ANNs) offer a deep learning approach that can store this tabular data using a small number of network weight… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: 8 pages. Part of Scientific Computing 2023 Conference Proceedings (ISBN e-Book: 978-3-903318-20-5)

  5. arXiv:2308.01743  [pdf, other

    cs.CE cs.LG

    Finding the Optimum Design of Large Gas Engines Prechambers Using CFD and Bayesian Optimization

    Authors: Stefan Posch, Clemens Gößnitzer, Franz Rohrhofer, Bernhard C. Geiger, Andreas Wimmer

    Abstract: The turbulent jet ignition concept using prechambers is a promising solution to achieve stable combustion at lean conditions in large gas engines, leading to high efficiency at low emission levels. Due to the wide range of design and operating parameters for large gas engine prechambers, the preferred method for evaluating different designs is computational fluid dynamics (CFD), as testing in test… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: 9 pages. Part of Scientific Computing 2023 Conference Proceedings (ISBN e-Book: 978-3-903318-20-5)

  6. arXiv:2303.00596  [pdf, other

    cs.IT

    Information Plane Analysis for Dropout Neural Networks

    Authors: Linara Adilova, Bernhard C. Geiger, Asja Fischer

    Abstract: The information-theoretic framework promises to explain the predictive power of neural networks. In particular, the information plane analysis, which measures mutual information (MI) between input and representation as well as representation and output, should give rich insights into the training process. This approach, however, was shown to strongly depend on the choice of estimator of the MI. Th… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: Published as a conference paper at ICLR2023

  7. Cluster Purging: Efficient Outlier Detection based on Rate-Distortion Theory

    Authors: Maximilian B. Toller, Bernhard C. Geiger, Roman Kern

    Abstract: Rate-distortion theory-based outlier detection builds upon the rationale that a good data compression will encode outliers with unique symbols. Based on this rationale, we propose Cluster Purging, which is an extension of clustering-based outlier detection. This extension allows one to assess the representivity of clusterings, and to find data that are best represented by individual unique cluster… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Journal ref: IEEE Transactions on Knowledge and Data Engineering 35 (2023) 1270-1282

  8. Robust Bayesian Target Value Optimization

    Authors: Johannes G. Hoffer, Sascha Ranftl, Bernhard C. Geiger

    Abstract: We consider the problem of finding an input to a stochastic black box function such that the scalar output of the black box function is as close as possible to a target value in the sense of the expected squared error. While the optimization of stochastic black boxes is classic in (robust) Bayesian optimization, the current approaches based on Gaussian processes predominantly focus either on i) ma… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: 24 pages; submitted to Computers and Industrial Engineering

    MSC Class: 90C26; 60G15 ACM Class: G.1.6

    Journal ref: Computers & Industrial Engineering, vol. 180, 2023, 109279

  9. arXiv:2211.01446  [pdf, other

    cs.LG

    FUNCK: Information Funnels and Bottlenecks for Invariant Representation Learning

    Authors: João Machado de Freitas, Bernhard C. Geiger

    Abstract: Learning invariant representations that remain useful for a downstream task is still a key challenge in machine learning. We investigate a set of related information funnels and bottleneck problems that claim to learn invariant representations from the data. We also propose a new element to this family of information-theoretic objectives: The Conditional Privacy Funnel with Side Information, which… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: 28 pages

  10. Compressed Hierarchical Representations for Multi-Task Learning and Task Clustering

    Authors: João Machado de Freitas, Sebastian Berg, Bernhard C. Geiger, Manfred Mücke

    Abstract: In this paper, we frame homogeneous-feature multi-task learning (MTL) as a hierarchical representation learning problem, with one task-agnostic and multiple task-specific latent representations. Drawing inspiration from the information bottleneck principle and assuming an additive independent noise model between the task-agnostic and task-specific latent representations, we limit the information c… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: Accepted by the 2022 International Joint Conference on Neural Networks (IJCNN 2022)

    Journal ref: 2022 International Joint Conference on Neural Networks (IJCNN), 2022

  11. Generating Simple Directed Social Network Graphs for Information Spreading

    Authors: Christoph Schweimer, Christine Gfrerer, Florian Lugstein, David Pape, Jan A. Velimsky, Robert Elsässer, Bernhard C. Geiger

    Abstract: Online social networks are a dominant medium in everyday life to stay in contact with friends and to share information. In Twitter, users can connect with other users by following them, who in turn can follow back. In recent years, researchers studied several properties of social networks and designed random graph models to describe them. Many of these approaches either focus on the generation of… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: 11 pages, 7 figures; published at ACM Web Conference 2022

    Journal ref: Proc. ACM Web Conf., p. 1475-1485, Apr. 2022

  12. arXiv:2204.13896  [pdf, other

    cs.IT math.PR

    Information-Theoretic Reduction of Markov Chains

    Authors: Bernhard C. Geiger

    Abstract: We survey information-theoretic approaches to the reduction of Markov chains. Our survey is structured in two parts: The first part considers Markov chain coarse graining, which focuses on projecting the Markov chain to a process on a smaller state space that is informative}about certain quantities of interest. The second part considers Markov chain model reduction, which focuses on replacing the… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: 16 pages, 3 figures; survey paper

    MSC Class: 60J10; 94A16;

  13. arXiv:2203.13648  [pdf, other

    cs.LG

    On the Role of Fixed Points of Dynamical Systems in Training Physics-Informed Neural Networks

    Authors: Franz M. Rohrhofer, Stefan Posch, Clemens Gößnitzer, Bernhard C. Geiger

    Abstract: This paper empirically studies commonly observed training difficulties of Physics-Informed Neural Networks (PINNs) on dynamical systems. Our results indicate that fixed points which are inherent to these systems play a key role in the optimization of the in PINNs embedded physics loss function. We observe that the loss landscape exhibits local optima that are shaped by the presence of fixed points… ▽ More

    Submitted 13 February, 2023; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: 22 pages

    Journal ref: Transactions on Machine Learning Research, 2023(1)

  14. Knock Detection in Combustion Engine Time Series Using a Theory-Guided 1D Convolutional Neural Network Approach

    Authors: Andreas B. Ofner, Achilles Kefalas, Stefan Posch, Bernhard C. Geiger

    Abstract: This paper introduces a method for the detection of knock occurrences in an internal combustion engine (ICE) using a 1D convolutional neural network trained on in-cylinder pressure data. The model architecture was based on considerations regarding the expected frequency characteristics of knocking combustion. To aid the feature extraction, all cycles were reduced to 60° CA long windows, with no fu… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: accepted for publication in IEEE/ASME Transactions on Mechatronics. (c) IEEE 2022

    Journal ref: IEEE/ASME Trans. on Mechatronics, 27(5):4101-4111, Oct. 2022

  15. Semi-Supervised Clustering via Information-Theoretic Markov Chain Aggregation

    Authors: Sophie Steger, Bernhard C. Geiger, Marek Smieja

    Abstract: We connect the problem of semi-supervised clustering to constrained Markov aggregation, i.e., the task of partitioning the state space of a Markov chain. We achieve this connection by considering every data point in the dataset as an element of the Markov chain's state space, by defining the transition probabilities between states via similarities between corresponding data points, and by incorpor… ▽ More

    Submitted 7 February, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: 13 pages, 6 figures; this is an extended version of a short paper accepted at ACM SAC 2022 (minor changes to the text; error in source code corrected)

    ACM Class: H.1.1; I.5.3; I.2.0

    Journal ref: Proc. of ACM/SIGAPP Symposium on Applied Computing, pp. 1136-1139, 2022

  16. Data vs. Physics: The Apparent Pareto Front of Physics-Informed Neural Networks

    Authors: Franz M. Rohrhofer, Stefan Posch, Clemens Gößnitzer, Bernhard C. Geiger

    Abstract: Physics-informed neural networks (PINNs) have emerged as a promising deep learning method, capable of solving forward and inverse problems governed by differential equations. Despite their recent advance, it is widely acknowledged that PINNs are difficult to train and often require a careful tuning of loss weights when data and physics loss functions are combined by scalarization of a multi-object… ▽ More

    Submitted 10 June, 2024; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: 11 pages

    Journal ref: IEEE Access, vol. 11, pp. 86252-86261, 2023

  17. arXiv:2102.00191  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Importance of feature engineering and database selection in a machine learning model: A case study on carbon crystal structures

    Authors: Franz M. Rohrhofer, Santanu Saha, Simone Di Cataldo, Bernhard C. Geiger, Wolfgang von der Linden, Lilia Boeri

    Abstract: Drive towards improved performance of machine learning models has led to the creation of complex features representing a database of condensed matter systems. The complex features, however, do not offer an intuitive explanation on which physical attributes do improve the performance. The effect of the database on the performance of the trained model is often neglected. In this work we seek to unde… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

    Comments: 18 pages, 11 figures

  18. arXiv:2101.08623  [pdf, other

    cs.SI cs.LG physics.soc-ph

    Synwalk -- Community Detection via Random Walk Modelling

    Authors: Christian Toth, Denis Helic, Bernhard C. Geiger

    Abstract: Complex systems, abstractly represented as networks, are ubiquitous in everyday life. Analyzing and understanding these systems requires, among others, tools for community detection. As no single best community detection algorithm can exist, robustness across a wide variety of problem settings is desirable. In this work, we present Synwalk, a random walk-based community detection method. Synwalk b… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: 31 pages, 13 figures

    Journal ref: Data Mining and Knowledge Discovery, 2022, Special Issue of the Journal Track of ECML PKDD 2022

  19. arXiv:2008.07865  [pdf, other

    cs.LG stat.ML

    A Formally Robust Time Series Distance Metric

    Authors: Maximilian Toller, Bernhard C. Geiger, Roman Kern

    Abstract: Distance-based classification is among the most competitive classification methods for time series data. The most critical component of distance-based classification is the selected distance function. Past research has proposed various different distance metrics or measures dedicated to particular aspects of real-world time series data, yet there is an important aspect that has not been considered… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: MileTS Workshop at KDD'19

  20. On Functions of Markov Random Fields

    Authors: Bernhard C. Geiger, Ali Al-Bashabsheh

    Abstract: We derive two sufficient conditions for a function of a Markov random field (MRF) on a given graph to be a MRF on the same graph. The first condition is information-theoretic and parallels a recent information-theoretic characterization of lumpability of Markov chains. The second condition, which is easier to check, is based on the potential functions of the corresponding Gibbs field. We illustrat… ▽ More

    Submitted 15 October, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: 7 pages, submitted to IEEE Information Theory Workshop

    Journal ref: Proc. IEEE Information Theory Workshop, pp. 316-320, 2021. (c) IEEE

  21. arXiv:2003.09671  [pdf, other

    cs.LG cs.CV cs.IT stat.ML

    On Information Plane Analyses of Neural Network Classifiers -- A Review

    Authors: Bernhard C. Geiger

    Abstract: We review the current literature concerned with information plane analyses of neural network classifiers. While the underlying information bottleneck theory and the claim that information-theoretic compression is causally linked to generalization are plausible, empirical evidence was found to be both supporting and conflicting. We review this evidence together with a detailed analysis of how the r… ▽ More

    Submitted 10 June, 2021; v1 submitted 21 March, 2020; originally announced March 2020.

    Comments: 12 pages, 3 figures; accepted for publication in IEEE Transactions on Neural Networks and Learning Systems. (c) 2021 IEEE

    Journal ref: IEEE Trans. Neural Networks and Learning Systems 33(12):7039-7051

  22. arXiv:1906.09333  [pdf, other

    cs.LG cs.AI

    SeGMA: Semi-Supervised Gaussian Mixture Auto-Encoder

    Authors: Marek Śmieja, Maciej Wołczyk, Jacek Tabor, Bernhard C. Geiger

    Abstract: We propose a semi-supervised generative model, SeGMA, which learns a joint probability distribution of data and their classes and which is implemented in a typical Wasserstein auto-encoder framework. We choose a mixture of Gaussians as a target distribution in latent space, which provides a natural splitting of data into clusters. To connect Gaussian components with correct classes, we use a small… ▽ More

    Submitted 27 August, 2020; v1 submitted 21 June, 2019; originally announced June 2019.

  23. arXiv:1906.02576  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Class-Conditional Compression and Disentanglement: Bridging the Gap between Neural Networks and Naive Bayes Classifiers

    Authors: Rana Ali Amjad, Bernhard C. Geiger

    Abstract: In this draft, which reports on work in progress, we 1) adapt the information bottleneck functional by replacing the compression term by class-conditional compression, 2) relax this functional using a variational bound related to class-conditional disentanglement, 3) consider this functional as a training objective for stochastic neural networks, and 4) show that the latent representations are lea… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

    Comments: draft; work in progress

  24. arXiv:1812.02059  [pdf, other

    cs.IT

    A Short Note on the Jensen-Shannon Divergence between Simple Mixture Distributions

    Authors: Bernhard C. Geiger

    Abstract: This short note presents results about the symmetric Jensen-Shannon divergence between two discrete mixture distributions $p_1$ and $p_2$. Specifically, for $i=1,2$, $p_i$ is the mixture of a common distribution $q$ and a distribution $\tilde{p}_i$ with mixture proportion $λ_i$. In general, $\tilde{p}_1\neq \tilde{p}_2$ and $λ_1\neqλ_2$. We provide experimental and theoretical insight to the behav… ▽ More

    Submitted 6 December, 2018; v1 submitted 5 December, 2018; originally announced December 2018.

    Comments: four-page tech note

  25. arXiv:1804.06679  [pdf, other

    cs.LG cs.CV cs.IT stat.ML

    Understanding Neural Networks and Individual Neuron Importance via Information-Ordered Cumulative Ablation

    Authors: Rana Ali Amjad, Kairen Liu, Bernhard C. Geiger

    Abstract: In this work, we investigate the use of three information-theoretic quantities -- entropy, mutual information with the class variable, and a class selectivity measure based on Kullback-Leibler divergence -- to understand and study the behavior of already trained fully-connected feed-forward neural networks. We analyze the connection between these information-theoretic quantities and classification… ▽ More

    Submitted 9 June, 2021; v1 submitted 18 April, 2018; originally announced April 2018.

    Comments: 12 pages; accepted for publication in IEEE Transactions on Neural Networks and Learning Systems

    Journal ref: IEEE Trans. Neural Networks and Learning Systems 33(12):7842-7852

  26. Learning Representations for Neural Network-Based Classification Using the Information Bottleneck Principle

    Authors: Rana Ali Amjad, Bernhard C. Geiger

    Abstract: In this theory paper, we investigate training deep neural networks (DNNs) for classification via minimizing the information bottleneck (IB) functional. We show that the resulting optimization problem suffers from two severe issues: First, for deterministic DNNs, either the IB functional is infinite for almost all values of network parameters, making the optimization problem ill-posed, or it is pie… ▽ More

    Submitted 11 April, 2019; v1 submitted 27 February, 2018; originally announced February 2018.

    Comments: 16 pages, to appear in IEEE Trans. Pattern Analysis and Machine Intelligence

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 42(9):2225-2239, 2020. (c) IEEE

  27. Co-Clustering via Information-Theoretic Markov Aggregation

    Authors: Clemens Bloechl, Rana Ali Amjad, Bernhard C. Geiger

    Abstract: We present an information-theoretic cost function for co-clustering, i.e., for simultaneous clustering of two sets based on similarities between their elements. By constructing a simple random walk on the corresponding bipartite graph, our cost function is derived from a recently proposed generalized framework for information-theoretic Markov chain aggregation. The goal of our cost function is to… ▽ More

    Submitted 15 June, 2018; v1 submitted 2 January, 2018; originally announced January 2018.

    Comments: accepted for publication in IEEE Trans. on Knowledge and Data Engineering; (c) 2018 IEEE

  28. On the Information Dimension of Multivariate Gaussian Processes

    Authors: Bernhard C. Geiger, Tobias Koch

    Abstract: The authors have recently defined the Rényi information dimension rate $d(\{X_t\})$ of a stationary stochastic process $\{X_t,\,t\in\mathbb{Z}\}$ as the entropy rate of the uniformly-quantized process divided by minus the logarithm of the quantizer step size $1/m$ in the limit as $m\to\infty$ (B. Geiger and T. Koch, "On the information dimension rate of stochastic processes," in Proc. IEEE Int. Sy… ▽ More

    Submitted 21 December, 2017; originally announced December 2017.

    Comments: This work will be presented in part at the 2018 International Zurich Seminar on Information and Communication

    Journal ref: IEEE Trans. on Information Theory 65(10):6496-6518. (C) IEEE 2019

  29. arXiv:1709.05907  [pdf, ps, other

    eess.SY cs.IT math.OC

    A Generalized Framework for Kullback-Leibler Markov Aggregation

    Authors: Rana Ali Amjad, Clemens Blöchl, Bernhard C. Geiger

    Abstract: This paper proposes an information-theoretic cost function for aggregating a Markov chain via a (possibly stochastic) map**. The cost function is motivated by two objectives: 1) The process obtained by observing the Markov chain through the map** should be close to a Markov chain, and 2) the aggregated Markov chain should retain as much of the temporal dependence structure of the original Mark… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

    Comments: 12 pages, 3 figures; submitted to a journal

  30. Semi-supervised cross-entropy clustering with information bottleneck constraint

    Authors: Marek Śmieja, Bernhard C. Geiger

    Abstract: In this paper, we propose a semi-supervised clustering method, CEC-IB, that models data with a set of Gaussian distributions and that retrieves clusters based on a partial labeling provided by the user (partition-level side information). By combining the ideas from cross-entropy clustering (CEC) with those from the information bottleneck method (IB), our method trades between three conflicting goa… ▽ More

    Submitted 3 May, 2017; originally announced May 2017.

    Journal ref: Information Sciences, vol. 421, Dec. 2017, pp. 254-271

  31. On the Information Dimension of Stochastic Processes

    Authors: Bernhard C. Geiger, Tobias Koch

    Abstract: In 1959, Rényi proposed the information dimension and the $d$-dimensional entropy to measure the information content of general random variables. This paper proposes a generalization of information dimension to stochastic processes by defining the information dimension rate as the entropy rate of the uniformly-quantized stochastic process divided by minus the logarithm of the quantizer step size… ▽ More

    Submitted 11 June, 2019; v1 submitted 2 February, 2017; originally announced February 2017.

    Comments: 23 pages, double column. Accepted for publication in the IEEE Transactions on Information Theory, copyright (c) 2019 IEEE. This version supersedes our previous submissions arXiv:1702.00645v2 and arXiv:1712.07863

  32. Divergence Scaling of Fixed-Length, Binary-Output, One-to-One Distribution Matching

    Authors: Patrick Schulte, Bernhard C. Geiger

    Abstract: Distribution matching is the process of invertibly map** a uniformly distributed input sequence onto sequences that approximate the output of a desired discrete memoryless source. The special case of a binary output alphabet and one-to-one map** is studied. A fixed-length distribution matcher is proposed that is optimal in the sense of minimizing the unnormalized informational divergence betwe… ▽ More

    Submitted 16 May, 2017; v1 submitted 25 January, 2017; originally announced January 2017.

    Comments: 5 pages, 1 figure; Lemma 6 updated; This work will be presented at ISIT 2017

    Journal ref: Proc. IEEE Int. Symp. on Information Theory 2017, pp. 3075-3079

  33. A Sufficient Condition for a Unique Invariant Distribution of a Higher-Order Markov Chain

    Authors: Bernhard C. Geiger

    Abstract: We derive a sufficient condition for a $k$-th order homogeneous Markov chain $\mathbf{Z}$ with finite alphabet $\mathcal{Z}$ to have a unique invariant distribution on $\mathcal{Z}^k$. Specifically, let $\mathbf{X}$ be a first-order, stationary Markov chain with finite alphabet $\mathcal{X}$ and a single recurrent class, let $g{:}\ \mathcal{X}\to\mathcal{Z}$ be non-injective, and define the (possi… ▽ More

    Submitted 7 April, 2017; v1 submitted 16 November, 2016; originally announced November 2016.

    Comments: 11 pages, 1 figure

    MSC Class: 60J10

    Journal ref: Statistics & Probability Letters, vol. 130, Nov. 2017

  34. arXiv:1610.07304  [pdf, other

    cs.IT

    A Rate-Distortion Approach to Caching

    Authors: Roy Timo, Shirin Saeedi Bidokhti, Michèle Wigger, Bernhard C. Geiger

    Abstract: This paper takes a rate-distortion approach to understanding the information-theoretic laws governing cache-aided communications systems. Specifically, we characterise the optimal tradeoffs between the delivery rate, cache capacity and reconstruction distortions for a single-user problem and some special cases of a two-user problem. Our analysis considers discrete memoryless sources, expected- and… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.

  35. arXiv:1608.04872  [pdf, ps, other

    cs.IT cs.IR cs.LG

    Hard Clusters Maximize Mutual Information

    Authors: Bernhard C. Geiger, Rana Ali Amjad

    Abstract: In this paper, we investigate mutual information as a cost function for clustering, and show in which cases hard, i.e., deterministic, clusters are optimal. Using convexity properties of mutual information, we show that certain formulations of the information bottleneck problem are solved by hard clusters. Similarly, hard clusters are optimal for the information-theoretic co-clustering problem tha… ▽ More

    Submitted 17 August, 2016; originally announced August 2016.

  36. arXiv:1608.04637  [pdf, ps, other

    cs.IT

    Higher-Order Kullback-Leibler Aggregation of Markov Chains

    Authors: Bernhard C. Geiger, Yuchen Wu

    Abstract: We consider the problem of reducing a first-order Markov chain on a large alphabet to a higher-order Markov chain on a small alphabet. We present information-theoretic cost functions that are related to predictability and lumpability, show relations between these cost functions, and discuss heuristics to minimize them. Our experiments suggest that the generalization to higher orders is useful for… ▽ More

    Submitted 16 August, 2016; originally announced August 2016.

  37. Greedy Algorithms for Optimal Distribution Approximation

    Authors: Bernhard C. Geiger, Georg Böcherer

    Abstract: The approximation of a discrete probability distribution $\mathbf{t}$ by an $M$-type distribution $\mathbf{p}$ is considered. The approximation error is measured by the informational divergence $\mathbb{D}(\mathbf{t}\Vert\mathbf{p})$, which is an appropriate measure, e.g., in the context of data compression. Properties of the optimal approximation are derived and bounds on the approximation error… ▽ More

    Submitted 22 January, 2016; originally announced January 2016.

    Comments: 5 pages

    Journal ref: Entropy 2016, 18(7), 262

  38. Graph-Based Lossless Markov Lum**s

    Authors: Bernhard C. Geiger, Christoph Hofer-Temmel

    Abstract: We use results from zero-error information theory to determine the set of non-injective functions through which a Markov chain can be projected without losing information. These lum** functions can be found by clique partitioning of a graph related to the Markov chain. Lossless lum** is made possible by exploiting the (sufficiently sparse) temporal structure of the Markov chain. Eliminating ed… ▽ More

    Submitted 22 January, 2016; v1 submitted 22 September, 2015; originally announced September 2015.

    Comments: 6 pages

    MSC Class: 60J10; 68R10

    Journal ref: Proc. IEEE Int. Sym. on Information Theory (ISIT) 2015

  39. The Fractality of Polar and Reed-Muller Codes

    Authors: Bernhard C. Geiger

    Abstract: The generator matrices of polar codes and Reed-Muller codes are obtained by selecting rows from the Kronecker product of a lower-triangular binary square matrix. For polar codes, the selection is based on the Bhattacharyya parameter of the row, which is closely related to the error probability of the corresponding input bit under sequential decoding. For Reed-Muller codes, the selection is based o… ▽ More

    Submitted 25 February, 2016; v1 submitted 17 June, 2015; originally announced June 2015.

    Comments: 9 pages, one figure

    Journal ref: a slightly extended version of this manuscript is published in Entropy 2018, 20(1), 70

  40. arXiv:1506.04518  [pdf, ps, other

    math.PR cs.IT

    Cepstral Analysis of Random Variables: Muculants

    Authors: Christian Knoll, Bernhard C. Geiger, Gernot Kubin

    Abstract: An alternative parametric description for discrete random variables, called muculants, is proposed. In contrast to cumulants, muculants are based on the Fourier series expansion, rather than on the Taylor series expansion, of the logarithm of the characteristic function. We utilize results from cepstral theory to derive elementary properties of muculants, some of which demonstrate behavior superio… ▽ More

    Submitted 13 November, 2017; v1 submitted 15 June, 2015; originally announced June 2015.

    Comments: 5 pages

    MSC Class: 60E10

  41. arXiv:1412.1770  [pdf

    physics.bio-ph physics.flu-dyn q-bio.QM

    Non-constrictive bead immobilization leading to decreased and uniform shear stress in microfluidic bead-based ELISA

    Authors: Kinshuk Mitra, Brett C. Geiger, Preethi Chidambaram, Aaron P. Maharry, Ronald X. Xu, Michael F. Tweedle

    Abstract: Microfluidic biosensors have been utilized for sensing a wide range of antigens using numerous configurations. Bead based microfluidic sensors have been a popular modality due to the plug and play nature of analyte choice and the favorable geometry of spherical sensor scaffolds. While constriction of beads against fluid flow remains a popular method to immobilize the sensor, it results in poor flu… ▽ More

    Submitted 3 December, 2014; originally announced December 2014.

    Comments: 15 pages, 11 figures

  42. arXiv:1310.8487  [pdf, ps, other

    cs.IT

    Information Loss and Anti-Aliasing Filters in Multirate Systems

    Authors: Bernhard C. Geiger, Gernot Kubin

    Abstract: This work investigates the information loss in a decimation system, i.e., in a downsampler preceded by an anti-aliasing filter. It is shown that, without a specific signal model in mind, the anti-aliasing filter cannot reduce information loss, while, e.g., for a simple signal-plus-noise model it can. For the Gaussian case, the optimal anti-aliasing filter is shown to coincide with the one obtained… ▽ More

    Submitted 7 July, 2014; v1 submitted 31 October, 2013; originally announced October 2013.

    Comments: 12 pages; a shorter version of this paper was published at the 2014 International Zurich Seminar on Communications

    Journal ref: Proc. Int. Zurich Seminar on Communications, 2014, pp. 148 - 151

  43. Optimal Quantization for Distribution Synthesis

    Authors: Georg Böcherer, Bernhard C. Geiger

    Abstract: Finite precision approximations of discrete probability distributions are considered, applicable for distribution synthesis, e.g., probabilistic sha**. Two algorithms are presented that find the optimal $M$-type approximation $Q$ of a distribution $P$ in terms of the variational distance $| Q-P|_1$ and the informational divergence $\mathbb{D}(Q| P)$. Bounds on the approximation errors are derive… ▽ More

    Submitted 19 January, 2016; v1 submitted 25 July, 2013; originally announced July 2013.

    Comments: Submitted to the IEEE Transactions on Information Theory

  44. Optimal Kullback-Leibler Aggregation via Information Bottleneck

    Authors: Bernhard C. Geiger, Tatjana Petrov, Gernot Kubin, Heinz Koeppl

    Abstract: In this paper, we present a method for reducing a regular, discrete-time Markov chain (DTMC) to another DTMC with a given, typically much smaller number of states. The cost of reduction is defined as the Kullback-Leibler divergence rate between a projection of the original process through a partition function and a DTMC on the correspondingly partitioned state space. Finding the reduced model with… ▽ More

    Submitted 10 February, 2015; v1 submitted 24 April, 2013; originally announced April 2013.

    Comments: 13 pages, 4 figures

    Journal ref: IEEE Trans. Autom. Control, vol. 60, no. 4, p. 1010 - 1022, 2015

  45. arXiv:1304.5075  [pdf, ps, other

    cs.IT

    On the Rate of Information Loss in Memoryless Systems

    Authors: Bernhard C. Geiger, Gernot Kubin

    Abstract: In this work we present results about the rate of (relative) information loss induced by passing a real-valued, stationary stochastic process through a memoryless system. We show that for a special class of systems the information loss rate is closely related to the difference of differential entropy rates of the input and output processes. It is further shown that the rate of (relative) informati… ▽ More

    Submitted 18 April, 2013; originally announced April 2013.

    Comments: 9 pages, 4 figures; submitted to a conference

  46. arXiv:1304.0920  [pdf, ps, other

    cs.IT

    Information-Preserving Markov Aggregation

    Authors: Bernhard C. Geiger, Christoph Temmel

    Abstract: We present a sufficient condition for a non-injective function of a Markov chain to be a second-order Markov chain with the same entropy rate as the original chain. This permits an information-preserving state space reduction by merging states or, equivalently, lossless compression of a Markov source on a sample-by-sample basis. The cardinality of the reduced state space is bounded from below by t… ▽ More

    Submitted 24 July, 2013; v1 submitted 3 April, 2013; originally announced April 2013.

    Comments: 7 pages, 3 figures, 2 tables

    Journal ref: Proc. IEEE Information Theory Workshop, 2013, pp. 258-262

  47. arXiv:1303.6409  [pdf, ps, other

    cs.IT

    Information Measures for Deterministic Input-Output Systems

    Authors: Bernhard C. Geiger, Gernot Kubin

    Abstract: In this work the information loss in deterministic, memoryless systems is investigated by evaluating the conditional entropy of the input random variable given the output random variable. It is shown that for a large class of systems the information loss is finite, even if the input is continuously distributed. Based on this finiteness, the problem of perfectly reconstructing the input is addresse… ▽ More

    Submitted 17 April, 2013; v1 submitted 26 March, 2013; originally announced March 2013.

    Comments: 23 pages, 12 figures; submitted

  48. Lum**s of Markov chains, entropy rate preservation, and higher-order lumpability

    Authors: Bernhard C. Geiger, Christoph Temmel

    Abstract: A lum** of a Markov chain is a coordinate-wise projection of the chain. We characterise the entropy rate preservation of a lum** of an aperiodic and irreducible Markov chain on a finite state space by the random growth rate of the cardinality of the realisable preimage of a finite-length trajectory of the lumped chain and by the information needed to reconstruct original trajectories from thei… ▽ More

    Submitted 20 April, 2015; v1 submitted 18 December, 2012; originally announced December 2012.

    MSC Class: 60J10 (60G17 94A17 60G10 65C40)

  49. arXiv:1205.6935  [pdf, ps, other

    cs.IT

    Signal Enhancement as Minimization of Relevant Information Loss

    Authors: Bernhard C. Geiger, Gernot Kubin

    Abstract: We introduce the notion of relevant information loss for the purpose of casting the signal enhancement problem in information-theoretic terms. We show that many algorithms from machine learning can be reformulated using relevant information loss, which allows their application to the aforementioned problem. As a particular example we analyze principle component analysis for dimensionality reductio… ▽ More

    Submitted 16 January, 2013; v1 submitted 31 May, 2012; originally announced May 2012.

    Comments: 9 pages; 4 figures; accepted for presentation at a conference

    Journal ref: Proc. ITG Conf. on Systems, Communication and Coding, 2013, pp. 1-6

  50. Relative Information Loss in the PCA

    Authors: Bernhard C. Geiger, Gernot Kubin

    Abstract: In this work we analyze principle component analysis (PCA) as a deterministic input-output system. We show that the relative information loss induced by reducing the dimensionality of the data after performing the PCA is the same as in dimensionality reduction without PCA. Finally, we analyze the case where the PCA uses the sample covariance matrix to compute the rotation. If the rotation matrix i… ▽ More

    Submitted 31 July, 2012; v1 submitted 2 April, 2012; originally announced April 2012.

    Comments: 9 pages, 4 figure; extended version of a paper accepted for publication

    Journal ref: Proc. IEEE Information Theory Workshop, 2012, pp. 562 - 566