Search | arXiv e-print repository

FPGA-Placement via Quantum Annealing

Authors: Thore Gerlach, Stefan Knipp, David Biesner, Stelios Emmanouilidis, Klaus Hauber, Nico Piatkowski

Abstract: Field-Programmable Gate Arrays (FPGAs) have asserted themselves as vital assets in contemporary computing by offering adaptable, reconfigurable hardware platforms. FPGA-based accelerators incubate opportunities for breakthroughs in areas, such as real-time data processing, machine learning or cryptography -- to mention just a few. The procedure of placement -- determining the optimal spatial arran… ▽ More Field-Programmable Gate Arrays (FPGAs) have asserted themselves as vital assets in contemporary computing by offering adaptable, reconfigurable hardware platforms. FPGA-based accelerators incubate opportunities for breakthroughs in areas, such as real-time data processing, machine learning or cryptography -- to mention just a few. The procedure of placement -- determining the optimal spatial arrangement of functional blocks on an FPGA to minimize communication delays and enhance performance -- is an NP-hard problem, notably requiring sophisticated algorithms for proficient solutions. Clearly, improving the placement leads to a decreased resource utilization during the implementation phase. Adiabatic quantum computing (AQC), with its capability to traverse expansive solution spaces, has potential for addressing such combinatorial problems. In this paper, we re-formulate the placement problem as a series of so called quadratic unconstrained binary optimization (QUBO) problems which are subsequently solved via AQC. Our novel formulation facilitates a straight-forward integration of design constraints. Moreover, the size of the sub-problems can be conveniently adapted to the available hardware capabilities. Beside the sole proposal of a novel method, we ask whether contemporary quantum hardware is resilient enough to find placements for real-world-sized FPGAs. A numerical evaluation on a D-Wave Advantage 5.4 quantum annealer suggests that the answer is in the affirmative. △ Less

Submitted 24 December, 2023; originally announced December 2023.

Comments: Poster will be presented at International Symposium on Field-Programmable Gate Arrays

arXiv:2310.09129 [pdf, other]

Computing Marginal and Conditional Divergences between Decomposable Models with Applications

Authors: Loong Kuan Lee, Geoffrey I. Webb, Daniel F. Schmidt, Nico Piatkowski

Abstract: The ability to compute the exact divergence between two high-dimensional distributions is useful in many applications but doing so naively is intractable. Computing the alpha-beta divergence -- a family of divergences that includes the Kullback-Leibler divergence and Hellinger distance -- between the joint distribution of two decomposable models, i.e chordal Markov networks, can be done in time ex… ▽ More The ability to compute the exact divergence between two high-dimensional distributions is useful in many applications but doing so naively is intractable. Computing the alpha-beta divergence -- a family of divergences that includes the Kullback-Leibler divergence and Hellinger distance -- between the joint distribution of two decomposable models, i.e chordal Markov networks, can be done in time exponential in the treewidth of these models. However, reducing the dissimilarity between two high-dimensional objects to a single scalar value can be uninformative. Furthermore, in applications such as supervised learning, the divergence over a conditional distribution might be of more interest. Therefore, we propose an approach to compute the exact alpha-beta divergence between any marginal or conditional distribution of two decomposable models. Doing so tractably is non-trivial as we need to decompose the divergence between these distributions and therefore, require a decomposition over the marginal and conditional distributions of these models. Consequently, we provide such a decomposition and also extend existing work to compute the marginal and conditional alpha-beta divergence between these decompositions. We then show how our method can be used to analyze distributional changes by first applying it to a benchmark image dataset. Finally, based on our framework, we propose a novel way to quantify the error in contemporary superconducting quantum computers. Code for all experiments is available at: https://lklee.dev/pub/2023-icdm/code △ Less

Submitted 13 October, 2023; originally announced October 2023.

Comments: 10 pages, 8 figures, Accepted at the IEEE International Conference on Data Mining (ICDM) 2023

arXiv:2307.02195 [pdf, other]

Optimum-Preserving QUBO Parameter Compression

Authors: Sascha Mücke, Thore Gerlach, Nico Piatkowski

Abstract: Quadratic unconstrained binary optimization (QUBO) problems are well-studied, not least because they can be approached using contemporary quantum annealing or classical hardware acceleration. However, due to limited precision and hardware noise, the effective set of feasible parameter values is severely restricted. As a result, otherwise solvable problems become harder or even intractable. In this… ▽ More Quadratic unconstrained binary optimization (QUBO) problems are well-studied, not least because they can be approached using contemporary quantum annealing or classical hardware acceleration. However, due to limited precision and hardware noise, the effective set of feasible parameter values is severely restricted. As a result, otherwise solvable problems become harder or even intractable. In this work, we study the implications of solving QUBO problems under limited precision. Specifically, it is shown that the problem's dynamic range has a crucial impact on the problem's robustness against distortions. We show this by formalizing the notion of preserving optima between QUBO instances and explore to which extend parameters can be modified without changing the set of minimizing solutions. Based on these insights, we introduce techniques to reduce the dynamic range of a given QUBO instance based on theoretical bounds of the minimal energy value. An experimental evaluation on random QUBO instances as well as QUBO-encoded Binary Clustering and Subset Sum problems show that our theoretical findings manifest in practice. Results on quantum annealing hardware show that the performance can be improved drastically when following our methodology. △ Less

Submitted 5 July, 2023; originally announced July 2023.

arXiv:2301.09138 [pdf, other]

Explaining Quantum Circuits with Shapley Values: Towards Explainable Quantum Machine Learning

Authors: Raoul Heese, Thore Gerlach, Sascha Mücke, Sabine Müller, Matthias Jakobs, Nico Piatkowski

Abstract: Methods of artificial intelligence (AI) and especially machine learning (ML) have been growing ever more complex, and at the same time have more and more impact on people's lives. This leads to explainable AI (XAI) manifesting itself as an important research field that helps humans to better comprehend ML systems. In parallel, quantum machine learning (QML) is emerging with the ongoing improvement… ▽ More Methods of artificial intelligence (AI) and especially machine learning (ML) have been growing ever more complex, and at the same time have more and more impact on people's lives. This leads to explainable AI (XAI) manifesting itself as an important research field that helps humans to better comprehend ML systems. In parallel, quantum machine learning (QML) is emerging with the ongoing improvement of quantum computing hardware combined with its increasing availability via cloud services. QML enables quantum-enhanced ML in which quantum mechanics is exploited to facilitate ML tasks, typically in form of quantum-classical hybrid algorithms that combine quantum and classical resources. Quantum gates constitute the building blocks of gate-based quantum hardware and form circuits that can be used for quantum computations. For QML applications, quantum circuits are typically parameterized and their parameters are optimized classically such that a suitably defined objective function is minimized. Inspired by XAI, we raise the question of explainability of such circuits by quantifying the importance of (groups of) gates for specific goals. To this end, we transfer and adapt the well-established concept of Shapley values to the quantum realm. The resulting attributions can be interpreted as explanations for why a specific circuit works well for a given task, improving the understanding of how to construct parameterized (or variational) quantum circuits, and fostering their human interpretability in general. An experimental evaluation on simulators and two superconducting quantum hardware devices demonstrates the benefits of the proposed framework for classification, generative modeling, transpilation, and optimization. Furthermore, our results shed some light on the role of specific gates in popular QML approaches. △ Less

Submitted 2 March, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

Comments: 36 pages, 27 figures, 3 tables

arXiv:2301.08086 [pdf, other]

doi 10.1007/978-3-031-30047-9_13

Shapley Values with Uncertain Value Functions

Authors: Raoul Heese, Sascha Mücke, Matthias Jakobs, Thore Gerlach, Nico Piatkowski

Abstract: We propose a novel definition of Shapley values with uncertain value functions based on first principles using probability theory. Such uncertain value functions can arise in the context of explainable machine learning as a result of non-deterministic algorithms. We show that random effects can in fact be absorbed into a Shapley value with a noiseless but shifted value function. Hence, Shapley val… ▽ More We propose a novel definition of Shapley values with uncertain value functions based on first principles using probability theory. Such uncertain value functions can arise in the context of explainable machine learning as a result of non-deterministic algorithms. We show that random effects can in fact be absorbed into a Shapley value with a noiseless but shifted value function. Hence, Shapley values with uncertain value functions can be used in analogy to regular Shapley values. However, their reliable evaluation typically requires more computational effort. △ Less

Submitted 19 January, 2023; originally announced January 2023.

Comments: 12 pages, 1 figure, 1 table

Journal ref: Advances in Intelligent Data Analysis XXI. IDA 2023. Lecture Notes in Computer Science, vol 13876

arXiv:2209.02055 [pdf, other]

Full Kullback-Leibler-Divergence Loss for Hyperparameter-free Label Distribution Learning

Authors: Maurice Günder, Nico Piatkowski, Christian Bauckhage

Abstract: The concept of Label Distribution Learning (LDL) is a technique to stabilize classification and regression problems with ambiguous and/or imbalanced labels. A prototypical use-case of LDL is human age estimation based on profile images. Regarding this regression problem, a so called Deep Label Distribution Learning (DLDL) method has been developed. The main idea is the joint regression of the labe… ▽ More The concept of Label Distribution Learning (LDL) is a technique to stabilize classification and regression problems with ambiguous and/or imbalanced labels. A prototypical use-case of LDL is human age estimation based on profile images. Regarding this regression problem, a so called Deep Label Distribution Learning (DLDL) method has been developed. The main idea is the joint regression of the label distribution and its expectation value. However, the original DLDL method uses loss components with different mathematical motivation and, thus, different scales, which is why the use of a hyperparameter becomes necessary. In this work, we introduce a loss function for DLDL whose components are completely defined by Kullback-Leibler (KL) divergences and, thus, are directly comparable to each other without the need of additional hyperparameters. It generalizes the concept of DLDL with regard to further use-cases, in particular for multi-dimensional or multi-scale distribution learning tasks. △ Less

Submitted 5 September, 2022; originally announced September 2022.

Comments: 8 pages, 4 figures

arXiv:2206.00398 [pdf, other]

On Quantum Circuits for Discrete Graphical Models

Authors: Nico Piatkowski, Christa Zoufal

Abstract: Graphical models are useful tools for describing structured high-dimensional probability distributions. Development of efficient algorithms for generating unbiased and independent samples from graphical models remains an active research topic. Sampling from graphical models that describe the statistics of discrete variables is a particularly challenging problem, which is intractable in the presenc… ▽ More Graphical models are useful tools for describing structured high-dimensional probability distributions. Development of efficient algorithms for generating unbiased and independent samples from graphical models remains an active research topic. Sampling from graphical models that describe the statistics of discrete variables is a particularly challenging problem, which is intractable in the presence of high dimensions. In this work, we provide the first method that allows one to provably generate unbiased and independent samples from general discrete factor models with a quantum circuit. Our method is compatible with multi-body interactions and its success probability does not depend on the number of variables. To this end, we identify a novel embedding of the graphical model into unitary operators and provide rigorous guarantees on the resulting quantum state. Moreover, we prove a unitary Hammersley-Clifford theorem -- showing that our quantum embedding factorizes over the cliques of the underlying conditional independence structure. Importantly, the quantum embedding allows for maximum likelihood learning as well as maximum a posteriori state approximation via state-of-the-art hybrid quantum-classical methods. Finally, the proposed quantum method can be implemented on current quantum processors. Experiments with quantum simulation as well as actual quantum hardware show that our method can carry out sampling and parameter learning on quantum computers. △ Less

Submitted 1 June, 2022; originally announced June 2022.

arXiv:2205.11433 [pdf, other]

Informed Pre-Training on Prior Knowledge

Authors: Laura von Rueden, Sebastian Houben, Kostadin Cvejoski, Christian Bauckhage, Nico Piatkowski

Abstract: When training data is scarce, the incorporation of additional prior knowledge can assist the learning process. While it is common to initialize neural networks with weights that have been pre-trained on other large data sets, pre-training on more concise forms of knowledge has rather been overlooked. In this paper, we propose a novel informed machine learning approach and suggest to pre-train on p… ▽ More When training data is scarce, the incorporation of additional prior knowledge can assist the learning process. While it is common to initialize neural networks with weights that have been pre-trained on other large data sets, pre-training on more concise forms of knowledge has rather been overlooked. In this paper, we propose a novel informed machine learning approach and suggest to pre-train on prior knowledge. Formal knowledge representations, e.g. graphs or equations, are first transformed into a small and condensed data set of knowledge prototypes. We show that informed pre-training on such knowledge prototypes (i) speeds up the learning processes, (ii) improves generalization capabilities in the regime where not enough training data is available, and (iii) increases model robustness. Analyzing which parts of the model are affected most by the prototypes reveals that improvements come from deeper layers that typically represent high-level features. This confirms that informed pre-training can indeed transfer semantic knowledge. This is a novel effect, which shows that knowledge-based pre-training has additional and complementary strengths to existing approaches. △ Less

Submitted 23 May, 2022; originally announced May 2022.

arXiv:2204.11133 [pdf, other]

Towards Bundle Adjustment for Satellite Imaging via Quantum Machine Learning

Authors: Nico Piatkowski, Thore Gerlach, Romain Hugues, Rafet Sifa, Christian Bauckhage, Frederic Barbaresco

Abstract: Given is a set of images, where all images show views of the same area at different points in time and from different viewpoints. The task is the alignment of all images such that relevant information, e.g., poses, changes, and terrain, can be extracted from the fused image. In this work, we focus on quantum methods for keypoint extraction and feature matching, due to the demanding computational c… ▽ More Given is a set of images, where all images show views of the same area at different points in time and from different viewpoints. The task is the alignment of all images such that relevant information, e.g., poses, changes, and terrain, can be extracted from the fused image. In this work, we focus on quantum methods for keypoint extraction and feature matching, due to the demanding computational complexity of these sub-tasks. To this end, k-medoids clustering, kernel density clustering, nearest neighbor search, and kernel methods are investigated and it is explained how these methods can be re-formulated for quantum annealers and gate-based quantum computers. Experimental results obtained on digital quantum emulation hardware, quantum annealers, and quantum gate computers show that classical systems still deliver superior results. However, the proposed methods are ready for the current and upcoming generations of quantum computing devices which have the potential to outperform classical systems in the near future. △ Less

Submitted 23 April, 2022; originally announced April 2022.

ACM Class: C.3; I.2; I.4

arXiv:2203.13261 [pdf, other]

doi 10.1007/s42484-023-00099-z

Feature Selection on Quantum Computers

Authors: Sascha Mücke, Raoul Heese, Sabine Müller, Moritz Wolter, Nico Piatkowski

Abstract: In machine learning, fewer features reduce model complexity. Carefully assessing the influence of each input feature on the model quality is therefore a crucial preprocessing step. We propose a novel feature selection algorithm based on a quadratic unconstrained binary optimization (QUBO) problem, which allows to select a specified number of features based on their importance and redundancy. In co… ▽ More In machine learning, fewer features reduce model complexity. Carefully assessing the influence of each input feature on the model quality is therefore a crucial preprocessing step. We propose a novel feature selection algorithm based on a quadratic unconstrained binary optimization (QUBO) problem, which allows to select a specified number of features based on their importance and redundancy. In contrast to iterative or greedy methods, our direct approach yields higherquality solutions. QUBO problems are particularly interesting because they can be solved on quantum hardware. To evaluate our proposed algorithm, we conduct a series of numerical experiments using a classical computer, a quantum gate computer and a quantum annealer. Our evaluation compares our method to a range of standard methods on various benchmark datasets. We observe competitive performance. △ Less

Submitted 27 January, 2023; v1 submitted 24 March, 2022; originally announced March 2022.

Comments: 30 pages

Journal ref: Quantum Mach. Intell. 5, 11 (2023)

arXiv:2203.08815 [pdf, other]

QUBOs for Sorting Lists and Building Trees

Authors: Christian Bauckhage, Thore Gerlach, Nico Piatkowski

Abstract: We show that the fundamental tasks of sorting lists and building search trees or heaps can be modeled as quadratic unconstrained binary optimization problems (QUBOs). The idea is to understand these tasks as permutation problems and to devise QUBOs whose solutions represent appropriate permutation matrices. We discuss how to construct such QUBOs and how to solve them using Hopfield nets or adiabat… ▽ More We show that the fundamental tasks of sorting lists and building search trees or heaps can be modeled as quadratic unconstrained binary optimization problems (QUBOs). The idea is to understand these tasks as permutation problems and to devise QUBOs whose solutions represent appropriate permutation matrices. We discuss how to construct such QUBOs and how to solve them using Hopfield nets or adiabatic) quantum computing. In short, we show that neurocomputing methods or quantum computers can solve problems usually associated with abstract data structures. △ Less

Submitted 15 March, 2022; originally announced March 2022.

arXiv:2112.10712 [pdf, other]

Evolutionary Hierarchical Harvest Schedule Optimization for Food Waste Prevention

Authors: Maurice Günder, Nico Piatkowski, Laura von Rueden, Rafet Sifa, Christian Bauckhage

Abstract: In order to avoid disadvantages of monocrop** for soil and environment, it is advisable to practice intercrop** of various plant species whenever possible. However, intercrop** is challenging as it requires a balanced planting schedule due to individual cultivation time frames. Maintaining a continuous harvest reduces logistical costs and related greenhouse gas emissions, and contributes to… ▽ More In order to avoid disadvantages of monocrop** for soil and environment, it is advisable to practice intercrop** of various plant species whenever possible. However, intercrop** is challenging as it requires a balanced planting schedule due to individual cultivation time frames. Maintaining a continuous harvest reduces logistical costs and related greenhouse gas emissions, and contributes to food waste prevention. In this work, we address these issues and propose an optimization method for a full harvest season of large crop ensembles that complies with given constraints. By using an approach based on an evolutionary algorithm combined with a novel hierarchical loss function and adaptive mutation rate, we transfer the multi-objective into a pseudo-single-objective optimization problem and obtain faster convergence and better solutions than for conventional approaches. △ Less

Submitted 20 December, 2021; originally announced December 2021.

Comments: 4 pages, AAAI-2022 Workshop AI for Agriculture and Food Systems (AIAFS)

arXiv:2112.04583 [pdf, ps, other]

doi 10.1609/aaai.v37i10.26443

Computing Divergences between Discrete Decomposable Models

Authors: Loong Kuan Lee, Nico Piatkowski, François Petitjean, Geoffrey I. Webb

Abstract: There are many applications that benefit from computing the exact divergence between 2 discrete probability measures, including machine learning. Unfortunately, in the absence of any assumptions on the structure or independencies within these distributions, computing the divergence between them is an intractable problem in high dimensions. We show that we are able to compute a wide family of funct… ▽ More There are many applications that benefit from computing the exact divergence between 2 discrete probability measures, including machine learning. Unfortunately, in the absence of any assumptions on the structure or independencies within these distributions, computing the divergence between them is an intractable problem in high dimensions. We show that we are able to compute a wide family of functionals and divergences, such as the alpha-beta divergence, between two decomposable models, i.e. chordal Markov networks, in time exponential to the treewidth of these models. The alpha-beta divergence is a family of divergences that include popular divergences such as the Kullback-Leibler divergence, the Hellinger distance, and the chi-squared divergence. Thus, we can accurately compute the exact values of any of this broad class of divergences to the extent to which we can accurately model the two distributions using decomposable models. △ Less

Submitted 30 November, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

Comments: 13 pages, 4 Figures, 3 Tables. Accepted to the 37th AAAI Conference on Artificial Intelligence (AAAI 2023)

Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence. 37, 10 (Jun. 2023), 12243-12251

arXiv:2108.13329 [pdf, other]

doi 10.1007/s10994-023-06490-y

On the effects of biased quantum random numbers on the initialization of artificial neural networks

Authors: Raoul Heese, Moritz Wolter, Sascha Mücke, Lukas Franken, Nico Piatkowski

Abstract: Recent advances in practical quantum computing have led to a variety of cloud-based quantum computing platforms that allow researchers to evaluate their algorithms on noisy intermediate-scale quantum (NISQ) devices. A common property of quantum computers is that they can exhibit instances of true randomness as opposed to pseudo-randomness obtained from classical systems. Investigating the effects… ▽ More Recent advances in practical quantum computing have led to a variety of cloud-based quantum computing platforms that allow researchers to evaluate their algorithms on noisy intermediate-scale quantum (NISQ) devices. A common property of quantum computers is that they can exhibit instances of true randomness as opposed to pseudo-randomness obtained from classical systems. Investigating the effects of such true quantum randomness in the context of machine learning is appealing, and recent results vaguely suggest that benefits can indeed be achieved from the use of quantum random numbers. To shed some more light on this topic, we empirically study the effects of hardware-biased quantum random numbers on the initialization of artificial neural network weights in numerical experiments. We find no statistically significant difference in comparison with unbiased quantum random numbers as well as biased and unbiased random numbers from a classical pseudo-random number generator. The quantum random numbers for our experiments are obtained from real quantum hardware. △ Less

Submitted 19 December, 2023; v1 submitted 30 August, 2021; originally announced August 2021.

Comments: 26 pages, 12 figures, 3 tables

Journal ref: Mach Learn (2024)

arXiv:2106.00512 [pdf, other]

The Care Label Concept: A Certification Suite for Trustworthy and Resource-Aware Machine Learning

Authors: Katharina Morik, Helena Kotthaus, Lukas Heppe, Danny Heinrich, Raphael Fischer, Andreas Pauly, Nico Piatkowski

Abstract: Machine learning applications have become ubiquitous. This has led to an increased effort of making machine learning trustworthy. Explainable and fair AI have already matured. They address knowledgeable users and application engineers. For those who do not want to invest time into understanding the method or the learned model, we offer care labels: easy to understand at a glance, allowing for meth… ▽ More Machine learning applications have become ubiquitous. This has led to an increased effort of making machine learning trustworthy. Explainable and fair AI have already matured. They address knowledgeable users and application engineers. For those who do not want to invest time into understanding the method or the learned model, we offer care labels: easy to understand at a glance, allowing for method or model comparisons, and, at the same time, scientifically well-based. On one hand, this transforms descriptions as given by, e.g., Fact Sheets or Model Cards, into a form that is well-suited for end-users. On the other hand, care labels are the result of a certification suite that tests whether stated guarantees hold. In this paper, we present two experiments with our certification suite. One shows the care labels for configurations of Markov random fields (MRFs). Based on the underlying theory of MRFs, each choice leads to its specific rating of static properties like, e.g., expressivity and reliability. In addition, the implementation is tested and resource consumption is measured yielding dynamic properties. This two-level procedure is followed by another experiment certifying deep neural network (DNN) models. There, we draw the static properties from the literature on a particular model and data set. At the second level, experiments are generated that deliver measurements of robustness against certain attacks. We illustrate this by ResNet-18 and MobileNetV3 applied to ImageNet. △ Less

Submitted 1 June, 2021; originally announced June 2021.

arXiv:2105.10197 [pdf]

doi 10.3389/frai.2022.975029

Yes We Care! -- Certification for Machine Learning Methods through the Care Label Framework

Authors: Katharina Morik, Helena Kotthaus, Raphael Fischer, Sascha Mücke, Matthias Jakobs, Nico Piatkowski, Andreas Pauly, Lukas Heppe, Danny Heinrich

Abstract: Machine learning applications have become ubiquitous. Their applications range from embedded control in production machines over process optimization in diverse areas (e.g., traffic, finance, sciences) to direct user interactions like advertising and recommendations. This has led to an increased effort of making machine learning trustworthy. Explainable and fair AI have already matured. They addre… ▽ More Machine learning applications have become ubiquitous. Their applications range from embedded control in production machines over process optimization in diverse areas (e.g., traffic, finance, sciences) to direct user interactions like advertising and recommendations. This has led to an increased effort of making machine learning trustworthy. Explainable and fair AI have already matured. They address the knowledgeable user and the application engineer. However, there are users that want to deploy a learned model in a similar way as their washing machine. These stakeholders do not want to spend time in understanding the model, but want to rely on guaranteed properties. What are the relevant properties? How can they be expressed to the stakeholder without presupposing machine learning knowledge? How can they be guaranteed for a certain implementation of a machine learning model? These questions move far beyond the current state of the art and we want to address them here. We propose a unified framework that certifies learning methods via care labels. They are easy to understand and draw inspiration from well-known certificates like textile labels or property cards of electronic devices. Our framework considers both, the machine learning theory and a given implementation. We test the implementation's compliance with theoretical properties and bounds. △ Less

Submitted 22 December, 2022; v1 submitted 21 May, 2021; originally announced May 2021.

Journal ref: Frontiers in Artificial Intelligence, September 2022

arXiv:2104.07538 [pdf, other]

Street-Map Based Validation of Semantic Segmentation in Autonomous Driving

Authors: Laura von Rueden, Tim Wirtz, Fabian Hueger, Jan David Schneider, Nico Piatkowski, Christian Bauckhage

Abstract: Artificial intelligence for autonomous driving must meet strict requirements on safety and robustness, which motivates the thorough validation of learned models. However, current validation approaches mostly require ground truth data and are thus both cost-intensive and limited in their applicability. We propose to overcome these limitations by a model agnostic validation using a-priori knowledge… ▽ More Artificial intelligence for autonomous driving must meet strict requirements on safety and robustness, which motivates the thorough validation of learned models. However, current validation approaches mostly require ground truth data and are thus both cost-intensive and limited in their applicability. We propose to overcome these limitations by a model agnostic validation using a-priori knowledge from street maps. In particular, we show how to validate semantic segmentation masks and demonstrate the potential of our approach using OpenStreetMap. We introduce validation metrics that indicate false positive or negative road segments. Besides the validation approach, we present a method to correct the vehicle's GPS position so that a more accurate localization can be used for the street-map based validation. Lastly, we present quantitative results on the Cityscapes dataset indicating that our validation approach can indeed uncover errors in semantic segmentation masks. △ Less

Submitted 15 April, 2021; originally announced April 2021.

Comments: Final version accepted at the International Conference on Pattern Recognition (ICPR). arXiv admin note: substantial text overlap with arXiv:2011.08008

arXiv:2012.13453 [pdf, other]

doi 10.1109/CEC55065.2022.9870269

Quantum Circuit Evolution on NISQ Devices

Authors: Lukas Franken, Bogdan Georgiev, Sascha Mücke, Moritz Wolter, Raoul Heese, Christian Bauckhage, Nico Piatkowski

Abstract: Variational quantum circuits build the foundation for various classes of quantum algorithms. In a nutshell, the weights of a parametrized quantum circuit are varied until the empirical sampling distribution of the circuit is sufficiently close to a desired outcome. Numerical first-order methods are applied frequently to fit the parameters of the circuit, but most of the time, the circuit itself, t… ▽ More Variational quantum circuits build the foundation for various classes of quantum algorithms. In a nutshell, the weights of a parametrized quantum circuit are varied until the empirical sampling distribution of the circuit is sufficiently close to a desired outcome. Numerical first-order methods are applied frequently to fit the parameters of the circuit, but most of the time, the circuit itself, that is, the actual composition of gates, is fixed. Methods for optimizing the circuit design jointly with the weights have been proposed, but empirical results are rather scarce. Here, we consider a simple evolutionary strategy that addresses the trade-off between finding appropriate circuit architectures and parameter tuning. We evaluate our method both via simulation and on actual quantum hardware. Our benchmark problems include the transverse field Ising Hamiltonian and the Sherrington-Kirkpatrick spin model. Despite the shortcomings of current noisy intermediate-scale quantum hardware, we find only a minor slowdown on actual quantum machines compared to simulations. Moreover, we investigate which mutation operations most significantly contribute to the optimization. The results provide intuition on how randomized search heuristics behave on actual quantum hardware and lay out a path for further refinement of evolutionary quantum gate circuits. △ Less

Submitted 23 May, 2022; v1 submitted 23 December, 2020; originally announced December 2020.

Comments: 8 pages, 7 figures. To appear in the proceedings of IEEE Congress on Evolutionary Computation (CEC) 2022

Journal ref: 2022 IEEE Congress on Evolutionary Computation (CEC), pp. 1-8

arXiv:2009.12098 [pdf, other]

Resource-Constrained On-Device Learning by Dynamic Averaging

Authors: Lukas Heppe, Michael Kamp, Linara Adilova, Danny Heinrich, Nico Piatkowski, Katharina Morik

Abstract: The communication between data-generating devices is partially responsible for a growing portion of the world's power consumption. Thus reducing communication is vital, both, from an economical and an ecological perspective. For machine learning, on-device learning avoids sending raw data, which can reduce communication substantially. Furthermore, not centralizing the data protects privacy-sensiti… ▽ More The communication between data-generating devices is partially responsible for a growing portion of the world's power consumption. Thus reducing communication is vital, both, from an economical and an ecological perspective. For machine learning, on-device learning avoids sending raw data, which can reduce communication substantially. Furthermore, not centralizing the data protects privacy-sensitive data. However, most learning algorithms require hardware with high computation power and thus high energy consumption. In contrast, ultra-low-power processors, like FPGAs or micro-controllers, allow for energy-efficient learning of local models. Combined with communication-efficient distributed learning strategies, this reduces the overall energy consumption and enables applications that were yet impossible due to limited energy on local devices. The major challenge is then, that the low-power processors typically only have integer processing capabilities. This paper investigates an approach to communication-efficient on-device learning of integer exponential families that can be executed on low-power processors, is privacy-preserving, and effectively minimizes communication. The empirical evaluation shows that the approach can reach a model quality comparable to a centrally learned regular model with an order of magnitude less communication. Comparing the overall energy consumption, this reduces the required energy for solving the machine learning task by a significant amount. △ Less

Submitted 25 September, 2020; originally announced September 2020.

arXiv:2003.09827 [pdf, other]

doi 10.1109/JIOT.2020.2983207

The Channel as a Traffic Sensor: Vehicle Detection and Classification based on Radio Fingerprinting

Authors: Benjamin Sliwa, Niko Piatkowski, Christian Wietfeld

Abstract: Ubiquitously deployed Internet of Things (IoT)- based automatic vehicle classification systems will catalyze data-driven traffic flow optimization in future smart cities and will transform the road infrastructure itself into a dynamically sensing Cyber-physical System (CPS). Although a wide range of different traffic sensing systems has been proposed, the existing solutions are not yet able to sim… ▽ More Ubiquitously deployed Internet of Things (IoT)- based automatic vehicle classification systems will catalyze data-driven traffic flow optimization in future smart cities and will transform the road infrastructure itself into a dynamically sensing Cyber-physical System (CPS). Although a wide range of different traffic sensing systems has been proposed, the existing solutions are not yet able to simultaneously satisfy the multitude of requirements, e.g., accuracy, robustness, cost-efficiency, and privacy preservation. In this paper, we present a novel approach, which exploits radio fingerprints - multidimensional attenuation patterns of wireless signals - for accurate and robust vehicle detection and classification. The proposed system can be deployed in a highly cost-efficient manner as it relies on off-the-shelf embedded devices which are installed into existing delineator posts. In a comprehensive field evaluation campaign, the performance of the radio fingerprinting-based approach is analyzed within an experimental live deployment on a German highway, where it is able to achieve a binary classification success ratio of more than 99% and an overall accuracy of 93.83% for a classification task with seven different classes. △ Less

Submitted 25 March, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

Journal ref: IEEE Internet of Things Journal 2020

arXiv:2001.10189 [pdf, other]

LIMITS: Lightweight Machine Learning for IoT Systems with Resource Limitations

Authors: Benjamin Sliwa, Nico Piatkowski, Christian Wietfeld

Abstract: Exploiting big data knowledge on small devices will pave the way for building truly cognitive Internet of Things (IoT) systems. Although machine learning has led to great advancements for IoT-based data analytics, there remains a huge methodological gap for the deployment phase of trained machine learning models. For given resource-constrained platforms such as Microcontroller Units (MCUs), model… ▽ More Exploiting big data knowledge on small devices will pave the way for building truly cognitive Internet of Things (IoT) systems. Although machine learning has led to great advancements for IoT-based data analytics, there remains a huge methodological gap for the deployment phase of trained machine learning models. For given resource-constrained platforms such as Microcontroller Units (MCUs), model choice and parametrization are typically performed based on heuristics or analytical models. However, these approaches are only able to provide rough estimates of the required system resources as they do not consider the interplay of hardware, compiler specific optimizations, and code dependencies. In this paper, we present the novel open source framework LIghtweight Machine learning for IoT Systems (LIMITS), which applies a platform-in-the-loop approach explicitly considering the actual compilation toolchain of the target IoT platform. LIMITS focuses on high level tasks such as experiment automation, platform-specific code generation, and sweet spot determination. The solid foundations of validated low-level model implementations are provided by the coupled well-established data analysis framework Waikato Environment for Knowledge Analysis (WEKA). We apply and validate LIMITS in two case studies focusing on cellular data rate prediction and radio-based vehicle classification, where we compare different learning models and real world IoT platforms with memory constraints from 16 kB to 4 MB and demonstrate its potential to catalyze the development of machine learning enabled IoT systems. △ Less

Submitted 28 January, 2020; originally announced January 2020.

arXiv:1907.00697 [pdf, other]

The Trustworthy Pal: Controlling the False Discovery Rate in Boolean Matrix Factorization

Authors: Sibylle Hess, Nico Piatkowski, Katharina Morik

Abstract: Boolean matrix factorization (BMF) is a popular and powerful technique for inferring knowledge from data. The mining result is the Boolean product of two matrices, approximating the input dataset. The Boolean product is a disjunction of rank-1 binary matrices, each describing a feature-relation, called pattern, for a group of samples. Yet, there are no guarantees that any of the returned patterns… ▽ More Boolean matrix factorization (BMF) is a popular and powerful technique for inferring knowledge from data. The mining result is the Boolean product of two matrices, approximating the input dataset. The Boolean product is a disjunction of rank-1 binary matrices, each describing a feature-relation, called pattern, for a group of samples. Yet, there are no guarantees that any of the returned patterns do not actually arise from noise, i.e., are false discoveries. In this paper, we propose and discuss the usage of the false discovery rate in the unsupervised BMF setting. We prove two bounds on the probability that a found pattern is constituted of random Bernoulli-distributed noise. Each bound exploits a specific property of the factorization which minimizes the approximation error---yielding new insights on the minimizers of Boolean matrix factorization. This leads to improved BMF algorithms by replacing heuristic rank selection techniques with a theoretically well-based approach. Our empirical demonstration shows that both bounds deliver excellent results in various practical settings. △ Less

Submitted 1 July, 2019; originally announced July 2019.

arXiv:1906.09722 [pdf, other]

The PRIM** Routine -- Tiling through Proximal Alternating Linearized Minimization

Authors: Sibylle Hess, Katharina Morik, Nico Piatkowski

Abstract: Mining and exploring databases should provide users with knowledge and new insights. Tiles of data strive to unveil true underlying structure and distinguish valuable information from various kinds of noise. We propose a novel Boolean matrix factorization algorithm to solve the tiling problem, based on recent results from optimization theory. In contrast to existing work, the new algorithm minimiz… ▽ More Mining and exploring databases should provide users with knowledge and new insights. Tiles of data strive to unveil true underlying structure and distinguish valuable information from various kinds of noise. We propose a novel Boolean matrix factorization algorithm to solve the tiling problem, based on recent results from optimization theory. In contrast to existing work, the new algorithm minimizes the description length of the resulting factorization. This approach is well known for model selection and data compression, but not for finding suitable factorizations via numerical optimization. We demonstrate the superior robustness of the new approach in the presence of several kinds of noise and types of underlying structure. Moreover, our general framework can work with any cost measure having a suitable real-valued relaxation. Thereby, no convexity assumptions have to be met. The experimental results on synthetic data and image data show that the new method identifies interpretable patterns which explain the data almost always better than the competing algorithms. △ Less

Submitted 17 June, 2019; originally announced June 2019.

Journal ref: Data Mining and Knowledge Discovery 31(4): 1090-1131 (2017)

arXiv:1904.10186 [pdf, other]

doi 10.1109/TITS.2019.2930109

Boosting Vehicle-to-cloud Communication by Machine Learning-enabled Context Prediction

Authors: Benjamin Sliwa, Robert Falkenberg, Thomas Liebig, Nico Piatkowski, Christian Wietfeld

Abstract: The exploitation of vehicles as mobile sensors acts as a catalyst for novel crowdsensing-based applications such as intelligent traffic control and distributed weather forecast. However, the massive increases in Machine-type Communication (MTC) highly stress the capacities of the network infrastructure. With the system-immanent limitation of resources in cellular networks and the resource competit… ▽ More The exploitation of vehicles as mobile sensors acts as a catalyst for novel crowdsensing-based applications such as intelligent traffic control and distributed weather forecast. However, the massive increases in Machine-type Communication (MTC) highly stress the capacities of the network infrastructure. With the system-immanent limitation of resources in cellular networks and the resource competition between human cell users and MTC, more resource-efficient channel access methods are required in order to improve the coexistence of the different communicating entities. In this paper, we present a machine learning-enabled transmission scheme for client-side opportunistic data transmission. By considering the measured channel state as well as the predicted future channel behavior, delay-tolerant MTC is performed with respect to the anticipated resource-efficiency. The proposed mechanism is evaluated in comprehensive field evaluations in public Long Term Evolution (LTE) networks, where it is able to increase the mean data rate by 194% while simultaneously reducing the average power consumption by up to 54%. △ Less

Submitted 19 July, 2019; v1 submitted 23 April, 2019; originally announced April 2019.

Journal ref: IEEE Transactions on Intelligent Transportation Systems, 2019

arXiv:1807.00464 [pdf, other]

doi 10.1109/ITSC.2018.8569391

Leveraging the Channel as a Sensor: Real-time Vehicle Classification Using Multidimensional Radio-fingerprinting

Authors: Benjamin Sliwa, Nico Piatkowski, Marcus Haferkamp, Dennis Dorn, Christian Wietfeld

Abstract: Upcoming Intelligent Transportation Systems (ITSs) will transform roads from static resources to dynamic Cyber Physical Systems (CPSs) in order to satisfy the requirements of future vehicular traffic in smart city environments. Up-to-date information serves as the basis for changing street directions as well as guiding individual vehicles to a fitting parking slot. In this context, not only abstra… ▽ More Upcoming Intelligent Transportation Systems (ITSs) will transform roads from static resources to dynamic Cyber Physical Systems (CPSs) in order to satisfy the requirements of future vehicular traffic in smart city environments. Up-to-date information serves as the basis for changing street directions as well as guiding individual vehicles to a fitting parking slot. In this context, not only abstract indicators like traffic flow and density are required, but also data about mobility parameters and class information of individual vehicles. Consequently, accurate and reliable systems that are capable of providing these kinds of information in real-time are highly demanded. In this paper, we present a system for classifying vehicles based on their radio-fingerprints which applies cutting-edge machine learning models and can be non-intrusively installed into the existing road infrastructure in an ad-hoc manner. In contrast to other approaches, it is able to provide accurate classification results without causing privacy-violations or being vulnerable to challenging weather conditions. Moreover, it is a promising candidate for large-scale city deployments due to its cost-efficient installation and maintenance properties. The proposed system is evaluated in a comprehensive field evaluation campaign within an experimental live deployment on a German highway, where it is able to achieve a binary classification success ratio of more than 99% and an overall accuracy of 89.15% for a fine-grained classification task with nine different classes. △ Less

Submitted 2 July, 2018; originally announced July 2018.

Journal ref: 2018 21st International Conference on Intelligent Transportation Systems (ITSC)

arXiv:1806.06620 [pdf, other]

doi 10.1109/VTCFall.2018.8690629

Machine Learning Based Uplink Transmission Power Prediction for LTE and Upcoming 5G Networks using Passive Downlink Indicators

Authors: Robert Falkenberg, Benjamin Sliwa, Nico Piatkowski, Christian Wietfeld

Abstract: Energy-aware system design is an important optimization task for static and mobile Internet of Things (IoT)-based sensor nodes, especially for highly resource-constrained vehicles such as mobile robotic systems. For 4G/5G-based cellular communication systems, the effective transmission power of uplink data transmissions is of crucial importance for the overall system power consumption. Unfortunate… ▽ More Energy-aware system design is an important optimization task for static and mobile Internet of Things (IoT)-based sensor nodes, especially for highly resource-constrained vehicles such as mobile robotic systems. For 4G/5G-based cellular communication systems, the effective transmission power of uplink data transmissions is of crucial importance for the overall system power consumption. Unfortunately, this information is usually hidden within off-the-shelf modems and mobile handsets and can therefore not be exploited for enabling green communication. Moreover, the dynamic transmission power control behavior of the mobile device is not even explicitly modeled in most of the established simulation frameworks. In this paper, we present a novel machine learning-based approach for forecasting the resulting uplink transmission power used for data transmissions based on the available passive network quality indicators and application-level information. The model is derived from comprehensive field measurements of drive tests performed in a public cellular network and can be parameterized for integrating all measurements a given target platform is able to provide into the prediction process. In a comparison of three different machine learning methods, Random-Forest models thoroughly performed best with a mean average error of 3.166 dB. As the absolute sum of errors converges towards zero and falls below 1 dB after 28 predictions in average, the approach is well-suited for long-term power estimations. △ Less

Submitted 19 February, 2020; v1 submitted 18 June, 2018; originally announced June 2018.

Journal ref: 2018 IEEE 88th Vehicular Technology Conference (VTC-Fall)

Showing 1–26 of 26 results for author: Piatkowski, N