Search | arXiv e-print repository

The Polynomial Connection between Morphological Dilation and Discrete Convolution

Authors: Vivek Sridhar, Keyvan Shahin, Michael Breuß, Marc Reichenbach

Abstract: In this paper we consider the fundamental operations dilation and erosion of mathematical morphology. Many powerful image filtering operations are based on their combinations. We establish homomorphism between max-plus semi-ring of integers and subset of polynomials over the field of real numbers. This enables to reformulate the task of computing morphological dilation to that of computing sums an… ▽ More In this paper we consider the fundamental operations dilation and erosion of mathematical morphology. Many powerful image filtering operations are based on their combinations. We establish homomorphism between max-plus semi-ring of integers and subset of polynomials over the field of real numbers. This enables to reformulate the task of computing morphological dilation to that of computing sums and products of polynomials. Therefore, dilation and its dual operation erosion can be computed by convolution of discrete linear signals, which is efficiently accomplished using a Fast Fourier Transform technique. The novel method may deal with non-flat filters and incorporates no restrictions on shape or size of the structuring element, unlike many other fast methods in the field. In contrast to previous fast Fourier techniques it gives exact results and is not an approximation. The new method is in practice particularly suitable for filtering images with small tonal range or when employing large filter sizes. We explore the benefits by investigating an implementation on FPGA hardware. Several experiments demonstrate the exactness and efficiency of the proposed method. △ Less

Submitted 4 May, 2023; originally announced May 2023.

arXiv:2203.01771 [pdf, other]

doi 10.1109/IPDPSW.2015.58

Estimation of Non-Functional Properties for Embedded Hardware with Application to Image Processing

Authors: Christian Herglotz, Jürgen Seiler, André Kaup, Arne Hendricks, Marc Reichenbach, Dietmar Fey

Abstract: In recent years, due to a higher demand for portable devices, which provide restricted amounts of processing capacity and battery power, the need for energy and time efficient hard- and software solutions has increased. Preliminary estimations of time and energy consumption can thus be valuable to improve implementations and design decisions. To this end, this paper presents a method to estimate t… ▽ More In recent years, due to a higher demand for portable devices, which provide restricted amounts of processing capacity and battery power, the need for energy and time efficient hard- and software solutions has increased. Preliminary estimations of time and energy consumption can thus be valuable to improve implementations and design decisions. To this end, this paper presents a method to estimate the time and energy consumption of a given software solution, without having to rely on the use of a traditional Cycle Accurate Simulator (CAS). Instead, we propose to utilize a combination of high-level functional simulation with a mechanistic extension to include non-functional properties: Instruction counts from virtual execution are multiplied with corresponding specific energies and times. By evaluating two common image processing algorithms on an FPGA-based CPU, where a mean relative estimation error of 3% is achieved for cacheless systems, we show that this estimation tool can be a valuable aid in the development of embedded processor architectures. The tool allows the developer to reach well-suited design decisions regarding the optimal processor hardware configuration for a given algorithm at an early stage in the design process. △ Less

Submitted 3 March, 2022; originally announced March 2022.

Comments: 6 pages, 4 figures, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop (IPDPS)

arXiv:2203.00466 [pdf, ps, other]

doi 10.1109/TCSVT.2016.2598705

Modeling the Energy Consumption of the HEVC Decoding Process

Authors: Christian Herglotz, Dominic Springer, Marc Reichenbach, Benno Stabernack, André Kaup

Abstract: In this paper, we present a bit stream feature based energy model that accurately estimates the energy required to decode a given HEVC-coded bit stream. Therefore, we take a model from literature and extend it by explicitly modeling the inloop filters, which was not done before. Furthermore, to prove its superior estimation performance, it is compared to seven different energy models from literatu… ▽ More In this paper, we present a bit stream feature based energy model that accurately estimates the energy required to decode a given HEVC-coded bit stream. Therefore, we take a model from literature and extend it by explicitly modeling the inloop filters, which was not done before. Furthermore, to prove its superior estimation performance, it is compared to seven different energy models from literature. By using a unified evaluation framework we show how accurately the required decoding energy for different decoding systems can be approximated. We give thorough explanations on the model parameters and explain how the model variables are derived. To show the modeling capabilities in general, we test the estimation performance for different decoding software and hardware solutions, where we find that the proposed model outperforms the models from literature by reaching frame-wise mean estimation errors of less than 7% for software and less than 15% for hardware based systems. △ Less

Submitted 1 March, 2022; originally announced March 2022.

Comments: 13 pages, 4 figures

Journal ref: IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), volume 28, issue 1, pp. 217 - 229, Jan. 2018

arXiv:2102.06018 [pdf]

Transparent FPGA Acceleration with TensorFlow

Authors: Simon Pfenning, Philipp Holzinger, Marc Reichenbach

Abstract: Today, artificial neural networks are one of the major innovators pushing the progress of machine learning. This has particularly affected the development of neural network accelerating hardware. However, since most of these architectures require specialized toolchains, there is a certain amount of additional effort for developers each time they want to make use of a new deep learning accelerator.… ▽ More Today, artificial neural networks are one of the major innovators pushing the progress of machine learning. This has particularly affected the development of neural network accelerating hardware. However, since most of these architectures require specialized toolchains, there is a certain amount of additional effort for developers each time they want to make use of a new deep learning accelerator. Furthermore the flexibility of the device is bound to the architecture itself, as well as to the functionality of the runtime environment. In this paper we propose a toolflow using TensorFlow as frontend, thus offering developers the opportunity of using a familiar environment. On the backend we use an FPGA, which is addressable via an HSA runtime environment. In this way we are able to hide the complexity of controlling new hardware from the user, while at the same time maintaining a high amount of flexibility. This can be achieved by our HSA toolflow, since the hardware is not statically configured with the structure of the network. Instead, it can be dynamically reconfigured during runtime with the respective kernels executed by the network and simultaneously from other sources e.g. OpenCL/OpenMP. △ Less

Submitted 2 February, 2021; originally announced February 2021.

Comments: Presented at DATE Friday Workshop on System-level Design Methods for Deep Learning on Heterogeneous Architectures (SLOHA 2021) (arXiv:2102.00818)

Report number: SLOHA/2021/09

arXiv:1502.07453 [pdf]

A Holistic Approach for Modeling and Synthesis of Image Processing Applications for Heterogeneous Computing Architectures

Authors: Christian Hartmann, Anna Yupatova, Marc Reichenbach, Dietmar Fey, Reinhard German

Abstract: Image processing applications are common in every field of our daily life. However, most of them are very complex and contain several tasks with different complexities which result in varying requirements for computing architectures. Nevertheless, a general processing scheme in every image processing application has a similar structure, called image processing pipeline: (1) capturing an image, (2)… ▽ More Image processing applications are common in every field of our daily life. However, most of them are very complex and contain several tasks with different complexities which result in varying requirements for computing architectures. Nevertheless, a general processing scheme in every image processing application has a similar structure, called image processing pipeline: (1) capturing an image, (2) pre-processing using local operators, (3) processing with global operators and (4) post-processing using complex operations. Therefore, application-specialized hardware solutions based on heterogeneous architectures are used for image processing. Unfortunately the development of applications for heterogeneous hardware architectures is challenging due to the distribution of computational tasks among processors and programmable logic units. Nowadays, image processing systems are started from scratch which is time-consuming, error-prone and inflexible. A new methodology for modeling and implementing is needed in order to reduce the development time of heterogenous image processing systems. This paper introduces a new holistic top down approach for image processing systems. Two challenges have to be investigated. First, designers ought to be able to model their complete image processing pipeline on an abstract layer using UML. Second, we want to close the gap between the abstract system and the system architecture. △ Less

Submitted 26 February, 2015; originally announced February 2015.

Comments: Presented at DATE Friday Workshop on Heterogeneous Architectures and Design Methods for Embedded Image Systems (HIS 2015) (arXiv:1502.07241)

Report number: DATEHIS/2015/06

arXiv:1502.07448 [pdf]

Automatic Optimization of Hardware Accelerators for Image Processing

Authors: Oliver Reiche, Konrad Häublein, Marc Reichenbach, Frank Hannig, Jürgen Teich, Dietmar Fey

Abstract: In the domain of image processing, often real-time constraints are required. In particular, in safety-critical applications, such as X-ray computed tomography in medical imaging or advanced driver assistance systems in the automotive domain, timing is of utmost importance. A common approach to maintain real-time capabilities of compute-intensive applications is to offload those computations to ded… ▽ More In the domain of image processing, often real-time constraints are required. In particular, in safety-critical applications, such as X-ray computed tomography in medical imaging or advanced driver assistance systems in the automotive domain, timing is of utmost importance. A common approach to maintain real-time capabilities of compute-intensive applications is to offload those computations to dedicated accelerator hardware, such as Field Programmable Gate Arrays (FPGAs). Programming such architectures is a challenging task, with respect to the typical FPGA-specific design criteria: Achievable overall algorithm latency and resource usage of FPGA primitives (BRAM, FF, LUT, and DSP). High-Level Synthesis (HLS) dramatically simplifies this task by enabling the description of algorithms in well-known higher languages (C/C++) and its automatic synthesis that can be accomplished by HLS tools. However, algorithm developers still need expert knowledge about the target architecture, in order to achieve satisfying results. Therefore, in previous work, we have shown that elevating the description of image algorithms to an even higher abstraction level, by using a Domain-Specific Language (DSL), can significantly cut down the complexity for designing such algorithms for FPGAs. To give the developer even more control over the common trade-off, latency vs. resource usage, we will present an automatic optimization process where these criteria are analyzed and fed back to the DSL compiler, in order to generate code that is closer to the desired design specifications. Finally, we generate code for stereo block matching algorithms and compare it with handwritten implementations to quantify the quality of our results. △ Less

Submitted 26 February, 2015; originally announced February 2015.

Comments: Presented at DATE Friday Workshop on Heterogeneous Architectures and Design Methods for Embedded Image Systems (HIS 2015) (arXiv:1502.07241)

Report number: DATEHIS/2015/03

arXiv:hep-th/9608138 [pdf, ps, other]

doi 10.1016/S0550-3213(96)00629-3

Field theoretical approach to non-local interactions: 1d electrons and fermionic impurities

Authors: C. M. Naon, M. C. von Reichenbach, M. L. Trobo

Abstract: We apply a recently proposed path-integral approach to non-local bosonization to a Thirring-like system modeling non-relativistic massless particles interacting with localized fermionic impurities. We consider forward scattering processes described by symmetric potentials including interactions between charge, current, spin and spin-current densities. In the general (spin-flip**) problem we ob… ▽ More We apply a recently proposed path-integral approach to non-local bosonization to a Thirring-like system modeling non-relativistic massless particles interacting with localized fermionic impurities. We consider forward scattering processes described by symmetric potentials including interactions between charge, current, spin and spin-current densities. In the general (spin-flip**) problem we obtain an effective action for the collective modes of the model at T = 0, containing WZW-type terms. When spin-flip** processes are disregarded the structure of the action is considerably simplified, allowing us to derive exact expressions for the dispersion relations of collective modes and two point fermionic correlation functions as functionals of the potentials. Finally, as an example, we compute the momentum distribution for the case in which electrons and impurities are coupled through spin and spin-current densities only. The formulae we get suggest that our formalism could be useful in order to seek for a mechanism able to restore Fermi liquid behavior. △ Less

Submitted 20 August, 1996; originally announced August 1996.

Comments: 27 pages, Latex file, no figures

Report number: La Plata-th 96/10

Journal ref: Nucl.Phys. B485 (1997) 665-684

arXiv:cond-mat/9510166 [pdf, ps, other]

Fermi edge restoration in the Tomonaga-Luttinger model with impurities

Authors: C. M. Naón, M. C. von Reichenbach, M. L. Trobo

Abstract: We study the Tomonaga-Luttinger model in the presence of magnetic (Kondo-like) impurities. By using a recently proposed field-theoretical approach to non-local bosonization we obtain the effective action describing the low-energy charge and spin density fluctuations. From this action the dispersion relations of the collective modes are readily found. We also compute the momentum distribution and… ▽ More We study the Tomonaga-Luttinger model in the presence of magnetic (Kondo-like) impurities. By using a recently proposed field-theoretical approach to non-local bosonization we obtain the effective action describing the low-energy charge and spin density fluctuations. From this action the dispersion relations of the collective modes are readily found. We also compute the momentum distribution and show that the electron-impurity scattering allows to have restoration of the Fermi liquid behavior. △ Less

Submitted 30 October, 1995; originally announced October 1995.

Comments: 11 pages, RevTex, 1 figure available upon request on e-mail [email protected]

Report number: La Plata-TH 95-28

arXiv:hep-th/9409085 [pdf, ps, other]

doi 10.1016/0550-3213(94)00534-L

Path-Integral bosonization of a non-local interaction and its application to the study of 1-d many-body systems

Authors: C. M. Naón, M. C. von Reichenbach, M. L. Trobo

Abstract: We extend the path-integral approach to bosonization to the case in which the fermionic interaction is non-local. In particular we obtain a completely bosonized version of a Thirring-like model with currents coupled by general (symmetric) bilocal potentials. The model contains the Tomonaga-Luttinger model as a special case; exploiting this fact we study the basic properties of the 1-d spinless f… ▽ More We extend the path-integral approach to bosonization to the case in which the fermionic interaction is non-local. In particular we obtain a completely bosonized version of a Thirring-like model with currents coupled by general (symmetric) bilocal potentials. The model contains the Tomonaga-Luttinger model as a special case; exploiting this fact we study the basic properties of the 1-d spinless fermionic gas: fermionic correlators, the spectrum of collective modes, etc. Finally we discuss the generalization of our procedure to the non-Abelian case, thus providing a new tool to be used in the study of 1-d many-body systems with spin-flip** interactions. △ Less

Submitted 15 September, 1994; originally announced September 1994.

Comments: 26 pages LATEX, La Plata 94-09

Journal ref: Nucl.Phys. B435 (1995) 567-584

Showing 1–9 of 9 results for author: Reichenbach, M