Search | arXiv e-print repository

Decoupled Boundary Handling in SPH

Abstract: Particle-based boundary representations are frequently used in Smoothed Particle Hydrodynamics (SPH) due to their simple integration into fluid solvers. Commonly, incompressible fluid solvers estimate the current density and corresponding forces in case the current density exceeds the rest density to push fluid particles apart. Close to the boundary, the calculation of the fluid particles' density… ▽ More Particle-based boundary representations are frequently used in Smoothed Particle Hydrodynamics (SPH) due to their simple integration into fluid solvers. Commonly, incompressible fluid solvers estimate the current density and corresponding forces in case the current density exceeds the rest density to push fluid particles apart. Close to the boundary, the calculation of the fluid particles' density involves both, neighboring fluid and neighboring boundary particles, yielding an overestimation of density, and, subsequently, wrong pressure forces and wrong velocities leading to the disturbed fluid particles' behavior in the vicinity of the boundary. In this paper, we present a detailed explanation of this disturbed fluid particle behavior, which is mainly due to the combined or coupled handling of the fluid-fluid particle and the fluid-boundary particle interaction. We propose the decoupled handling of both interaction types, leading to two densities for a given fluid particle, i.e., fluid-induced density and boundary-induced density. In our approach, we alternately apply the corresponding fluid-induced and boundary-induced forces during pressure estimation. This separation avoids force overestimation and reduces unintended fluid dynamics near the boundary, as well as a consistent fluid-boundary distance across different fluid amounts and different particle-based boundary handling methods. We compare our method with two regular state-of-the-art methods in different experiments and show how our method handles detailed boundary shapes. △ Less

Submitted 21 June, 2023; originally announced June 2023.

Comments: 12 pages, 9 figures, 3 tables

arXiv:2304.14736 [pdf, other]

Differentiable Sensor Layouts for End-to-End Learning of Task-Specific Camera Parameters

Authors: Hendrik Sommerhoff, Shashank Agnihotri, Mohamed Saleh, Michael Moeller, Margret Keuper, Andreas Kolb

Abstract: The success of deep learning is frequently described as the ability to train all parameters of a network on a specific application in an end-to-end fashion. Yet, several design choices on the camera level, including the pixel layout of the sensor, are considered as pre-defined and fixed, and high resolution, regular pixel layouts are considered to be the most generic ones in computer vision and gr… ▽ More The success of deep learning is frequently described as the ability to train all parameters of a network on a specific application in an end-to-end fashion. Yet, several design choices on the camera level, including the pixel layout of the sensor, are considered as pre-defined and fixed, and high resolution, regular pixel layouts are considered to be the most generic ones in computer vision and graphics, treating all regions of an image as equally important. While several works have considered non-uniform, \eg, hexagonal or foveated, pixel layouts in hardware and image processing, the layout has not been integrated into the end-to-end learning paradigm so far. In this work, we present the first truly end-to-end trained imaging pipeline that optimizes the size and distribution of pixels on the imaging sensor jointly with the parameters of a given neural network on a specific task. We derive an analytic, differentiable approach for the sensor layout parameterization that allows for task-specific, local varying pixel resolutions. We present two pixel layout parameterization functions: rectangular and curvilinear grid shapes that retain a regular topology. We provide a drop-in module that approximates sensor simulation given existing high-resolution images to directly connect our method with existing deep learning models. We show that network predictions benefit from learnable pixel layouts for two different downstream tasks, classification and semantic segmentation. △ Less

Submitted 28 April, 2023; originally announced April 2023.

arXiv:2212.04666 [pdf, other]

Neural Volume Super-Resolution

Authors: Yuval Bahat, Yuxuan Zhang, Hendrik Sommerhoff, Andreas Kolb, Felix Heide

Abstract: Neural volumetric representations have become a widely adopted model for radiance fields in 3D scenes. These representations are fully implicit or hybrid function approximators of the instantaneous volumetric radiance in a scene, which are typically learned from multi-view captures of the scene. We investigate the new task of neural volume super-resolution - rendering high-resolution views corresp… ▽ More Neural volumetric representations have become a widely adopted model for radiance fields in 3D scenes. These representations are fully implicit or hybrid function approximators of the instantaneous volumetric radiance in a scene, which are typically learned from multi-view captures of the scene. We investigate the new task of neural volume super-resolution - rendering high-resolution views corresponding to a scene captured at low resolution. To this end, we propose a neural super-resolution network that operates directly on the volumetric representation of the scene. This approach allows us to exploit an advantage of operating in the volumetric domain, namely the ability to guarantee consistent super-resolution across different viewing directions. To realize our method, we devise a novel 3D representation that hinges on multiple 2D feature planes. This allows us to super-resolve the 3D scene representation by applying 2D convolutional networks on the 2D feature planes. We validate the proposed method by super-resolving multi-view consistent views on a diverse set of unseen 3D scenes, confirming qualitative and quantitatively favorable quality over existing approaches. △ Less

Submitted 5 May, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

arXiv:2206.05061 [pdf, other]

Glyph from Icon -- Automated Generation of Metaphoric Glyphs

Authors: Dmitri Presnov, Andreas Kolb

Abstract: Metaphoric glyphs enhance the readability and learnability of abstract glyphs used for the visualization of quantitative multidimensional data by building upon graphical entities that are intuitively related to the underlying problem domain. Their construction is, however, a predominantly manual process. In this paper, we introduce the Glyph-from-Icon (GfI) approach that allows the automated gener… ▽ More Metaphoric glyphs enhance the readability and learnability of abstract glyphs used for the visualization of quantitative multidimensional data by building upon graphical entities that are intuitively related to the underlying problem domain. Their construction is, however, a predominantly manual process. In this paper, we introduce the Glyph-from-Icon (GfI) approach that allows the automated generation of metaphoric glyphs from user specified icons. Our approach modifies the icon's visual appearance using up to seven quantifiable visual variables, three of which manipulate its geometry while four affect its color. Depending on the visualization goal, specific combinations of these visual variables define the glyphs's variables used for data encoding. Technically, we propose a diffusion-curve based parametric icon representation, which comprises the degrees-of-freedom related to the geometric and color-based visual variables. Moreover, we extend our GfI approach to achieve scalability of the generated glyphs. Based on a user study we evaluate the perception of the glyph's main variables, i.e., amplitude and frequency of geometric and color modulation, as function of the stimuli and deduce functional relations as well as quantization levels to achieve perceptual monotonicity and readability. Finally, we propose a robustly perceivable combination of visual variables, which we apply to the visualization of COVID-19 data. △ Less

Submitted 9 June, 2022; originally announced June 2022.

arXiv:2112.00185 [pdf, other]

Light Field Implicit Representation for Flexible Resolution Reconstruction

Authors: Paramanand Chandramouli, Hendrik Sommerhoff, Andreas Kolb

Abstract: Inspired by the recent advances in implicitly representing signals with trained neural networks, we aim to learn a continuous representation for narrow-baseline 4D light fields. We propose an implicit representation model for 4D light fields which is conditioned on a sparse set of input views. Our model is trained to output the light field values for a continuous range of query spatio-angular coor… ▽ More Inspired by the recent advances in implicitly representing signals with trained neural networks, we aim to learn a continuous representation for narrow-baseline 4D light fields. We propose an implicit representation model for 4D light fields which is conditioned on a sparse set of input views. Our model is trained to output the light field values for a continuous range of query spatio-angular coordinates. Given a sparse set of input views, our scheme can super-resolve the input in both spatial and angular domains by flexible factors. consists of a feature extractor and a decoder which are trained on a dataset of light field patches. The feature extractor captures per-pixel features from the input views. These features can be resized to a desired spatial resolution and fed to the decoder along with the query coordinates. This formulation enables us to reconstruct light field views at any desired spatial and angular resolution. Additionally, our network can handle scenarios in which input views are either of low-resolution or with missing pixels. Experiments show that our method achieves state-of-the-art performance for the task of view synthesis while being computationally fast. △ Less

Submitted 30 November, 2021; originally announced December 2021.

arXiv:2005.06508 [pdf, other]

A Generative Model for Generic Light Field Reconstruction

Authors: Paramanand Chandramouli, Kanchana Vaishnavi Gandikota, Andreas Goerlitz, Andreas Kolb, Michael Moeller

Abstract: Recently deep generative models have achieved impressive progress in modeling the distribution of training data. In this work, we present for the first time a generative model for 4D light field patches using variational autoencoders to capture the data distribution of light field patches. We develop a generative model conditioned on the central view of the light field and incorporate this as a pr… ▽ More Recently deep generative models have achieved impressive progress in modeling the distribution of training data. In this work, we present for the first time a generative model for 4D light field patches using variational autoencoders to capture the data distribution of light field patches. We develop a generative model conditioned on the central view of the light field and incorporate this as a prior in an energy minimization framework to address diverse light field reconstruction tasks. While pure learning-based approaches do achieve excellent results on each instance of such a problem, their applicability is limited to the specific observation model they have been trained on. On the contrary, our trained light field generative model can be incorporated as a prior into any model-based optimization approach and therefore extend to diverse reconstruction tasks including light field view synthesis, spatial-angular super resolution and reconstruction from coded projections. Our proposed method demonstrates good reconstruction, with performance approaching end-to-end trained networks, while outperforming traditional model-based approaches on both synthetic and real scenes. Furthermore, we show that our approach enables reliable light field recovery despite distortions in the input. △ Less

Submitted 17 June, 2020; v1 submitted 13 May, 2020; originally announced May 2020.

arXiv:1907.04099 [pdf, other]

doi 10.1111/cgf.13808

Progressive Refinement Imaging

Authors: Markus Kluge, Tim Weyrich, Andreas Kolb

Abstract: This paper presents a novel technique for progressive online integration of uncalibrated image sequences with substantial geometric and/or photometric discrepancies into a single, geometrically and photometrically consistent image. Our approach can handle large sets of images, acquired from a nearly planar or infinitely distant scene at different resolutions in object domain and under variable loc… ▽ More This paper presents a novel technique for progressive online integration of uncalibrated image sequences with substantial geometric and/or photometric discrepancies into a single, geometrically and photometrically consistent image. Our approach can handle large sets of images, acquired from a nearly planar or infinitely distant scene at different resolutions in object domain and under variable local or global illumination conditions. It allows for efficient user guidance as its progressive nature provides a valid and consistent reconstruction at any moment during the online refinement process. Our approach avoids global optimization techniques, as commonly used in the field of image refinement, and progressively incorporates new imagery into a dynamically extendable and memory-efficient Laplacian pyramid. Our image registration process includes a coarse homography and a local refinement stage using optical flow. Photometric consistency is achieved by retaining the photometric intensities given in a reference image, while it is being refined. Globally blurred imagery and local geometric inconsistencies due to, e.g. motion are detected and removed prior to image fusion. We demonstrate the quality and robustness of our approach using several image and video sequences, including handheld acquisition with mobile phones and zooming sequences with consumer cameras. △ Less

Submitted 9 September, 2019; v1 submitted 9 July, 2019; originally announced July 2019.

Comments: This article has been published in final form at https://doi.org/10.1111/cgf.13808

arXiv:1907.01377 [pdf, other]

doi 10.1007/978-3-030-33676-9_7

Training Auto-encoder-based Optimizers for Terahertz Image Reconstruction

Authors: Tak Ming Wong, Matthias Kahl, Peter Haring Bolívar, Andreas Kolb, Michael Möller

Abstract: Terahertz (THz) sensing is a promising imaging technology for a wide variety of different applications. Extracting the interpretable and physically meaningful parameters for such applications, however, requires solving an inverse problem in which a model function determined by these parameters needs to be fitted to the measured data. Since the underlying optimization problem is nonconvex and very… ▽ More Terahertz (THz) sensing is a promising imaging technology for a wide variety of different applications. Extracting the interpretable and physically meaningful parameters for such applications, however, requires solving an inverse problem in which a model function determined by these parameters needs to be fitted to the measured data. Since the underlying optimization problem is nonconvex and very costly to solve, we propose learning the prediction of suitable parameters from the measured data directly. More precisely, we develop a model-based autoencoder in which the encoder network predicts suitable parameters and the decoder is fixed to a physically meaningful model function, such that we can train the encoding network in an unsupervised way. We illustrate numerically that the resulting network is more than 140 times faster than classical optimization techniques while making predictions with only slightly higher objective values. Using such predictions as starting points of local optimization techniques allows us to converge to better local minima about twice as fast as optimization without the network-based initialization. △ Less

Submitted 29 October, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

Comments: This is a pre-print of a conference paper published in German Conference on Pattern Recognition (GCPR) 2019

Journal ref: Pattern Recognition. DAGM GCPR 2019. Lecture Notes in Computer Science, vol 11824. Springer, Cham

arXiv:1906.07720 [pdf, other]

On-Body Visualization of Patient Data for Cooperative Tasks

Authors: Dmitri Presnov, Julia Kurz, Judith Willkomm, Daniel Alt, Johannes Dillmann, Robert Zilke, Veit Braun, Cornelius Schubert, Andreas Kolb

Abstract: Electronic health records (EHR) systematically represent patient data in digital form. However, text and visualization based EHR systems are poorly integrated in the hospital workflow due to their complex and rather non-intuitive access structure. This is especially disadvantageous in clinical cooperative situations that require an efficient, task specific information transfer. In this paper we… ▽ More Electronic health records (EHR) systematically represent patient data in digital form. However, text and visualization based EHR systems are poorly integrated in the hospital workflow due to their complex and rather non-intuitive access structure. This is especially disadvantageous in clinical cooperative situations that require an efficient, task specific information transfer. In this paper we introduce a novel concept of anatomically integrated in-place visualization designed to engage with cooperative tasks on a neurosurgical ward. Based on the findings of our field studies and the derived design goals, we propose an approach that follows a visual tradition in medicine, which is tightly related with anatomy, by using a virtual patient's body as spatial representation of visually encoded abstract medical data. More specifically, we provide a generic set of formal requirements for these kinds of in-place visualizations, we apply these requirements in order to achieve a specific visualization of neurological symptoms related to the differential diagnosis of spinal disc herniation, and we present a prototypical implementation of the visualization concept on a mobile device. Moreover, we discuss various challenges related to visual encoding and visibility of the body model components. Finally, the prototype is evaluated by 10 neurosurgeons, who assess the validity and the further potential of the proposed approach. △ Less

Submitted 27 February, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

arXiv:1905.03580 [pdf]

Designing technology, develo** theory. Towards a symmetrical approach

Authors: Cornelius Schubert, Andreas Kolb

Abstract: We focus on collaborative activities that engage computer graphics designers and social scientists in systems design processes. Our conceptual symmetrical account of technology design and theory development is elaborated as a mode of mutual engagement occurring in an interdisciplinary trading zone, where neither discipline is placed at the service of the other, and nor do disciplinary boundaries d… ▽ More We focus on collaborative activities that engage computer graphics designers and social scientists in systems design processes. Our conceptual symmetrical account of technology design and theory development is elaborated as a mode of mutual engagement occurring in an interdisciplinary trading zone, where neither discipline is placed at the service of the other, and nor do disciplinary boundaries dissolve. To this end, we draw on analyses of mutual engagements between computer and social scientists stemming from the fields of computer-supported cooperative work (CSCW), human-computer interaction (HCI), and science and technology studies (STS). We especially build on theoretical work in STS concerning information technology (IT) in health care and extend recent contributions from STS with respect to the modes of engagement and trading zones between computer and social sciences. We conceive participative digital systems design as a form of inquiry for the analysis of cooperative work settings, particularly when social science becomes part of design processes. We illustrate our conceptual approach using data from an interdisciplinary project involving computer graphics designers, sociologists, and neurosurgeons with the aim of develo** patient-centered visualizations for clinical cooperation on a hospital ward. △ Less

Submitted 10 June, 2020; v1 submitted 9 May, 2019; originally announced May 2019.

Comments: 36 pages, 2 figures

arXiv:1812.04042 [pdf, ps, other]

Supervised Deep Kriging for Single-Image Super-Resolution

Authors: Gianni Franchi, Angela Yao, Andreas Kolb

Abstract: We propose a novel single-image super-resolution approach based on the geostatistical method of kriging. Kriging is a zero-bias minimum-variance estimator that performs spatial interpolation based on a weighted average of known observations. Rather than solving for the kriging weights via the traditional method of inverting covariance matrices, we propose a supervised form in which we learn a deep… ▽ More We propose a novel single-image super-resolution approach based on the geostatistical method of kriging. Kriging is a zero-bias minimum-variance estimator that performs spatial interpolation based on a weighted average of known observations. Rather than solving for the kriging weights via the traditional method of inverting covariance matrices, we propose a supervised form in which we learn a deep network to generate said weights. We combine the kriging weight generation and kriging process into a joint network that can be learned end-to-end. Our network achieves competitive super-resolution results as other state-of-the-art methods. In addition, since the super-resolution process follows a known statistical framework, we are able to estimate bias and variance, something which is rarely possible for other deep networks. △ Less

Submitted 10 December, 2018; originally announced December 2018.

Comments: 3 figures, for a better quality read the hal or GCPR version

arXiv:1811.02396 [pdf, other]

A Bit Too Much? High Speed Imaging from Sparse Photon Counts

Authors: Paramanand Chandramouli, Samuel Burri, Claudio Bruschini, Edoardo Charbon, Andreas Kolb

Abstract: Recent advances in photographic sensing technologies have made it possible to achieve light detection in terms of a single photon. Photon counting sensors are being increasingly used in many diverse applications. We address the problem of jointly recovering spatial and temporal scene radiance from very few photon counts. Our ConvNet-based scheme effectively combines spatial and temporal informatio… ▽ More Recent advances in photographic sensing technologies have made it possible to achieve light detection in terms of a single photon. Photon counting sensors are being increasingly used in many diverse applications. We address the problem of jointly recovering spatial and temporal scene radiance from very few photon counts. Our ConvNet-based scheme effectively combines spatial and temporal information present in measurements to reduce noise. We demonstrate that using our method one can acquire videos at a high frame rate and still achieve good quality signal-to-noise ratio. Experiments show that the proposed scheme performs quite well in different challenging scenarios while the existing approaches are unable to handle them. △ Less

Submitted 11 May, 2019; v1 submitted 6 November, 2018; originally announced November 2018.

arXiv:1809.10089 [pdf, ps, other]

Residuum-Condition Diagram and Reduction of Over-Complete Endmember-Sets

Authors: Christoph Schikora, Markus Plack, Andreas Kolb

Abstract: Extracting reference spectra, or endmembers (EMs) from a given multi- or hyperspectral image, as well as estimating the size of the EM set, plays an important role in multispectral image processing. In this paper, we present condition-residuum-diagrams. By plotting the residuum resulting from the unmixing and reconstruction and the condition number of various EM sets, the resulting diagram provide… ▽ More Extracting reference spectra, or endmembers (EMs) from a given multi- or hyperspectral image, as well as estimating the size of the EM set, plays an important role in multispectral image processing. In this paper, we present condition-residuum-diagrams. By plotting the residuum resulting from the unmixing and reconstruction and the condition number of various EM sets, the resulting diagram provides insight into the behavior of the spectral unmixing under a varying amount of endmembers (EMs). Furthermore, we utilize condition-residuum-diagrams to realize an EM reduction algorithm that starts with an initially extracted, over-complete EM set. An over-complete EM set commonly exhibits a good unmixing result, i.e. a lower reconstruction residuum, but due to its partial redundancy, the unmixing gets numerically unstable, i.e. the unmixed abundances values are less reliable. Our greedy reduction scheme improves the EM set by reducing the condition number, i.e. enhancing the set's stability, while kee** the reconstruction error as low as possible. The resulting set sequence gives hint to the optimal EM set and its size. We demonstrate the benefit of our condition-residuum-diagram and reduction scheme on well-studied datasets with known reference EM set sizes for several well-known EE algorithms. △ Less

Submitted 26 September, 2018; originally announced September 2018.

arXiv:1602.02022 [pdf]

doi 10.1117/12.877660

Preoperative Volume Determination for Pituitary Adenoma

Authors: Dzenan Zukic, Jan Egger, Miriam H. A. Bauer, Daniela Kuhnt, Barbara Carl, Bernd Freisleben, Andreas Kolb, Christopher Nimsky

Abstract: The most common sellar lesion is the pituitary adenoma, and sellar tumors are approximately 10-15% of all intracranial neoplasms. Manual slice-by-slice segmentation takes quite some time that can be reduced by using the appropriate algorithms. In this contribution, we present a segmentation method for pituitary adenoma. The method is based on an algorithm that we have applied recently to segmentin… ▽ More The most common sellar lesion is the pituitary adenoma, and sellar tumors are approximately 10-15% of all intracranial neoplasms. Manual slice-by-slice segmentation takes quite some time that can be reduced by using the appropriate algorithms. In this contribution, we present a segmentation method for pituitary adenoma. The method is based on an algorithm that we have applied recently to segmenting glioblastoma multiforme. A modification of this scheme is used for adenoma segmentation that is much harder to perform, due to lack of contrast-enhanced boundaries. In our experimental evaluation, neurosurgeons performed manual slice-by-slice segmentation of ten magnetic resonance imaging (MRI) cases. The segmentations were compared to the segmentation results of the proposed method using the Dice Similarity Coefficient (DSC). The average DSC for all datasets was 75.92% +/- 7.24%. A manual segmentation took about four minutes and our algorithm required about one second. △ Less

Submitted 5 February, 2016; originally announced February 2016.

Comments: 7 pages, 6 figures, 1 table, 16 references in Proc. SPIE 7963, Medical Imaging 2011: Computer-Aided Diagnosis, 79632T (9 March 2011). arXiv admin note: text overlap with arXiv:1103.1778

arXiv:1505.05459 [pdf, other]

Kinect Range Sensing: Structured-Light versus Time-of-Flight Kinect

Authors: Hamed Sarbolandi, Damien Lefloch, Andreas Kolb

Abstract: Recently, the new Kinect One has been issued by Microsoft, providing the next generation of real-time range sensing devices based on the Time-of-Flight (ToF) principle. As the first Kinect version was using a structured light approach, one would expect various differences in the characteristics of the range data delivered by both devices. This paper presents a detailed and in-depth comparison betw… ▽ More Recently, the new Kinect One has been issued by Microsoft, providing the next generation of real-time range sensing devices based on the Time-of-Flight (ToF) principle. As the first Kinect version was using a structured light approach, one would expect various differences in the characteristics of the range data delivered by both devices. This paper presents a detailed and in-depth comparison between both devices. In order to conduct the comparison, we propose a framework of seven different experimental setups, which is a generic basis for evaluating range cameras such as Kinect. The experiments have been designed with the goal to capture individual effects of the Kinect devices as isolatedly as possible and in a way, that they can also be adopted, in order to apply them to any other range sensing device. The overall goal of this paper is to provide a solid insight into the pros and cons of either device. Thus, scientists that are interested in using Kinect range sensing cameras in their specific application scenario can directly assess the expected, specific benefits and potential problem of either device. △ Less

Submitted 20 May, 2015; originally announced May 2015.

Comments: 58 pages, 23 figures. Accepted for publication in Computer Vision and Image Understanding (CVIU)

arXiv:1302.2024 [pdf, other]

User Interface for Volume Rendering in Virtual Reality Environments

Authors: Jonathan Klein, Dennis Reuling, Jan Grimm, Andreas Pfau, Damien Lefloch, Martin Lambers, Andreas Kolb

Abstract: Volume Rendering applications require sophisticated user interaction for the definition and refinement of transfer functions. Traditional 2D desktop user interface elements have been developed to solve this task, but such concepts do not map well to the interaction devices available in Virtual Reality environments. In this paper, we propose an intuitive user interface for Volume Rendering specif… ▽ More Volume Rendering applications require sophisticated user interaction for the definition and refinement of transfer functions. Traditional 2D desktop user interface elements have been developed to solve this task, but such concepts do not map well to the interaction devices available in Virtual Reality environments. In this paper, we propose an intuitive user interface for Volume Rendering specifically designed for Virtual Reality environments. The proposed interface allows transfer function design and refinement based on intuitive two-handed operation of Wand-like controllers. Additional interaction modes such as navigation and clip plane manipulation are supported as well. The system is implemented using the Sony PlayStation Move controller system. This choice is based on controller device capabilities as well as application and environment constraints. Initial results document the potential of our approach. △ Less

Submitted 8 February, 2013; originally announced February 2013.

arXiv:1110.5450 [pdf, ps, other]

Hand Tracking based on Hierarchical Clustering of Range Data

Authors: Roberto Cespi, Andreas Kolb, Marvin Lindner

Abstract: Fast and robust hand segmentation and tracking is an essential basis for gesture recognition and thus an important component for contact-less human-computer interaction (HCI). Hand gesture recognition based on 2D video data has been intensively investigated. However, in practical scenarios purely intensity based approaches suffer from uncontrollable environmental conditions like cluttered backgrou… ▽ More Fast and robust hand segmentation and tracking is an essential basis for gesture recognition and thus an important component for contact-less human-computer interaction (HCI). Hand gesture recognition based on 2D video data has been intensively investigated. However, in practical scenarios purely intensity based approaches suffer from uncontrollable environmental conditions like cluttered background colors. In this paper we present a real-time hand segmentation and tracking algorithm using Time-of-Flight (ToF) range cameras and intensity data. The intensity and range information is fused into one pixel value, representing its combined intensity-depth homogeneity. The scene is hierarchically clustered using a GPU based parallel merging algorithm, allowing a robust identification of both hands even for inhomogeneous backgrounds. After the detection, both hands are tracked on the CPU. Our tracking algorithm can cope with the situation that one hand is temporarily covered by the other hand. △ Less

Submitted 25 October, 2011; originally announced October 2011.

Comments: Technical Report

MSC Class: 68U10 ACM Class: I.4.6

arXiv:1102.2382 [pdf]

A Comparison of Two Human Brain Tumor Segmentation Methods for MRI Data

Authors: Jan Egger, Dženan Zukić, Miriam H. A. Bauer, Daniela Kuhnt, Barbara Carl, Bernd Freisleben, Andreas Kolb, Christopher Nimsky

Abstract: The most common primary brain tumors are gliomas, evolving from the cerebral supportive cells. For clinical follow-up, the evaluation of the preoperative tumor volume is essential. Volumetric assessment of tumor volume with manual segmentation of its outlines is a time-consuming process that can be overcome with the help of computerized segmentation methods. In this contribution, two methods for W… ▽ More The most common primary brain tumors are gliomas, evolving from the cerebral supportive cells. For clinical follow-up, the evaluation of the preoperative tumor volume is essential. Volumetric assessment of tumor volume with manual segmentation of its outlines is a time-consuming process that can be overcome with the help of computerized segmentation methods. In this contribution, two methods for World Health Organization (WHO) grade IV glioma segmentation in the human brain are compared using magnetic resonance imaging (MRI) patient data from the clinical routine. One method uses balloon inflation forces, and relies on detection of high intensity tumor boundaries that are coupled with the use of contrast agent gadolinium. The other method sets up a directed and weighted graph and performs a min-cut for optimal segmentation results. The ground truth of the tumor boundaries - for evaluating the methods on 27 cases - is manually extracted by neurosurgeons with several years of experience in the resection of gliomas. A comparison is performed using the Dice Similarity Coefficient (DSC), a measure for the spatial overlap of different segmentation results. △ Less

Submitted 10 March, 2011; v1 submitted 9 February, 2011; originally announced February 2011.

Comments: 4 pages, 5 figures, Proc. of the 6th Russian-Bavarian Conference on Bio-Medical Engineering

arXiv:1102.0634 [pdf]

Glioblastoma Multiforme Segmentation in MRI Data with a Balloon Inflation Approach

Authors: Dženan Zukić, Jan Egger, Miriam H. A. Bauer, Daniela Kuhnt, Barbara Carl, Bernd Freisleben, Andreas Kolb, Christopher Nimsky

Abstract: Gliomas are the most common primary brain tumors, evolving from the cerebral supportive cells. For clinical follow-up, the evaluation of the preoperative tumor volume is essential. Volumetric assessment of tumor volume with manual segmentation of its outlines is a time-consuming process that can be overcome with the help of computer-assisted segmentation methods. In this paper, a semi-automatic ap… ▽ More Gliomas are the most common primary brain tumors, evolving from the cerebral supportive cells. For clinical follow-up, the evaluation of the preoperative tumor volume is essential. Volumetric assessment of tumor volume with manual segmentation of its outlines is a time-consuming process that can be overcome with the help of computer-assisted segmentation methods. In this paper, a semi-automatic approach for World Health Organization (WHO) grade IV glioma segmentation is introduced that uses balloon inflation forces, and relies on the detection of high-intensity tumor boundaries that are coupled by using contrast agent gadolinium. The presented method is evaluated on 27 magnetic resonance imaging (MRI) data sets and the ground truth data of the tumor boundaries - for evaluation of the results - are manually extracted by neurosurgeons. △ Less

Submitted 3 February, 2011; originally announced February 2011.

Comments: 4 pages, 4 figures, Proc. of the 6th Russian-Bavarian Conference on Bio-Medical Engineering

arXiv:0906.2274 [pdf, other]

A Neural Network Classifier of Volume Datasets

Authors: Dženan Zukić, Christof Rezk-Salama, Andreas Kolb

Abstract: Many state-of-the art visualization techniques must be tailored to the specific type of dataset, its modality (CT, MRI, etc.), the recorded object or anatomical region (head, spine, abdomen, etc.) and other parameters related to the data acquisition process. While parts of the information (imaging modality and acquisition sequence) may be obtained from the meta-data stored with the volume scan,… ▽ More Many state-of-the art visualization techniques must be tailored to the specific type of dataset, its modality (CT, MRI, etc.), the recorded object or anatomical region (head, spine, abdomen, etc.) and other parameters related to the data acquisition process. While parts of the information (imaging modality and acquisition sequence) may be obtained from the meta-data stored with the volume scan, there is important information which is not stored explicitly (anatomical region, tracing compound). Also, meta-data might be incomplete, inappropriate or simply missing. This paper presents a novel and simple method of determining the type of dataset from previously defined categories. 2D histograms based on intensity and gradient magnitude of datasets are used as input to a neural network, which classifies it into one of several categories it was trained with. The proposed method is an important building block for visualization systems to be used autonomously by non-experts. The method has been tested on 80 datasets, divided into 3 classes and a "rest" class. A significant result is the ability of the system to classify datasets into a specific class after being trained with only one dataset of that class. Other advantages of the method are its easy implementation and its high computational performance. △ Less

Submitted 12 June, 2009; originally announced June 2009.

Comments: 10 pages, 10 figures, 1 table, 3IA conference http://3ia.teiath.gr/

ACM Class: I.3.6; I.2.6

Journal ref: International Conference on Computer Graphics and Artificial Intelligence, Proceedings (2009) 53-62

Showing 1–20 of 20 results for author: Kolb, A