-
On the Convexity and Reliability of the Bethe Free Energy Approximation
Authors:
Harald Leisenberger,
Christian Knoll,
Franz Pernkopf
Abstract:
The Bethe free energy approximation provides an effective way for relaxing NP-hard problems of probabilistic inference. However, its accuracy depends on the model parameters and particularly degrades if a phase transition in the model occurs. In this work, we analyze when the Bethe approximation is reliable and how this can be verified. We argue and show by experiment that it is mostly accurate if…
▽ More
The Bethe free energy approximation provides an effective way for relaxing NP-hard problems of probabilistic inference. However, its accuracy depends on the model parameters and particularly degrades if a phase transition in the model occurs. In this work, we analyze when the Bethe approximation is reliable and how this can be verified. We argue and show by experiment that it is mostly accurate if it is convex on a submanifold of its domain, the 'Bethe box'. For verifying its convexity, we derive two sufficient conditions that are based on the definiteness properties of the Bethe Hessian matrix: the first uses the concept of diagonal dominance, and the second decomposes the Bethe Hessian matrix into a sum of sparse matrices and characterizes the definiteness properties of the individual matrices in that sum. These theoretical results provide a simple way to estimate the critical phase transition temperature of a model. As a practical contribution we propose $\texttt{BETHE-MIN}$, a projected quasi-Newton method to efficiently find a minimum of the Bethe free energy.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
PointCompress3D -- A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems
Authors:
Walter Zimmer,
Ramandika Pranamulia,
Xingcheng Zhou,
Mingyu Liu,
Alois C. Knoll
Abstract:
In the context of Intelligent Transportation Systems (ITS), efficient data compression is crucial for managing large-scale point cloud data acquired by roadside LiDAR sensors. The demand for efficient storage, streaming, and real-time object detection capabilities for point cloud data is substantial. This work introduces PointCompress3D, a novel point cloud compression framework tailored specifica…
▽ More
In the context of Intelligent Transportation Systems (ITS), efficient data compression is crucial for managing large-scale point cloud data acquired by roadside LiDAR sensors. The demand for efficient storage, streaming, and real-time object detection capabilities for point cloud data is substantial. This work introduces PointCompress3D, a novel point cloud compression framework tailored specifically for roadside LiDARs. Our framework addresses the challenges of compressing high-resolution point clouds while maintaining accuracy and compatibility with roadside LiDAR sensors. We adapt, extend, integrate, and evaluate three cutting-edge compression methods using our real-world-based TUMTraf dataset family. We achieve a frame rate of 10 FPS while kee** compression sizes below 105 Kb, a reduction of 50 times, and maintaining object detection performance on par with the original data. In extensive experiments and ablation studies, we finally achieved a PSNR d2 of 94.46 and a BPP of 6.54 on our dataset. Future work includes the deployment on the live system. The code is available on our project website: https://pointcompress3d.github.io.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Tensions between Preference and Performance: Designing for Visual Exploration of Multi-frequency Medical Network Data
Authors:
Christian Knoll,
Laura Koesten,
Isotta Rigoni,
Serge Vulliémoz,
Torsten Möller
Abstract:
The analysis of complex high-dimensional data is a common task in many domains, resulting in bespoke visual exploration tools. Expectations and practices of domain experts as users do not always align with visualization theory. In this paper, we report on a design study in the medical domain where we developed two high-fidelity prototypes encoding EEG-derived brain network data with different type…
▽ More
The analysis of complex high-dimensional data is a common task in many domains, resulting in bespoke visual exploration tools. Expectations and practices of domain experts as users do not always align with visualization theory. In this paper, we report on a design study in the medical domain where we developed two high-fidelity prototypes encoding EEG-derived brain network data with different types of visualizations. We evaluate these prototypes regarding effectiveness, efficiency, and preference with two groups: participants with domain knowledge (domain experts in medical research) and those without domain knowledge, both groups having little or no visualization experience. A requirement analysis and study of low-fidelity prototypes revealed a strong preference for a novel and aesthetically pleasing visualization design, as opposed to a design that is considered more optimal based on visualization theory. Our study highlights the pros and cons of both approaches, discussing trade-offs between task-specific measurements and subjective preference. While the aesthetically pleasing and novel low-fidelity prototype was favored, the results of our evaluation show that, in most cases, this was not reflected in participants' performance or subjective preference for the high-fidelity prototypes.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
TUMTraf V2X Cooperative Perception Dataset
Authors:
Walter Zimmer,
Gerhard Arya Wardana,
Suren Sritharan,
Xingcheng Zhou,
Rui Song,
Alois C. Knoll
Abstract:
Cooperative perception offers several benefits for enhancing the capabilities of autonomous vehicles and improving road safety. Using roadside sensors in addition to onboard sensors increases reliability and extends the sensor range. External sensors offer higher situational awareness for automated vehicles and prevent occlusions. We propose CoopDet3D, a cooperative multi-modal fusion model, and T…
▽ More
Cooperative perception offers several benefits for enhancing the capabilities of autonomous vehicles and improving road safety. Using roadside sensors in addition to onboard sensors increases reliability and extends the sensor range. External sensors offer higher situational awareness for automated vehicles and prevent occlusions. We propose CoopDet3D, a cooperative multi-modal fusion model, and TUMTraf-V2X, a perception dataset, for the cooperative 3D object detection and tracking task. Our dataset contains 2,000 labeled point clouds and 5,000 labeled images from five roadside and four onboard sensors. It includes 30k 3D boxes with track IDs and precise GPS and IMU data. We labeled eight categories and covered occlusion scenarios with challenging driving maneuvers, like traffic violations, near-miss events, overtaking, and U-turns. Through multiple experiments, we show that our CoopDet3D camera-LiDAR fusion model achieves an increase of +14.36 3D mAP compared to a vehicle camera-LiDAR fusion model. Finally, we make our dataset, model, labeling tool, and dev-kit publicly available on our website: https://tum-traffic-dataset.github.io/tumtraf-v2x.
△ Less
Submitted 2 March, 2024;
originally announced March 2024.
-
Rao-Blackwellising Bayesian Causal Inference
Authors:
Christian Toth,
Christian Knoll,
Franz Pernkopf,
Robert Peharz
Abstract:
Bayesian causal inference, i.e., inferring a posterior over causal models for the use in downstream causal reasoning tasks, poses a hard computational inference problem that is little explored in literature. In this work, we combine techniques from order-based MCMC structure learning with recent advances in gradient-based graph learning into an effective Bayesian causal inference framework. Specif…
▽ More
Bayesian causal inference, i.e., inferring a posterior over causal models for the use in downstream causal reasoning tasks, poses a hard computational inference problem that is little explored in literature. In this work, we combine techniques from order-based MCMC structure learning with recent advances in gradient-based graph learning into an effective Bayesian causal inference framework. Specifically, we decompose the problem of inferring the causal structure into (i) inferring a topological order over variables and (ii) inferring the parent sets for each variable. When limiting the number of parents per variable, we can exactly marginalise over the parent sets in polynomial time. We further use Gaussian processes to model the unknown causal mechanisms, which also allows their exact marginalisation. This introduces a Rao-Blackwellization scheme, where all components are eliminated from the model, except for the causal order, for which we learn a distribution via gradient-based optimisation. The combination of Rao-Blackwellization with our sequential inference procedure for causal orders yields state-of-the-art on linear and non-linear additive noise benchmarks with scale-free and Erdos-Renyi graph structures.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
ActiveAnno3D -- An Active Learning Framework for Multi-Modal 3D Object Detection
Authors:
Ahmed Ghita,
Bjørk Antoniussen,
Walter Zimmer,
Ross Greer,
Christian Creß,
Andreas Møgelmose,
Mohan M. Trivedi,
Alois C. Knoll
Abstract:
The curation of large-scale datasets is still costly and requires much time and resources. Data is often manually labeled, and the challenge of creating high-quality datasets remains. In this work, we fill the research gap using active learning for multi-modal 3D object detection. We propose ActiveAnno3D, an active learning framework to select data samples for labeling that are of maximum informat…
▽ More
The curation of large-scale datasets is still costly and requires much time and resources. Data is often manually labeled, and the challenge of creating high-quality datasets remains. In this work, we fill the research gap using active learning for multi-modal 3D object detection. We propose ActiveAnno3D, an active learning framework to select data samples for labeling that are of maximum informativeness for training. We explore various continuous training methods and integrate the most efficient method regarding computational demand and detection performance. Furthermore, we perform extensive experiments and ablation studies with BEVFusion and PV-RCNN on the nuScenes and TUM Traffic Intersection dataset. We show that we can achieve almost the same performance with PV-RCNN and the entropy-based query strategy when using only half of the training data (77.25 mAP compared to 83.50 mAP) of the TUM Traffic Intersection dataset. BEVFusion achieved an mAP of 64.31 when using half of the training data and 75.0 mAP when using the complete nuScenes dataset. We integrate our active learning framework into the proAnno labeling tool to enable AI-assisted data selection and labeling and minimize the labeling costs. Finally, we provide code, weights, and visualization results on our website: https://active3d-framework.github.io/active3d-framework.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
GPT-4V as Traffic Assistant: An In-depth Look at Vision Language Model on Complex Traffic Events
Authors:
Xingcheng Zhou,
Alois C. Knoll
Abstract:
The recognition and understanding of traffic incidents, particularly traffic accidents, is a topic of paramount importance in the realm of intelligent transportation systems and intelligent vehicles. This area has continually captured the extensive focus of both the academic and industrial sectors. Identifying and comprehending complex traffic events is highly challenging, primarily due to the int…
▽ More
The recognition and understanding of traffic incidents, particularly traffic accidents, is a topic of paramount importance in the realm of intelligent transportation systems and intelligent vehicles. This area has continually captured the extensive focus of both the academic and industrial sectors. Identifying and comprehending complex traffic events is highly challenging, primarily due to the intricate nature of traffic environments, diverse observational perspectives, and the multifaceted causes of accidents. These factors have persistently impeded the development of effective solutions. The advent of large vision-language models (VLMs) such as GPT-4V, has introduced innovative approaches to addressing this issue. In this paper, we explore the ability of GPT-4V with a set of representative traffic incident videos and delve into the model's capacity of understanding these complex traffic situations. We observe that GPT-4V demonstrates remarkable cognitive, reasoning, and decision-making ability in certain classic traffic events. Concurrently, we also identify certain limitations of GPT-4V, which constrain its understanding in more intricate scenarios. These limitations merit further exploration and resolution.
△ Less
Submitted 7 February, 2024; v1 submitted 3 February, 2024;
originally announced February 2024.
-
TUMTraf Event: Calibration and Fusion Resulting in a Dataset for Roadside Event-Based and RGB Cameras
Authors:
Christian Creß,
Walter Zimmer,
Nils Purschke,
Bach Ngoc Doan,
Sven Kirchner,
Venkatnarayanan Lakshminarasimhan,
Leah Strand,
Alois C. Knoll
Abstract:
Event-based cameras are predestined for Intelligent Transportation Systems (ITS). They provide very high temporal resolution and dynamic range, which can eliminate motion blur and improve detection performance at night. However, event-based images lack color and texture compared to images from a conventional RGB camera. Considering that, data fusion between event-based and conventional cameras can…
▽ More
Event-based cameras are predestined for Intelligent Transportation Systems (ITS). They provide very high temporal resolution and dynamic range, which can eliminate motion blur and improve detection performance at night. However, event-based images lack color and texture compared to images from a conventional RGB camera. Considering that, data fusion between event-based and conventional cameras can combine the strengths of both modalities. For this purpose, extrinsic calibration is necessary. To the best of our knowledge, no targetless calibration between event-based and RGB cameras can handle multiple moving objects, nor does data fusion optimized for the domain of roadside ITS exist. Furthermore, synchronized event-based and RGB camera datasets considering roadside perspective are not yet published. To fill these research gaps, based on our previous work, we extended our targetless calibration approach with clustering methods to handle multiple moving objects. Furthermore, we developed an early fusion, simple late fusion, and a novel spatiotemporal late fusion method. Lastly, we published the TUMTraf Event Dataset, which contains more than 4,111 synchronized event-based and RGB images with 50,496 labeled 2D boxes. During our extensive experiments, we verified the effectiveness of our calibration method with multiple moving objects. Furthermore, compared to a single RGB camera, we increased the detection performance of up to +9 % mAP in the day and up to +13 % mAP during the challenging night with our presented event-based sensor fusion methods. The TUMTraf Event Dataset is available at https://innovation-mobility.com/tumtraf-dataset.
△ Less
Submitted 9 March, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
A Survey on Autonomous Driving Datasets: Statistics, Annotation Quality, and a Future Outlook
Authors:
Mingyu Liu,
Ekim Yurtsever,
Jonathan Fossaert,
Xingcheng Zhou,
Walter Zimmer,
Yuning Cui,
Bare Luka Zagar,
Alois C. Knoll
Abstract:
Autonomous driving has rapidly developed and shown promising performance due to recent advances in hardware and deep learning techniques. High-quality datasets are fundamental for develo** reliable autonomous driving algorithms. Previous dataset surveys either focused on a limited number or lacked detailed investigation of dataset characteristics. To this end, we present an exhaustive study of 2…
▽ More
Autonomous driving has rapidly developed and shown promising performance due to recent advances in hardware and deep learning techniques. High-quality datasets are fundamental for develo** reliable autonomous driving algorithms. Previous dataset surveys either focused on a limited number or lacked detailed investigation of dataset characteristics. To this end, we present an exhaustive study of 265 autonomous driving datasets from multiple perspectives, including sensor modalities, data size, tasks, and contextual conditions. We introduce a novel metric to evaluate the impact of datasets, which can also be a guide for creating new datasets. Besides, we analyze the annotation processes, existing labeling tools, and the annotation quality of datasets, showing the importance of establishing a standard annotation pipeline. On the other hand, we thoroughly analyze the impact of geographical and adversarial environmental conditions on the performance of autonomous driving systems. Moreover, we exhibit the data distribution of several vital datasets and discuss their pros and cons accordingly. Finally, we discuss the current challenges and the development trend of the future autonomous driving datasets.
△ Less
Submitted 23 April, 2024; v1 submitted 2 January, 2024;
originally announced January 2024.
-
Interpretability is in the eye of the beholder: Human versus artificial classification of image segments generated by humans versus XAI
Authors:
Romy Müller,
Marius Thoß,
Julian Ullrich,
Steffen Seitz,
Carsten Knoll
Abstract:
The evaluation of explainable artificial intelligence is challenging, because automated and human-centred metrics of explanation quality may diverge. To clarify their relationship, we investigated whether human and artificial image classification will benefit from the same visual explanations. In three experiments, we analysed human reaction times, errors, and subjective ratings while participants…
▽ More
The evaluation of explainable artificial intelligence is challenging, because automated and human-centred metrics of explanation quality may diverge. To clarify their relationship, we investigated whether human and artificial image classification will benefit from the same visual explanations. In three experiments, we analysed human reaction times, errors, and subjective ratings while participants classified image segments. These segments either reflected human attention (eye movements, manual selections) or the outputs of two attribution methods explaining a ResNet (Grad-CAM, XRAI). We also had this model classify the same segments. Humans and the model largely agreed on the interpretability of attribution methods: Grad-CAM was easily interpretable for indoor scenes and landscapes, but not for objects, while the reverse pattern was observed for XRAI. Conversely, human and model performance diverged for human-generated segments. Our results caution against general statements about interpretability, as it varies with the explanation method, the explained images, and the agent interpreting them.
△ Less
Submitted 12 February, 2024; v1 submitted 21 November, 2023;
originally announced November 2023.
-
Vision Language Models in Autonomous Driving: A Survey and Outlook
Authors:
Xingcheng Zhou,
Mingyu Liu,
Ekim Yurtsever,
Bare Luka Zagar,
Walter Zimmer,
Hu Cao,
Alois C. Knoll
Abstract:
The applications of Vision-Language Models (VLMs) in the field of Autonomous Driving (AD) have attracted widespread attention due to their outstanding performance and the ability to leverage Large Language Models (LLMs). By incorporating language data, driving systems can gain a better understanding of real-world environments, thereby enhancing driving safety and efficiency. In this work, we prese…
▽ More
The applications of Vision-Language Models (VLMs) in the field of Autonomous Driving (AD) have attracted widespread attention due to their outstanding performance and the ability to leverage Large Language Models (LLMs). By incorporating language data, driving systems can gain a better understanding of real-world environments, thereby enhancing driving safety and efficiency. In this work, we present a comprehensive and systematic survey of the advances in vision language models in this domain, encompassing perception and understanding, navigation and planning, decision-making and control, end-to-end autonomous driving, and data generation. We introduce the mainstream VLM tasks in AD and the commonly utilized metrics. Additionally, we review current studies and applications in various areas and summarize the existing language-enhanced autonomous driving datasets thoroughly. Lastly, we discuss the benefits and challenges of VLMs in AD and provide researchers with the current research gaps and future trends.
△ Less
Submitted 20 June, 2024; v1 submitted 22 October, 2023;
originally announced October 2023.
-
3D Understanding of Deformable Linear Objects: Datasets and Transferability Benchmark
Authors:
Bare Luka Žagar,
Tim Hertel,
Mingyu Liu,
Ekim Yurtsever,
ALois C. Knoll
Abstract:
Deformable linear objects are vastly represented in our everyday lives. It is often challenging even for humans to visually understand them, as the same object can be entangled so that it appears completely different. Examples of deformable linear objects include blood vessels and wiring harnesses, vital to the functioning of their corresponding systems, such as the human body and a vehicle. Howev…
▽ More
Deformable linear objects are vastly represented in our everyday lives. It is often challenging even for humans to visually understand them, as the same object can be entangled so that it appears completely different. Examples of deformable linear objects include blood vessels and wiring harnesses, vital to the functioning of their corresponding systems, such as the human body and a vehicle. However, no point cloud datasets exist for studying 3D deformable linear objects. Therefore, we are introducing two point cloud datasets, PointWire and PointVessel. We evaluated state-of-the-art methods on the proposed large-scale 3D deformable linear object benchmarks. Finally, we analyzed the generalization capabilities of these methods by conducting transferability experiments on the PointWire and PointVessel datasets.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
The Gulf of Interpretation: From Chart to Message and Back Again
Authors:
Christian Knoll,
Torsten Möller,
Kathleen Gregory,
Laura Koesten
Abstract:
Charts are used to communicate data visually, but designing an effective chart that a broad set of people can understand is challenging. Usually, we do not know whether a chart's intended message aligns with the message readers perceive. In this mixed-methods study, we investigate how data journalists encode data and how a broad audience engages with, experiences, and understands these data visual…
▽ More
Charts are used to communicate data visually, but designing an effective chart that a broad set of people can understand is challenging. Usually, we do not know whether a chart's intended message aligns with the message readers perceive. In this mixed-methods study, we investigate how data journalists encode data and how a broad audience engages with, experiences, and understands these data visualizations. We conducted a series of workshops and interviews with school students, university students, job seekers, designers, and senior citizens to collect perceived messages and subjective feedback on a sample of eight real-world charts. We analyzed these messages and compared them to the intended message of the chart producer. Four of the collected messages from consumers were then provided to data journalists (including the ones that created the original charts) as a starting point to re-design the charts accordingly. The results from our work underline the difficulty of complex charts such as stacked bar charts and Sankey diagrams. Consumers are often overwhelmed with the amount of data provided and are easily confused with terms (as text) not well known. Chart producers tend to be faithful with data but are willing to abstract further when asked to transport particular messages visually. There are strong conventions on how to visually encode particular information that might not be to the benefit of many consumers.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type
Authors:
Romy Müller,
Marcel Dürschmidt,
Julian Ullrich,
Carsten Knoll,
Sascha Weber,
Steffen Seitz
Abstract:
Deep Learning models like Convolutional Neural Networks (CNN) are powerful image classifiers, but what factors determine whether they attend to similar image areas as humans do? While previous studies have focused on technological factors, little is known about the role of factors that affect human attention. In the present study, we investigated how the tasks used to elicit human attention maps i…
▽ More
Deep Learning models like Convolutional Neural Networks (CNN) are powerful image classifiers, but what factors determine whether they attend to similar image areas as humans do? While previous studies have focused on technological factors, little is known about the role of factors that affect human attention. In the present study, we investigated how the tasks used to elicit human attention maps interact with image characteristics in modulating the similarity between humans and CNN. We varied the intentionality of human tasks, ranging from spontaneous gaze during categorization over intentional gaze-pointing up to manual area selection. Moreover, we varied the type of image to be categorized, using either singular, salient objects, indoor scenes consisting of object arrangements, or landscapes without distinct objects defining the category. The human attention maps generated in this way were compared to the CNN attention maps revealed by explainable artificial intelligence (Grad-CAM). The influence of human tasks strongly depended on image type: For objects, human manual selection produced maps that were most similar to CNN, while the specific eye movement task has little impact. For indoor scenes, spontaneous gaze produced the least similarity, while for landscapes, similarity was equally low across all human tasks. To better understand these results, we also compared the different human attention maps to each other. Our results highlight the importance of taking human factors into account when comparing the attention of humans and CNN.
△ Less
Submitted 15 October, 2023; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Multi-Task Consistency for Active Learning
Authors:
Aral Hekimoglu,
Philipp Friedrich,
Walter Zimmer,
Michael Schmidt,
Alvaro Marcos-Ramiro,
Alois C. Knoll
Abstract:
Learning-based solutions for vision tasks require a large amount of labeled training data to ensure their performance and reliability. In single-task vision-based settings, inconsistency-based active learning has proven to be effective in selecting informative samples for annotation. However, there is a lack of research exploiting the inconsistency between multiple tasks in multi-task networks. To…
▽ More
Learning-based solutions for vision tasks require a large amount of labeled training data to ensure their performance and reliability. In single-task vision-based settings, inconsistency-based active learning has proven to be effective in selecting informative samples for annotation. However, there is a lack of research exploiting the inconsistency between multiple tasks in multi-task networks. To address this gap, we propose a novel multi-task active learning strategy for two coupled vision tasks: object detection and semantic segmentation. Our approach leverages the inconsistency between them to identify informative samples across both tasks. We propose three constraints that specify how the tasks are coupled and introduce a method for determining the pixels belonging to the object detected by a bounding box, to later quantify the constraints as inconsistency scores. To evaluate the effectiveness of our approach, we establish multiple baselines for multi-task active learning and introduce a new metric, mean Detection Segmentation Quality (mDSQ), tailored for the multi-task active learning comparison that addresses the performance of both tasks. We conduct extensive experiments on the nuImages and A9 datasets, demonstrating that our approach outperforms existing state-of-the-art methods by up to 3.4% mDSQ on nuImages. Our approach achieves 95% of the fully-trained performance using only 67% of the available data, corresponding to 20% fewer labels compared to random selection and 5% fewer labels compared to state-of-the-art selection strategy. Our code will be made publicly available after the review process.
△ Less
Submitted 21 June, 2023;
originally announced June 2023.
-
A9 Intersection Dataset: All You Need for Urban 3D Camera-LiDAR Roadside Perception
Authors:
Walter Zimmer,
Christian Creß,
Huu Tung Nguyen,
Alois C. Knoll
Abstract:
Intelligent Transportation Systems (ITS) allow a drastic expansion of the visibility range and decrease occlusions for autonomous driving. To obtain accurate detections, detailed labeled sensor data for training is required. Unfortunately, high-quality 3D labels of LiDAR point clouds from the infrastructure perspective of an intersection are still rare. Therefore, we provide the A9 Intersection Da…
▽ More
Intelligent Transportation Systems (ITS) allow a drastic expansion of the visibility range and decrease occlusions for autonomous driving. To obtain accurate detections, detailed labeled sensor data for training is required. Unfortunately, high-quality 3D labels of LiDAR point clouds from the infrastructure perspective of an intersection are still rare. Therefore, we provide the A9 Intersection Dataset, which consists of labeled LiDAR point clouds and synchronized camera images. Here, we recorded the sensor output from two roadside cameras and LiDARs mounted on intersection gantry bridges. The point clouds were labeled in 3D by experienced annotators. Furthermore, we provide calibration data between all sensors, which allow the projection of the 3D labels into the camera images and an accurate data fusion. Our dataset consists of 4.8k images and point clouds with more than 57.4k manually labeled 3D boxes. With ten object classes, it has a high diversity of road users in complex driving maneuvers, such as left and right turns, overtaking, and U-turns. In experiments, we provided multiple baselines for the perception tasks. Overall, our dataset is a valuable contribution to the scientific community to perform complex 3D camera-LiDAR roadside perception tasks. Find data, code, and more information at https://a9-dataset.com.
△ Less
Submitted 15 June, 2023;
originally announced June 2023.
-
InfraDet3D: Multi-Modal 3D Object Detection based on Roadside Infrastructure Camera and LiDAR Sensors
Authors:
Walter Zimmer,
Joseph Birkner,
Marcel Brucker,
Huu Tung Nguyen,
Stefan Petrovski,
Bohan Wang,
Alois C. Knoll
Abstract:
Current multi-modal object detection approaches focus on the vehicle domain and are limited in the perception range and the processing capabilities. Roadside sensor units (RSUs) introduce a new domain for perception systems and leverage altitude to observe traffic. Cameras and LiDARs mounted on gantry bridges increase the perception range and produce a full digital twin of the traffic. In this wor…
▽ More
Current multi-modal object detection approaches focus on the vehicle domain and are limited in the perception range and the processing capabilities. Roadside sensor units (RSUs) introduce a new domain for perception systems and leverage altitude to observe traffic. Cameras and LiDARs mounted on gantry bridges increase the perception range and produce a full digital twin of the traffic. In this work, we introduce InfraDet3D, a multi-modal 3D object detector for roadside infrastructure sensors. We fuse two LiDARs using early fusion and further incorporate detections from monocular cameras to increase the robustness and to detect small objects. Our monocular 3D detection module uses HD maps to ground object yaw hypotheses, improving the final perception results. The perception framework is deployed on a real-world intersection that is part of the A9 Test Stretch in Munich, Germany. We perform several ablation studies and experiments and show that fusing two LiDARs with two cameras leads to an improvement of +1.90 mAP compared to a camera-only solution. We evaluate our results on the A9 infrastructure dataset and achieve 68.48 mAP on the test set. The dataset and code will be available at https://a9-dataset.com to allow the research community to further improve the perception results and make autonomous driving safer.
△ Less
Submitted 29 April, 2023;
originally announced May 2023.
-
What is the message? Perspectives on Visual Data Communication
Authors:
Laura Koesten,
Kathleen Gregory,
Regina Schuster,
Christian Knoll,
Sarah Davies,
Torsten Möller
Abstract:
Data visualizations are used to communicate messages to diverse audiences. It is unclear whether interpretations of these visualizations match the messages their creators aim to convey. In a mixed-methods study, we investigate how data in the popular science magazine Scientific American are visually communicated and understood. We first analyze visualizations about climate change and pandemics pub…
▽ More
Data visualizations are used to communicate messages to diverse audiences. It is unclear whether interpretations of these visualizations match the messages their creators aim to convey. In a mixed-methods study, we investigate how data in the popular science magazine Scientific American are visually communicated and understood. We first analyze visualizations about climate change and pandemics published in the magazine over a fifty-year period. Acting as chart readers, we then interpret visualizations with and without textual elements, identifying takeaway messages and creating field notes. Finally, we compare a sample of our interpreted messages to the intended messages of chart producers, drawing on interviews conducted with magazine staff. These data allow us to explore understanding visualizations through three perspectives: that of the charts, visualization readers, and visualization producers. Building on our findings from a thematic analysis, we present in-depth insights into data visualization sensemaking, particularly regarding the role of messages and textual elements; we propose a message typology, and we consider more broadly how messages can be conceptualized and understood.
△ Less
Submitted 12 April, 2023;
originally announced April 2023.
-
Self-attention for Enhanced OAMP Detection in MIMO Systems
Authors:
Alexander Fuchs,
Christian Knoll,
Nima N. Moghadam,
Alexey Pak **liang Huang,
Erik Leitinger,
Franz Pernkopf
Abstract:
Multiple-Input Multiple-Output (MIMO) systems are essential for wireless communications. Sinceclassical algorithms for symbol detection in MIMO setups require large computational resourcesor provide poor results, data-driven algorithms are becoming more popular. Most of the proposedalgorithms, however, introduce approximations leading to degraded performance for realistic MIMOsystems. In this pape…
▽ More
Multiple-Input Multiple-Output (MIMO) systems are essential for wireless communications. Sinceclassical algorithms for symbol detection in MIMO setups require large computational resourcesor provide poor results, data-driven algorithms are becoming more popular. Most of the proposedalgorithms, however, introduce approximations leading to degraded performance for realistic MIMOsystems. In this paper, we introduce a neural-enhanced hybrid model, augmenting the analyticbackbone algorithm with state-of-the-art neural network components. In particular, we introduce aself-attention model for the enhancement of the iterative Orthogonal Approximate Message Passing(OAMP)-based decoding algorithm. In our experiments, we show that the proposed model canoutperform existing data-driven approaches for OAMP while having improved generalization to otherSNR values at limited computational overhead.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
Understanding the Behavior of Belief Propagation
Authors:
Christian Knoll
Abstract:
Probabilistic graphical models are a powerful concept for modeling high-dimensional distributions. Besides modeling distributions, probabilistic graphical models also provide an elegant framework for performing statistical inference; because of the high-dimensional nature, however, one must often use approximate methods for this purpose. Belief propagation performs approximate inference, is effici…
▽ More
Probabilistic graphical models are a powerful concept for modeling high-dimensional distributions. Besides modeling distributions, probabilistic graphical models also provide an elegant framework for performing statistical inference; because of the high-dimensional nature, however, one must often use approximate methods for this purpose. Belief propagation performs approximate inference, is efficient, and looks back on a long success-story. Yet, in most cases, belief propagation lacks any performance and convergence guarantees. Many realistic problems are presented by graphical models with loops, however, in which case belief propagation is neither guaranteed to provide accurate estimates nor that it converges at all. This thesis investigates how the model parameters influence the performance of belief propagation. We are particularly interested in their influence on (i) the number of fixed points, (ii) the convergence properties, and (iii) the approximation quality.
△ Less
Submitted 5 September, 2022;
originally announced September 2022.
-
Sim-to-Real Transfer of Robotic Assembly with Visual Inputs Using CycleGAN and Force Control
Authors:
Chengjie Yuan,
Yunlei Shi,
Qian Feng,
Chunyang Chang,
Zhaopeng Chen,
Alois Christian Knoll,
Jianwei Zhang
Abstract:
Recently, deep reinforcement learning (RL) has shown some impressive successes in robotic manipulation applications. However, training robots in the real world is nontrivial owing to sample efficiency and safety concerns. Sim-to-real transfer is proposed to address the aforementioned concerns but introduces a new issue called the reality gap. In this work, we introduce a sim-to-real learning frame…
▽ More
Recently, deep reinforcement learning (RL) has shown some impressive successes in robotic manipulation applications. However, training robots in the real world is nontrivial owing to sample efficiency and safety concerns. Sim-to-real transfer is proposed to address the aforementioned concerns but introduces a new issue called the reality gap. In this work, we introduce a sim-to-real learning framework for vision-based assembly tasks and perform training in a simulated environment by employing inputs from a single camera to address the aforementioned issues. We present a domain adaptation method based on cycle-consistent generative adversarial networks (CycleGAN) and a force control transfer approach to bridge the reality gap. We demonstrate that the proposed framework trained in a simulated environment can be successfully transferred to a real peg-in-hole setup.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Real-Time And Robust 3D Object Detection with Roadside LiDARs
Authors:
Walter Zimmer,
Jialong Wu,
Xingcheng Zhou,
Alois C. Knoll
Abstract:
This work aims to address the challenges in autonomous driving by focusing on the 3D perception of the environment using roadside LiDARs. We design a 3D object detection model that can detect traffic participants in roadside LiDARs in real-time. Our model uses an existing 3D detector as a baseline and improves its accuracy. To prove the effectiveness of our proposed modules, we train and evaluate…
▽ More
This work aims to address the challenges in autonomous driving by focusing on the 3D perception of the environment using roadside LiDARs. We design a 3D object detection model that can detect traffic participants in roadside LiDARs in real-time. Our model uses an existing 3D detector as a baseline and improves its accuracy. To prove the effectiveness of our proposed modules, we train and evaluate the model on three different vehicle and infrastructure datasets. To show the domain adaptation ability of our detector, we train it on an infrastructure dataset from China and perform transfer learning on a different dataset recorded in Germany. We do several sets of experiments and ablation studies for each module in the detector that show that our model outperforms the baseline by a significant margin, while the inference speed is at 45 Hz (22 ms). We make a significant contribution with our LiDAR-based 3D detector that can be used for smart city applications to provide connected and automated vehicles with a far-reaching view. Vehicles that are connected to the roadside sensors can get information about other vehicles around the corner to improve their path and maneuver planning and to increase road traffic safety.
△ Less
Submitted 11 July, 2022;
originally announced July 2022.
-
Active Bayesian Causal Inference
Authors:
Christian Toth,
Lars Lorch,
Christian Knoll,
Andreas Krause,
Franz Pernkopf,
Robert Peharz,
Julius von Kügelgen
Abstract:
Causal discovery and causal reasoning are classically treated as separate and consecutive tasks: one first infers the causal graph, and then uses it to estimate causal effects of interventions. However, such a two-stage approach is uneconomical, especially in terms of actively collected interventional data, since the causal query of interest may not require a fully-specified causal model. From a B…
▽ More
Causal discovery and causal reasoning are classically treated as separate and consecutive tasks: one first infers the causal graph, and then uses it to estimate causal effects of interventions. However, such a two-stage approach is uneconomical, especially in terms of actively collected interventional data, since the causal query of interest may not require a fully-specified causal model. From a Bayesian perspective, it is also unnatural, since a causal query (e.g., the causal graph or some causal effect) can be viewed as a latent quantity subject to posterior inference -- other unobserved quantities that are not of direct interest (e.g., the full causal model) ought to be marginalized out in this process and contribute to our epistemic uncertainty. In this work, we propose Active Bayesian Causal Inference (ABCI), a fully-Bayesian active learning framework for integrated causal discovery and reasoning, which jointly infers a posterior over causal models and queries of interest. In our approach to ABCI, we focus on the class of causally-sufficient, nonlinear additive noise models, which we model using Gaussian processes. We sequentially design experiments that are maximally informative about our target causal query, collect the corresponding interventional data, and update our beliefs to choose the next experiment. Through simulations, we demonstrate that our approach is more data-efficient than several baselines that only focus on learning the full causal graph. This allows us to accurately learn downstream causal queries from fewer samples while providing well-calibrated uncertainty estimates for the quantities of interest.
△ Less
Submitted 15 October, 2022; v1 submitted 4 June, 2022;
originally announced June 2022.
-
Intelligent Transportation Systems Using External Infrastructure: A Literature Survey
Authors:
Christian Creß,
Zhenshan Bing,
Alois C. Knoll
Abstract:
The main problems in transportation are accidents, increasingly slow traffic flow, and pollution. An intelligent transportation system (ITS) using external infrastructure can overcome these problems. For this reason, the number of such systems is increasing dramatically, and therefore requires an adequate overview. To the best of our knowledge, no current systematic review of existing ITS solution…
▽ More
The main problems in transportation are accidents, increasingly slow traffic flow, and pollution. An intelligent transportation system (ITS) using external infrastructure can overcome these problems. For this reason, the number of such systems is increasing dramatically, and therefore requires an adequate overview. To the best of our knowledge, no current systematic review of existing ITS solutions exists. To fill this knowledge gap, our paper provides an overview of existing ITS that use external infrastructure worldwide. Accordingly, this paper addresses current questions and challenges. For this purpose, we performed a literature review of documents that describe existing ITS solutions from 2009 until today. We categorized the results according to technology levels and analyzed its hardware system setup and value-added contributions. In doing so, we made the ITS solutions comparable and highlighted past development alongside current trends. We analyzed more than 357 papers, including 52 test bed projects. In summary, current ITSs can deliver accurate information about individuals in traffic situations in real-time. However, further research into ITS should focus on more reliable perception of the traffic using modern sensors, plug-and-play mechanisms, and secure real-time distribution of the digital twins in a decentralized manner. By addressing these topics, the development of intelligent transportation systems will be able to take a step towards its comprehensive roll-out.
△ Less
Submitted 25 August, 2022; v1 submitted 10 December, 2021;
originally announced December 2021.
-
Distribution Mismatch Correction for Improved Robustness in Deep Neural Networks
Authors:
Alexander Fuchs,
Christian Knoll,
Franz Pernkopf
Abstract:
Deep neural networks rely heavily on normalization methods to improve their performance and learning behavior. Although normalization methods spurred the development of increasingly deep and efficient architectures, they also increase the vulnerability with respect to noise and input corruptions. In most applications, however, noise is ubiquitous and diverse; this can often lead to complete failur…
▽ More
Deep neural networks rely heavily on normalization methods to improve their performance and learning behavior. Although normalization methods spurred the development of increasingly deep and efficient architectures, they also increase the vulnerability with respect to noise and input corruptions. In most applications, however, noise is ubiquitous and diverse; this can often lead to complete failure of machine learning systems as they fail to cope with mismatches between the input distribution during training- and test-time. The most common normalization method, batch normalization, reduces the distribution shift during training but is agnostic to changes in the input distribution during test time. This makes batch normalization prone to performance degradation whenever noise is present during test-time. Sample-based normalization methods can correct linear transformations of the activation distribution but cannot mitigate changes in the distribution shape; this makes the network vulnerable to distribution changes that cannot be reflected in the normalization parameters. We propose an unsupervised non-parametric distribution correction method that adapts the activation distribution of each layer. This reduces the mismatch between the training and test-time distribution by minimizing the 1-D Wasserstein distance. In our experiments, we empirically show that the proposed method effectively reduces the impact of intense image corruptions and thus improves the classification performance without the need for retraining or fine-tuning the model.
△ Less
Submitted 5 October, 2021;
originally announced October 2021.
-
Einstein-Dirac-Maxwell wormholes: ansatz, construction and properties of symmetric solutions
Authors:
Jose Luis Blázquez-Salcedo,
Christian Knoll,
E. Radu
Abstract:
We present a discussion of the traversable wormholes in Einstein-Dirac-Maxwell theory recently reported in e-Print: 2010.07317. This includes a detailed description of the ansatz and junction condition, together with an investigation of the domain of existence of the solutions. In this study, we assume symmetry under interchange of the two asymptotically flat regions of a wormhole. Possible issues…
▽ More
We present a discussion of the traversable wormholes in Einstein-Dirac-Maxwell theory recently reported in e-Print: 2010.07317. This includes a detailed description of the ansatz and junction condition, together with an investigation of the domain of existence of the solutions. In this study, we assume symmetry under interchange of the two asymptotically flat regions of a wormhole. Possible issues and limitations of the approach are also discussed.
△ Less
Submitted 27 August, 2021;
originally announced August 2021.
-
An Empirical Study of UMLS Concept Extraction from Clinical Notes using Boolean Combination Ensembles
Authors:
Greg M. Silverman,
Raymond L. Finzel,
Michael V. Heinz,
Jake Vasilakes,
Jacob C. Solinsky,
Reed McEwan,
Benjamin C. Knoll,
Christopher J. Tignanelli,
Hongfang Liu,
Hua Xu,
Xiaoqian Jiang,
Genevieve B. Melton,
Serguei VS Pakhomov
Abstract:
Our objective in this study is to investigate the behavior of Boolean operators on combining annotation output from multiple Natural Language Processing (NLP) systems across multiple corpora and to assess how filtering by aggregation of Unified Medical Language System (UMLS) Metathesaurus concepts affects system performance for Named Entity Recognition (NER) of UMLS concepts. We used three corpora…
▽ More
Our objective in this study is to investigate the behavior of Boolean operators on combining annotation output from multiple Natural Language Processing (NLP) systems across multiple corpora and to assess how filtering by aggregation of Unified Medical Language System (UMLS) Metathesaurus concepts affects system performance for Named Entity Recognition (NER) of UMLS concepts. We used three corpora annotated for UMLS concepts: 2010 i2b2 VA challenge set (31,161 annotations), Multi-source Integrated Platform for Answering Clinical Questions (MiPACQ) corpus (17,457 annotations including UMLS concept unique identifiers), and Fairview Health Services corpus (44,530 annotations). Our results showed that for UMLS concept matching, Boolean ensembling of the MiPACQ corpus trended towards higher performance over individual systems. Use of an approximate grid-search can help optimize the precision-recall tradeoff and can provide a set of heuristics for choosing an optimal set of ensembles.
△ Less
Submitted 4 August, 2021;
originally announced August 2021.
-
Traversable wormholes in Einstein-Dirac-Maxwell theory
Authors:
Jose Luis Blázquez-Salcedo,
Christian Knoll,
Eugen Radu
Abstract:
We construct a specific example of a class of traversable wormholes in Einstein-Dirac-Maxwell theory in four spacetime dimensions, without needing any form of exotic matter. Restricting to a model with two massive fermions in a singlet spinor state, we show the existence of spherically symmetric asymptotically flat configurations which are free of singularities, representing localized states. Thes…
▽ More
We construct a specific example of a class of traversable wormholes in Einstein-Dirac-Maxwell theory in four spacetime dimensions, without needing any form of exotic matter. Restricting to a model with two massive fermions in a singlet spinor state, we show the existence of spherically symmetric asymptotically flat configurations which are free of singularities, representing localized states. These solutions satisfy a generalized Smarr relation, being connected with the extremal Reissner-Nordström black holes. They also possess a finite mass $M$ and electric charge $Q_e$, with $Q_e/M>1$. An exact wormhole solution with ungauged, massless fermions is also reported.
△ Less
Submitted 12 March, 2021; v1 submitted 14 October, 2020;
originally announced October 2020.
-
Constructing spherically symmetric Einstein-Dirac systems with multiple spinors: Ansatz, wormholes and other analytical solutions
Authors:
Jose Luis Blázquez-Salcedo,
Christian Knoll
Abstract:
In this paper we present a detailed calculation of an Ansatz that allows to obtain spherically symmetric Einstein-Dirac configurations in $d$-dimensions. We show that this is possible by combining $2^{\lfloor \frac{d-2}{2} \rfloor}$ Dirac fields, making use of the properties of the angular dependence of the spinors in a spherical background. By applying this Ansatz, we investigate some simple anal…
▽ More
In this paper we present a detailed calculation of an Ansatz that allows to obtain spherically symmetric Einstein-Dirac configurations in $d$-dimensions. We show that this is possible by combining $2^{\lfloor \frac{d-2}{2} \rfloor}$ Dirac fields, making use of the properties of the angular dependence of the spinors in a spherical background. By applying this Ansatz, we investigate some simple analytical solutions. One of them is a regular wormhole supported by the Dirac fields. Other solutions include a pathological black hole and a naked singularity. We analyze the domain of existence and properties of all these solutions.
△ Less
Submitted 25 February, 2020; v1 submitted 8 October, 2019;
originally announced October 2019.
-
Boson and Dirac stars in $D\geq 4$ dimensions
Authors:
Jose Luis Blázquez-Salcedo,
Christian Knoll,
Eugen Radu
Abstract:
We present a comparative study of spherically symmetric, localized, particle-like solutions for spin $s=0,1/2$ and $1$ gravitating fields in a $D$-dimensional, asymptotically flat spacetime. These fields are massive, possessing a harmonic time dependence and no self-interaction. Special attention is paid to the mathematical similarities and physical differences between the bosonic and fermonic cas…
▽ More
We present a comparative study of spherically symmetric, localized, particle-like solutions for spin $s=0,1/2$ and $1$ gravitating fields in a $D$-dimensional, asymptotically flat spacetime. These fields are massive, possessing a harmonic time dependence and no self-interaction. Special attention is paid to the mathematical similarities and physical differences between the bosonic and fermonic cases. We find that the generic pattern of solutions is similar for any value of the spin $s$, depending only on the dimensionality of spacetime, the cases $D=4,5$ being special.
△ Less
Submitted 15 February, 2019;
originally announced February 2019.
-
Self-Guided Belief Propagation -- A Homotopy Continuation Method
Authors:
Christian Knoll,
Adrian Weller,
Franz Pernkopf
Abstract:
Belief propagation (BP) is a popular method for performing probabilistic inference on graphical models. In this work, we enhance BP and propose self-guided belief propagation (SBP) that incorporates the pairwise potentials only gradually. This homotopy continuation method converges to a unique solution and increases the accuracy without increasing the computational burden. We provide a formal anal…
▽ More
Belief propagation (BP) is a popular method for performing probabilistic inference on graphical models. In this work, we enhance BP and propose self-guided belief propagation (SBP) that incorporates the pairwise potentials only gradually. This homotopy continuation method converges to a unique solution and increases the accuracy without increasing the computational burden. We provide a formal analysis to demonstrate that SBP finds the global optimum of the Bethe approximation for attractive models where all variables favor the same state. Moreover, we apply SBP to various graphs with random potentials and empirically show that: (i) SBP is superior in terms of accuracy whenever BP converges, and (ii) SBP obtains a unique, stable, and accurate solution whenever BP does not converge.
△ Less
Submitted 19 March, 2021; v1 submitted 4 December, 2018;
originally announced December 2018.
-
Quasinormal modes of Dirac spinors in the background of rotating black holes in four and five dimensions
Authors:
Jose Luis Blázquez-Salcedo,
Christian Knoll
Abstract:
We study the quasinormal modes of massive Dirac spinors in the background of rotating black holes. In particular, we consider the Kerr geometry as well as the five dimensional Myers-Perry spacetime with equal angular momenta. We decouple the equations using the standard methods from the literature. In the five dimensional Myers-Perry black hole the angular equation is solved analytically. Using th…
▽ More
We study the quasinormal modes of massive Dirac spinors in the background of rotating black holes. In particular, we consider the Kerr geometry as well as the five dimensional Myers-Perry spacetime with equal angular momenta. We decouple the equations using the standard methods from the literature. In the five dimensional Myers-Perry black hole the angular equation is solved analytically. Using the continued fraction method, we calculate the spectrum of quasinormal modes for the ground modes and first excited modes. We analyze, in a systematic way, its dependence on the different parameters of the black hole and fermionic field. We compare our values with previous results available in the literature for Kerr and for the static limit. The numerical results show several differences between the four and five dimensional cases. For instance, in five dimensions the symmetry between the positive and negative (real) frequency of the modes breaks down, which results in a richer spectrum.
△ Less
Submitted 30 November, 2018; v1 submitted 5 November, 2018;
originally announced November 2018.
-
Solutions of the massive Dirac equation in the near-horizon metric of the extremal five dimensional Myers-Perry black hole with equal angular momenta
Authors:
Jose Luis Blázquez-Salcedo,
Christian Knoll
Abstract:
We study massive Dirac fields in the background of the near-horizon limit of the extremal Myers-Perry black hole in five dimensions. We consider the case in which both angular momenta have equal magnitude. The resulting Dirac equation can be decoupled into an angular and a radial part. The solution of the angular part results in some algebraic relations that determine completely the angular quantu…
▽ More
We study massive Dirac fields in the background of the near-horizon limit of the extremal Myers-Perry black hole in five dimensions. We consider the case in which both angular momenta have equal magnitude. The resulting Dirac equation can be decoupled into an angular and a radial part. The solution of the angular part results in some algebraic relations that determine completely the angular quantum numbers of the fermionic field. The radial part can be analytically solved in terms of special functions, which allow us to analyze the near-horizon radial current of the Dirac field.
△ Less
Submitted 1 August, 2018;
originally announced August 2018.
-
Slowly damped quasinormal modes of the massive Dirac field in d-dimensional Tangherlini spacetime
Authors:
Jose Luis Blázquez-Salcedo,
Christian Knoll
Abstract:
We consider quasinormal modes of the massive Dirac field in the background of a Schwarzschild- Tangherlini black hole. Different dimensions of the spacetime are considered, from d = 4 to d = 9. The quasinormal modes are calculated using two independent methods: WKB and continued fraction. We obtain the spectrum of quasinormal modes for different values of the overtone number and angular quantum nu…
▽ More
We consider quasinormal modes of the massive Dirac field in the background of a Schwarzschild- Tangherlini black hole. Different dimensions of the spacetime are considered, from d = 4 to d = 9. The quasinormal modes are calculated using two independent methods: WKB and continued fraction. We obtain the spectrum of quasinormal modes for different values of the overtone number and angular quantum number. An analytical approximation of the spectrum valid in the case of large values of the angular quantum number and mass is calculated. Although we don't find unstable modes in the spectrum, we show that for large values of the mass, the quasinormal modes can become very slowly damped, giving rise to quasistationary perturbations.
△ Less
Submitted 15 February, 2018; v1 submitted 22 September, 2017;
originally announced September 2017.
-
Fixed Points of Belief Propagation -- An Analysis via Polynomial Homotopy Continuation
Authors:
Christian Knoll,
Franz Pernkopf,
Dhagash Mehta,
Tianran Chen
Abstract:
Belief propagation (BP) is an iterative method to perform approximate inference on arbitrary graphical models. Whether BP converges and if the solution is a unique fixed point depends on both the structure and the parametrization of the model. To understand this dependence it is interesting to find \emph{all} fixed points. In this work, we formulate a set of polynomial equations, the solutions of…
▽ More
Belief propagation (BP) is an iterative method to perform approximate inference on arbitrary graphical models. Whether BP converges and if the solution is a unique fixed point depends on both the structure and the parametrization of the model. To understand this dependence it is interesting to find \emph{all} fixed points. In this work, we formulate a set of polynomial equations, the solutions of which correspond to BP fixed points. To solve such a nonlinear system we present the numerical polynomial-homotopy-continuation (NPHC) method. Experiments on binary Ising models and on error-correcting codes show how our method is capable of obtaining all BP fixed points. On Ising models with fixed parameters we show how the structure influences both the number of fixed points and the convergence properties. We further asses the accuracy of the marginals and weighted combinations thereof. Weighting marginals with their respective partition function increases the accuracy in all experiments. Contrary to the conjecture that uniqueness of BP fixed points implies convergence, we find graphs for which BP fails to converge, even though a unique fixed point exists. Moreover, we show that this fixed point gives a good approximation, and the NPHC method is able to obtain this fixed point.
△ Less
Submitted 30 May, 2017; v1 submitted 20 May, 2016;
originally announced May 2016.
-
Scalability in Neural Control of Musculoskeletal Robots
Authors:
Christoph Richter,
Sören Jentzsch,
Rafael Hostettler,
Jesús A. Garrido,
Eduardo Ros,
Alois C. Knoll,
Florian Röhrbein,
Patrick van der Smagt,
Jörg Conradt
Abstract:
Anthropomimetic robots are robots that sense, behave, interact and feel like humans. By this definition, anthropomimetic robots require human-like physical hardware and actuation, but also brain-like control and sensing. The most self-evident realization to meet those requirements would be a human-like musculoskeletal robot with a brain-like neural controller. While both musculoskeletal robotic ha…
▽ More
Anthropomimetic robots are robots that sense, behave, interact and feel like humans. By this definition, anthropomimetic robots require human-like physical hardware and actuation, but also brain-like control and sensing. The most self-evident realization to meet those requirements would be a human-like musculoskeletal robot with a brain-like neural controller. While both musculoskeletal robotic hardware and neural control software have existed for decades, a scalable approach that could be used to build and control an anthropomimetic human-scale robot has not been demonstrated yet. Combining Myorobotics, a framework for musculoskeletal robot development, with SpiNNaker, a neuromorphic computing platform, we present the proof-of-principle of a system that can scale to dozens of neurally-controlled, physically compliant joints. At its core, it implements a closed-loop cerebellar model which provides real-time low-level neural control at minimal power consumption and maximal extensibility: higher-order (e.g., cortical) neural networks and neuromorphic sensors like silicon-retinae or -cochleae can naturally be incorporated.
△ Less
Submitted 19 January, 2016;
originally announced January 2016.
-
Charged rotating dilaton black holes with Kaluza-Klein asymptotics
Authors:
Christian Knoll,
Petya Nedkova
Abstract:
We construct a class of stationary and axisymmetric solutions to the 5D Einstein-Maxwell-dilaton gravity, which describe configurations of charged rotating black objects with Kaluza-Klein asymptotics. The solutions are constructed by uplifting a vacuum seed solution to six dimensions, performing a boost, and a subsequent circle reduction. We investigate the physical properties of the charged solut…
▽ More
We construct a class of stationary and axisymmetric solutions to the 5D Einstein-Maxwell-dilaton gravity, which describe configurations of charged rotating black objects with Kaluza-Klein asymptotics. The solutions are constructed by uplifting a vacuum seed solution to six dimensions, performing a boost, and a subsequent circle reduction. We investigate the physical properties of the charged solutions, and obtain their general relations to the properties of the vacuum seed. We also derive the gyromagnetic ratio and the Smarr-like relations. As particular cases we study three solutions, which describe a charged rotating black string, a charged rotating black ring on Kaluza-Klein bubbles, and a superposition of two black holes and a Kaluza-Klein bubble.
△ Less
Submitted 4 December, 2015;
originally announced December 2015.
-
Cepstral Analysis of Random Variables: Muculants
Authors:
Christian Knoll,
Bernhard C. Geiger,
Gernot Kubin
Abstract:
An alternative parametric description for discrete random variables, called muculants, is proposed. In contrast to cumulants, muculants are based on the Fourier series expansion, rather than on the Taylor series expansion, of the logarithm of the characteristic function. We utilize results from cepstral theory to derive elementary properties of muculants, some of which demonstrate behavior superio…
▽ More
An alternative parametric description for discrete random variables, called muculants, is proposed. In contrast to cumulants, muculants are based on the Fourier series expansion, rather than on the Taylor series expansion, of the logarithm of the characteristic function. We utilize results from cepstral theory to derive elementary properties of muculants, some of which demonstrate behavior superior to those of cumulants. For example, muculants and cumulants are both additive. While the existence of cumulants is linked to how often the characteristic function is differentiable, all muculants exist if the characteristic function satisfies a Paley-Wiener condition. Moreover, the muculant sequence and, if the random variable has finite expectation, the reconstruction of the characteristic function from its muculants converge. We furthermore develop a connection between muculants and cumulants and present the muculants of selected discrete random variables. Specifically, it is shown that the Poisson distribution is the only distribution where only the first two muculants are nonzero.
△ Less
Submitted 13 November, 2017; v1 submitted 15 June, 2015;
originally announced June 2015.
-
Using a computer algebra system to simplify expressions for Titchmarsh-Weyl m-functions associated with the Hydrogen Atom on the half line
Authors:
Cecilia Knoll,
Charles Fulton
Abstract:
In this paper we give simplified formulas for certain polynomials which arise in some new Titchmarsh-Weyl m-functions for the radial part of the separated Hydrogen atom on the half line and two independent programs for generating them using the symbolic manipulator Mathematica.
In this paper we give simplified formulas for certain polynomials which arise in some new Titchmarsh-Weyl m-functions for the radial part of the separated Hydrogen atom on the half line and two independent programs for generating them using the symbolic manipulator Mathematica.
△ Less
Submitted 29 December, 2008;
originally announced December 2008.