Skip to main content

Showing 1–29 of 29 results for author: Bauer, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08474  [pdf, other

    cs.CV cs.AI cs.LG

    Real2Code: Reconstruct Articulated Objects via Code Generation

    Authors: Zhao Mandi, Yijia Weng, Dominik Bauer, Shuran Song

    Abstract: We present Real2Code, a novel approach to reconstructing articulated objects via code generation. Given visual observations of an object, we first reconstruct its part geometry using an image segmentation model and a shape completion model. We then represent the object parts with oriented bounding boxes, which are input to a fine-tuned large language model (LLM) to predict joint articulation as co… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2404.12524  [pdf, other

    cs.CV cs.LG cs.RO

    DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects

    Authors: Dominik Bauer, Zhenjia Xu, Shuran Song

    Abstract: Manipulation of elastoplastic objects like dough often involves topological changes such as splitting and merging. The ability to accurately predict these topological changes that a specific action might incur is critical for planning interactions with elastoplastic objects. We present DoughNet, a Transformer-based architecture for handling these challenges, consisting of two components. First, a… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Under review. 17 pages, 14 figures

  3. arXiv:2404.08371  [pdf, other

    cs.CE

    Code Generation and Performance Engineering for Matrix-Free Finite Element Methods on Hybrid Tetrahedral Grids

    Authors: Fabian Böhm, Daniel Bauer, Nils Kohl, Christie Alappat, Dominik Thönnes, Marcus Mohr, Harald Köstler, Ulrich Rüde

    Abstract: This paper introduces a code generator designed for node-level optimized, extreme-scalable, matrix-free finite element operators on hybrid tetrahedral grids. It optimizes the local evaluation of bilinear forms through various techniques including tabulation, relocation of loop invariants, and inter-element vectorization - implemented as transformations of an abstract syntax tree. A key contributio… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 22 pages

    MSC Class: 65F50; 65N30; 65N55; 65Y20; 65F10

  4. arXiv:2308.01792  [pdf, other

    cs.CE

    Fundamental Data Structures for Matrix-Free Finite Elements on Hybrid Tetrahedral Grids

    Authors: Nils Kohl, Daniel Bauer, Fabian Böhm, Ulrich Rüde

    Abstract: This paper presents efficient data structures for the implementation of matrix-free finite element methods on block-structured, hybrid tetrahedral grids. It provides a complete categorization of all geometric sub-objects that emerge from the regular refinement of the unstructured, tetrahedral coarse grid and describes efficient iteration patterns and analytical linearization functions for the mapp… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: 21 pages

  5. arXiv:2307.15671  [pdf, other

    cs.CV cs.RO

    TrackAgent: 6D Object Tracking via Reinforcement Learning

    Authors: Konstantin Röhrl, Dominik Bauer, Timothy Patten, Markus Vincze

    Abstract: Tracking an object's 6D pose, while either the object itself or the observing camera is moving, is important for many robotics and augmented reality applications. While exploiting temporal priors eases this problem, object-specific knowledge is required to recover when tracking is lost. Under the tight time constraints of the tracking task, RGB(D)-based methods are often conceptionally complex or… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: International Conference on Computer Vision Systems (ICVS) 2023

  6. arXiv:2307.12172  [pdf, ps, other

    cs.RO cs.CV

    Challenges for Monocular 6D Object Pose Estimation in Robotics

    Authors: Stefan Thalhammer, Dominik Bauer, Peter Hönig, Jean-Baptiste Weibel, José García-Rodríguez, Markus Vincze

    Abstract: Object pose estimation is a core perception task that enables, for example, object gras** and scene understanding. The widely available, inexpensive and high-resolution RGB sensors and CNNs that allow for fast inference based on this modality make monocular approaches especially well suited for robotics applications. We observe that previous surveys on object pose estimation establish the state… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.11827

  7. arXiv:2306.09922  [pdf, ps, other

    cs.RO cs.AI cs.CL cs.LG

    Learning to Summarize and Answer Questions about a Virtual Robot's Past Actions

    Authors: Chad DeChant, Iretiayo Akinola, Daniel Bauer

    Abstract: When robots perform long action sequences, users will want to easily and reliably find out what they have done. We therefore demonstrate the task of learning to summarize and answer questions about a robot agent's past actions using natural language alone. A single system with a large language model at its core is trained to both summarize and answer questions about action sequences given ego-cent… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  8. arXiv:2306.04784  [pdf, other

    cs.RO

    Designing Anthropomorphic Soft Hands through Interaction

    Authors: Pragna Mannam, Kenneth Shaw, Dominik Bauer, Jean Oh, Deepak Pathak, Nancy Pollard

    Abstract: Modeling and simulating soft robot hands can aid in design iteration for complex and high degree-of-freedom (DoF) morphologies. This can be further supplemented by iterating on the design based on its performance in real world manipulation tasks. However, iterating in the real world requires an approach that allows us to test new designs quickly at low costs. In this paper, we leverage rapid proto… ▽ More

    Submitted 14 March, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Journal ref: 2023 IEEE-RAS 22nd International Conference on Humanoid Robots (Humanoids)

  9. arXiv:2304.07338  [pdf, other

    cs.GR cs.LG

    Photon Field Networks for Dynamic Real-Time Volumetric Global Illumination

    Authors: David Bauer, Qi Wu, Kwan-Liu Ma

    Abstract: Volume data is commonly found in many scientific disciplines, like medicine, physics, and biology. Experts rely on robust scientific visualization techniques to extract valuable insights from the data. Recent years have shown path tracing to be the preferred approach for volumetric rendering, given its high levels of realism. However, real-time volumetric path tracing often suffers from stochastic… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  10. arXiv:2304.04188  [pdf, other

    cs.GR cs.LG

    HyperINR: A Fast and Predictive Hypernetwork for Implicit Neural Representations via Knowledge Distillation

    Authors: Qi Wu, David Bauer, Yuyang Chen, Kwan-Liu Ma

    Abstract: Implicit Neural Representations (INRs) have recently exhibited immense potential in the field of scientific visualization for both data generation and visualization tasks. However, these representations often consist of large multi-layer perceptrons (MLPs), necessitating millions of operations for a single forward pass, consequently hindering interactive visual exploration. While reducing the size… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  11. arXiv:2210.07270  [pdf, other

    cs.CL

    Multi-Task Learning for Joint Semantic Role and Proto-Role Labeling

    Authors: Aashish Arora, Harshitha Malireddi, Daniel Bauer, Asad Sayeed, Yuval Marton

    Abstract: We put forward an end-to-end multi-step machine learning model which jointly labels semantic roles and the proto-roles of Dowty (1991), given a sentence and the predicates therein. Our best architecture first learns argument spans followed by learning the argument's syntactic heads. This information is shared with the next steps for predicting the semantic roles and proto-roles. We also experiment… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: 10 pages including references. 2 figures. First 2 authors contributed significantly

  12. arXiv:2209.09965  [pdf, other

    cs.GR cs.LG

    FoVolNet: Fast Volume Rendering using Foveated Deep Neural Networks

    Authors: David Bauer, Qi Wu, Kwan-Liu Ma

    Abstract: Volume data is found in many important scientific and engineering applications. Rendering this data for visualization at high quality and interactive rates for demanding applications such as virtual reality is still not easily achievable even using professional-grade hardware. We introduce FoVolNet -- a method to significantly increase the performance of volume data visualization. We develop a cos… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: To appear at IEEE VIS 2022 and later TVCG

  13. arXiv:2208.04683  [pdf, other

    cs.CY cs.AI cs.LG stat.AP

    Applying data technologies to combat AMR: current status, challenges, and opportunities on the way forward

    Authors: Leonid Chindelevitch, Elita Jauneikaite, Nicole E. Wheeler, Kasim Allel, Bede Yaw Ansiri-Asafoakaa, Wireko A. Awuah, Denis C. Bauer, Stephan Beisken, Kara Fan, Gary Grant, Michael Graz, Yara Khalaf, Veranja Liyanapathirana, Carlos Montefusco-Pereira, Lawrence Mugisha, Atharv Naik, Sylvia Nanono, Anthony Nguyen, Timothy Rawson, Kessendri Reddy, Juliana M. Ruzante, Anneke Schmider, Roman Stocker, Leonhardt Unruh, Daniel Waruingi , et al. (2 additional authors not shown)

    Abstract: Antimicrobial resistance (AMR) is a growing public health threat, estimated to cause over 10 million deaths per year and cost the global economy 100 trillion USD by 2050 under status quo projections. These losses would mainly result from an increase in the morbidity and mortality from treatment failure, AMR infections during medical procedures, and a loss of quality of life attributed to AMR. Nume… ▽ More

    Submitted 11 August, 2022; v1 submitted 5 July, 2022; originally announced August 2022.

    Comments: 65 pages, 3 figures

    ACM Class: I.2.1; J.3

  14. arXiv:2207.11620  [pdf, other

    cs.GR cs.LG

    Interactive Volume Visualization via Multi-Resolution Hash Encoding based Neural Representation

    Authors: Qi Wu, David Bauer, Michael J. Doyle, Kwan-Liu Ma

    Abstract: Neural networks have shown great potential in compressing volume data for visualization. However, due to the high cost of training and inference, such volumetric neural representations have thus far only been applied to offline data processing and non-interactive rendering. In this paper, we demonstrate that by simultaneously leveraging modern GPU tensor cores, a native CUDA neural network framewo… ▽ More

    Submitted 29 June, 2023; v1 submitted 23 July, 2022; originally announced July 2022.

    Comments: There is a supplementary video for this manuscript, which can be accessed via this link: https://drive.google.com/file/d/17wSgIm_VsoeGhfyZwMpOnCYy2Mj3ydGv/view?usp=sharing

  15. arXiv:2203.06671  [pdf, other

    cs.CL cs.RO

    Summarizing a virtual robot's past actions in natural language

    Authors: Chad DeChant, Daniel Bauer

    Abstract: We propose and demonstrate the task of giving natural language summaries of the actions of a robotic agent in a virtual environment. We explain why such a task is important, what makes it difficult, and discuss how it might be addressed. To encourage others to work on this, we show how a popular existing dataset that matches robot actions with natural language descriptions designed for an instruct… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: 12 pages, 3 figures

  16. arXiv:2201.05230  [pdf, other

    cs.CL cs.CY

    NLP in Human Rights Research -- Extracting Knowledge Graphs About Police and Army Units and Their Commanders

    Authors: Daniel Bauer, Tom Longley, Yueen Ma, Tony Wilson

    Abstract: In this working paper we explore the use of an NLP system to assist the work of Security Force Monitor (SFM). SFM creates data about the organizational structure, command personnel and operations of police, army and other security forces, which assists human rights researchers, journalists and litigators in their work to help identify and bring to account specific units and personnel alleged to ha… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

    Comments: Equal contributions. for associated text corpus see https://github.com/security-force-monitor/nlp_starter_dataset

  17. arXiv:2201.00239  [pdf, other

    cs.CV

    SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement

    Authors: Dominik Bauer, Timothy Patten, Markus Vincze

    Abstract: Observational noise, inaccurate segmentation and ambiguity due to symmetry and occlusion lead to inaccurate object pose estimates. While depth- and RGB-based pose refinement approaches increase the accuracy of the resulting pose estimates, they are susceptible to ambiguity in the observation as they consider visual alignment. We propose to leverage the fact that we often observe static, rigid scen… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2022

  18. arXiv:2112.10272  [pdf, other

    cs.HC

    A Multi-Layout Design for Immersive Visualization of Network Data

    Authors: David Bauer, Chengbo Zheng, Oh-Hyun Kwon, Kwan-Liu Ma

    Abstract: Visualization plays a vital role in making sense of complex network data. Recent studies have shown the potential of using extended reality (XR) for the immersive exploration of networks. The additional depth cues offered by XR help users perform better in certain tasks when compared to using traditional desktop setups. However, prior works on immersive network visualization rely on mostly static… ▽ More

    Submitted 26 January, 2023; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: 13 pages, 6 figures, this manuscript is currently under revision

  19. arXiv:2111.01869  [pdf, other

    cs.RO

    Towards Very Low-Cost Iterative Prototy** for Fully Printable Dexterous Soft Robotic Hands

    Authors: Dominik Bauer, Cornelia Bauer, Arjun Lakshmipathy, Roberto Shu, Nancy S. Pollard

    Abstract: The design and fabrication of soft robot hands is still a time-consuming and difficult process. Advances in rapid prototy** have accelerated the fabrication process significantly while introducing new complexities into the design process. In this work, we present an approach that utilizes novel low-cost fabrication techniques in conjunction with design tools hel** soft hand designers to system… ▽ More

    Submitted 16 April, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

  20. arXiv:2110.15532  [pdf, other

    cs.RO

    Contact Transfer: A Direct, User-Driven Method for Human to Robot Transfer of Grasps and Manipulations

    Authors: Arjun Lakshmipathy, Dominik Bauer, Cornelia Bauer, Nancy S. Pollard

    Abstract: We present a novel method for the direct transfer of grasps and manipulations between objects and hands through utilization of contact areas. Our method fully preserves contact shapes, and in contrast to existing techniques, is not dependent on grasp families, requires no model training or grasp sampling, makes no assumptions about manipulator morphology or kinematics, and allows user control over… ▽ More

    Submitted 1 June, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

  21. arXiv:2109.09500  [pdf, other

    stat.ML cs.LG

    Deep Learning-Based Estimation and Goodness-of-Fit for Large-Scale Confirmatory Item Factor Analysis

    Authors: Christopher J. Urban, Daniel J. Bauer

    Abstract: We investigate novel parameter estimation and goodness-of-fit (GOF) assessment methods for large-scale confirmatory item factor analysis (IFA) with many respondents, items, and latent factors. For parameter estimation, we extend Urban and Bauer's (2021) deep learning algorithm for exploratory IFA to the confirmatory setting by showing how to handle constraints on loadings and factor correlations.… ▽ More

    Submitted 15 March, 2023; v1 submitted 20 September, 2021; originally announced September 2021.

  22. arXiv:2103.15231  [pdf, other

    cs.CV

    ReAgent: Point Cloud Registration using Imitation and Reinforcement Learning

    Authors: Dominik Bauer, Timothy Patten, Markus Vincze

    Abstract: Point cloud registration is a common step in many 3D computer vision tasks such as object pose estimation, where a 3D model is aligned to an observation. Classical registration methods generalize well to novel domains but fail when given a noisy observation or a bad initialization. Learning-based methods, in contrast, are more robust but lack in generalization capacity. We propose to consider iter… ▽ More

    Submitted 28 March, 2021; originally announced March 2021.

    Comments: Accepted at CVPR 2021

  23. arXiv:2012.02111  [pdf, other

    cs.RO cs.LG

    Deep Inverse Sensor Models as Priors for evidential Occupancy Map**

    Authors: Daniel Bauer, Lars Kuhnert, Lutz Eckstein

    Abstract: With the recent boost in autonomous driving, increased attention has been paid on radars as an input for occupancy map**. Besides their many benefits, the inference of occupied space based on radar detections is notoriously difficult because of the data sparsity and the environment dependent noise (e.g. multipath reflections). Recently, deep learning-based inverse sensor models, from here on cal… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

  24. arXiv:2001.07859  [pdf, other

    stat.ME cs.LG stat.ML

    A Deep Learning Algorithm for High-Dimensional Exploratory Item Factor Analysis

    Authors: Christopher J. Urban, Daniel J. Bauer

    Abstract: Marginal maximum likelihood (MML) estimation is the preferred approach to fitting item response theory models in psychometrics due to the MML estimator's consistency, normality, and efficiency as the sample size tends to infinity. However, state-of-the-art MML estimation procedures such as the Metropolis-Hastings Robbins-Monro (MH-RM) algorithm as well as approximate MML estimation procedures such… ▽ More

    Submitted 4 February, 2021; v1 submitted 21 January, 2020; originally announced January 2020.

    Comments: 30 pages; 12 figures; accepted for publication in Psychometrika

  25. arXiv:1909.05730  [pdf, other

    cs.CV cs.RO

    VeREFINE: Integrating Object Pose Verification with Physics-guided Iterative Refinement

    Authors: Dominik Bauer, Timothy Patten, Markus Vincze

    Abstract: Accurate and robust object pose estimation for robotics applications requires verification and refinement steps. In this work, we propose to integrate hypotheses verification with object pose refinement guided by physics simulation. This allows the physical plausibility of individual object pose estimates and the stability of the estimated scene to be considered in a unified optimization. The prop… ▽ More

    Submitted 18 May, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: Revised version

  26. arXiv:1904.00842  [pdf, other

    cs.CV cs.AI cs.RO

    Deep, spatially coherent Inverse Sensor Models with Uncertainty Incorporation using the evidential Framework

    Authors: Daniel Bauer, Lars Kuhnert, Lutz Eckstein

    Abstract: To perform high speed tasks, sensors of autonomous cars have to provide as much information in as few time steps as possible. However, radars, one of the sensor modalities autonomous cars heavily rely on, often only provide sparse, noisy detections. These have to be accumulated over time to reach a high enough confidence about the static parts of the environment. For radars, the state is typically… ▽ More

    Submitted 29 March, 2019; originally announced April 2019.

    Comments: Submitted for Intelligent Vehicle Symposium 2019

  27. arXiv:1903.12467  [pdf, other

    cs.RO cs.AI

    Deep, spatially coherent Occupancy Maps based on Radar Measurements

    Authors: Daniel Bauer, Lars Kuhnert, Lutz Eckstein

    Abstract: One essential step to realize modern driver assistance technology is the accurate knowledge about the location of static objects in the environment. In this work, we use artificial neural networks to predict the occupation state of a whole scene in an end-to-end manner. This stands in contrast to the traditional approach of accumulating each detection's influence on the occupancy state and allows… ▽ More

    Submitted 29 March, 2019; originally announced March 2019.

    Comments: Submitted for Automotive Meets Electronics 2019

  28. Personalized and situation-aware multimodal route recommendations: the FAVOUR algorithm

    Authors: Paolo Campigotto, Christian Rudloff, Maximilian Leodolter, Dietmar Bauer

    Abstract: Route choice in multimodal networks shows a considerable variation between different individuals as well as the current situational context. Personalization of recommendation algorithms are already common in many areas, e.g., online retail. However, most online routing applications still provide shortest distance or shortest travel-time routes only, neglecting individual preferences as well as the… ▽ More

    Submitted 29 February, 2016; originally announced February 2016.

    Comments: 12 pages, 6 figures, 1 table. Submitted to IEEE Transactions on Intelligent Transportation Systems journal for publication

  29. arXiv:1207.1115  [pdf, other

    stat.ML cs.LG physics.data-an physics.soc-ph

    Inferring land use from mobile phone activity

    Authors: Jameson L. Toole, Michael Ulm, Dietmar Bauer, Marta C. Gonzalez

    Abstract: Understanding the spatiotemporal distribution of people within a city is crucial to many planning applications. Obtaining data to create required knowledge, currently involves costly survey methods. At the same time ubiquitous mobile sensors from personal GPS devices to mobile phones are collecting massive amounts of data on urban systems. The locations, communications, and activities of millions… ▽ More

    Submitted 3 July, 2012; originally announced July 2012.

    Comments: To be presented at ACM UrbComp2012

    ACM Class: H.2.8