Skip to main content

Showing 1–14 of 14 results for author: Herzog, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02477  [pdf, other

    eess.IV cs.CV cs.LG

    Inpainting Pathology in Lumbar Spine MRI with Latent Diffusion

    Authors: Colin Hansen, Simas Glinskis, Ashwin Raju, Micha Kornreich, **Hyeong Park, Jayashri Pawar, Richard Herzog, Li Zhang, Benjamin Odry

    Abstract: Data driven models for automated diagnosis in radiology suffer from insufficient and imbalanced datasets due to low representation of pathology in a population and the cost of expert annotations. Datasets can be bolstered through data augmentation. However, even when utilizing a full suite of transformations during model training, typical data augmentations do not address variations in human anato… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2405.20800  [pdf, other

    cs.LG cs.SC

    Shape Constraints in Symbolic Regression using Penalized Least Squares

    Authors: Viktor Martinek, Julia Reuter, Ophelia Frotscher, Sanaz Mostaghim, Markus Richter, Roland Herzog

    Abstract: We study the addition of shape constraints and their consideration during the parameter estimation step of symbolic regression (SR). Shape constraints serve as a means to introduce prior knowledge about the shape of the otherwise unknown model function into SR. Unlike previous works that have explored shape constraints in SR, we propose minimizing shape constraint violations during parameter estim… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  3. arXiv:2405.18896  [pdf, other

    cs.LG cs.SC

    Unit-Aware Genetic Programming for the Development of Empirical Equations

    Authors: Julia Reuter, Viktor Martinek, Roland Herzog, Sanaz Mostaghim

    Abstract: When develo** empirical equations, domain experts require these to be accurate and adhere to physical laws. Often, constants with unknown units need to be discovered alongside the equations. Traditional unit-aware genetic programming (GP) approaches cannot be used when unknown constants with undetermined units are included. This paper presents a method for dimensional analysis that propagates un… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Submitted to Conference Proceedings of PPSN2024

  4. arXiv:2311.16956  [pdf, other

    math.OC cs.LG

    Adaptive Step Sizes for Preconditioned Stochastic Gradient Descent

    Authors: Frederik Köhne, Leonie Kreis, Anton Schiela, Roland Herzog

    Abstract: This paper proposes a novel approach to adaptive step sizes in stochastic gradient descent (SGD) by utilizing quantities that we have identified as numerically traceable -- the Lipschitz constant for gradients and a concept of the local variance in search directions. Our findings yield a nearly hyperparameter-free algorithm for stochastic optimization, which has provable convergence properties whe… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  5. arXiv:2311.15995  [pdf, other

    cs.LG math.OC

    Sensitivity-Based Layer Insertion for Residual and Feedforward Neural Networks

    Authors: Evelyn Herberg, Roland Herzog, Frederik Köhne, Leonie Kreis, Anton Schiela

    Abstract: The training of neural networks requires tedious and often manual tuning of the network architecture. We propose a systematic method to insert new layers during the training process, which eliminates the need to choose a fixed network size before training. Our technique borrows techniques from constrained optimization and is based on first-order sensitivity information of the objective with respec… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  6. arXiv:2311.15419  [pdf, other

    cs.LG math.NA

    Frobenius-Type Norms and Inner Products of Matrices and Linear Maps with Applications to Neural Network Training

    Authors: Roland Herzog, Frederik Köhne, Leonie Kreis, Anton Schiela

    Abstract: The Frobenius norm is a frequent choice of norm for matrices. In particular, the underlying Frobenius inner product is typically used to evaluate the gradient of an objective with respect to matrix variable, such as those occuring in the training of neural networks. We provide a broader view on the Frobenius norm and inner product for linear maps or matrices, and establish their dependence on inne… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  7. arXiv:2309.02805  [pdf, other

    cs.LG physics.data-an

    Introducing Thermodynamics-Informed Symbolic Regression -- A Tool for Thermodynamic Equations of State Development

    Authors: Viktor Martinek, Ophelia Frotscher, Markus Richter, Roland Herzog

    Abstract: Thermodynamic equations of state (EOS) are essential for many industries as well as in academia. Even leaving aside the expensive and extensive measurement campaigns required for the data acquisition, the development of EOS is an intensely time-consuming process, which does often still heavily rely on expert knowledge and iterative fine-tuning. To improve upon and accelerate the EOS development pr… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  8. arXiv:2306.16111  [pdf, other

    cs.LG math.OC

    Time Regularization in Optimal Time Variable Learning

    Authors: Evelyn Herberg, Roland Herzog, Frederik Köhne

    Abstract: Recently, optimal time variable learning in deep neural networks (DNNs) was introduced in arXiv:2204.08528. In this manuscript we extend the concept by introducing a regularization term that directly relates to the time horizon in discrete dynamical systems. Furthermore, we propose an adaptive pruning approach for Residual Neural Networks (ResNets), which reduces network complexity without comprom… ▽ More

    Submitted 6 December, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

  9. Identifying public values and spatial conflicts in urban planning

    Authors: Rico H. Herzog, Juliana E. Gonçalves, Geertje Slingerland, Reinout Kleinhans, Holger Prang, Frances Brazier, Trivik Verma

    Abstract: Identifying the diverse and often competing values of citizens, and resolving the consequent public value conflicts, are of significant importance for inclusive and integrated urban development. Scholars have highlighted that relational, value-laden urban space gives rise to many diverse conflicts that vary both spatially and temporally. Although notions of public value conflicts have been conceiv… ▽ More

    Submitted 18 July, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

  10. arXiv:2205.02979  [pdf, other

    cs.LG cs.AI cs.CL

    Explaining the Effectiveness of Multi-Task Learning for Efficient Knowledge Extraction from Spine MRI Reports

    Authors: Arijit Sehanobish, McCullen Sandora, Nabila Abraham, Jayashri Pawar, Danielle Torres, Anasuya Das, Murray Becker, Richard Herzog, Benjamin Odry, Ron Vianu

    Abstract: Pretrained Transformer based models finetuned on domain specific corpora have changed the landscape of NLP. However, training or fine-tuning these models for individual tasks can be time consuming and resource intensive. Thus, a lot of current research is focused on using transformers for multi-task learning (Raffel et al.,2020) and how to group the tasks to help a multi-task model to learn effect… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: To appear at NAACL-2022, Industry Track. Follow-up of previous work: arXiv:2204.04544

  11. arXiv:2204.04544  [pdf, other

    cs.LG cs.AI cs.CL

    Efficient Extraction of Pathologies from C-Spine Radiology Reports using Multi-Task Learning

    Authors: Arijit Sehanobish, Nathaniel Brown, Ishita Daga, Jayashri Pawar, Danielle Torres, Anasuya Das, Murray Becker, Richard Herzog, Benjamin Odry, Ron Vianu

    Abstract: Pretrained Transformer based models finetuned on domain specific corpora have changed the landscape of NLP. Generally, if one has multiple tasks on a given dataset, one may finetune different models or use task specific adapters. In this work, we show that a multi-task model can beat or achieve the performance of multiple BERT-based models finetuned on various tasks and various task specific adapt… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: Accepted at 6th International Workshop on Health Intelligence, AAAI-2022. To appear in as a book chapter published by Springer in Studies in Computational Intelligence

  12. arXiv:2202.09206  [pdf, other

    cs.CV

    Spatio-Temporal Outdoor Lighting Aggregation on Image Sequences using Transformer Networks

    Authors: Haebom Lee, Christian Homeyer, Robert Herzog, Jan Rexilius, Carsten Rother

    Abstract: In this work, we focus on outdoor lighting estimation by aggregating individual noisy estimates from images, exploiting the rich image information from wide-angle cameras and/or temporal image sequences. Photographs inherently encode information about the scene's lighting in the form of shading and shadows. Recovering the lighting is an inverse rendering problem and as that ill-posed. Recent work… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

    Comments: 11 pages, 7 figures, 1 table, currently under a review process

  13. Using Probabilistic Movement Primitives in Analyzing Human Motion Difference under Transcranial Current Stimulation

    Authors: Honghu Xue, Rebecca Herzog, Till M Berger, Tobias Bäumer, Anne Weissbach, Elmar Rueckert

    Abstract: In medical tasks such as human motion analysis, computer-aided auxiliary systems have become preferred choice for human experts for its high efficiency. However, conventional approaches are typically based on user-defined features such as movement onset times, peak velocities, motion vectors or frequency domain analyses. Such approaches entail careful data post-processing or specific domain knowle… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Journal ref: https://www.frontiersin.org/articles/10.3389/frobt.2021.721890/full

  14. arXiv:2012.11748  [pdf, other

    math.NA cs.CG

    Mesh Denoising and Inpainting using the Total Variation of the Normal and a Shape Newton Approach

    Authors: Lukas Baumgärtner, Ronny Bergmann, Roland Herzog, Stephan Schmidt, José Vidal-Núñez, Manuel Weiß

    Abstract: We present a novel approach to denoising and inpainting problems for surface meshes. The purpose of these problems is to remove noise or fill in missing parts while preserving important features such as sharp edges. A discrete variant of the total variation of the unit normal vector field serves as a regularizing functional to achieve these goals. In order to solve the resulting problem, we use a… ▽ More

    Submitted 12 March, 2024; v1 submitted 21 December, 2020; originally announced December 2020.