Skip to main content

Showing 1–26 of 26 results for author: Luedtke, S

.
  1. arXiv:2407.02112  [pdf, other

    cs.LG cs.AI

    A Data-Centric Perspective on Evaluating Machine Learning Models for Tabular Data

    Authors: Andrej Tschalzev, Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Tabular data is prevalent in real-world machine learning applications, and new models for supervised learning of tabular data are frequently proposed. Comparative studies assessing the performance of models typically consist of model-centric evaluation setups with overly standardized data preprocessing. This paper demonstrates that such model-centric evaluations are biased, as real-world modeling… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2407.01115  [pdf, other

    cs.LG stat.ML

    Enabling Mixed Effects Neural Networks for Diverse, Clustered Data Using Monte Carlo Methods

    Authors: Andrej Tschalzev, Paul Nitschke, Lukas Kirchdorfer, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Neural networks often assume independence among input data samples, disregarding correlations arising from inherent clustering patterns in real-world datasets (e.g., due to different sites or repeated measurements). Recently, mixed effects neural networks (MENNs) which separate cluster-specific 'random effects' from cluster-invariant 'fixed effects' have been proposed to improve generalization and… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. Community recommendations on cryoEM data archiving and validation

    Authors: Gerard J. Kleywegt, Paul D. Adams, Sarah J. Butcher, Cathy Lawson, Alexis Rohou, Peter B. Rosenthal, Sriram Subramaniam, Maya Topf, Sanja Abbott, Philip R. Baldwin, John M. Berrisford, Gérard Bricogne, Preeti Choudhary, Tristan I. Croll, Radostin Danev, Sai J. Ganesan, Timothy Grant, Aleksandras Gutmanas, Richard Henderson, J. Bernard Heymann, Juha T. Huiskonen, Andrei Istrate, Takayuki Kato, Gabriel C. Lander, Shee-Mei Lok , et al. (22 additional authors not shown)

    Abstract: In January 2020, a workshop was held at EMBL-EBI (Hinxton, UK) to discuss data requirements for deposition and validation of cryoEM structures, with a focus on single-particle analysis. The meeting was attended by 45 experts in data processing, model building and refinement, validation, and archiving of such structures. This report describes the workshop's motivation and history, the topics discus… ▽ More

    Submitted 2 February, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: Outcomes of a wwPDB/EMDB workshop on cryoEM data management, deposition and validation

  4. arXiv:2309.17130  [pdf, other

    cs.LG

    GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data

    Authors: Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Despite the success of deep learning for text and image data, tree-based ensemble models are still state-of-the-art for machine learning with heterogeneous tabular data. However, there is a significant need for tabular-specific gradient-based methods due to their high flexibility. In this paper, we propose $\text{GRANDE}$, $\text{GRA}$die$\text{N}$t-Based $\text{D}$ecision Tree $\text{E}$nsembles,… ▽ More

    Submitted 12 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  5. arXiv:2309.00306  [pdf, ps, other

    cs.AI

    On the Aggregation of Rules for Knowledge Graph Completion

    Authors: Patrick Betz, Stefan Lüdtke, Christian Meilicke, Heiner Stuckenschmidt

    Abstract: Rule learning approaches for knowledge graph completion are efficient, interpretable and competitive to purely neural models. The rule aggregation problem is concerned with finding one plausibility score for a candidate fact which was simultaneously predicted by multiple rules. Although the problem is ubiquitous, as data-driven rule learning can result in noisy and large rulesets, it is underrepre… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: KLR Workshop@ICML2023

  6. arXiv:2308.03403  [pdf, other

    cs.LG

    Towards Machine Learning-based Fish Stock Assessment

    Authors: Stefan Lüdtke, Maria E. Pierce

    Abstract: The accurate assessment of fish stocks is crucial for sustainable fisheries management. However, existing statistical stock assessment models can have low forecast performance of relevant stock parameters like recruitment or spawning stock biomass, especially in ecosystems that are changing due to global warming and other anthropogenic stressors. In this paper, we investigate the use of machine le… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: Accepted at Fragile Earth Workshop 2023

  7. arXiv:2305.03515  [pdf, other

    cs.LG cs.AI

    GradTree: Learning Axis-Aligned Decision Trees with Gradient Descent

    Authors: Sascha Marton, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Decision Trees (DTs) are commonly used for many machine learning tasks due to their high degree of interpretability. However, learning a DT from data is a difficult optimization problem, as it is non-convex and non-differentiable. Therefore, common approaches learn DTs using a greedy growth algorithm that minimizes the impurity locally at each internal node. Unfortunately, this greedy procedure ca… ▽ More

    Submitted 12 March, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

  8. arXiv:2301.10571  [pdf, other

    cs.AI

    Leveraging Planning Landmarks for Hybrid Online Goal Recognition

    Authors: Nils Wilken, Lea Cohausz, Johannes Schaum, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Goal recognition is an important problem in many application domains (e.g., pervasive computing, intrusion detection, computer games, etc.). In many application scenarios it is important that goal recognition algorithms can recognize goals of an observed agent as fast as possible and with minimal domain knowledge. Hence, in this paper, we propose a hybrid method for online goal recognition that co… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: 9 pages. Presented at SPARK 2022 (https://icaps22.icaps-conference.org/workshops/SPARK/)

  9. arXiv:2301.05608  [pdf, other

    cs.AI

    Investigating the Combination of Planning-Based and Data-Driven Methods for Goal Recognition

    Authors: Nils Wilken, Lea Cohausz, Johannes Schaum, Stefan Lüdtke, Heiner Stuckenschmidt

    Abstract: An important feature of pervasive, intelligent assistance systems is the ability to dynamically adapt to the current needs of their users. Hence, it is critical for such systems to be able to recognize those goals and needs based on observations of the user's actions and state of the environment. In this work, we investigate the application of two state-of-the-art, planning-based plan recognition… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

  10. arXiv:2210.16425  [pdf, other

    physics.acc-ph physics.plasm-ph

    Feasibility study for the hard x-ray free electron laser based on synergistic use of conventional and plasma accelerator technologies

    Authors: Nikolai Yampolsky, Sandra Biedron, Bjorn Manuel Hegelich, Scott Luedtke, Evgenya Simakov, Stephen Milton

    Abstract: We access the possibility of using the conventional RF accelerator as an injector for the plasma driven wakefield accelerator. Conventional accelerators deliver high quality beams with low emittance and low energy spread. Once injected into the plasma wake, the emittance may be preserved upon proper beam matching while the energy spread may not due to long beam duration delivered by the convention… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: Internal report from Los Alamos National Laboratory. LA-UR-22-30406

  11. arXiv:2207.11492  [pdf

    physics.plasm-ph physics.acc-ph

    High-charge 10 GeV electron acceleration in a 10 cm nanoparticle-assisted hybrid wakefield accelerator

    Authors: Constantin Aniculaesei, Thanh Ha, Samuel Yoffe, Edward McCary, Michael M Spinks, Hernan J. Quevedo, Lance Labun, Ou Z. Labun, Ritwik Sain, Andrea Hannasch, Rafal Zgadzaj, Isabella Pagano, Jose A. Franco-Altamirano, Martin L. Ringuette, Erhart Gaul, Scott V. Luedtke, Ganesh Tiwari, Bernhard Ersfeld, Enrico Brunetti, Hartmut Ruhl, Todd Ditmire, Sandra Bruce, Michael E. Donovan, Dino A. Jaroszynski, Michael C. Downer , et al. (1 additional authors not shown)

    Abstract: In an electron wakefield accelerator, an intense laser pulse or charged particle beam excites plasma waves. Under proper conditions, electrons from the background plasma are trapped in the plasma wave and accelerated to ultra-relativistic velocities. We present recent results from a proof-of-principle wakefield acceleration experiment that reveal a unique synergy between a laser-driven and particl… ▽ More

    Submitted 18 August, 2023; v1 submitted 23 July, 2022; originally announced July 2022.

    Journal ref: Matter Radiat. Extremes 9, 014001 (2024)

  12. arXiv:2207.08816  [pdf, other

    cs.LG cs.AI

    Discovering Behavioral Predispositions in Data to Improve Human Activity Recognition

    Authors: Maximilian Popko, Sebastian Bader, Stefan Lüdtke, Thomas Kirste

    Abstract: The automatic, sensor-based assessment of challenging behavior of persons with dementia is an important task to support the selection of interventions. However, predicting behaviors like apathy and agitation is challenging due to the large inter- and intra-patient variability. Goal of this paper is to improve the recognition performance by making use of the observation that patients tend to show s… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: Submitted to iWOAR 2022 - 7th international Workshop on Sensor-Based Activity Recognition and Artificial Intelligence

  13. arXiv:2207.08414  [pdf, other

    cs.LG

    Outlier Explanation via Sum-Product Networks

    Authors: Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Outlier explanation is the task of identifying a set of features that distinguish a sample from normal data, which is important for downstream (human) decision-making. Existing methods are based on beam search in the space of feature subsets. They quickly becomes computationally expensive, as they require to run an outlier detection algorithm from scratch for each feature subset. To alleviate this… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  14. Explaining Neural Networks without Access to Training Data

    Authors: Sascha Marton, Stefan Lüdtke, Christian Bartelt, Andrej Tschalzev, Heiner Stuckenschmidt

    Abstract: We consider generating explanations for neural networks in cases where the network's training data is not accessible, for instance due to privacy or safety issues. Recently, $\mathcal{I}$-Nets have been proposed as a sample-free approach to post-hoc, global model interpretability that does not require access to training data. They formulate interpretation as a machine learning task that maps netwo… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Journal ref: Machine Learning (2024)

  15. arXiv:2202.00332  [pdf, other

    cs.AI

    Activity Recognition in Assembly Tasks by Bayesian Filtering in Multi-Hypergraphs

    Authors: Timon Felske, Stefan Lüdtke, Sebastian Bader, Thomas Kirste

    Abstract: We study sensor-based human activity recognition in manual work processes like assembly tasks. In such processes, the system states often have a rich structure, involving object properties and relations. Thus, estimating the hidden system state from sensor observations by recursive Bayesian filtering can be very challenging, due to the combinatorial explosion in the number of system states. To all… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Accepted for presentation at the 2nd GCLR workshop in conjunction with AAAI 2022

  16. arXiv:2111.04564  [pdf, other

    eess.SP cs.AI cs.LG

    Human Activity Recognition using Attribute-Based Neural Networks and Context Information

    Authors: Stefan Lüdtke, Fernando Moya Rueda, Waqas Ahmed, Gernot A. Fink, Thomas Kirste

    Abstract: We consider human activity recognition (HAR) from wearable sensor data in manual-work processes, like warehouse order-picking. Such structured domains can often be partitioned into distinct process steps, e.g., packaging or transporting. Each process step can have a different prior distribution over activity classes, e.g., standing or walking, and different system dynamics. Here, we show how such… ▽ More

    Submitted 28 October, 2021; originally announced November 2021.

    Comments: 3rd International Workshop on Deep Learning for Human Activity Recognition

  17. arXiv:2110.05165  [pdf, other

    cs.LG cs.AI stat.ML

    Exchangeability-Aware Sum-Product Networks

    Authors: Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Sum-Product Networks (SPNs) are expressive probabilistic models that provide exact, tractable inference. They achieve this efficiency by making use of local independence. On the other hand, mixtures of exchangeable variable models (MEVMs) are a class of tractable probabilistic models that make use of exchangeability of discrete random variables to render inference tractable. Exchangeability, which… ▽ More

    Submitted 28 April, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: accepted at IJCAI 2022

  18. VPIC 2.0: Next Generation Particle-in-Cell Simulations

    Authors: Robert Bird, Nigel Tan, Scott V. Luedtke, Stephen Lien Harrell, Michela Taufer, Brian Albright

    Abstract: VPIC is a general purpose Particle-in-Cell simulation code for modeling plasma phenomena such as magnetic reconnection, fusion, solar weather, and laser-plasma interaction in three dimensions using large numbers of particles. VPIC's capacity in both fidelity and scale makes it particularly well-suited for plasma research on pre-exascale and exascale platforms. In this paper we demonstrate the uniq… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

  19. Deep learning based mixed-dimensional GMM for characterizing variability in CryoEM

    Authors: Muyuan Chen, Steven Ludtke

    Abstract: Structural flexibility and/or dynamic interactions with other molecules is a critical aspect of protein function. CryoEM provides direct visualization of individual macromolecules sampling different conformational and compositional states. While numerous methods are available for computational classification of discrete states, characterization of continuous conformational changes or large numbers… ▽ More

    Submitted 23 May, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: 31 pages, 5 main figures and 8 supplementary figures

    Journal ref: Nature Methods 18, 930-936 (2021)

  20. Creating QED Photon Jets with Present-Day Lasers

    Authors: Scott V. Luedtke, Lin Yin, Lance A. Labun, Ou Z. Labun, B. J. Albright, Robert F. Bird, W. D. Nystrom, Björn Manuel Hegelich

    Abstract: Large-scale, relativistic particle-in-cell simulations with quantum electrodynamics (QED) models show that high energy (1$<E_γ\lesssim$ 75 MeV) QED photon jets with a flux of $10^{12}$ sr$^{-1}$ can be created with present-day lasers and planar, unstructured targets. This process involves a self-forming channel in the target in response to a laser pulse focused tightly ($f$ number unity) onto the… ▽ More

    Submitted 15 September, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: 5 pages, 4 figures

    Journal ref: Phys. Rev. Research 3, 032061 (2021)

  21. arXiv:1902.03978  [pdf

    q-bio.QM eess.IV

    A complete data processing workflow for CryoET and subtomogram averaging

    Authors: Muyuan Chen, James M. Bell, Xiaodong Shi, Stella Y. Sun, Zhao Wang, Steven J. Ludtke

    Abstract: Electron cryotomography (CryoET) is currently the only method capable of visualizing cells in 3D at nanometer resolutions. While modern instruments produce massive amounts of tomography data containing extremely rich structural information, the data processing is very labor intensive and results are often limited by the skills of the personnel rather than the data. We present an integrated workflo… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

    Comments: 21 pages, 4+2 figures

    Journal ref: Nature Methods 16 (2019) 1161-1168

  22. arXiv:1808.07067  [pdf, other

    physics.plasm-ph

    Jet Observable for Photons from High-Intensity Laser-Plasma Interactions

    Authors: Scott V. Luedtke, Lance A. Labun, Ou Z. Labun, Karl-Ulrich Bamberg, Hartmut Ruhl, Björn Manuel Hegelich

    Abstract: The goals of discovering quantum radiation dynamics in high-intensity laser-plasma interactions and engineering new laser-driven high-energy particle sources both require accurate and robust predictions. Experiments rely on particle-in-cell simulations to predict and interpret outcomes, but unknowns in modeling the interaction limit the simulations to qualitative predictions, too uncertain to test… ▽ More

    Submitted 21 August, 2018; originally announced August 2018.

  23. State-Space Abstractions for Probabilistic Inference: A Systematic Review

    Authors: Stefan Lüdtke, Max Schröder, Frank Krüger, Sebastian Bader, Thomas Kirste

    Abstract: Tasks such as social network analysis, human behavior recognition, or modeling biochemical reactions, can be solved elegantly by using the probabilistic inference framework. However, standard probabilistic inference algorithms work at a propositional level, and thus cannot capture the symmetries and redundancies that are present in these tasks. Algorithms that exploit those symmetries have been de… ▽ More

    Submitted 4 December, 2018; v1 submitted 18 April, 2018; originally announced April 2018.

  24. arXiv:1801.10495  [pdf, other

    cs.AI

    Lifted Filtering via Exchangeable Decomposition

    Authors: Stefan Lüdtke, Max Schröder, Sebastian Bader, Kristian Kersting, Thomas Kirste

    Abstract: We present a model for exact recursive Bayesian filtering based on lifted multiset states. Combining multisets with lifting makes it possible to simultaneously exploit multiple strategies for reducing inference complexity when compared to list-based grounded state representations. The core idea is to borrow the concept of Maximally Parallel Multiset Rewriting Systems and to enhance it by concepts… ▽ More

    Submitted 7 May, 2018; v1 submitted 31 January, 2018; originally announced January 2018.

  25. arXiv:1707.06446  [pdf, other

    cs.AI

    Sequential Lifted Bayesian Filtering in Multiset Rewriting Systems

    Authors: Max Schröder, Stefan Lüdtke, Sebastian Bader, Frank Krüger, Thomas Kirste

    Abstract: Bayesian Filtering for plan and activity recognition is challenging for scenarios that contain many observation equivalent entities (i.e. entities that produce the same observations). This is due to the combinatorial explosion in the number of hypotheses that need to be tracked. However, this class of problems exhibits a certain symmetry that can be exploited for state space representation and inf… ▽ More

    Submitted 14 August, 2017; v1 submitted 20 July, 2017; originally announced July 2017.

    Comments: 7 pages, 3 figures, accepted at UAI-17 Statistical Relational AI (StarAI) workshop

  26. Convolutional Neural Networks for Automated Annotation of Cellular Cryo-Electron Tomograms

    Authors: Muyuan Chen, Wei Dai, Ying Sun, Darius Jonasch, Cynthia Y He, Michael F. Schmid, Wah Chiu, Steven J Ludtke

    Abstract: Cellular Electron Cryotomography (CryoET) offers the ability to look inside cells and observe macromolecules frozen in action. A primary challenge for this technique is identifying and extracting the molecular components within the crowded cellular environment. We introduce a method using neural networks to dramatically reduce the time and human effort required for subcellular annotation and featu… ▽ More

    Submitted 11 June, 2017; v1 submitted 19 January, 2017; originally announced January 2017.

    Comments: 21 pages, 8 figures

    Journal ref: Nature Methods volume 14, 983-985 (2017)