Skip to main content

Showing 1–10 of 10 results for author: Stuhmer, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.00050  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph

    Grappa -- A Machine Learned Molecular Mechanics Force Field

    Authors: Leif Seute, Eric Hartmann, Jan Stühmer, Frauke Gräter

    Abstract: Simulating large molecular systems over long timescales requires force fields that are both accurate and efficient. In recent years, E(3) equivariant neural networks have lifted the tension between computational efficiency and accuracy of force fields, but they are still several orders of magnitude more expensive than classical molecular mechanics (MM) force fields. Here, we propose a novel mach… ▽ More

    Submitted 25 March, 2024; originally announced April 2024.

  2. arXiv:2302.14102  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.chem-ph

    Connectivity Optimized Nested Graph Networks for Crystal Structures

    Authors: Robin Ruff, Patrick Reiser, Jan Stühmer, Pascal Friederich

    Abstract: Graph neural networks (GNNs) have been applied to a large variety of applications in materials science and chemistry. Here, we recapitulate the graph construction for crystalline (periodic) materials and investigate its impact on the GNNs model performance. We suggest the asymmetric unit cell as a representation to reduce the number of atoms by using all symmetries of the system. This substantiall… ▽ More

    Submitted 9 August, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 19 pages, 13 figures

    ACM Class: J.2

  3. arXiv:2302.12712  [pdf, other

    cs.CV

    Amortised Invariance Learning for Contrastive Self-Supervision

    Authors: Ruchika Chavhan, Henry Gouk, Jan Stuehmer, Calum Heggan, Mehrdad Yaghoobi, Timothy Hospedales

    Abstract: Contrastive self-supervised learning methods famously produce high quality transferable representations by learning invariances to different data augmentations. Invariances established during pre-training can be interpreted as strong inductive biases. However these may or may not be helpful, depending on if they match the invariance requirements of downstream tasks or not. This has led to several… ▽ More

    Submitted 3 April, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: ICLR 2023, Code available here: https://github.com/ruchikachavhan/amortized-invariance-learning-ssl/

  4. arXiv:2207.08304  [pdf, other

    cs.LG cs.CV

    HyperInvariances: Amortizing Invariance Learning

    Authors: Ruchika Chavhan, Henry Gouk, Jan Stühmer, Timothy Hospedales

    Abstract: Providing invariances in a given learning task conveys a key inductive bias that can lead to sample-efficient learning and good generalisation, if correctly specified. However, the ideal invariances for many problems of interest are often not known, which has led both to a body of engineering lore as well as attempts to provide frameworks for invariance learning. However, invariance learning is ex… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: ICML 2022, Workshop on Spurious Correlations, Invariance, and Stability

  5. arXiv:2204.07305  [pdf, other

    cs.CV cs.LG

    Pushing the Limits of Simple Pipelines for Few-Shot Learning: External Data and Fine-Tuning Make a Difference

    Authors: Shell Xu Hu, Da Li, Jan Stühmer, Minyoung Kim, Timothy M. Hospedales

    Abstract: Few-shot learning (FSL) is an important and topical problem in computer vision that has motivated extensive research into numerous methods spanning from sophisticated meta-learning methods to simple transfer learning baselines. We seek to push the limits of a simple-but-effective pipeline for more realistic and practical settings of few-shot image classification. To this end, we explore few-shot l… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted by CVPR2022

  6. arXiv:2104.11181  [pdf, other

    cs.CV

    H2O: Two Hands Manipulating Objects for First Person Interaction Recognition

    Authors: Taein Kwon, Bugra Tekin, Jan Stuhmer, Federica Bogo, Marc Pollefeys

    Abstract: We present a comprehensive framework for egocentric interaction recognition using markerless 3D annotations of two hands manipulating objects. To this end, we propose a method to create a unified dataset for egocentric 3D interaction recognition. Our method produces annotations of the 3D pose of two hands and the 6D pose of the manipulated objects, along with their interaction labels for each fram… ▽ More

    Submitted 24 August, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted to ICCV 2021

  7. arXiv:2008.11239  [pdf, other

    cs.CV

    HoloLens 2 Research Mode as a Tool for Computer Vision Research

    Authors: Dorin Ungureanu, Federica Bogo, Silvano Galliani, Pooja Sama, Xin Duan, Casey Meekhof, Jan Stühmer, Thomas J. Cashman, Bugra Tekin, Johannes L. Schönberger, Pawel Olszta, Marc Pollefeys

    Abstract: Mixed reality headsets, such as the Microsoft HoloLens 2, are powerful sensing devices with integrated compute capabilities, which makes it an ideal platform for computer vision research. In this technical report, we present HoloLens 2 Research Mode, an API and a set of tools enabling access to the raw sensor streams. We provide an overview of the API and explain how it can be used to build mixed… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

  8. arXiv:1910.05639  [pdf, other

    cs.LG cs.SI nlin.CD stat.ML

    Disentangling Interpretable Generative Parameters of Random and Real-World Graphs

    Authors: Niklas Stoehr, Emine Yilmaz, Marc Brockschmidt, Jan Stuehmer

    Abstract: While a wide range of interpretable generative procedures for graphs exist, matching observed graph topologies with such procedures and choices for its parameters remains an open problem. Devising generative models that closely reproduce real-world graphs requires domain knowledge and time-consuming simulation. While existing deep learning approaches rely on less manual modelling, they offer littl… ▽ More

    Submitted 6 November, 2019; v1 submitted 12 October, 2019; originally announced October 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Workshop on Graph Representation Learning

  9. arXiv:1909.05063  [pdf, other

    stat.ML cs.LG

    Independent Subspace Analysis for Unsupervised Learning of Disentangled Representations

    Authors: Jan Stühmer, Richard E. Turner, Sebastian Nowozin

    Abstract: Recently there has been an increased interest in unsupervised learning of disentangled representations using the Variational Autoencoder (VAE) framework. Most of the existing work has focused largely on modifying the variational cost function to achieve this goal. We first show that these modifications, e.g. beta-VAE, simplify the tendency of variational inference to underfit causing pathological… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

  10. arXiv:1805.09281  [pdf, other

    stat.ML cs.LG

    Variational Inference for Data-Efficient Model Learning in POMDPs

    Authors: Sebastian Tschiatschek, Kai Arulkumaran, Jan Stühmer, Katja Hofmann

    Abstract: Partially observable Markov decision processes (POMDPs) are a powerful abstraction for tasks that require decision making under uncertainty, and capture a wide range of real world tasks. Today, effective planning approaches exist that generate effective strategies given black-box models of a POMDP task. Yet, an open question is how to acquire accurate models for complex domains. In this paper we p… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.