Skip to main content

Showing 1–5 of 5 results for author: Behrmann, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.12956  [pdf, other

    cs.LG cs.AI cs.CV

    Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems

    Authors: David T. Hoffmann, Simon Schrodi, Jelena Bratulić, Nadine Behrmann, Volker Fischer, Thomas Brox

    Abstract: In this work, we study rapid improvements of the training loss in transformers when being confronted with multi-step decision tasks. We found that transformers struggle to learn the intermediate task and both training and validation loss saturate for hundreds of epochs. When transformers finally learn the intermediate task, they do this rapidly and unexpectedly. We call these abrupt improvements E… ▽ More

    Submitted 6 June, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted at ICML 2024

  2. arXiv:2209.00638  [pdf, other

    cs.CV

    Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation

    Authors: Nadine Behrmann, S. Alireza Golestaneh, Zico Kolter, Juergen Gall, Mehdi Noroozi

    Abstract: This paper introduces a unified framework for video action segmentation via sequence to sequence (seq2seq) translation in a fully and timestamp supervised setup. In contrast to current state-of-the-art frame-level prediction methods, we view action segmentation as a seq2seq translation task, i.e., map** a sequence of video frames to a sequence of action segments. Our proposed method involves a s… ▽ More

    Submitted 11 October, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: ECCV 2022 (Main Conference)

  3. arXiv:2201.11736  [pdf, other

    cs.CV

    Ranking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked Positives

    Authors: David T. Hoffmann, Nadine Behrmann, Juergen Gall, Thomas Brox, Mehdi Noroozi

    Abstract: This paper introduces Ranking Info Noise Contrastive Estimation (RINCE), a new member in the family of InfoNCE losses that preserves a ranked ordering of positive samples. In contrast to the standard InfoNCE loss, which requires a strict binary separation of the training pairs into similar and dissimilar samples, RINCE can exploit information about a similarity ranking for learning a corresponding… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: AAAI 2022 (Main Track)

  4. arXiv:2109.11593  [pdf, other

    cs.CV

    Long Short View Feature Decomposition via Contrastive Video Representation Learning

    Authors: Nadine Behrmann, Mohsen Fayyaz, Juergen Gall, Mehdi Noroozi

    Abstract: Self-supervised video representation methods typically focus on the representation of temporal attributes in videos. However, the role of stationary versus non-stationary attributes is less explored: Stationary features, which remain similar throughout the video, enable the prediction of video-level action classes. Non-stationary features, which represent temporally varying attributes, are more be… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: ICCV 2021 (Main Conference)

  5. arXiv:2011.06037  [pdf, other

    cs.CV cs.LG

    Unsupervised Video Representation Learning by Bidirectional Feature Prediction

    Authors: Nadine Behrmann, Juergen Gall, Mehdi Noroozi

    Abstract: This paper introduces a novel method for self-supervised video representation learning via feature prediction. In contrast to the previous methods that focus on future feature prediction, we argue that a supervisory signal arising from unobserved past frames is complementary to one that originates from the future frames. The rationale behind our method is to encourage the network to explore the te… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Accepted at WACV 2021