Skip to main content

Showing 1–14 of 14 results for author: Brunner, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.05271  [pdf, other

    cs.CV eess.IV stat.ML

    A Field of Experts Prior for Adapting Neural Networks at Test Time

    Authors: Neerav Karani, Georg Brunner, Ertunc Erdil, Simin Fei, Kerem Tezcan, Krishna Chaitanya, Ender Konukoglu

    Abstract: Performance of convolutional neural networks (CNNs) in image analysis tasks is often marred in the presence of acquisition-related distribution shifts between training and test images. Recently, it has been proposed to tackle this problem by fine-tuning trained CNNs for each test image. Such test-time-adaptation (TTA) is a promising and practical strategy for improving robustness to distribution s… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: Manuscript under review

  2. arXiv:2101.04547  [pdf, other

    cs.CL cs.AI

    Of Non-Linearity and Commutativity in BERT

    Authors: Sumu Zhao, Damian Pascual, Gino Brunner, Roger Wattenhofer

    Abstract: In this work we provide new insights into the transformer architecture, and in particular, its best-known variant, BERT. First, we propose a method to measure the degree of non-linearity of different elements of transformers. Next, we focus our investigation on the feed-forward networks (FFN) inside transformers, which contain 2/3 of the model parameters and have so far not received much attention… ▽ More

    Submitted 7 May, 2021; v1 submitted 12 January, 2021; originally announced January 2021.

    Comments: Accepted at IJCNN 2021

  3. arXiv:2008.11159  [pdf, other

    cs.SD cs.LG eess.AS

    Medley2K: A Dataset of Medley Transitions

    Authors: Lukas Faber, Sandro Luck, Damian Pascual, Andreas Roth, Gino Brunner, Roger Wattenhofer

    Abstract: The automatic generation of medleys, i.e., musical pieces formed by different songs concatenated via smooth transitions, is not well studied in the current literature. To facilitate research on this topic, we make available a dataset called Medley2K that consists of 2,000 medleys and 7,712 labeled transitions. Our dataset features a rich variety of song transitions across different music genres. W… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: MML 2020 - 13th Int. Workshop on Machine Learning and Music at ECML-PKDD 2020

  4. arXiv:2004.05916  [pdf, other

    cs.LG cs.CL

    Telling BERT's full story: from Local Attention to Global Aggregation

    Authors: Damian Pascual, Gino Brunner, Roger Wattenhofer

    Abstract: We take a deep look into the behavior of self-attention heads in the transformer architecture. In light of recent work discouraging the use of attention distributions for explaining a model's behavior, we show that attention distributions can nevertheless provide insights into the local behavior of attention heads. This way, we propose a distinction between local patterns revealed by attention and… ▽ More

    Submitted 13 January, 2021; v1 submitted 9 April, 2020; originally announced April 2020.

    Comments: Accepted at EACL 2021

  5. arXiv:1908.04211  [pdf, other

    cs.CL cs.LG

    On Identifiability in Transformers

    Authors: Gino Brunner, Yang Liu, Damián Pascual, Oliver Richter, Massimiliano Ciaramita, Roger Wattenhofer

    Abstract: In this paper we delve deep in the Transformer architecture by investigating two of its core components: self-attention and contextual embeddings. In particular, we study the identifiability of attention weights and token embeddings, and the aggregation of context into hidden tokens. We show that, for sequences longer than the attention head dimension, attention weights are not identifiable. We pr… ▽ More

    Submitted 7 February, 2020; v1 submitted 12 August, 2019; originally announced August 2019.

    Comments: Published as a conference paper at ICLR 2020

    MSC Class: I.2.7; I.7.0 ACM Class: I.2.7; I.7.0

  6. arXiv:1907.02874  [pdf, other

    cs.LG cs.AI stat.ML

    Attentive Multi-Task Deep Reinforcement Learning

    Authors: Timo Bram, Gino Brunner, Oliver Richter, Roger Wattenhofer

    Abstract: Sharing knowledge between tasks is vital for efficient learning in a multi-task setting. However, most research so far has focused on the easier case where knowledge transfer is not harmful, i.e., where knowledge from one task cannot negatively impact the performance on another task. In contrast, we present an approach to multi-task deep reinforcement learning based on attention that does not requ… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

    Comments: Accepted as conference paper at ECML PKDD 2019

    MSC Class: 93E35 ACM Class: I.2.6; I.2.8

  7. arXiv:1810.00361  [pdf, other

    cs.LG cs.AI stat.ML

    Using State Predictions for Value Regularization in Curiosity Driven Deep Reinforcement Learning

    Authors: Gino Brunner, Manuel Fritsche, Oliver Richter, Roger Wattenhofer

    Abstract: Learning in sparse reward settings remains a challenge in Reinforcement Learning, which is often addressed by using intrinsic rewards. One promising strategy is inspired by human curiosity, requiring the agent to learn to predict the future. In this paper a curiosity-driven agent is extended to use these predictions directly for training. To achieve this, the agent predicts the value function of t… ▽ More

    Submitted 30 September, 2018; originally announced October 2018.

  8. arXiv:1809.08022  [pdf, other

    cs.RO eess.SY

    The Urban Last Mile Problem: Autonomous Drone Delivery to Your Balcony

    Authors: Gino Brunner, Bence Szebedy, Simon Tanner, Roger Wattenhofer

    Abstract: Drone delivery has been a hot topic in the industry in the past few years. However, existing approaches either focus on rural areas or rely on centralized drop-off locations from where the last mile delivery is performed. In this paper we tackle the problem of autonomous last mile delivery in urban environments using an off-the-shelf drone. We build a prototype system that is able to fly to the ap… ▽ More

    Submitted 21 September, 2018; originally announced September 2018.

  9. arXiv:1809.07600  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    MIDI-VAE: Modeling Dynamics and Instrumentation of Music with Applications to Style Transfer

    Authors: Gino Brunner, Andres Konrad, Yuyi Wang, Roger Wattenhofer

    Abstract: We introduce MIDI-VAE, a neural network model based on Variational Autoencoders that is capable of handling polyphonic music with multiple instrument tracks, as well as modeling the dynamics of music by incorporating note durations and velocities. We show that MIDI-VAE can perform style transfer on symbolic music by automatically changing pitches, dynamics and instruments of a music piece from, e.… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

    Comments: Paper accepted at the 19th International Society for Music Information Retrieval Conference, ISMIR 2018, Paris, France

    ACM Class: I.2.1; I.2.4; I.2.6; H.5.5

  10. arXiv:1809.07575  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Symbolic Music Genre Transfer with CycleGAN

    Authors: Gino Brunner, Yuyi Wang, Roger Wattenhofer, Sumu Zhao

    Abstract: Deep generative models such as Variational Autoencoders (VAEs) and Generative Adversarial Networks (GANs) have recently been applied to style and domain transfer for images, and in the case of VAEs, music. GAN-based models employing several generators and some form of cycle consistency loss have been among the most successful for image domain transfer. In this paper we apply such a model to symbol… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

    Comments: Paper accepted at the 30th International Conference on Tools with Artificial Intelligence, ICTAI 2018, Volos, Greece

    ACM Class: I.2.1; I.2.4; I.2.6; H.5.5

  11. arXiv:1801.06024  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Natural Language Multitasking: Analyzing and Improving Syntactic Saliency of Hidden Representations

    Authors: Gino Brunner, Yuyi Wang, Roger Wattenhofer, Michael Weigelt

    Abstract: We train multi-task autoencoders on linguistic tasks and analyze the learned hidden sentence representations. The representations change significantly when translation and part-of-speech decoders are added. The more decoders a model employs, the better it clusters sentences according to their syntactic similarity, as the representation space becomes less entangled. We explore the structure of the… ▽ More

    Submitted 18 January, 2018; originally announced January 2018.

    Comments: The 31st Annual Conference on Neural Information Processing (NIPS) - Workshop on Learning Disentangled Features: from Perception to Control, Long Beach, CA, December 2017

  12. arXiv:1711.07682  [pdf, ps, other

    cs.SD cs.AI cs.IT cs.LG eess.AS stat.ML

    JamBot: Music Theory Aware Chord Based Generation of Polyphonic Music with LSTMs

    Authors: Gino Brunner, Yuyi Wang, Roger Wattenhofer, Jonas Wiesendanger

    Abstract: We propose a novel approach for the generation of polyphonic music based on LSTMs. We generate music in two steps. First, a chord LSTM predicts a chord progression based on a chord embedding. A second LSTM then generates polyphonic music from the predicted chord progression. The generated music sounds pleasing and harmonic, with only few dissonant notes. It has clear long-term structure that is si… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: Paper presented at the 29th International Conference on Tools with Artificial Intelligence, ICTAI 2017, Boston, MA, USA

    ACM Class: I.2.1; I.2.4; I.2.6; H.5.5

  13. arXiv:1711.07479  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    Teaching a Machine to Read Maps with Deep Reinforcement Learning

    Authors: Gino Brunner, Oliver Richter, Yuyi Wang, Roger Wattenhofer

    Abstract: The ability to use a 2D map to navigate a complex 3D environment is quite remarkable, and even difficult for many humans. Localization and navigation is also an important problem in domains such as robotics, and has recently become a focus of the deep reinforcement learning community. In this paper we teach a reinforcement learning agent to read a map in order to find the shortest way out of a ran… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.

    Comments: Paper accepted at 32nd AAAI Conference on Artificial Intelligence, AAAI 2018, New Orleans, Louisiana, USA

    ACM Class: I.2.0; I.2.6; I.2.9; I.2.10

  14. arXiv:1605.09185  [pdf, other

    cs.RO

    RAFCON: a Graphical Tool for Task Programming and Mission Control

    Authors: Sebastian G. Brunner, Franz Steinmetz, Rico Belder, Andreas Dömel

    Abstract: There are many application fields for robotic systems including service robotics, search and rescue missions, industry and space robotics. As the scenarios in these areas grow more and more complex, there is a high demand for powerful tools to efficiently program heterogeneous robotic systems. Therefore, we created RAFCON, a graphical tool to develop robotic tasks and to be used for mission contro… ▽ More

    Submitted 30 May, 2016; originally announced May 2016.

    Comments: 8 pages, 5 figures