Skip to main content

Showing 1–16 of 16 results for author: Peter, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.03506  [pdf, other

    cs.HC

    Spin-Wave Voices: Sonification of Nanoscale Spin Waves as an Engagement and Research Tool

    Authors: Santa Pile, Oleg Lesota, Silvan David Peter, Christina Humer, Martin Gasser

    Abstract: Magnonics is an emerging research field that addresses the use of spin waves (magnons), purely magnetic waves, for information transport and processing. Spin waves are a potential replacement for electric current in modern computational devices that would make them more compact and energy efficient. The field is yet little known, even among physicists. Additionally, with the development of new mea… ▽ More

    Submitted 21 June, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted to The 29th International Conference on Auditory Display (ICAD 2024) conference proceedings

  2. arXiv:2401.02979  [pdf, other

    cs.CL cs.AI cs.IR

    Are we describing the same sound? An analysis of word embedding spaces of expressive piano performance

    Authors: Silvan David Peter, Shreyan Chowdhury, Carlos Eduardo Cancino-Chacón, Gerhard Widmer

    Abstract: Semantic embeddings play a crucial role in natural language-based information retrieval. Embedding models represent words and contexts as vectors whose spatial configuration is derived from the distribution of words in large text corpora. While such representations are generally very powerful, they might fail to account for fine-grained domain-specific nuances. In this article, we investigate this… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Journal ref: Proceedings of the Forum for Information Retrieval Evaluation, FIRE, 2023, Panjim, India

  3. Sounding Out Reconstruction Error-Based Evaluation of Generative Models of Expressive Performance

    Authors: Silvan David Peter, Carlos Eduardo Cancino-Chacón, Emmanouil Karystinaios, Gerhard Widmer

    Abstract: Generative models of expressive piano performance are usually assessed by comparing their predictions to a reference human performance. A generative algorithm is taken to be better than competing ones if it produces performances that are closer to a human reference performance. However, expert human performers can (and do) interpret music in different ways, making for different possible references… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Journal ref: 10th International Conference on Digital Libraries for Musicology, November 10, 2023, Milan, Italy

  4. arXiv:2401.00466  [pdf, other

    cs.SD cs.LG eess.AS

    Online Symbolic Music Alignment with Offline Reinforcement Learning

    Authors: Silvan David Peter

    Abstract: Symbolic Music Alignment is the process of matching performed MIDI notes to corresponding score notes. In this paper, we introduce a reinforcement learning (RL)-based online symbolic music alignment technique. The RL agent - an attention-based neural network - iteratively estimates the current score position from local score and performance contexts. For this symbolic alignment task, environment s… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Journal ref: Proceedings of the 24th International Society for Music Information Retrieval Conference, {ISMIR} 2023, Milan, Italy, November 5-9, 2023

  5. arXiv:2208.14958  [pdf, other

    cs.CV cs.AI cs.LG

    A Realism Metric for Generated LiDAR Point Clouds

    Authors: Larissa T. Triess, Christoph B. Rist, David Peter, J. Marius Zöllner

    Abstract: A considerable amount of research is concerned with the generation of realistic sensor data. LiDAR point clouds are generated by complex simulations or learned generative models. The generated data is usually exploited to enable or improve downstream perception algorithms. Two major questions arise from these procedures: First, how to evaluate the realism of the generated data? Second, does more r… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: text overlap with arXiv:2109.11775

  6. arXiv:2206.01104  [pdf, other

    cs.SD cs.DL eess.AS

    The match file format: Encoding Alignments between Scores and Performances

    Authors: Francesco Foscarin, Emmanouil Karystinaios, Silvan David Peter, Carlos Cancino-Chacón, Maarten Grachten, Gerhard Widmer

    Abstract: This paper presents the specifications of match: a file format that extends a MIDI human performance with note-, beat-, and downbeat-level alignments to a corresponding musical score. This enables advanced analyses of the performance that are relevant for various tasks, such as expressive performance modeling, score following, music transcription, and performer classification. The match file inclu… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Journal ref: Proceedings of the Music Encoding Conference (MEC), 2022, Halifax, Canada

  7. arXiv:2206.01071  [pdf, other

    cs.SD cs.DL eess.AS

    Partitura: A Python Package for Symbolic Music Processing

    Authors: Carlos Cancino-Chacón, Silvan David Peter, Emmanouil Karystinaios, Francesco Foscarin, Maarten Grachten, Gerhard Widmer

    Abstract: Partitura is a lightweight Python package for handling symbolic musical information. It provides easy access to features commonly used in music information retrieval tasks, like note arrays (lists of timed pitched events) and 2D piano roll matrices, as well as other score elements such as time and key signatures, performance directives, and repeat structures. Partitura can load musical scores (in… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Journal ref: Proceedings of the Music Encoding Conference (MEC), 2022, Halifax, Canada

  8. arXiv:2202.08526  [pdf, other

    cs.CV cs.AI cs.LG

    Point Cloud Generation with Continuous Conditioning

    Authors: Larissa T. Triess, Andre Bühler, David Peter, Fabian B. Flohr, J. Marius Zöllner

    Abstract: Generative models can be used to synthesize 3D objects of high quality and diversity. However, there is typically no control over the properties of the generated object.This paper proposes a novel generative adversarial network (GAN) setup that generates 3D point cloud shapes conditioned on a continuous parameter. In an exemplary application, we use this to guide the generative process to create a… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: Accepted at International Conference on Artificial Intelligence and Statistics (AISTATS) 2022

    Journal ref: 2022 International Conference on Artificial Intelligence and Statistics (AISTATS), PMLR 151:4462-4481

  9. arXiv:2111.15615  [pdf, other

    cs.CV cs.LG

    Semi-Local Convolutions for LiDAR Scan Processing

    Authors: Larissa T. Triess, David Peter, J. Marius Zöllner

    Abstract: A number of applications, such as mobile robots or automated vehicles, use LiDAR sensors to obtain detailed information about their three-dimensional surroundings. Many methods use image-like projections to efficiently process these LiDAR measurements and use deep convolutional neural networks to predict semantic classes for each point in the scan. The spatial stationary assumption enables the usa… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: text overlap with arXiv:2004.11803

    Journal ref: ICBINB Workshop at NeurIPS 2021

  10. Quantifying point cloud realism through adversarially learned latent representations

    Authors: Larissa T. Triess, David Peter, Stefan A. Baur, J. Marius Zöllner

    Abstract: Judging the quality of samples synthesized by generative models can be tedious and time consuming, especially for complex data structures, such as point clouds. This paper presents a novel approach to quantify the realism of local regions in LiDAR point clouds. Relevant features are learned from real-world and synthetic point clouds by training on a proxy classification task. Inspired by fair netw… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: 2021 German Conference on Pattern Recognition (GCPR). Project Page: http://ltriess.github.io/lidar-metric

    Journal ref: 2021 German Conference on Pattern Recognition (GCPR)

  11. arXiv:2104.06666  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    End-to-end Keyword Spotting using Neural Architecture Search and Quantization

    Authors: David Peter, Wolfgang Roth, Franz Pernkopf

    Abstract: This paper introduces neural architecture search (NAS) for the automatic discovery of end-to-end keyword spotting (KWS) models in limited resource environments. We employ a differentiable NAS approach to optimize the structure of convolutional neural networks (CNNs) operating on raw audio waveforms. After a suitable KWS model is found with NAS, we conduct quantization of weights and activations to… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2012.10138

  12. arXiv:2012.10138  [pdf, other

    eess.AS cs.LG

    Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization

    Authors: David Peter, Wolfgang Roth, Franz Pernkopf

    Abstract: This paper introduces neural architecture search (NAS) for the automatic discovery of small models for keyword spotting (KWS) in limited resource environments. We employ a differentiable NAS approach to optimize the structure of convolutional neural networks (CNNs) to maximize the classification accuracy while minimizing the number of operations per inference. Using NAS only, we were able to obtai… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

  13. Scan-based Semantic Segmentation of LiDAR Point Clouds: An Experimental Study

    Authors: Larissa T. Triess, David Peter, Christoph B. Rist, J. Marius Zöllner

    Abstract: Autonomous vehicles need to have a semantic understanding of the three-dimensional world around them in order to reason about their environment. State of the art methods use deep neural networks to predict semantic classes for each point in a LiDAR scan. A powerful and efficient way to process LiDAR measurements is to use two-dimensional, image-like projections. In this work, we perform a comprehe… ▽ More

    Submitted 24 September, 2021; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: Project Page: http://ltriess.github.io/scan-semseg

    Journal ref: IEEE Intelligent Vehicles Symposium (IV), 2020, pp. 1116-1121

  14. arXiv:1907.00787  [pdf, other

    eess.IV cs.LG stat.ML

    CNN-based synthesis of realistic high-resolution LiDAR data

    Authors: Larissa T. Triess, David Peter, Christoph B. Rist, Markus Enzweiler, J. Marius Zöllner

    Abstract: This paper presents a novel CNN-based approach for synthesizing high-resolution LiDAR point cloud data. Our approach generates semantically and perceptually realistic results with guidance from specialized loss-functions. First, we utilize a modified per-point loss that addresses missing LiDAR point measurements. Second, we align the quality of our generated output with real-world sensor data by a… ▽ More

    Submitted 24 September, 2021; v1 submitted 28 June, 2019; originally announced July 2019.

    Comments: Project Page: http://ltriess.github.io/pc-upsampling

    Journal ref: IEEE Intelligent Vehicles Symposium (IV), 2019, pp. 1512-1519

  15. arXiv:1901.10183  [pdf, other

    cs.DC cs.LG cs.PF

    A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep Learning

    Authors: Tal Ben-Nun, Maciej Besta, Simon Huber, Alexandros Nikolaos Ziogas, Daniel Peter, Torsten Hoefler

    Abstract: We introduce Deep500: the first customizable benchmarking infrastructure that enables fair comparison of the plethora of deep learning frameworks, algorithms, libraries, and techniques. The key idea behind Deep500 is its modular design, where deep learning is factorized into four distinct levels: operators, network processing, training, and distributed training. Our evaluation illustrates that Dee… ▽ More

    Submitted 13 June, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: Accepted to IPDPS 2019

  16. arXiv:1804.09915  [pdf, other

    cs.CV

    Boosting LiDAR-based Semantic Labeling by Cross-Modal Training Data Generation

    Authors: Florian Piewak, Peter **gera, Manuel Schäfer, David Peter, Beate Schwarz, Nick Schneider, David Pfeiffer, Markus Enzweiler, Marius Zöllner

    Abstract: Mobile robots and autonomous vehicles rely on multi-modal sensor setups to perceive and understand their surroundings. Aside from cameras, LiDAR sensors represent a central component of state-of-the-art perception systems. In addition to accurate spatial perception, a comprehensive semantic understanding of the environment is essential for efficient and safe operation. In this paper we present a n… ▽ More

    Submitted 26 April, 2018; originally announced April 2018.