Skip to main content

Showing 1–3 of 3 results for author: Chatziioannou, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.11727  [pdf, other

    cs.SD eess.AS

    Direction Specific Ambisonics Source Separation with End-To-End Deep Learning

    Authors: Francesc Lluís, Nils Meyer-Kahlen, Vasileios Chatziioannou, Alex Hofmann

    Abstract: Ambisonics is a scene-based spatial audio format that has several useful features compared to object-based formats, such as efficient whole scene rotation and versatility. However, it does not provide direct access to the individual source signals, so that these have to be separated from the mixture when required. Typically, this is done with linear spherical harmonics (SH) beamforming. In this pa… ▽ More

    Submitted 20 June, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Code and listening examples: https://github.com/francesclluis/direction-ambisonics-source-separation

    Journal ref: Acta Acustica 2023, 7, 29

  2. arXiv:2104.12462  [pdf, other

    cs.SD cs.CV eess.AS

    Points2Sound: From mono to binaural audio using 3D point cloud scenes

    Authors: Francesc Lluís, Vasileios Chatziioannou, Alex Hofmann

    Abstract: For immersive applications, the generation of binaural sound that matches its visual counterpart is crucial to bring meaningful experiences to people in a virtual environment. Recent studies have shown the possibility of using neural networks for synthesizing binaural audio from mono audio by using 2D visual information as guidance. Extending this approach by guiding the audio with 3D visual infor… ▽ More

    Submitted 19 May, 2023; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: Code, data, and listening examples: https://github.com/francesclluis/points2sound

    Journal ref: EURASIP Journal on Audio, Speech, and Music Processing 2022 (1), 1-15

  3. arXiv:2102.02028  [pdf, other

    cs.SD cs.CV eess.AS

    Music source separation conditioned on 3D point clouds

    Authors: Francesc Lluís, Vasileios Chatziioannou, Alex Hofmann

    Abstract: Recently, significant progress has been made in audio source separation by the application of deep learning techniques. Current methods that combine both audio and visual information use 2D representations such as images to guide the separation process. However, in order to (re)-create acoustically correct scenes for 3D virtual/augmented reality applications from recordings of real music ensembles… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.