Skip to main content

Showing 1–5 of 5 results for author: Marafioti, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2106.11048  [pdf, other

    eess.IV cs.CV

    CataNet: Predicting remaining cataract surgery duration

    Authors: Andrés Marafioti, Michel Hayoz, Mathias Gallardo, Pablo Márquez Neila, Sebastian Wolf, Martin Zinkernagel, Raphael Sznitman

    Abstract: Cataract surgery is a sight saving surgery that is performed over 10 million times each year around the world. With such a large demand, the ability to organize surgical wards and operating rooms efficiently is critical to delivery this therapy in routine clinical care. In this context, estimating the remaining surgical duration (RSD) during procedures is one way to help streamline patient through… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: Accepted at MICCAI 2021

  2. arXiv:2106.05148  [pdf, other

    eess.SP cs.SD eess.AS

    Time-Frequency Phase Retrieval for Audio -- The Effect of Transform Parameters

    Authors: Andrés Marafioti, Nicki Holighaus, Piotr Majdak

    Abstract: In audio processing applications, phase retrieval (PR) is often performed from the magnitude of short-time Fourier transform (STFT) coefficients. Although PR performance has been observed to depend on the considered STFT parameters and audio data, the extent of this dependence has not been systematically evaluated yet. To address this, we studied the performance of three PR algorithms for various… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted for publication as a regular paper in the IEEE Transactions on Signal Processing

  3. arXiv:2005.05032  [pdf, other

    cs.SD eess.AS stat.ML

    GACELA -- A generative adversarial context encoder for long audio inpainting

    Authors: Andres Marafioti, Piotr Majdak, Nicki Holighaus, Nathanaël Perraudin

    Abstract: We introduce GACELA, a generative adversarial network (GAN) designed to restore missing musical audio data with a duration ranging between hundreds of milliseconds to a few seconds, i.e., to perform long-gap audio inpainting. While previous work either addressed shorter gaps or relied on exemplars by copying available information from other signal parts, GACELA addresses the inpainting of long gap… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, vol. 15, no. 1, pp. 120-131, Jan. 2021

  4. arXiv:1902.04072  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Adversarial Generation of Time-Frequency Features with application in audio synthesis

    Authors: Andrés Marafioti, Nicki Holighaus, Nathanaël Perraudin, Piotr Majdak

    Abstract: Time-frequency (TF) representations provide powerful and intuitive features for the analysis of time series such as audio. But still, generative modeling of audio in the TF domain is a subtle matter. Consequently, neural audio synthesis widely relies on directly modeling the waveform and previous attempts at unconditionally synthesizing audio from neurally generated invertible TF features still st… ▽ More

    Submitted 16 May, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

    Comments: Accepted for publication at ICML 2019

  5. arXiv:1810.12138  [pdf, other

    cs.SD cs.LG eess.AS

    Audio inpainting of music by means of neural networks

    Authors: Andrés Marafioti, Nicki Holighaus, Piotr Majdak, Nathanaël Perraudin

    Abstract: We studied the ability of deep neural networks (DNNs) to restore missing audio content based on its context, a process usually referred to as audio inpainting. We focused on gaps in the range of tens of milliseconds. The proposed DNN structure was trained on audio signals containing music and musical instruments, separately, with 64-ms long gaps. The input to the DNN was the context, i.e., the sig… ▽ More

    Submitted 18 February, 2022; v1 submitted 29 October, 2018; originally announced October 2018.

    Comments: Presented at the 146th AES Convention [arXiv:1810.12138v2]. For the journal version, published in published in IEEE TASLP, see [arXiv:1810.12138v2]