Skip to main content

Showing 1–3 of 3 results for author: Iliescu, D A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.09446  [pdf, other

    eess.AS cs.AI cs.CL cs.LG

    Controllable Prosody Generation With Partial Inputs

    Authors: Dan Andrei Iliescu, Devang Savita Ram Mohan, Tian Huey Teh, Zack Hodari

    Abstract: We address the problem of human-in-the-loop control for generating prosody in the context of text-to-speech synthesis. Controlling prosody is challenging because existing generative models lack an efficient interface through which users can modify the output quickly and precisely. To solve this, we introduce a novel framework whereby the user provides partial inputs and the generative model genera… ▽ More

    Submitted 15 April, 2024; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: 5 pages

  2. arXiv:2202.07285  [pdf, other

    cs.DC

    Disentangling Domain and Content

    Authors: Dan Andrei Iliescu, Aliaksei Mikhailiuk, Damon Wischik, Rafal Mantiuk

    Abstract: Many real-world datasets can be divided into groups according to certain salient features (e.g. grou** images by subject, grou** text by font, etc.). Often, machine learning tasks require that these features be represented separately from those manifesting independently of the grou**. For example, image translation entails changing the style of an image while preserving its content. We forma… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  3. arXiv:2103.14616  [pdf, other

    eess.IV cs.CV

    Training a Task-Specific Image Reconstruction Loss

    Authors: Aamir Mustafa, Aliaksei Mikhailiuk, Dan Andrei Iliescu, Varun Babbar, Rafal K. Mantiuk

    Abstract: The choice of a loss function is an important factor when training neural networks for image restoration problems, such as single image super resolution. The loss function should encourage natural and perceptually pleasing results. A popular choice for a loss is a pre-trained network, such as VGG, which is used as a feature extractor for computing the difference between restored and reference imag… ▽ More

    Submitted 17 October, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Accepted at WACV 2022