Skip to main content

Showing 1–3 of 3 results for author: Demiroglu, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.07373  [pdf, other

    cs.CL cs.SD eess.AS

    NatiQ: An End-to-end Text-to-Speech System for Arabic

    Authors: Ahmed Abdelali, Nadir Durrani, Cenk Demiroglu, Fahim Dalvi, Hamdy Mubarak, Kareem Darwish

    Abstract: NatiQ is end-to-end text-to-speech system for Arabic. Our speech synthesizer uses an encoder-decoder architecture with attention. We used both tacotron-based models (tacotron-1 and tacotron-2) and the faster transformer model for generating mel-spectrograms from characters. We concatenated Tacotron1 with the WaveRNN vocoder, Tacotron2 with the WaveGlow vocoder and ESPnet transformer with the paral… ▽ More

    Submitted 16 November, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

  2. arXiv:1610.03009  [pdf, other

    cs.SD cs.CL

    Investigation of Synthetic Speech Detection Using Frame- and Segment-Specific Importance Weighting

    Authors: Ali Khodabakhsh, Cenk Demiroglu

    Abstract: Speaker verification systems are vulnerable to spoofing attacks which presents a major problem in their real-life deployment. To date, most of the proposed synthetic speech detectors (SSDs) have weighted the importance of different segments of speech equally. However, different attack methods have different strengths and weaknesses and the traces that they leave may be short or long term acoustic… ▽ More

    Submitted 10 October, 2016; originally announced October 2016.

  3. arXiv:1608.02272  [pdf, other

    cs.SD cs.CL

    Incorporation of Speech Duration Information in Score Fusion of Speaker Recognition Systems

    Authors: Ali Khodabakhsh, Seyyed Saeed Sarfjoo, Umut Uludag, Osman Soyyigit, Cenk Demiroglu

    Abstract: In recent years identity-vector (i-vector) based speaker verification (SV) systems have become very successful. Nevertheless, environmental noise and speech duration variability still have a significant effect on degrading the performance of these systems. In many real-life applications, duration of recordings are very short; as a result, extracted i-vectors cannot reliably represent the attribute… ▽ More

    Submitted 7 August, 2016; originally announced August 2016.