Skip to main content

Showing 1–1 of 1 results for author: Rotaru, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.03378  [pdf, other

    cs.CL cs.SD eess.AS

    RoDia: A New Dataset for Romanian Dialect Identification from Speech

    Authors: Codrut Rotaru, Nicolae-Catalin Ristea, Radu Tudor Ionescu

    Abstract: We introduce RoDia, the first dataset for Romanian dialect identification from speech. The RoDia dataset includes a varied compilation of speech samples from five distinct regions of Romania, covering both urban and rural environments, totaling 2 hours of manually annotated speech data. Along with our dataset, we introduce a set of competitive models to be used as baselines for future research. Th… ▽ More

    Submitted 20 March, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted at NAACL 2024