Skip to main content

Showing 1–1 of 1 results for author: Bishop, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2109.01164  [pdf, other

    eess.AS cs.LG cs.SD

    Scalable Data Annotation Pipeline for High-Quality Large Speech Datasets Development

    Authors: Mingkuan Liu, Chi Zhang, Hua Xing, Chao Feng, Monchu Chen, Judith Bishop, Grace Ngapo

    Abstract: This paper introduces a human-in-the-loop (HITL) data annotation pipeline to generate high-quality, large-scale speech datasets. The pipeline combines human and machine advantages to more quickly, accurately, and cost-effectively annotate datasets with machine pre-labeling and fully manual auditing. Quality control mechanisms such as blind testing, behavior monitoring, and data validation have bee… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: Submitted to NeurIPS 2021 Datasets and Benchmarks Track (Round 2)