Skip to main content

Showing 1–8 of 8 results for author: Cwitkowitz, F

Searching in archive eess. Search in all archives.
.
  1. arXiv:2402.15569  [pdf, other

    eess.AS cs.LG cs.SD

    Toward Fully Self-Supervised Multi-Pitch Estimation

    Authors: Frank Cwitkowitz, Zhiyao Duan

    Abstract: Multi-pitch estimation is a decades-long research problem involving the detection of pitch activity associated with concurrent musical events within multi-instrument mixtures. Supervised learning techniques have demonstrated solid performance on more narrow characterizations of the task, but suffer from limitations concerning the shortage of large-scale and diverse polyphonic music datasets with m… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  2. arXiv:2309.15717  [pdf, other

    eess.AS cs.LG cs.SD

    Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription

    Authors: Frank Cwitkowitz, Kin Wai Cheuk, Woosung Choi, Marco A. Martínez-Ramírez, Keisuke Toyama, Wei-Hsiang Liao, Yuki Mitsufuji

    Abstract: In recent years, research on music transcription has focused mainly on architecture design and instrument-specific data acquisition. With the lack of availability of diverse datasets, progress is often limited to solo-instrument tasks such as piano transcription. Several works have explored multi-instrument transcription as a means to bolster the performance of models on low-resource tasks, but th… ▽ More

    Submitted 24 January, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted to ICASSP 2024

  3. arXiv:2309.09085  [pdf, other

    cs.SD cs.IR cs.MM eess.AS eess.SP

    SynthTab: Leveraging Synthesized Data for Guitar Tablature Transcription

    Authors: Yongyi Zang, Yi Zhong, Frank Cwitkowitz, Zhiyao Duan

    Abstract: Guitar tablature is a form of music notation widely used among guitarists. It captures not only the musical content of a piece, but also its implementation and ornamentation on the instrument. Guitar Tablature Transcription (GTT) is an important task with broad applications in music education, composition, and entertainment. Existing GTT datasets are quite limited in size and scope, rendering mode… ▽ More

    Submitted 24 January, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: Accepted to ICASSP 2024

  4. FretNet: Continuous-Valued Pitch Contour Streaming for Polyphonic Guitar Tablature Transcription

    Authors: Frank Cwitkowitz, Toni Hirvonen, Anssi Klapuri

    Abstract: In recent years, the task of Automatic Music Transcription (AMT), whereby various attributes of music notes are estimated from audio, has received increasing attention. At the same time, the related task of Multi-Pitch Estimation (MPE) remains a challenging but necessary component of almost all AMT approaches, even if only implicitly. In the context of AMT, pitch information is typically quantized… ▽ More

    Submitted 14 March, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: Accepted to ICASSP 2023

  5. arXiv:2204.08094  [pdf, other

    eess.AS cs.LG cs.SD

    A Data-Driven Methodology for Considering Feasibility and Pairwise Likelihood in Deep Learning Based Guitar Tablature Transcription Systems

    Authors: Frank Cwitkowitz, Jonathan Driedger, Zhiyao Duan

    Abstract: Guitar tablature transcription is an important but understudied problem within the field of music information retrieval. Traditional signal processing approaches offer only limited performance on the task, and there is little acoustic data with transcription labels for training machine learning models. However, guitar transcription labels alone are more widely available in the form of tablature, w… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

    Comments: Sound and Music Computing Conference (SMC) 2022

  6. arXiv:2110.04265  [pdf, other

    eess.AS cs.SD

    A study of the robustness of raw waveform based speaker embeddings under mismatched conditions

    Authors: Ge Zhu, Frank Cwitkowitz, Zhiyao Duan

    Abstract: In this paper, we conduct a cross-dataset study on parametric and non-parametric raw-waveform based speaker embeddings through speaker verification experiments. In general, we observe a more significant performance degradation of these raw-waveform systems compared to spectral based systems. We then propose two strategies to improve the performance of raw-waveform based systems on cross-dataset te… ▽ More

    Submitted 11 October, 2021; v1 submitted 8 October, 2021; originally announced October 2021.

  7. arXiv:2108.10382  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Learning Sparse Analytic Filters for Piano Transcription

    Authors: Frank Cwitkowitz, Mojtaba Heydari, Zhiyao Duan

    Abstract: In recent years, filterbank learning has become an increasingly popular strategy for various audio-related machine learning tasks. This is partly due to its ability to discover task-specific audio characteristics which can be leveraged in downstream processing. It is also a natural extension of the nearly ubiquitous deep learning methods employed to tackle a diverse array of audio applications. In… ▽ More

    Submitted 10 November, 2022; v1 submitted 23 August, 2021; originally announced August 2021.

    Comments: Sound and Music Computing Conference (SMC) 2022

  8. arXiv:2108.03576  [pdf, other

    eess.AS cs.AI cs.IR cs.LG cs.SD eess.SP

    BeatNet: CRNN and Particle Filtering for Online Joint Beat Downbeat and Meter Tracking

    Authors: Mojtaba Heydari, Frank Cwitkowitz, Zhiyao Duan

    Abstract: The online estimation of rhythmic information, such as beat positions, downbeat positions, and meter, is critical for many real-time music applications. Musical rhythm comprises complex hierarchical relationships across time, rendering its analysis intrinsically challenging and at times subjective. Furthermore, systems which attempt to estimate rhythmic information in real-time must be causal and… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.

    Comments: 22nd International Society for Music Information Retrieval (ISMIR) Conference Paper, Fall 2021. 8 Pages (Total), 3 Figures, 2 Tables, 1 Algorithm