Skip to main content

Showing 1–2 of 2 results for author: Wang, M L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19328  [pdf, other

    cs.SD cs.LG eess.AS

    Subtractive Training for Music Stem Insertion using Latent Diffusion Models

    Authors: Ivan Villa-Renteria, Mason L. Wang, Zachary Shah, Zhe Li, Soohyun Kim, Neelesh Ramachandran, Mert Pilanci

    Abstract: We present Subtractive Training, a simple and novel method for synthesizing individual musical instrument stems given other instruments as context. This method pairs a dataset of complete music mixes with 1) a variant of the dataset lacking a specific stem, and 2) LLM-generated instructions describing how the missing stem should be reintroduced. We then fine-tune a pretrained text-to-audio diffusi… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2008.13715  [pdf, other

    cs.CV cs.LG eess.IV

    Extracting full-field subpixel structural displacements from videos via deep learning

    Authors: Lele Luan, **gwei Zheng, Yongchao Yang, Ming L. Wang, Hao Sun

    Abstract: This paper develops a deep learning framework based on convolutional neural networks (CNNs) that enable real-time extraction of full-field subpixel structural displacements from videos. In particular, two new CNN architectures are designed and trained on a dataset generated by the phase-based motion extraction method from a single lab-recorded high-speed video of a dynamic structure. As displaceme… ▽ More

    Submitted 3 September, 2020; v1 submitted 31 August, 2020; originally announced August 2020.

    Comments: 22 figures; 24 figures