Skip to main content

Showing 1–2 of 2 results for author: Pomerantz, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2107.08661  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Translatotron 2: High-quality direct speech-to-speech translation with voice preservation

    Authors: Ye Jia, Michelle Tadmor Ramanovich, Tal Remez, Roi Pomerantz

    Abstract: We present Translatotron 2, a neural direct speech-to-speech translation model that can be trained end-to-end. Translatotron 2 consists of a speech encoder, a linguistic decoder, an acoustic synthesizer, and a single attention module that connects them together. Experimental results on three datasets consistently show that Translatotron 2 outperforms the original Translatotron by a large margin on… ▽ More

    Submitted 17 May, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: ICML 2022

  2. arXiv:1807.10215  [pdf, other

    cs.CV cs.LG

    DeepSPINE: Automated Lumbar Vertebral Segmentation, Disc-level Designation, and Spinal Stenosis Grading Using Deep Learning

    Authors: Jen-Tang Lu, Stefano Pedemonte, Bernardo Bizzo, Sean Doyle, Katherine P. Andriole, Mark H. Michalski, R. Gilberto Gonzalez, Stuart R. Pomerantz

    Abstract: The high prevalence of spinal stenosis results in a large volume of MRI imaging, yet interpretation can be time-consuming with high inter-reader variability even among the most specialized radiologists. In this paper, we develop an efficient methodology to leverage the subject-matter-expertise stored in large-scale archival reporting and image data for a deep-learning approach to fully-automated l… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Comments: Accepted as spotlight talk at Machine Learning for Healthcare (MLHC) 2018. Supplementary Video: https://bit.ly/DeepSPINE