Skip to main content

Showing 1–1 of 1 results for author: Abts, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.11062  [pdf, other

    cs.LG cs.CL

    Answer Fast: Accelerating BERT on the Tensor Streaming Processor

    Authors: Ibrahim Ahmed, Sahil Parmar, Matthew Boyd, Michael Beidler, Kris Kang, Bill Liu, Kyle Roach, John Kim, Dennis Abts

    Abstract: Transformers have become a predominant machine learning workload, they are not only the de-facto standard for natural language processing tasks, but they are also being deployed in other domains such as vision and speech recognition. Many of the transformer-based applications are real-time systems such as machine translation and web search. These real time systems often come with strict end-to-end… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.