Skip to main content

Showing 1–2 of 2 results for author: Webb, T J

.
  1. arXiv:1801.08058  [pdf, other

    cs.DC cs.LG

    Intel nGraph: An Intermediate Representation, Compiler, and Executor for Deep Learning

    Authors: Scott Cyphers, Arjun K. Bansal, Anahita Bhiwandiwalla, Jayaram Bobba, Matthew Brookhart, Avijit Chakraborty, Will Constable, Christian Convey, Leona Cook, Omar Kanawi, Robert Kimball, Jason Knight, Nikolay Korovaiko, Varun Kumar, Yixing Lao, Christopher R. Lishka, Jaikrishnan Menon, Jennifer Myers, Sandeep Aswath Narayana, Adam Procter, Tristan J. Webb

    Abstract: The Deep Learning (DL) community sees many novel topologies published each year. Achieving high performance on each new topology remains challenging, as each requires some level of manual effort. This issue is compounded by the proliferation of frameworks and hardware platforms. The current approach, which we call "direct optimization", requires deep changes within each framework to improve the tr… ▽ More

    Submitted 29 January, 2018; v1 submitted 24 January, 2018; originally announced January 2018.

  2. arXiv:1711.02213  [pdf, other

    cs.LG math.NA stat.ML

    Flexpoint: An Adaptive Numerical Format for Efficient Training of Deep Neural Networks

    Authors: Urs Köster, Tristan J. Webb, Xin Wang, Marcel Nassar, Arjun K. Bansal, William H. Constable, Oğuz H. Elibol, Scott Gray, Stewart Hall, Luke Hornof, Amir Khosrowshahi, Carey Kloss, Ruby J. Pai, Naveen Rao

    Abstract: Deep neural networks are commonly developed and trained in 32-bit floating point format. Significant gains in performance and energy efficiency could be realized by training and inference in numerical formats optimized for deep learning. Despite advances in limited precision inference in recent years, training of neural networks in low bit-width remains a challenging problem. Here we present the F… ▽ More

    Submitted 2 December, 2017; v1 submitted 6 November, 2017; originally announced November 2017.

    Comments: 14 pages, 5 figures, accepted in Neural Information Processing Systems 2017