-
65 GOPS/neuron Photonic Tensor Core with Thin-film Lithium Niobate Photonics
Authors:
Zhong** Lin,
Bhavin J. Shastri,
Shangxuan Yu,
**gxiang Song,
Yuntao Zhu,
Arman Safarnejadian,
Wangning Cai,
Yanmei Lin,
Wei Ke,
Mustafa Hammood,
Tianye Wang,
Mengyue Xu,
Zibo Zheng,
Mohammed Al-Qadasi,
Omid Esmaeeli,
Mohamed Rahim,
Grzegorz Pakulski,
Jens Schmid,
Pedro Barrios,
Weihong Jiang,
Hugh Morison,
Matthew Mitchell,
Xiaogang Qiang,
Xun Guan,
Nicolas A. F. Jaeger
, et al. (6 additional authors not shown)
Abstract:
Photonics offers a transformative approach to artificial intelligence (AI) and neuromorphic computing by providing low latency, high bandwidth, and energy-efficient computations. Here, we introduce a photonic tensor core processor enabled by time-multiplexed inputs and charge-integrated outputs. This fully integrated processor, comprising only two thin-film lithium niobate (TFLN) modulators, a III…
▽ More
Photonics offers a transformative approach to artificial intelligence (AI) and neuromorphic computing by providing low latency, high bandwidth, and energy-efficient computations. Here, we introduce a photonic tensor core processor enabled by time-multiplexed inputs and charge-integrated outputs. This fully integrated processor, comprising only two thin-film lithium niobate (TFLN) modulators, a III-V laser, and a charge-integration photoreceiver, can implement an entire layer of a neural network. It can execute 65 billion operations per second (GOPS) per neuron, including simultaneous weight updates-a hitherto unachieved speed. Our processor stands out from conventional photonic processors, which have static weights set during training, as it supports fast "hardware-in-the-loop" training, and can dynamically adjust the inputs (fan-in) and outputs (fan-out) within a layer, thereby enhancing its versatility. Our processor can perform large-scale dot-product operations with vector dimensions up to 131,072. Furthermore, it successfully classifies (supervised learning) and clusters (unsupervised learning) 112*112-pixel images after "hardware-in-the-loop" training. To handle "hardware-in-the-loop" training for clustering AI tasks, we provide a solution for multiplications involving two negative numbers based on our processor.
△ Less
Submitted 30 November, 2023; v1 submitted 28 November, 2023;
originally announced November 2023.
-
Silicon Photonic Architecture for Training Deep Neural Networks with Direct Feedback Alignment
Authors:
Matthew J. Filipovich,
Zhimu Guo,
Mohammed Al-Qadasi,
Bicky A. Marquez,
Hugh D. Morison,
Volker J. Sorger,
Paul R. Prucnal,
Sudip Shekhar,
Bhavin J. Shastri
Abstract:
There has been growing interest in using photonic processors for performing neural network inference operations; however, these networks are currently trained using standard digital electronics. Here, we propose on-chip training of neural networks enabled by a CMOS-compatible silicon photonic architecture to harness the potential for massively parallel, efficient, and fast data operations. Our sch…
▽ More
There has been growing interest in using photonic processors for performing neural network inference operations; however, these networks are currently trained using standard digital electronics. Here, we propose on-chip training of neural networks enabled by a CMOS-compatible silicon photonic architecture to harness the potential for massively parallel, efficient, and fast data operations. Our scheme employs the direct feedback alignment training algorithm, which trains neural networks using error feedback rather than error backpropagation, and can operate at speeds of trillions of multiply-accumulate (MAC) operations per second while consuming less than one picojoule per MAC operation. The photonic architecture exploits parallelized matrix-vector multiplications using arrays of microring resonators for processing multi-channel analog signals along single waveguide buses to calculate the gradient vector for each neural network layer in situ. We also experimentally demonstrate training deep neural networks with the MNIST dataset using on-chip MAC operation results. Our novel approach for efficient, ultra-fast neural network training showcases photonics as a promising platform for executing AI applications.
△ Less
Submitted 18 August, 2022; v1 submitted 12 November, 2021;
originally announced November 2021.
-
Scaling Up Silicon Photonic-based Accelerators: Challenges and Opportunities
Authors:
M. A. Al-Qadasi,
L. Chrostowski,
B. J. Shastri,
S. Shekhar
Abstract:
Digital accelerators in the latest generation of CMOS processes support multiply and accumulate (MAC) operations at energy efficiencies spanning 10-to-100~fJ/Op. But the operating speed for such MAC operations are often limited to a few hundreds of MHz. Optical or optoelectronic MAC operations on today's SOI-based silicon photonic integrated circuit platforms can be realized at a speed of tens of…
▽ More
Digital accelerators in the latest generation of CMOS processes support multiply and accumulate (MAC) operations at energy efficiencies spanning 10-to-100~fJ/Op. But the operating speed for such MAC operations are often limited to a few hundreds of MHz. Optical or optoelectronic MAC operations on today's SOI-based silicon photonic integrated circuit platforms can be realized at a speed of tens of GHz, leading to much lower latency and higher throughput. In this paper, we study the energy efficiency of integrated silicon photonic MAC circuits based on Mach-Zehnder modulators and microring resonators. We describe the bounds on energy efficiency and scaling limits for NxN optical networks with today's technology, based on the optical and electrical link budget. We also describe research directions that can overcome the current limitations.
△ Less
Submitted 27 February, 2022; v1 submitted 14 September, 2021;
originally announced September 2021.
-
Prospects and applications of photonic neural networks
Authors:
Chaoran Huang,
Volker J. Sorger,
Mario Miscuglio,
Mohammed Al-Qadasi,
Avilash Mukherjee,
Sudip Shekhar,
Lukas Chrostowski,
Lutz Lampe,
Mitchell Nichols,
Mable P. Fok,
Daniel Brunner,
Alexander N. Tait,
Thomas Ferreira de Lima,
Bicky A. Marquez,
Paul R. Prucnal,
Bhavin J. Shastri
Abstract:
Neural networks have enabled applications in artificial intelligence through machine learning, and neuromorphic computing. Software implementations of neural networks on conventional computers that have separate memory and processor (and that operate sequentially) are limited in speed and energy efficiency. Neuromorphic engineering aims to build processors in which hardware mimics neurons and syna…
▽ More
Neural networks have enabled applications in artificial intelligence through machine learning, and neuromorphic computing. Software implementations of neural networks on conventional computers that have separate memory and processor (and that operate sequentially) are limited in speed and energy efficiency. Neuromorphic engineering aims to build processors in which hardware mimics neurons and synapses in the brain for distributed and parallel processing. Neuromorphic engineering enabled by photonics (optical physics) can offer sub-nanosecond latencies and high bandwidth with low energies to extend the domain of artificial intelligence and neuromorphic computing applications to machine learning acceleration, nonlinear programming, intelligent signal processing, etc. Photonic neural networks have been demonstrated on integrated platforms and free-space optics depending on the class of applications being targeted. Here, we discuss the prospects and demonstrated applications of these photonic neural networks.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.