Marvel: A Data-centric Compiler for DNN Operators on Spatial Accelerators
Authors:
Prasanth Chatarasi,
Hyoukjun Kwon,
Natesh Raina,
Saurabh Malik,
Vaisakh Haridas,
Angshuman Parashar,
Michael Pellauer,
Tushar Krishna,
Vivek Sarkar
Abstract:
The efficiency of a spatial DNN accelerator depends heavily on the compiler and its cost model ability to generate optimized map**s for various operators of DNN models on to the accelerator's compute and memory resources. But, existing cost models lack a formal boundary over the operators for precise and tractable analysis, which poses adaptability challenges for new DNN operators. To address th…
▽ More
The efficiency of a spatial DNN accelerator depends heavily on the compiler and its cost model ability to generate optimized map**s for various operators of DNN models on to the accelerator's compute and memory resources. But, existing cost models lack a formal boundary over the operators for precise and tractable analysis, which poses adaptability challenges for new DNN operators. To address this challenge, we leverage the recently introduced Maestro Data-Centric (MDC) notation. We develop a formal understanding of DNN operators whose map**s can be described in the MDC notation, because any map** adhering to the notation is always analyzable by the MDC's cost model. Furthermore, we introduce a transformation for translating map**s into the MDC notation for exploring the map** space.
Searching for the optimal map**s is challenging because of the large space of map**s, and this challenge gets exacerbated with new operators and diverse accelerator configurations.To address this challenge, we propose a decoupled off-chip/on-chip approach that decomposes the map** space into off-chip and on-chip subspaces, and first optimizes the off-chip subspace followed by the on-chip subspace. The motivation for this decomposition is to reduce the size of the search space dramatically and also to prioritize the optimization of off-chip data movement, which is 2-3 orders of magnitude more compared to the on-chip data movement. We implemented our approach in a tool called {\em Marvel}, and another major benefit of our approach is that it is applicable to any DNN operator conformable with the MDC notation.
△ Less
Submitted 11 June, 2020; v1 submitted 18 February, 2020;
originally announced February 2020.