NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction

Yi, Yun; Zhang, Haokui; Hu, Wenze; Wang, Nannan; Wang, Xiaoyu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.08024 (cs)

[Submitted on 15 Nov 2022 (v1), last revised 23 Mar 2023 (this version, v3)]

Title:NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction

Authors:Yun Yi, Haokui Zhang, Wenze Hu, Nannan Wang, Xiaoyu Wang

View PDF

Abstract:With the wide and deep adoption of deep learning models in real applications, there is an increasing need to model and learn the representations of the neural networks themselves. These models can be used to estimate attributes of different neural network architectures such as the accuracy and latency, without running the actual training or inference tasks. In this paper, we propose a neural architecture representation model that can be used to estimate these attributes holistically. Specifically, we first propose a simple and effective tokenizer to encode both the operation and topology information of a neural network into a single sequence. Then, we design a multi-stage fusion transformer to build a compact vector representation from the converted sequence. For efficient model training, we further propose an information flow consistency augmentation and correspondingly design an architecture consistency loss, which brings more benefits with less augmentation samples compared with previous random augmentation strategies. Experiment results on NAS-Bench-101, NAS-Bench-201, DARTS search space and NNLQP show that our proposed framework can be used to predict the aforementioned latency and accuracy attributes of both cell architectures and whole deep neural networks, and achieves promising performance. Code is available at this https URL.

Comments:	8 pages, 4 figures, 7 tables. Accepted by IEEE Conference on Computer Vision and Pattern Recognition(CVPR) 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2211.08024 [cs.CV]
	(or arXiv:2211.08024v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.08024

Submission history

From: Yun Yi [view email]
[v1] Tue, 15 Nov 2022 10:15:21 UTC (1,311 KB)
[v2] Mon, 6 Mar 2023 07:38:08 UTC (2,253 KB)
[v3] Thu, 23 Mar 2023 03:03:56 UTC (2,254 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators