Skip to main content

Showing 1–11 of 11 results for author: Akin, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10518  [pdf, other

    cs.CV

    MobileNetV4 -- Universal Models for the Mobile Ecosystem

    Authors: Danfeng Qin, Chas Leichner, Manolis Delakis, Marco Fornoni, Shixin Luo, Fan Yang, Weijun Wang, Colby Banbury, Chengxi Ye, Berkin Akin, Vaibhav Aggarwal, Tenghui Zhu, Daniele Moro, Andrew Howard

    Abstract: We present the latest generation of MobileNets, known as MobileNetV4 (MNv4), featuring universally efficient architecture designs for mobile devices. At its core, we introduce the Universal Inverted Bottleneck (UIB) search block, a unified and flexible structure that merges Inverted Bottleneck (IB), ConvNext, Feed Forward Network (FFN), and a novel Extra Depthwise (ExtraDW) variant. Alongside UIB,… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  2. arXiv:2204.14007  [pdf, other

    cs.DC cs.CV cs.LG

    Searching for Efficient Neural Architectures for On-Device ML on Edge TPUs

    Authors: Berkin Akin, Suyog Gupta, Yun Long, Anton Spiridonov, Zhuo Wang, Marie White, Hao Xu, ** Zhou, Yanqi Zhou

    Abstract: On-device ML accelerators are becoming a standard in modern mobile system-on-chips (SoC). Neural architecture search (NAS) comes to the rescue for efficiently utilizing the high compute throughput offered by these accelerators. However, existing NAS frameworks have several practical limitations in scaling to multiple tasks and different target platforms. In this work, we provide a two-pronged appr… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

  3. arXiv:2109.14320  [pdf, other

    cs.AR cs.LG

    Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks

    Authors: Amirali Boroumand, Saugata Ghose, Berkin Akin, Ravi Narayanaswami, Geraldo F. Oliveira, Xiaoyu Ma, Eric Shiu, Onur Mutlu

    Abstract: Emerging edge computing platforms often contain machine learning (ML) accelerators that can accelerate inference for a wide range of neural network (NN) models. These models are designed to fit within the limited area and energy constraints of the edge computing platforms, each targeting various applications (e.g., face detection, speech recognition, translation, image captioning, video analytics)… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: This work appears at the 30th International Conference on Parallel Architectures and Compilation Techniques (PACT 2021). arXiv admin note: substantial text overlap with arXiv:2103.00768

  4. arXiv:2103.00768  [pdf, other

    cs.AR cs.LG

    Mitigating Edge Machine Learning Inference Bottlenecks: An Empirical Study on Accelerating Google Edge Models

    Authors: Amirali Boroumand, Saugata Ghose, Berkin Akin, Ravi Narayanaswami, Geraldo F. Oliveira, Xiaoyu Ma, Eric Shiu, Onur Mutlu

    Abstract: As the need for edge computing grows, many modern consumer devices now contain edge machine learning (ML) accelerators that can compute a wide range of neural network (NN) models while still fitting within tight resource constraints. We analyze a commercial Edge TPU using 24 Google edge NN models (including CNNs, LSTMs, transducers, and RCNNs), and find that the accelerator suffers from three shor… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

  5. arXiv:2102.10423  [pdf, other

    cs.LG cs.AR

    An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks

    Authors: Kiran Seshadri, Berkin Akin, James Laudon, Ravi Narayanaswami, Amir Yazdanbakhsh

    Abstract: Edge TPUs are a domain of accelerators for low-power, edge devices and are widely used in various Google products such as Coral and Pixel devices. In this paper, we first discuss the major microarchitectural details of Edge TPUs. Then, we extensively evaluate three classes of Edge TPUs, covering different computing ecosystems, that are either currently deployed in Google products or are the produc… ▽ More

    Submitted 11 October, 2022; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: 13 pages, 15 figures, 8 tables, published in IISWC 2022

  6. arXiv:2102.08619  [pdf, other

    cs.LG cs.AR

    Rethinking Co-design of Neural Architectures and Hardware Accelerators

    Authors: Yanqi Zhou, Xuanyi Dong, Berkin Akin, Mingxing Tan, Daiyi Peng, Tianjian Meng, Amir Yazdanbakhsh, Da Huang, Ravi Narayanaswami, James Laudon

    Abstract: Neural architectures and hardware accelerators have been two driving forces for the progress in deep learning. Previous works typically attempt to optimize hardware given a fixed model architecture or model architecture given fixed hardware. And the dominant hardware architecture explored in this prior work is FPGAs. In our work, we target the optimization of hardware and software configurations o… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

  7. arXiv:2102.01723  [pdf, other

    cs.LG cs.AR

    Apollo: Transferable Architecture Exploration

    Authors: Amir Yazdanbakhsh, Christof Angermueller, Berkin Akin, Yanqi Zhou, Albin Jones, Milad Hashemi, Kevin Swersky, Satrajit Chatterjee, Ravi Narayanaswami, James Laudon

    Abstract: The looming end of Moore's Law and ascending use of deep learning drives the design of custom accelerators that are optimized for specific neural architectures. Architecture exploration for such accelerators forms a challenging constrained optimization problem over a complex, high-dimensional, and structured input space with a costly to evaluate objective function. Existing approaches for accelera… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: 10 pages, 5 figures, Accepted to Workshop on ML for Systems at the 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

  8. arXiv:2008.08178  [pdf, other

    cs.CV

    Discovering Multi-Hardware Mobile Models via Architecture Search

    Authors: Grace Chu, Okan Arikan, Gabriel Bender, Weijun Wang, Achille Brighton, Pieter-Jan Kindermans, Hanxiao Liu, Berkin Akin, Suyog Gupta, Andrew Howard

    Abstract: Hardware-aware neural architecture designs have been predominantly focusing on optimizing model performance on single hardware and model development complexity, where another important factor, model deployment complexity, has been largely ignored. In this paper, we argue that, for applications that may be deployed on multiple hardware, having different single-hardware models across the deployed ha… ▽ More

    Submitted 23 April, 2021; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: CVPR Workshop 2021

  9. arXiv:2004.14525  [pdf, other

    cs.CV

    MobileDets: Searching for Object Detection Architectures for Mobile Accelerators

    Authors: Yunyang Xiong, Hanxiao Liu, Suyog Gupta, Berkin Akin, Gabriel Bender, Yongzhe Wang, Pieter-Jan Kindermans, Mingxing Tan, Vikas Singh, Bo Chen

    Abstract: Inverted bottleneck layers, which are built upon depthwise convolutions, have been the predominant building blocks in state-of-the-art object detection models on mobile devices. In this work, we investigate the optimality of this design pattern over a broad range of mobile accelerators by revisiting the usefulness of regular convolutions. We discover that regular convolutions are a potent componen… ▽ More

    Submitted 30 March, 2021; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: Accepted at CVPR 2021; Code and models are available in the TensorFlow Object Detection API: https://github.com/tensorflow/models/tree/master/research/object_detection

  10. arXiv:2003.02838  [pdf, other

    eess.SP cs.LG stat.ML

    Accelerator-aware Neural Network Design using AutoML

    Authors: Suyog Gupta, Berkin Akin

    Abstract: While neural network hardware accelerators provide a substantial amount of raw compute throughput, the models deployed on them must be co-designed for the underlying hardware architecture to obtain the optimal system performance. We present a class of computer vision models designed using hardware-aware neural architecture search and customized to run on the Edge TPU, Google's neural network hardw… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Comments: Accepted paper at the On-device Intelligence Workshop at MLSys Conference 2020

  11. arXiv:1401.4543  [pdf

    cs.SI cs.CY physics.soc-ph

    On the Potential of Twitter for Understanding the Tunisia of the Post-Arab Spring

    Authors: Meriem Ben-Salah Akin

    Abstract: Micro-blogging through Twitter has made information short and to the point, and more importantly systematically searchable. This work is the first of a series in which quotidian observations about Tunisia are obtained using the micro-blogging site Twitter. Data was extracted using the open source Twitter API v1.1. Specific tweets were obtained using functional search operators in particular themat… ▽ More

    Submitted 18 January, 2014; originally announced January 2014.

    Comments: 7 pages, 5 figures