Skip to main content

Showing 1–10 of 10 results for author: Meyer, B H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.11207  [pdf, other

    cs.CV cs.AI

    SSS3D: Fast Neural Architecture Search For Efficient Three-Dimensional Semantic Segmentation

    Authors: Olivier Therrien, Marihan Amein, Zhuoran Xiong, Warren J. Gross, Brett H. Meyer

    Abstract: We present SSS3D, a fast multi-objective NAS framework designed to find computationally efficient 3D semantic scene segmentation networks. It uses RandLA-Net, an off-the-shelf point-based network, as a super-network to enable weight sharing and reduce search time by 99.67% for single-stage searches. SSS3D has a complex search space composed of sampling and architectural parameters that can form 2.… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: Accepted as a full paper by the TinyML Research Symposium 2023

  2. arXiv:2303.16322  [pdf, other

    cs.CV cs.AI

    FMAS: Fast Multi-Objective SuperNet Architecture Search for Semantic Segmentation

    Authors: Zhuoran Xiong, Marihan Amein, Olivier Therrien, Warren J. Gross, Brett H. Meyer

    Abstract: We present FMAS, a fast multi-objective neural architecture search framework for semantic segmentation. FMAS subsamples the structure and pre-trained parameters of DeepLabV3+, without fine-tuning, dramatically reducing training time during search. To further reduce candidate evaluation time, we use a subset of the validation dataset during the search. Only the final, Pareto non-dominated, candidat… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: Accepted as a full paper by the TinyML Research Symposium 2023

  3. arXiv:2212.12965  [pdf, other

    cs.CV

    BD-KD: Balancing the Divergences for Online Knowledge Distillation

    Authors: Ibtihel Amara, Nazanin Sepahvand, Brett H. Meyer, Warren J. Gross, James J. Clark

    Abstract: Knowledge distillation (KD) has gained a lot of attention in the field of model compression for edge devices thanks to its effectiveness in compressing large powerful networks into smaller lower-capacity models. Online distillation, in which both the teacher and the student are learning collaboratively, has also gained much interest due to its ability to improve on the performance of the networks… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

  4. arXiv:2209.07606  [pdf, other

    cs.CV cs.LG

    CES-KD: Curriculum-based Expert Selection for Guided Knowledge Distillation

    Authors: Ibtihel Amara, Maryam Ziaeefard, Brett H. Meyer, Warren Gross, James J. Clark

    Abstract: Knowledge distillation (KD) is an effective tool for compressing deep classification models for edge devices. However, the performance of KD is affected by the large capacity gap between the teacher and student networks. Recent methods have resorted to a multiple teacher assistant (TA) setting for KD, which sequentially decreases the size of the teacher model to relatively bridge the size gap betw… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: ICPR2022

  5. arXiv:2208.02070  [pdf, other

    cs.CL cs.LG

    Efficient Fine-Tuning of Compressed Language Models with Learners

    Authors: Danilo Vucetic, Mohammadreza Tayaranian, Maryam Ziaeefard, James J. Clark, Brett H. Meyer, Warren J. Gross

    Abstract: Fine-tuning BERT-based models is resource-intensive in memory, computation, and time. While many prior works aim to improve inference efficiency via compression techniques, e.g., pruning, these works do not explicitly address the computational challenges of training to downstream tasks. We introduce Learner modules and priming, novel methods for fine-tuning that exploit the overparameterization of… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: 8 pages, 9 figures, 2 tables, presented at ICML 2022 workshop on Hardware-Aware Efficient Training (HAET 2022)

  6. Efficient Fine-Tuning of BERT Models on the Edge

    Authors: Danilo Vucetic, Mohammadreza Tayaranian, Maryam Ziaeefard, James J. Clark, Brett H. Meyer, Warren J. Gross

    Abstract: Resource-constrained devices are increasingly the deployment targets of machine learning applications. Static models, however, do not always suffice for dynamic environments. On-device training of models allows for quick adaptability to new scenarios. With the increasing size of deep neural networks, as noted with the likes of BERT and other natural language processing models, comes increased reso… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: 4 pages, 2 figures, 3 tables. To be published in ISCAS 2022 and made available on IEEE Xplore

  7. Worst-Case Execution Time Calculation for Query-Based Monitors by Witness Generation

    Authors: Márton Búr, Kristóf Marussy, Brett H. Meyer, Dániel Varró

    Abstract: Runtime monitoring plays a key role in the assurance of modern intelligent cyber-physical systems, which are frequently data-intensive and safety-critical. While graph queries can serve as an expressive yet formally precise specification language to capture the safety properties of interest, there are no timeliness guarantees for such auto-generated runtime monitoring programs, which prevents thei… ▽ More

    Submitted 3 November, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: 36 pages, 11 figures, submitted to ACM Transactions on Embedded Computing Systems (accepted version)

    ACM Class: C.3.3

    Journal ref: ACM Transactions on Embedded Computing Systems, Volume 20, Issue 6, 2021 November

  8. arXiv:2006.01659  [pdf, other

    cs.LG stat.ML

    Surprisal-Triggered Conditional Computation with Neural Networks

    Authors: Loren Lugosch, Derek Nowrouzezahrai, Brett H. Meyer

    Abstract: Autoregressive neural network models have been used successfully for sequence generation, feature extraction, and hypothesis scoring. This paper presents yet another use for these models: allocating more computation to more difficult inputs. In our model, an autoregressive model is used both to extract features and to predict observations in a stream of input observations. The surprisal of the inp… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

  9. arXiv:1809.11086  [pdf, other

    cs.LG cs.CC stat.ML

    Learning Recurrent Binary/Ternary Weights

    Authors: Arash Ardakani, Zhengyun Ji, Sean C. Smithson, Brett H. Meyer, Warren J. Gross

    Abstract: Recurrent neural networks (RNNs) have shown excellent performance in processing sequence data. However, they are both complex and memory intensive due to their recursive nature. These limitations make RNNs difficult to embed on mobile devices requiring real-time processes with limited hardware resources. To address the above issues, we introduce a method that can learn binary and ternary weights d… ▽ More

    Submitted 24 January, 2019; v1 submitted 28 September, 2018; originally announced September 2018.

    Comments: Published as a conference paper at ICLR 2019

  10. arXiv:1611.02120  [pdf, other

    cs.NE cs.LG

    Neural Networks Designing Neural Networks: Multi-Objective Hyper-Parameter Optimization

    Authors: Sean C. Smithson, Guang Yang, Warren J. Gross, Brett H. Meyer

    Abstract: Artificial neural networks have gone through a recent rise in popularity, achieving state-of-the-art results in various fields, including image classification, speech recognition, and automated control. Both the performance and computational complexity of such models are heavily dependant on the design of characteristic hyper-parameters (e.g., number of hidden layers, nodes per layer, or choice of… ▽ More

    Submitted 7 November, 2016; originally announced November 2016.

    Comments: To appear in ICCAD'16. The authoritative version will appear in the ACM Digital Library