Skip to main content

Showing 1–3 of 3 results for author: Borup, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.12314  [pdf, other

    cs.CV cs.AI cs.LG

    Distilling from Similar Tasks for Transfer Learning on a Budget

    Authors: Kenneth Borup, Cheng Perng Phoo, Bharath Hariharan

    Abstract: We address the challenge of getting efficient yet accurate recognition systems with limited labels. While recognition models improve with model size and amount of data, many specialized applications of computer vision have severe resource constraints both during training and inference. Transfer learning is an effective solution for training with few labels, however often at the expense of a comput… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: 11 pages

  2. arXiv:2304.02641  [pdf, other

    stat.ML cs.LG

    Self-Distillation for Gaussian Process Regression and Classification

    Authors: Kenneth Borup, Lars Nørvang Andersen

    Abstract: We propose two approaches to extend the notion of knowledge distillation to Gaussian Process Regression (GPR) and Gaussian Process Classification (GPC); data-centric and distribution-centric. The data-centric approach resembles most current distillation techniques for machine learning, and refits a model on deterministic predictions from the teacher, while the distribution-centric approach, re-use… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 10 pages; code at https://github.com/Kennethborup/gaussian_process_self_distillation

  3. arXiv:2102.13088  [pdf, other

    cs.LG stat.ML

    Even your Teacher Needs Guidance: Ground-Truth Targets Dampen Regularization Imposed by Self-Distillation

    Authors: Kenneth Borup, Lars N. Andersen

    Abstract: Knowledge distillation is classically a procedure where a neural network is trained on the output of another network along with the original targets in order to transfer knowledge between the architectures. The special case of self-distillation, where the network architectures are identical, has been observed to improve generalization accuracy. In this paper, we consider an iterative variant of se… ▽ More

    Submitted 15 October, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: To be published at NeurIPS 2021; 21 pages, 14 figures