Skip to main content

Showing 1–12 of 12 results for author: Ericsson, L

.
  1. arXiv:2405.20838  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    einspace: Searching for Neural Architectures from Fundamental Operations

    Authors: Linus Ericsson, Miguel Espinosa, Chenhongyi Yang, Antreas Antoniou, Amos Storkey, Shay B. Cohen, Steven McDonagh, Elliot J. Crowley

    Abstract: Neural architecture search (NAS) finds high performing networks for a given task. Yet the results of NAS are fairly prosaic; they did not e.g. create a shift from convolutional structures to transformers. This is not least because the search spaces in NAS often aren't diverse enough to include such transformations a priori. Instead, for NAS to provide greater potential for fundamental design shift… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Project page at https://linusericsson.github.io/einspace/

  2. arXiv:2404.06466  [pdf, other

    cs.LG stat.ML

    Hyperparameter Selection in Continual Learning

    Authors: Thomas L. Lee, Sigrid Passano Hellan, Linus Ericsson, Elliot J. Crowley, Amos Storkey

    Abstract: In continual learning (CL) -- where a learner trains on a stream of data -- standard hyperparameter optimisation (HPO) cannot be applied, as a learner does not have access to all of the data at the same time. This has prompted the development of CL-specific HPO frameworks. The most popular way to tune hyperparameters in CL is to repeatedly train over the whole data stream with different hyperparam… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Preprint, 9 pages

  3. arXiv:2403.17695  [pdf, other

    cs.CV cs.LG

    PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition

    Authors: Chenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J. Crowley

    Abstract: We present PlainMamba: a simple non-hierarchical state space model (SSM) designed for general visual recognition. The recent Mamba model has shown how SSMs can be highly competitive with other architectures on sequential data and initial attempts have been made to apply it to images. In this paper, we further adapt the selective scanning process of Mamba to the visual domain, enhancing its ability… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  4. arXiv:2311.11819  [pdf

    eess.IV cs.AI cs.CV cs.LG q-bio.QM

    Generalized super-resolution 4D Flow MRI $\unicode{x2013}$ using ensemble learning to extend across the cardiovascular system

    Authors: Leon Ericsson, Adam Hjalmarsson, Muhammad Usman Akbar, Edward Ferdian, Mia Bonini, Brandon Hardy, Jonas Schollenberger, Maria Aristova, Patrick Winter, Nicholas Burris, Alexander Fyrdahl, Andreas Sigfridsson, Susanne Schnell, C. Alberto Figueroa, David Nordsletten, Alistair A. Young, David Marlevi

    Abstract: 4D Flow Magnetic Resonance Imaging (4D Flow MRI) is a non-invasive measurement technique capable of quantifying blood flow across the cardiovascular system. While practical use is limited by spatial resolution and image noise, incorporation of trained super-resolution (SR) networks has potential to enhance image quality post-scan. However, these efforts have predominantly been restricted to narrow… ▽ More

    Submitted 21 November, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures

  5. arXiv:2311.08815  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations

    Authors: Cian Eastwood, Julius von Kügelgen, Linus Ericsson, Diane Bouchacourt, Pascal Vincent, Bernhard Schölkopf, Mark Ibrahim

    Abstract: Self-supervised representation learning often uses data augmentations to induce some invariance to "style" attributes of the data. However, with downstream tasks generally unknown at training time, it is difficult to deduce a priori which attributes of the data are indeed "style" and can be safely discarded. To address this, we introduce a more principled approach that seeks to disentangle style f… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  6. arXiv:2309.03879  [pdf, other

    cs.LG cs.CV

    Better Practices for Domain Adaptation

    Authors: Linus Ericsson, Da Li, Timothy M. Hospedales

    Abstract: Distribution shifts are all too common in real-world applications of machine learning. Domain adaptation (DA) aims to address this by providing various frameworks for adapting models to the deployment data without using labels. However, the domain shift scenario raises a second more subtle challenge: the difficulty of performing hyperparameter optimisation (HPO) for these adaptation algorithms wit… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: AutoML 2023 (Best paper award)

  7. arXiv:2305.08252  [pdf, other

    cs.CV cs.AI

    Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity

    Authors: Raman Dutt, Linus Ericsson, Pedro Sanchez, Sotirios A. Tsaftaris, Timothy Hospedales

    Abstract: Foundation models have significantly advanced medical image analysis through the pre-train fine-tune paradigm. Among various fine-tuning algorithms, Parameter-Efficient Fine-Tuning (PEFT) is increasingly utilized for knowledge transfer across diverse tasks, including vision-language and text-to-image generation. However, its application in medical image analysis is relatively unexplored due to the… ▽ More

    Submitted 10 June, 2024; v1 submitted 14 May, 2023; originally announced May 2023.

    Comments: Accepted as Oral Presentation at MIDL 2024

  8. Label-Efficient Object Detection via Region Proposal Network Pre-Training

    Authors: Nanqing Dong, Linus Ericsson, Yongxin Yang, Ales Leonardis, Steven McDonagh

    Abstract: Self-supervised pre-training, based on the pretext task of instance discrimination, has fueled the recent advance in label-efficient object detection. However, existing studies focus on pre-training only a feature extractor network to learn transferable representations for downstream detection tasks. This leads to the necessity of training multiple detection-specific modules from scratch in the fi… ▽ More

    Submitted 15 February, 2024; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Accepted by Neurocomputing

  9. arXiv:2111.11398  [pdf, other

    cs.CV

    Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks

    Authors: Linus Ericsson, Henry Gouk, Timothy M. Hospedales

    Abstract: Self-supervised learning is a powerful paradigm for representation learning on unlabelled images. A wealth of effective new methods based on instance matching rely on data-augmentation to drive learning, and these have reached a rough agreement on an augmentation scheme that optimises popular recognition benchmarks. However, there is strong reason to suspect that different tasks in computer vision… ▽ More

    Submitted 10 October, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

    Comments: Code available at https://github.com/linusericsson/ssl-invariances

  10. arXiv:2110.09327  [pdf, other

    cs.LG cs.CV stat.ML

    Self-Supervised Representation Learning: Introduction, Advances and Challenges

    Authors: Linus Ericsson, Henry Gouk, Chen Change Loy, Timothy M. Hospedales

    Abstract: Self-supervised representation learning methods aim to provide powerful deep feature learning without the requirement of large annotated datasets, thus alleviating the annotation bottleneck that is one of the main barriers to practical deployment of deep learning today. These methods have advanced rapidly in recent years, with their efficacy approaching and sometimes surpassing fully supervised pr… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  11. arXiv:2011.13377  [pdf, other

    cs.CV

    How Well Do Self-Supervised Models Transfer?

    Authors: Linus Ericsson, Henry Gouk, Timothy M. Hospedales

    Abstract: Self-supervised visual representation learning has seen huge progress recently, but no large scale evaluation has compared the many models now available. We evaluate the transfer performance of 13 top self-supervised models on 40 downstream tasks, including many-shot and few-shot recognition, object detection, and dense prediction. We compare their performance to a supervised baseline and show tha… ▽ More

    Submitted 29 March, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: CVPR 2021. Code available at https://github.com/linusericsson/ssl-transfer

  12. arXiv:2006.12360  [pdf, other

    cs.LG stat.ML

    Don't Wait, Just Weight: Improving Unsupervised Representations by Learning Goal-Driven Instance Weights

    Authors: Linus Ericsson, Henry Gouk, Timothy M. Hospedales

    Abstract: In the absence of large labelled datasets, self-supervised learning techniques can boost performance by learning useful representations from unlabelled data, which is often more readily available. However, there is often a domain shift between the unlabelled collection and the downstream target problem data. We show that by learning Bayesian instance weights for the unlabelled data, we can improve… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.