Skip to main content

Showing 1–15 of 15 results for author: McDonnell, M D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.07356  [pdf, other

    cs.CV cs.LG

    Premonition: Using Generative Models to Preempt Future Data Changes in Continual Learning

    Authors: Mark D. McDonnell, Dong Gong, Ehsan Abbasnejad, Anton van den Hengel

    Abstract: Continual learning requires a model to adapt to ongoing changes in the data distribution, and often to the set of tasks to be performed. It is rare, however, that the data and task changes are completely unpredictable. Given a description of an overarching goal or data theme, which we call a realm, humans can often guess what concepts are associated with it. We show here that the combination of a… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 31 pages total (14 main paper, 5 references, 12 appendices)

  2. arXiv:2307.02251  [pdf, other

    cs.LG cs.CV

    RanPAC: Random Projections and Pre-trained Models for Continual Learning

    Authors: Mark D. McDonnell, Dong Gong, Amin Parveneh, Ehsan Abbasnejad, Anton van den Hengel

    Abstract: Continual learning (CL) aims to incrementally learn different tasks (such as classification) in a non-stationary data stream without forgetting old ones. Most CL works focus on tackling catastrophic forgetting under a learning-from-scratch paradigm. However, with the increasing prominence of foundation models, pre-trained models equipped with informative representations have become available for v… ▽ More

    Submitted 15 January, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: 32 pages, 11 figures

    Journal ref: 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023), Dec 2023, New Orleans, United States

  3. arXiv:1907.06916  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Single-bit-per-weight deep convolutional neural networks without batch-normalization layers for embedded systems

    Authors: Mark D. McDonnell, Hesham Mostafa, Runchun Wang, Andre van Schaik

    Abstract: Batch-normalization (BN) layers are thought to be an integrally important layer type in today's state-of-the-art deep convolutional neural networks for computer vision tasks such as classification and detection. However, BN layers introduce complexity and computational overheads that are highly undesirable for training and/or inference on low-power custom hardware implementations of real-time embe… ▽ More

    Submitted 22 July, 2019; v1 submitted 16 July, 2019; originally announced July 2019.

    Comments: 8 pages, published IEEE conference paper

  4. arXiv:1810.03241  [pdf, ps, other

    cs.CV

    Diagnosing Convolutional Neural Networks using their Spectral Response

    Authors: Victor Stamatescu, Mark D. McDonnell

    Abstract: Convolutional Neural Networks (CNNs) are a class of artificial neural networks whose computational blocks use convolution, together with other linear and non-linear operations, to perform classification or regression. This paper explores the spectral response of CNNs and its potential use in diagnosing problems with their training. We measure the gain of CNNs trained for image classification on Im… ▽ More

    Submitted 7 October, 2018; originally announced October 2018.

    ACM Class: I.5.2; I.4.7

  5. arXiv:1802.08530  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Training wide residual networks for deployment using a single bit for each weight

    Authors: Mark D. McDonnell

    Abstract: For fast and energy-efficient deployment of trained deep neural networks on resource-constrained embedded hardware, each learned weight parameter should ideally be represented and stored using a single bit. Error-rates usually increase when this requirement is imposed. Here, we report large improvements in error rates on multiple datasets, for deep convolutional neural networks deployed with 1-bit… ▽ More

    Submitted 23 February, 2018; originally announced February 2018.

    Comments: Published as a conference paper at ICLR 2018

    Journal ref: ICLR 2018 - International Conference on Learning Representations, Apr 2018, Vancouver, Canada. 2018

  6. Track Everything: Limiting Prior Knowledge in Online Multi-Object Recognition

    Authors: Sebastien C. Wong, Victor Stamatescu, Adam Gatt, David Kearney, Ivan Lee, Mark D. McDonnell

    Abstract: This paper addresses the problem of online tracking and classification of multiple objects in an image sequence. Our proposed solution is to first track all objects in the scene without relying on object-specific prior knowledge, which in other systems can take the form of hand-crafted features or user-based track initialization. We then classify the tracked objects with a fast-learning image clas… ▽ More

    Submitted 21 April, 2017; originally announced April 2017.

    Comments: 15 pages

    ACM Class: I.4.8

  7. arXiv:1609.08764  [pdf, ps, other

    cs.CV

    Understanding data augmentation for classification: when to warp?

    Authors: Sebastien C. Wong, Adam Gatt, Victor Stamatescu, Mark D. McDonnell

    Abstract: In this paper we investigate the benefit of augmenting data with synthetically created samples when training a machine learning classifier. Two approaches for creating additional training samples are data war**, which generates additional samples through transformations applied in the data-space, and synthetic over-sampling, which creates additional samples in feature-space. We experimentally ev… ▽ More

    Submitted 26 November, 2016; v1 submitted 28 September, 2016; originally announced September 2016.

    Comments: 6 pages, 6 figures, DICTA 2016 conference

    ACM Class: I.5.2; I.4.7

  8. arXiv:1503.04596  [pdf, other

    cs.NE cs.CV cs.LG

    Enhanced Image Classification With a Fast-Learning Shallow Convolutional Neural Network

    Authors: Mark D. McDonnell, Tony Vladusich

    Abstract: We present a neural network architecture and training method designed to enable very rapid training and low implementation complexity. Due to its training speed and very few tunable parameters, the method has strong potential for applications requiring frequent retraining or online training. The approach is characterized by (a) convolutional filters based on biologically inspired visual processing… ▽ More

    Submitted 15 August, 2015; v1 submitted 16 March, 2015; originally announced March 2015.

    Comments: 7 pages, 2 figures, Paper at IJCNN 2015 (International Joint Conference on Neural Networks, 2015)

  9. Fast, simple and accurate handwritten digit classification by training shallow neural network classifiers with the 'extreme learning machine' algorithm

    Authors: Mark D. McDonnell, Migel D. Tissera, Tony Vladusich, André van Schaik, Jonathan Tapson

    Abstract: Recent advances in training deep (multi-layer) architectures have inspired a renaissance in neural network use. For example, deep convolutional networks are becoming the default option for difficult tasks on large datasets, such as image and speech recognition. However, here we show that error rates below 1% on the MNIST handwritten digit benchmark can be replicated with shallow non-convolutional… ▽ More

    Submitted 22 July, 2015; v1 submitted 29 December, 2014; originally announced December 2014.

    Comments: Accepted for publication; 9 pages of text, 6 figures and 1 table

  10. Transmit Pulse Sha** for Molecular Communications

    Authors: Siyi Wang, Weisi Guo, Mark D. McDonnell

    Abstract: This paper presents a method for sha** the transmit pulse of a molecular signal such that the diffusion channel's response is a sharp pulse. The impulse response of a diffusion channel is typically characterised as having an infinitely long transient response. This can cause severe inter-symbol-interference, and reduce the achievable reliable bit rate. We achieve the desired chemical channel res… ▽ More

    Submitted 11 April, 2014; originally announced April 2014.

    Comments: 2 pages, 1 figure, IEEE Conference on Computer Communications (INFOCOM)

  11. Distance Distributions for Real Cellular Networks

    Authors: Siyi Wang, Weisi Guo, Mark D. McDonnell

    Abstract: This paper presents the general distribution for the distance between a mobile user and any base station (BS). We show that a random variable proportional to the distance squared is Gamma distributed. In the case of the nearest BS, it can be reduced to the well established result of the distance being Rayleigh distributed. We validate our results using a random node simulation and real Vodafone 3G… ▽ More

    Submitted 11 April, 2014; originally announced April 2014.

    Comments: 2 pages, 1 figure, IEEE Conference on Computer Communications (INFOCOM)

  12. Performance of Macro-Scale Molecular Communications with Sensor Cleanse Time

    Authors: Siyi Wang, Weisi Guo, Song Qiu, Mark D. McDonnell

    Abstract: In this paper, we consider a molecular diffusion based communications link that conveys information on the macro-scale (several metres). The motivation is to apply molecular-based communications to challenging electromagnetic environments. We first derive a novel capture probability expression of a finite sized receiver. The paper then introduces the concept of time-aggregated molecular noise at t… ▽ More

    Submitted 1 April, 2014; originally announced April 2014.

    Comments: 6 pages, 6 figures, IEEE International Conference on Telecommunications (ICT)

  13. Downlink Interference Estimation without Feedback for Heterogeneous Network Interference Avoidance

    Authors: Siyi Wang, Weisi Guo, Mark D. McDonnell

    Abstract: In this paper, we present a novel method for a base station (BS) to estimate the total downlink interference power received by any given mobile receiver, without information feedback from the user or information exchange between neighbouring BSs. The prediction method is deterministic and can be computed rapidly. This is achieved by first abstracting the cellular network into a mathematical model,… ▽ More

    Submitted 1 April, 2014; originally announced April 2014.

    Comments: 6 pages, 5 figures, IEEE International Conference on Telecommunications (ICT)

  14. An Introductory Review of Information Theory in the Context of Computational Neuroscience

    Authors: Mark D. McDonnell, Shiro Ikeda, Jonathan H. Manton

    Abstract: This paper introduces several fundamental concepts in information theory from the perspective of their origins in engineering. Understanding such concepts is important in neuroscience for two reasons. Simply applying formulae from information theory without understanding the assumptions behind their definitions can lead to erroneous results and conclusions. Furthermore, this century will see a con… ▽ More

    Submitted 14 July, 2011; originally announced July 2011.

    Comments: 18 pages, 7 figures, to appear in Biological Cybernetics

    Journal ref: Biological Cybernetics, 105(1), 1-16, 2011

  15. Signal acquisition via polarization modulation in single photon sources

    Authors: Mark D. McDonnell, Adrian P. Flitney

    Abstract: A simple model system is introduced for demonstrating how a single photon source might be used to transduce classical analog information. The theoretical scheme results in measurements of analog source samples that are (i) quantized in the sense of analog-to-digital conversion and (ii) corrupted by random noise that is solely due to the quantum uncertainty in detecting the polarization state of… ▽ More

    Submitted 25 November, 2009; v1 submitted 18 November, 2009; originally announced November 2009.

    Comments: 7 pages, 2 figures, accepted by Physical Review E. This version adds a reference

    Journal ref: Physical Review E 80, 060102(R) (2009)