Skip to main content

Showing 1–45 of 45 results for author: Stowell, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01253  [pdf, other

    cs.SD cs.AI eess.AS q-bio.QM stat.AP

    animal2vec and MeerKAT: A self-supervised transformer for rare-event raw audio input and a large-scale reference dataset for bioacoustics

    Authors: Julian C. Schäfer-Zimmermann, Vlad Demartsev, Baptiste Averly, Kiran Dhanjal-Adams, Mathieu Duteil, Gabriella Gall, Marius Faiß, Lily Johnson-Ulrich, Dan Stowell, Marta B. Manser, Marie A. Roch, Ariana Strandburg-Peshkin

    Abstract: Bioacoustic research provides invaluable insights into the behavior, ecology, and conservation of animals. Most bioacoustic datasets consist of long recordings where events of interest, such as vocalizations, are exceedingly rare. Analyzing these datasets poses a monumental challenge to researchers, where deep learning techniques have emerged as a standard method. Their adaptation remains challeng… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Code available at: https://github.com/livingingroups/animal2vec | Dataset available at: https://doi.org/10.17617/3.0J0DYB

  2. arXiv:2404.03474  [pdf, other

    cs.CV cs.AI

    Performance of computer vision algorithms for fine-grained classification using crowdsourced insect images

    Authors: Rita Pucci, Vincent J. Kalkman, Dan Stowell

    Abstract: With fine-grained classification, we identify unique characteristics to distinguish among classes of the same super-class. We are focusing on species recognition in Insecta, as they are critical for biodiversity monitoring and at the base of many ecosystems. With citizen science campaigns, billions of images are collected in the wild. Once these are labelled, experts can use them to create distrib… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2312.09269  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Efficient speech detection in environmental audio using acoustic recognition and knowledge distillation

    Authors: Drew Priebe, Burooj Ghani, Dan Stowell

    Abstract: The ongoing biodiversity crisis, driven by factors such as land-use change and global warming, emphasizes the need for effective ecological monitoring methods. Acoustic monitoring of biodiversity has emerged as an important monitoring tool. Detecting human voices in soundscape monitoring projects is useful both for analysing human disturbance and for privacy filtering. Despite significant strides… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  4. arXiv:2311.04945  [pdf, other

    cs.LG cs.AI cs.SD eess.AS

    Auto deep learning for bioacoustic signals

    Authors: Giulio Tosato, Abdelrahman Shehata, Joshua Janssen, Kees Kamp, Pramatya Jati, Dan Stowell

    Abstract: This study investigates the potential of automated deep learning to enhance the accuracy and efficiency of multi-class classification of bird vocalizations, compared against traditional manually-designed deep learning models. Using the Western Mediterranean Wetland Birds dataset, we investigated the use of AutoKeras, an automated machine learning framework, to automate neural architecture search a… ▽ More

    Submitted 26 December, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

  5. arXiv:2311.01526  [pdf, other

    cs.SD cs.LG eess.AS

    ATGNN: Audio Tagging Graph Neural Network

    Authors: Shubhr Singh, Christian J. Steinmetz, Emmanouil Benetos, Huy Phan, Dan Stowell

    Abstract: Deep learning models such as CNNs and Transformers have achieved impressive performance for end-to-end audio tagging. Recent works have shown that despite stacking multiple layers, the receptive field of CNNs remains severely limited. Transformers on the other hand are able to map global context through self-attention, but treat the spectrogram as a sequence of patches which is not flexible enough… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  6. arXiv:2307.11112  [pdf, other

    cs.CV cs.LG

    Comparison between transformers and convolutional models for fine-grained classification of insects

    Authors: Rita Pucci, Vincent J. Kalkman, Dan Stowell

    Abstract: Fine-grained classification is challenging due to the difficulty of finding discriminatory features. This problem is exacerbated when applied to identifying species within the same taxonomical class. This is because species are often sharing morphological characteristics that make them difficult to differentiate. We consider the taxonomical class of Insecta. The identification of insects is essent… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  7. arXiv:2306.09223  [pdf, other

    cs.SD cs.LG eess.AS

    Few-shot bioacoustic event detection at the DCASE 2023 challenge

    Authors: Ines Nolasco, Burooj Ghani, Shubhr Singh, Ester Vidaña-Vila, Helen Whitehead, Emily Grout, Michael Emmerson, Frants Jensen, Ivan Kiskin, Joe Morford, Ariana Strandburg-Peshkin, Lisa Gill, Hanna Pamuła, Vincent Lostanlen, Dan Stowell

    Abstract: Few-shot bioacoustic event detection consists in detecting sound events of specified types, in varying soundscapes, while having access to only a few examples of the class of interest. This task ran as part of the DCASE challenge for the third time this year with an evaluation set expanded to include new animal species, and a new rule: ensemble models were no longer allowed. The 2023 few shot task… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: submitted to DCASE 2023 workshop

  8. arXiv:2305.13210  [pdf, other

    cs.SD eess.AS q-bio.QM

    Learning to detect an animal sound from five examples

    Authors: Inês Nolasco, Shubhr Singh, Veronica Morfi, Vincent Lostanlen, Ariana Strandburg-Peshkin, Ester Vidaña-Vila, Lisa Gill, Hanna Pamuła, Helen Whitehead, Ivan Kiskin, Frants H. Jensen, Joe Morford, Michael G. Emmerson, Elisabetta Versace, Emily Grout, Haohe Liu, Dan Stowell

    Abstract: Automatic detection and classification of animal sounds has many applications in biodiversity monitoring and animal behaviour. In the past twenty years, the volume of digitised wildlife sound available has massively increased, and automatic classification through deep learning now shows strong results. However, bioacoustics is not a single task but a vast range of small-scale tasks (such as indivi… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  9. arXiv:2304.12739  [pdf

    cs.SD eess.AS q-bio.QM

    Adaptive Representations of Sound for Automatic Insect Recognition

    Authors: Marius Faiß, Dan Stowell

    Abstract: Insect population numbers and biodiversity have been rapidly declining with time, and monitoring these trends has become increasingly important for conservation measures to be effectively implemented. But monitoring methods are often invasive, time and resource intense, and prone to various biases. Many insect species produce characteristic sounds that can easily be detected and recorded without l… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 35 pages, 11 figures. arXiv admin note: substantial text overlap with arXiv:2211.09503

  10. arXiv:2210.07685  [pdf

    cs.SD eess.AS

    Full-Stack Bioacoustics: Field Kit to AI to Action (Workshop report)

    Authors: Dan Stowell, Caitlin Black, Florencia Noriega, Sarab S. Sethi

    Abstract: Acoustic data (sound recordings) are a vital source of evidence for detecting, counting, and distinguishing wildlife. This domain of "bioacoustics" has grown in the past decade due to the massive advances in signal processing and machine learning, recording devices, and the capacity of data processing and storage. Numerous research papers describe the use of Raspberry Pi or similar devices for aco… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Workshop report: Lorentz Center, Leiden, the Netherlands, 1-5 August 2022

  11. arXiv:2207.07911  [pdf, other

    cs.SD cs.LG eess.AS

    Few-shot bioacoustic event detection at the DCASE 2022 challenge

    Authors: I. Nolasco, S. Singh, E. Vidana-Villa, E. Grout, J. Morford, M. Emmerson, F. Jensens, H. Whitehead, I. Kiskin, A. Strandburg-Peshkin, L. Gill, H. Pamula, V. Lostanlen, V. Morfi, D. Stowell

    Abstract: Few-shot sound event detection is the task of detecting sound events, despite having only a few labelled examples of the class of interest. This framework is particularly useful in bioacoustics, where often there is a need to annotate very long recordings but the expert annotator time is limited. This paper presents an overview of the second edition of the few-shot bioacoustic sound event detectio… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: submitted to DCASE2022 workshop

  12. arXiv:2207.06349  [pdf

    cs.SD eess.AS

    Polyphonic sound event detection for highly dense birdsong scenes

    Authors: Alberto García Arroba Parrilla, Dan Stowell

    Abstract: One hour before sunrise, one can experience the dawn chorus where birds from different species sing together. In this scenario, high levels of polyphony, as in the number of overlap** sound sources, are prone to happen resulting in a complex acoustic outcome. Sound Event Detection (SED) tasks analyze acoustic scenarios in order to identify the occurring events and their respective temporal infor… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  13. arXiv:2112.06725  [pdf, other

    cs.SD eess.AS q-bio.QM

    Computational bioacoustics with deep learning: a review and roadmap

    Authors: Dan Stowell

    Abstract: Animal vocalisations and natural soundscapes are fascinating objects of study, and contain valuable evidence about animal behaviours, populations and ecosystems. They are studied in bioacoustics and ecoacoustics, with signal processing and analysis an important component. Computational bioacoustics has accelerated in recent decades due to the growth of affordable digital sound recording devices, a… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

  14. Rank-based loss for learning hierarchical representations

    Authors: Ines Nolasco, Dan Stowell

    Abstract: Hierarchical taxonomies are common in many contexts, and they are a very natural structure humans use to organise information. In machine learning, the family of methods that use the 'extra' information is called hierarchical classification. However, applied to audio classification, this remains relatively unexplored. Here we focus on how to integrate the hierarchical information of a problem to l… ▽ More

    Submitted 11 February, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: This version corrects a bug in the baseline results

  15. arXiv:2012.03216  [pdf, other

    cs.SD cs.LG eess.AS

    Guitar Effects Recognition and Parameter Estimation with Convolutional Neural Networks

    Authors: Marco Comunità, Dan Stowell, Joshua D. Reiss

    Abstract: Despite the popularity of guitar effects, there is very little existing research on classification and parameter estimation of specific plugins or effect units from guitar recordings. In this paper, convolutional neural networks were used for classification and parameter estimation for 13 overdrive, distortion and fuzz guitar effects. A novel dataset of processed electric guitar samples was assemb… ▽ More

    Submitted 6 December, 2020; originally announced December 2020.

    Journal ref: JAES Volume 69 Issue 7/8 pp. 594-604; July 2021

  16. arXiv:2010.02275  [pdf, other

    cs.LG

    Short-term prediction of photovoltaic power generation using Gaussian process regression

    Authors: Yahya Al Lawati, Jack Kelly, Dan Stowell

    Abstract: Photovoltaic (PV) power is affected by weather conditions, making the power generated from the PV systems uncertain. Solving this problem would help improve the reliability and cost effectiveness of the grid, and could help reduce reliance on fossil fuel plants. The present paper focuses on evaluating predictions of the energy generated by PV systems in the United Kingdom Gaussian process regressi… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

  17. arXiv:1908.04672  [pdf, other

    eess.AS cs.SD eess.SP

    Estimating & Mitigating the Impact of Acoustic Environments on Machine-to-Machine Signalling

    Authors: Amogh Matt, Dan Stowell

    Abstract: The advance of technology for transmitting Data-over-Sound in various IoT and telecommunication applications has led to the concept of machine-to-machine over-the-air acoustic signalling. Reverberation can have a detrimental effect on such machine-to-machine signals while decoding. Various methods have been studied to combat the effects of reverberation in speech and audio signals, but it is not c… ▽ More

    Submitted 13 August, 2019; originally announced August 2019.

  18. Efficient On-line Computation of Visibility Graphs

    Authors: Delia Fano Yela, Florian Thalmann, Vincenzo Nicosia, Dan Stowell, Mark Sandler

    Abstract: A visibility algorithm maps time series into complex networks following a simple criterion. The resulting visibility graph has recently proven to be a powerful tool for time series analysis. However its straightforward computation is time-consuming and rigid, motivating the development of more efficient algorithms. Here we present a highly efficient method to compute visibility graphs with the fur… ▽ More

    Submitted 8 May, 2019; originally announced May 2019.

    Comments: code https://github.com/delialia/bst

    Journal ref: Phys. Rev. Research 2, 023069 (2020)

  19. arXiv:1903.01976  [pdf, other

    cs.SD eess.AS

    Spectral Visibility Graphs: Application to Similarity of Harmonic Signals

    Authors: Delia Fano Yela, Dan Stowell, Mark Sandler

    Abstract: Graph theory is emerging as a new source of tools for time series analysis. One promising method is to transform a signal into its visibility graph, a representation which captures many interesting aspects of the signal. Here we introduce the visibility graph for audio spectra and propose a novel representation for audio analysis: the spectral visibility graph degree. Such representation inherentl… ▽ More

    Submitted 20 June, 2019; v1 submitted 5 March, 2019; originally announced March 2019.

    Comments: European Signal Processing Conference (EUSIPCO)

  20. arXiv:1901.11436  [pdf, other

    stat.ML cs.LG cs.SD eess.AS eess.SP

    End-to-End Probabilistic Inference for Nonstationary Audio Analysis

    Authors: William J. Wilkinson, Michael Riis Andersen, Joshua D. Reiss, Dan Stowell, Arno Solin

    Abstract: A typical audio signal processing pipeline includes multiple disjoint analysis stages, including calculation of a time-frequency representation followed by spectrogram-based feature analysis. We show how time-frequency analysis and nonnegative matrix factorisation can be jointly formulated as a spectral mixture Gaussian process model with nonstationary priors over the amplitude variance parameters… ▽ More

    Submitted 27 April, 2019; v1 submitted 31 January, 2019; originally announced January 2019.

    Comments: Accepted to the Thirty-sixth International Conference on Machine Learning (ICML) 2019

  21. arXiv:1811.02489  [pdf, other

    eess.SP cs.LG cs.SD eess.AS stat.ML

    Unifying Probabilistic Models for Time-Frequency Analysis

    Authors: William J. Wilkinson, Michael Riis Andersen, Joshua D. Reiss, Dan Stowell, Arno Solin

    Abstract: In audio signal processing, probabilistic time-frequency models have many benefits over their non-probabilistic counterparts. They adapt to the incoming signal, quantify uncertainty, and measure correlation between the signal's amplitude and phase information, making time domain resynthesis straightforward. However, these models are still not widely used since they come at a high computational cos… ▽ More

    Submitted 12 February, 2019; v1 submitted 6 November, 2018; originally announced November 2018.

    Comments: Accepted to International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019

  22. arXiv:1811.02275  [pdf, other

    cs.SD cs.DL eess.AS

    NIPS4Bplus: a richly annotated birdsong audio dataset

    Authors: Veronica Morfi, Yves Bas, Hanna Pamuła, Hervé Glotin, Dan Stowell

    Abstract: Recent advances in birdsong detection and classification have approached a limit due to the lack of fully annotated recordings. In this paper, we present NIPS4Bplus, the first richly annotated birdsong audio dataset, that is comprised of recordings containing bird vocalisations along with their active species tags plus the temporal annotations acquired for them. Statistical information about the r… ▽ More

    Submitted 14 November, 2018; v1 submitted 6 November, 2018; originally announced November 2018.

    Comments: 5 pages, 5 figures, submitted to ICASSP 2019

  23. arXiv:1810.12679  [pdf, other

    eess.AS cs.LG cs.SD eess.SP stat.ML

    Sparse Gaussian Process Audio Source Separation Using Spectrum Priors in the Time-Domain

    Authors: Pablo A. Alvarado, Mauricio A. Álvarez, Dan Stowell

    Abstract: Gaussian process (GP) audio source separation is a time-domain approach that circumvents the inherent phase approximation issue of spectrogram based methods. Furthermore, through its kernel, GPs elegantly incorporate prior knowledge about the sources into the separation model. Despite these compelling advantages, the computational complexity of GP inference scales cubically with the number of audi… ▽ More

    Submitted 21 November, 2018; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: Paper submitted to the 44th International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019. To be held in Brighton, United Kingdom, between May 12 and May 17, 2019

  24. arXiv:1810.09273  [pdf, other

    cs.SD eess.AS

    Automatic acoustic identification of individual animals: Improving generalisation across species and recording conditions

    Authors: Dan Stowell, Tereza Petrusková, Martin Šálek, Pavel Linhart

    Abstract: Many animals emit vocal sounds which, independently from the sounds' function, embed some individually-distinctive signature. Thus the automatic recognition of individuals by sound is a potentially powerful tool for zoology and ecology research and practical monitoring. Here we present a general automatic identification method, that can work across multiple animal species with various levels of co… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

  25. arXiv:1807.06972  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Data-Efficient Weakly Supervised Learning for Low-Resource Audio Event Detection Using Deep Learning

    Authors: Veronica Morfi, Dan Stowell

    Abstract: We propose a method to perform audio event detection under the common constraint that only limited training data are available. In training a deep learning system to perform audio event detection, two practical problems arise. Firstly, most datasets are "weakly labelled" having only a list of events present in each recording without any temporal information for training. Secondly, deep neural netw… ▽ More

    Submitted 26 October, 2018; v1 submitted 17 July, 2018; originally announced July 2018.

    Comments: 5 pages, 2 figures. arXiv admin note: substantial text overlap with arXiv:1807.03697

  26. Automatic acoustic detection of birds through deep learning: the first Bird Audio Detection challenge

    Authors: Dan Stowell, Yannis Stylianou, Mike Wood, Hanna Pamuła, Hervé Glotin

    Abstract: Assessing the presence and abundance of birds is important for monitoring specific species as well as overall ecosystem health. Many birds are most readily detected by their sounds, and thus passive acoustic monitoring is highly appropriate. Yet acoustic monitoring is often held back by practical limitations such as the need for manual configuration, reliance on example sound libraries, low accura… ▽ More

    Submitted 16 July, 2018; originally announced July 2018.

  27. arXiv:1807.03697  [pdf, other

    cs.LG stat.ML

    Deep Learning for Audio Transcription on Low-Resource Datasets

    Authors: Veronica Morfi, Dan Stowell

    Abstract: In training a deep learning system to perform audio transcription, two practical problems may arise. Firstly, most datasets are weakly labelled, having only a list of events present in each recording without any temporal information for training. Secondly, deep neural networks need a very large amount of labelled training data to achieve good quality performance, yet in practice it is difficult to… ▽ More

    Submitted 11 July, 2018; v1 submitted 10 July, 2018; originally announced July 2018.

    Comments: 20 pages, 5 figures

  28. arXiv:1804.02325  [pdf, other

    cs.SD eess.AS

    Does k Matter? k-NN Hubness Analysis for Kernel Additive Modelling Vocal Separation

    Authors: Delia Fano Yela, Dan Stowell, Mark Sandler

    Abstract: Kernel Additive Modelling (KAM) is a framework for source separation aiming to explicitly model inherent properties of sound sources to help with their identification and separation. KAM separates a given source by applying robust statistics on the selection of time-frequency bins obtained through a source-specific kernel, typically the k-NN function. Even though the parameter k appears to be key… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

    Comments: LVA-ICA 2018 - Feedback always welcome

  29. arXiv:1802.00680  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    A Generative Model for Natural Sounds Based on Latent Force Modelling

    Authors: William J. Wilkinson, Joshua D. Reiss, Dan Stowell

    Abstract: Recent advances in analysis of subband amplitude envelopes of natural sounds have resulted in convincing synthesis, showing subband amplitudes to be a crucial component of perception. Probabilistic latent variable analysis is particularly revealing, but existing approaches don't incorporate prior knowledge about the physical behaviour of amplitude envelopes, such as exponential decay and feedback.… ▽ More

    Submitted 27 March, 2019; v1 submitted 2 February, 2018; originally announced February 2018.

    Comments: 10 pages, 5 figures

  30. arXiv:1705.07104  [pdf, other

    stat.ML cs.SD

    Efficient Learning of Harmonic Priors for Pitch Detection in Polyphonic Music

    Authors: Pablo A. Alvarado, Dan Stowell

    Abstract: Automatic music transcription (AMT) aims to infer a latent symbolic representation of a piece of music (piano-roll), given a corresponding observed audio recording. Transcribing polyphonic music (when multiple notes are played simultaneously) is a challenging problem, due to highly structured overlap** between harmonics. We study whether the introduction of physically inspired Gaussian process (… ▽ More

    Submitted 16 November, 2018; v1 submitted 19 May, 2017; originally announced May 2017.

    Comments: Updated version with appendix section about derivation of amplitude modulated GP

  31. arXiv:1612.05489  [pdf, other

    cs.SD

    On-bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts

    Authors: Dan Stowell, Emmanouil Benetos, Lisa F. Gill

    Abstract: We introduce a novel approach to studying animal behaviour and the context in which it occurs, through the use of microphone backpacks carried on the backs of individual free-flying birds. These sensors are increasingly used by animal behaviour researchers to study individual vocalisations of freely behaving animals, even in the field. However such devices may record more than an animals vocal beh… ▽ More

    Submitted 16 December, 2016; originally announced December 2016.

  32. Bird detection in audio: a survey and a challenge

    Authors: Dan Stowell, Mike Wood, Yannis Stylianou, Hervé Glotin

    Abstract: Many biological monitoring projects rely on acoustic detection of birds. Despite increasingly large datasets, this detection is often manual or semi-automatic, requiring manual tuning/postprocessing. We review the state of the art in automatic bird sound detection, and identify a widespread need for tuning-free and species-agnostic approaches. We introduce new datasets and an IEEE research challen… ▽ More

    Submitted 11 August, 2016; originally announced August 2016.

    Comments: Slightly extended preprint of paper accepted for MLSP 2016

  33. arXiv:1606.01039  [pdf, ps, other

    stat.ML cs.SD

    Gaussian Processes for Music Audio Modelling and Content Analysis

    Authors: Pablo A. Alvarado, Dan Stowell

    Abstract: Real music signals are highly variable, yet they have strong statistical structure. Prior information about the underlying physical mechanisms by which sounds are generated and rules by which complex sound structure is constructed (notes, chords, a complete musical score), can be naturally unified using Bayesian modelling techniques. Typically algorithms for Automatic Music Transcription independe… ▽ More

    Submitted 10 June, 2016; v1 submitted 3 June, 2016; originally announced June 2016.

  34. arXiv:1603.07236  [pdf, other

    cs.SD

    Individual identity in songbirds: signal representations and metric learning for locating the information in complex corvid calls

    Authors: Dan Stowell, Veronica Morfi, Lisa F. Gill

    Abstract: Bird calls range from simple tones to rich dynamic multi-harmonic structures. The more complex calls are very poorly understood at present, such as those of the scientifically important corvid family (jackdaws, crows, ravens, etc.). Individual birds can recognise familiar individuals from calls, but where in the signal is this identity encoded? We studied the question by applying a combination of… ▽ More

    Submitted 26 April, 2016; v1 submitted 23 March, 2016; originally announced March 2016.

  35. arXiv:1603.07173  [pdf, other

    cs.SD

    Deductive Refinement of Species Labelling in Weakly Labelled Birdsong Recordings

    Authors: Veronica Morfi, Dan Stowell

    Abstract: Many approaches have been used in bird species classification from their sound in order to provide labels for the whole of a recording. However, a more precise classification of each bird vocalization would be of great importance to the use and management of sound archives and bird monitoring. In this work, we introduce a technique that using a two step process can first automatically detect all b… ▽ More

    Submitted 23 March, 2016; originally announced March 2016.

    Comments: 11 pages, 1 figure

  36. arXiv:1601.05449  [pdf, other

    q-bio.QM cs.SI

    Detailed temporal structure of communication networks in groups of songbirds

    Authors: Dan Stowell, Lisa Gill, David Clayton

    Abstract: Animals in groups often exchange calls, in patterns whose temporal structure may be influenced by contextual factors such as physical location and the social network structure of the group. We introduce a model-based analysis for temporal patterns of animal call timing, originally developed for networks of firing neurons. This has advantages over cross-correlation analysis in that it can correctly… ▽ More

    Submitted 20 January, 2016; originally announced January 2016.

  37. arXiv:1509.05982  [pdf, other

    cs.NE cs.LG

    Denoising without access to clean data using a partitioned autoencoder

    Authors: Dan Stowell, Richard E. Turner

    Abstract: Training a denoising autoencoder neural network requires access to truly clean data, a requirement which is often impractical. To remedy this, we introduce a method to train an autoencoder using only noisy data, having examples with and without the signal class of interest. The autoencoder learns a partitioned representation of signal and noise, learning to reconstruct each separately. We illustra… ▽ More

    Submitted 22 September, 2015; v1 submitted 20 September, 2015; originally announced September 2015.

  38. arXiv:1503.07150  [pdf, other

    cs.SD

    Acoustic event detection for multiple overlap** similar sources

    Authors: Dan Stowell, David Clayton

    Abstract: Many current paradigms for acoustic event detection (AED) are not adapted to the organic variability of natural sounds, and/or they assume a limit on the number of simultaneous sources: often only one source, or one source of each type, may be active. These aspects are highly undesirable for applications such as bird population monitoring. We introduce a simple method modelling the onsets, duratio… ▽ More

    Submitted 9 July, 2015; v1 submitted 24 March, 2015; originally announced March 2015.

    Comments: Accepted for WASPAA 2015

  39. Acoustic Scene Classification

    Authors: Daniele Barchiesi, Dimitrios Giannoulis, Dan Stowell, Mark D. Plumbley

    Abstract: In this article we present an account of the state-of-the-art in acoustic scene classification (ASC), the task of classifying environments from the sounds they produce. Starting from a historical review of previous research in this area, we define a general framework for ASC and present different imple- mentations of its components. We then describe a range of different algorithms submitted for a… ▽ More

    Submitted 13 November, 2014; originally announced November 2014.

    Journal ref: IEEE Signal Processing Magazine 32(3) (May 2015) 16-34

  40. Automatic large-scale classification of bird sounds is strongly improved by unsupervised feature learning

    Authors: Dan Stowell, Mark D. Plumbley

    Abstract: Automatic species classification of birds from their sound is a computational tool of increasing importance in ecology, conservation monitoring and vocal communication studies. To make classification useful in practice, it is crucial to improve its accuracy while ensuring that it can run at big data scales. Many approaches use acoustic measures based on spectrogram-type data, such as the Mel-frequ… ▽ More

    Submitted 26 May, 2014; originally announced May 2014.

    Journal ref: PeerJ 2:e488, 2014

  41. Large-scale analysis of frequency modulation in birdsong databases

    Authors: Dan Stowell, Mark D. Plumbley

    Abstract: Birdsong often contains large amounts of rapid frequency modulation (FM). It is believed that the use or otherwise of FM is adaptive to the acoustic environment, and also that there are specific social uses of FM such as trills in aggressive territorial encounters. Yet temporal fine detail of FM is often absent or obscured in standard audio signal analysis methods such as Fourier analysis or linea… ▽ More

    Submitted 19 November, 2013; originally announced November 2013.

    Journal ref: Methods in Ecology and Evolution, Volume 5, Issue 9, pages 901-912, September 2014

  42. arXiv:1309.5275  [pdf, other

    cs.SD cs.DL

    An open dataset for research on audio field recording archives: freefield1010

    Authors: Dan Stowell, Mark D. Plumbley

    Abstract: We introduce a free and open dataset of 7690 audio clips sampled from the field-recording tag in the Freesound audio archive. The dataset is designed for use in research related to data mining in audio archives of field recordings / soundscapes. Audio is standardised, and audio and metadata are Creative Commons licensed. We describe the data preparation process, characterise the dataset descriptiv… ▽ More

    Submitted 1 October, 2013; v1 submitted 20 September, 2013; originally announced September 2013.

  43. Improved multiple birdsong tracking with distribution derivative method and Markov renewal process clustering

    Authors: Dan Stowell, Sašo Muševič, Jordi Bonada, Mark D. Plumbley

    Abstract: Segregating an audio mixture containing multiple simultaneous bird sounds is a challenging task. However, birdsong often contains rapid pitch modulations, and these modulations carry information which may be of use in automatic recognition. In this paper we demonstrate that an improved spectrogram representation, based on the distribution derivative method, leads to improved performance of a segre… ▽ More

    Submitted 15 February, 2013; v1 submitted 14 February, 2013; originally announced February 2013.

    Comments: Submitted to ICASSP 2013

  44. arXiv:1302.0136  [pdf, other

    cs.SD

    Maximum a posteriori estimation of piecewise arcs in tempo time-series

    Authors: Dan Stowell, Elaine Chew

    Abstract: In musical performances with expressive tempo modulation, the tempo variation can be modelled as a sequence of tempo arcs. Previous authors have used this idea to estimate series of piecewise arc segments from data. In this paper we describe a probabilistic model for a time-series process of this nature, and use this to perform inference of single- and multi-level arc processes from data. We descr… ▽ More

    Submitted 1 February, 2013; originally announced February 2013.

    Comments: Submitted to postprint volume for Computer Music Modeling and Retrieval (CMMR) 2012

  45. arXiv:1211.2972  [pdf, other

    cs.AI

    Segregating event streams and noise with a Markov renewal process model

    Authors: Dan Stowell, Mark D. Plumbley

    Abstract: We describe an inference task in which a set of timestamped event observations must be clustered into an unknown number of temporal sequences with independent and varying rates of observations. Various existing approaches to multi-object tracking assume a fixed number of sources and/or a fixed observation rate; we develop an approach to inferring structure in timestamped data produced by a mixture… ▽ More

    Submitted 13 November, 2012; originally announced November 2012.

    ACM Class: I.5.1

    Journal ref: Journal of Machine Learning Research, 14(Aug):2213-2238, 2013