-
A comparison of Fourier and POD mode decomposition methods for high-speed Hall thruster video
Authors:
J. W. Brooks,
M. S. McDonald,
A. A. Kaptanoglu
Abstract:
Hall thrusters are susceptible to large-amplitude plasma oscillations that impact thruster performance and lifetime and are also difficult to model. High-speed cameras are a popular tool to study these dynamics due to their spatial resolution and are a popular, nonintrusive complement to in-situ probes. High-speed video of thruster oscillations can be isolated (decomposed) into coherent structures…
▽ More
Hall thrusters are susceptible to large-amplitude plasma oscillations that impact thruster performance and lifetime and are also difficult to model. High-speed cameras are a popular tool to study these dynamics due to their spatial resolution and are a popular, nonintrusive complement to in-situ probes. High-speed video of thruster oscillations can be isolated (decomposed) into coherent structures (modes) with algorithms that help us better understand the evolution and interactions of each. This work provides an introduction, comparison, and step-by-step tutorial on established Fourier and newer Proper Orthogonal Decomposition (POD) algorithms as applied to high-speed video of the unshielded H6 6-kW laboratory model Hall thruster. From this dataset, both sets of algorithms identify and characterize $m=0$ and $m>0$ modes in the discharge channel and cathode regions of the thruster plume, as well as mode hop** between the $m=3$ and $m=4$ rotating spokes in the channel. The Fourier methods are ideal for characterizing linear modal structures and also provide intuitive dispersion relationships. By contrast, the POD method tailors a basis set using energy minimization techniques that better captures the nonlinear nature of these structures and with a simpler implementation. Together, the Fourier and POD methods provide a more complete toolkit for studying Hall thruster plasma instabilities and mode dynamics. Specifically, we recommend first applying POD first to quickly identify the nature and location of global dynamics and then using Fourier methods to isolate dispersion plots and other wave-based physics.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
Learning Models for Query by Vocal Percussion: A Comparative Study
Authors:
Alejandro Delgado,
SkoT McDonald,
Ning Xu,
Charalampos Saitis,
Mark Sandler
Abstract:
The imitation of percussive sounds via the human voice is a natural and effective tool for communicating rhythmic ideas on the fly. Thus, the automatic retrieval of drum sounds using vocal percussion can help artists prototype drum patterns in a comfortable and quick way, smoothing the creative workflow as a result. Here we explore different strategies to perform this type of query, making use of…
▽ More
The imitation of percussive sounds via the human voice is a natural and effective tool for communicating rhythmic ideas on the fly. Thus, the automatic retrieval of drum sounds using vocal percussion can help artists prototype drum patterns in a comfortable and quick way, smoothing the creative workflow as a result. Here we explore different strategies to perform this type of query, making use of both traditional machine learning algorithms and recent deep learning techniques. The main hyperparameters from the models involved are carefully selected by feeding performance metrics to a grid search algorithm. We also look into several audio data augmentation techniques, which can potentially regularise deep learning models and improve generalisation. We compare the final performances in terms of effectiveness (classification accuracy), efficiency (computational speed), stability (performance consistency), and interpretability (decision patterns), and discuss the relevance of these results when it comes to the design of successful query-by-vocal-percussion systems.
△ Less
Submitted 18 October, 2021;
originally announced October 2021.
-
A New Dataset for Amateur Vocal Percussion Analysis
Authors:
Alejandro Delgado,
SKoT McDonald,
Ning Xu,
Mark Sandler
Abstract:
The imitation of percussive instruments via the human voice is a natural way for us to communicate rhythmic ideas and, for this reason, it attracts the interest of music makers. Specifically, the automatic map** of these vocal imitations to their emulated instruments would allow creators to realistically prototype rhythms in a faster way. The contribution of this study is two-fold. Firstly, a ne…
▽ More
The imitation of percussive instruments via the human voice is a natural way for us to communicate rhythmic ideas and, for this reason, it attracts the interest of music makers. Specifically, the automatic map** of these vocal imitations to their emulated instruments would allow creators to realistically prototype rhythms in a faster way. The contribution of this study is two-fold. Firstly, a new Amateur Vocal Percussion (AVP) dataset is introduced to investigate how people with little or no experience in beatboxing approach the task of vocal percussion. The end-goal of this analysis is that of hel** map** algorithms to better generalise between subjects and achieve higher performances. The dataset comprises a total of 9780 utterances recorded by 28 participants with fully annotated onsets and labels (kick drum, snare drum, closed hi-hat and opened hi-hat). Lastly, we conducted baseline experiments on audio onset detection with the recorded dataset, comparing the performance of four state-of-the-art algorithms in a vocal percussion context.
△ Less
Submitted 24 September, 2020;
originally announced September 2020.
-
Adversarial Attacks in Sound Event Classification
Authors:
Vinod Subramanian,
Emmanouil Benetos,
Ning Xu,
SKoT McDonald,
Mark Sandler
Abstract:
Adversarial attacks refer to a set of methods that perturb the input to a classification model in order to fool the classifier. In this paper we apply different gradient based adversarial attack algorithms on five deep learning models trained for sound event classification. Four of the models use mel-spectrogram input and one model uses raw audio input. The models represent standard architectures…
▽ More
Adversarial attacks refer to a set of methods that perturb the input to a classification model in order to fool the classifier. In this paper we apply different gradient based adversarial attack algorithms on five deep learning models trained for sound event classification. Four of the models use mel-spectrogram input and one model uses raw audio input. The models represent standard architectures such as convolutional, recurrent and dense networks. The dataset used for training is the Freesound dataset released for task 2 of the DCASE 2018 challenge and the models used are from participants of the challenge who open sourced their code. Our experiments show that adversarial attacks can be generated with high confidence and low perturbation. In addition, we show that the adversarial attacks are very effective across the different models.
△ Less
Submitted 15 August, 2019; v1 submitted 4 July, 2019;
originally announced July 2019.