Search | arXiv e-print repository

Investigating Design Choices in Joint-Embedding Predictive Architectures for General Audio Representation Learning

Authors: Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Geoffroy Peeters

Abstract: This paper addresses the problem of self-supervised general-purpose audio representation learning. We explore the use of Joint-Embedding Predictive Architectures (JEPA) for this task, which consists of splitting an input mel-spectrogram into two parts (context and target), computing neural representations for each, and training the neural network to predict the target representations from the cont… ▽ More This paper addresses the problem of self-supervised general-purpose audio representation learning. We explore the use of Joint-Embedding Predictive Architectures (JEPA) for this task, which consists of splitting an input mel-spectrogram into two parts (context and target), computing neural representations for each, and training the neural network to predict the target representations from the context representations. We investigate several design choices within this framework and study their influence through extensive experiments by evaluating our models on various audio classification benchmarks, including environmental sounds, speech and music downstream tasks. We focus notably on which part of the input data is used as context or target and show experimentally that it significantly impacts the model's quality. In particular, we notice that some effective design choices in the image domain lead to poor performance on audio, thus highlighting major differences between these two modalities. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: Self-supervision in Audio, Speech and Beyond workshop, IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024

arXiv:2403.00688 [pdf, ps, other]

Degradation-Invariant Music Indexing

Authors: Rémi Mignot, Geoffroy Peeters

Abstract: For music indexing robust to sound degradations and scalable for big music catalogs, this scientific report presents an approach based on audio descriptors relevant to the music content and invariant to sound transformations (noise addition, distortion, lossy coding, pitch/time transformations, or filtering e.g.). To achieve this task, one of the key point of the proposed method is the definition… ▽ More For music indexing robust to sound degradations and scalable for big music catalogs, this scientific report presents an approach based on audio descriptors relevant to the music content and invariant to sound transformations (noise addition, distortion, lossy coding, pitch/time transformations, or filtering e.g.). To achieve this task, one of the key point of the proposed method is the definition of high-dimensional audio prints, which are intrinsically (by design) robust to some sound degradations. The high dimensionality of this first representation is then used to learn a linear projection to a sub-space significantly smaller, which reduces again the sensibility to sound degradations using a series of discriminant analyses. Finally, anchoring the analysis times on local maxima of a selected onset function, an approximative hashing is done to provide a better tolerance to bit corruptions, and in the same time to make easier the scaling of the method. △ Less

Submitted 1 March, 2024; originally announced March 2024.

arXiv:2312.14507 [pdf, other]

Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport

Authors: Bernardo Torres, Geoffroy Peeters, Gaël Richard

Abstract: In neural audio signal processing, pitch conditioning has been used to enhance the performance of synthesizers. However, jointly training pitch estimators and synthesizers is a challenge when using standard audio-to-audio reconstruction loss, leading to reliance on external pitch trackers. To address this issue, we propose using a spectral loss function inspired by optimal transportation theory th… ▽ More In neural audio signal processing, pitch conditioning has been used to enhance the performance of synthesizers. However, jointly training pitch estimators and synthesizers is a challenge when using standard audio-to-audio reconstruction loss, leading to reliance on external pitch trackers. To address this issue, we propose using a spectral loss function inspired by optimal transportation theory that minimizes the displacement of spectral energy. We validate this approach through an unsupervised autoencoding task that fits a harmonic template to harmonic signals. We jointly estimate the fundamental frequency and amplitudes of harmonics using a lightweight encoder and reconstruct the signals using a differentiable harmonic synthesizer. The proposed approach offers a promising direction for improving unsupervised parameter estimation in neural audio applications. △ Less

Submitted 15 January, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

Comments: Accepted in ICASSP 2024

Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2024, Seoul, South Korea

arXiv:2312.14005 [pdf, ps, other]

On the choice of the optimal temporal support for audio classification with Pre-trained embeddings

Authors: Aurian Quelennec, Michel Olvera, Geoffroy Peeters, Slim Essid

Abstract: Current state-of-the-art audio analysis systems rely on pre-trained embedding models, often used off-the-shelf as (frozen) feature extractors. Choosing the best one for a set of tasks is the subject of many recent publications. However, one aspect often overlooked in these works is the influence of the duration of audio input considered to extract an embedding, which we refer to as Temporal Suppor… ▽ More Current state-of-the-art audio analysis systems rely on pre-trained embedding models, often used off-the-shelf as (frozen) feature extractors. Choosing the best one for a set of tasks is the subject of many recent publications. However, one aspect often overlooked in these works is the influence of the duration of audio input considered to extract an embedding, which we refer to as Temporal Support (TS). In this work, we study the influence of the TS for well-established or emerging pre-trained embeddings, chosen to represent different types of architectures and learning paradigms. We conduct this evaluation using both musical instrument and environmental sound datasets, namely OpenMIC, TAU Urban Acoustic Scenes 2020 Mobile, and ESC-50. We especially highlight that Audio Spectrogram Transformer-based systems (PaSST and BEATs) remain effective with smaller TS, which therefore allows for a drastic reduction in memory and computational cost. Moreover, we show that by choosing the optimal TS we reach competitive results across all tasks. In particular, we improve the state-of-the-art results on OpenMIC, using BEATs and PaSST without any fine-tuning. △ Less

Submitted 21 December, 2023; originally announced December 2023.

Comments: Copyright 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

arXiv:2310.11781 [pdf, other]

Blind estimation of audio effects using an auto-encoder approach and differentiable digital signal processing

Authors: Côme Peladeau, Geoffroy Peeters

Abstract: Blind Estimation of Audio Effects (BE-AFX) aims at estimating the Audio Effects (AFXs) applied to an original, unprocessed audio sample solely based on the processed audio sample. To train such a system traditional approaches optimize a loss between ground truth and estimated AFX parameters. This involves knowing the exact implementation of the AFXs used for the process. In this work, we propose a… ▽ More Blind Estimation of Audio Effects (BE-AFX) aims at estimating the Audio Effects (AFXs) applied to an original, unprocessed audio sample solely based on the processed audio sample. To train such a system traditional approaches optimize a loss between ground truth and estimated AFX parameters. This involves knowing the exact implementation of the AFXs used for the process. In this work, we propose an alternative solution that eliminates the requirement for knowing this implementation. Instead, we introduce an auto-encoder approach, which optimizes an audio quality metric. We explore, suggest, and compare various implementations of commonly used mastering AFXs, using differential signal processing or neural approximations. Our findings demonstrate that our auto-encoder approach yields superior estimates of the audio quality produced by a chain of AFXs, compared to the traditional parameter-based approach, even if the latter provides a more accurate parameter estimation. △ Less

Submitted 9 February, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

arXiv:2309.02265 [pdf, other]

PESTO: Pitch Estimation with Self-supervised Transposition-equivariant Objective

Authors: Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Geoffroy Peeters

Abstract: In this paper, we address the problem of pitch estimation using Self Supervised Learning (SSL). The SSL paradigm we use is equivariance to pitch transposition, which enables our model to accurately perform pitch estimation on monophonic audio after being trained only on a small unlabeled dataset. We use a lightweight ($<$ 30k parameters) Siamese neural network that takes as inputs two different pi… ▽ More In this paper, we address the problem of pitch estimation using Self Supervised Learning (SSL). The SSL paradigm we use is equivariance to pitch transposition, which enables our model to accurately perform pitch estimation on monophonic audio after being trained only on a small unlabeled dataset. We use a lightweight ($<$ 30k parameters) Siamese neural network that takes as inputs two different pitch-shifted versions of the same audio represented by its Constant-Q Transform. To prevent the model from collapsing in an encoder-only setting, we propose a novel class-based transposition-equivariant objective which captures pitch information. Furthermore, we design the architecture of our network to be transposition-preserving by introducing learnable Toeplitz matrices. We evaluate our model for the two tasks of singing voice and musical instrument pitch estimation and show that our model is able to generalize across tasks and datasets while being lightweight, hence remaining compatible with low-resource devices and suitable for real-time applications. In particular, our results surpass self-supervised baselines and narrow the performance gap between self-supervised and supervised methods for pitch estimation. △ Less

Submitted 5 September, 2023; originally announced September 2023.

arXiv:2309.02243 [pdf, other]

Self-Similarity-Based and Novelty-based loss for music structure analysis

Authors: Geoffroy Peeters

Abstract: Music Structure Analysis (MSA) is the task aiming at identifying musical segments that compose a music track and possibly label them based on their similarity. In this paper we propose a supervised approach for the task of music boundary detection. In our approach we simultaneously learn features and convolution kernels. For this we jointly optimize -- a loss based on the Self-Similarity-Matrix (S… ▽ More Music Structure Analysis (MSA) is the task aiming at identifying musical segments that compose a music track and possibly label them based on their similarity. In this paper we propose a supervised approach for the task of music boundary detection. In our approach we simultaneously learn features and convolution kernels. For this we jointly optimize -- a loss based on the Self-Similarity-Matrix (SSM) obtained with the learned features, denoted by SSM-loss, and -- a loss based on the novelty score obtained applying the learned kernels to the estimated SSM, denoted by novelty-loss. We also demonstrate that relative feature learning, through self-attention, is beneficial for the task of MSA. Finally, we compare the performances of our approach to previously proposed approaches on the standard RWC-Pop, and various subsets of SALAMI. △ Less

Submitted 5 September, 2023; originally announced September 2023.

arXiv:2306.07187 [pdf, other]

doi 10.1109/TMM.2022.3152598

Video-to-Music Recommendation using Temporal Alignment of Segments

Authors: Laure Prétet, Gaël Richard, Clément Souchier, Geoffroy Peeters

Abstract: We study cross-modal recommendation of music tracks to be used as soundtracks for videos. This problem is known as the music supervision task. We build on a self-supervised system that learns a content association between music and video. In addition to the adequacy of content, adequacy of structure is crucial in music supervision to obtain relevant recommendations. We propose a novel approach to… ▽ More We study cross-modal recommendation of music tracks to be used as soundtracks for videos. This problem is known as the music supervision task. We build on a self-supervised system that learns a content association between music and video. In addition to the adequacy of content, adequacy of structure is crucial in music supervision to obtain relevant recommendations. We propose a novel approach to significantly improve the system's performance using structure-aware recommendation. The core idea is to consider not only the full audio-video clips, but rather shorter segments for training and inference. We find that using semantic segments and ranking the tracks according to sequence alignment costs significantly improves the results. We investigate the impact of different ranking metrics and segmentation methods. △ Less

Submitted 12 June, 2023; originally announced June 2023.

Journal ref: IEEE Transactions on Multimedia, 18 February 2022

arXiv:2211.08141 [pdf, other]

SSM-Net: feature learning for Music Structure Analysis using a Self-Similarity-Matrix based loss

Authors: Geoffroy Peeters, Florian Angulo

Abstract: In this paper, we propose a new paradigm to learn audio features for Music Structure Analysis (MSA). We train a deep encoder to learn features such that the Self-Similarity-Matrix (SSM) resulting from those approximates a ground-truth SSM. This is done by minimizing a loss between both SSMs. Since this loss is differentiable w.r.t. its input features we can train the encoder in a straightforward w… ▽ More In this paper, we propose a new paradigm to learn audio features for Music Structure Analysis (MSA). We train a deep encoder to learn features such that the Self-Similarity-Matrix (SSM) resulting from those approximates a ground-truth SSM. This is done by minimizing a loss between both SSMs. Since this loss is differentiable w.r.t. its input features we can train the encoder in a straightforward way. We successfully demonstrate the use of this training paradigm using the Area Under the Curve ROC (AUC) on the RWC-Pop dataset. △ Less

Submitted 15 November, 2022; originally announced November 2022.

Comments: Extended Abstracts for the Late-Breaking Demo Session of the 23rd Int. Society for Music Information Retrieval Conf., Bengaluru, India, 2022

arXiv:2211.07250 [pdf, other]

Exploiting Device and Audio Data to Tag Music with User-Aware Listening Contexts

Authors: Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, Gaël Richard

Abstract: As music has become more available especially on music streaming platforms, people have started to have distinct preferences to fit to their varying listening situations, also known as context. Hence, there has been a growing interest in considering the user's situation when recommending music to users. Previous works have proposed user-aware autotaggers to infer situation-related tags from music… ▽ More As music has become more available especially on music streaming platforms, people have started to have distinct preferences to fit to their varying listening situations, also known as context. Hence, there has been a growing interest in considering the user's situation when recommending music to users. Previous works have proposed user-aware autotaggers to infer situation-related tags from music content and user's global listening preferences. However, in a practical music retrieval system, the autotagger could be only used by assuming that the context class is explicitly provided by the user. In this work, for designing a fully automatised music retrieval system, we propose to disambiguate the user's listening information from their stream data. Namely, we propose a system which can generate a situational playlist for a user at a certain time 1) by leveraging user-aware music autotaggers, and 2) by automatically inferring the user's situation from stream data (e.g. device, network) and user's general profile information (e.g. age). Experiments show that such a context-aware personalized music retrieval system is feasible, but the performance decreases in the case of new users, new tracks or when the number of context classes increases. △ Less

Submitted 14 November, 2022; originally announced November 2022.

Comments: Published in ISMIR

arXiv:2210.12016 [pdf, other]

doi 10.1063/5.0131736

Optical spin-wave detection beyond the diffraction limit

Authors: J. Lucassen, M. J. G. Peeters, C. F. Schippers, R. A. Duine, H. J. M. Swagten, B. Koopmans, R. Lavrijsen

Abstract: Spin waves are proposed as information carriers for next-generation computing devices because of their low power consumption. Moreover, their wave-like nature allows for novel computing paradigms. Conventional methods to detect spin waves are based either on electrical induction, limiting the downscaling and efficiency complicating eventual implementation, or on light scattering, where the minimum… ▽ More Spin waves are proposed as information carriers for next-generation computing devices because of their low power consumption. Moreover, their wave-like nature allows for novel computing paradigms. Conventional methods to detect spin waves are based either on electrical induction, limiting the downscaling and efficiency complicating eventual implementation, or on light scattering, where the minimum detectable spin-wave wavelength is set by the wavelength of the laser. In this Article we demonstrate magneto-optical detection of spin waves beyond the diffraction limit using a metallic grating that selectively absorbs laser light. Specifically, we demonstrate the detection of propagating spin waves with a wavelength of 700 nm using a diffraction-limited laser spot with a size of 10 $μ$m in 20 nm thick Py strips. Additionally, we show that this grating is selective to the wavelength of the spin wave, providing wavevector-selective spin-wave detection. This should open up new avenues towards the integration of the burgeoning fields of photonics and magnonics, and aid in the optical detection of spin waves in the short-wavelength exchange regime for fundamental research. △ Less

Submitted 21 October, 2022; originally announced October 2022.

Comments: Includes supplementary

arXiv:2202.09198 [pdf, other]

Deep-Learning Architectures for Multi-Pitch Estimation: Towards Reliable Evaluation

Authors: Christof Weiß, Geoffroy Peeters

Abstract: Extracting pitch information from music recordings is a challenging but important problem in music signal processing. Frame-wise transcription or multi-pitch estimation aims for detecting the simultaneous activity of pitches in polyphonic music recordings and has recently seen major improvements thanks to deep-learning techniques, with a variety of proposed network architectures. In this paper, we… ▽ More Extracting pitch information from music recordings is a challenging but important problem in music signal processing. Frame-wise transcription or multi-pitch estimation aims for detecting the simultaneous activity of pitches in polyphonic music recordings and has recently seen major improvements thanks to deep-learning techniques, with a variety of proposed network architectures. In this paper, we realize different architectures based on CNNs, the U-net structure, and self-attention components. We propose several modifications to these architectures including self-attention modules for skip connections, recurrent layers to replace the self-attention, and a multi-task strategy with simultaneous prediction of the degree of polyphony. We compare variants of these architectures in different sizes for multi-pitch estimation, focusing on Western classical music beyond the piano-solo scenario using the MusicNet and Schubert Winterreise datasets. Our experiments indicate that most architectures yield competitive results and that larger model variants seem to be beneficial. However, we find that these results substantially depend on randomization effects and the particular choice of the training-test split, which questions the claim of superiority for particular architectures given only small improvements. We therefore investigate the influence of dataset splits in the presence of several movements of a work cycle (cross-version evaluation) and propose a best-practice splitting strategy for MusicNet, which weakens the influence of individual test tracks and suppresses overfitting to specific works and recording conditions. A final evaluation on a mixed dataset suggests that improvements on one specific dataset do not necessarily generalize to other scenarios, thus emphasizing the need for further high-quality multi-pitch datasets in order to reliably measure progress in music transcription tasks. △ Less

Submitted 18 February, 2022; originally announced February 2022.

arXiv:2110.12063 [pdf, other]

doi 10.1063/5.0077491

Ultra-low energy threshold engineering for all-optical switching of magnetization in dielectric-coated Co/Gd based synthetic-ferrimagnet

Authors: **zhi Li, Mark J. G. Peeters, Youri L. W. van Hees, Reinoud Lavrijsen, Bert Koopmans

Abstract: A femtosecond laser pulse is able to switch the magnetic state of a 3d-4f ferrimagnetic material on a pico-second time scale. Devices based on this all-optical switching (AOS) mechanism are competitive candidates for ultrafast memory applications. However, a large portion of the light energy is lost by reflection from the metal thin film as well as transmission to the substrate. In this paper, we… ▽ More A femtosecond laser pulse is able to switch the magnetic state of a 3d-4f ferrimagnetic material on a pico-second time scale. Devices based on this all-optical switching (AOS) mechanism are competitive candidates for ultrafast memory applications. However, a large portion of the light energy is lost by reflection from the metal thin film as well as transmission to the substrate. In this paper, we explore the use of dielectric coatings to increase the light absorption by the magnetic metal layer based on the principle of constructive interference. We experimentally show that the switching energy oscillates with the dielectric layer thickness following the light interference profile as obtained from theoretical calculations. Furthermore, the switching threshold fluence can be reduced by at least $80\%$ to 0.6 mJ/cm$^2$ using two dielectric SiO$_2$ layers sandwiching the metal stack, which scales to 15 fJ of incident energy for a cell size of $50^2$ nm$^2$. △ Less

Submitted 22 October, 2021; originally announced October 2021.

arXiv:2110.11341 [pdf, ps, other]

doi 10.3847/1538-4357/ab2df6

The impact of accretion heating and thermal conduction on the dead zone of protoplanetary disks

Authors: B. N. Schobert, A. G. Peeters, F. Rath

Abstract: The paper investigates the influence of accretion heating and turbulent heat conduction on the equilibrium of protoplanetary disks, extending the 2D axis-symmetric passive disk model of Flock (Flock et al. 2016, ApJ 827, 144). The model includes dust sublimation and radiative transfer with the flux-limited diffusion approximation, and predicts the density and temperature profiles as well as the du… ▽ More The paper investigates the influence of accretion heating and turbulent heat conduction on the equilibrium of protoplanetary disks, extending the 2D axis-symmetric passive disk model of Flock (Flock et al. 2016, ApJ 827, 144). The model includes dust sublimation and radiative transfer with the flux-limited diffusion approximation, and predicts the density and temperature profiles as well as the dust to gas ratio of the disk. It is shown that the accretion heating can have a large impact: For accretion rates above 5*10^(-8) M_solar /yr a zone forms behind the silicate condensation front with sufficiently high temperature to sublimate the dust and form a gaseous cavity. Assuming a Prandtl number ~ 0.7, it is furthermore shown that the turbulent heat conduction cannot be neglected in the evaluation of the temperature profile. While the inner rim position is not affected by viscous heating, the dead zone edge shifts radially outward for higher accretion rates. △ Less

Submitted 20 October, 2021; originally announced October 2021.

Journal ref: The Astrophysical Journal,881:56(10pp), 2019 August 10

arXiv:2110.10525 [pdf, ps, other]

doi 10.1051/0004-6361/202039398

Impact of dust diffusion on the rim shape of protoplanetary disks

Authors: B. N. Schobert, A. G. Peeters

Abstract: Context. Multiple mechanisms are known to give rise to turbulence in protoplanetary disks, which facilitates the accretion onto the central star. Small dust particles that are well coupled to the gas undergo diffusion due to this turbulent motion. Aims. This paper investigates the influence of turbulence induced dust diffusion on the equilibrium of protoplanetary disks. Methods. The model accounts… ▽ More Context. Multiple mechanisms are known to give rise to turbulence in protoplanetary disks, which facilitates the accretion onto the central star. Small dust particles that are well coupled to the gas undergo diffusion due to this turbulent motion. Aims. This paper investigates the influence of turbulence induced dust diffusion on the equilibrium of protoplanetary disks. Methods. The model accounts for dust sublimation, radiative transfer with the flux-limited diffusion approximation and dust diffusion. It predicts the density and temperature profiles as well as the dust-to-gas ratio of the disk. Results. It is shown that dust diffusion can have a large impact: assuming the dust survives for 104 seconds or longer before it can be evaporated, leads the dust diffusion to widen the inner disk considerably. The latter effect is generated through a feedback mechanism as the diffusion length is much smaller than the disk width. With increasing dust diffusion, the inclination of the inner rim towards the stellar radiation becomes steeper until it is almost vertical. The temperature range of evaporation and condensation, which is linked to the dust composition, has no influence on this effect. Conclusions. For realistic parameters dust diffusion can not be neglected when determining the equilibrium of the disk. Stronger turbulence inside the disk induces more dust diffusion. Therefore, the dust density grows more gradually over a greater distance and less radiation reaches the disk surface. The new equilibrium shape of the disk is more inclined towards the star. This effect is universal and independent of the specific dust composition. △ Less

Submitted 20 October, 2021; originally announced October 2021.

Journal ref: A&A 651, A27 (2021)

arXiv:2108.00970 [pdf, other]

Is there a "language of music-video clips" ? A qualitative and quantitative study

Authors: Laure Prétet, Gaël Richard, Geoffroy Peeters

Abstract: Recommending automatically a video given a music or a music given a video has become an important asset for the audiovisual industry - with user-generated or professional content. While both music and video have specific temporal organizations, most current works do not consider those and only focus on globally recommending a media. As a first step toward the improvement of these recommendation sy… ▽ More Recommending automatically a video given a music or a music given a video has become an important asset for the audiovisual industry - with user-generated or professional content. While both music and video have specific temporal organizations, most current works do not consider those and only focus on globally recommending a media. As a first step toward the improvement of these recommendation systems, we study in this paper the relationship between music and video temporal organization. We do this for the case of official music videos, with a quantitative and a qualitative approach. Our assumption is that the movement in the music are correlated to the ones in the video. To validate this, we first interview a set of internationally recognized music video experts. We then perform a large-scale analysis of official music-video clips (which we manually annotated into video genres) using MIR description tools (downbeats and functional segments estimation) and Computer Vision tools (shot detection). Our study confirms that a "language of music-video clips" exists; i.e. editors favor the co-occurrence of music and video events using strategies such as anticipation. It also highlights that the amount of co-occurrence depends on the music and video genres. △ Less

Submitted 2 August, 2021; originally announced August 2021.

arXiv:2105.13862 [pdf, other]

doi 10.1103/PhysRevB.105.014429

Influence of magnetic fields on ultrafast laser-induced switching dynamics in Co/Gd bilayers

Authors: M. J. G. Peeters, Y. M. van Ballegooie, B. Koopmans

Abstract: Recently it has been shown that not only GdFeCo alloys exhibit single-pulse helicity-independent all-optical switching (HI-AOS), but that this effect is also seen in Co/Gd bilayers. However, there have been no reports on the explicit time dynamics of the switching process in these bilayers as of yet. Furthermore, time-resolved measurements of switching of other materials are typically done with a… ▽ More Recently it has been shown that not only GdFeCo alloys exhibit single-pulse helicity-independent all-optical switching (HI-AOS), but that this effect is also seen in Co/Gd bilayers. However, there have been no reports on the explicit time dynamics of the switching process in these bilayers as of yet. Furthermore, time-resolved measurements of switching of other materials are typically done with a constant applied field to reset the magnetization between consecutive pulses and thus ensure repeatable behavior. In this paper we experimentally resolve the explicit dynamics of the switching process in Co/Gd, and the influence of applied magnetic fields on the switching process. We observe that after a switch within several picoseconds, the magnetization switches back at a timescale of hundreds of picoseconds. This backswitch includes a strong dependence on the magnetic field strength even at sub-tesla fields, significantly smaller than the exchange fields that govern the switching dynamics. This surprising behaviour is explained by a combination of longitudinal switching (on a picosecond timescale), precessional switching (on a nanosecond timescale) and domain-wall motion (on a timescale of 10 ns and beyond). We discuss these different switching regimes and their relative importance using simple model calculations. △ Less

Submitted 28 May, 2021; originally announced May 2021.

arXiv:2104.14799 [pdf, other]

Cross-Modal Music-Video Recommendation: A Study of Design Choices

Authors: Laure Pretet, Gael Richard, Geoffroy Peeters

Abstract: In this work, we study music/video cross-modal recommendation, i.e. recommending a music track for a video or vice versa. We rely on a self-supervised learning paradigm to learn from a large amount of unlabelled data. We rely on a self-supervised learning paradigm to learn from a large amount of unlabelled data. More precisely, we jointly learn audio and video embeddings by using their co-occurren… ▽ More In this work, we study music/video cross-modal recommendation, i.e. recommending a music track for a video or vice versa. We rely on a self-supervised learning paradigm to learn from a large amount of unlabelled data. We rely on a self-supervised learning paradigm to learn from a large amount of unlabelled data. More precisely, we jointly learn audio and video embeddings by using their co-occurrence in music-video clips. In this work, we build upon a recent video-music retrieval system (the VM-NET), which originally relies on an audio representation obtained by a set of statistics computed over handcrafted features. We demonstrate here that using audio representation learning such as the audio embeddings provided by the pre-trained MuSimNet, OpenL3, MusicCNN or by AudioSet, largely improves recommendations. We also validate the use of the cross-modal triplet loss originally proposed in the VM-NET compared to the binary cross-entropy loss commonly used in self-supervised learning. We perform all our experiments using the Music Video Dataset (MVD). △ Less

Submitted 30 April, 2021; originally announced April 2021.

arXiv:2008.02070 [pdf, other]

Content based singing voice source separation via strong conditioning using aligned phonemes

Authors: Gabriel Meseguer-Brocal, Geoffroy Peeters

Abstract: Informed source separation has recently gained renewed interest with the introduction of neural networks and the availability of large multitrack datasets containing both the mixture and the separated sources. These approaches use prior information about the target source to improve separation. Historically, Music Information Retrieval researchers have focused primarily on score-informed source se… ▽ More Informed source separation has recently gained renewed interest with the introduction of neural networks and the availability of large multitrack datasets containing both the mixture and the separated sources. These approaches use prior information about the target source to improve separation. Historically, Music Information Retrieval researchers have focused primarily on score-informed source separation, but more recent approaches explore lyrics-informed source separation. However, because of the lack of multitrack datasets with time-aligned lyrics, models use weak conditioning with non-aligned lyrics. In this paper, we present a multimodal multitrack dataset with lyrics aligned in time at the word level with phonetic information as well as explore strong conditioning using the aligned phonemes. Our model follows a U-Net architecture and takes as input both the magnitude spectrogram of a musical mixture and a matrix with aligned phonetic information. The phoneme matrix is embedded to obtain the parameters that control Feature-wise Linear Modulation (FiLM) layers. These layers condition the U-Net feature maps to adapt the separation process to the presence of different phonemes via affine transformations. We show that phoneme conditioning can be successfully applied to improve singing voice source separation. △ Less

Submitted 5 August, 2020; originally announced August 2020.

Comments: 21st International Society for Music Information Retrieval Conference 11-15 October 2020, Montreal, Canada

arXiv:2005.12977 [pdf, other]

doi 10.1109/ICASSP40776.2020.9053135

Learning to rank music tracks using triplet loss

Authors: Laure Prétet, Gaël Richard, Geoffroy Peeters

Abstract: Most music streaming services rely on automatic recommendation algorithms to exploit their large music catalogs. These algorithms aim at retrieving a ranked list of music tracks based on their similarity with a target music track. In this work, we propose a method for direct recommendation based on the audio content without explicitly tagging the music tracks. To that aim, we propose several strat… ▽ More Most music streaming services rely on automatic recommendation algorithms to exploit their large music catalogs. These algorithms aim at retrieving a ranked list of music tracks based on their similarity with a target music track. In this work, we propose a method for direct recommendation based on the audio content without explicitly tagging the music tracks. To that aim, we propose several strategies to perform triplet mining from ranked lists. We train a Convolutional Neural Network to learn the similarity via triplet loss. These different strategies are compared and validated on a large-scale experiment against an auto-tagging based approach. The results obtained highlight the efficiency of our system, especially when associated with an Auto-pooling layer. △ Less

Submitted 18 May, 2020; originally announced May 2020.

arXiv:1910.09862 [pdf, other]

A Prototypical Triplet Loss for Cover Detection

Authors: Guillaume Doras, Geoffroy Peeters

Abstract: Automatic cover detection -- the task of finding in a audio dataset all covers of a query track -- has long been a challenging theoretical problem in MIR community. It also became a practical need for music composers societies requiring to detect automatically if an audio excerpt embeds musical content belonging to their catalog. In a recent work, we addressed this problem with a convolutional n… ▽ More Automatic cover detection -- the task of finding in a audio dataset all covers of a query track -- has long been a challenging theoretical problem in MIR community. It also became a practical need for music composers societies requiring to detect automatically if an audio excerpt embeds musical content belonging to their catalog. In a recent work, we addressed this problem with a convolutional neural network map** each track's dominant melody to an embedding vector, and trained to minimize cover pairs distance in the embeddings space, while maximizing it for non-covers. We showed in particular that training this model with enough works having five or more covers yields state-of-the-art results. This however does not reflect the realistic use case, where music catalogs typically contain works with zero or at most one or two covers. We thus introduce here a new test set incorporating these constraints, and propose two contributions to improve our model's accuracy under these stricter conditions: we replace dominant melody with multi-pitch representation as input data, and describe a novel prototypical triplet loss designed to improve covers clustering. We show that these changes improve results significantly for two concrete use cases, large dataset lookup and live songs identification. △ Less

Submitted 9 April, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

Comments: Corrections after reviewers comments. Correct erroneous figure 5 in original version

arXiv:1907.01824 [pdf, other]

Cover Detection using Dominant Melody Embeddings

Authors: Guillaume Doras, Geoffroy Peeters

Abstract: Automatic cover detection -- the task of finding in an audio database all the covers of one or several query tracks -- has long been seen as a challenging theoretical problem in the MIR community and as an acute practical problem for authors and composers societies. Original algorithms proposed for this task have proven their accuracy on small datasets, but are unable to scale up to modern real-li… ▽ More Automatic cover detection -- the task of finding in an audio database all the covers of one or several query tracks -- has long been seen as a challenging theoretical problem in the MIR community and as an acute practical problem for authors and composers societies. Original algorithms proposed for this task have proven their accuracy on small datasets, but are unable to scale up to modern real-life audio corpora. On the other hand, faster approaches designed to process thousands of pairwise comparisons resulted in lower accuracy, making them unsuitable for practical use. In this work, we propose a neural network architecture that is trained to represent each track as a single embedding vector. The computation burden is therefore left to the embedding extraction -- that can be conducted offline and stored, while the pairwise comparison task reduces to a simple Euclidean distance computation. We further propose to extract each track's embedding out of its dominant melody representation, obtained by another neural network trained for this task. We then show that this architecture improves state-of-the-art accuracy both on small and large datasets, and is able to scale to query databases of thousands of tracks in a few seconds. △ Less

Submitted 3 July, 2019; originally announced July 2019.

Journal ref: 20th International Society for Music Information Retrieval Conference, Delft, The Netherlands, 2019

arXiv:1907.01277 [pdf, other]

doi 10.5281/zenodo.3527766

Conditioned-U-Net: Introducing a Control Mechanism in the U-Net for Multiple Source Separations

Authors: Gabriel Meseguer-Brocal, Geoffroy Peeters

Abstract: Data-driven models for audio source separation such as U-Net or Wave-U-Net are usually models dedicated to and specifically trained for a single task, e.g. a particular instrument isolation. Training them for various tasks at once commonly results in worse performances than training them for a single specialized task. In this work, we introduce the Conditioned-U-Net (C-U-Net) which adds a control… ▽ More Data-driven models for audio source separation such as U-Net or Wave-U-Net are usually models dedicated to and specifically trained for a single task, e.g. a particular instrument isolation. Training them for various tasks at once commonly results in worse performances than training them for a single specialized task. In this work, we introduce the Conditioned-U-Net (C-U-Net) which adds a control mechanism to the standard U-Net. The control mechanism allows us to train a unique and generic U-Net to perform the separation of various instruments. The C-U-Net decides the instrument to isolate according to a one-hot-encoding input vector. The input vector is embedded to obtain the parameters that control Feature-wise Linear Modulation (FiLM) layers. FiLM layers modify the U-Net feature maps in order to separate the desired instrument via affine transformations. The C-U-Net performs different instrument separations, all with a single model achieving the same performances as the dedicated ones at a lower cost. △ Less

Submitted 21 November, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

Journal ref: Proceedings of the 20th International Society for Music Information Retrieval Conference, ISMIR, Delft, Netherlands, 2019

arXiv:1906.10606 [pdf, other]

doi 10.5281/zenodo.1492443

DALI: a large Dataset of synchronized Audio, LyrIcs and notes, automatically created using teacher-student machine learning paradigm

Authors: Gabriel Meseguer-Brocal, Alice Cohen-Hadria, Geoffroy Peeters

Abstract: The goal of this paper is twofold. First, we introduce DALI, a large and rich multimodal dataset containing 5358 audio tracks with their time-aligned vocal melody notes and lyrics at four levels of granularity. The second goal is to explain our methodology where dataset creation and learning models interact using a teacher-student machine learning paradigm that benefits each other. We start with a… ▽ More The goal of this paper is twofold. First, we introduce DALI, a large and rich multimodal dataset containing 5358 audio tracks with their time-aligned vocal melody notes and lyrics at four levels of granularity. The second goal is to explain our methodology where dataset creation and learning models interact using a teacher-student machine learning paradigm that benefits each other. We start with a set of manual annotations of draft time-aligned lyrics and notes made by non-expert users of Karaoke games. This set comes without audio. Therefore, we need to find the corresponding audio and adapt the annotations to it. To that end, we retrieve audio candidates from the Web. Each candidate is then turned into a singing-voice probability over time using a teacher, a deep convolutional neural network singing-voice detection system (SVD), trained on cleaned data. Comparing the time-aligned lyrics and the singing-voice probability, we detect matches and update the time-alignment lyrics accordingly. From this, we obtain new audio sets. They are then used to train new SVD students used to perform again the above comparison. The process could be repeated iteratively. We show that this allows to progressively improve the performances of our SVD and get better audio-matching and alignment. △ Less

Submitted 25 June, 2019; originally announced June 2019.

Journal ref: Proceedings of the 19th International Society for Music Information Retrieval Conference, ISMIR, Paris, France, pp. 431-437, 2018

arXiv:1903.01415 [pdf, other]

Improving singing voice separation using Deep U-Net and Wave-U-Net with data augmentation

Authors: Alice Cohen-Hadria, Axel Roebel, Geoffroy Peeters

Abstract: State-of-the-art singing voice separation is based on deep learning making use of CNN structures with skip connections (like U-net model, Wave-U-Net model, or MSDENSELSTM). A key to the success of these models is the availability of a large amount of training data. In the following study, we are interested in singing voice separation for mono signals and will investigate into comparing the U-Net a… ▽ More State-of-the-art singing voice separation is based on deep learning making use of CNN structures with skip connections (like U-net model, Wave-U-Net model, or MSDENSELSTM). A key to the success of these models is the availability of a large amount of training data. In the following study, we are interested in singing voice separation for mono signals and will investigate into comparing the U-Net and the Wave-U-Net that are structurally similar, but work on different input representations. First, we report a few results on variations of the U-Net model. Second, we will discuss the potential of state of the art speech and music transformation algorithms for augmentation of existing data sets and demonstrate that the effect of these augmentations depends on the signal representations used by the model. The results demonstrate a considerable improvement due to the augmentation for both models. But pitch transposition is the most effective augmentation strategy for the U-Net model, while transposition, time stretching, and formant shifting have a much more balanced effect on the Wave-U-Net model. Finally, we compare the two models on the same dataset. △ Less

Submitted 4 March, 2019; originally announced March 2019.

Journal ref: Published in Proceedings of the 27th European Signal Processing Conference (EUSIPCO), 2019

arXiv:1805.01201 [pdf, ps, other]

Single-Channel Blind Source Separation for Singing Voice Detection: A Comparative Study

Authors: Dominique Fourer, Geoffroy Peeters

Abstract: We propose a novel unsupervised singing voice detection method which use single-channel Blind Audio Source Separation (BASS) algorithm as a preliminary step. To reach this goal, we investigate three promising BASS approaches which operate through a morphological filtering of the analyzed mixture spectrogram. The contributions of this paper are manyfold. First, the investigated BASS methods are rew… ▽ More We propose a novel unsupervised singing voice detection method which use single-channel Blind Audio Source Separation (BASS) algorithm as a preliminary step. To reach this goal, we investigate three promising BASS approaches which operate through a morphological filtering of the analyzed mixture spectrogram. The contributions of this paper are manyfold. First, the investigated BASS methods are reworded with the same formalism and we investigate their respective hyperparameters by numerical simulations. Second, we propose an extension of the KAM method for which we propose a novel training algorithm used to compute a source-specific kernel from a given isolated source signal. Second, the BASS methods are compared together in terms of source separation accuracy and in terms of singing voice detection accuracy when they are used in our new singing voice detection framework. Finally, we do an exhaustive singing voice detection evaluation for which we compare both supervised and unsupervised singing voice detection methods. Our comparison explores different combination of the proposed BASS methods with new features such as the new proposed KAM features and the scattering transform through a machine learning framework and also considers convolutional neural networks methods. △ Less

Submitted 3 May, 2018; originally announced May 2018.

arXiv:1801.10600 [pdf, ps, other]

doi 10.1088/1741-4326/aab22f

Global gyrokinetic simulations of intrinsic rotation in ASDEX Upgrade Ohmic L-mode plasmas

Authors: W. A. Hornsby, C. Angioni, Z. X. Lu, E. Fable, I. Erofeev, R. McDermott, A. Medvedeva, A. Lebschy, A. G. Peeters

Abstract: Non-linear, radially global, turbulence simulations of ASDEX Upgrade (AUG) plasmas are performed and the nonlinear generated intrinsic flow shows agreement with the intrinsic flow gradients measured in the core of Ohmic L-mode plasmas at nominal parameters. Simulations utilising the kinetic electron model show hollow intrinsic flow profiles as seen in a predominant number of experiments performed… ▽ More Non-linear, radially global, turbulence simulations of ASDEX Upgrade (AUG) plasmas are performed and the nonlinear generated intrinsic flow shows agreement with the intrinsic flow gradients measured in the core of Ohmic L-mode plasmas at nominal parameters. Simulations utilising the kinetic electron model show hollow intrinsic flow profiles as seen in a predominant number of experiments performed at similar plasma parameters. In addition, significantly larger flow gradients are seen than in a previous flux-tube analysis (Hornsby et al {\it Nucl. Fusion} (2017)). Adiabatic electron model simulations can show a flow profile with opposing sign in the gradient with respect to a kinetic electron simulation, implying a reversal in the sign of the residual stress due to kinetic electrons. The sha** of the intrinsic flow is strongly determined by the density gradient profile. The sensitivity of the residual stress to variations in density profile curvature is calculated and seen to be significantly stronger than to neoclassical flows (Hornsby et al {\it Nucl. Fusion} (2017)). This variation is strong enough on its own to explain the large variations in the intrinsic flow gradients seen in some AUG experiments. Analysis of the symmetry breaking properties of the turbulence shows that profile shearing is the dominant mechanism in producing a finite parallel wave-number, with turbulence gradient effects contributing a smaller portion of the parallel wave-vector. △ Less

Submitted 31 January, 2018; originally announced January 2018.

arXiv:1701.08095 [pdf, ps, other]

doi 10.1088/1361-6587/aa543a

Experimental observations and modelling of intrinsic rotation reversals in tokamaks

Authors: Y. Camenen, C. Angioni, A. Bortolon, B. P. Duval, E. Fable, W. A. Hornsby, R. M. Mcdermott, D. H. Na, Y-S. Na, A. G. Peeters, J. E. Rice

Abstract: The progress made in understanding spontaneous toroidal rotation reversals in tokamaks is reviewed and current ideas to solve this ten-year-old puzzle are explored. The paper includes a summarial synthesis of the experimental observations in AUG, C-Mod, KSTAR, MAST and TCV tokamaks, reasons why turbulent momentum transport is thought to be responsible for the reversals, a review of the theory of t… ▽ More The progress made in understanding spontaneous toroidal rotation reversals in tokamaks is reviewed and current ideas to solve this ten-year-old puzzle are explored. The paper includes a summarial synthesis of the experimental observations in AUG, C-Mod, KSTAR, MAST and TCV tokamaks, reasons why turbulent momentum transport is thought to be responsible for the reversals, a review of the theory of turbulent momentum transport and suggestions for future investigations. △ Less

Submitted 27 January, 2017; originally announced January 2017.

Journal ref: Plasma Physics and Controlled Fusion, IOP Publishing, 2017, 59, pp.34001 - 34001

arXiv:1610.08852 [pdf, other]

doi 10.1063/1.4975048

Precession-torque-driven domain-wall motion in out-of-plane materials

Authors: M. J. G. Peeters, F. C. Ummelen, M. L. M. Lalieu, J. -S. Kim, H. J. M. Swagten, B. Koopmans

Abstract: Domain-wall (DW) motion in magnetic nanostrips is intensively studied, in particular because of the possible applications in data storage. In this work, we will investigate a novel method of DW motion using magnetic field pulses, with the precession torque as the driving mechanism. We use a one dimensional (1D) model to show that it is possible to drive DWs in out-of-plane materials using the prec… ▽ More Domain-wall (DW) motion in magnetic nanostrips is intensively studied, in particular because of the possible applications in data storage. In this work, we will investigate a novel method of DW motion using magnetic field pulses, with the precession torque as the driving mechanism. We use a one dimensional (1D) model to show that it is possible to drive DWs in out-of-plane materials using the precession torque, and we identify the key parameters that influence this motion. Because the DW moves back to its initial position at the end of the field pulse, thereby severely complicating direct detection of the DW motion, depinning experiments are used to indirectly observe the effect of the precession torque. The 1D model is extended to include an energy landscape in order to predict the influence of the precession torque in the depinning experiments. Although preliminary experiments did not yet show an effect of the precession torque, our calculations indicate that depinning experiments can be used to demonstrate this novel method of DW motion in out-of-plane materials, which even allows for coherent motion of multiple domains when the Dzyaloshinskii-Moriya interaction is taken into account. △ Less

Submitted 23 November, 2016; v1 submitted 27 October, 2016; originally announced October 2016.

arXiv:1507.02841 [pdf, ps, other]

The non-linear evolution of the tearing mode in electromagnetic turbulence using gyrokinetic simulations

Authors: William A Hornsby, Pierluigi Migliano, Rico Buchholz, Stefan Grosshauser, Arne Weikl, David Zarzoso, Francis J Casson, Emanuele Poli, Artur G Peeters

Abstract: The non-linear evolution of a magnetic island is studied using the Vlasov gyro-kinetic code GKW. The interaction of electromagnetic turbulence with a self-consistently growing magnetic island, generated by a tearing unstable $Δ' > 0$ current profile, is considered. The turbulence is able to seed the magnetic island and bypass the linear growth phase by generating structures that are approximately… ▽ More The non-linear evolution of a magnetic island is studied using the Vlasov gyro-kinetic code GKW. The interaction of electromagnetic turbulence with a self-consistently growing magnetic island, generated by a tearing unstable $Δ' > 0$ current profile, is considered. The turbulence is able to seed the magnetic island and bypass the linear growth phase by generating structures that are approximately an ion gyro-radius in width. The non-linear evolution of the island width and its rotation frequency, after this seeding phase, is found to be modified and is dependent on the value of the plasma beta and equilibrium pressure gradients. At low values of beta the island evolves largely independent of the turbulence, while at higher values the interaction has a dramatic effect on island growth, causing the island to grow exponentially at the growth rate of its linear phase, even though the island is larger than linear theory validity. The turbulence forces the island to rotate in the ion-diamagnetic direction as opposed to the electron diamagnetic direction in which it rotates when no turbulence is present. In addition, it is found that the mode rotation slows as the island grows in size. △ Less

Submitted 10 July, 2015; originally announced July 2015.

arXiv:1408.1345 [pdf, ps, other]

doi 10.1088/0741-3335/57/5/054008

On the radial propagation of turbulence in gyro-kinetic toroidal systems

Authors: P. Migliano, R. Buchholz, S. R. Grosshauser, W. A. Hornsby, A. G. Peeters

Abstract: In this paper a conservation equation is derived for the radially dependent entropy in toroidal geometry using the local approximation of the gyro-kinetic framework. This equation naturally leads to an operative definition for the turbulence intensity. It is shown that the conservation equation can be split in two separate conservation equations, one describing the dynamics of the zonal modes and… ▽ More In this paper a conservation equation is derived for the radially dependent entropy in toroidal geometry using the local approximation of the gyro-kinetic framework. This equation naturally leads to an operative definition for the turbulence intensity. It is shown that the conservation equation can be split in two separate conservation equations, one describing the dynamics of the zonal modes and one for the non-zonal modes. In essence the paper provides an operative tool for both analytic as well as numeric studies of the radial propagation of turbulence in tokamak plasmas. △ Less

Submitted 6 August, 2014; originally announced August 2014.

arXiv:1407.7767 [pdf, ps, other]

doi 10.1088/0029-5515/55/1/012002

Effect of turbulence on electron cyclotron current drive and heating in ITER

Authors: F. J. Casson, E. Poli, C. Angioni, R. Buchholz, A. G. Peeters

Abstract: Non-linear local electromagnetic gyrokinetic turbulence simulations of the ITER standard scenario H-mode are presented for the q=3/2 and q=2 surfaces. The turbulent transport is examined in regions of velocity space characteristic of electrons heated by electron cyclotron waves. Electromagnetic fluctuations and sub-dominant micro-tearing modes are found to contribute significantly to the transport… ▽ More Non-linear local electromagnetic gyrokinetic turbulence simulations of the ITER standard scenario H-mode are presented for the q=3/2 and q=2 surfaces. The turbulent transport is examined in regions of velocity space characteristic of electrons heated by electron cyclotron waves. Electromagnetic fluctuations and sub-dominant micro-tearing modes are found to contribute significantly to the transport of the accelerated electrons, even though they have only a small impact on the transport of the bulk species. The particle diffusivity for resonant passing electrons is found to be less than 0.15 m^2/s, and their heat conductivity is found to be less than 2 m^2/s. Implications for the broadening of the current drive and energy deposition in ITER are discussed. △ Less

Submitted 29 July, 2014; originally announced July 2014.

Comments: Letter, 5 pages, 5 figures, for submission to Nuclear Fusion

Journal ref: Nucl. Fusion 55 (2015) 012002

arXiv:1306.4557 [pdf, ps, other]

doi 10.1088/0029-5515/53/3/033007

Analysis of Lithium Driven Electron Density Peaking in FTU Liquid Lithium Limiter Experiments

Authors: G. Szepesi, M. Romanelli, F. Militello, A. G. Peeters, Y. Camenen, F. J. Casson, W. A. Hornsby, A. P. Snodin, D. Wagner, FTU team

Abstract: The impact of lithium impurities on the microstability and turbulent transport characteristics in the core of a typical FTU Liquid Lithium Limiter (LLL)(Mazzitelli et al., Nucl. Fusion, 2011) discharge during the density ramp-up phase is studied. A non-linear gyrokinetic analysis performed with GKW (Peeters et al.,Comp. Phys. Comm., 2009) accompanied by a quasi-linear fluid analysis is presented.… ▽ More The impact of lithium impurities on the microstability and turbulent transport characteristics in the core of a typical FTU Liquid Lithium Limiter (LLL)(Mazzitelli et al., Nucl. Fusion, 2011) discharge during the density ramp-up phase is studied. A non-linear gyrokinetic analysis performed with GKW (Peeters et al.,Comp. Phys. Comm., 2009) accompanied by a quasi-linear fluid analysis is presented. We show that a centrally peaked, high concentration lithium profile contributes to the electron peaking by reducing the outward electron flux, and that it leads to inward turbulent deuterium transport through ion flux separation. △ Less

Submitted 19 June, 2013; originally announced June 2013.

Comments: 23 pages, 12 figures

Journal ref: Nuclear Fusion 53 033007 (2013)

arXiv:1302.6453 [pdf, ps, other]

doi 10.1063/1.4799750

Toroidal momentum transport in a tokamak caused by symmetry breaking parallel derivatives

Authors: Tobias Sung, Rico Buchholz, Francis Casson, Emilino Fable, Stefan R. Grosshauser, William Hornsby, Piereluigi Migliano, Arthur G. Peeters

Abstract: A new mechanism for toroidal momentum transport in a tokamak is investigated using the gyro-kinetic model. First, an analytic model is developed through the use of the ballooning transform. The terms that generate the momentum transport are then connected with the poloidal derivative of the ballooning envelope, which are one order smaller in the normalised Larmor radius, compared with the derivati… ▽ More A new mechanism for toroidal momentum transport in a tokamak is investigated using the gyro-kinetic model. First, an analytic model is developed through the use of the ballooning transform. The terms that generate the momentum transport are then connected with the poloidal derivative of the ballooning envelope, which are one order smaller in the normalised Larmor radius, compared with the derivative of the eikonal. The mechanism, therefore, does not introduce an inhomogeneity in the radial direction, in contrast with the effect of profile shearing. Numerical simulations of the linear ion temperature gradient mode with adiabatic electrons, retaining the finite rho* effects in the ExB velocity, the drift, and the gyro-average, are presented. The momentum flux is found to be linear in the normalised Larmor radius (ρ*) but is, nevertheless, generating a sizeable counter-current rotation. The total momentum flux scales linear with the aspect ratio of the considered magnetic surface, and increases with increasing magnetic shear, safety factor, and density and temperature gradients. △ Less

Submitted 26 February, 2013; originally announced February 2013.

Journal ref: Phys. Plasmas 20, 042506 (2013)

arXiv:1102.3717 [pdf, other]

doi 10.1063/1.3586332

Up-down symmetry of the turbulent transport of toroidal angular momentum in tokamaks

Authors: Felix I. Parra, Michael Barnes, Arthur G. Peeters

Abstract: Two symmetries of the local nonlinear delta-f gyrokinetic system of equations in tokamaks in the high flow regime are presented. The turbulent transport of toroidal angular momentum changes sign under an up-down reflection of the tokamak and a sign change of both the rotation and the rotation shear. Thus, the turbulent transport of toroidal angular momentum must vanish for up-down symmetric tokama… ▽ More Two symmetries of the local nonlinear delta-f gyrokinetic system of equations in tokamaks in the high flow regime are presented. The turbulent transport of toroidal angular momentum changes sign under an up-down reflection of the tokamak and a sign change of both the rotation and the rotation shear. Thus, the turbulent transport of toroidal angular momentum must vanish for up-down symmetric tokamaks in the absence of both rotation and rotation shear. This has important implications for the modeling of spontaneous rotation. △ Less

Submitted 27 June, 2011; v1 submitted 17 February, 2011; originally announced February 2011.

Comments: 15 pages, 2 figures

Journal ref: Phys. Plasmas 18, 062501 (2011)

arXiv:physics/0701185 [pdf, ps, other]

doi 10.1088/0029-5515/47/9/035

On the extrapolation to ITER of discharges in present tokamaks

Authors: A. G. Peeters, C. Angioni, A. C. C. Sips

Abstract: An expression for the extrapolated fusion gain G = Pfusion /5 Pheat (Pfusion being the total fusion power and Pheat the total heating power) of ITER in terms of the confinement improvement factor (H) and the normalised beta (betaN) is derived in this paper. It is shown that an increase in normalised beta can be expected to have a negative or neutral influence on G depending on the chosen confine… ▽ More An expression for the extrapolated fusion gain G = Pfusion /5 Pheat (Pfusion being the total fusion power and Pheat the total heating power) of ITER in terms of the confinement improvement factor (H) and the normalised beta (betaN) is derived in this paper. It is shown that an increase in normalised beta can be expected to have a negative or neutral influence on G depending on the chosen confinement scaling law. Figures of merit like H betaN / q95^2 should be used with care, since large values of this quantity do not guarantee high values of G, and might not be attainable with the heating power installed on ITER. △ Less

Submitted 16 January, 2007; originally announced January 2007.

Comments: 6 Pages, 3 figures, Submitted to Nuclear Fusion on the 29th of November 2006

arXiv:physics/0701147 [pdf, ps, other]

The toroidal momentum pinch velocity

Authors: A. G. Peeters, C. Angioni, D. Strintzi

Abstract: In this letter a pinch velocity of toroidal momentum is shown to exist for the first time. Using the gyro-kinetic equations in the frame moving with the equilibrium toroidal velocity, it is shown that the physics effect can be elegantly formulated through the ``Coriolis'' drift. A fluid model is used to highlight the main coupling mechanisms between the density and temperature perturbations on t… ▽ More In this letter a pinch velocity of toroidal momentum is shown to exist for the first time. Using the gyro-kinetic equations in the frame moving with the equilibrium toroidal velocity, it is shown that the physics effect can be elegantly formulated through the ``Coriolis'' drift. A fluid model is used to highlight the main coupling mechanisms between the density and temperature perturbations on the one hand and the perturbed parallel flow on the other. Gyro-kinetic calculations are used to accurately asses the magnitude of the pinch. The pinch velocity leads to a radial gradient of the toroidal velocity profile even in the absence of a torque on the plasma. It is shown to be sizeable in the plasmas of the International Thermonuclear Experimental Reactor (ITER) leading to a moderately peaked rotation profile. Finally, the pinch also affects the interpretation of current experiments. △ Less

Submitted 12 January, 2007; originally announced January 2007.

Comments: 4 Pages, 2 Figures. Submitted to Phys. Rev. Letters on the 17th of Nov. 2006

Showing 1–37 of 37 results for author: Peeters, G