-
Speech dereverberation constrained on room impulse response characteristics
Authors:
Louis Bahrman,
Mathieu Fontaine,
Jonathan Le Roux,
Gaël Richard
Abstract:
Single-channel speech dereverberation aims at extracting a dry speech signal from a recording affected by the acoustic reflections in a room. However, most current deep learning-based approaches for speech dereverberation are not interpretable for room acoustics, and can be considered as black-box systems in that regard. In this work, we address this problem by regularizing the training loss using…
▽ More
Single-channel speech dereverberation aims at extracting a dry speech signal from a recording affected by the acoustic reflections in a room. However, most current deep learning-based approaches for speech dereverberation are not interpretable for room acoustics, and can be considered as black-box systems in that regard. In this work, we address this problem by regularizing the training loss using a novel physical coherence loss which encourages the room impulse response (RIR) induced by the dereverberated output of the model to match the acoustic properties of the room in which the signal was recorded. Our investigation demonstrates the preservation of the original dereverberated signal alongside the provision of a more physically coherent RIR.
△ Less
Submitted 10 July, 2024;
originally announced July 2024.
-
Winner-takes-all learners are geometry-aware conditional density estimators
Authors:
Victor Letzelter,
David Perera,
Cédric Rommel,
Mathieu Fontaine,
Slim Essid,
Gael Richard,
Patrick Pérez
Abstract:
Winner-takes-all training is a simple learning paradigm, which handles ambiguous tasks by predicting a set of plausible hypotheses. Recently, a connection was established between Winner-takes-all training and centroidal Voronoi tessellations, showing that, once trained, hypotheses should quantize optimally the shape of the conditional distribution to predict. However, the best use of these hypothe…
▽ More
Winner-takes-all training is a simple learning paradigm, which handles ambiguous tasks by predicting a set of plausible hypotheses. Recently, a connection was established between Winner-takes-all training and centroidal Voronoi tessellations, showing that, once trained, hypotheses should quantize optimally the shape of the conditional distribution to predict. However, the best use of these hypotheses for uncertainty quantification is still an open question. In this work, we show how to leverage the appealing geometric properties of the Winner-takes-all learners for conditional density estimation, without modifying its original training scheme. We theoretically establish the advantages of our novel estimator both in terms of quantization and density estimation, and we demonstrate its competitiveness on synthetic and real-world datasets, including audio data.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
A lightweight dual-stage framework for personalized speech enhancement based on DeepFilterNet2
Authors:
Thomas Serre,
Mathieu Fontaine,
Éric Benhaim,
Geoffroy Dutour,
Slim Essid
Abstract:
Isolating the desired speaker's voice amidst multiplespeakers in a noisy acoustic context is a challenging task. Per-sonalized speech enhancement (PSE) endeavours to achievethis by leveraging prior knowledge of the speaker's voice.Recent research efforts have yielded promising PSE mod-els, albeit often accompanied by computationally intensivearchitectures, unsuitable for resource-constrained embed…
▽ More
Isolating the desired speaker's voice amidst multiplespeakers in a noisy acoustic context is a challenging task. Per-sonalized speech enhancement (PSE) endeavours to achievethis by leveraging prior knowledge of the speaker's voice.Recent research efforts have yielded promising PSE mod-els, albeit often accompanied by computationally intensivearchitectures, unsuitable for resource-constrained embeddeddevices. In this paper, we introduce a novel method to per-sonalize a lightweight dual-stage Speech Enhancement (SE)model and implement it within DeepFilterNet2, a SE modelrenowned for its state-of-the-art performance. We seek anoptimal integration of speaker information within the model,exploring different positions for the integration of the speakerembeddings within the dual-stage enhancement architec-ture. We also investigate a tailored training strategy whenadapting DeepFilterNet2 to a PSE task. We show that ourpersonalization method greatly improves the performancesof DeepFilterNet2 while preserving minimal computationaloverhead.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
GLA-Grad: A Griffin-Lim Extended Waveform Generation Diffusion Model
Authors:
Haocheng Liu,
Teysir Baoueb,
Mathieu Fontaine,
Jonathan Le Roux,
Gael Richard
Abstract:
Diffusion models are receiving a growing interest for a variety of signal generation tasks such as speech or music synthesis. WaveGrad, for example, is a successful diffusion model that conditionally uses the mel spectrogram to guide a diffusion process for the generation of high-fidelity audio. However, such models face important challenges concerning the noise diffusion process for training and…
▽ More
Diffusion models are receiving a growing interest for a variety of signal generation tasks such as speech or music synthesis. WaveGrad, for example, is a successful diffusion model that conditionally uses the mel spectrogram to guide a diffusion process for the generation of high-fidelity audio. However, such models face important challenges concerning the noise diffusion process for training and inference, and they have difficulty generating high-quality speech for speakers that were not seen during training. With the aim of minimizing the conditioning error and increasing the efficiency of the noise diffusion process, we propose in this paper a new scheme called GLA-Grad, which consists in introducing a phase recovery algorithm such as the Griffin-Lim algorithm (GLA) at each step of the regular diffusion process. Furthermore, it can be directly applied to an already-trained waveform generation model, without additional training or fine-tuning. We show that our algorithm outperforms state-of-the-art diffusion models for speech generation, especially when generating speech for a previously unseen target speaker.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis
Authors:
Teysir Baoueb,
Haocheng Liu,
Mathieu Fontaine,
Jonathan Le Roux,
Gael Richard
Abstract:
Generative adversarial network (GAN) models can synthesize highquality audio signals while ensuring fast sample generation. However, they are difficult to train and are prone to several issues including mode collapse and divergence. In this paper, we introduce SpecDiff-GAN, a neural vocoder based on HiFi-GAN, which was initially devised for speech synthesis from mel spectrogram. In our model, the…
▽ More
Generative adversarial network (GAN) models can synthesize highquality audio signals while ensuring fast sample generation. However, they are difficult to train and are prone to several issues including mode collapse and divergence. In this paper, we introduce SpecDiff-GAN, a neural vocoder based on HiFi-GAN, which was initially devised for speech synthesis from mel spectrogram. In our model, the training stability is enhanced by means of a forward diffusion process which consists in injecting noise from a Gaussian distribution to both real and fake samples before inputting them to the discriminator. We further improve the model by exploiting a spectrally-shaped noise distribution with the aim to make the discriminator's task more challenging. We then show the merits of our proposed model for speech and music synthesis on several datasets. Our experiments confirm that our model compares favorably in audio quality and efficiency compared to several baselines.
△ Less
Submitted 30 January, 2024;
originally announced February 2024.
-
Online speaker diarization of meetings guided by speech separation
Authors:
Elio Gruttadauria,
Mathieu Fontaine,
Slim Essid
Abstract:
Overlapped speech is notoriously problematic for speaker diarization systems. Consequently, the use of speech separation has recently been proposed to improve their performance. Although promising, speech separation models struggle with realistic data because they are trained on simulated mixtures with a fixed number of speakers. In this work, we introduce a new speech separation-guided diarizatio…
▽ More
Overlapped speech is notoriously problematic for speaker diarization systems. Consequently, the use of speech separation has recently been proposed to improve their performance. Although promising, speech separation models struggle with realistic data because they are trained on simulated mixtures with a fixed number of speakers. In this work, we introduce a new speech separation-guided diarization scheme suitable for the online speaker diarization of long meeting recordings with a variable number of speakers, as present in the AMI corpus. We envisage ConvTasNet and DPRNN as alternatives for the separation networks, with two or three output sources. To obtain the speaker diarization result, voice activity detection is applied on each estimated source. The final model is fine-tuned end-to-end, after first adapting the separation to real data using AMI. The system operates on short segments, and inference is performed by stitching the local predictions using speaker embeddings and incremental clustering. The results show that our system improves the state-of-the-art on the AMI headset mix, using no oracle information and under full evaluation (no collar and including overlapped speech). Finally, we show the strength of our system particularly on overlapped speech sections.
△ Less
Submitted 30 January, 2024;
originally announced February 2024.
-
Quality-Diversity Generative Sampling for Learning with Synthetic Data
Authors:
Allen Chang,
Matthew C. Fontaine,
Serena Booth,
Maja J. Matarić,
Stefanos Nikolaidis
Abstract:
Generative models can serve as surrogates for some real data sources by creating synthetic training datasets, but in doing so they may transfer biases to downstream tasks. We focus on protecting quality and diversity when generating synthetic training datasets. We propose quality-diversity generative sampling (QDGS), a framework for sampling data uniformly across a user-defined measure space, desp…
▽ More
Generative models can serve as surrogates for some real data sources by creating synthetic training datasets, but in doing so they may transfer biases to downstream tasks. We focus on protecting quality and diversity when generating synthetic training datasets. We propose quality-diversity generative sampling (QDGS), a framework for sampling data uniformly across a user-defined measure space, despite the data coming from a biased generator. QDGS is a model-agnostic framework that uses prompt guidance to optimize a quality objective across measures of diversity for synthetically generated data, without fine-tuning the generative model. Using balanced synthetic datasets generated by QDGS, we first debias classifiers trained on color-biased shape datasets as a proof-of-concept. By applying QDGS to facial data synthesis, we prompt for desired semantic concepts, such as skin tone and age, to create an intersectional dataset with a combined blend of visual features. Leveraging this balanced data for training classifiers improves fairness while maintaining accuracy on facial recognition benchmarks. Code available at: https://github.com/Cylumn/qd-generative-sampling.
△ Less
Submitted 27 February, 2024; v1 submitted 21 December, 2023;
originally announced December 2023.
-
Density Descent for Diversity Optimization
Authors:
David H. Lee,
Anishalakshmi V. Palaparthi,
Matthew C. Fontaine,
Bryon Tjanaka,
Stefanos Nikolaidis
Abstract:
Diversity optimization seeks to discover a set of solutions that elicit diverse features. Prior work has proposed Novelty Search (NS), which, given a current set of solutions, seeks to expand the set by finding points in areas of low density in the feature space. However, to estimate density, NS relies on a heuristic that considers the k-nearest neighbors of the search point in the feature space,…
▽ More
Diversity optimization seeks to discover a set of solutions that elicit diverse features. Prior work has proposed Novelty Search (NS), which, given a current set of solutions, seeks to expand the set by finding points in areas of low density in the feature space. However, to estimate density, NS relies on a heuristic that considers the k-nearest neighbors of the search point in the feature space, which yields a weaker stability guarantee. We propose Density Descent Search (DDS), an algorithm that explores the feature space via CMA-ES on a continuous density estimate of the feature space that also provides a stronger stability guarantee. We experiment with DDS and two density estimation methods: kernel density estimation (KDE) and continuous normalizing flow (CNF). On several standard diversity optimization benchmarks, DDS outperforms NS, the recently proposed MAP-Annealing algorithm, and other state-of-the-art baselines. Additionally, we prove that DDS with KDE provides stronger stability guarantees than NS, making it more suitable for adaptive optimizers. Furthermore, we prove that NS is a special case of DDS that descends a KDE of the feature space.
△ Less
Submitted 30 May, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Symmetry, topology, duality, chirality, and criticality in a spin-1/2 XXZ ladder with a four-spin interaction
Authors:
Mateo Fontaine,
Koudai Sugimoto,
Shunsuke Furukawa
Abstract:
We study the ground-state phase diagram of a spin-1/2 XXZ model with a chirality-chirality interaction (CCI) on a two-leg ladder. This model offers a minimal setup to study an interplay between spin and chirality degrees of freedom. The spin-chirality duality transformation allows us to relate the regimes of weak and strong CCIs. By applying the Abelian bosonization and the duality, we obtain a ri…
▽ More
We study the ground-state phase diagram of a spin-1/2 XXZ model with a chirality-chirality interaction (CCI) on a two-leg ladder. This model offers a minimal setup to study an interplay between spin and chirality degrees of freedom. The spin-chirality duality transformation allows us to relate the regimes of weak and strong CCIs. By applying the Abelian bosonization and the duality, we obtain a rich phase diagram that contains distinct gapped featureless and ordered phases. In particular, Neel and vector chiral orders appear for easy-axis anisotropy, while two distinct symmetry protected topological (SPT) phases appear for easy-plane anisotropy. The two SPT phases can be viewed as twisted variants of the Haldane phase. We also present an effective description in terms of (spinor) hard-core bosons, which reveals critical behavior on the self-dual line in the easy-axis and easy-plane regimes. We perform numerical simulations to confirm the predicted phase structure and critical properties. We further demonstrate that the two SPT phases and a trivial phase are distinguished by topological indices in the presence of certain symmetries. A similar phase structure is expected in a spin-1/2 XXZ ladder with four-spin ring exchange.
△ Less
Submitted 7 April, 2024; v1 submitted 12 November, 2023;
originally announced November 2023.
-
Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis
Authors:
Victor Letzelter,
Mathieu Fontaine,
Mickaël Chen,
Patrick Pérez,
Slim Essid,
Gaël Richard
Abstract:
We introduce Resilient Multiple Choice Learning (rMCL), an extension of the MCL approach for conditional distribution estimation in regression settings where multiple targets may be sampled for each training input. Multiple Choice Learning is a simple framework to tackle multimodal density estimation, using the Winner-Takes-All (WTA) loss for a set of hypotheses. In regression settings, the existi…
▽ More
We introduce Resilient Multiple Choice Learning (rMCL), an extension of the MCL approach for conditional distribution estimation in regression settings where multiple targets may be sampled for each training input. Multiple Choice Learning is a simple framework to tackle multimodal density estimation, using the Winner-Takes-All (WTA) loss for a set of hypotheses. In regression settings, the existing MCL variants focus on merging the hypotheses, thereby eventually sacrificing the diversity of the predictions. In contrast, our method relies on a novel learned scoring scheme underpinned by a mathematical framework based on Voronoi tessellations of the output space, from which we can derive a probabilistic interpretation. After empirically validating rMCL with experiments on synthetic data, we further assess its merits on the sound source localization problem, demonstrating its practical usefulness and the relevance of its interpretation.
△ Less
Submitted 16 November, 2023; v1 submitted 2 November, 2023;
originally announced November 2023.
-
Arbitrarily Scalable Environment Generators via Neural Cellular Automata
Authors:
Yulun Zhang,
Matthew C. Fontaine,
Varun Bhatt,
Stefanos Nikolaidis,
Jiaoyang Li
Abstract:
We study the problem of generating arbitrarily large environments to improve the throughput of multi-robot systems. Prior work proposes Quality Diversity (QD) algorithms as an effective method for optimizing the environments of automated warehouses. However, these approaches optimize only relatively small environments, falling short when it comes to replicating real-world warehouse sizes. The chal…
▽ More
We study the problem of generating arbitrarily large environments to improve the throughput of multi-robot systems. Prior work proposes Quality Diversity (QD) algorithms as an effective method for optimizing the environments of automated warehouses. However, these approaches optimize only relatively small environments, falling short when it comes to replicating real-world warehouse sizes. The challenge arises from the exponential increase in the search space as the environment size increases. Additionally, the previous methods have only been tested with up to 350 robots in simulations, while practical warehouses could host thousands of robots. In this paper, instead of optimizing environments, we propose to optimize Neural Cellular Automata (NCA) environment generators via QD algorithms. We train a collection of NCA generators with QD algorithms in small environments and then generate arbitrarily large environments from the generators at test time. We show that NCA environment generators maintain consistent, regularized patterns regardless of environment size, significantly enhancing the scalability of multi-robot systems in two different domains with up to 2,350 robots. Additionally, we demonstrate that our method scales a single-agent reinforcement learning policy to arbitrarily large environments with similar patterns. We include the source code at \url{https://github.com/lunjohnzhang/warehouse_env_gen_nca_public}.
△ Less
Submitted 28 October, 2023;
originally announced October 2023.
-
Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning
Authors:
Sumeet Batra,
Bryon Tjanaka,
Matthew C. Fontaine,
Aleksei Petrenko,
Stefanos Nikolaidis,
Gaurav Sukhatme
Abstract:
Training generally capable agents that thoroughly explore their environment and learn new and diverse skills is a long-term goal of robot learning. Quality Diversity Reinforcement Learning (QD-RL) is an emerging research area that blends the best aspects of both fields -- Quality Diversity (QD) provides a principled form of exploration and produces collections of behaviorally diverse agents, while…
▽ More
Training generally capable agents that thoroughly explore their environment and learn new and diverse skills is a long-term goal of robot learning. Quality Diversity Reinforcement Learning (QD-RL) is an emerging research area that blends the best aspects of both fields -- Quality Diversity (QD) provides a principled form of exploration and produces collections of behaviorally diverse agents, while Reinforcement Learning (RL) provides a powerful performance improvement operator enabling generalization across tasks and dynamic environments. Existing QD-RL approaches have been constrained to sample efficient, deterministic off-policy RL algorithms and/or evolution strategies, and struggle with highly stochastic environments. In this work, we, for the first time, adapt on-policy RL, specifically Proximal Policy Optimization (PPO), to the Differentiable Quality Diversity (DQD) framework and propose additional improvements over prior work that enable efficient optimization and discovery of novel skills on challenging locomotion tasks. Our new algorithm, Proximal Policy Gradient Arborescence (PPGA), achieves state-of-the-art results, including a 4x improvement in best reward over baselines on the challenging humanoid domain.
△ Less
Submitted 29 January, 2024; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Multi-Robot Coordination and Layout Design for Automated Warehousing
Authors:
Yulun Zhang,
Matthew C. Fontaine,
Varun Bhatt,
Stefanos Nikolaidis,
Jiaoyang Li
Abstract:
With the rapid progress in Multi-Agent Path Finding (MAPF), researchers have studied how MAPF algorithms can be deployed to coordinate hundreds of robots in large automated warehouses. While most works try to improve the throughput of such warehouses by develo** better MAPF algorithms, we focus on improving the throughput by optimizing the warehouse layout. We show that, even with state-of-the-a…
▽ More
With the rapid progress in Multi-Agent Path Finding (MAPF), researchers have studied how MAPF algorithms can be deployed to coordinate hundreds of robots in large automated warehouses. While most works try to improve the throughput of such warehouses by develo** better MAPF algorithms, we focus on improving the throughput by optimizing the warehouse layout. We show that, even with state-of-the-art MAPF algorithms, commonly used human-designed layouts can lead to congestion for warehouses with large numbers of robots and thus have limited scalability. We extend existing automatic scenario generation methods to optimize warehouse layouts. Results show that our optimized warehouse layouts (1) reduce traffic congestion and thus improve throughput, (2) improve the scalability of the automated warehouses by doubling the number of robots in some cases, and (3) are capable of generating layouts with user-specified diversity measures. We include the source code at: https://github.com/lunjohnzhang/warehouse_env_gen_public
△ Less
Submitted 2 September, 2023; v1 submitted 10 May, 2023;
originally announced May 2023.
-
Neural Steerer: Novel Steering Vector Synthesis with a Causal Neural Field over Frequency and Source Positions
Authors:
Diego Di Carlo,
Aditya Arie Nugraha,
Mathieu Fontaine,
Mathieu Fontaine,
Kazuyoshi Yoshii
Abstract:
We address the problem of accurately interpolating measured anechoic steering vectors with a deep learning framework called the neural field. This task plays a pivotal role in reducing the resource-intensive measurements required for precise sound source separation and localization, essential as the front-end of speech recognition. Classical approaches to interpolation rely on linear weighting of…
▽ More
We address the problem of accurately interpolating measured anechoic steering vectors with a deep learning framework called the neural field. This task plays a pivotal role in reducing the resource-intensive measurements required for precise sound source separation and localization, essential as the front-end of speech recognition. Classical approaches to interpolation rely on linear weighting of nearby measurements in space on a fixed, discrete set of frequencies. Drawing inspiration from the success of neural fields for novel view synthesis in computer vision, we introduce the neural steerer, a continuous complex-valued function that takes both frequency and direction as input and produces the corresponding steering vector. Importantly, it incorporates inter-channel phase difference information and a regularization term enforcing filter causality, essential for accurate steering vector modeling. Our experiments, conducted using a dataset of real measured steering vectors, demonstrate the effectiveness of our resolution-free model in interpolating such measurements.
△ Less
Submitted 1 March, 2024; v1 submitted 7 May, 2023;
originally announced May 2023.
-
Surrogate Assisted Generation of Human-Robot Interaction Scenarios
Authors:
Varun Bhatt,
Heramb Nemlekar,
Matthew C. Fontaine,
Bryon Tjanaka,
Hejia Zhang,
Ya-Chuan Hsu,
Stefanos Nikolaidis
Abstract:
As human-robot interaction (HRI) systems advance, so does the difficulty of evaluating and understanding the strengths and limitations of these systems in different environments and with different users. To this end, previous methods have algorithmically generated diverse scenarios that reveal system failures in a shared control teleoperation task. However, these methods require directly evaluatin…
▽ More
As human-robot interaction (HRI) systems advance, so does the difficulty of evaluating and understanding the strengths and limitations of these systems in different environments and with different users. To this end, previous methods have algorithmically generated diverse scenarios that reveal system failures in a shared control teleoperation task. However, these methods require directly evaluating generated scenarios by simulating robot policies and human actions. The computational cost of these evaluations limits their applicability in more complex domains. Thus, we propose augmenting scenario generation systems with surrogate models that predict both human and robot behaviors. In the shared control teleoperation domain and a more complex shared workspace collaboration task, we show that surrogate assisted scenario generation efficiently synthesizes diverse datasets of challenging scenarios. We demonstrate that these failures are reproducible in real-world interactions.
△ Less
Submitted 31 October, 2023; v1 submitted 26 April, 2023;
originally announced April 2023.
-
The James Webb Space Telescope Mission
Authors:
Jonathan P. Gardner,
John C. Mather,
Randy Abbott,
James S. Abell,
Mark Abernathy,
Faith E. Abney,
John G. Abraham,
Roberto Abraham,
Yasin M. Abul-Huda,
Scott Acton,
Cynthia K. Adams,
Evan Adams,
David S. Adler,
Maarten Adriaensen,
Jonathan Albert Aguilar,
Mansoor Ahmed,
Nasif S. Ahmed,
Tanjira Ahmed,
Rüdeger Albat,
Loïc Albert,
Stacey Alberts,
David Aldridge,
Mary Marsha Allen,
Shaune S. Allen,
Martin Altenburg
, et al. (983 additional authors not shown)
Abstract:
Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astrono…
▽ More
Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astronomers will celebrate their accomplishments for the life of the mission, potentially as long as 20 years, and beyond. This report and the scientific discoveries that follow are extended thank-you notes to the 20,000 team members. The telescope is working perfectly, with much better image quality than expected. In this and accompanying papers, we give a brief history, describe the observatory, outline its objectives and current observing program, and discuss the inventions and people who made it possible. We cite detailed reports on the design and the measured performance on orbit.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
pyribs: A Bare-Bones Python Library for Quality Diversity Optimization
Authors:
Bryon Tjanaka,
Matthew C. Fontaine,
David H. Lee,
Yulun Zhang,
Nivedit Reddy Balam,
Nathaniel Dennler,
Sujay S. Garlanka,
Nikitas Dimitri Klapsis,
Stefanos Nikolaidis
Abstract:
Recent years have seen a rise in the popularity of quality diversity (QD) optimization, a branch of optimization that seeks to find a collection of diverse, high-performing solutions to a given problem. To grow further, we believe the QD community faces two challenges: develo** a framework to represent the field's growing array of algorithms, and implementing that framework in software that supp…
▽ More
Recent years have seen a rise in the popularity of quality diversity (QD) optimization, a branch of optimization that seeks to find a collection of diverse, high-performing solutions to a given problem. To grow further, we believe the QD community faces two challenges: develo** a framework to represent the field's growing array of algorithms, and implementing that framework in software that supports a range of researchers and practitioners. To address these challenges, we have developed pyribs, a library built on a highly modular conceptual QD framework. By replacing components in the conceptual framework, and hence in pyribs, users can compose algorithms from across the QD literature; equally important, they can identify unexplored algorithm variations. Furthermore, pyribs makes this framework simple, flexible, and accessible, with a user-friendly API supported by extensive documentation and tutorials. This paper overviews the creation of pyribs, focusing on the conceptual framework that it implements and the design principles that have guided the library's development.
△ Less
Submitted 14 April, 2023; v1 submitted 28 February, 2023;
originally announced March 2023.
-
Design Project of an Open-Source, Low-Cost, and Lightweight Robotic Manipulator for High School Students
Authors:
Isabella Huang,
Qianwen Zhao,
Maxine Fontaine,
Long Wang
Abstract:
In recent years, there is an increasing interest in high school robotics extracurriculars such as robotics clubs and robotics competitions. The growing demand is a result of more ubiquitous open-source software and affordable off-the-shelf hardware kits, which significantly help lower the barrier for entry-level robotics hobbyists. In this project, we present an open-source, low-cost, and lightwei…
▽ More
In recent years, there is an increasing interest in high school robotics extracurriculars such as robotics clubs and robotics competitions. The growing demand is a result of more ubiquitous open-source software and affordable off-the-shelf hardware kits, which significantly help lower the barrier for entry-level robotics hobbyists. In this project, we present an open-source, low-cost, and lightweight robotic manipulator designed and developed by a high school researcher under the guidance of a university faculty and a Ph.D. student. We believe the presented project is suitable for high school robotics research and educational activities. Our open-source package consists of mechanical design models, mechatronics specifications, and software program source codes. The mechanical design models include CAD (Computer Aided Design) files that are ready for prototy** (3D printing technology) and serve as an assembly guide accommodated with a complete bill of materials. Electrical wiring diagrams and low-level controllers are documented in detail as part of the open-source software package. The educational objective of this project is to enable high school student teams to replicate and build a robotic manipulator. The engineering experience that high school students acquire in the proposed project is full-stack, including mechanical design, mechatronics, and programming. The project significantly enriches their hands-on engineering experience in a project-based environment. Throughout this project, we discovered that the high school researcher was able to apply multidisciplinary knowledge from K-12 STEM courses to build the robotic manipulator. The researcher was able to go through a system engineering design and development process and obtain skills to use professional engineering tools including SolidWorks and Arduino microcontrollers.
△ Less
Submitted 16 March, 2023; v1 submitted 21 February, 2023;
originally announced February 2023.
-
Artificial Intelligence and Natural Language Processing and Understanding in Space: A Methodological Framework and Four ESA Case Studies
Authors:
José Manuel Gómez-Pérez,
Andrés García-Silva,
Rosemarie Leone,
Mirko Albani,
Moritz Fontaine,
Charles Poncet,
Leopold Summerer,
Alessandro Donati,
Ilaria Roma,
Stefano Scaglioni
Abstract:
The European Space Agency is well known as a powerful force for scientific discovery in numerous areas related to Space. The amount and depth of the knowledge produced throughout the different missions carried out by ESA and their contribution to scientific progress is enormous, involving large collections of documents like scientific publications, feasibility studies, technical reports, and quali…
▽ More
The European Space Agency is well known as a powerful force for scientific discovery in numerous areas related to Space. The amount and depth of the knowledge produced throughout the different missions carried out by ESA and their contribution to scientific progress is enormous, involving large collections of documents like scientific publications, feasibility studies, technical reports, and quality management procedures, among many others. Through initiatives like the Open Space Innovation Platform, ESA also acts as a hub for new ideas coming from the wider community across different challenges, contributing to a virtuous circle of scientific discovery and innovation. Handling such wealth of information, of which large part is unstructured text, is a colossal task that goes beyond human capabilities, hence requiring automation. In this paper, we present a methodological framework based on artificial intelligence and natural language processing and understanding to automatically extract information from Space documents, generating value from it, and illustrate such framework through several case studies implemented across different functional areas of ESA, including Mission Design, Quality Assurance, Long-Term Data Preservation, and the Open Space Innovation Platform. In doing so, we demonstrate the value of these technologies in several tasks ranging from effortlessly searching and recommending Space information to automatically determining how innovative an idea can be, answering questions about Space, and generating quizzes regarding quality procedures. Each of these accomplishments represents a step forward in the application of increasingly intelligent AI systems in Space, from structuring and facilitating information access to intelligent systems capable to understand and reason with such information.
△ Less
Submitted 24 October, 2022; v1 submitted 7 October, 2022;
originally announced October 2022.
-
Training Diverse High-Dimensional Controllers by Scaling Covariance Matrix Adaptation MAP-Annealing
Authors:
Bryon Tjanaka,
Matthew C. Fontaine,
David H. Lee,
Aniruddha Kalkar,
Stefanos Nikolaidis
Abstract:
Pre-training a diverse set of neural network controllers in simulation has enabled robots to adapt online to damage in robot locomotion tasks. However, finding diverse, high-performing controllers requires expensive network training and extensive tuning of a large number of hyperparameters. On the other hand, Covariance Matrix Adaptation MAP-Annealing (CMA-MAE), an evolution strategies (ES)-based…
▽ More
Pre-training a diverse set of neural network controllers in simulation has enabled robots to adapt online to damage in robot locomotion tasks. However, finding diverse, high-performing controllers requires expensive network training and extensive tuning of a large number of hyperparameters. On the other hand, Covariance Matrix Adaptation MAP-Annealing (CMA-MAE), an evolution strategies (ES)-based quality diversity algorithm, does not have these limitations and has achieved state-of-the-art performance on standard QD benchmarks. However, CMA-MAE cannot scale to modern neural network controllers due to its quadratic complexity. We leverage efficient approximation methods in ES to propose three new CMA-MAE variants that scale to high dimensions. Our experiments show that the variants outperform ES-based baselines in benchmark robotic locomotion tasks, while being comparable with or exceeding state-of-the-art deep reinforcement learning-based quality diversity algorithms.
△ Less
Submitted 15 September, 2023; v1 submitted 5 October, 2022;
originally announced October 2022.
-
DNN-Free Low-Latency Adaptive Speech Enhancement Based on Frame-Online Beamforming Powered by Block-Online FastMNMF
Authors:
Aditya Arie Nugraha,
Kouhei Sekiguchi,
Mathieu Fontaine,
Yoshiaki Bando,
Kazuyoshi Yoshii
Abstract:
This paper describes a practical dual-process speech enhancement system that adapts environment-sensitive frame-online beamforming (front-end) with help from environment-free block-online source separation (back-end). To use minimum variance distortionless response (MVDR) beamforming, one may train a deep neural network (DNN) that estimates time-frequency masks used for computing the covariance ma…
▽ More
This paper describes a practical dual-process speech enhancement system that adapts environment-sensitive frame-online beamforming (front-end) with help from environment-free block-online source separation (back-end). To use minimum variance distortionless response (MVDR) beamforming, one may train a deep neural network (DNN) that estimates time-frequency masks used for computing the covariance matrices of sources (speech and noise). Backpropagation-based run-time adaptation of the DNN was proposed for dealing with the mismatched training-test conditions. Instead, one may try to directly estimate the source covariance matrices with a state-of-the-art blind source separation method called fast multichannel non-negative matrix factorization (FastMNMF). In practice, however, neither the DNN nor the FastMNMF can be updated in a frame-online manner due to its computationally-expensive iterative nature. Our DNN-free system leverages the posteriors of the latest source spectrograms given by block-online FastMNMF to derive the current source covariance matrices for frame-online beamforming. The evaluation shows that our frame-online system can quickly respond to scene changes caused by interfering speaker movements and outperformed an existing block-online system with DNN-based beamforming by 5.0 points in terms of the word error rate.
△ Less
Submitted 22 July, 2022;
originally announced July 2022.
-
Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments
Authors:
Kouhei Sekiguchi,
Aditya Arie Nugraha,
Yicheng Du,
Yoshiaki Bando,
Mathieu Fontaine,
Kazuyoshi Yoshii
Abstract:
This paper describes the practical response- and performance-aware development of online speech enhancement for an augmented reality (AR) headset that helps a user understand conversations made in real noisy echoic environments (e.g., cocktail party). One may use a state-of-the-art blind source separation method called fast multichannel nonnegative matrix factorization (FastMNMF) that works well i…
▽ More
This paper describes the practical response- and performance-aware development of online speech enhancement for an augmented reality (AR) headset that helps a user understand conversations made in real noisy echoic environments (e.g., cocktail party). One may use a state-of-the-art blind source separation method called fast multichannel nonnegative matrix factorization (FastMNMF) that works well in various environments thanks to its unsupervised nature. Its heavy computational cost, however, prevents its application to real-time processing. In contrast, a supervised beamforming method that uses a deep neural network (DNN) for estimating spatial information of speech and noise readily fits real-time processing, but suffers from drastic performance degradation in mismatched conditions. Given such complementary characteristics, we propose a dual-process robust online speech enhancement method based on DNN-based beamforming with FastMNMF-guided adaptation. FastMNMF (back end) is performed in a mini-batch style and the noisy and enhanced speech pairs are used together with the original parallel training data for updating the direction-aware DNN (front end) with backpropagation at a computationally-allowable interval. This method is used with a blind dereverberation method called weighted prediction error (WPE) for transcribing the noisy reverberant speech of a speaker, which can be detected from video or selected by a user's hand gesture or eye gaze, in a streaming manner and spatially showing the transcriptions with an AR technique. Our experiment showed that the word error rate was improved by more than 10 points with the run-time adaptation using only twelve minutes of observation.
△ Less
Submitted 15 July, 2022;
originally announced July 2022.
-
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments
Authors:
Yicheng Du,
Aditya Arie Nugraha,
Kouhei Sekiguchi,
Yoshiaki Bando,
Mathieu Fontaine,
Kazuyoshi Yoshii
Abstract:
This paper describes noisy speech recognition for an augmented reality headset that helps verbal communication within real multiparty conversational environments. A major approach that has actively been studied in simulated environments is to sequentially perform speech enhancement and automatic speech recognition (ASR) based on deep neural networks (DNNs) trained in a supervised manner. In our ta…
▽ More
This paper describes noisy speech recognition for an augmented reality headset that helps verbal communication within real multiparty conversational environments. A major approach that has actively been studied in simulated environments is to sequentially perform speech enhancement and automatic speech recognition (ASR) based on deep neural networks (DNNs) trained in a supervised manner. In our task, however, such a pretrained system fails to work due to the mismatch between the training and test conditions and the head movements of the user. To enhance only the utterances of a target speaker, we use beamforming based on a DNN-based speech mask estimator that can adaptively extract the speech components corresponding to a head-relative particular direction. We propose a semi-supervised adaptation method that jointly updates the mask estimator and the ASR model at run-time using clean speech signals with ground-truth transcriptions and noisy speech signals with highly-confident estimated transcriptions. Comparative experiments using the state-of-the-art distant speech recognition system show that the proposed method significantly improves the ASR performance.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
Generating Diverse Indoor Furniture Arrangements
Authors:
Ya-Chuan Hsu,
Matthew C. Fontaine,
Sam Earle,
Maria Edwards,
Julian Togelius,
Stefanos Nikolaidis
Abstract:
We present a method for generating arrangements of indoor furniture from human-designed furniture layout data. Our method creates arrangements that target specified diversity, such as the total price of all furniture in the room and the number of pieces placed. To generate realistic furniture arrangement, we train a generative adversarial network (GAN) on human-designed layouts. To target specific…
▽ More
We present a method for generating arrangements of indoor furniture from human-designed furniture layout data. Our method creates arrangements that target specified diversity, such as the total price of all furniture in the room and the number of pieces placed. To generate realistic furniture arrangement, we train a generative adversarial network (GAN) on human-designed layouts. To target specific diversity in the arrangements, we optimize the latent space of the GAN via a quality diversity algorithm to generate a diverse arrangement collection. Experiments show our approach discovers a set of arrangements that are similar to human-designed layouts but varies in price and number of furniture pieces.
△ Less
Submitted 20 June, 2022;
originally announced June 2022.
-
Deep Surrogate Assisted Generation of Environments
Authors:
Varun Bhatt,
Bryon Tjanaka,
Matthew C. Fontaine,
Stefanos Nikolaidis
Abstract:
Recent progress in reinforcement learning (RL) has started producing generally capable agents that can solve a distribution of complex environments. These agents are typically tested on fixed, human-authored environments. On the other hand, quality diversity (QD) optimization has been proven to be an effective component of environment generation algorithms, which can generate collections of high-q…
▽ More
Recent progress in reinforcement learning (RL) has started producing generally capable agents that can solve a distribution of complex environments. These agents are typically tested on fixed, human-authored environments. On the other hand, quality diversity (QD) optimization has been proven to be an effective component of environment generation algorithms, which can generate collections of high-quality environments that are diverse in the resulting agent behaviors. However, these algorithms require potentially expensive simulations of agents on newly generated environments. We propose Deep Surrogate Assisted Generation of Environments (DSAGE), a sample-efficient QD environment generation algorithm that maintains a deep surrogate model for predicting agent behaviors in new environments. Results in two benchmark domains show that DSAGE significantly outperforms existing QD environment generation algorithms in discovering collections of environments that elicit diverse behaviors of a state-of-the-art RL agent and a planning agent. Our source code and videos are available at https://dsagepaper.github.io/.
△ Less
Submitted 11 October, 2022; v1 submitted 8 June, 2022;
originally announced June 2022.
-
Covariance Matrix Adaptation MAP-Annealing
Authors:
Matthew C. Fontaine,
Stefanos Nikolaidis
Abstract:
Single-objective optimization algorithms search for the single highest-quality solution with respect to an objective. Quality diversity (QD) optimization algorithms, such as Covariance Matrix Adaptation MAP-Elites (CMA-ME), search for a collection of solutions that are both high-quality with respect to an objective and diverse with respect to specified measure functions. However, CMA-ME suffers fr…
▽ More
Single-objective optimization algorithms search for the single highest-quality solution with respect to an objective. Quality diversity (QD) optimization algorithms, such as Covariance Matrix Adaptation MAP-Elites (CMA-ME), search for a collection of solutions that are both high-quality with respect to an objective and diverse with respect to specified measure functions. However, CMA-ME suffers from three major limitations highlighted by the QD community: prematurely abandoning the objective in favor of exploration, struggling to explore flat objectives, and having poor performance for low-resolution archives. We propose a new quality diversity algorithm, Covariance Matrix Adaptation MAP-Annealing (CMA-MAE), that addresses all three limitations. We provide theoretical justifications for the new algorithm with respect to each limitation. Our theory informs our experiments, which support the theory and show that CMA-MAE achieves state-of-the-art performance and robustness.
△ Less
Submitted 5 June, 2023; v1 submitted 22 May, 2022;
originally announced May 2022.
-
Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation
Authors:
Mathieu Fontaine,
Kouhei Sekiguchi,
Aditya Nugraha,
Yoshiaki Bando,
Kazuyoshi Yoshii
Abstract:
This paper describes heavy-tailed extensions of a state-of-the-art versatile blind source separation method called fast multichannel nonnegative matrix factorization (FastMNMF) from a unified point of view. The common way of deriving such an extension is to replace the multivariate complex Gaussian distribution in the likelihood function with its heavy-tailed generalization, e.g., the multivariate…
▽ More
This paper describes heavy-tailed extensions of a state-of-the-art versatile blind source separation method called fast multichannel nonnegative matrix factorization (FastMNMF) from a unified point of view. The common way of deriving such an extension is to replace the multivariate complex Gaussian distribution in the likelihood function with its heavy-tailed generalization, e.g., the multivariate complex Student's t and leptokurtic generalized Gaussian distributions, and tailor-make the corresponding parameter optimization algorithm. Using a wider class of heavy-tailed distributions called a Gaussian scale mixture (GSM), i.e., a mixture of Gaussian distributions whose variances are perturbed by positive random scalars called impulse variables, we propose GSM-FastMNMF and develop an expectationmaximization algorithm that works even when the probability density function of the impulse variables have no analytical expressions. We show that existing heavy-tailed FastMNMF extensions are instances of GSM-FastMNMF and derive a new instance based on the generalized hyperbolic distribution that include the normal-inverse Gaussian, Student's t, and Gaussian distributions as the special cases. Our experiments show that the normalinverse Gaussian FastMNMF outperforms the state-of-the-art FastMNMF extensions and ILRMA model in speech enhancement and separation in terms of the signal-to-distortion ratio.
△ Less
Submitted 11 May, 2022;
originally announced May 2022.
-
Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning
Authors:
Bryon Tjanaka,
Matthew C. Fontaine,
Julian Togelius,
Stefanos Nikolaidis
Abstract:
Consider the problem of training robustly capable agents. One approach is to generate a diverse collection of agent polices. Training can then be viewed as a quality diversity (QD) optimization problem, where we search for a collection of performant policies that are diverse with respect to quantified behavior. Recent work shows that differentiable quality diversity (DQD) algorithms greatly accele…
▽ More
Consider the problem of training robustly capable agents. One approach is to generate a diverse collection of agent polices. Training can then be viewed as a quality diversity (QD) optimization problem, where we search for a collection of performant policies that are diverse with respect to quantified behavior. Recent work shows that differentiable quality diversity (DQD) algorithms greatly accelerate QD optimization when exact gradients are available. However, agent policies typically assume that the environment is not differentiable. To apply DQD algorithms to training agent policies, we must approximate gradients for performance and behavior. We propose two variants of the current state-of-the-art DQD algorithm that compute gradients via approximation methods common in reinforcement learning (RL). We evaluate our approach on four simulated locomotion tasks. One variant achieves results comparable to the current state-of-the-art in combining QD and RL, while the other performs comparably in two locomotion tasks. These results provide insight into the limitations of current DQD algorithms in domains where gradients must be approximated. Source code is available at https://github.com/icaros-usc/dqd-rl
△ Less
Submitted 15 April, 2022; v1 submitted 8 February, 2022;
originally announced February 2022.
-
Deep Surrogate Assisted MAP-Elites for Automated Hearthstone Deckbuilding
Authors:
Yulun Zhang,
Matthew C. Fontaine,
Amy K. Hoover,
Stefanos Nikolaidis
Abstract:
We study the problem of efficiently generating high-quality and diverse content in games. Previous work on automated deckbuilding in Hearthstone shows that the quality diversity algorithm MAP-Elites can generate a collection of high-performing decks with diverse strategic gameplay. However, MAP-Elites requires a large number of expensive evaluations to discover a diverse collection of decks. We pr…
▽ More
We study the problem of efficiently generating high-quality and diverse content in games. Previous work on automated deckbuilding in Hearthstone shows that the quality diversity algorithm MAP-Elites can generate a collection of high-performing decks with diverse strategic gameplay. However, MAP-Elites requires a large number of expensive evaluations to discover a diverse collection of decks. We propose assisting MAP-Elites with a deep surrogate model trained online to predict game outcomes with respect to candidate decks. MAP-Elites discovers a diverse dataset to improve the surrogate model accuracy, while the surrogate model helps guide MAP-Elites towards promising new content. In a Hearthstone deckbuilding case study, we show that our approach improves the sample efficiency of MAP-Elites and outperforms a model trained offline with random decks, as well as a linear surrogate model baseline, setting a new state-of-the-art for quality diversity approaches in automated Hearthstone deckbuilding. We include the source code for all the experiments at: https://github.com/icaros-usc/EvoStone2.
△ Less
Submitted 16 April, 2022; v1 submitted 7 December, 2021;
originally announced December 2021.
-
Illuminating Diverse Neural Cellular Automata for Level Generation
Authors:
Sam Earle,
Justin Snider,
Matthew C. Fontaine,
Stefanos Nikolaidis,
Julian Togelius
Abstract:
We present a method of generating diverse collections of neural cellular automata (NCA) to design video game levels. While NCAs have so far only been trained via supervised learning, we present a quality diversity (QD) approach to generating a collection of NCA level generators. By framing the problem as a QD problem, our approach can train diverse level generators, whose output levels vary based…
▽ More
We present a method of generating diverse collections of neural cellular automata (NCA) to design video game levels. While NCAs have so far only been trained via supervised learning, we present a quality diversity (QD) approach to generating a collection of NCA level generators. By framing the problem as a QD problem, our approach can train diverse level generators, whose output levels vary based on aesthetic or functional criteria. To efficiently generate NCAs, we train generators via Covariance Matrix Adaptation MAP-Elites (CMA-ME), a quality diversity algorithm which specializes in continuous search spaces. We apply our new method to generate level generators for several 2D tile-based games: a maze game, Sokoban, and Zelda. Our results show that CMA-ME can generate small NCAs that are diverse yet capable, often satisfying complex solvability criteria for deterministic agents. We compare against a Compositional Pattern-Producing Network (CPPN) baseline trained to produce diverse collections of generators and show that the NCA representation yields a better exploration of level-space.
△ Less
Submitted 17 February, 2022; v1 submitted 12 September, 2021;
originally announced September 2021.
-
On the Importance of Environments in Human-Robot Coordination
Authors:
Matthew C. Fontaine,
Ya-Chuan Hsu,
Yulun Zhang,
Bryon Tjanaka,
Stefanos Nikolaidis
Abstract:
When studying robots collaborating with humans, much of the focus has been on robot policies that coordinate fluently with human teammates in collaborative tasks. However, less emphasis has been placed on the effect of the environment on coordination behaviors. To thoroughly explore environments that result in diverse behaviors, we propose a framework for procedural generation of environments that…
▽ More
When studying robots collaborating with humans, much of the focus has been on robot policies that coordinate fluently with human teammates in collaborative tasks. However, less emphasis has been placed on the effect of the environment on coordination behaviors. To thoroughly explore environments that result in diverse behaviors, we propose a framework for procedural generation of environments that are (1) stylistically similar to human-authored environments, (2) guaranteed to be solvable by the human-robot team, and (3) diverse with respect to coordination measures. We analyze the procedurally generated environments in the Overcooked benchmark domain via simulation and an online user study. Results show that the environments result in qualitatively different emerging behaviors and statistically significant differences in collaborative fluency metrics, even when the robot runs the same planning algorithm.
△ Less
Submitted 28 June, 2021; v1 submitted 21 June, 2021;
originally announced June 2021.
-
Differentiable Quality Diversity
Authors:
Matthew C. Fontaine,
Stefanos Nikolaidis
Abstract:
Quality diversity (QD) is a growing branch of stochastic optimization research that studies the problem of generating an archive of solutions that maximize a given objective function but are also diverse with respect to a set of specified measure functions. However, even when these functions are differentiable, QD algorithms treat them as "black boxes", ignoring gradient information. We present th…
▽ More
Quality diversity (QD) is a growing branch of stochastic optimization research that studies the problem of generating an archive of solutions that maximize a given objective function but are also diverse with respect to a set of specified measure functions. However, even when these functions are differentiable, QD algorithms treat them as "black boxes", ignoring gradient information. We present the differentiable quality diversity (DQD) problem, a special case of QD, where both the objective and measure functions are first order differentiable. We then present MAP-Elites via a Gradient Arborescence (MEGA), a DQD algorithm that leverages gradient information to efficiently explore the joint range of the objective and measure functions. Results in two QD benchmark domains and in searching the latent space of a StyleGAN show that MEGA significantly outperforms state-of-the-art QD algorithms, highlighting DQD's promise for efficient quality diversity optimization when gradient information is available. Source code is available at https://github.com/icaros-usc/dqd.
△ Less
Submitted 26 October, 2021; v1 submitted 7 June, 2021;
originally announced June 2021.
-
Braids of the N-body problem II: carousel solutions by cabling central configurations
Authors:
Marine Fontaine,
Carlos García-Azpeitia
Abstract:
We prove the existence of relative periodic solutions of the planar $N=\sum_{j=1}^n k_j$-body problem starting with $n$ bodies moving close to a non-degenerate central configuration and replacing each of them with clusters of $k_j$ bodies that move close to a small central configuration. We name these solutions carousel solutions. The proof relies on blow-up techniques for variational methods used…
▽ More
We prove the existence of relative periodic solutions of the planar $N=\sum_{j=1}^n k_j$-body problem starting with $n$ bodies moving close to a non-degenerate central configuration and replacing each of them with clusters of $k_j$ bodies that move close to a small central configuration. We name these solutions carousel solutions. The proof relies on blow-up techniques for variational methods used in our previous work.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
The large inner Micromegas modules for the Atlas Muon Spectrometer Upgrade: construction, quality control and characterization
Authors:
J. Allard,
M. Anfreville,
N. Andari,
D. Attié,
S. Aune,
H. Bachacou,
F. Balli,
F. Bauer,
J. Bennet,
T. Benoit,
J. Beltramelli,
H. Bervas,
T. Bey,
S. Bouaziz,
M. Boyer,
T. Challey,
T. Chevalérias,
X. Copollani,
J. Costa,
G. Cara,
G. Decock,
F. Deliot,
D. Denysiuk,
D. Desforge,
G. Disset
, et al. (49 additional authors not shown)
Abstract:
The steadily increasing luminosity of the LHC requires an upgrade with high-rate and high-resolution detector technology for the inner end cap of the ATLAS muon spectrometer: the New Small Wheels (NSW). In order to achieve the goal of precision tracking at a hit rate of about 15 kHz/cm$^2$ at the inner radius of the NSW, large area Micromegas quadruplets with 100\,\microns spatial resolution per p…
▽ More
The steadily increasing luminosity of the LHC requires an upgrade with high-rate and high-resolution detector technology for the inner end cap of the ATLAS muon spectrometer: the New Small Wheels (NSW). In order to achieve the goal of precision tracking at a hit rate of about 15 kHz/cm$^2$ at the inner radius of the NSW, large area Micromegas quadruplets with 100\,\microns spatial resolution per plane have been produced. % IRFU, from the CEA research center of Saclay, is responsible for the production and validation of LM1 Micromegas modules. The construction, production, qualification and validation of the largest Micromegas detectors ever built are reported here. Performance results under cosmic muon characterisation will also be discussed.
△ Less
Submitted 28 May, 2021;
originally announced May 2021.
-
A Quality Diversity Approach to Automatically Generating Human-Robot Interaction Scenarios in Shared Autonomy
Authors:
Matthew Fontaine,
Stefanos Nikolaidis
Abstract:
The growth of scale and complexity of interactions between humans and robots highlights the need for new computational methods to automatically evaluate novel algorithms and applications. Exploring diverse scenarios of humans and robots interacting in simulation can improve understanding of the robotic system and avoid potentially costly failures in real-world settings. We formulate this problem a…
▽ More
The growth of scale and complexity of interactions between humans and robots highlights the need for new computational methods to automatically evaluate novel algorithms and applications. Exploring diverse scenarios of humans and robots interacting in simulation can improve understanding of the robotic system and avoid potentially costly failures in real-world settings. We formulate this problem as a quality diversity (QD) problem, where the goal is to discover diverse failure scenarios by simultaneously exploring both environments and human actions. We focus on the shared autonomy domain, where the robot attempts to infer the goal of a human operator, and adopt the QD algorithm MAP-Elites to generate scenarios for two published algorithms in this domain: shared autonomy via hindsight optimization and linear policy blending. Some of the generated scenarios confirm previous theoretical findings, while others are surprising and bring about a new understanding of state-of-the-art implementations. Our experiments show that MAP-Elites outperforms Monte-Carlo simulation and optimization based methods in effectively searching the scenario space, highlighting its promise for automatic evaluation of algorithms in human-robot interaction.
△ Less
Submitted 21 June, 2021; v1 submitted 8 December, 2020;
originally announced December 2020.
-
Video Game Level Repair via Mixed Integer Linear Programming
Authors:
Hejia Zhang,
Matthew C. Fontaine,
Amy K. Hoover,
Julian Togelius,
Bistra Dilkina,
Stefanos Nikolaidis
Abstract:
Recent advancements in procedural content generation via machine learning enable the generation of video-game levels that are aesthetically similar to human-authored examples. However, the generated levels are often unplayable without additional editing. We propose a generate-then-repair framework for automatic generation of playable levels adhering to specific styles. The framework constructs lev…
▽ More
Recent advancements in procedural content generation via machine learning enable the generation of video-game levels that are aesthetically similar to human-authored examples. However, the generated levels are often unplayable without additional editing. We propose a generate-then-repair framework for automatic generation of playable levels adhering to specific styles. The framework constructs levels using a generative adversarial network (GAN) trained with human-authored examples and repairs them using a mixed-integer linear program (MIP) with playability constraints. A key component of the framework is computing minimum cost edits between the GAN generated level and the solution of the MIP solver, which we cast as a minimum cost network flow problem. Results show that the proposed framework generates a diverse range of playable levels, that capture the spatial relationships between objects exhibited in the human-authored levels.
△ Less
Submitted 13 October, 2020;
originally announced October 2020.
-
Real Forms of Holomorphic Hamiltonian Systems
Authors:
Philip Arathoon,
Marine Fontaine
Abstract:
By complexifying a Hamiltonian system one obtains dynamics on a holomorphic symplectic manifold. To invert this construction we present a theory of real forms which not only recovers the original system but also yields different real Hamiltonian systems which share the same complexification. This provides a notion of real forms for holomorphic Hamiltonian systems analogous to that of real forms fo…
▽ More
By complexifying a Hamiltonian system one obtains dynamics on a holomorphic symplectic manifold. To invert this construction we present a theory of real forms which not only recovers the original system but also yields different real Hamiltonian systems which share the same complexification. This provides a notion of real forms for holomorphic Hamiltonian systems analogous to that of real forms for complex Lie algebras. Our main result is that the complexification of any analytic mechanical system on a Grassmannian admits a real form on a compact symplectic manifold. This produces a `unitary trick' for Hamiltonian systems which curiously requires an essential use of hyperkähler geometry. We demonstrate this result by finding compact real forms for the simple pendulum, the spherical pendulum, and the rigid body.
△ Less
Submitted 7 February, 2023; v1 submitted 22 September, 2020;
originally announced September 2020.
-
Illuminating Mario Scenes in the Latent Space of a Generative Adversarial Network
Authors:
Matthew C. Fontaine,
Ruilin Liu,
Ahmed Khalifa,
Jignesh Modi,
Julian Togelius,
Amy K. Hoover,
Stefanos Nikolaidis
Abstract:
Generative adversarial networks (GANs) are quickly becoming a ubiquitous approach to procedurally generating video game levels. While GAN generated levels are stylistically similar to human-authored examples, human designers often want to explore the generative design space of GANs to extract interesting levels. However, human designers find latent vectors opaque and would rather explore along dim…
▽ More
Generative adversarial networks (GANs) are quickly becoming a ubiquitous approach to procedurally generating video game levels. While GAN generated levels are stylistically similar to human-authored examples, human designers often want to explore the generative design space of GANs to extract interesting levels. However, human designers find latent vectors opaque and would rather explore along dimensions the designer specifies, such as number of enemies or obstacles. We propose using state-of-the-art quality diversity algorithms designed to optimize continuous spaces, i.e. MAP-Elites with a directional variation operator and Covariance Matrix Adaptation MAP-Elites, to efficiently explore the latent space of a GAN to extract levels that vary across a set of specified gameplay measures. In the benchmark domain of Super Mario Bros, we demonstrate how designers may specify gameplay measures to our system and extract high-quality (playable) levels with a diverse range of level mechanics, while still maintaining stylistic similarity to human authored examples. An online user study shows how the different mechanics of the automatically generated levels affect subjective ratings of their perceived difficulty and appearance.
△ Less
Submitted 21 June, 2021; v1 submitted 10 July, 2020;
originally announced July 2020.
-
Covariance Matrix Adaptation for the Rapid Illumination of Behavior Space
Authors:
Matthew C. Fontaine,
Julian Togelius,
Stefanos Nikolaidis,
Amy K. Hoover
Abstract:
We focus on the challenge of finding a diverse collection of quality solutions on complex continuous domains. While quality diver-sity (QD) algorithms like Novelty Search with Local Competition (NSLC) and MAP-Elites are designed to generate a diverse range of solutions, these algorithms require a large number of evaluations for exploration of continuous spaces. Meanwhile, variants of the Covarianc…
▽ More
We focus on the challenge of finding a diverse collection of quality solutions on complex continuous domains. While quality diver-sity (QD) algorithms like Novelty Search with Local Competition (NSLC) and MAP-Elites are designed to generate a diverse range of solutions, these algorithms require a large number of evaluations for exploration of continuous spaces. Meanwhile, variants of the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) are among the best-performing derivative-free optimizers in single-objective continuous domains. This paper proposes a new QD algorithm called Covariance Matrix Adaptation MAP-Elites (CMA-ME). Our new algorithm combines the self-adaptation techniques of CMA-ES with archiving and map** techniques for maintaining diversity in QD. Results from experiments based on standard continuous optimization benchmarks show that CMA-ME finds better-quality solutions than MAP-Elites; similarly, results on the strategic game Hearthstone show that CMA-ME finds both a higher overall quality and broader diversity of strategies than both CMA-ES and MAP-Elites. Overall, CMA-ME more than doubles the performance of MAP-Elites using standard QD performance metrics. These results suggest that QD algorithms augmented by operators from state-of-the-art optimization algorithms can yield high-performing methods for simultaneously exploring and optimizing continuous search spaces, with significant applications to design, testing, and reinforcement learning among other domains.
△ Less
Submitted 7 May, 2020; v1 submitted 5 December, 2019;
originally announced December 2019.
-
Evolving the Hearthstone Meta
Authors:
Fernando de Mesentier Silva,
Rodrigo Canaan,
Scott Lee,
Matthew C. Fontaine,
Julian Togelius,
Amy K. Hoover
Abstract:
Balancing an ever growing strategic game of high complexity, such as Hearthstone is a complex task. The target of making strategies diverse and customizable results in a delicate intricate system. Tuning over 2000 cards to generate the desired outcome without disrupting the existing environment becomes a laborious challenge. In this paper, we discuss the impacts that changes to existing cards can…
▽ More
Balancing an ever growing strategic game of high complexity, such as Hearthstone is a complex task. The target of making strategies diverse and customizable results in a delicate intricate system. Tuning over 2000 cards to generate the desired outcome without disrupting the existing environment becomes a laborious challenge. In this paper, we discuss the impacts that changes to existing cards can have on strategy in Hearthstone. By analyzing the win rate on match-ups across different decks, being played by different strategies, we propose to compare their performance before and after changes are made to improve or worsen different cards. Then, using an evolutionary algorithm, we search for a combination of changes to the card attributes that cause the decks to approach equal, 50% win rates. We then expand our evolutionary algorithm to a multi-objective solution to search for this result, while making the minimum amount of changes, and as a consequence disruption, to the existing cards. Lastly, we propose and evaluate metrics to serve as heuristics with which to decide which cards to target with balance changes.
△ Less
Submitted 2 July, 2019;
originally announced July 2019.
-
Braids of the N-body problem by cabling a body in a central configuration
Authors:
Marine Fontaine,
Carlos García-Azpeitia
Abstract:
We prove the existence of periodic solutions of the N=(n+1)-body problem starting with n bodies whose reduced motion is close to a non-degenerate central configuration and replacing one of them by the center of mass of a pair of bodies rotating uniformly. When the motion takes place in the standard Euclidean plane, these solutions are a special type of braid solutions obtained numerically by C. Mo…
▽ More
We prove the existence of periodic solutions of the N=(n+1)-body problem starting with n bodies whose reduced motion is close to a non-degenerate central configuration and replacing one of them by the center of mass of a pair of bodies rotating uniformly. When the motion takes place in the standard Euclidean plane, these solutions are a special type of braid solutions obtained numerically by C. Moore. The proof uses blow-up techniques to separate the problem into the n-body problem, the Kepler problem, and a coupling which is small if the distance of the pair is small. The formulation is variational and the result is obtained by applying a Lyapunov-Schmidt reduction and by using the equivariant Lyusternik-Schnirelmann category.
△ Less
Submitted 19 July, 2020; v1 submitted 18 June, 2019;
originally announced June 2019.
-
Map** Hearthstone Deck Spaces through MAP-Elites with Sliding Boundaries
Authors:
Matthew C. Fontaine,
Scott Lee,
L. B. Soros,
Fernando De Mesentier Silva,
Julian Togelius,
Amy K. Hoover
Abstract:
Quality diversity (QD) algorithms such as MAP-Elites have emerged as a powerful alternative to traditional single-objective optimization methods. They were initially applied to evolutionary robotics problems such as locomotion and maze navigation, but have yet to see widespread application. We argue that these algorithms are perfectly suited to the rich domain of video games, which contains many r…
▽ More
Quality diversity (QD) algorithms such as MAP-Elites have emerged as a powerful alternative to traditional single-objective optimization methods. They were initially applied to evolutionary robotics problems such as locomotion and maze navigation, but have yet to see widespread application. We argue that these algorithms are perfectly suited to the rich domain of video games, which contains many relevant problems with a multitude of successful strategies and often also multiple dimensions along which solutions can vary.
This paper introduces a novel modification of the MAP-Elites algorithm called MAP-Elites with Sliding Boundaries (MESB) and applies it to the design and rebalancing of Hearthstone, a popular collectible card game chosen for its number of multidimensional behavior features relevant to particular styles of play. To avoid overpopulating cells with conflated behaviors, MESB slides the boundaries of cells based on the distribution of evolved individuals. Experiments in this paper demonstrate the performance of MESB in Hearthstone. Results suggest MESB finds diverse ways of playing the game well along the selected behavioral dimensions. Further analysis of the evolved strategies reveals common patterns that recur across behavioral dimensions and explores how MESB can help rebalance the game.
△ Less
Submitted 24 April, 2019;
originally announced April 2019.
-
Time resolution studies for scintillating plastics coupled to silicon photo-multipliers
Authors:
Mauricio Alvarado,
Alejandro Ayala,
Marco Alberto Ayala-Torres,
Wolfgang Bietenholz,
Isabel Dominguez,
Marcos Fontaine,
P. González-Zamora,
Luis Manuel Montaño,
E. Moreno Barbosa,
Miguel Enrique Patiño Salazar,
V. Z. Reyna Ortiz,
M. Rodríguez Cahuantzi,
G. Tejeda Muńoz,
Maria Elena Tejeda-Yeomans,
Luis Valenzuela-Cázares,
C. H. Zepeda Fernández
Abstract:
We present results for time resolution studies performed on three different scintillating plastics and two silicon photo-multipliers. These studies are intended to determine whether scintillating plastic/silicon photo-multiplier systems can be employed to provide a fast trigger signal for NICA's Multi Purpose Detector (MPD). Our results show that such a system made of cells with transverse dimensi…
▽ More
We present results for time resolution studies performed on three different scintillating plastics and two silicon photo-multipliers. These studies are intended to determine whether scintillating plastic/silicon photo-multiplier systems can be employed to provide a fast trigger signal for NICA's Multi Purpose Detector (MPD). Our results show that such a system made of cells with transverse dimensions of order of a few cm, coupled to silicon photo-multipliers, provides a time resolution of about 50 ps, which can be even further improved to attain the MPD trigger requirements of 20 ps.
△ Less
Submitted 15 January, 2019;
originally announced January 2019.
-
A beam-beam monitoring detector for the MPD experiment at NICA
Authors:
Mauricio Alvarado,
Alejandro Ayala,
Marco Alberto Ayala-Torres,
Wolfgang Bietenholz,
Isabel Dominguez,
Marcos Fontaine,
P. González-Zamora,
Luis Manuel Montaño,
E. Moreno-Barbosa,
Miguel Enrique Patiño Salazar,
L. A. P. Moreno,
P. A. Nieto-Marín,
V. Z. Reyna Ortiz,
M. Rodríguez-Cahuantzi,
G. Tejeda-Muñoz,
Maria Elena Tejeda-Yeomans,
A. Villatoro-Tello,
C. H. Zepeda Fernández
Abstract:
The Multi-Purpose Detector (MPD) is to be installed at the Nuclotron Ion Collider fAcility (NICA) of the Joint Institute for Nuclear Research (JINR). Its main goal is to study the phase diagram of the strongly interacting matter produced in heavy-ion collisions. These studies, while providing insight into the physics of heavy-ion collisions, are relevant for improving our understanding of the evol…
▽ More
The Multi-Purpose Detector (MPD) is to be installed at the Nuclotron Ion Collider fAcility (NICA) of the Joint Institute for Nuclear Research (JINR). Its main goal is to study the phase diagram of the strongly interacting matter produced in heavy-ion collisions. These studies, while providing insight into the physics of heavy-ion collisions, are relevant for improving our understanding of the evolution of the early Universe and the formation of neutron stars. In order to extend the MPD trigger capabilities, we propose to include a high granularity beam-beam monitoring detector (BE-BE) to provide a level-0 trigger signal with an expected time resolution of 30 ps. This new detector will improve the determination of the reaction plane by the MPD experiment, a key measurement for flow studies that provides physics insight into the early stages of the reaction. In this work, we use simulated Au+Au collisions at NICA energies to show the potential of such a detector to determine the event plane resolution, providing further redundancy to the detectors originally considered for this purpose namely, the Fast Forward Detector (FFD) and the Hadron Calorimeter (HCAL). We also show our results for the time resolution studies of two prototype cells carried out at the T10 beam line at the CERN PS complex.
△ Less
Submitted 4 December, 2019; v1 submitted 25 September, 2018;
originally announced September 2018.
-
Symplectic slice for subgroup actions
Authors:
Marine Fontaine
Abstract:
Given a symplectic manifold $(M,ω)$ endowed with a proper Hamiltonian action of a Lie group $G$, we consider the action induced by a Lie subgroup $H$ of $G$. We propose a construction for two compatible Witt-Artin decompositions of the tangent space of $M$, one relative to the $G$-action and one relative to the $H$-action. In particular, we provide an explicit relation between the respective sympl…
▽ More
Given a symplectic manifold $(M,ω)$ endowed with a proper Hamiltonian action of a Lie group $G$, we consider the action induced by a Lie subgroup $H$ of $G$. We propose a construction for two compatible Witt-Artin decompositions of the tangent space of $M$, one relative to the $G$-action and one relative to the $H$-action. In particular, we provide an explicit relation between the respective symplectic slices.
△ Less
Submitted 19 June, 2019; v1 submitted 29 December, 2017;
originally announced December 2017.
-
A localization formula for equivariant Lyusternik-Schnirelmann category
Authors:
Marine Fontaine,
James Montaldi
Abstract:
The LS-category of a topological space is a numerical homotopy invariant, introduced originally in a course on the global calculus of variations by Lyusternik and Schnirelmann, to estimate the number of critical points of a smooth function. When the topological space is a smooth manifold equipped with a proper action of a Lie group, we give a localization formula to calculate the equivariant analo…
▽ More
The LS-category of a topological space is a numerical homotopy invariant, introduced originally in a course on the global calculus of variations by Lyusternik and Schnirelmann, to estimate the number of critical points of a smooth function. When the topological space is a smooth manifold equipped with a proper action of a Lie group, we give a localization formula to calculate the equivariant analogue of this category in terms of the minimal orbit-type strata. The formula holds provided that the manifold admits a specific cover. We show that such a cover exists on every symplectic toric manifold. The known result stating that the LS-category of a symplectic toric manifold is equal to the number of fixed points of the torus action follows from our localization formula.
△ Less
Submitted 19 December, 2017;
originally announced December 2017.
-
Persistence of stationary motion under explicit symmetry breaking perturbation
Authors:
Marine Fontaine,
James Montaldi
Abstract:
Explicit symmetry breaking occurs when a dynamical system having a certain symmetry group is perturbed in a way that the perturbation preserves only some symmetries of the original system. We give a geometric approach to study this phenomenon in the setting of equivariant Hamiltonian systems. A lower bound for the number of orbits of equilibria and orbits of relative equilibria which persist after…
▽ More
Explicit symmetry breaking occurs when a dynamical system having a certain symmetry group is perturbed in a way that the perturbation preserves only some symmetries of the original system. We give a geometric approach to study this phenomenon in the setting of equivariant Hamiltonian systems. A lower bound for the number of orbits of equilibria and orbits of relative equilibria which persist after a small perturbation is given. This bound is given in terms of the equivariant Lyusternik-Schnirelmann category of the group orbit.
△ Less
Submitted 19 June, 2019; v1 submitted 16 December, 2017;
originally announced December 2017.
-
Stable Ground States for the HMF Poisson Model
Authors:
Marine Fontaine,
Mohammed Lemou,
Florian Méhats
Abstract:
In this paper we prove the nonlinear orbital stability of a large class of steady states solutions to the Hamiltonian Mean Field (HMF) system with a Poisson interaction potential. These steady states are obtained as minimizers of an energy functional under one, two or infinitely many constraints. The singularity of the Poisson potential prevents from a direct run of the general strategy in [20, 16…
▽ More
In this paper we prove the nonlinear orbital stability of a large class of steady states solutions to the Hamiltonian Mean Field (HMF) system with a Poisson interaction potential. These steady states are obtained as minimizers of an energy functional under one, two or infinitely many constraints. The singularity of the Poisson potential prevents from a direct run of the general strategy in [20, 16] which was based on generalized rearrangement techniques, and which has been recently extended to the case of the usual (smooth) cosine potential [17]. Our strategy is rather based on variational techniques. However, due to the boundedness of the space domain, our variational problems do not enjoy the usual scaling invariances which are, in general, very important in the analysis of variational problems. To replace these scaling arguments, we introduce new transformations which, although specific to our context, remain somehow in the same spirit of rearrangements tools introduced in the references above. In particular, these transformations allow for the incorporation of an arbitrary number of constraints, and yield a stability result for a large class of steady states.
△ Less
Submitted 11 September, 2017; v1 submitted 7 September, 2017;
originally announced September 2017.