-
Factor-Conditioned Speaking-Style Captioning
Authors:
Atsushi Ando,
Takafumi Moriya,
Shota Horiguchi,
Ryo Masumura
Abstract:
This paper presents a novel speaking-style captioning method that generates diverse descriptions while accurately predicting speaking-style information. Conventional learning criteria directly use original captions that contain not only speaking-style factor terms but also syntax words, which disturbs learning speaking-style information. To solve this problem, we introduce factor-conditioned capti…
▽ More
This paper presents a novel speaking-style captioning method that generates diverse descriptions while accurately predicting speaking-style information. Conventional learning criteria directly use original captions that contain not only speaking-style factor terms but also syntax words, which disturbs learning speaking-style information. To solve this problem, we introduce factor-conditioned captioning (FCC), which first outputs a phrase representing speaking-style factors (e.g., gender, pitch, etc.), and then generates a caption to ensure the model explicitly learns speaking-style factors. We also propose greedy-then-sampling (GtS) decoding, which first predicts speaking-style factors deterministically to guarantee semantic accuracy, and then generates a caption based on factor-conditioned sampling to ensure diversity. Experiments show that FCC outperforms the original caption-based training, and with GtS, it generates more diverse captions while kee** style prediction performance.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis
Authors:
Kenichi Fujita,
Atsushi Ando,
Yusuke Ijima
Abstract:
This paper proposes a speech rhythm-based method for speaker embeddings to model phoneme duration using a few utterances by the target speaker. Speech rhythm is one of the essential factors among speaker characteristics, along with acoustic features such as F0, for reproducing individual utterances in speech synthesis. A novel feature of the proposed method is the rhythm-based embeddings extracted…
▽ More
This paper proposes a speech rhythm-based method for speaker embeddings to model phoneme duration using a few utterances by the target speaker. Speech rhythm is one of the essential factors among speaker characteristics, along with acoustic features such as F0, for reproducing individual utterances in speech synthesis. A novel feature of the proposed method is the rhythm-based embeddings extracted from phonemes and their durations, which are known to be related to speaking rhythm. They are extracted with a speaker identification model similar to the conventional spectral feature-based one. We conducted three experiments, speaker embeddings generation, speech synthesis with generated embeddings, and embedding space analysis, to evaluate the performance. The proposed method demonstrated a moderate speaker identification performance (15.2% EER), even with only phonemes and their duration information. The objective and subjective evaluation results demonstrated that the proposed method can synthesize speech with speech rhythm closer to the target speaker than the conventional method. We also visualized the embeddings to evaluate the relationship between the distance of the embeddings and the perceptual similarity. The visualization of the embedding space and the relation analysis between the closeness indicated that the distribution of embeddings reflects the subjective and objective similarity.
△ Less
Submitted 10 February, 2024;
originally announced February 2024.
-
NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization
Authors:
Naohiro Tawara,
Marc Delcroix,
Atsushi Ando,
Atsunori Ogawa
Abstract:
This paper details our speaker diarization system designed for multi-domain, multi-microphone casual conversations. The proposed diarization pipeline uses weighted prediction error (WPE)-based dereverberation as a front end, then applies end-to-end neural diarization with vector clustering (EEND-VC) to each channel separately. It integrates the diarization result obtained from each channel using d…
▽ More
This paper details our speaker diarization system designed for multi-domain, multi-microphone casual conversations. The proposed diarization pipeline uses weighted prediction error (WPE)-based dereverberation as a front end, then applies end-to-end neural diarization with vector clustering (EEND-VC) to each channel separately. It integrates the diarization result obtained from each channel using diarization output voting error reduction plus overlap (DOVER-LAP). To harness the knowledge from the target domain and results integrated across all channels, we apply self-supervised adaptation for each session by retraining the EEND-VC with pseudo-labels derived from DOVER-LAP. The proposed system was incorporated into NTT's submission for the distant automatic speech recognition task in the CHiME-7 challenge. Our system achieved 65 % and 62 % relative improvements on development and eval sets compared to the organizer-provided VC-based baseline diarization system, securing third place in diarization performance.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
Adversarial Finetuning with Latent Representation Constraint to Mitigate Accuracy-Robustness Tradeoff
Authors:
Satoshi Suzuki,
Shin'ya Yamaguchi,
Shoichiro Takeda,
Sekitoshi Kanai,
Naoki Makishima,
Atsushi Ando,
Ryo Masumura
Abstract:
This paper addresses the tradeoff between standard accuracy on clean examples and robustness against adversarial examples in deep neural networks (DNNs). Although adversarial training (AT) improves robustness, it degrades the standard accuracy, thus yielding the tradeoff. To mitigate this tradeoff, we propose a novel AT method called ARREST, which comprises three components: (i) adversarial finetu…
▽ More
This paper addresses the tradeoff between standard accuracy on clean examples and robustness against adversarial examples in deep neural networks (DNNs). Although adversarial training (AT) improves robustness, it degrades the standard accuracy, thus yielding the tradeoff. To mitigate this tradeoff, we propose a novel AT method called ARREST, which comprises three components: (i) adversarial finetuning (AFT), (ii) representation-guided knowledge distillation (RGKD), and (iii) noisy replay (NR). AFT trains a DNN on adversarial examples by initializing its parameters with a DNN that is standardly pretrained on clean examples. RGKD and NR respectively entail a regularization term and an algorithm to preserve latent representations of clean examples during AFT. RGKD penalizes the distance between the representations of the standardly pretrained and AFT DNNs. NR switches input adversarial examples to nonadversarial ones when the representation changes significantly during AFT. By combining these components, ARREST achieves both high standard accuracy and robustness. Experimental results demonstrate that ARREST mitigates the tradeoff more effectively than previous AT-based methods do.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Nonuniqueness phenomena in discontinuous dynamical systems and their regularizations
Authors:
Alessia andò,
Roderick Edwards,
Nicola Guglielmi
Abstract:
In a recent paper by Guglielmi and Hairer (SIADS 2015), an analysis in the $\varepsilon\to 0$ limit was proposed of regularized discontinuous ODEs in codimension-2 switching domains; this was obtained by studying a certain 2-dimensional system describing the so-called hidden dynamics. In particular, the existence of a unique limit solution was not proved in all cases, a few of which were labeled a…
▽ More
In a recent paper by Guglielmi and Hairer (SIADS 2015), an analysis in the $\varepsilon\to 0$ limit was proposed of regularized discontinuous ODEs in codimension-2 switching domains; this was obtained by studying a certain 2-dimensional system describing the so-called hidden dynamics. In particular, the existence of a unique limit solution was not proved in all cases, a few of which were labeled as ambiguous, and it was not clear whether or not the ambiguity could be resolved. In this paper, we show that it cannot be resolved in general. A first contribution of this paper is an illustration of the dependence of the limit solution on the form of the switching function. Considering the parameter dependence in the ambiguous class of discontinuous systems, a second contribution is a bifurcation analysis, revealing a range of possible behaviors. Finally, we investigate the sensitivity of solutions in the transition from codimension-2 domains to codimension-3 when there is a limit cycle in the hidden dynamics.
△ Less
Submitted 1 June, 2024; v1 submitted 24 July, 2023;
originally announced July 2023.
-
End-to-End Joint Target and Non-Target Speakers ASR
Authors:
Ryo Masumura,
Naoki Makishima,
Taiga Yamane,
Yoshihiko Yamazaki,
Saki Mizuno,
Mana Ihori,
Mihiro Uchida,
Keita Suzuki,
Hiroshi Sato,
Tomohiro Tanaka,
Akihiko Takashima,
Satoshi Suzuki,
Takafumi Moriya,
Nobukatsu Hojo,
Atsushi Ando
Abstract:
This paper proposes a novel automatic speech recognition (ASR) system that can transcribe individual speaker's speech while identifying whether they are target or non-target speakers from multi-talker overlapped speech. Target-speaker ASR systems are a promising way to only transcribe a target speaker's speech by enrolling the target speaker's information. However, in conversational ASR applicatio…
▽ More
This paper proposes a novel automatic speech recognition (ASR) system that can transcribe individual speaker's speech while identifying whether they are target or non-target speakers from multi-talker overlapped speech. Target-speaker ASR systems are a promising way to only transcribe a target speaker's speech by enrolling the target speaker's information. However, in conversational ASR applications, transcribing both the target speaker's speech and non-target speakers' ones is often required to understand interactive information. To naturally consider both target and non-target speakers in a single ASR model, our idea is to extend autoregressive modeling-based multi-talker ASR systems to utilize the enrollment speech of the target speaker. Our proposed ASR is performed by recursively generating both textual tokens and tokens that represent target or non-target speakers. Our experiments demonstrate the effectiveness of our proposed method.
△ Less
Submitted 4 June, 2023;
originally announced June 2023.
-
Piecewise orthogonal collocation for computing periodic solutions of coupled delay equations
Authors:
Alessia andò,
Dimitri Breda
Abstract:
We extend the piecewise orthogonal collocation method to computing periodic solutions of coupled renewal and delay differential equations. Through a rigorous error analysis, we prove convergence of the relevant finite-element method and provide a theoretical estimate of the error. We conclude with some numerical experiments to further support the theoretical results.
We extend the piecewise orthogonal collocation method to computing periodic solutions of coupled renewal and delay differential equations. Through a rigorous error analysis, we prove convergence of the relevant finite-element method and provide a theoretical estimate of the error. We conclude with some numerical experiments to further support the theoretical results.
△ Less
Submitted 20 May, 2023;
originally announced May 2023.
-
RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous Driving
Authors:
Angelika Ando,
Spyros Gidaris,
Andrei Bursuc,
Gilles Puy,
Alexandre Boulch,
Renaud Marlet
Abstract:
Casting semantic segmentation of outdoor LiDAR point clouds as a 2D problem, e.g., via range projection, is an effective and popular approach. These projection-based methods usually benefit from fast computations and, when combined with techniques which use other point cloud representations, achieve state-of-the-art results. Today, projection-based methods leverage 2D CNNs but recent advances in c…
▽ More
Casting semantic segmentation of outdoor LiDAR point clouds as a 2D problem, e.g., via range projection, is an effective and popular approach. These projection-based methods usually benefit from fast computations and, when combined with techniques which use other point cloud representations, achieve state-of-the-art results. Today, projection-based methods leverage 2D CNNs but recent advances in computer vision show that vision transformers (ViTs) have achieved state-of-the-art results in many image-based benchmarks. In this work, we question if projection-based methods for 3D semantic segmentation can benefit from these latest improvements on ViTs. We answer positively but only after combining them with three key ingredients: (a) ViTs are notoriously hard to train and require a lot of training data to learn powerful representations. By preserving the same backbone architecture as for RGB images, we can exploit the knowledge from long training on large image collections that are much cheaper to acquire and annotate than point clouds. We reach our best results with pre-trained ViTs on large image datasets. (b) We compensate ViTs' lack of inductive bias by substituting a tailored convolutional stem for the classical linear embedding layer. (c) We refine pixel-wise predictions with a convolutional decoder and a skip connection from the convolutional stem to combine low-level but fine-grained features of the the convolutional stem with the high-level but coarse predictions of the ViT encoder. With these ingredients, we show that our method, called RangeViT, outperforms existing projection-based methods on nuScenes and SemanticKITTI. The code is available at https://github.com/valeoai/rangevit.
△ Less
Submitted 25 April, 2023; v1 submitted 24 January, 2023;
originally announced January 2023.
-
On the Use of Modality-Specific Large-Scale Pre-Trained Encoders for Multimodal Sentiment Analysis
Authors:
Atsushi Ando,
Ryo Masumura,
Akihiko Takashima,
Satoshi Suzuki,
Naoki Makishima,
Keita Suzuki,
Takafumi Moriya,
Takanori Ashihara,
Hiroshi Sato
Abstract:
This paper investigates the effectiveness and implementation of modality-specific large-scale pre-trained encoders for multimodal sentiment analysis~(MSA). Although the effectiveness of pre-trained encoders in various fields has been reported, conventional MSA methods employ them for only linguistic modality, and their application has not been investigated. This paper compares the features yielded…
▽ More
This paper investigates the effectiveness and implementation of modality-specific large-scale pre-trained encoders for multimodal sentiment analysis~(MSA). Although the effectiveness of pre-trained encoders in various fields has been reported, conventional MSA methods employ them for only linguistic modality, and their application has not been investigated. This paper compares the features yielded by large-scale pre-trained encoders with conventional heuristic features. One each of the largest pre-trained encoders publicly available for each modality are used; CLIP-ViT, WavLM, and BERT for visual, acoustic, and linguistic modalities, respectively. Experiments on two datasets reveal that methods with domain-specific pre-trained encoders attain better performance than those with conventional features in both unimodal and multimodal scenarios. We also find it better to use the outputs of the intermediate layers of the encoders than those of the output layer. The codes are available at https://github.com/ando-hub/MSA_Pretrain.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data
Authors:
Naoki Makishima,
Satoshi Suzuki,
Atsushi Ando,
Ryo Masumura
Abstract:
In this paper, we investigate the semi-supervised joint training of text to speech (TTS) and automatic speech recognition (ASR), where a small amount of paired data and a large amount of unpaired text data are available. Conventional studies form a cycle called the TTS-ASR pipeline, where the multispeaker TTS model synthesizes speech from text with a reference speech and the ASR model reconstructs…
▽ More
In this paper, we investigate the semi-supervised joint training of text to speech (TTS) and automatic speech recognition (ASR), where a small amount of paired data and a large amount of unpaired text data are available. Conventional studies form a cycle called the TTS-ASR pipeline, where the multispeaker TTS model synthesizes speech from text with a reference speech and the ASR model reconstructs the text from the synthesized speech, after which both models are trained with a cycle-consistency loss. However, the synthesized speech does not reflect the speaker characteristics of the reference speech and the synthesized speech becomes overly easy for the ASR model to recognize after training. This not only decreases the TTS model quality but also limits the ASR model improvement. To solve this problem, we propose improving the cycleconsistency-based training with a speaker consistency loss and step-wise optimization. The speaker consistency loss brings the speaker characteristics of the synthesized speech closer to that of the reference speech. In the step-wise optimization, we first freeze the parameter of the TTS model before both models are trained to avoid over-adaptation of the TTS model to the ASR model. Experimental results demonstrate the efficacy of the proposed method.
△ Less
Submitted 11 July, 2022;
originally announced July 2022.
-
A pseudospectral method for investigating the stability of linear population models with two physiological structures
Authors:
Alessia Andò,
Simone De Reggi,
Davide Liessi,
Francesca Scarabel
Abstract:
The asymptotic stability of the null equilibrium of a linear population model with two physiological structures formulated as a first-order hyperbolic PDE is determined by the spectrum of its infinitesimal generator. We propose an equivalent reformulation of the problem in the space of absolutely continuous functions in the sense of Carathéodory, so that the domain of the corresponding infinitesim…
▽ More
The asymptotic stability of the null equilibrium of a linear population model with two physiological structures formulated as a first-order hyperbolic PDE is determined by the spectrum of its infinitesimal generator. We propose an equivalent reformulation of the problem in the space of absolutely continuous functions in the sense of Carathéodory, so that the domain of the corresponding infinitesimal generator is defined by trivial boundary conditions. Via bivariate collocation, we discretize the reformulated operator as a finite-dimensional matrix, which can be used to approximate the spectrum of the original infinitesimal generator. Finally, we provide test examples illustrating the converging behavior of the approximated eigenvalues and eigenfunctions, and its dependence on the regularity of the model coefficients.
△ Less
Submitted 22 March, 2022;
originally announced March 2022.
-
Convergence analysis of collocation methods for computing periodic solutions of retarded functional differential equations
Authors:
Alessia Andò,
Dimitri Breda
Abstract:
We analyze the convergence of piecewise collocation methods for computing periodic solutions of general retarded functional differential equations under the abstract framework recently developed in [S. Maset, Numer. Math. (2016) 133(3):525-555], [S. Maset, SIAM J. Numer. Anal. (2015) 53(6):2771--2793] and [S. Maset, SIAM J. Numer. Anal. (2015) 53(6):2794--2821]. We rigorously show that a reformula…
▽ More
We analyze the convergence of piecewise collocation methods for computing periodic solutions of general retarded functional differential equations under the abstract framework recently developed in [S. Maset, Numer. Math. (2016) 133(3):525-555], [S. Maset, SIAM J. Numer. Anal. (2015) 53(6):2771--2793] and [S. Maset, SIAM J. Numer. Anal. (2015) 53(6):2794--2821]. We rigorously show that a reformulation as a boundary value problem requires a proper infinite-dimensional boundary periodic condition in order to be amenable of such analysis. In this regard, we also highlight the role of the period acting as an unknown parameter, which is critical since it is directly linked to the course of time. Finally, we prove that the finite element method is convergent, while we limit ourselves to commenting on the infeasibility of this approach as far as the spectral element method is concerned.
△ Less
Submitted 26 November, 2020; v1 submitted 17 August, 2020;
originally announced August 2020.
-
Does the Lombard Effect Improve Emotional Communication in Noise? - Analysis of Emotional Speech Acted in Noise -
Authors:
Yi Zhao,
Atsushi Ando,
Shinji Takaki,
Junichi Yamagishi,
Satoshi Kobashikawa
Abstract:
Speakers usually adjust their way of talking in noisy environments involuntarily for effective communication. This adaptation is known as the Lombard effect. Although speech accompanying the Lombard effect can improve the intelligibility of a speaker's voice, the changes in acoustic features (e.g. fundamental frequency, speech intensity, and spectral tilt) caused by the Lombard effect may also aff…
▽ More
Speakers usually adjust their way of talking in noisy environments involuntarily for effective communication. This adaptation is known as the Lombard effect. Although speech accompanying the Lombard effect can improve the intelligibility of a speaker's voice, the changes in acoustic features (e.g. fundamental frequency, speech intensity, and spectral tilt) caused by the Lombard effect may also affect the listener's judgment of emotional content. To the best of our knowledge, there is no published study on the influence of the Lombard effect in emotional speech. Therefore, we recorded parallel emotional speech waveforms uttered by 12 speakers under both quiet and noisy conditions in a professional recording studio in order to explore how the Lombard effect interacts with emotional speech. By analyzing confusion matrices and acoustic features, we aim to answer the following questions: 1) Can speakers express their emotions correctly even under adverse conditions? 2) Can listeners recognize the emotion contained in speech signals even under noise? 3) How does emotional speech uttered in noise differ from emotional speech uttered in quiet conditions in terms of acoustic characteristic?
△ Less
Submitted 9 April, 2019; v1 submitted 28 March, 2019;
originally announced March 2019.
-
Ultrafast Dynamics of Electron-phonon Coupling in Transition-metal Dichalcogenides
Authors:
Kotaro Makino,
Yuta Saito,
Shuuto Horii,
Paul Fons,
Alexander V. Kolobov,
Atsushi Ando,
Keiji Ueno,
Richarj Mondal,
Muneaki Hase
Abstract:
Time-domain femtosecond laser spectroscopic measurements of the ultrafast lattice dynamics in 2H-MoTe2 bulk crystals were carried out to understand the carrier-phonon interactions that govern electronic transport properties. An unusually long lifetime coherent A1g phonon mode was observed even in the presence of very large density of photo-excited carriers at room temperature. The decay rate was o…
▽ More
Time-domain femtosecond laser spectroscopic measurements of the ultrafast lattice dynamics in 2H-MoTe2 bulk crystals were carried out to understand the carrier-phonon interactions that govern electronic transport properties. An unusually long lifetime coherent A1g phonon mode was observed even in the presence of very large density of photo-excited carriers at room temperature. The decay rate was observed to decrease with increasing excitation laser fluence. Based on the laser fluence dependence including the inducement of significant phonon softening and a peculiar decrease in phonon decay rate, we attribute the long lifetime lattice dynamics to weak anharmonic phonon-phonon coupling and a carrier-density-dependent deformation potential electron-phonon coupling.
△ Less
Submitted 27 July, 2018;
originally announced July 2018.
-
Measurement and comparison of individual external doses of high-school students living in Japan, France, Poland and Belarus -- the "D-shuttle" project --
Authors:
N. Adachi,
V. Adamovitch,
Y. Adjovi,
K. Aida,
H. Akamatsu,
S. Akiyama,
A. Akli,
A. Ando,
T. Andrault,
H. Antonietti,
S. Anzai,
G. Arkoun,
C. Avenoso,
D. Ayrault,
M. Banasiewicz,
M. Banaśkiewicz,
L. Bernandini,
E. Bernard,
E. Berthet,
M. Blanchard,
D. Boreyko,
K. Boros,
S. Charron,
P. Cornette,
K. Czerkas
, et al. (208 additional authors not shown)
Abstract:
Twelve high schools in Japan (of which six are in Fukushima Prefecture), four in France, eight in Poland and two in Belarus cooperated in the measurement and comparison of individual external doses in 2014. In total 216 high-school students and teachers participated in the study. Each participant wore an electronic personal dosimeter "D-shuttle" for two weeks, and kept a journal of his/her whereab…
▽ More
Twelve high schools in Japan (of which six are in Fukushima Prefecture), four in France, eight in Poland and two in Belarus cooperated in the measurement and comparison of individual external doses in 2014. In total 216 high-school students and teachers participated in the study. Each participant wore an electronic personal dosimeter "D-shuttle" for two weeks, and kept a journal of his/her whereabouts and activities. The distributions of annual external doses estimated for each region overlap with each other, demonstrating that the personal external individual doses in locations where residence is currently allowed in Fukushima Prefecture and in Belarus are well within the range of estimated annual doses due to the background radiation level of other regions/countries.
△ Less
Submitted 18 November, 2015; v1 submitted 21 June, 2015;
originally announced June 2015.
-
Enhancement of phonon effects in photoexcited states of one-dimensional Mott insulators
Authors:
Hiroaki Matsueda,
Akihiro Ando,
Takami Tohyama,
Sadamichi Maekawa
Abstract:
We examine how the electron correlation affects the electron-phonon (EP) interaction in the linear optical absorption spectrum of the one-dimensional (1D) extended Hubbard-Holstein model. A density matrix renormalization group (DMRG) calculation shows that the effect of the EP interaction on an exciton is enhanced by increasing the on-site Coulomb repulsion. This enhancement is in contrast to th…
▽ More
We examine how the electron correlation affects the electron-phonon (EP) interaction in the linear optical absorption spectrum of the one-dimensional (1D) extended Hubbard-Holstein model. A density matrix renormalization group (DMRG) calculation shows that the effect of the EP interaction on an exciton is enhanced by increasing the on-site Coulomb repulsion. This enhancement is in contrast to the effect of the EP interaction on the ground state where the Peierls instability is suppressed by the on-site Coulomb repulsion. The DMRG data with the EP interaction fit with absorption experiments in 1D cuprates better than those for the extended Hubbard model.
△ Less
Submitted 27 February, 2008;
originally announced February 2008.
-
Development of supersonic plasma flows by use of a magnetic nozzle and an ICRF heating
Authors:
M. Inutake,
A. Ando,
K. Hattori,
H. Tobari,
Y. Hosokawa,
R. Sato,
M. Hatanaka,
K. Harata
Abstract:
A high-beta, supersonic plasma flow plays a crucial role in MHD phenomena in space and fusion plasmas. There are a few experimental researches on production and control of a fast flowing plasma in spite of a growing significance in the magnetized-plasma flow dynamics. A magneto-plasma-dynamic arcjet (MPDA) is one of promising devices to produce a supersonic plasma flow and has been utilized as a…
▽ More
A high-beta, supersonic plasma flow plays a crucial role in MHD phenomena in space and fusion plasmas. There are a few experimental researches on production and control of a fast flowing plasma in spite of a growing significance in the magnetized-plasma flow dynamics. A magneto-plasma-dynamic arcjet (MPDA) is one of promising devices to produce a supersonic plasma flow and has been utilized as an electric propulsion device with a higher specific impulse and a relatively larger thrust. We have improved the performance of an MPDA to produce a quasi-steady plasma flow with a transonic and supersonic Mach number in a highly-ionized state. There are two methods in order to control an ion-acoustic Mach number of the plasma flow exhausted from an MPDA: one is to use a magnetic Laval nozzle to convert a thermal energy to a flow energy and the other is a combined system of an ion heating and a divergent magnetic nozzle. The former is an analogous method to a compressible air flow and the latter is the method proposed in an advanced thruster for a manned interplanetary space mission. We have clarified the plasma flow characteristics in various shapes of a magnetic field configuration. It was demonstrated that the Mach number of the plasma flow could increase up to almost 3 in a divergent magnetic nozzle field. This paper reports recent results on the flow field improvements: one is on a magnetic-Laval-nozzle effects observed at the muzzle region of the MPDA, and the other is on ICRF (ion-cyclotron-range of frequency) heating of a supersonic plasma by use of a helical antenna.
△ Less
Submitted 22 October, 2004;
originally announced October 2004.