Search | arXiv e-print repository

arXiv:2406.07250 [pdf, other]

Description and Discussion on DCASE 2024 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Authors: Tomoya Nishida, Noboru Harada, Daisuke Niizumi, Davide Albertini, Roberto Sannino, Simone Pradolini, Filippo Augusti, Keisuke Imoto, Kota Dohi, Harsh Purohit, Takashi Endo, Yohei Kawaguchi

Abstract: We present the task description of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2024 Challenge Task 2: First-shot unsupervised anomalous sound detection (ASD) for machine condition monitoring. Continuing from last year's DCASE 2023 Challenge Task 2, we organize the task as a first-shot problem under domain generalization required settings. The main goal of the first-shot… ▽ More We present the task description of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2024 Challenge Task 2: First-shot unsupervised anomalous sound detection (ASD) for machine condition monitoring. Continuing from last year's DCASE 2023 Challenge Task 2, we organize the task as a first-shot problem under domain generalization required settings. The main goal of the first-shot problem is to enable rapid deployment of ASD systems for new kinds of machines without the need for machine-specific hyperparameter tunings. This problem setting was realized by (1) giving only one section for each machine type and (2) having completely different machine types for the development and evaluation datasets. For the DCASE 2024 Challenge Task 2, data of completely new machine types were newly collected and provided as the evaluation dataset. In addition, attribute information such as the machine operation conditions were concealed for several machine types to mimic situations where such information are unavailable. We will add challenge results and analysis of the submissions after the challenge submission deadline. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: anomaly detection, acoustic condition monitoring, domain shift, first-shot problem, DCASE Challenge. arXiv admin note: text overlap with arXiv:2305.07828

arXiv:2406.02032 [pdf, other]

M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation

Authors: Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Masahiro Yasuda, Shunsuke Tsubaki, Keisuke Imoto

Abstract: Contrastive language-audio pre-training (CLAP) enables zero-shot (ZS) inference of audio and exhibits promising performance in several classification tasks. However, conventional audio representations are still crucial for many tasks where ZS is not applicable (e.g., regression problems). Here, we explore a new representation, a general-purpose audio-language representation, that performs well in… ▽ More Contrastive language-audio pre-training (CLAP) enables zero-shot (ZS) inference of audio and exhibits promising performance in several classification tasks. However, conventional audio representations are still crucial for many tasks where ZS is not applicable (e.g., regression problems). Here, we explore a new representation, a general-purpose audio-language representation, that performs well in both ZS and transfer learning. To do so, we propose a new method, M2D-CLAP, which combines self-supervised learning Masked Modeling Duo (M2D) and CLAP. M2D learns an effective representation to model audio signals, and CLAP aligns the representation with text embedding. As a result, M2D-CLAP learns a versatile representation that allows for both ZS and transfer learning. Experiments show that M2D-CLAP performs well on linear evaluation, fine-tuning, and ZS classification with a GTZAN state-of-the-art of 75.17%, thus achieving a general-purpose audio-language representation. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 5 pages, 1 figure, 5 tables. Accepted by Interspeech 2024

MSC Class: 68T07

arXiv:2405.09029 [pdf, other]

A temperature or FUV tracer? The HNC/HCN ratio in M83 on the GMC scale

Authors: Nanase Harada, Toshiki Saito, Yuri Nishimura, Yoshimasa Watanabe, Kazushi Sakamoto

Abstract: The HNC/HCN ratio is observationally known as a thermometer in Galactic interstellar molecular clouds. A recent study has alternatively suggested that the HNC/HCN ratio is affected by the ultraviolet (UV) field, not by the temperature. We aim to study this ratio on the scale of giant molecular clouds in the barred spiral galaxy M83 towards the southwestern bar end and the central region from ALMA… ▽ More The HNC/HCN ratio is observationally known as a thermometer in Galactic interstellar molecular clouds. A recent study has alternatively suggested that the HNC/HCN ratio is affected by the ultraviolet (UV) field, not by the temperature. We aim to study this ratio on the scale of giant molecular clouds in the barred spiral galaxy M83 towards the southwestern bar end and the central region from ALMA observations, and if possible, distinguish the above scenarios. We compare the high (40-50 pc) resolution HNC/HCN ratios with the star formation rate from the 3-mm continuum intensity and the molecular mass inferred from the HCN intensities. Our results show that the HNC/HCN ratios do not vary with the star formation rates, star formation efficiencies, or column densities in the bar-end region. In the central region, the HNC/HCN ratios become higher with higher star formation rates, which tend to cause higher temperatures. This result is not consistent with the previously proposed scenario in which the HNC/HCN ratio decreases with increasing temperature. Spectral shapes suggest that this trend may be due to optically thick HCN and optically thin HNC. In addition, we compare the large-scale ($\sim 200$ pc) correlation between the dust temperature from the FIR ratio and the HNC/HCN ratio for the southwestern bar-end region. The HNC/HCN ratio is lower when the dust temperatures are higher. We suggest from the above results that the HNC/HCN ratio depends on the UV radiation field that affects the interstellar medium on the $\sim100\,$pc scale where the column densities are low. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 17 pages, 9 figures. Accepted for publication in ApJ

arXiv:2405.08408 [pdf, other]

An ALCHEMI inspection of sulphur-bearing species towards the central molecular zone of NGC 253

Authors: M. Bouvier, S. Viti, E. Behrens, J. Butterworth, K. -Y. Huang, J. G. Mangum, N. Harada, S. Martín, V. M. Rivilla, S. Muller, K. Sakamoto, Y. Yoshimura, K. Tanaka, K. Nakanishi, R. Herrero-Illana, L. Colzi, M. D. Gorski, C. Henkel, P. K. Humire, D. S. Meier, P. P. van der Werf, Y. T. Yan

Abstract: Sulphur-bearing species are detected in various environments within Galactic star-forming regions and are particularly abundant in the gas phase of outflows and shocks, and photo-dissociation regions. In this work, we aim to investigate the nature of the emission from the most common sulphur-bearing species observable at millimetre wavelengths towards the nuclear starburst of the galaxy NGC 253. W… ▽ More Sulphur-bearing species are detected in various environments within Galactic star-forming regions and are particularly abundant in the gas phase of outflows and shocks, and photo-dissociation regions. In this work, we aim to investigate the nature of the emission from the most common sulphur-bearing species observable at millimetre wavelengths towards the nuclear starburst of the galaxy NGC 253. We intend to understand which type of regions are probed by sulphur-bearing species and which process(es) dominate(s) the release of sulphur into the gas phase. We used the high-angular resolution (1.6" or 27 pc) observations from the ALCHEMI ALMA Large Program to image several sulphur-bearing species towards the central molecular zone (CMZ) of NGC 253. We performed local thermodynamic equilibrium (LTE) and non-LTE large velocity gradient (LVG) analyses to derive the physical conditions of the gas in which S-bearing species are emitted, and their abundance ratios across the CMZ. Finally, we compared our results with previous ALCHEMI studies and a few selected Galactic environments. We found that not all sulphur-bearing species trace the same type of gas: strong evidence indicates that H2S and part of the emission of OCS, H2CS, and SO, are tracing shocks whilst part of SO and CS emission rather trace the dense molecular gas. For some species, such as CCS and SO2, we could not firmly conclude on their origin of emission. The present analysis indicates that the emission from most sulphur-bearing species throughout the CMZ is likely dominated by shocks associated with ongoing star formation. In the inner part of the CMZ where the presence of super star clusters was previously indicated, we could not distinguish between shocks or thermal evaporation as the main process releasing the S-bearing species. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 44 pages, 20 figures, Accepted for publication in A&A

arXiv:2404.19531 [pdf, other]

MoST: Multi-modality Scene Tokenization for Motion Prediction

Authors: Norman Mu, **gwei Ji, Zhenpei Yang, Nate Harada, Haotian Tang, Kan Chen, Charles R. Qi, Runzhou Ge, Kratarth Goel, Zoey Yang, Scott Ettinger, Rami Al-Rfou, Dragomir Anguelov, Yin Zhou

Abstract: Many existing motion prediction approaches rely on symbolic perception outputs to generate agent trajectories, such as bounding boxes, road graph information and traffic lights. This symbolic representation is a high-level abstraction of the real world, which may render the motion prediction model vulnerable to perception errors (e.g., failures in detecting open-vocabulary obstacles) while missing… ▽ More Many existing motion prediction approaches rely on symbolic perception outputs to generate agent trajectories, such as bounding boxes, road graph information and traffic lights. This symbolic representation is a high-level abstraction of the real world, which may render the motion prediction model vulnerable to perception errors (e.g., failures in detecting open-vocabulary obstacles) while missing salient information from the scene context (e.g., poor road conditions). An alternative paradigm is end-to-end learning from raw sensors. However, this approach suffers from the lack of interpretability and requires significantly more training resources. In this work, we propose tokenizing the visual world into a compact set of scene elements and then leveraging pre-trained image foundation models and LiDAR neural networks to encode all the scene elements in an open-vocabulary manner. The image foundation model enables our scene tokens to encode the general knowledge of the open world while the LiDAR neural network encodes geometry information. Our proposed representation can efficiently encode the multi-frame multi-modality observations with a few hundred tokens and is compatible with most transformer-based architectures. To evaluate our method, we have augmented Waymo Open Motion Dataset with camera embeddings. Experiments over Waymo Open Motion Dataset show that our approach leads to significant performance improvements over the state-of-the-art. △ Less

Submitted 30 April, 2024; originally announced April 2024.

Comments: CVPR 2024

arXiv:2404.17107 [pdf, other]

Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection

Authors: Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

Abstract: To reduce the need for skilled clinicians in heart sound interpretation, recent studies on automating cardiac auscultation have explored deep learning approaches. However, despite the demands for large data for deep learning, the size of the heart sound datasets is limited, and no pre-trained model is available. On the contrary, many pre-trained models for general audio tasks are available as gene… ▽ More To reduce the need for skilled clinicians in heart sound interpretation, recent studies on automating cardiac auscultation have explored deep learning approaches. However, despite the demands for large data for deep learning, the size of the heart sound datasets is limited, and no pre-trained model is available. On the contrary, many pre-trained models for general audio tasks are available as general-purpose audio representations. This study explores the potential of general-purpose audio representations pre-trained on large-scale datasets for transfer learning in heart murmur detection. Experiments on the CirCor DigiScope heart sound dataset show that the recent self-supervised learning Masked Modeling Duo (M2D) outperforms previous methods with the results of a weighted accuracy of 0.832 and an unweighted average recall of 0.713. Experiments further confirm improved performance by ensembling M2D with other models. These results demonstrate the effectiveness of general-purpose audio representation in processing heart sounds and open the way for further applications. Our code is available online which runs on a 24 GB consumer GPU at https://github.com/nttcslab/m2d/tree/master/app/circor △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: 4 pages, 1 figure, and 4 tables. Accepted by IEEE EMBC 2024

MSC Class: 68T07

arXiv:2404.11113 [pdf, other]

Internal 1000 AU-scale Structures of the R CrA Cluster-forming Cloud -- I: Filamentary Structures

Authors: Kengo Tachihara, Naofumi Fukaya, Kazuki Tokuda, Yasumasa Yamasaki, Takeru Nishioka, Daisei Abe, Tsuyoshi Inoue, Naoto Harada, Ayumu Shoshi, Shingo Nozaki, Asako Sato, Mitsuki Omura, Kakeru Fujishiro, Misato Fukagawa, Masahiro N. Machida, Takahiro Kanai, Yumiko Oasa, Toshikazu Onishi, Kazuya Saigo, Yasuo Fukui

Abstract: We report on ALMA ACA observations of a high-density region of the Corona Australis cloud forming a young star cluster, and the results of resolving internal structures. In addition to embedded Class 0/I protostars in continuum, a number of complex dense filamentary structures are detected in the C18O and SO lines by the 7m array. These are sub-structures of the molecular clump that are detected b… ▽ More We report on ALMA ACA observations of a high-density region of the Corona Australis cloud forming a young star cluster, and the results of resolving internal structures. In addition to embedded Class 0/I protostars in continuum, a number of complex dense filamentary structures are detected in the C18O and SO lines by the 7m array. These are sub-structures of the molecular clump that are detected by the TP array as the extended emission. We identify 101 and 37 filamentary structures with a few thousand AU widths in C18O and SO, respectively, called as feathers. The typical column density of the feathers in C18O is about 10^{22} cm^{-2}, and the volume density and line mass are ~ 10^5 cm^{-3}, and a few times M_{sun} pc^{-1}, respectively. This line mass is significantly smaller than the critical line mass expected for cold and dense gas. These structures have complex velocity fields, indicating a turbulent internal property. The number of feathers associated with Class 0/I protostars is only ~ 10, indicating that most of them do not form stars but rather being transient structures. The formation of feathers can be interpreted as a result of colliding gas flow as the morphology well reproduced by MHD simulations, supported by the the presence of HI shells in the vicinity. The colliding gas flows may accumulate gas and form filaments and feathers, and trigger the active star formation of the R CrA cluster. △ Less

Submitted 17 April, 2024; originally announced April 2024.

Comments: 24 pages, 13 figures; Accepted for publication in ApJ

arXiv:2404.08264 [pdf, other]

Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis

Authors: Masahiro Yasuda, Noboru Harada, Yasunori Ohishi, Shoichiro Saito, Akira Nakayama, Nobutaka Ono

Abstract: Observations with distributed sensors are essential in analyzing a series of human and machine activities (referred to as 'events' in this paper) in complex and extensive real-world environments. This is because the information obtained from a single sensor is often missing or fragmented in such an environment; observations from multiple locations and modalities should be integrated to analyze eve… ▽ More Observations with distributed sensors are essential in analyzing a series of human and machine activities (referred to as 'events' in this paper) in complex and extensive real-world environments. This is because the information obtained from a single sensor is often missing or fragmented in such an environment; observations from multiple locations and modalities should be integrated to analyze events comprehensively. However, a learning method has yet to be established to extract joint representations that effectively combine such distributed observations. Therefore, we propose Guided Masked sELf-Distillation modeling (Guided-MELD) for inter-sensor relationship modeling. The basic idea of Guided-MELD is to learn to supplement the information from the masked sensor with information from other sensors needed to detect the event. Guided-MELD is expected to enable the system to effectively distill the fragmented or redundant target event information obtained by the sensors without being overly dependent on any specific sensors. To validate the effectiveness of the proposed method in novel tasks of distributed multimedia sensor event analysis, we recorded two new datasets that fit the problem setting: MM-Store and MM-Office. These datasets consist of human activities in a convenience store and an office, recorded using distributed cameras and microphones. Experimental results on these datasets show that the proposed Guided-MELD improves event tagging and detection performance and outperforms conventional inter-sensor relationship modeling methods. Furthermore, the proposed method performed robustly even when sensors were reduced. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 13page, 7figure, under review

arXiv:2404.06095 [pdf, other]

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

Authors: Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

Abstract: Self-supervised learning (SSL) using masked prediction has made great strides in general-purpose audio representation. This study proposes Masked Modeling Duo (M2D), an improved masked prediction SSL, which learns by predicting representations of masked input signals that serve as training signals. Unlike conventional methods, M2D obtains a training signal by encoding only the masked part, encoura… ▽ More Self-supervised learning (SSL) using masked prediction has made great strides in general-purpose audio representation. This study proposes Masked Modeling Duo (M2D), an improved masked prediction SSL, which learns by predicting representations of masked input signals that serve as training signals. Unlike conventional methods, M2D obtains a training signal by encoding only the masked part, encouraging the two networks in M2D to model the input. While M2D improves general-purpose audio representations, a specialized representation is essential for real-world applications, such as in industrial and medical domains. The often confidential and proprietary data in such domains is typically limited in size and has a different distribution from that in pre-training datasets. Therefore, we propose M2D for X (M2D-X), which extends M2D to enable the pre-training of specialized representations for an application X. M2D-X learns from M2D and an additional task and inputs background noise. We make the additional task configurable to serve diverse applications, while the background noise helps learn on small data and forms a denoising task that makes representation robust. With these design choices, M2D-X should learn a representation specialized to serve various application needs. Our experiments confirmed that the representations for general-purpose audio, specialized for the highly competitive AudioSet and speech domain, and a small-data medical task achieve top-level performance, demonstrating the potential of using our models as a universal audio pre-training framework. Our code is available online for future studies at https://github.com/nttcslab/m2d △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: 15 pages, 6 figures, 15 tables. Accepted by TASLP

MSC Class: 68T07

arXiv:2404.04791 [pdf, other]

Physical Properties of the Southwest Outflow Streamer in the Starburst Galaxy NGC 253 with ALCHEMI

Authors: Min Bao, Nanase Harada, Kotaro Kohno, Yuki Yoshimura, Fumi Egusa, Yuri Nishimura, Kunihiko Tanaka, Kouichiro Nakanishi, Sergio Martín, Jeffrey G. Mangum, Kazushi Sakamoto, Sébastien Muller, Mathilde Bouvier, Laura Colzi, Kimberly L. Emig, David S. Meier, Christian Henkel, Pedro Humire, Ko-Yun Huang, Víctor M. Rivilla, Paul van der Werf, Serena Viti

Abstract: The physical properties of galactic molecular outflows are important as they could constrain outflow formation mechanisms. We study the properties of the southwest (SW) outflow streamer including gas kinematics, optical depth, dense gas fraction, and shock strength in the central molecular zone of the starburst galaxy NGC 253. We image the molecular emission at a spatial resolution of $\sim$27 pc… ▽ More The physical properties of galactic molecular outflows are important as they could constrain outflow formation mechanisms. We study the properties of the southwest (SW) outflow streamer including gas kinematics, optical depth, dense gas fraction, and shock strength in the central molecular zone of the starburst galaxy NGC 253. We image the molecular emission at a spatial resolution of $\sim$27 pc based on data from the ALCHEMI program. We trace the kinematics of molecular gas with CO(1-0) line. We constrain the optical depth of CO emission with CO/$^{13}$CO(1-0) ratio, the dense gas fraction with HCN/CO(1-0) ratio, as well as the shock strength with SiO(2-1)/$^{13}$CO(1-0) ratio. The CO/$^{13}$CO(1-0) integrated intensity ratio is $\sim$21 in the SW streamer region, which approximates the C/$^{13}$C isotopic abundance ratio. The higher integrated intensity ratio compared to the disk can be attributed to the optically thinner environment for CO(1-0) emission inside the SW streamer. The HCN/CO(1-0) and SiO(2-1)/$^{13}$CO(1-0) integrated intensity ratios both approach $\sim$0.2 in three giant molecular clouds (GMCs) at the base of the outflow streamers, which implies the higher dense gas fraction and enhanced strength of fast shocks in those GMCs than in the disk. The contours of those two integrated intensity ratios are extended towards the directions of outflow streamers, which connects the enhanced dense gas fraction and shock strength with molecular outflow. Moreover, the molecular gas with enhanced dense gas fraction and shock strength located at the base of the SW streamer shares the same velocity with the outflow. These phenomena suggest that the star formation inside the GMCs can trigger the shocks and further drive the molecular outflow. △ Less

Submitted 6 April, 2024; originally announced April 2024.

Comments: Accepted for publication in A&A, 14 pages, 11 figures

arXiv:2403.16759 [pdf, other]

doi 10.1051/0004-6361/202348821

A spectacular galactic scale magnetohydrodynamic powered wind in ESO 320-G030

Authors: M. D. Gorski, S. Aalto, S. König, C. F. Wethers, C. Yang, S. Muller, K. Onishi, M. Sato, N. Falstad, Jeffrey G. Mangum, S. T. Linden, F. Combes, S. Martín, M. Imanishi, Keiichi Wada, L. Barcos-Muñoz, F. Stanley, S. García-Burillo, P. P. van der Werf, A. S. Evans, C. Henkel, S. Viti, N. Harada, T. Díaz-Santos, J. S. Gallagher , et al. (1 additional authors not shown)

Abstract: How galaxies regulate nuclear growth through gas accretion by supermassive black holes (SMBHs) is one of the most fundamental questions in galaxy evolution. One potential way to regulate nuclear growth is through a galactic wind that removes gas from the nucleus. It is unclear whether galactic winds are powered by jets, mechanical winds, radiation, or via magnetohydrodynamic (MHD) processes. Compa… ▽ More How galaxies regulate nuclear growth through gas accretion by supermassive black holes (SMBHs) is one of the most fundamental questions in galaxy evolution. One potential way to regulate nuclear growth is through a galactic wind that removes gas from the nucleus. It is unclear whether galactic winds are powered by jets, mechanical winds, radiation, or via magnetohydrodynamic (MHD) processes. Compact obscured nuclei (CONs) represent a significant phase of galactic nuclear growth. These galaxies hide growing SMBHs or unusual starbursts in their very opaque, extremely compact (r $<$ 100 pc) centres. They are found in approximately 30 % of the luminous and ultra-luminous infrared galaxy (LIRG and ULIRG) population. Here, we present high-resolution ALMA observations ($\sim$30 mas, $\sim$5 pc) of ground-state and vibrationally excited HCN towards ESO 320-G030 (IRAS 11506-3851). ESO 320-G030 is an isolated luminous infrared galaxy known to host a compact obscured nucleus and a kiloparsec-scale molecular wind. Our analysis of these high-resolution observations excludes the possibility of a starburst-driven wind, a mechanically or energy driven active galactic nucleus (AGN) wind, and exposes a molecular MDH wind. These results imply that the nuclear evolution of galaxies and the growth of SMBHs are similar to the growth of hot cores or protostars where gravitational collapse of the nuclear torus drives a MHD wind. These results mean galaxies are capable, in part, of regulating the evolution of their nuclei without feedback. △ Less

Submitted 25 March, 2024; originally announced March 2024.

Comments: 9 pages, 10 figures

Journal ref: A&A 684, L11 (2024)

arXiv:2403.10756 [pdf, other]

Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval

Authors: Shunsuke Tsubaki, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Keisuke Imoto

Abstract: The aim of this research is to refine knowledge transfer on audio-image temporal agreement for audio-text cross retrieval. To address the limited availability of paired non-speech audio-text data, learning methods for transferring the knowledge acquired from a large amount of paired audio-image data to shared audio-text representation have been investigated, suggesting the importance of how audio-… ▽ More The aim of this research is to refine knowledge transfer on audio-image temporal agreement for audio-text cross retrieval. To address the limited availability of paired non-speech audio-text data, learning methods for transferring the knowledge acquired from a large amount of paired audio-image data to shared audio-text representation have been investigated, suggesting the importance of how audio-image co-occurrence is learned. Conventional approaches in audio-image learning assign a single image randomly selected from the corresponding video stream to the entire audio clip, assuming their co-occurrence. However, this method may not accurately capture the temporal agreement between the target audio and image because a single image can only represent a snapshot of a scene, though the target audio changes from moment to moment. To address this problem, we propose two methods for audio and image matching that effectively capture the temporal information: (i) Nearest Match wherein an image is selected from multiple time frames based on similarity with audio, and (ii) Multiframe Match wherein audio and image pairs of multiple time frames are used. Experimental results show that method (i) improves the audio-text retrieval performance by selecting the nearest image that aligns with the audio information and transferring the learned knowledge. Conversely, method (ii) improves the performance of audio-image retrieval while not showing significant improvements in audio-text retrieval performance. These results indicate that refining audio-image temporal agreement may contribute to better knowledge transfer to audio-text retrieval. △ Less

Submitted 15 March, 2024; originally announced March 2024.

Comments: Submitted to EUSIPCO2024

arXiv:2403.01670 [pdf, other]

6DoF SELD: Sound Event Localization and Detection Using Microphones and Motion Tracking Sensors on self-motioning human

Authors: Masahiro Yasuda, Shoichiro Saito, Akira Nakayama, Noboru Harada

Abstract: We aim to perform sound event localization and detection (SELD) using wearable equipment for a moving human, such as a pedestrian. Conventional SELD tasks have dealt only with microphone arrays located in static positions. However, self-motion with three rotational and three translational degrees of freedom (6DoF) shall be considered for wearable microphone arrays. A system trained only with a dat… ▽ More We aim to perform sound event localization and detection (SELD) using wearable equipment for a moving human, such as a pedestrian. Conventional SELD tasks have dealt only with microphone arrays located in static positions. However, self-motion with three rotational and three translational degrees of freedom (6DoF) shall be considered for wearable microphone arrays. A system trained only with a dataset using microphone arrays in a fixed position would be unable to adapt to the fast relative motion of sound events associated with self-motion, resulting in the degradation of SELD performance. To address this, we designed 6DoF SELD Dataset for wearable systems, the first SELD dataset considering the self-motion of microphones. Furthermore, we proposed a multi-modal SELD system that jointly utilizes audio and motion tracking sensor signals. These sensor signals are expected to help the system find useful acoustic cues for SELD on the basis of the current self-motion state. Experimental results on our dataset show that the proposed method effectively improves SELD performance with a mechanism to extract acoustic features conditioned by sensor signals. △ Less

Submitted 3 March, 2024; originally announced March 2024.

Comments: ICASSP2024 accepted

arXiv:2403.00305 [pdf, other]

doi 10.3847/1538-4357/ad2f9a

Discovery of Asymmetric Spike-like Structures of the 10 au Disk around the Very Low-luminosity Protostar Embedded in the Taurus Dense Core MC 27/L1521F with ALMA

Authors: Kazuki Tokuda, Naoto Harada, Mitsuki Omura, Tomoaki Matsumoto, Toshikazu Onishi, Kazuya Saigo, Ayumu Shoshi, Shingo Nozaki, Kengo Tachihara, Naofumi Fukaya, Yasuo Fukui, Shu-ichiro Inutsuka, Masahiro N. Machida

Abstract: Recent Atacama Large Millimeter/submillimeter Array (ALMA) observations have revealed an increasing number of compact protostellar disks with radii of less than a few tens of astronomical units and that young Class 0/I objects have an intrinsic size diversity. To deepen our understanding of the origin of such tiny disks, we performed the highest-resolution configuration observations with ALMA at a… ▽ More Recent Atacama Large Millimeter/submillimeter Array (ALMA) observations have revealed an increasing number of compact protostellar disks with radii of less than a few tens of astronomical units and that young Class 0/I objects have an intrinsic size diversity. To deepen our understanding of the origin of such tiny disks, we performed the highest-resolution configuration observations with ALMA at a beam size of $\sim$0$''$03 (4 au) on the very low-luminosity Class 0 protostar embedded in the Taurus dense core MC 27/L1521F. The 1.3 mm continuum measurement successfully resolved a tiny, faint ($\sim$1 mJy) disk with a major axis length of $\sim$10 au, one of the smallest examples in the ALMA protostellar studies. In addition, we detected spike-like components in the northeastern direction at the disk edge. Gravitational instability or other fragmentation mechanisms cannot explain the structures, given the central stellar mass of $\sim$0.2 $M_{\odot}$ and the disk mass of $\gtrsim$10$^{-4}$ $M_{\odot}$. Instead, we propose that these small spike structures were formed by a recent dynamic magnetic flux transport event due to interchange instability that would be favorable to occur if the parental core has a strong magnetic field. The presence of complex arc-like structures on a larger ($\sim$2000 au) scale in the same direction as the spike structures suggests that the event was not single. Such episodic, dynamical events may play an important role in maintaining the compact nature of the protostellar disk in the complex gas envelope during the main accretion phase. △ Less

Submitted 3 April, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

Comments: 13 pages, 5 figures, Accepted for publication in ApJ

arXiv:2402.15436 [pdf, other]

doi 10.1051/0004-6361/202348331

CON-quest II. Spatially and spectrally resolved HCN/HCO+ line ratios in local luminous and ultraluminous infrared galaxies

Authors: Y. Nishimura, S. Aalto, M. D. Gorski, S. König, K. Onishi, C. Wethers, C. Yang, L. Barcos-Muñoz, F. Combes, T. Díaz-Santos, J. S. Gallagher, S. García-Burillo, E. González-Alfonso, T. R. Greve, N. Harada, C. Henkel, M. Imanishi, K. Kohno, S. T. Linden, J. G. Mangum, S. Martín, S. Muller, G. C. Privon, C. Ricci, F. Stanley , et al. (2 additional authors not shown)

Abstract: Nuclear regions of ultraluminous and luminous infrared galaxies (U/LIRGs) are powered by starbursts and/or active galactic nuclei (AGNs). These regions are often obscured by extremely high columns of gas and dust. Molecular lines in the submillimeter windows have the potential to determine the physical conditions of these compact obscured nuclei (CONs). We aim to reveal the distributions of HCN an… ▽ More Nuclear regions of ultraluminous and luminous infrared galaxies (U/LIRGs) are powered by starbursts and/or active galactic nuclei (AGNs). These regions are often obscured by extremely high columns of gas and dust. Molecular lines in the submillimeter windows have the potential to determine the physical conditions of these compact obscured nuclei (CONs). We aim to reveal the distributions of HCN and HCO$^+$ emission in local U/LIRGs and investigate whether and how they are related to galaxy properties. Using ALMA, we have conducted sensitive observations of the HCN J=3--2 and HCO$^+$ J=3--2 lines toward 23 U/LIRGs in the local Universe (z < 0.07) with a spatial resolution of ~0.3" (~50--400 pc). We detected both HCN and HCO$^+$ in 21 galaxies, only HCN in one galaxy, and neither in one galaxy. The global HCN/HCO$^+$ line ratios, averaged over scales of ~0.5--4 kpc, range from 0.4 to 2.3, with an unweighted mean of 1.1. These line ratios appear to have no systematic trend with bolometric AGN luminosity or star formation rate. The line ratio varies with position and velocity within each galaxy, with an average interquartile range of 0.38 on a spaxel-by-spaxel basis. In eight out of ten galaxies known to have outflows and/or inflows, we found spatially and kinematically symmetric structures of high line ratios. These structures appear as a collimated bicone in two galaxies and as a thin spherical shell in six galaxies. Non-LTE analysis suggests that the high HCN/HCO$^+$ line ratio in outflows is predominantly influenced by the abundance ratio. Chemical model calculations indicate that the enhancement of HCN abundance in outflows is likely due to high-temperature chemistry triggered by shock heating. These results imply that the HCN/HCO$^+$ line ratio can aid in identifying the outflow geometry when the shock velocity of the outflows is sufficiently high to heat the gas. △ Less

Submitted 25 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

Comments: 52 pages, 35 figures, accepted for publication in Astronomy & Astrophysics

Journal ref: A&A 686, A48 (2024)

arXiv:2402.14445 [pdf]

Rare-earth doped yttrium silicate (Y2SiO5) thin films grown by chemical vapour deposition for quantum technologies

Authors: Suma Al-Hunaishi, Anna Blin, Nao Harada, Pauline Perrin, Philippe Goldner, Diana Serrano, Alexandre Tallaire

Abstract: Yttrium orthosilicate (Y2SiO5 - YSO) is one of the most promising crystals to host rare-earth (RE) ions for quantum technologies applications. In this matrix, they indeed exhibit narrow optical and spin linewidths that can be exploited to develop quantum memories or quantum information processing capabilities. In this paper, we propose a new method to grow RE doped silicate thin films on silicon w… ▽ More Yttrium orthosilicate (Y2SiO5 - YSO) is one of the most promising crystals to host rare-earth (RE) ions for quantum technologies applications. In this matrix, they indeed exhibit narrow optical and spin linewidths that can be exploited to develop quantum memories or quantum information processing capabilities. In this paper, we propose a new method to grow RE doped silicate thin films on silicon wafers based on direct liquid injection chemical vapour deposition (DLI-CVD). We optimize the deposition and annealing conditions to achieve formation of the high temperature X2-YSO phase. The phase purity and crystalline quality of the films are assessed by evaluating the optical properties of Eu3+ ions embedded in this oxide matrix. In view of the results, we discuss the possible phase formation mechanisms, and the potential of this new wafer-compatible form of YSO for quantum technologies applications. △ Less

Submitted 22 February, 2024; originally announced February 2024.

Comments: 14 pages, 7 figures

arXiv:2402.10721 [pdf, other]

doi 10.1051/0004-6361/202348787

Molecular isotopologue measurements toward super star clusters and the relation to their ages in NGC253 with ALCHEMI

Authors: J. Butterworth, S. Viti, P. P. Van der Werf, J. G. Mangum, S. Martín, N. Harada, K. L. Emig, S. Muller, K. Sakamoto, Y. Yoshimura, K. Tanaka, R. Herrero-Illana, L. Colzi, V. M. Rivilla, K. Y. Huang, M. Bouvier, E. Behrens, C. Henkel, Y. T. Yan, D. S. Meier, D. Zhou

Abstract: Determining the evolution of the CNO isotopes in the interstellar medium (ISM) of starburst galaxies can yield important constraints on the ages of superstar clusters (SSCs), or on other aspects and contributing factors of their evolution. Due to the time-dependent nature of the abundances of isotopes within the ISM as they are supplied from processes such as nucleosynthesis or chemical fractionat… ▽ More Determining the evolution of the CNO isotopes in the interstellar medium (ISM) of starburst galaxies can yield important constraints on the ages of superstar clusters (SSCs), or on other aspects and contributing factors of their evolution. Due to the time-dependent nature of the abundances of isotopes within the ISM as they are supplied from processes such as nucleosynthesis or chemical fractionation, this provides the possible opportunity to probe the ability of isotopes ratios to trace the ages of high star forming regions, such as SSCs. The goal of this study is to investigate whether the isotopic variations in SSC regions within NGC253 are correlated with their different ages as derived from stellar population modelling. We have measured abundance ratios of CO, HCN and HCO$^+$ isotopologues in six regions containing SSCs within NGC253 using high spatial resolution (1.6",$\sim 28$pc) data from the ALCHEMI (ALma Comprehensive High-resolution Extragalactic Molecular Inventory) ALMA Large program. We have then analysed these ratios using RADEX radiative transfer modelling, with the parameter space sampled using the nested sampling Monte Carlo algorithm MLFriends. These abundance ratios were then compared to ages predicted in each region via the fitting of observed star formation tracers (such as Br$γ$) to starburst stellar population evolution models. We do not find any significant trend with age for the CO and HCN isotopologue ratios on the timescales for the ages of the SSC* regions observed. The driving factors of these ratios within SSCs could be the Initial Mass Function as well as possibly fractionation effects. To further probe these effects in SSCs over time a larger sample of SSCs must be observed spanning a larger age range. △ Less

Submitted 16 February, 2024; originally announced February 2024.

Comments: 44 pages, 43 Figures, Accepted for Publication to A&A

Journal ref: A&A 686, A31 (2024)

arXiv:2402.08252 [pdf, other]

Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN

Authors: Shiqi Zhang, Zheng Qiu, Daiki Takeuchi, Noboru Harada, Shoji Makino

Abstract: With the rapid development of neural networks in recent years, the ability of various networks to enhance the magnitude spectrum of noisy speech in the single-channel speech enhancement domain has become exceptionally outstanding. However, enhancing the phase spectrum using neural networks is often ineffective, which remains a challenging problem. In this paper, we found that the human ear cannot… ▽ More With the rapid development of neural networks in recent years, the ability of various networks to enhance the magnitude spectrum of noisy speech in the single-channel speech enhancement domain has become exceptionally outstanding. However, enhancing the phase spectrum using neural networks is often ineffective, which remains a challenging problem. In this paper, we found that the human ear cannot sensitively perceive the difference between a precise phase spectrum and a biased phase (BP) spectrum. Therefore, we propose an optimization method of phase reconstruction, allowing freedom on the global-phase bias instead of reconstructing the precise phase spectrum. We applied it to a Conformer-based Metric Generative Adversarial Networks (CMGAN) baseline model, which relaxes the existing constraints of precise phase and gives the neural network a broader learning space. Results show that this method achieves a new state-of-the-art performance without incurring additional computational overhead. △ Less

Submitted 4 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

Comments: Accepted by ICASSP 2024 Updated on 2024/06/04 to add one more citation in appendix

arXiv:2401.13204 [pdf, other]

An Extremely Young Protostellar Core, MMS 1/ OMC-3: Episodic Mass Ejection History Traced by the Micro SiO Jet

Authors: Satoko Takahashi, Masahiro N. Machida, Mitsuki Omura, Doug Johnstone, Kazuya Saigo, Naoto Harada, Kohji Tomisaka, Paul T. P. Ho, Luis A. Zapata, Steve Mairs, Gregory J. Herczeg, Kotomi Taniguchi, Yuhua Liu, Asako Sato

Abstract: We present ${\sim}0.2$ arcsec ($\sim$80 au) resolution observations of the CO (2-1) and SiO (5-4) lines made with the Atacama large millimeter/submillimeter array toward an extremely young intermediate-mass protostellar source (t$_{\rm dyn}<$1000 years), MMS 1 located in the Orion Molecular Cloud-3 region. We have successfully imaged a very compact CO molecular outflow associated with MMS 1, havin… ▽ More We present ${\sim}0.2$ arcsec ($\sim$80 au) resolution observations of the CO (2-1) and SiO (5-4) lines made with the Atacama large millimeter/submillimeter array toward an extremely young intermediate-mass protostellar source (t$_{\rm dyn}<$1000 years), MMS 1 located in the Orion Molecular Cloud-3 region. We have successfully imaged a very compact CO molecular outflow associated with MMS 1, having deprojected lobe sizes of $\sim$18000 au (red-shifted lobe) and $\sim$35000 au (blue-shifted lobe). We have also detected an extremely compact ($\lesssim$1000 au) and collimated SiO protostellar jet within the CO outflow. The maximum deprojected jet speed is measured to be as high as 93 km s$^{-1}$. The SiO jet wiggles and displays a chain of knots. Our detection of the molecular outflow and jet is the first direct evidence that MMS 1 already hosts a protostar. The position-velocity diagram obtained from the SiO emission shows two distinct structures: (i) bow-shocks associated with the tips of the outflow, and (ii) a collimated jet, showing the jet velocities linearly increasing with the distance from the driving source. Comparisons between the observations and numerical simulations quantitatively share similarities such as multiple-mass ejection events within the jet and Hubble-like flow associated with each mass ejection event. Finally, while there is a weak flux decline seen in the 850 $μ$m light curve obtained with JCMT/SCUBA 2 toward MMS 1, no dramatic flux change events are detected. This suggests that there has not been a clear burst event within the last 8 years. △ Less

Submitted 23 January, 2024; originally announced January 2024.

Comments: 19 pages, 9 figures, Accepted for publication in ApJ

arXiv:2401.05976 [pdf, other]

The ALMaQUEST Survey XII: Dense Molecular Gas as traced by HCN and HCO$^{+}$ in Green Valley Galaxies

Authors: Lihwai Lin, Hsi-An Pan, Sara L. Ellison, Nanase Harada, Maria J. Jimenez-Donaire, K. Decker French, William M. Baker, Bau-Ching Hsieh, Yusei Koyama, Carlos Lopez-Coba, Tomonari Michiyama, Kate Rowlands, Sebastian F. Sanchez, Mallory Thorp

Abstract: We present ALMA observations of two dense gas tracers, HCN(1-0) and HCO$^{+}$(1-0), for three galaxies in the green valley and two galaxies on the star-forming main sequence with comparable molecular gas fractions as traced by the CO(1-0) emissions, selected from the ALMaQUEST survey. We investigate whether the deficit of molecular gas star formation efficiency (SFE$_{\rm mol}$) that leads to the… ▽ More We present ALMA observations of two dense gas tracers, HCN(1-0) and HCO$^{+}$(1-0), for three galaxies in the green valley and two galaxies on the star-forming main sequence with comparable molecular gas fractions as traced by the CO(1-0) emissions, selected from the ALMaQUEST survey. We investigate whether the deficit of molecular gas star formation efficiency (SFE$_{\rm mol}$) that leads to the low specific star formation rate in these green valley galaxies is due to a lack of dense gas (characterized by the dense gas fraction $f_{\rm dense}$) or the low star formation efficiency of dense gas (SFE$_{\rm dense}$). We find that SFE$_{\rm mol}$ as traced by the CO emissions, when considering both star-forming and retired spaxels together, is tightly correlated with SFE$_{\rm dense}$ and depends only weakly on $f_{\rm dense}$. The specific star formation rate (sSFR) on kpc scales is primarily driven by SFE$_{\rm mol}$ and SFE$_{\rm dense}$, followed by the dependence on $f_{\rm mol}$, and is least correlated with $f_{\rm dense}$ or the dense-to-stellar mass ratio ($R_{\rm dense}$). When compared with other works in the literature, we find that our green valley sample shows lower global SFE$_{\rm mol}$ as well as lower SFE$_{\rm dense}$ while exhibiting similar dense gas fractions when compared to star-forming and starburst galaxies. We conclude that the star formation of the 3 green valley galaxies with a normal abundance of molecular gas is suppressed mainly due to the reduced SFE$_{\rm dense}$ rather than the lack of dense gas. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 20 pages, 13 figures, ApJ accepted

arXiv:2401.02578 [pdf, other]

The ALCHEMI atlas: principal component analysis reveals starburst evolution in NGC 253

Authors: Nanase Harada, David S. Meier, Sergio Martín, Sebastien Muller, Kazushi Sakamoto, Toshiki Saito, Mark D. Gorski, Christian Henkel, Kunihiko Tanaka, Jeffrey G. Mangum, Susanne Aalto, Rebeca Aladro, Mathilde Bouvier, Laura Colzi, Kimberly L. Emig, Rubén Herrero-Illana, Ko-Yun Huang, Kotaro Kohno, Sabine König, Kouichiro Nakanishi, Yuri Nishimura, Shuro Takano, Víctor M. Rivilla, Serena Viti, Yoshimasa Watanabe , et al. (2 additional authors not shown)

Abstract: Molecular lines are powerful diagnostics of the physical and chemical properties of the interstellar medium (ISM). These ISM properties, which affect future star formation, are expected to differ in starburst galaxies from those of more quiescent galaxies. We investigate the ISM properties in the central molecular zone of the nearby starburst galaxy NGC 253 using the ultra-wide millimeter spectral… ▽ More Molecular lines are powerful diagnostics of the physical and chemical properties of the interstellar medium (ISM). These ISM properties, which affect future star formation, are expected to differ in starburst galaxies from those of more quiescent galaxies. We investigate the ISM properties in the central molecular zone of the nearby starburst galaxy NGC 253 using the ultra-wide millimeter spectral scan survey from the ALMA Large Program ALCHEMI. We present an atlas of velocity-integrated images at a 1".6 resolution of 148 unblended transitions from 44 species, including the first extragalactic detection of HCNH$^+$ and the first interferometric images of C$_3$H$^+$, NO, HCS$^+$. We conduct a principal component analysis (PCA) on these images to extract correlated chemical species and to identify key groups of diagnostic transitions. To the best of our knowledge, our dataset is currently the largest astronomical set of molecular lines to which PCA has been applied. The PCA can categorize transitions coming from different physical components in NGC 253 such as i) young starburst tracers characterized by high-excitation transitions of HC$_3$N and complex organic molecules (COMs) versus tracers of on-going star formation (radio recombination lines) and high-excitation transitions of CCH and CN tracing PDRs, ii) tracers of cloud-collision-induced shocks (low-excitation transitions of CH$_3$OH, HNCO, HOCO$^+$, and OCS) versus shocks from star-formation-induced outflows (high-excitation transitions of SiO), as well as iii) outflows showing emission from HOC$^+$, CCH, H$_3$O$^+$, CO isotopologues, HCN, HCO$^+$, CS, and CN. Our findings show these intensities vary with galactic dynamics, star formation activities, and stellar feedback. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: 65 pages, 39 figures. Accepted for publication in ApJS

arXiv:2312.02504 [pdf, other]

Ring Gap Structure around Class I Protostar WL 17

Authors: Ayumu Shoshi, Naoto Harada, Kazuki Tokuda, Yoshihiro Kawasaki, Hayao Yamasaki, Asako Sato, Mitsuki Omura, Masayuki Yamaguchi, Kengo Tachihara, Masahiro N. Machida

Abstract: WL 17 is a Class I object and was considered to have a ring-hole structure. We analyzed the structure around WL 17 to investigate the detailed properties of WL 17. We used ALMA archival data, which have a higher angular resolution than previous observations. We investigated the WL 17 system with the 1.3 mm dust continuum and 12CO and C18O (J = 2-1) line emissions. The dust continuum emission showe… ▽ More WL 17 is a Class I object and was considered to have a ring-hole structure. We analyzed the structure around WL 17 to investigate the detailed properties of WL 17. We used ALMA archival data, which have a higher angular resolution than previous observations. We investigated the WL 17 system with the 1.3 mm dust continuum and 12CO and C18O (J = 2-1) line emissions. The dust continuum emission showed a clear ring structure with inner and outer edges of ~11 and ~21 au, respectively. In addition, we detected an inner disk of < 5 au radius enclosing the central star within the ring, the first observation of this structure. Thus, WL 17 has a ring-gap structure, not a ring-hole structure. We did not detect any marked emission in either the gap or inner disk, indicating that there is no sign of a planet, circumplanetary disk, or binary companion. We identified the base of both blue-shifted and red-shifted outflows based on the 12CO emission, which is clearly associated with the disk around WL 17. The outflow mass ejection rate is ~3.6x10^-7 Msun yr-1 and the dynamical timescale is as short as ~ 10^4 yr. The C18O emission showed that an inhomogeneous infalling envelope, which can induce episodic mass accretion, is distributed in the region within ~1000 au from the central protostar. With these new findings, we can constrain the planet formation and dust growth scenarios in the accretion phase of star formation. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 22 pages, 9 figures, Accepted for publication in the Astrophysical Journal

arXiv:2311.12106 [pdf, other]

Volume density structure of the NGC253 CMZ through ALCHEMI excitation analysis

Authors: Kunihiko Tanaka, Jeffrey G. Mangum, Serena Viti, Sergio Martin, Nanase Harada, Kazushi Sakamoto, Sebastien Muller, Yuki Yoshimura, Kouichiro Nakanishi, Ruben Herrero Illana, Kimberly L. Emig, S. Muhle, Hiroyuki Kaneko, Tomoka Tosaki, Erica Behrens, Victor M. Rivilla, Laura Colzi, Yuri Nishimura, P. K. Humire, Mathilde Bouvier, Ko-Yun Huang, Joshua Butterworth, David S. Meier, Paul P. van der Werf

Abstract: We present a spatially-resolved excitation analysis for the central molecular zone (CMZ) of the starburst galaxy NGC 253 using the data from the ALMA Large program ALCHEMI, whereby we explore parameters distinguishing NGC 253 from the quiescent Milky Way's Galactic Center (GC). Non-LTE analyses employing a hierarchical Bayesian framework are applied to Band 3-7 transitions from nine molecular spec… ▽ More We present a spatially-resolved excitation analysis for the central molecular zone (CMZ) of the starburst galaxy NGC 253 using the data from the ALMA Large program ALCHEMI, whereby we explore parameters distinguishing NGC 253 from the quiescent Milky Way's Galactic Center (GC). Non-LTE analyses employing a hierarchical Bayesian framework are applied to Band 3-7 transitions from nine molecular species to delineate the position-position-velocity distributions of column density ($N_\mathrm{H_2}$), volume density ($n_\mathrm{H_2}$), and temperature ($T_\mathrm{kin}$) at 27 pc resolution. Two distinct components are detected: a low-density component with $(n_\mathrm{H_2},\ T_\mathrm{kin})\sim(10^{3.3}\ \mathrm{cm}^{-3}, 85 K)$ and a high-density component with $(n_\mathrm{H_2},\ T_\mathrm{kin})\sim (10^{4.4}\ \mathrm{cm}^{-3}, 110\ \mathrm{K})$, separated at $n_\mathrm{H_2}\sim10^{3.8}\ \mathrm{cm}^{-3}$. NGC 253 has $\sim10$ times the high-density gas mass and $\sim3$ times the dense-gas mass fraction of the GC. These properties are consistent with their HCN/CO ratio but cannot alone explain the factor of $\sim30$ difference in their star formation efficiencies (SFEs), contradicting the dense-gas mass to star formation rate scaling law. The $n_\mathrm{H_2}$ histogram toward NGC 253 exhibits a shallow declining slope up to $n_\mathrm{H_2}\sim10^6\ \mathrm{cm}^{-3}$, while that of the GC steeply drops in $n_\mathrm{H_2}\gtrsim10^{4.5}\ \mathrm{cm}^{-3}$ and vanishes at $10^5\ \mathrm{cm}^{-3}$. Their dense-gas mass fraction ratio becomes consistent with their SFEs when the threshold $n_\mathrm{H_2}$ for the dense gas is taken at $\sim 10^{4.2\mbox{-}4.6}\ \mathrm{cm}^{-3}$. The rich abundance of gas above this density range in the NGC 253 CMZ, or its scarcity in the GC, is likely to be the critical difference characterizing the contrasting star formation in the centers of the two galaxies. △ Less

Submitted 20 November, 2023; originally announced November 2023.

Comments: 49 pages, 27 figures, 7 tables, accepted for publication in the Astrophysical Journal

arXiv:2311.01715 [pdf, other]

Acousto-optic reconstruction of exterior sound field based on concentric circle sampling with circular harmonic expansion

Authors: Phuc Duc Nguyen, Kenji Ishikawa, Noboru Harada, Takehiro Moriya

Abstract: Acousto-optic sensing provides an alternative approach to traditional microphone arrays by shedding light on the interaction of light with an acoustic field. Sound field reconstruction is a fascinating and advanced technique used in acousto-optics sensing. Current challenges in sound-field reconstruction methods pertain to scenarios in which the sound source is located within the reconstruction ar… ▽ More Acousto-optic sensing provides an alternative approach to traditional microphone arrays by shedding light on the interaction of light with an acoustic field. Sound field reconstruction is a fascinating and advanced technique used in acousto-optics sensing. Current challenges in sound-field reconstruction methods pertain to scenarios in which the sound source is located within the reconstruction area, known as the exterior problem. Existing reconstruction algorithms, primarily designed for interior scenarios, often exhibit suboptimal performance when applied to exterior cases. This paper introduces a novel technique for exterior sound-field reconstruction. The proposed method leverages concentric circle sampling and a two-dimensional exterior sound-field reconstruction approach based on circular harmonic extensions. To evaluate the efficacy of this approach, both numerical simulations and practical experiments are conducted. The results highlight the superior accuracy of the proposed method when compared to conventional reconstruction methods, all while utilizing a minimal amount of measured projection data. △ Less

Submitted 3 November, 2023; originally announced November 2023.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2310.06055 [pdf, other]

Secondary outflow driven by the protostar Ser-emb 15 in Serpens

Authors: Asako Sato, Kazuki Tokuda, Masahiro N. Machida, Kengo Tachihara, Naoto Harada, Hayao Yamasaki, Shingo Hirano, Toshikazu Onishi, Yuko Matsushita

Abstract: We present the detection of a secondary outflow associated with a Class I source, Ser-emb 15, in the Serpens Molecular Cloud. We reveal two pairs of molecular outflows consisting of three lobes, namely primary and secondary outflows, using ALMA 12CO and SiO line observations at a resolution of 318 au. The secondary outflow is elongated approximately perpendicular to the axis of the primary outflow… ▽ More We present the detection of a secondary outflow associated with a Class I source, Ser-emb 15, in the Serpens Molecular Cloud. We reveal two pairs of molecular outflows consisting of three lobes, namely primary and secondary outflows, using ALMA 12CO and SiO line observations at a resolution of 318 au. The secondary outflow is elongated approximately perpendicular to the axis of the primary outflow in the plane of the sky. We also identify two compact structures, Sources A and B, within an extended structure associated with Ser-emb 15 in the 1.3 mm continuum emission at a resolution of 40 au. The projected sizes of Sources A and B are 137 au and 60 au, respectively. Assuming a dust temperature of 20 K, we estimate the dust mass to be 0.0024 Msun for Source A and 0.00033 Msun for Source B. C18O line data imply the existence of rotational motion around the extended structure, however, cannot resolve rotational motion in Source A and/or B, due to insufficient angular and frequency resolutions. Therefore, we cannot conclude whether Ser-emb 15 is a single or binary system. Thus, either Source A or B could drive the secondary outflow. We discuss two scenarios to explain the driving mechanism of the primary and secondary outflows: the Ser-emb 15 system is (1) a binary system composed of Source A and B or (2) a single star system composed of only Source A. In either case, the system could be a suitable target for investigating the disk and/or binary formation processes in complicated environments. Detecting these outflows should contribute to understanding complex star-forming environments, which may be common in the star-formation processes. △ Less

Submitted 9 October, 2023; originally announced October 2023.

Comments: 21 pages, 10 figures, Accepted for publication in the Astrophysical Journal

arXiv:2309.13821 [pdf, other]

An ALMA-resolved view of 7000 au Protostellar Gas Ring around the Class I source CrA-IRS 2 as a possible sign of magnetic flux advection

Authors: Kazuki Tokuda, Naofumi Fukaya, Kengo Tachihara, Mitsuki Omura, Naoto Harada, Shingo Nozaki, Ayumu Shoshi, Masahiro N. Machida

Abstract: Transferring a significant fraction of the magnetic flux from a dense cloud core is essential in the star formation process. A ring-like structure produced by magnetic flux loss has been predicted theoretically, but no observational identification has been presented. We have performed ALMA observations of the Class I protostar IRS 2 in the Corona Australis star-forming region and resolved a distin… ▽ More Transferring a significant fraction of the magnetic flux from a dense cloud core is essential in the star formation process. A ring-like structure produced by magnetic flux loss has been predicted theoretically, but no observational identification has been presented. We have performed ALMA observations of the Class I protostar IRS 2 in the Corona Australis star-forming region and resolved a distinctive gas ring in the C$^{18}$O ($J$ = 2-1) line emission. The center of this gas ring is $\sim$5,000 au away from the protostar, with a diameter of $\sim$7,000 au. The radial velocity of the gas is $\lesssim1$ km s$^{-1}$ blueshifted from that of the protostar, with a possible expanding feature judged from the velocity-field (moment 1) map and position-velocity diagram. These features are either observationally new or have been discovered but not discussed in depth because they are difficult to explain by well-studied protostellar phenomena such as molecular outflows and accretion streamers. A plausible interpretation is a magnetic wall created by the advection of magnetic flux which is theoretically expected in the Class 0/I phase during star formation as a removal mechanism of magnetic flux. Similar structures reported in the other young stellar sources could likely be candidates formed by the same mechanism, encouraging us to revisit the issue of magnetic flux transport in the early stages of star formation from an observational perspective. △ Less

Submitted 15 October, 2023; v1 submitted 24 September, 2023; originally announced September 2023.

Comments: 9 pages, 3 figures, Accepted for publication in the Astronomical Journal Letters

Journal ref: 10.3847/2041-8213/acfca9

arXiv:2309.05307 [pdf, other]

FOREVER22: Gas and metal outflow from massive galaxies in protocluster regions

Authors: Naoki Harada, Hidenobu Yajima, Makito Abe

Abstract: We study gas and metal outflow from massive galaxies in protocluster regions at $z=3-9$ by using the results of the FOREVER22 simulation project. Our simulations contain massive haloes with $M_{\rm h} \gtrsim 10^{13}~\rm M_{\odot}$, showing high star formation rates of $> 100~\rm M_{\odot}~yr^{-1}$ and hosting supermassive black holes with $M_{\rm BH} \gtrsim 10^{8}~\rm M_{\odot}$. We show that th… ▽ More We study gas and metal outflow from massive galaxies in protocluster regions at $z=3-9$ by using the results of the FOREVER22 simulation project. Our simulations contain massive haloes with $M_{\rm h} \gtrsim 10^{13}~\rm M_{\odot}$, showing high star formation rates of $> 100~\rm M_{\odot}~yr^{-1}$ and hosting supermassive black holes with $M_{\rm BH} \gtrsim 10^{8}~\rm M_{\odot}$. We show that the mass loading factor ($η_{\rm M}$) sensitively depends on the halo mass and it is $η_{\rm M} = 1.2~(9.2)$ for $M_{\rm h} = 10^{13}~(10^{11})~\rm M_{\odot}$. Once the halo mass exceeds $\sim 10^{12.5}~\rm M_{\odot}$, the outflow velocity of the gas rapidly decreases near a virial radius, and the gas returns to a galactic centre finally as a fountain flow. Also, the metal inflow and outflow rates sensitively depend on the halo mass and redshift. At $z=3$, the inflow rate becomes larger than the outflow one if $M_{\rm h} \gtrsim 10^{13.0}~\rm M_{\odot}$. Thus, we suggest that massive haloes cannot be efficient metal enrichment sources beyond virial radii that will be probed in future observations, e.g., studies of metal absorption lines with the Prime Focus Spectrograph on the Subaru telescope. △ Less

Submitted 11 September, 2023; originally announced September 2023.

Comments: 13 pages, 10 figures, accepted for publication in MNRAS

arXiv:2309.02586 [pdf, other]

The Detection of Higher-Order Millimeter Hydrogen Recombination Lines in the Large Magellanic Cloud

Authors: Marta Sewiło, Kazuki Tokuda, Stan E. Kurtz, Steven B. Charnley, Thomas Möller, Jennifer Wiseman, C. -H. Rosie Chen, Remy Indebetouw, Álvaro Sánchez-Monge, Kei E. I. Tanaka, Peter Schilke, Toshikazu Onishi, Naoto Harada

Abstract: We report the first extragalactic detection of the higher-order millimeter hydrogen recombination lines ($Δn>2$). The $γ$-, $ε$-, and $η$-transitions have been detected toward the millimeter continuum source N105-1A in the star-forming region N105 in the Large Magellanic Cloud (LMC) with the Atacama Large Millimeter/submillimeter Array (ALMA). We use the H40$α$ line, the brightest of the detected… ▽ More We report the first extragalactic detection of the higher-order millimeter hydrogen recombination lines ($Δn>2$). The $γ$-, $ε$-, and $η$-transitions have been detected toward the millimeter continuum source N105-1A in the star-forming region N105 in the Large Magellanic Cloud (LMC) with the Atacama Large Millimeter/submillimeter Array (ALMA). We use the H40$α$ line, the brightest of the detected recombination lines (H40$α$, H36$β$, H50$β$, H41$γ$, H57$γ$, H49$ε$, H53$η$, and H54$η$), and/or the 3 mm free-free continuum emission to determine the physical parameters of N105-1A (the electron temperature, emission measure, electron density, and size) and study ionized gas kinematics. We compare the physical properties of N105-1A to a large sample of Galactic compact and ultracompact (UC) H II regions and conclude that N105-1A is similar to the most luminous ($L>10^5$ $L_{\odot}$) UC H II regions in the Galaxy. N105-1A is ionized by an O5.5 V star, it is deeply embedded in its natal molecular clump, and likely associated with a (proto)cluster. We incorporate high-resolution molecular line data including CS, SO, SO$_2$, and CH$_3$OH ($\sim$0.12 pc), and HCO$^{+}$ and CO ($\sim$0.087 pc) to explore the molecular environment of N105-1A. Based on the CO data, we find evidence for a cloud-cloud collision that likely triggered star formation in the region. We find no clear outflow signatures, but the presence of filaments and streamers indicates on-going accretion onto the clump hosting the UC H II region. Sulfur chemistry in N105-1A is consistent with the accretion shock model predictions. △ Less

Submitted 5 September, 2023; originally announced September 2023.

Comments: 51 pages, 30 figures, 2 tables (including appendices); accepted for publication in The Astrophysical Journal (ApJ)

arXiv:2308.11923 [pdf, other]

Audio Difference Captioning Utilizing Similarity-Discrepancy Disentanglement

Authors: Daiki Takeuchi, Yasunori Ohishi, Daisuke Niizumi, Noboru Harada, Kunio Kashino

Abstract: We proposed Audio Difference Captioning (ADC) as a new extension task of audio captioning for describing the semantic differences between input pairs of similar but slightly different audio clips. The ADC solves the problem that conventional audio captioning sometimes generates similar captions for similar audio clips, failing to describe the difference in content. We also propose a cross-attentio… ▽ More We proposed Audio Difference Captioning (ADC) as a new extension task of audio captioning for describing the semantic differences between input pairs of similar but slightly different audio clips. The ADC solves the problem that conventional audio captioning sometimes generates similar captions for similar audio clips, failing to describe the difference in content. We also propose a cross-attention-concentrated transformer encoder to extract differences by comparing a pair of audio clips and a similarity-discrepancy disentanglement to emphasize the difference in the latent space. To evaluate the proposed methods, we built an AudioDiffCaps dataset consisting of pairs of similar but slightly different audio clips with human-annotated descriptions of their differences. The experiment with the AudioDiffCaps dataset showed that the proposed methods solve the ADC task effectively and improve the attention weights to extract the difference by visualizing them in the transformer encoder. △ Less

Submitted 23 August, 2023; originally announced August 2023.

Comments: Accepted to DCASE2023 Workshop

arXiv:2308.05568 [pdf, other]

doi 10.3847/1538-4357/acefb7

An ALMA Glimpse of Dense Molecular Filaments Associated with High-mass Protostellar Systems in the Large Magellanic Cloud

Authors: Kazuki Tokuda, Naoto Harada, Kei E. I. Tanaka, Tsuyoshi Inoue, Takashi Shimonishi, Yichen Zhang, Marta Sewiło, Yuri Kunitoshi, Ayu Konishi, Yasuo Fukui, Akiko Kawamura, Toshikazu Onishi, Masahiro N. Machida

Abstract: Recent millimeter/sub-millimeter facilities have revealed the physical properties of filamentary molecular clouds in relation to high-mass star formation. A uniform survey of the nearest, face-on star-forming galaxy, the Large Magellanic Cloud (LMC), complements the Galactic knowledge. We present ALMA survey data with a spatial resolution of $\sim$0.1 pc in the 0.87 mm continuum and HCO$^{+}$(4-3)… ▽ More Recent millimeter/sub-millimeter facilities have revealed the physical properties of filamentary molecular clouds in relation to high-mass star formation. A uniform survey of the nearest, face-on star-forming galaxy, the Large Magellanic Cloud (LMC), complements the Galactic knowledge. We present ALMA survey data with a spatial resolution of $\sim$0.1 pc in the 0.87 mm continuum and HCO$^{+}$(4-3) emission toward 30 protostellar objects with luminosities of 10$^4$-10$^{5.5}$ $L_{\odot}$ in the LMC. The spatial distributions of the HCO$^{+}$(4-3) line and thermal dust emission are well correlated, indicating that the line effectively traces dense, filamentary gas with an H$_2$ volume density of $\gtrsim$10$^5$ cm$^{-3}$ and a line mass of $\sim$10$^3$-10$^{4}$ $M_{\odot}$ pc$^{-1}$. Furthermore, we obtain an increase in the velocity linewidths of filamentary clouds, which follows a power-law dependence on their H$_2$ column densities with an exponent of $\sim$0.5. This trend is consistent with observations toward filamentary clouds in nearby star-forming regions withiin $ \lesssim$1 kpc from us and suggests enhanced internal turbulence within the filaments owing to surrounding gas accretion. Among the 30 sources, we find that 14 are associated with hub-filamentary structures, and these complex structures predominantly appear in protostellar luminosities exceeding $\sim$5 $\times$10$^4$ $L_{\odot}$. The hub-filament systems tend to appear in the latest stages of their natal cloud evolution, often linked to prominent H$\;${\sc ii} regions and numerous stellar clusters. Our preliminary statistics suggest that the massive filaments accompanied by hub-type complex features may be a necessary intermediate product in forming extremely luminous high-mass stellar systems capable of ultimately dispersing the parent cloud. △ Less

Submitted 10 August, 2023; originally announced August 2023.

Comments: 21 pages, 8 figures, 4 tables, accepted for publication in ApJ

arXiv:2307.02320 [pdf, ps, other]

Molecular Abundance of the Circumnuclear Region Surrounding an Active Galactic Nucleus in NGC 1068 based on Imaging Line Survey in the 3-mm Band with ALMA

Authors: Taku Nakajima, Shuro Takano, Tomoka Tosaki, Akio Taniguchi, Nanase Harada, Toshiki Saito, Masatoshi Imanishi, Yuri Nishimura, Takuma Izumi, Yoichi Tamura, Kotaro Kohno, Eric Herbst

Abstract: We present an imaging molecular line survey in the 3-mm band (85-114 GHz) focused on one of the nearest galaxies with an active galactic nucleus (AGN), NGC 1068, based on observations taken with the Atacama Large Millimeter/submillimeter Array (ALMA). Distributions of 23 molecular transitions are obtained in the central ~3 kpc region, including both the circumnuclear disk (CND) and starburst ring… ▽ More We present an imaging molecular line survey in the 3-mm band (85-114 GHz) focused on one of the nearest galaxies with an active galactic nucleus (AGN), NGC 1068, based on observations taken with the Atacama Large Millimeter/submillimeter Array (ALMA). Distributions of 23 molecular transitions are obtained in the central ~3 kpc region, including both the circumnuclear disk (CND) and starburst ring (SBR) with 60 and 350 pc resolution. The column densities and relative abundances of all the detected molecules are estimated under the assumption of local thermodynamic equilibrium in the CND and SBR. Then, we discuss the physical and chemical effects of the AGN on molecular abundance corresponding to the observation scale. We found that H13CN, SiO, HCN, and H13CO+ are abundant in the CND relative to the SBR. In contrast, 13CO is more abundant in the SBR. Based on the calculated column density ratios of N(HCN)/N(HCO+), N(HCN)/N(CN), and other molecular distributions, we conclude that the enhancement of HCN in the CND may be due to high-temperature environments resulting from strong shocks, which are traced by the SiO emission. Moreover, the abundance of CN in the CND is significantly lower than the expected value of the model calculations in the region affected by strong radiation. The expected strong X-ray irradiation from the AGN has a relatively lower impact on the molecular abundance in the CND than mechanical feedback. △ Less

Submitted 5 July, 2023; originally announced July 2023.

Comments: 33 pages, 20 figures, 4 tables, accepted for publication in ApJ

arXiv:2305.14079 [pdf, other]

Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation

Authors: Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

Abstract: Self-supervised learning general-purpose audio representations have demonstrated high performance in a variety of tasks. Although they can be optimized for application by fine-tuning, even higher performance can be expected if they can be specialized to pre-train for an application. This paper explores the challenges and solutions in specializing general-purpose audio representations for a specifi… ▽ More Self-supervised learning general-purpose audio representations have demonstrated high performance in a variety of tasks. Although they can be optimized for application by fine-tuning, even higher performance can be expected if they can be specialized to pre-train for an application. This paper explores the challenges and solutions in specializing general-purpose audio representations for a specific application using speech, a highly demanding field, as an example. We enhance Masked Modeling Duo (M2D), a general-purpose model, to close the performance gap with state-of-the-art (SOTA) speech models. To do so, we propose a new task, denoising distillation, to learn from fine-grained clustered features, and M2D for Speech (M2D-S), which jointly learns the denoising distillation task and M2D masked prediction task. Experimental results show that M2D-S performs comparably to or outperforms SOTA speech models on the SUPERB benchmark, demonstrating that M2D can specialize in a demanding field. Our code is available at: https://github.com/nttcslab/m2d/tree/master/speech △ Less

Submitted 3 August, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: Interspeech 2023; 5+2 pages, 2 figures, 6+6 tables, Code: https://github.com/nttcslab/m2d/tree/master/speech

MSC Class: 68T07

arXiv:2305.07828 [pdf, other]

Description and Discussion on DCASE 2023 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Authors: Kota Dohi, Keisuke Imoto, Noboru Harada, Daisuke Niizumi, Yuma Koizumi, Tomoya Nishida, Harsh Purohit, Ryo Tanabe, Takashi Endo, Yohei Kawaguchi

Abstract: We present the task description of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2023 Challenge Task 2: ``First-shot unsupervised anomalous sound detection (ASD) for machine condition monitoring''. The main goal is to enable rapid deployment of ASD systems for new kinds of machines without the need for hyperparameter tuning. In the past ASD tasks, developed methods tuned h… ▽ More We present the task description of the Detection and Classification of Acoustic Scenes and Events (DCASE) 2023 Challenge Task 2: ``First-shot unsupervised anomalous sound detection (ASD) for machine condition monitoring''. The main goal is to enable rapid deployment of ASD systems for new kinds of machines without the need for hyperparameter tuning. In the past ASD tasks, developed methods tuned hyperparameters for each machine type, as the development and evaluation datasets had the same machine types. However, collecting normal and anomalous data as the development dataset can be infeasible in practice. In 2023 Task 2, we focus on solving the first-shot problem, which is the challenge of training a model on a completely novel machine type. Specifically, (i) each machine type has only one section (a subset of machine type) and (ii) machine types in the development and evaluation datasets are completely different. Analysis of 86 submissions from 23 teams revealed that the keys to outperform baselines were: 1) sampling techniques for dealing with class imbalances across different domains and attributes, 2) generation of synthetic samples for robust detection, and 3) use of multiple large pre-trained models to extract meaningful embeddings for the anomaly detector. △ Less

Submitted 2 November, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

Comments: anomaly detection, acoustic condition monitoring, domain shift, first-shot problem, DCASE Challenge, Accepted in DCASE2023 Workshop

arXiv:2304.14923 [pdf, ps, other]

Deep sound-field denoiser: optically-measured sound-field denoising using deep neural network

Authors: Kenji Ishikawa, Daiki Takeuchi, Noboru Harada, Takehiro Moriya

Abstract: This paper proposes a deep sound-field denoiser, a deep neural network (DNN) based denoising of optically measured sound-field images. Sound-field imaging using optical methods has gained considerable attention due to its ability to achieve high-spatial-resolution imaging of acoustic phenomena that conventional acoustic sensors cannot accomplish. However, the optically measured sound-field images… ▽ More This paper proposes a deep sound-field denoiser, a deep neural network (DNN) based denoising of optically measured sound-field images. Sound-field imaging using optical methods has gained considerable attention due to its ability to achieve high-spatial-resolution imaging of acoustic phenomena that conventional acoustic sensors cannot accomplish. However, the optically measured sound-field images are often heavily contaminated by noise because of the low sensitivity of optical interferometric measurements to airborne sound. Here, we propose a DNN-based sound-field denoising method. Time-varying sound-field image sequences are decomposed into harmonic complex-amplitude images by using a time-directional Fourier transform. The complex images are converted into two-channel images consisting of real and imaginary parts and denoised by a nonlinear-activation-free network. The network is trained on a sound-field dataset obtained from numerical acoustic simulations with randomized parameters. We compared the method with conventional ones, such as image filters, a spatiotemporal filter, and other DNN architectures, on numerical and experimental data. The experimental data were measured by parallel phase-shifting interferometry and holographic speckle interferometry. The proposed deep sound-field denoiser significantly outperformed the conventional methods on both the numerical and experimental data. Code is available on GitHub: https://github.com/nttcslab/deep-sound-field-denoiser. △ Less

Submitted 21 September, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

Comments: 16 pages, 10 figures, 2 tables

arXiv:2303.12685 [pdf, other]

doi 10.1051/0004-6361/202245659

Reconstructing the shock history in the CMZ of NGC 253 with ALCHEMI

Authors: K. -Y. Huang, S. Viti, J. Holdship, J. G. Mangum, S. Martín, N. Harada, S. Muller, K. Sakamoto, K. Tanaka, Y. Yoshimura, R. Herrero-Illana, D. S. Meier, E. Behrens, P. P. van der Werf, C. Henkel, S. García-Burillo, V. M. Rivilla, K. L. Emig, L. Colzi, P. K. Humire, R. Aladro, M. Bouvier

Abstract: HNCO and SiO are well known shock tracers and have been observed in nearby galaxies, including the nearby (D=3.5 Mpc) starburst galaxy NGC 253. The simultaneous detection of these two species in regions where the star formation rate is high may be used to study the shock history of the gas. We perform a multi-line molecular study using these two shock tracers (SiO and HNCO) with the aim of charact… ▽ More HNCO and SiO are well known shock tracers and have been observed in nearby galaxies, including the nearby (D=3.5 Mpc) starburst galaxy NGC 253. The simultaneous detection of these two species in regions where the star formation rate is high may be used to study the shock history of the gas. We perform a multi-line molecular study using these two shock tracers (SiO and HNCO) with the aim of characterizing the gas properties. We also explore the possibility of reconstructing the shock history in NGC 253's Central Molecular Zone (CMZ). Six SiO transitions and eleven HNCO transitions were imaged at high resolution $1''.6$ (28 pc) with the Atacama Large Millimeter/submillimeter Array (ALMA) as part of the ALCHEMI Large Programme. Both non-LTE radiative transfer analysis and chemical modelling were performed in order to characterize the gas properties, and to investigate the chemical origin of the emission. The non-LTE radiative transfer analysis coupled with Bayesian inference shows clear evidence that the gas traced by SiO has different densities and temperatures than that traced by HNCO, with an indication that shocks are needed to produce both species. Chemical modelling further confirms such a scenario and suggests that fast and slow shocks are responsible for SiO and HNCO production, respectively, in most GMCs. We are also able to infer the physical characteristics of the shocks traced by SiO and HNCO for each GMC. Radiative transfer and chemical analysis of the SiO and HNCO in the CMZ of NGC 253 reveal a complex picture whereby most of the GMCs are subjected to shocks. We speculate on the possible shock scenarios responsible for the observed emission and provide potential history and timescales for each shock scenario. Higher spatial resolution observations of these two species are required in order to quantitatively differentiate between scenarios. △ Less

Submitted 22 March, 2023; originally announced March 2023.

Journal ref: A&A 675, A151 (2023)

arXiv:2303.12108 [pdf, other]

doi 10.3847/1538-4357/acc65e

Diverse Molecular Structures Across The Whole Star-Forming Disk of M83: High fidelity Imaging at 40pc Resolution

Authors: ** Koda, Akihiko Hirota, Fumi Egusa, Kazushi Sakamoto, Tsuyoshi Sawada, Mark Heyer, Junichi Baba, Samuel Boissier, Daniela Calzetti, Jennifer Donovan Meyer, Bruce G. Elmegreen, Armando Gil de Paz, Nanase Harada, Luis C. Ho, Masato I. N. Kobayashi, Nario Kuno, Amanda M Lee, Barry F. Madore, Fumiya Maeda, Sergio Martin, Kazuyuki Muraoka, Kouichiro Nakanishi, Sachiko Onodera, Jorge L. Pineda, Nick Scoville , et al. (1 additional authors not shown)

Abstract: We present high-fidelity CO(1-0) imaging of molecular gas across the full star-forming disk of M83, using ALMA's 12m, 7m, and TP arrays and the MIRIAD package. The data have a mass sensitivity and resolution of 10^4Msun and 40 pc. The full disk coverage shows that the characteristics of molecular gas change radially from the center to outer disk. The molecular gas distribution shows coherent large… ▽ More We present high-fidelity CO(1-0) imaging of molecular gas across the full star-forming disk of M83, using ALMA's 12m, 7m, and TP arrays and the MIRIAD package. The data have a mass sensitivity and resolution of 10^4Msun and 40 pc. The full disk coverage shows that the characteristics of molecular gas change radially from the center to outer disk. The molecular gas distribution shows coherent large-scale structures in the inner part, including the central concentration, bar offset ridges, and prominent molecular spiral arms. In the outer disk, the spiral arms appear less spatially coherent, and even flocculent. Massive filamentary gas concentrations are abundant even in the interarm regions. Building up these structures in the interarm regions would require a very long time (~>100Myr). Instead, they must have formed within stellar spiral arms and been released into the interarm regions. For such structures to survive through the dynamical processes, the lifetimes of these structures and their constituent molecules and molecular clouds must be long (~>100Myr). These interarm structures host little or no star formation traced by Halpha. The new map also shows extended CO emission, which likely represents an ensemble of unresolved molecular clouds. △ Less

Submitted 21 March, 2023; originally announced March 2023.

Comments: Accepted for publication in ApJ

arXiv:2303.00455 [pdf, other]

First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline

Authors: Noboru Harada, Daisuke Niizumi, Yasunori Ohishi, Daiki Takeuchi, Masahiro Yasuda

Abstract: This paper provides a baseline system for First-shot-compliant unsupervised anomaly detection (ASD) for machine condition monitoring. First-shot ASD does not allow systems to do machine-type dependent hyperparameter tuning or tool ensembling based on the performance metric calculated with the grand truth. To show benchmark performance for First-shot ASD, this paper proposes an anomaly sound detect… ▽ More This paper provides a baseline system for First-shot-compliant unsupervised anomaly detection (ASD) for machine condition monitoring. First-shot ASD does not allow systems to do machine-type dependent hyperparameter tuning or tool ensembling based on the performance metric calculated with the grand truth. To show benchmark performance for First-shot ASD, this paper proposes an anomaly sound detection system that works on the domain generalization task in the Detection and Classification of Acoustic Scenes and Events (DCASE) 2022 Challenge Task 2: "Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Technique" while complying with the First-shot requirements introduced in the DCASE 2023 Challenge Task 2 (DCASE2023T2). A simple autoencoder based implementation combined with selective Mahalanobis metric is implemented as a baseline system. The performance evaluation is conducted to set the target benchmark for the forthcoming DCASE2023T2. Source code of the baseline system will be available on GitHub: https://github.com/nttcslab/dcase2023_task2_baseline_ae . △ Less

Submitted 1 March, 2023; originally announced March 2023.

Comments: 5 pages, 2 figures

arXiv:2302.01612 [pdf, other]

doi 10.3847/1538-4357/acb930

Crescent-Shaped Molecular Outflow from the Intermediate-mass Protostar DK Cha Revealed by ALMA

Authors: Naoto Harada, Kazuki Tokuda, Hayao Yamasaki, Asako Sato, Mitsuki Omura, Shingo Hirano, Toshikazu Onishi, Kengo Tachihara, Masahiro N. Machida

Abstract: We report on an Atacama Large Millimeter/submillimeter Array (ALMA) study of the Class I or II intermediate-mass protostar DK Cha in the Chamaeleon II region. The 12CO (J=2-1) images have an angular resolution of ~1'' (~250 au) and show high-velocity blueshifted (>70 km s-1) and redshifted (>50 km s-1) emissions which have 3000 au scale crescent-shaped structures around the protostellar disk trace… ▽ More We report on an Atacama Large Millimeter/submillimeter Array (ALMA) study of the Class I or II intermediate-mass protostar DK Cha in the Chamaeleon II region. The 12CO (J=2-1) images have an angular resolution of ~1'' (~250 au) and show high-velocity blueshifted (>70 km s-1) and redshifted (>50 km s-1) emissions which have 3000 au scale crescent-shaped structures around the protostellar disk traced in the 1.3mm continuum. Because the high-velocity components of the CO emission are associated with the protostar, we concluded that the emission traces the pole-on outflow. The blueshifted outflow lobe has a clear layered velocity gradient with a higher velocity component located on the inner side of the crescent shape, which can be explained by a model of an outflow with a higher velocity in the inner radii. Based on the directly driven outflow scenario, we estimated the driving radii from the observed outflow velocities and found that the driving region extends over two orders of magnitude. The 13CO emission traces a complex envelope structure with arc-like substructures with lengths of ~1000au. We identified the arc-like structures as streamers because they appear to be connected to a rotating infalling envelope. DK Cha is useful for understanding characteristics that are visible by looking at nearly face-on configurations of young protostellar systems, providing an alternative perspective for studying the star-formation process. △ Less

Submitted 3 February, 2023; originally announced February 2023.

Comments: Accepted for publication in ApJ. 12 pages, 5 figures

arXiv:2211.08988 [pdf, other]

doi 10.1093/pasj/psac094

Twisted magnetic field in star formation processes of L1521 F revealed by submillimeter dual band polarimetry using James Clerk Maxwell Telescope

Authors: Sakiko Fukaya, Hiroko Shinnaga, Ray S. Furuya, Kohji Tomisaka, Masahiro N. Machida, Naoto Harada

Abstract: Understanding the initial conditions of star formation requires both observational studies and theoretical works taking into account the magnetic field, which plays an important role in star formation processes. Herein, we study the young nearby dense cloud core L1521 F ($n$(H$_2$) $\sim 10^{4-6}$ cm$^{-3}$) in the Taurus Molecular Cloud. This dense core hosts a 0.2 $M_\odot$ protostar, categorize… ▽ More Understanding the initial conditions of star formation requires both observational studies and theoretical works taking into account the magnetic field, which plays an important role in star formation processes. Herein, we study the young nearby dense cloud core L1521 F ($n$(H$_2$) $\sim 10^{4-6}$ cm$^{-3}$) in the Taurus Molecular Cloud. This dense core hosts a 0.2 $M_\odot$ protostar, categorized as a Very Low Luminosity Objects with complex velocity structures, particularly in the vicinity of the protostar. To trace the magnetic field within the dense core, we conducted high sensitivity submillimeter polarimetry of the dust continuum at $λ$= 850 $μ$m and 450 $μ$m using the POL-2 polarimeter situated in front of the SCUBA-2 submillimeter bolometer camera on James Clerk Maxwell Tetescope. This was compared with millimeter polarimetry taken at $λ$= 3.3 mm with ALMA. The magnetic field was detected at $λ$= 850 $μ$m in the peripheral region, which is threaded in a north-south direction, while the central region traced at $λ$= 450 $μ$m shows a magnetic field with an east-west direction, i.e., orthogonal to that of the peripheral region. Magnetic field strengths are estimated to be $\sim$70 $μ$G and 200 $μ$G in the peripheral- and central-regions, respectively, using the Davis-Chandrasekhar-Fermi method. The resulting mass-to-flux ratio of 3 times larger than that of magnetically critical state for both regions indicates that L1521 F is magnetically supercritical, i.e., gravitational forces dominate over magnetic turbulence forces. Combining observational data with MHD simulations, detailed parameters of the morphological properties of this puzzling object are derived for the first time. △ Less

Submitted 16 November, 2022; originally announced November 2022.

Comments: 9 pages, 7 figures

arXiv:2210.14648 [pdf, other]

Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input

Authors: Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

Abstract: Masked Autoencoders is a simple yet powerful self-supervised learning method. However, it learns representations indirectly by reconstructing masked input patches. Several methods learn representations directly by predicting representations of masked patches; however, we think using all patches to encode training signal representations is suboptimal. We propose a new method, Masked Modeling Duo (M… ▽ More Masked Autoencoders is a simple yet powerful self-supervised learning method. However, it learns representations indirectly by reconstructing masked input patches. Several methods learn representations directly by predicting representations of masked patches; however, we think using all patches to encode training signal representations is suboptimal. We propose a new method, Masked Modeling Duo (M2D), that learns representations directly while obtaining training signals using only masked patches. In the M2D, the online network encodes visible patches and predicts masked patch representations, and the target network, a momentum encoder, encodes masked patches. To better predict target representations, the online network should model the input well, while the target network should also model it well to agree with online predictions. Then the learned representations should better model the input. We validated the M2D by learning general-purpose audio representations, and M2D set new state-of-the-art performance on tasks such as UrbanSound8K, VoxCeleb1, AudioSet20K, GTZAN, and SpeechCommandsV2. We additionally validate the effectiveness of M2D for images using ImageNet-1K in the appendix. △ Less

Submitted 2 March, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

Comments: 6 pages, 3 figures, and 6 tables. To appear at ICASSP2023

MSC Class: 68T07

arXiv:2209.06244 [pdf, other]

doi 10.3847/1538-4357/ac91ce

Tracing Interstellar Heating: An ALCHEMI Measurement of the HCN Isomers in NGC 253

Authors: Erica Behrens, Jeffrey G. Mangum, Jonathan Holdship, Serena Viti, Nanase Harada, Sergio Martin, Kazushi Sakamoto, Sebastien Muller, Kunihiko Tanaka, Kouichiro Nakanishi, Ruben Herrero-Illana, Yuki Yoshimura, Rebeca Aladro, Laura Colzi, Kimberly L. Emig, Christian Henkel, Ko-Yun Huang, P. K. Humire, David S. Meier, Victor M. Rivilla

Abstract: We analyze HCN and HNC emission in the nearby starburst galaxy NGC 253 to investigate its effectiveness in tracing heating processes associated with star formation. This study uses multiple HCN and HNC rotational transitions observed using ALMA via the ALCHEMI Large Program. To understand the conditions and associated heating mechanisms within NGC 253's dense gas, we employ Bayesian nested samplin… ▽ More We analyze HCN and HNC emission in the nearby starburst galaxy NGC 253 to investigate its effectiveness in tracing heating processes associated with star formation. This study uses multiple HCN and HNC rotational transitions observed using ALMA via the ALCHEMI Large Program. To understand the conditions and associated heating mechanisms within NGC 253's dense gas, we employ Bayesian nested sampling techniques applied to chemical and radiative transfer models which are constrained using our HCN and HNC measurements. We find that the volume density $n_{\text{H}_{2}}$ and cosmic ray ionization rate (CRIR) $ζ$ are enhanced by about an order of magnitude in the galaxy's central regions as compared to those further from the nucleus. In NGC 253's central GMCs, where observed HCN/HNC abundance ratios are lowest, $n \sim 10^{5.5}$ cm$^{-3}$ and $ζ\sim 10^{-12}$ s$^{-1}$ (greater than $10^4$ times the average Galactic rate). We find a positive correlation in the association of both density and CRIR with the number of star formation-related heating sources (supernova remnants, HII regions, and super hot cores) located in each GMC, as well as a correlation between CRIRs and supernova rates. Additionally, we see an anticorrelation between the HCN/HNC ratio and CRIR, indicating that this ratio will be lower in regions where $ζ$ is higher. Though previous studies suggested HCN and HNC may reveal strong mechanical heating processes in NGC 253's CMZ, we find cosmic ray heating dominates the heating budget, and mechanical heating does not play a significant role in the HCN and HNC chemistry. △ Less

Submitted 8 November, 2022; v1 submitted 13 September, 2022; originally announced September 2022.

Comments: 33 pages, 23 figures, accepted for publication by the Astrophysical Journal

arXiv:2208.13983 [pdf, other]

doi 10.3847/1538-4357/ac8dfc

ALCHEMI finds a "shocking" carbon footprint in the starburst galaxy NGC~253

Authors: Nanase Harada, Sergio Martin, Jeff Mangum, Kazushi Sakamoto, Sebastian Muller, Victor Rivilla, Christian Henkel, David Meier, Laura Colzi, Mitsuyoshi Yamagishi, Kunihiko Tanaka, Kouichiro Nakanishi, Ruben Herrero-Illana, Yuki Yoshimura, Pedro Humire, Rebeca Aladro, Paul van der Werf, Kim Emig

Abstract: Centers of starburst galaxies may be characterized by a specific gas and ice chemistry due to their gas dynamics and the presence of various ice desorption mechanisms. This may result in a peculiar observable composition. We analyze abundances of $CO_2$, a reliable tracer of ice chemistry, from data collected as part of the ALMA large program ALCHEMI, a wide-frequency spectral scan toward the star… ▽ More Centers of starburst galaxies may be characterized by a specific gas and ice chemistry due to their gas dynamics and the presence of various ice desorption mechanisms. This may result in a peculiar observable composition. We analyze abundances of $CO_2$, a reliable tracer of ice chemistry, from data collected as part of the ALMA large program ALCHEMI, a wide-frequency spectral scan toward the starburst galaxy NGC~253 with an angular resolution of 1.6$''$. We constrain the $CO_2$ abundances in the gas phase using its protonated form $HOCO^+$. The distribution of $HOCO^+$ is similar to that of methanol, which suggests that $HOCO^+$ is indeed produced from the protonation of $CO_2$ sublimated from ice. The $HOCO^+$ fractional abundances are found to be $(1-2)\times10^{-9}$ at the outer part of the central molecular zone (CMZ), while they are lower ($\sim10^{-10}$) near the kinematic center. This peak fractional abundance at the outer CMZ is comparable to that in the Milky Way CMZ, and orders of magnitude higher than that in Galactic disk star-forming regions. From the range of $HOCO^+/CO_2$ ratios suggested from chemical models, the gas-phase $CO_2$ fractional abundance is estimated to be $(1-20)\times10^{-7}$ at the outer CMZ, and orders of magnitude lower near the center. We estimate the $CO_2$ ice fractional abundances at the outer CMZ to be $(2-5)\times10^{-6}$ from the literature. A comparison between the ice and gas $CO_2$ abundances suggests an efficient sublimation mechanism. This sublimation is attributed to large-scale shocks at the orbital intersections of the bar and CMZ. △ Less

Submitted 30 August, 2022; originally announced August 2022.

Comments: 22 pages, 9 figures. Accepted for publication in the Astrophysical Journal

arXiv:2207.11964 [pdf, other]

doi 10.1145/3503161.3548397

ConceptBeam: Concept Driven Target Speech Extraction

Authors: Yasunori Ohishi, Marc Delcroix, Tsubasa Ochiai, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Akisato Kimura, Noboru Harada, Kunio Kashino

Abstract: We propose a novel framework for target speech extraction based on semantic information, called ConceptBeam. Target speech extraction means extracting the speech of a target speaker in a mixture. Typical approaches have been exploiting properties of audio signals, such as harmonic structure and direction of arrival. In contrast, ConceptBeam tackles the problem with semantic clues. Specifically, we… ▽ More We propose a novel framework for target speech extraction based on semantic information, called ConceptBeam. Target speech extraction means extracting the speech of a target speaker in a mixture. Typical approaches have been exploiting properties of audio signals, such as harmonic structure and direction of arrival. In contrast, ConceptBeam tackles the problem with semantic clues. Specifically, we extract the speech of speakers speaking about a concept, i.e., a topic of interest, using a concept specifier such as an image or speech. Solving this novel problem would open the door to innovative applications such as listening systems that focus on a particular topic discussed in a conversation. Unlike keywords, concepts are abstract notions, making it challenging to directly represent a target concept. In our scheme, a concept is encoded as a semantic embedding by map** the concept specifier to a shared embedding space. This modality-independent space can be built by means of deep metric learning using paired data consisting of images and their spoken captions. We use it to bridge modality-dependent information, i.e., the speech segments in the mixture, and the specified, modality-independent concept. As a proof of our scheme, we performed experiments using a set of images associated with spoken captions. That is, we generated speech mixtures from these spoken captions and used the images or speech signals as the concept specifiers. We then extracted the target speech using the acoustic characteristics of the identified segments. We compare ConceptBeam with two methods: one based on keywords obtained from recognition systems and another based on sound source separation. We show that ConceptBeam clearly outperforms the baseline methods and effectively extracts speech based on the semantic representation. △ Less

Submitted 25 July, 2022; originally announced July 2022.

Comments: Accepted to ACM Multimedia 2022

arXiv:2207.09732 [pdf, other]

Introducing Auxiliary Text Query-modifier to Content-based Audio Retrieval

Authors: Daiki Takeuchi, Yasunori Ohishi, Daisuke Niizumi, Noboru Harada, Kunio Kashino

Abstract: The amount of audio data available on public websites is growing rapidly, and an efficient mechanism for accessing the desired data is necessary. We propose a content-based audio retrieval method that can retrieve a target audio that is similar to but slightly different from the query audio by introducing auxiliary textual information which describes the difference between the query and target aud… ▽ More The amount of audio data available on public websites is growing rapidly, and an efficient mechanism for accessing the desired data is necessary. We propose a content-based audio retrieval method that can retrieve a target audio that is similar to but slightly different from the query audio by introducing auxiliary textual information which describes the difference between the query and target audio. While the range of conventional content-based audio retrieval is limited to audio that is similar to the query audio, the proposed method can adjust the retrieval range by adding an embedding of the auxiliary text query-modifier to the embedding of the query sample audio in a shared latent space. To evaluate our method, we built a dataset comprising two different audio clips and the text that describes the difference. The experimental results show that the proposed method retrieves the paired audio more accurately than the baseline. We also confirmed based on visualization that the proposed method obtains the shared latent space in which the audio difference and the corresponding text are represented as similar embedding vectors. △ Less

Submitted 20 July, 2022; originally announced July 2022.

Comments: Accepted to Interspeech 2022

arXiv:2207.08396 [pdf, ps, other]

doi 10.3847/2041-8213/ac81c1

The First Detection of a Protostellar CO Outflow in the Small Magellanic Cloud with ALMA

Authors: Kazuki Tokuda, Sarolta Zahorecz, Yuri Kunitoshi, Kosuke Higashino, Kei E. I. Tanaka, Ayu Konishi, Taisei Suzuki, Naoya Kitano, Naoto Harada, Takashi Shimonishi, Naslim Neelamkodan, Yasuo Fukui, Akiko Kawamura, Toshikazu Onishi, Masahiro N. Machida

Abstract: Protostellar outflows are one of the most outstanding features of star formation. Observational studies over the last several decades have successfully demonstrated that outflows are ubiquitously associated with low- and high-mass protostars in the solar-metallicity Galactic conditions. However, the environmental dependence of protostellar outflow properties is still poorly understood, particularl… ▽ More Protostellar outflows are one of the most outstanding features of star formation. Observational studies over the last several decades have successfully demonstrated that outflows are ubiquitously associated with low- and high-mass protostars in the solar-metallicity Galactic conditions. However, the environmental dependence of protostellar outflow properties is still poorly understood, particularly in the low-metallicity regime. Here we report the first detection of a molecular outflow in the Small Magellanic Cloud with 0.2 $Z_{\odot}$, using Atacama Large Millimeter/submillimeter Array observations at a spatial resolution of 0.1 pc toward the massive protostar Y246. The bipolar outflow is nicely illustrated by high-velocity wings of CO(3-2) emission at $\gtrsim$15 km s$^{-1}$. The evaluated properties of the outflow (momentum, mechanical force, etc.) are consistent with those of the Galactic counterparts. Our results suggest that the molecular outflows, i.e., the guidepost of the disk accretion at the small scale, might be universally associated with protostars across the metallicity range of $\sim$0.2-1 $Z_{\odot}$. △ Less

Submitted 7 August, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

Comments: 7 pages, 2 figures, Accepted for publication in ApJL

arXiv:2207.06448 [pdf, other]

doi 10.3847/1538-4357/ac80ff

AGN-driven Cold Gas Outflow of NGC 1068 Characterized by Dissociation-Sensitive Molecules

Authors: Toshiki Saito, Shuro Takano, Nanase Harada, Taku Nakajima, Eva Schinnerer, Daizhong Liu, Akio Taniguchi, Takuma Izumi, Yumi Watanabe, Kazuharu Bamba, Kotaro Kohno, Yuri Nishimura, Sophia Stuber, Tomoka Tosaki

Abstract: Recent developments in (sub-)millimeter facilities have drastically changed the amount of information obtained from extragalactic spectral scans. In this paper, we present a feature extraction technique using principal component analysis (PCA) applied to arcsecond-resolution (1.0-2.0 arcsec = 72-144 pc) spectral scan datasets for the nearby type-2 Seyfert galaxy, NGC 1068, using Band 3 of the Atac… ▽ More Recent developments in (sub-)millimeter facilities have drastically changed the amount of information obtained from extragalactic spectral scans. In this paper, we present a feature extraction technique using principal component analysis (PCA) applied to arcsecond-resolution (1.0-2.0 arcsec = 72-144 pc) spectral scan datasets for the nearby type-2 Seyfert galaxy, NGC 1068, using Band 3 of the Atacama Large Millimeter/submillimeter Array. We apply PCA to 16 well-detected molecular line intensity maps convolved to a common 150 pc resolution. In addition, we include the [SIII]/[SII] line ratio and [CI] $^3P_1$-$^3P_0$ maps in the literature, both of whose distributions show remarkable resemblance with that of a kpc-scale biconical outflow from the central AGN. We identify two prominent features: (1) central concentration at the circumnuclear disk (CND) and (2) two peaks across the center that coincide with the biconical outflow peaks. The concentrated molecular lines in the CND are mostly high-dipole molecules (e.g., H$^{13}$CN, HC$_3$N, and HCN). Line emissions from molecules known to be enhanced in irradiated interstellar medium, CN, C$_2$H, and HNC, show similar concentrations and extended components along the bicone, suggesting that molecule dissociation is a dominant chemical effect of the cold molecular outflow of this galaxy. Although further investigation should be made, this scenario is consistent with the faintness or absence of the emission lines from CO isotopologues, CH$_3$OH, and N$_2$H$^+$, in the outflow, which are easily destroyed by dissociating photons and electrons. △ Less

Submitted 13 July, 2022; originally announced July 2022.

Comments: 15 pages, 6 figures, 2 tables, accepted for publication in ApJ

arXiv:2206.05876 [pdf, other]

Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques

Authors: Kota Dohi, Keisuke Imoto, Noboru Harada, Daisuke Niizumi, Yuma Koizumi, Tomoya Nishida, Harsh Purohit, Takashi Endo, Masaaki Yamamoto, Yohei Kawaguchi

Abstract: We present the task description and discussion on the results of the DCASE 2022 Challenge Task 2: ``Unsupervised anomalous sound detection (ASD) for machine condition monitoring applying domain generalization techniques''. Domain shifts are a critical problem for the application of ASD systems. Because domain shifts can change the acoustic characteristics of data, a model trained in a source domai… ▽ More We present the task description and discussion on the results of the DCASE 2022 Challenge Task 2: ``Unsupervised anomalous sound detection (ASD) for machine condition monitoring applying domain generalization techniques''. Domain shifts are a critical problem for the application of ASD systems. Because domain shifts can change the acoustic characteristics of data, a model trained in a source domain performs poorly for a target domain. In DCASE 2021 Challenge Task 2, we organized an ASD task for handling domain shifts. In this task, it was assumed that the occurrences of domain shifts are known. However, in practice, the domain of each sample may not be given, and the domain shifts can occur implicitly. In 2022 Task 2, we focus on domain generalization techniques that detects anomalies regardless of the domain shifts. Specifically, the domain of each sample is not given in the test data and only one threshold is allowed for all domains. Analysis of 81 submissions from 31 teams revealed two remarkable types of domain generalization techniques: 1) domain-mixing-based approach that obtains generalized representations and 2) domain-classification-based approach that explicitly or implicitly classifies different domains to improve detection performance for each domain. △ Less

Submitted 21 November, 2022; v1 submitted 12 June, 2022; originally announced June 2022.

Comments: arXiv admin note: substantial text overlap with arXiv:2106.04492

arXiv:2205.08138 [pdf, ps, other]

Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model

Authors: Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

Abstract: Many application studies rely on audio DNN models pre-trained on a large-scale dataset as essential feature extractors, and they extract features from the last layers. In this study, we focus on our finding that the middle layer features of existing supervised pre-trained models are more effective than the late layer features for some tasks. We propose a simple approach to compose features effecti… ▽ More Many application studies rely on audio DNN models pre-trained on a large-scale dataset as essential feature extractors, and they extract features from the last layers. In this study, we focus on our finding that the middle layer features of existing supervised pre-trained models are more effective than the late layer features for some tasks. We propose a simple approach to compose features effective for general-purpose applications, consisting of two steps: (1) calculating feature vectors along the time frame from middle/late layer outputs, and (2) fusing them. This approach improves the utility of frequency and channel information in downstream processes, and combines the effectiveness of middle and late layer features for different tasks. As a result, the feature vectors become effective for general purposes. In the experiments using VGGish, PANNs' CNN14, and AST on nine downstream tasks, we first show that each layer output of these models serves different tasks. Then, we demonstrate that the proposed approach significantly improves their performance and brings it to a level comparable to that of the state-of-the-art. In particular, the performance of the non-semantic speech (NOSS) tasks greatly improves, especially on Speech commands V2 with VGGish of +77.1 (14.3% to 91.4%). △ Less

Submitted 17 May, 2022; originally announced May 2022.

Comments: 5 pages, 4 figures and 4 tables. Accepted by EUSIPCO 2022

MSC Class: 68T07

arXiv:2205.03281 [pdf, other]

doi 10.1051/0004-6361/202243384

Methanol masers in NGC 253 with ALCHEMI

Authors: P. K. Humire, C. Henkel, A. Hernández-Gómez, S. Martín, J. Mangum, N. Harada, S. Muller, K. Sakamoto, K. Tanaka, Y. Yoshimura, K. Nakanishi, S. Mühle, R. Herrero-Illana, D. S. Meier, E. Caux, R. Aladro, R. Mauersberger, S. Viti, L. Colzi, V. M. Rivilla, M. Gorski, K. M. Menten, K. -Y. Huang, S. Aalto, P. P. van der Werf , et al. (1 additional authors not shown)

Abstract: Context: Methanol masers of Class I (collisionally-pumped) and Class II (radiatively-pumped) have been studied in great detail in our Galaxy in a variety of astrophysical environments such as shocks and star-forming regions and are helpful to analyze the properties of the dense interstellar medium. However, the study of methanol masers in external galaxies is still in its infancy. Aims: Our main g… ▽ More Context: Methanol masers of Class I (collisionally-pumped) and Class II (radiatively-pumped) have been studied in great detail in our Galaxy in a variety of astrophysical environments such as shocks and star-forming regions and are helpful to analyze the properties of the dense interstellar medium. However, the study of methanol masers in external galaxies is still in its infancy. Aims: Our main goal is to search for methanol masers in the central molecular zone (CMZ; inner 500 pc) of the nearby starburst galaxy NGC 253. Methods: Covering a frequency range between 84 and 373 GHz ($λ$ = 3.6 to 0.8 mm) at high angular (1.6"$\sim$27 pc) and spectral ($\sim$8--9 km s$^{-1}$) resolution with the ALMA large program ALCHEMI, we have probed different regions across the CMZ of NGC 253. In order to look for methanol maser candidates, we employed the rotation diagram method and a set of radiative transfer models. Results: We detect for the first time masers above 84 GHz in NGC 253, covering an ample portion of the $J_{-1}\rightarrow(J-$ 1)$_{0}-E$ line series (at 84, 132, 229, and 278 GHz) and the $J_{0}\rightarrow(J-$ 1)$_{1}-A$ series (at 95, 146, and 198 GHz). This confirms the presence of the Class I maser line at 84 GHz, already reported but now being detected in more than one location. For the $J_{-1}\rightarrow(J-$ 1)$_{0}-E$ line series, we observe a lack of Class I maser candidates in the central star-forming disk. Conclusions: The physical conditions for maser excitation in the $J_{-1}\rightarrow(J-$ 1)$_{0}-E$ line series can be weak shocks and cloud-cloud collisions as suggested by shock tracers (SiO and HNCO) in bi-symmetric shock/active regions located in the outskirts of the CMZ. On the other hand, the presence of photodissociation regions due to a high star-formation rate would be needed to explain the lack of Class I masers in the very central regions. △ Less

Submitted 6 May, 2022; originally announced May 2022.

Comments: Accepted for publication in A&A. 29 pages, 17 figures (4 in Appendix)

arXiv:2204.12260 [pdf, other]

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation

Authors: Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

Abstract: Recent general-purpose audio representations show state-of-the-art performance on various audio tasks. These representations are pre-trained by self-supervised learning methods that create training signals from the input. For example, typical audio contrastive learning uses temporal relationships among input sounds to create training signals, whereas some methods use a difference among input views… ▽ More Recent general-purpose audio representations show state-of-the-art performance on various audio tasks. These representations are pre-trained by self-supervised learning methods that create training signals from the input. For example, typical audio contrastive learning uses temporal relationships among input sounds to create training signals, whereas some methods use a difference among input views created by data augmentations. However, these training signals do not provide information derived from the intact input sound, which we think is suboptimal for learning representation that describes the input as it is. In this paper, we seek to learn audio representations from the input itself as supervision using a pretext task of auto-encoding of masked spectrogram patches, Masked Spectrogram Modeling (MSM, a variant of Masked Image Modeling applied to audio spectrogram). To implement MSM, we use Masked Autoencoders (MAE), an image self-supervised learning method. MAE learns to efficiently encode the small number of visible patches into latent representations to carry essential information for reconstructing a large number of masked patches. While training, MAE minimizes the reconstruction error, which uses the input as training signal, consequently achieving our goal. We conducted experiments on our MSM using MAE (MSM-MAE) models under the evaluation benchmark of the HEAR 2021 NeurIPS Challenge. Our MSM-MAE models outperformed the HEAR 2021 Challenge results on seven out of 15 tasks (e.g., accuracies of 73.4% on CREMA-D and 85.8% on LibriCount), while showing top performance on other tasks where specialized models perform better. We also investigate how the design choices of MSM-MAE impact the performance and conduct qualitative analysis of visualization outcomes to gain an understanding of learned representations. We make our code available online. △ Less

Submitted 26 April, 2022; originally announced April 2022.

Comments: 22 pages, 8 figures. Under the review process

MSC Class: 68T07

Journal ref: HEAR: Holistic Evaluation of Audio Representations (NeurIPS 2021 Competition) PMLR 166 (2022) 1-24

Showing 1–50 of 117 results for author: Harada, N