-
A Dual-Stream Neural Network Explains the Functional Segregation of Dorsal and Ventral Visual Pathways in Human Brains
Authors:
Minkyu Choi,
Kuan Han,
Xiaokai Wang,
Yizhen Zhang,
Zhongming Liu
Abstract:
The human visual system uses two parallel pathways for spatial processing and object recognition. In contrast, computer vision systems tend to use a single feedforward pathway, rendering them less robust, adaptive, or efficient than human vision. To bridge this gap, we developed a dual-stream vision model inspired by the human eyes and brain. At the input level, the model samples two complementary…
▽ More
The human visual system uses two parallel pathways for spatial processing and object recognition. In contrast, computer vision systems tend to use a single feedforward pathway, rendering them less robust, adaptive, or efficient than human vision. To bridge this gap, we developed a dual-stream vision model inspired by the human eyes and brain. At the input level, the model samples two complementary visual patterns to mimic how the human eyes use magnocellular and parvocellular retinal ganglion cells to separate retinal inputs to the brain. At the backend, the model processes the separate input patterns through two branches of convolutional neural networks (CNN) to mimic how the human brain uses the dorsal and ventral cortical pathways for parallel visual processing. The first branch (WhereCNN) samples a global view to learn spatial attention and control eye movements. The second branch (WhatCNN) samples a local view to represent the object around the fixation. Over time, the two branches interact recurrently to build a scene representation from moving fixations. We compared this model with the human brains processing the same movie and evaluated their functional alignment by linear transformation. The WhereCNN and WhatCNN branches were found to differentially match the dorsal and ventral pathways of the visual cortex, respectively, primarily due to their different learning objectives. These model-based results lead us to speculate that the distinct responses and representations of the ventral and dorsal streams are more influenced by their distinct goals in visual attention and object recognition than by their specific bias or selectivity in retinal inputs. This dual-stream model takes a further step in brain-inspired computer vision, enabling parallel neural networks to actively explore and understand the visual surroundings.
△ Less
Submitted 20 November, 2023; v1 submitted 20 October, 2023;
originally announced October 2023.
-
SimCKP: Simple Contrastive Learning of Keyphrase Representations
Authors:
Minseok Choi,
Chaeheon Gwak,
Seho Kim,
Si Hyeong Kim,
Jaegul Choo
Abstract:
Keyphrase generation (KG) aims to generate a set of summarizing words or phrases given a source document, while keyphrase extraction (KE) aims to identify them from the text. Because the search space is much smaller in KE, it is often combined with KG to predict keyphrases that may or may not exist in the corresponding document. However, current unified approaches adopt sequence labeling and maxim…
▽ More
Keyphrase generation (KG) aims to generate a set of summarizing words or phrases given a source document, while keyphrase extraction (KE) aims to identify them from the text. Because the search space is much smaller in KE, it is often combined with KG to predict keyphrases that may or may not exist in the corresponding document. However, current unified approaches adopt sequence labeling and maximization-based generation that primarily operate at a token level, falling short in observing and scoring keyphrases as a whole. In this work, we propose SimCKP, a simple contrastive learning framework that consists of two stages: 1) An extractor-generator that extracts keyphrases by learning context-aware phrase-level representations in a contrastive manner while also generating keyphrases that do not appear in the document; 2) A reranker that adapts scores for each generated phrase by likewise aligning their representations with the corresponding document. Experimental results on multiple benchmark datasets demonstrate the effectiveness of our proposed approach, which outperforms the state-of-the-art models by a significant margin.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Markov chain entropy games and the geometry of their Nash equilibria
Authors:
Michael C. H. Choi,
Geoffrey Wolfer
Abstract:
Consider the following two-person mixed strategy game of a probabilist against Nature with respect to the parameters $(f, \mathcal{B},π)$, where $f$ is a convex function satisfying certain regularity conditions, $\mathcal{B}$ is either the set $\{L_i\}_{i=1}^n$ or its convex hull with each $L_i$ being a Markov infinitesimal generator on a finite state space $\mathcal{X}$ and $π$ is a given positiv…
▽ More
Consider the following two-person mixed strategy game of a probabilist against Nature with respect to the parameters $(f, \mathcal{B},π)$, where $f$ is a convex function satisfying certain regularity conditions, $\mathcal{B}$ is either the set $\{L_i\}_{i=1}^n$ or its convex hull with each $L_i$ being a Markov infinitesimal generator on a finite state space $\mathcal{X}$ and $π$ is a given positive discrete distribution on $\mathcal{X}$. The probabilist chooses a prior measure $μ$ within the set of probability measures on $\mathcal{B}$ denoted by $\mathcal{P}(\mathcal{B})$ and picks a $L \in \mathcal{B}$ at random according to $μ$, whereas Nature follows a pure strategy to select $M \in \mathcal{L}(π)$, the set of $π$-reversible Markov generators on $\mathcal{X}$. Nature pays an amount $D_f(M||L)$, the $f$-divergence from $L$ to $M$, to the probabilist. We prove that a mixed strategy Nash equilibrium always exists, and establish a minimax result on the expected payoff of the game. This also contrasts with the pure strategy version of the game where we show a Nash equilibrium may not exist. To find approximately a mixed strategy Nash equilibrium, we propose and develop a simple projected subgradient algorithm that provably converges with a rate of $\mathcal{O}(1/\sqrt{t})$, where $t$ is the number of iterations. In addition, we elucidate the relationships of Nash equilibrium with other seemingly disparate notions such as weighted information centroid, Chebyshev center and Bayes risk. This article generalizes the two-person game of a statistician against Nature developed in the literature, and highlights the powerful interplay and synergy between modern Markov chains theory and geometry, information theory, game theory, optimization and mathematical statistics.
△ Less
Submitted 6 October, 2023;
originally announced October 2023.
-
PASTA: PArallel Spatio-Temporal Attention with spatial auto-correlation gating for fine-grained crowd flow prediction
Authors:
Chung Park,
Junui Hong,
Cheonbok Park,
Taesan Kim,
Minsung Choi,
Jaegul Choo
Abstract:
Understanding the movement patterns of objects (e.g., humans and vehicles) in a city is essential for many applications, including city planning and management. This paper proposes a method for predicting future city-wide crowd flows by modeling the spatio-temporal patterns of historical crowd flows in fine-grained city-wide maps. We introduce a novel neural network named PArallel Spatio-Temporal…
▽ More
Understanding the movement patterns of objects (e.g., humans and vehicles) in a city is essential for many applications, including city planning and management. This paper proposes a method for predicting future city-wide crowd flows by modeling the spatio-temporal patterns of historical crowd flows in fine-grained city-wide maps. We introduce a novel neural network named PArallel Spatio-Temporal Attention with spatial auto-correlation gating (PASTA) that effectively captures the irregular spatio-temporal patterns of fine-grained maps. The novel components in our approach include spatial auto-correlation gating, multi-scale residual block, and temporal attention gating module. The spatial auto-correlation gating employs the concept of spatial statistics to identify irregular spatial regions. The multi-scale residual block is responsible for handling multiple range spatial dependencies in the fine-grained map, and the temporal attention gating filters out irrelevant temporal information for the prediction. The experimental results demonstrate that our model outperforms other competing baselines, especially under challenging conditions that contain irregular spatial regions. We also provide a qualitative analysis to derive the critical time information where our model assigns high attention scores in prediction.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Pre-training Contextual Location Embeddings in Personal Trajectories via Efficient Hierarchical Location Representations
Authors:
Chung Park,
Taesan Kim,
Junui Hong,
Minsung Choi,
Jaegul Choo
Abstract:
Pre-training the embedding of a location generated from human mobility data has become a popular method for location based services. In practice, modeling the location embedding is too expensive, due to the large number of locations to be trained in situations with fine-grained resolution or extensive target regions. Previous studies have handled less than ten thousand distinct locations, which is…
▽ More
Pre-training the embedding of a location generated from human mobility data has become a popular method for location based services. In practice, modeling the location embedding is too expensive, due to the large number of locations to be trained in situations with fine-grained resolution or extensive target regions. Previous studies have handled less than ten thousand distinct locations, which is insufficient in the real-world applications. To tackle this problem, we propose a Geo-Tokenizer, designed to efficiently reduce the number of locations to be trained by representing a location as a combination of several grids at different scales. In the Geo-Tokenizer, a grid at a larger scale shares the common set of grids at smaller scales, which is a key factor in reducing the size of the location vocabulary. The sequences of locations preprocessed with the Geo-Tokenizer are utilized by a causal location embedding model to capture the temporal dependencies of locations. This model dynamically calculates the embedding vector of a target location, which varies depending on its trajectory. In addition, to efficiently pre-train the location embedding model, we propose the Hierarchical Auto-regressive Location Model objective to effectively train decomposed locations in the Geo-Tokenizer. We conducted experiments on two real-world user trajectory datasets using our pre-trained location model. The experimental results show that our model significantly improves the performance of downstream tasks with fewer model parameters compared to existing location embedding methods.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Self-distilled Masked Attention guided masked image modeling with noise Regularized Teacher (SMART) for medical image analysis
Authors:
Jue Jiang,
Aneesh Rangnekar,
Chloe Min Seo Choi,
Harini Veeraraghavan
Abstract:
Pretraining vision transformers (ViT) with attention guided masked image modeling (MIM) has shown to increase downstream accuracy for natural image analysis. Hierarchical shifted window (Swin) transformer, often used in medical image analysis cannot use attention guided masking as it lacks an explicit [CLS] token, needed for computing attention maps for selective masking. We thus enhanced Swin wit…
▽ More
Pretraining vision transformers (ViT) with attention guided masked image modeling (MIM) has shown to increase downstream accuracy for natural image analysis. Hierarchical shifted window (Swin) transformer, often used in medical image analysis cannot use attention guided masking as it lacks an explicit [CLS] token, needed for computing attention maps for selective masking. We thus enhanced Swin with semantic class attention. We developed a co-distilled Swin transformer that combines a noisy momentum updated teacher to guide selective masking for MIM. Our approach called \textsc{s}e\textsc{m}antic \textsc{a}ttention guided co-distillation with noisy teacher \textsc{r}egularized Swin \textsc{T}rans\textsc{F}ormer (SMARTFormer) was applied for analyzing 3D computed tomography datasets with lung nodules and malignant lung cancers (LC). We also analyzed the impact of semantic attention and noisy teacher on pretraining and downstream accuracy. SMARTFormer classified lesions (malignant from benign) with a high accuracy of 0.895 of 1000 nodules, predicted LC treatment response with accuracy of 0.74, and achieved high accuracies even in limited data regimes. Pretraining with semantic attention and noisy teacher improved ability to distinguish semantically meaningful structures such as organs in a unsupervised clustering task and localize abnormal structures like tumors. Code, models will be made available through GitHub upon paper acceptance.
△ Less
Submitted 3 July, 2024; v1 submitted 2 October, 2023;
originally announced October 2023.
-
PRiSM: Enhancing Low-Resource Document-Level Relation Extraction with Relation-Aware Score Calibration
Authors:
Minseok Choi,
Hyesu Lim,
Jaegul Choo
Abstract:
Document-level relation extraction (DocRE) aims to extract relations of all entity pairs in a document. A key challenge in DocRE is the cost of annotating such data which requires intensive human effort. Thus, we investigate the case of DocRE in a low-resource setting, and we find that existing models trained on low data overestimate the NA ("no relation") label, causing limited performance. In th…
▽ More
Document-level relation extraction (DocRE) aims to extract relations of all entity pairs in a document. A key challenge in DocRE is the cost of annotating such data which requires intensive human effort. Thus, we investigate the case of DocRE in a low-resource setting, and we find that existing models trained on low data overestimate the NA ("no relation") label, causing limited performance. In this work, we approach the problem from a calibration perspective and propose PRiSM, which learns to adapt logits based on relation semantic information. We evaluate our method on three DocRE datasets and demonstrate that integrating existing models with PRiSM improves performance by as much as 26.38 F1 score, while the calibration error drops as much as 36 times when trained with about 3% of data. The code is publicly available at https://github.com/brightjade/PRiSM.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Self-Purification and Entanglement Revival in Lambda Matter
Authors:
Dongni Chen,
Stefano Chesi,
Mahn-Soo Choi
Abstract:
In this study, we explore the dynamics of entanglement in an ensemble of three-level systems with a lambda-type level structure interacting with single-mode bosons. Our investigation focuses on zero-energy states within the subspace of totally symmetric wave functions. Remarkably, we observe a universal two-stage dynamics of entanglement with intriguing revival behavior. The revival of entanglemen…
▽ More
In this study, we explore the dynamics of entanglement in an ensemble of three-level systems with a lambda-type level structure interacting with single-mode bosons. Our investigation focuses on zero-energy states within the subspace of totally symmetric wave functions. Remarkably, we observe a universal two-stage dynamics of entanglement with intriguing revival behavior. The revival of entanglement is a consequence of the self-purification process, where the quantum state relaxes and converges universally to a special dark state within the system.
△ Less
Submitted 25 December, 2023; v1 submitted 2 September, 2023;
originally announced September 2023.
-
Velocity-gauge real-time time-dependent density functional tight-binding for large-scale condensed matter systems
Authors:
Qiang Xu,
Mauro Del Ben,
Mahmut Sait Okyay,
Min Choi,
Khaled Z. Ibrahim,
Bryan M. Wong
Abstract:
We present a new velocity-gauge real-time, time-dependent density functional tight-binding (VG-rtTDDFTB) implementation in the open-source DFTB+ software package (https://dftbplus.org) for probing electronic excitations in large, condensed matter systems. Our VG-rtTDDFTB approach enables real-time electron dynamics simulations of large, periodic, condensed matter systems containing thousands of at…
▽ More
We present a new velocity-gauge real-time, time-dependent density functional tight-binding (VG-rtTDDFTB) implementation in the open-source DFTB+ software package (https://dftbplus.org) for probing electronic excitations in large, condensed matter systems. Our VG-rtTDDFTB approach enables real-time electron dynamics simulations of large, periodic, condensed matter systems containing thousands of atoms with a favorable computational scaling as a function of system size. We provide computational details and benchmark calculations to demonstrate its accuracy and computational parallelizability on a variety of large material systems. As a representative example, we calculate laser-induced electron dynamics in a 512-atom amorphous silicon supercell to highlight the large periodic systems that can be examined with our implementation. Taken together, our VG-rtTDDFTB approach enables new electron dynamics simulations of complex systems that require large periodic supercells, such as crystal defects, complex surfaces, nanowires, and amorphous materials.
△ Less
Submitted 21 May, 2024; v1 submitted 18 August, 2023;
originally announced August 2023.
-
Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models
Authors:
Dohwan Ko,
Ji Soo Lee,
Miso Choi,
Jaewon Chu,
Jihwan Park,
Hyunwoo J. Kim
Abstract:
Video Question Answering (VideoQA) is a challenging task that entails complex multi-modal reasoning. In contrast to multiple-choice VideoQA which aims to predict the answer given several options, the goal of open-ended VideoQA is to answer questions without restricting candidate answers. However, the majority of previous VideoQA models formulate open-ended VideoQA as a classification task to class…
▽ More
Video Question Answering (VideoQA) is a challenging task that entails complex multi-modal reasoning. In contrast to multiple-choice VideoQA which aims to predict the answer given several options, the goal of open-ended VideoQA is to answer questions without restricting candidate answers. However, the majority of previous VideoQA models formulate open-ended VideoQA as a classification task to classify the video-question pairs into a fixed answer set, i.e., closed-vocabulary, which contains only frequent answers (e.g., top-1000 answers). This leads the model to be biased toward only frequent answers and fail to generalize on out-of-vocabulary answers. We hence propose a new benchmark, Open-vocabulary Video Question Answering (OVQA), to measure the generalizability of VideoQA models by considering rare and unseen answers. In addition, in order to improve the model's generalization power, we introduce a novel GNN-based soft verbalizer that enhances the prediction on rare and unseen answers by aggregating the information from their similar words. For evaluation, we introduce new baselines by modifying the existing (closed-vocabulary) open-ended VideoQA models and improve their performances by further taking into account rare and unseen answers. Our ablation studies and qualitative analyses demonstrate that our GNN-based soft verbalizer further improves the model performance, especially on rare and unseen answers. We hope that our benchmark OVQA can serve as a guide for evaluating the generalizability of VideoQA models and inspire future research. Code is available at https://github.com/mlvlab/OVQA.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
Profile Update: The Effects of Identity Disclosure on Network Connections and Language
Authors:
Minje Choi,
Daniel M. Romero,
David Jurgens
Abstract:
Our social identities determine how we interact and engage with the world surrounding us. In online settings, individuals can make these identities explicit by including them in their public biography, possibly signaling a change to what is important to them and how they should be viewed. Here, we perform the first large-scale study on Twitter that examines behavioral changes following identity si…
▽ More
Our social identities determine how we interact and engage with the world surrounding us. In online settings, individuals can make these identities explicit by including them in their public biography, possibly signaling a change to what is important to them and how they should be viewed. Here, we perform the first large-scale study on Twitter that examines behavioral changes following identity signal addition on Twitter profiles. Combining social networks with NLP and quasi-experimental analyses, we discover that after disclosing an identity on their profiles, users (1) generate more tweets containing language that aligns with their identity and (2) connect more to same-identity users. We also examine whether adding an identity signal increases the number of offensive replies and find that (3) the combined effect of disclosing identity via both tweets and profiles is associated with a reduced number of offensive replies from others.
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Two Tales of Platoon Intelligence for Autonomous Mobility Control: Enabling Deep Learning Recipes
Authors:
Soohyun Park,
Haemin Lee,
Chanyoung Park,
Soyi Jung,
Minseok Choi,
Joongheon Kim
Abstract:
This paper presents the deep learning-based recent achievements to resolve the problem of autonomous mobility control and efficient resource management of autonomous vehicles and UAVs, i.e., (i) multi-agent reinforcement learning (MARL), and (ii) neural Myerson auction. Representatively, communication network (CommNet), which is one of the most popular MARL algorithms, is introduced to enable mult…
▽ More
This paper presents the deep learning-based recent achievements to resolve the problem of autonomous mobility control and efficient resource management of autonomous vehicles and UAVs, i.e., (i) multi-agent reinforcement learning (MARL), and (ii) neural Myerson auction. Representatively, communication network (CommNet), which is one of the most popular MARL algorithms, is introduced to enable multiple agents to take actions in a distributed manner for their shared goals by training all agents' states and actions in a single neural network. Moreover, the neural Myerson auction guarantees trustfulness among multiple agents as well as achieves the optimal revenue of highly dynamic systems. Therefore, we survey the recent studies on autonomous mobility control based on MARL and neural Myerson auction. Furthermore, we emphasize that integration of MARL and neural Myerson auction is expected to be critical for efficient and trustful autonomous mobility services.
△ Less
Submitted 18 July, 2023;
originally announced July 2023.
-
Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models
Authors:
Sanghyun Kim,
Seohyeon Jung,
Balhae Kim,
Moonseok Choi,
**woo Shin,
Juho Lee
Abstract:
Large-scale image generation models, with impressive quality made possible by the vast amount of data available on the Internet, raise social concerns that these models may generate harmful or copyrighted content. The biases and harmfulness arise throughout the entire training process and are hard to completely remove, which have become significant hurdles to the safe deployment of these models. I…
▽ More
Large-scale image generation models, with impressive quality made possible by the vast amount of data available on the Internet, raise social concerns that these models may generate harmful or copyrighted content. The biases and harmfulness arise throughout the entire training process and are hard to completely remove, which have become significant hurdles to the safe deployment of these models. In this paper, we propose a method called SDD to prevent problematic content generation in text-to-image diffusion models. We self-distill the diffusion model to guide the noise estimate conditioned on the target removal concept to match the unconditional one. Compared to the previous methods, our method eliminates a much greater proportion of harmful content from the generated images without degrading the overall image quality. Furthermore, our method allows the removal of multiple concepts at once, whereas previous works are limited to removing a single concept at a time.
△ Less
Submitted 12 July, 2023;
originally announced July 2023.
-
HistRED: A Historical Document-Level Relation Extraction Dataset
Authors:
Soyoung Yang,
Minseok Choi,
Youngwoo Cho,
Jaegul Choo
Abstract:
Despite the extensive applications of relation extraction (RE) tasks in various domains, little has been explored in the historical context, which contains promising data across hundreds and thousands of years. To promote the historical RE research, we present HistRED constructed from Yeonhaengnok. Yeonhaengnok is a collection of records originally written in Hanja, the classical Chinese writing,…
▽ More
Despite the extensive applications of relation extraction (RE) tasks in various domains, little has been explored in the historical context, which contains promising data across hundreds and thousands of years. To promote the historical RE research, we present HistRED constructed from Yeonhaengnok. Yeonhaengnok is a collection of records originally written in Hanja, the classical Chinese writing, which has later been translated into Korean. HistRED provides bilingual annotations such that RE can be performed on Korean and Hanja texts. In addition, HistRED supports various self-contained subtexts with different lengths, from a sentence level to a document level, supporting diverse context settings for researchers to evaluate the robustness of their RE models. To demonstrate the usefulness of our dataset, we propose a bilingual RE model that leverages both Korean and Hanja contexts to predict relations between entities. Our model outperforms monolingual baselines on HistRED, showing that employing multiple language contexts supplements the RE predictions. The dataset is publicly available at: https://huggingface.co/datasets/Soyoung/HistRED under CC BY-NC-ND 4.0 license.
△ Less
Submitted 9 July, 2023;
originally announced July 2023.
-
Exploring Linguistic Style Matching in Online Communities: The Role of Social Context and Conversation Dynamics
Authors:
Aparna Ananthasubramaniam,
Hong Chen,
Jason Yan,
Kenan Alkiek,
Jiaxin Pei,
Agrima Seth,
Lavinia Dunagan,
Minje Choi,
Benjamin Litterer,
David Jurgens
Abstract:
Linguistic style matching (LSM) in conversations can be reflective of several aspects of social influence such as power or persuasion. However, how LSM relates to the outcomes of online communication on platforms such as Reddit is an unknown question. In this study, we analyze a large corpus of two-party conversation threads in Reddit where we identify all occurrences of LSM using two types of sty…
▽ More
Linguistic style matching (LSM) in conversations can be reflective of several aspects of social influence such as power or persuasion. However, how LSM relates to the outcomes of online communication on platforms such as Reddit is an unknown question. In this study, we analyze a large corpus of two-party conversation threads in Reddit where we identify all occurrences of LSM using two types of style: the use of function words and formality. Using this framework, we examine how levels of LSM differ in conversations depending on several social factors within Reddit: post and subreddit features, conversation depth, user tenure, and the controversiality of a comment. Finally, we measure the change of LSM following loss of status after community banning. Our findings reveal the interplay of LSM in Reddit conversations with several community metrics, suggesting the importance of understanding conversation engagement when understanding community dynamics.
△ Less
Submitted 26 August, 2023; v1 submitted 5 July, 2023;
originally announced July 2023.
-
A selectively reduced degree basis for efficient mixed nonlinear isogeometric beam formulations with extensible directors
Authors:
Myung-** Choi,
Roger A. Sauer,
Sven Klinkel
Abstract:
The effect of higher order continuity in the solution field by using NURBS basis function in isogeometric analysis (IGA) is investigated for an efficient mixed finite element formulation for elastostatic beams. It is based on the Hu-Washizu variational principle considering geometrical and material nonlinearities. Here we present a reduced degree of basis functions for the additional fields of the…
▽ More
The effect of higher order continuity in the solution field by using NURBS basis function in isogeometric analysis (IGA) is investigated for an efficient mixed finite element formulation for elastostatic beams. It is based on the Hu-Washizu variational principle considering geometrical and material nonlinearities. Here we present a reduced degree of basis functions for the additional fields of the stress resultants and strains of the beam, which are allowed to be discontinuous across elements. This approach turns out to significantly improve the computational efficiency and the accuracy of the results. We consider a beam formulation with extensible directors, where cross-sectional strains are enriched to avoid Poisson locking by an enhanced assumed strain method. In numerical examples, we show the superior per degree-of-freedom accuracy of IGA over conventional finite element analysis, due to the higher order continuity in the displacement field. We further verify the efficient rotational coupling between beams, as well as the path-independence of the results.
△ Less
Submitted 7 September, 2023; v1 submitted 23 June, 2023;
originally announced June 2023.
-
Triple spiral arms of a triple protostar system imaged in molecular lines
Authors:
Jeong-Eun Lee,
Tomoaki Matsumoto,
Hyun-Jeong Kim,
Seokho Lee,
Daniel Harsono,
Jaehan Bae,
Neal J. Evans II,
Shu-ichiro Inutsuka,
Minho Choi,
Ken'ichi Tatematsu,
Jae-Joon Lee,
Dan Jaffe
Abstract:
Most stars form in multiple star systems. For a better understanding of their formation processes, it is important to resolve the individual protostellar components and the surrounding envelope and disk material at the earliest possible formation epoch because the formation history can be lost in a few orbital timescales. Here we present the ALMA observational results of a young multiple protostel…
▽ More
Most stars form in multiple star systems. For a better understanding of their formation processes, it is important to resolve the individual protostellar components and the surrounding envelope and disk material at the earliest possible formation epoch because the formation history can be lost in a few orbital timescales. Here we present the ALMA observational results of a young multiple protostellar system, IRAS 04239+2436, where three well-developed large spiral arms were detected in the shocked SO emission. Along the most conspicuous arm, the accretion streamer was also detected in the SO$_2$ emission. The observational results are complemented by numerical magneto-hydrodynamic simulations, where those large arms only appear in magnetically weakened clouds. The numerical simulations also suggest that the large triple spiral arms are the result of gravitational interactions between compact triple protostars and the turbulent infalling envelope.
△ Less
Submitted 10 June, 2023;
originally announced June 2023.
-
Bifacial near-field thermophotovoltaic converter with transparent intermediate substrate
Authors:
Minwoo Choi,
Jaeman Song,
Bong Jae Lee
Abstract:
Thermophotovoltaic (TPV) converters are capable of generating electrical energy from infrared radiation emitted by an emitter powered by waste heat or solar energy. Key performance metrics for TPV converters are power output density (POD), which represents the electrical energy per unit area of the photovoltaic (PV) cell, and converter efficiency (CE), which indicates the proportion of radiative e…
▽ More
Thermophotovoltaic (TPV) converters are capable of generating electrical energy from infrared radiation emitted by an emitter powered by waste heat or solar energy. Key performance metrics for TPV converters are power output density (POD), which represents the electrical energy per unit area of the photovoltaic (PV) cell, and converter efficiency (CE), which indicates the proportion of radiative energy converted into electrical energy. A common method to significantly enhance POD is maintaining a sub-micron vacuum gap between the emitter and PV cell to leverage the near-field thermal radiation. On the other hand, bifacial TPV conversion, operating in the far-field regime, has been proposed to enhance CE by efficiently recycling the sub-bandgap energy radiation. However, bifacial TPV converters face a challenge in cooling the PV cell because the excess heat should be transferred in the lateral direction to side-edge cooling channels. Therefore, careful thermal engineering and management are required when employing near-field thermal radiation effects on bifacial TPV converters. In this study, we propose a bifacial near-field TPV (NF-TPV) converter that incorporates intrinsic Si intermediate layers, aiming to enhance both POD and CE. Si layers cover both sides of the PV cell to play a crucial role in PV cell cooling while addressing surface mode photonic loss in NF-TPV converters. We comprehensively analyze the influences of design parameters for a practical design of the bifacial NF-TPV converter. Our results demonstrate that a single-junction InAs cell can harvest 4.38 W/cm$^2$ of electrical energy with 27.2\% CE from 1000 K graphite emitters at a 100 nm vacuum gap. Despite the challenge in the cooing, our bifacial NF-TPV converter demonstrates 2.4 times larger POD with 2.7\% larger CE compared to conventional NF-TPV converters.
△ Less
Submitted 4 June, 2023;
originally announced June 2023.
-
An empirical study on speech restoration guided by self supervised speech representation
Authors:
Jaeuk Byun,
Youna Ji,
Soo Whan Chung,
Soyeon Choe,
Min Seok Choi
Abstract:
Enhancing speech quality is an indispensable yet difficult task as it is often complicated by a range of degradation factors. In addition to additive noise, reverberation, clip**, and speech attenuation can all adversely affect speech quality. Speech restoration aims to recover speech components from these distortions. This paper focuses on exploring the impact of self-supervised speech represen…
▽ More
Enhancing speech quality is an indispensable yet difficult task as it is often complicated by a range of degradation factors. In addition to additive noise, reverberation, clip**, and speech attenuation can all adversely affect speech quality. Speech restoration aims to recover speech components from these distortions. This paper focuses on exploring the impact of self-supervised speech representation learning on the speech restoration task. Specifically, we employ speech representation in various speech restoration networks and evaluate their performance under complicated distortion scenarios. Our experiments demonstrate that the contextual information provided by the self-supervised speech representation can enhance speech restoration performance in various distortion scenarios, while also increasing robustness against the duration of speech attenuation and mismatched test conditions.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark
Authors:
Minje Choi,
Jiaxin Pei,
Sagar Kumar,
Chang Shu,
David Jurgens
Abstract:
Large language models (LLMs) have been shown to perform well at a variety of syntactic, discourse, and reasoning tasks. While LLMs are increasingly deployed in many forms including conversational agents that interact with humans, we lack a grounded benchmark to measure how well LLMs understand \textit{social} language. Here, we introduce a new theory-driven benchmark, SocKET, that contains 58 NLP…
▽ More
Large language models (LLMs) have been shown to perform well at a variety of syntactic, discourse, and reasoning tasks. While LLMs are increasingly deployed in many forms including conversational agents that interact with humans, we lack a grounded benchmark to measure how well LLMs understand \textit{social} language. Here, we introduce a new theory-driven benchmark, SocKET, that contains 58 NLP tasks testing social knowledge which we group into five categories: humor & sarcasm, offensiveness, sentiment & emotion, and trustworthiness. In tests on the benchmark, we demonstrate that current models attain only moderate performance but reveal significant potential for task transfer among different types and categories of tasks, which were predicted from theory. Through zero-shot evaluations, we show that pretrained models already possess some innate but limited capabilities of social language understanding and training on one category of tasks can improve zero-shot testing on others. Our benchmark provides a systematic way to analyze model performance on an important dimension of language and points to clear room for improvement to build more socially-aware LLMs. The associated resources are released at https://github.com/minjechoi/SOCKET.
△ Less
Submitted 7 December, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Sparse Weight Averaging with Multiple Particles for Iterative Magnitude Pruning
Authors:
Moonseok Choi,
Hyungi Lee,
Giung Nam,
Juho Lee
Abstract:
Given the ever-increasing size of modern neural networks, the significance of sparse architectures has surged due to their accelerated inference speeds and minimal memory demands. When it comes to global pruning techniques, Iterative Magnitude Pruning (IMP) still stands as a state-of-the-art algorithm despite its simple nature, particularly in extremely sparse regimes. In light of the recent findi…
▽ More
Given the ever-increasing size of modern neural networks, the significance of sparse architectures has surged due to their accelerated inference speeds and minimal memory demands. When it comes to global pruning techniques, Iterative Magnitude Pruning (IMP) still stands as a state-of-the-art algorithm despite its simple nature, particularly in extremely sparse regimes. In light of the recent finding that the two successive matching IMP solutions are linearly connected without a loss barrier, we propose Sparse Weight Averaging with Multiple Particles (SWAMP), a straightforward modification of IMP that achieves performance comparable to an ensemble of two IMP solutions. For every iteration, we concurrently train multiple sparse models, referred to as particles, using different batch orders yet the same matching ticket, and then weight average such models to produce a single mask. We demonstrate that our method consistently outperforms existing baselines across different sparsities through extensive experiments on various data and neural network structures.
△ Less
Submitted 26 April, 2024; v1 submitted 24 May, 2023;
originally announced May 2023.
-
ConQueR: Contextualized Query Reduction using Search Logs
Authors:
Hye-young Kim,
Min** Choi,
Sunkyung Lee,
Eunseong Choi,
Young-In Song,
Jongwuk Lee
Abstract:
Query reformulation is a key mechanism to alleviate the linguistic chasm of query in ad-hoc retrieval. Among various solutions, query reduction effectively removes extraneous terms and specifies concise user intent from long queries. However, it is challenging to capture hidden and diverse user intent. This paper proposes Contextualized Query Reduction (ConQueR) using a pre-trained language model…
▽ More
Query reformulation is a key mechanism to alleviate the linguistic chasm of query in ad-hoc retrieval. Among various solutions, query reduction effectively removes extraneous terms and specifies concise user intent from long queries. However, it is challenging to capture hidden and diverse user intent. This paper proposes Contextualized Query Reduction (ConQueR) using a pre-trained language model (PLM). Specifically, it reduces verbose queries with two different views: core term extraction and sub-query selection. One extracts core terms from an original query at the term level, and the other determines whether a sub-query is a suitable reduction for the original query at the sequence level. Since they operate at different levels of granularity and complement each other, they are finally aggregated in an ensemble manner. We evaluate the reduction quality of ConQueR on real-world search logs collected from a commercial web search engine. It achieves up to 8.45% gains in exact match scores over the best competing model.
△ Less
Submitted 21 May, 2023;
originally announced May 2023.
-
The JCMT BISTRO Survey: Studying the Complex Magnetic Field of L43
Authors:
Janik Karoly,
Derek Ward-Thompson,
Kate Pattle,
David Berry,
Anthony Whitworth,
Jason Kirk,
Pierre Bastien,
Tao-Chung Ching,
Simon Coude,
Jihye Hwang,
Woo** Kwon,
Archana Soam,
Jia-Wei Wang,
Tetsuo Hasegawa,
Shih-** Lai,
Ke** Qiu,
Doris Arzoumanian,
Tyler L. Bourke,
Do-Young Byun,
Huei-Ru Vivien Chen,
Wen ** Chen,
Mike Chen,
Zhiwei Chen,
Jungyeon Cho,
Minho Choi
, et al. (133 additional authors not shown)
Abstract:
We present observations of polarized dust emission at 850 $μ$m from the L43 molecular cloud which sits in the Ophiuchus cloud complex. The data were taken using SCUBA-2/POL-2 on the James Clerk Maxwell Telescope as a part of the BISTRO large program. L43 is a dense ($N_{\rm H_2}\sim 10^{22}$-10$^{23}$ cm$^{-2}$) complex molecular cloud with a submillimetre-bright starless core and two protostellar…
▽ More
We present observations of polarized dust emission at 850 $μ$m from the L43 molecular cloud which sits in the Ophiuchus cloud complex. The data were taken using SCUBA-2/POL-2 on the James Clerk Maxwell Telescope as a part of the BISTRO large program. L43 is a dense ($N_{\rm H_2}\sim 10^{22}$-10$^{23}$ cm$^{-2}$) complex molecular cloud with a submillimetre-bright starless core and two protostellar sources. There appears to be an evolutionary gradient along the isolated filament that L43 is embedded within, with the most evolved source closest to the Sco OB2 association. One of the protostars drives a CO outflow that has created a cavity to the southeast. We see a magnetic field that appears to be aligned with the cavity walls of the outflow, suggesting interaction with the outflow. We also find a magnetic field strength of up to $\sim$160$\pm$30 $μ$G in the main starless core and up to $\sim$90$\pm$40 $μ$G in the more diffuse, extended region. These field strengths give magnetically super- and sub-critical values respectively and both are found to be roughly trans-Alfvénic. We also present a new method of data reduction for these denser but fainter objects like starless cores.
△ Less
Submitted 22 May, 2023; v1 submitted 18 May, 2023;
originally announced May 2023.
-
A digraph version of the Friendship Theorem
Authors:
Myungho Choi,
Ho** Chu,
Suh-Ryung Kim
Abstract:
The Friendship Theorem states that if in a party any pair of persons has precisely one common friend, then there is always a person who is everybody's friend and the theorem has been proved by Paul Erdös, Alfréd Rényi, and Vera T. Sós in 1966. This paper was written in response to the question, ``What would happen if the hypothesis stating that any pair of persons has exactly one common friend wer…
▽ More
The Friendship Theorem states that if in a party any pair of persons has precisely one common friend, then there is always a person who is everybody's friend and the theorem has been proved by Paul Erdös, Alfréd Rényi, and Vera T. Sós in 1966. This paper was written in response to the question, ``What would happen if the hypothesis stating that any pair of persons has exactly one common friend were replaced with one stating that any pair of persons warms to exactly one person?". We call a digraph obtained in this way a friendship digraph. It is easy to check that a symmetric friendship digraph becomes a friendship graph if each directed cycle of length two is replaced with an edge. Based on this observation, one can say that a friendship digraph is a generalization of a friendship graph. In this paper, we provide a digraph formulation of the Friendship Theorem by defining friendship digraphs as those in which any two distinct vertices have precisely one common out-neighbor. We also establish a sufficient and necessary condition for the existence of friendship digraphs.
△ Less
Submitted 6 May, 2023;
originally announced May 2023.
-
Critical heat flux diagnosis using conditional generative adversarial networks
Authors:
Ung** Na,
Moonhee Choi,
Hang** Jo
Abstract:
The critical heat flux (CHF) is an essential safety boundary in boiling heat transfer processes employed in high heat flux thermal-hydraulic systems. Identifying CHF is vital for preventing equipment damage and ensuring overall system safety, yet it is challenging due to the complexity of the phenomena. For an in-depth understanding of the complicated phenomena, various methodologies have been dev…
▽ More
The critical heat flux (CHF) is an essential safety boundary in boiling heat transfer processes employed in high heat flux thermal-hydraulic systems. Identifying CHF is vital for preventing equipment damage and ensuring overall system safety, yet it is challenging due to the complexity of the phenomena. For an in-depth understanding of the complicated phenomena, various methodologies have been devised, but the acquisition of high-resolution data is limited by the substantial resource consumption required. This study presents a data-driven, image-to-image translation method for reconstructing thermal data of a boiling system at CHF using conditional generative adversarial networks (cGANs). The supervised learning process relies on paired images, which include total reflection visualizations and infrared thermometry measurements obtained from flow boiling experiments. Our proposed approach has the potential to not only provide evidence connecting phase interface dynamics with thermal distribution but also to simplify the laborious and time-consuming experimental setup and data-reduction procedures associated with infrared thermal imaging, thereby providing an effective solution for CHF diagnosis.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
A composite measurement scheme for efficient quantum observable estimation
Authors:
Zi-Jian Zhang,
Kouhei Nakaji,
Matthew Choi,
Alán Aspuru-Guzik
Abstract:
Estimation of the expectation value of observables is a key subroutine in quantum computing and is also the bottleneck of the performance of many near-term quantum algorithms. Many works have been proposed to reduce the number of measurements needed for this task and they provide different measurement schemes for generating the measurements to perform. In this paper, we propose a new approach, com…
▽ More
Estimation of the expectation value of observables is a key subroutine in quantum computing and is also the bottleneck of the performance of many near-term quantum algorithms. Many works have been proposed to reduce the number of measurements needed for this task and they provide different measurement schemes for generating the measurements to perform. In this paper, we propose a new approach, composite measurement scheme, which composes multiple measurement schemes by distributing shots to them with a trainable ratio. As an example of our method, we study the case where only Pauli measurements are allowed and propose Composite-LBCS (C-LBCS), a composite measurement scheme made by composing locally-biased classical shadows. We numerically demonstrate C-LBCS on molecular systems up to $\mathrm{CO}_2$ (30 qubits) and show that C-LBCS outperforms the previous state-of-the-art methods despite its simplicity. We also show that C-LBCS can be efficiently optimized by stochastic gradient descent and is trainable even when the observable contains a large number of terms. We believe our method opens up a reliable way toward efficient observable estimation on large quantum systems.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
Stability condition of the steady oscillations in aggregation models with shattering process and self-fragmentation
Authors:
Jean-Yves P Fortin,
MooYoung Choi
Abstract:
We consider a system of clusters of various sizes or masses, subject to aggregation and fragmentation by collision with monomers or by self-disintegration. The aggregation rate for the cluster of size or mass $k$ is given by a kernel proportional to $k^{a}$, whereas the collision and disintegration kernels are given by $λk^{b}$ and $μk^{a}$, respectively, with $0\le a,b\le 1$ and positive factors…
▽ More
We consider a system of clusters of various sizes or masses, subject to aggregation and fragmentation by collision with monomers or by self-disintegration. The aggregation rate for the cluster of size or mass $k$ is given by a kernel proportional to $k^{a}$, whereas the collision and disintegration kernels are given by $λk^{b}$ and $μk^{a}$, respectively, with $0\le a,b\le 1$ and positive factors $λ$ and $μ$. We study the emergence of oscillations in the phase diagram $(μ,λ)$ for two models: $(a,b)=(1,0)$ and $(1,1)$. It is shown that the monomer population satisfies a class of integral equations possessing oscillatory solutions in a finite domain in the plane $(μ,λ)$. We evaluate analytically this domain and give an estimate of the oscillation frequency. In particular, these oscillations are found to occur generally for small but nonzero values of the parameter $μ$, far smaller than $λ$.
△ Less
Submitted 30 September, 2023; v1 submitted 28 April, 2023;
originally announced April 2023.
-
Cross-domain Denoising for Low-dose Multi-frame Spiral Computed Tomography
Authors:
Yucheng Lu,
Zhixin Xu,
Moon Hyung Choi,
Jimin Kim,
Seung-Won Jung
Abstract:
Computed tomography (CT) has been used worldwide as a non-invasive test to assist in diagnosis. However, the ionizing nature of X-ray exposure raises concerns about potential health risks such as cancer. The desire for lower radiation doses has driven researchers to improve reconstruction quality. Although previous studies on low-dose computed tomography (LDCT) denoising have demonstrated the effe…
▽ More
Computed tomography (CT) has been used worldwide as a non-invasive test to assist in diagnosis. However, the ionizing nature of X-ray exposure raises concerns about potential health risks such as cancer. The desire for lower radiation doses has driven researchers to improve reconstruction quality. Although previous studies on low-dose computed tomography (LDCT) denoising have demonstrated the effectiveness of learning-based methods, most were developed on the simulated data. However, the real-world scenario differs significantly from the simulation domain, especially when using the multi-slice spiral scanner geometry. This paper proposes a two-stage method for the commercially available multi-slice spiral CT scanners that better exploits the complete reconstruction pipeline for LDCT denoising across different domains. Our approach makes good use of the high redundancy of multi-slice projections and the volumetric reconstructions while leveraging the over-smoothing problem in conventional cascaded frameworks caused by aggressive denoising. The dedicated design also provides a more explicit interpretation of the data flow. Extensive experiments on various datasets showed that the proposed method could remove up to 70\% of noise without compromised spatial resolution, and subjective evaluations by two experienced radiologists further supported its superior performance against state-of-the-art methods in clinical practice.
△ Less
Submitted 28 June, 2024; v1 submitted 21 April, 2023;
originally announced April 2023.
-
Ultrafast and Bright Quantum Emitters from the Cavity Coupled Single Perovskite Nanocrystals
Authors:
Seongmoon Jun,
Joonyun Kim,
Minho Choi,
Byungsu Kim,
**u Park,
Daehan Kim,
Byungha Shin,
Yong-Hoon Cho
Abstract:
Perovskite nanocrystals (NCs) have attracted increasing interest for the realization of single-photon emitters, owing to their ease of chemical synthesis, wide spectral tunability, fast recombination rate, scalability, and high quantum yield. However, the integration of a single perovskite NC into a photonic structure is yet to be accomplished. We successfully coupled a highly stable individual zw…
▽ More
Perovskite nanocrystals (NCs) have attracted increasing interest for the realization of single-photon emitters, owing to their ease of chemical synthesis, wide spectral tunability, fast recombination rate, scalability, and high quantum yield. However, the integration of a single perovskite NC into a photonic structure is yet to be accomplished. We successfully coupled a highly stable individual zwitterionic ligand-based CsPbBr3 perovskite NC with a circular Bragg grating (CBG). The far-field radiation pattern of the NC inside the CBG exhibits high directionality toward a low azimuthal angle, which is consistent with the simulation results. We observed a 5.4-fold enhancement in brightness due to an increase in collection efficiency. Moreover, we achieved a 1.95-fold increase in the recombination rate. This study offers ultrafast (< 100 ps) single-photon emission and an improved brightness of perovskite NCs, which are critical factors for practical quantum optical applications.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
Counting statistics based on the analytic solutions of the differential-difference equation for birth-death processes
Authors:
Seong Jun Park,
M. Y. Choi
Abstract:
Birth-death processes take place ubiquitously throughout the universe. In general, birth and death rates depend on the system size (corresponding to the number of products or customers undergoing the birth-death process) and thus vary every time birth or death occurs, which makes fluctuations in the rates inevitable. The differential-difference equation governing the time evolution of such a birth…
▽ More
Birth-death processes take place ubiquitously throughout the universe. In general, birth and death rates depend on the system size (corresponding to the number of products or customers undergoing the birth-death process) and thus vary every time birth or death occurs, which makes fluctuations in the rates inevitable. The differential-difference equation governing the time evolution of such a birth-death process is well established, but it resists solving for a non-asymptotic solution. In this work, we present the analytic solution of the differential-difference equation for birth-death processes without approximation. The time-dependent solution we obtain leads to an analytical expression for counting statistics of products (or customers). We further examine the relationship between the system size fluctuations and the birth and death rates, and find that statistical properties (variance subtracted by mean) of the system size are determined by the mean death rate as well as the covariance of the system size and the net growth rate (i.e., the birth rate minus the death rate). This work suggests a promising new direction for quantitative investigations into birth-death processes.
△ Less
Submitted 4 April, 2023; v1 submitted 31 March, 2023;
originally announced March 2023.
-
Systematic approaches to generate reversiblizations of Markov chains
Authors:
Michael C. H. Choi,
Geoffrey Wolfer
Abstract:
Given a target distribution $π$ and an arbitrary Markov infinitesimal generator $L$ on a finite state space $\mathcal{X}$, we develop three structured and inter-related approaches to generate new reversiblizations from $L$. The first approach hinges on a geometric perspective, in which we view reversiblizations as projections onto the space of $π$-reversible generators under suitable information d…
▽ More
Given a target distribution $π$ and an arbitrary Markov infinitesimal generator $L$ on a finite state space $\mathcal{X}$, we develop three structured and inter-related approaches to generate new reversiblizations from $L$. The first approach hinges on a geometric perspective, in which we view reversiblizations as projections onto the space of $π$-reversible generators under suitable information divergences such as $f$-divergences. With different choices of functions $f$, we not only recover nearly all established reversiblizations but also unravel and generate new reversiblizations. Along the way, we unveil interesting geometric results such as bisection properties, Pythagorean identities, parallelogram laws and a Markov chain counterpart of the arithmetic-geometric-harmonic mean inequality governing these reversiblizations. This further serves as motivation for introducing the notion of information centroids of a sequence of Markov chains and to give conditions for their existence and uniqueness. Building upon the first approach, we view reversiblizations as generalized means. In this second approach, we construct new reversiblizations via different natural notions of generalized means such as the Cauchy mean or the dual mean. In the third approach, we combine the recently introduced locally-balanced Markov processes framework and the notion of convex $*$-conjugate in the study of $f$-divergence. The latter offers a rich source of balancing functions to generate new reversiblizations.
△ Less
Submitted 10 September, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
First BISTRO observations of the dark cloud Taurus L1495A-B10: the role of the magnetic field in the earliest stages of low-mass star formation
Authors:
Derek Ward-Thompson,
Janik Karoly,
Kate Pattle,
Anthony Whitworth,
Jason Kirk,
David Berry,
Pierre Bastien,
Tao-Chung Ching,
Simon Coude,
Jihye Hwang,
Woo** Kwon,
Archana Soam,
Jia-Wei Wang,
Tetsuo Hasegawa,
Shih-** Lai,
Ke** Qiu,
Doris Arzoumanian,
Tyler L. Bourke,
Do-Young Byun,
Huei-Ru Vivien Chen,
Wen ** Chen,
Mike Chen,
Zhiwei Chen,
Jungyeon Cho,
Minho Choi
, et al. (133 additional authors not shown)
Abstract:
We present BISTRO Survey 850 μm dust emission polarisation observations of the L1495A-B10 region of the Taurus molecular cloud, taken at the JCMT. We observe a roughly triangular network of dense filaments. We detect 9 of the dense starless cores embedded within these filaments in polarisation, finding that the plane-of-sky orientation of the core-scale magnetic field lies roughly perpendicular to…
▽ More
We present BISTRO Survey 850 μm dust emission polarisation observations of the L1495A-B10 region of the Taurus molecular cloud, taken at the JCMT. We observe a roughly triangular network of dense filaments. We detect 9 of the dense starless cores embedded within these filaments in polarisation, finding that the plane-of-sky orientation of the core-scale magnetic field lies roughly perpendicular to the filaments in almost all cases. We also find that the large-scale magnetic field orientation measured by Planck is not correlated with any of the core or filament structures, except in the case of the lowest-density core. We propose a scenario for early prestellar evolution that is both an extension to, and consistent with, previous models, introducing an additional evolutionary transitional stage between field-dominated and matter-dominated evolution, observed here for the first time. In this scenario, the cloud collapses first to a sheet-like structure. Uniquely, we appear to be seeing this sheet almost face-on. The sheet fragments into filaments, which in turn form cores. However, the material must reach a certain critical density before the evolution changes from being field-dominated to being matter-dominated. We measure the sheet surface density and the magnetic field strength at that transition for the first time and show consistency with an analytical prediction that had previously gone untested for over 50 years (Mestel 1965).
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Analyzing the Engagement of Social Relationships During Life Event Shocks in Social Media
Authors:
Minje Choi,
David Jurgens,
Daniel M. Romero
Abstract:
Individuals experiencing unexpected distressing events, shocks, often rely on their social network for support. While prior work has shown how social networks respond to shocks, these studies usually treat all ties equally, despite differences in the support provided by different social relationships. Here, we conduct a computational analysis on Twitter that examines how responses to online shocks…
▽ More
Individuals experiencing unexpected distressing events, shocks, often rely on their social network for support. While prior work has shown how social networks respond to shocks, these studies usually treat all ties equally, despite differences in the support provided by different social relationships. Here, we conduct a computational analysis on Twitter that examines how responses to online shocks differ by the relationship type of a user dyad. We introduce a new dataset of over 13K instances of individuals' self-reporting shock events on Twitter and construct networks of relationship-labeled dyadic interactions around these events. By examining behaviors across 110K replies to shocked users in a pseudo-causal analysis, we demonstrate relationship-specific patterns in response levels and topic shifts. We also show that while well-established social dimensions of closeness such as tie strength and structural embeddedness contribute to shock responsiveness, the degree of impact is highly dependent on relationship and shock types. Our findings indicate that social relationships contain highly distinctive characteristics in network interactions and that relationship-specific behaviors in online shock responses are unique from those of offline settings.
△ Less
Submitted 15 February, 2023;
originally announced February 2023.
-
Improved Langevin Monte Carlo for stochastic optimization via landscape modification
Authors:
Michael C. H. Choi,
Youjia Wang
Abstract:
Given a target function $H$ to minimize or a target Gibbs distribution $π_β^0 \propto e^{-βH}$ to sample from in the low temperature, in this paper we propose and analyze Langevin Monte Carlo (LMC) algorithms that run on an alternative landscape as specified by $H^f_{β,c,1}$ and target a modified Gibbs distribution $π^f_{β,c,1} \propto e^{-βH^f_{β,c,1}}$, where the landscape of $H^f_{β,c,1}$ is a…
▽ More
Given a target function $H$ to minimize or a target Gibbs distribution $π_β^0 \propto e^{-βH}$ to sample from in the low temperature, in this paper we propose and analyze Langevin Monte Carlo (LMC) algorithms that run on an alternative landscape as specified by $H^f_{β,c,1}$ and target a modified Gibbs distribution $π^f_{β,c,1} \propto e^{-βH^f_{β,c,1}}$, where the landscape of $H^f_{β,c,1}$ is a transformed version of that of $H$ which depends on the parameters $f,β$ and $c$. While the original Log-Sobolev constant affiliated with $π^0_β$ exhibits exponential dependence on both $β$ and the energy barrier $M$ in the low temperature regime, with appropriate tuning of these parameters and subject to assumptions on $H$, we prove that the energy barrier of the transformed landscape is reduced which consequently leads to polynomial dependence on both $β$ and $M$ in the modified Log-Sobolev constant associated with $π^f_{β,c,1}$. This yield improved total variation mixing time bounds and improved convergence toward a global minimum of $H$. We stress that the technique developed in this paper is not only limited to LMC and is broadly applicable to other gradient-based optimization or sampling algorithms.
△ Less
Submitted 8 February, 2023;
originally announced February 2023.
-
Visible Wavelength Flatband in a Gallium Phosphide Metasurface
Authors:
Christopher Munley,
Arnab Manna,
David Sharp,
Minho Choi,
Hao Nguyen,
Brandi M. Cossairt,
Mo Li,
Arthur Barnard,
Arka Majumdar
Abstract:
Engineering the dispersion of light in a metasurface allows for controlling the light-matter interaction strength between light confined in the metasurface and materials placed within its near-field. Specifically, engineering a flatband dispersion increases the photonic density of states thereby enhancing the light-matter interaction. Here, we experimentally demonstrate a metasurface with a flat d…
▽ More
Engineering the dispersion of light in a metasurface allows for controlling the light-matter interaction strength between light confined in the metasurface and materials placed within its near-field. Specifically, engineering a flatband dispersion increases the photonic density of states thereby enhancing the light-matter interaction. Here, we experimentally demonstrate a metasurface with a flat dispersion at visible wavelengths. We designed and fabricated a suspended one-dimensional gallium phosphide metasurface and measured the photonic band structure via energy-momentum spectroscopy, observing a photonic band that is flat over $10^o$ of half-angle at $\sim 580$nm. We integrated cadmium selenide nanoplatelets with the metasurface, and measured coupled photoluminescence into the flatband. Our demonstration of a photonic flatband will enable the possibility of integrating emerging quantum emitters to the metasurface with possible applications in nonlinear image processing, and topological photonics.
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
Rethinking Soft Label in Label Distribution Learning Perspective
Authors:
Seungbum Hong,
Jihun Yoon,
Bogyu Park,
Min-Kook Choi
Abstract:
The primary goal of training in early convolutional neural networks (CNN) is the higher generalization performance of the model. However, as the expected calibration error (ECE), which quantifies the explanatory power of model inference, was recently introduced, research on training models that can be explained is in progress. We hypothesized that a gap in supervision criteria during training and…
▽ More
The primary goal of training in early convolutional neural networks (CNN) is the higher generalization performance of the model. However, as the expected calibration error (ECE), which quantifies the explanatory power of model inference, was recently introduced, research on training models that can be explained is in progress. We hypothesized that a gap in supervision criteria during training and inference leads to overconfidence, and investigated that performing label distribution learning (LDL) would enhance the model calibration in CNN training. To verify this assumption, we used a simple LDL setting with recent data augmentation techniques. Based on a series of experiments, the following results are obtained: 1) State-of-the-art KD methods significantly impede model calibration. 2) Training using LDL with recent data augmentation can have excellent effects on model calibration and even in generalization performance. 3) Online LDL brings additional improvements in model calibration and accuracy with long training, especially in large-size models. Using the proposed approach, we simultaneously achieved a lower ECE and higher generalization performance for the image classification datasets CIFAR10, 100, STL10, and ImageNet. We performed several visualizations and analyses and witnessed several interesting behaviors in CNN training with the LDL.
△ Less
Submitted 31 January, 2023;
originally announced January 2023.
-
DECISIVE Benchmarking Data Report: sUAS Performance Results from Phase I
Authors:
Adam Norton,
Reza Ahmadzadeh,
Kshitij Jerath,
Paul Robinette,
Jay Weitzen,
Thanuka Wickramarathne,
Holly Yanco,
Minseop Choi,
Ryan Donald,
Brendan Donoghue,
Christian Dumas,
Peter Gavriel,
Alden Giedraitis,
Brendan Hertel,
Jack Houle,
Nathan Letteri,
Edwin Meriaux,
Zahra Rezaei Khavas,
Rakshith Singh,
Gregg Willcox,
Naye Yoni
Abstract:
This report reviews all results derived from performance benchmarking conducted during Phase I of the Development and Execution of Comprehensive and Integrated Subterranean Intelligent Vehicle Evaluations (DECISIVE) project by the University of Massachusetts Lowell, using the test methods specified in the DECISIVE Test Methods Handbook v1.1 for evaluating small unmanned aerial systems (sUAS) perfo…
▽ More
This report reviews all results derived from performance benchmarking conducted during Phase I of the Development and Execution of Comprehensive and Integrated Subterranean Intelligent Vehicle Evaluations (DECISIVE) project by the University of Massachusetts Lowell, using the test methods specified in the DECISIVE Test Methods Handbook v1.1 for evaluating small unmanned aerial systems (sUAS) performance in subterranean and constrained indoor environments, spanning communications, field readiness, interface, obstacle avoidance, navigation, map**, autonomy, trust, and situation awareness. Using those 20 test methods, over 230 tests were conducted across 8 sUAS platforms: Cleo Robotics Dronut X1P (P = prototype), FLIR Black Hornet PRS, Flyability Elios 2 GOV, Lumenier Nighthawk V3, Parrot ANAFI USA GOV, Skydio X2D, Teal Golden Eagle, and Vantage Robotics Vesper. Best in class criteria is specified for each applicable test method and the sUAS that match this criteria are named for each test method, including a high-level executive summary of their performance.
△ Less
Submitted 20 January, 2023; v1 submitted 18 January, 2023;
originally announced January 2023.
-
On unital qubit channels
Authors:
Chi-Kwong Li,
Man-Duen Choi
Abstract:
A canonical form for unital qubit channels under local unitary transforms is obtained. In particular, it is shown that the eigenvalues of the Choi matrix of a unital quantum channel form a complete set of invariants of the canonical form. It follows immediately that every unital qubit channel is the average of four unitary channels. More generally, a unital qubit channel can be expressed as the co…
▽ More
A canonical form for unital qubit channels under local unitary transforms is obtained. In particular, it is shown that the eigenvalues of the Choi matrix of a unital quantum channel form a complete set of invariants of the canonical form. It follows immediately that every unital qubit channel is the average of four unitary channels. More generally, a unital qubit channel can be expressed as the convex combination of unitary channels with convex coefficients $p_1, \dots, p_m$ as long as $2(p_1, \dots, p_m)$ is majorized by the vector of eigenvalues of the Choi matrix of the channel. A unital qubit channel in the canonical form will transform the Bloch sphere onto an ellipsoid. We look into the detailed structure of the natural linear maps sending the Bloch sphere onto a corresponding ellipsoid.
△ Less
Submitted 20 April, 2023; v1 submitted 3 January, 2023;
originally announced January 2023.
-
JCMT BISTRO Observations: Magnetic Field Morphology of Bubbles Associated with NGC 6334
Authors:
Mehrnoosh Tahani,
Pierre Bastien,
Ray S. Furuya,
Kate Pattle,
Doug Johnstone,
Doris Arzoumanian,
Yasuo Doi,
Tetsuo Hasegawa,
Shu-ichiro Inutsuka,
Simon Coudé,
Laura Fissel,
Michael Chun-Yuan Chen,
Frédérick Poidevin,
Sarah Sadavoy,
Rachel Friesen,
Patrick M. Koch,
James Di Francesco,
Gerald H. Moriarty-Schieven,
Zhiwei Chen,
Eun Jung Chung,
Chakali Eswaraiah,
Lapo Fanciullo,
Tim Gledhill,
Valentin J. M. Le Gouellec,
Thiem Hoang
, et al. (120 additional authors not shown)
Abstract:
We study the HII regions associated with the NGC 6334 molecular cloud observed in the sub-millimeter and taken as part of the B-fields In STar-forming Region Observations (BISTRO) Survey. In particular, we investigate the polarization patterns and magnetic field morphologies associated with these HII regions. Through polarization pattern and pressure calculation analyses, several of these bubbles…
▽ More
We study the HII regions associated with the NGC 6334 molecular cloud observed in the sub-millimeter and taken as part of the B-fields In STar-forming Region Observations (BISTRO) Survey. In particular, we investigate the polarization patterns and magnetic field morphologies associated with these HII regions. Through polarization pattern and pressure calculation analyses, several of these bubbles indicate that the gas and magnetic field lines have been pushed away from the bubble, toward an almost tangential (to the bubble) magnetic field morphology. In the densest part of NGC 6334, where the magnetic field morphology is similar to an hourglass, the polarization observations do not exhibit observable impact from HII regions. We detect two nested radial polarization patterns in a bubble to the south of NGC 6334 that correspond to the previously observed bipolar structure in this bubble. Finally, using the results of this study, we present steps (incorporating computer vision; circular Hough Transform) that can be used in future studies to identify bubbles that have physically impacted magnetic field lines.
△ Less
Submitted 21 December, 2022;
originally announced December 2022.
-
Biomedical image analysis competitions: The state of current participation practice
Authors:
Matthias Eisenmann,
Annika Reinke,
Vivienn Weru,
Minu Dietlinde Tizabi,
Fabian Isensee,
Tim J. Adler,
Patrick Godau,
Veronika Cheplygina,
Michal Kozubek,
Sharib Ali,
Anubha Gupta,
Jan Kybic,
Alison Noble,
Carlos Ortiz de Solórzano,
Samiksha Pachade,
Caroline Petitjean,
Daniel Sage,
Donglai Wei,
Elizabeth Wilden,
Deepak Alapatt,
Vincent Andrearczyk,
Ujjwal Baid,
Spyridon Bakas,
Niranjan Balu,
Sophia Bano
, et al. (331 additional authors not shown)
Abstract:
The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,…
▽ More
The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.
△ Less
Submitted 12 September, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
SplitGP: Achieving Both Generalization and Personalization in Federated Learning
Authors:
Dong-Jun Han,
Do-Yeon Kim,
Minseok Choi,
Christopher G. Brinton,
Jaekyun Moon
Abstract:
A fundamental challenge to providing edge-AI services is the need for a machine learning (ML) model that achieves personalization (i.e., to individual clients) and generalization (i.e., to unseen data) properties concurrently. Existing techniques in federated learning (FL) have encountered a steep tradeoff between these objectives and impose large computational requirements on edge devices during…
▽ More
A fundamental challenge to providing edge-AI services is the need for a machine learning (ML) model that achieves personalization (i.e., to individual clients) and generalization (i.e., to unseen data) properties concurrently. Existing techniques in federated learning (FL) have encountered a steep tradeoff between these objectives and impose large computational requirements on edge devices during training and inference. In this paper, we propose SplitGP, a new split learning solution that can simultaneously capture generalization and personalization capabilities for efficient inference across resource-constrained clients (e.g., mobile/IoT devices). Our key idea is to split the full ML model into client-side and server-side components, and impose different roles to them: the client-side model is trained to have strong personalization capability optimized to each client's main task, while the server-side model is trained to have strong generalization capability for handling all clients' out-of-distribution tasks. We analytically characterize the convergence behavior of SplitGP, revealing that all client models approach stationary points asymptotically. Further, we analyze the inference time in SplitGP and provide bounds for determining model split ratios. Experimental results show that SplitGP outperforms existing baselines by wide margins in inference time and test accuracy for varying amounts of out-of-distribution samples.
△ Less
Submitted 11 February, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
A forbidden subgraph characterization of phylogeny graphs of degree bounded digraphs
Authors:
Myungho Choi,
Suh-Ryung Kim
Abstract:
An acyclic digraph in which every vertex has indegree at most $i$ and outdegree at most $j$ is called an $(i,j)$ digraph for some positive integers $i$ and $j$. The phylogeny graph of a digraph $D$ has $V(D)$ as the vertex set and an edge $uv$ if and only if one of the following is true: $(u,v) \in A(D)$; $(v,u) \in A(D)$; $(u,w) \in A(D)$ and $(v,w) \in A(D)$ for some $w \in V(D)$. A graph $G$ is…
▽ More
An acyclic digraph in which every vertex has indegree at most $i$ and outdegree at most $j$ is called an $(i,j)$ digraph for some positive integers $i$ and $j$. The phylogeny graph of a digraph $D$ has $V(D)$ as the vertex set and an edge $uv$ if and only if one of the following is true: $(u,v) \in A(D)$; $(v,u) \in A(D)$; $(u,w) \in A(D)$ and $(v,w) \in A(D)$ for some $w \in V(D)$. A graph $G$ is a phylogeny graph (resp. an $(i,j)$ phylogeny graph) if there is an acyclic digraph $D$ (resp. an $(i,j)$ digraph $D$) such that the phylogeny graph of $D$ is isomorphic to $G$. Lee~{\em et al.} (2017) and Eoh and Kim (2021) studied the $(2,2)$ phylogeny graphs, $(1,j)$ phylogeny graphs, $(i,1)$ phylogeny graphs, and $(2,j)$ phylogeny graphs. Their work was motivated by problems related to evidence propagation in a Bayesian network for which it is useful to know which acyclic digraphs have chordal moral graphs (phylogeny graphs are called moral graphs in Bayesian network theory). In this paper, we extend their work by characterizing chordal $(i,2)$ phylogeny graphs. We go further to completely characterize $(i,j)$ phylogeny graphs by listing the forbidden induced subgraphs.
△ Less
Submitted 10 December, 2022;
originally announced December 2022.
-
Models of Rotating Infall for the B335 Protostar
Authors:
Neal J. Evans II,
Yao-Lun Yang,
Joel D. Green,
Bo Zhao,
James Di Francesco,
Jeong-Eun Lee,
Jes K. Jørgensen,
Minho Choi,
Philip C. Myers,
Diego Mardones
Abstract:
Models of the protostellar source, B335, are developed using axisymmetric three-dimensional models to resolve conflicts found in one-dimensional models. The models are constrained by a large number of observations, including ALMA, Herschel, and Spitzer data. Observations of the protostellar source B335 with ALMA show red-shifted absorption against a central continuum source indicative of infall in…
▽ More
Models of the protostellar source, B335, are developed using axisymmetric three-dimensional models to resolve conflicts found in one-dimensional models. The models are constrained by a large number of observations, including ALMA, Herschel, and Spitzer data. Observations of the protostellar source B335 with ALMA show red-shifted absorption against a central continuum source indicative of infall in the HCO$^+$ and HCN $J = 4\rightarrow 3$ transitions. The data are combined with a new estimate of the distance to provide strong constraints to three-dimensional radiative transfer models including a rotating, infalling envelope, outflow cavities, and a very small disk. The models favor ages since the initiation of collapse between $3 \times 10^4$ and $4 \times 10^4$ yr for both the continuum and the lines, resolving a conflict found in one-dimensional models. The models under-predict the continuum emission seen by ALMA, suggesting an additional component such as a pseudo-disk. The best-fitting model is used to convert variations in the 4.5 $μm$ flux in recent years into a model for a variation of a factor of 5-7 in luminosity over the last 8 years.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
The JCMT BISTRO-2 Survey: Magnetic Fields of the Massive DR21 Filament
Authors:
Tao-Chung Ching,
Ke** Qiu,
Di Li,
Zhiyuan Ren,
Shih-** Lai,
David Berry,
Kate Pattle,
Ray Furuya,
Derek Ward-Thompson,
Doug Johnstone,
Patrick M. Koch,
Chang Won Lee,
Thiem Hoang,
Tetsuo Hasegawa,
Woo** Kwon,
Pierre Bastien,
Chakali Eswaraiah,
Jia-Wei Wang,
Kyoung Hee Kim,
Jihye Hwang,
Archana Soam,
A-Ran Lyo,
Junhao Liu,
Valentin J. M. Le Gouellec,
Doris Arzoumanian
, et al. (132 additional authors not shown)
Abstract:
We present 850 $μ$m dust polarization observations of the massive DR21 filament from the B-fields In STar-forming Region Observations (BISTRO) survey, using the POL-2 polarimeter and the SCUBA-2 camera on the James Clerk Maxwell Telescope. We detect ordered magnetic fields perpendicular to the parsec-scale ridge of the DR21 main filament. In the sub-filaments, the magnetic fields are mainly parall…
▽ More
We present 850 $μ$m dust polarization observations of the massive DR21 filament from the B-fields In STar-forming Region Observations (BISTRO) survey, using the POL-2 polarimeter and the SCUBA-2 camera on the James Clerk Maxwell Telescope. We detect ordered magnetic fields perpendicular to the parsec-scale ridge of the DR21 main filament. In the sub-filaments, the magnetic fields are mainly parallel to the filamentary structures and smoothly connect to the magnetic fields of the main filament. We compare the POL-2 and Planck dust polarization observations to study the magnetic field structures of the DR21 filament on 0.1--10 pc scales. The magnetic fields revealed in the Planck data are well aligned with those of the POL-2 data, indicating a smooth variation of magnetic fields from large to small scales. The plane-of-sky magnetic field strengths derived from angular dispersion functions of dust polarization are 0.6--1.0 mG in the DR21 filament and $\sim$ 0.1 mG in the surrounding ambient gas. The mass-to-flux ratios are found to be magnetically supercritical in the filament and slightly subcritical to nearly critical in the ambient gas. The alignment between column density structures and magnetic fields changes from random alignment in the low-density ambient gas probed by Planck to mostly perpendicular in the high-density main filament probed by JCMT. The magnetic field structures of the DR21 filament are in agreement with MHD simulations of a strongly magnetized medium, suggesting that magnetic fields play an important role in sha** the DR21 main filament and sub-filaments.
△ Less
Submitted 4 December, 2022;
originally announced December 2022.
-
Genuine multipartite entanglement measures based on multi-party teleportation capability
Authors:
Min** Choi,
Eunok Bae,
Soojoon Lee
Abstract:
Quantifying entanglement is vital to understand entanglement as a resource in quantum information processing, and many entanglement measures have been suggested for this purpose. When mathematically defining an entanglement measure, we should consider the distinguishability between entangled and separable states, the invariance under local transformation, the monotonicity under local operations an…
▽ More
Quantifying entanglement is vital to understand entanglement as a resource in quantum information processing, and many entanglement measures have been suggested for this purpose. When mathematically defining an entanglement measure, we should consider the distinguishability between entangled and separable states, the invariance under local transformation, the monotonicity under local operations and classical communication, and the convexity. These are reasonable requirements but may be insufficient, in particular when taking into account the usefulness of quantum states in multi-party quantum information processing. Therefore, if we want to investigate multipartite entanglement as a resource, then it can be necessary to consider the usefulness of quantum states in multi-party quantum information processing when we define a multipartite entanglement measure. In this paper, we define new multipartite entanglement measures for three-qubit systems based on the three-party teleportation capability, and show that these entanglement measures satisfy the requirements for being genuine multipartite entanglement measures. We also generalize our entanglement measures for $N$-qubit systems, where $N \ge 4$, and discuss that these quantities may be good candidates to measure genuine multipartite entanglement.
△ Less
Submitted 29 August, 2023; v1 submitted 29 November, 2022;
originally announced November 2022.
-
DECISIVE Test Methods Handbook: Test Methods for Evaluating sUAS in Subterranean and Constrained Indoor Environments, Version 1.1
Authors:
Adam Norton,
Reza Ahmadzadeh,
Kshitij Jerath,
Paul Robinette,
Jay Weitzen,
Thanuka Wickramarathne,
Holly Yanco,
Minseop Choi,
Ryan Donald,
Brendan Donoghue,
Christian Dumas,
Peter Gavriel,
Alden Giedraitis,
Brendan Hertel,
Jack Houle,
Nathan Letteri,
Edwin Meriaux,
Zahra Rezaei Khavas,
Rakshith Singh,
Gregg Willcox,
Naye Yoni
Abstract:
This handbook outlines all test methods developed under the Development and Execution of Comprehensive and Integrated Subterranean Intelligent Vehicle Evaluations (DECISIVE) project by the University of Massachusetts Lowell for evaluating small unmanned aerial systems (sUAS) performance in subterranean and constrained indoor environments, spanning communications, field readiness, interface, obstac…
▽ More
This handbook outlines all test methods developed under the Development and Execution of Comprehensive and Integrated Subterranean Intelligent Vehicle Evaluations (DECISIVE) project by the University of Massachusetts Lowell for evaluating small unmanned aerial systems (sUAS) performance in subterranean and constrained indoor environments, spanning communications, field readiness, interface, obstacle avoidance, navigation, map**, autonomy, trust, and situation awareness. For sUAS deployment in subterranean and constrained indoor environments, this puts forth two assumptions about applicable sUAS to be evaluated using these test methods: (1) able to operate without access to GPS signal, and (2) width from prop top to prop tip does not exceed 91 cm (36 in) wide (i.e., can physically fit through a typical doorway, although successful navigation through is not guaranteed). All test methods are specified using a common format: Purpose, Summary of Test Method, Apparatus and Artifacts, Equipment, Metrics, Procedure, and Example Data. All test methods are designed to be run in real-world environments (e.g., MOUT sites) or using fabricated apparatuses (e.g., test bays built from wood, or contained inside of one or more ship** containers).
△ Less
Submitted 20 January, 2023; v1 submitted 1 November, 2022;
originally announced November 2022.
-
Diffusion-based Generative Speech Source Separation
Authors:
Robin Scheibler,
Youna Ji,
Soo-Whan Chung,
Jaeuk Byun,
Soyeon Choe,
Min-Seok Choi
Abstract:
We propose DiffSep, a new single channel source separation method based on score-matching of a stochastic differential equation (SDE). We craft a tailored continuous time diffusion-mixing process starting from the separated sources and converging to a Gaussian distribution centered on their mixture. This formulation lets us apply the machinery of score-based generative modelling. First, we train a…
▽ More
We propose DiffSep, a new single channel source separation method based on score-matching of a stochastic differential equation (SDE). We craft a tailored continuous time diffusion-mixing process starting from the separated sources and converging to a Gaussian distribution centered on their mixture. This formulation lets us apply the machinery of score-based generative modelling. First, we train a neural network to approximate the score function of the marginal probabilities or the diffusion-mixing process. Then, we use it to solve the reverse time SDE that progressively separates the sources starting from their mixture. We propose a modified training strategy to handle model mismatch and source permutation ambiguity. Experiments on the WSJ0 2mix dataset demonstrate the potential of the method. Furthermore, the method is also suitable for speech enhancement and shows performance competitive with prior work on the VoiceBank-DEMAND dataset.
△ Less
Submitted 2 November, 2022; v1 submitted 31 October, 2022;
originally announced October 2022.
-
Bayesian deep learning framework for uncertainty quantification in high dimensions
Authors:
Jeahan Jung,
Minseok Choi
Abstract:
We develop a novel deep learning method for uncertainty quantification in stochastic partial differential equations based on Bayesian neural network (BNN) and Hamiltonian Monte Carlo (HMC). A BNN efficiently learns the posterior distribution of the parameters in deep neural networks by performing Bayesian inference on the network parameters. The posterior distribution is efficiently sampled using…
▽ More
We develop a novel deep learning method for uncertainty quantification in stochastic partial differential equations based on Bayesian neural network (BNN) and Hamiltonian Monte Carlo (HMC). A BNN efficiently learns the posterior distribution of the parameters in deep neural networks by performing Bayesian inference on the network parameters. The posterior distribution is efficiently sampled using HMC to quantify uncertainties in the system. Several numerical examples are shown for both forward and inverse problems in high dimension to demonstrate the effectiveness of the proposed method for uncertainty quantification. These also show promising results that the computational cost is almost independent of the dimension of the problem demonstrating the potential of the method for tackling the so-called curse of dimensionality.
△ Less
Submitted 21 October, 2022;
originally announced October 2022.
-
The JCMT BISTRO Survey: A Spiral Magnetic Field in a Hub-filament Structure, Monoceros R2
Authors:
Jihye Hwang,
Jongsoo Kim,
Kate Pattle,
Chang Won Lee,
Patrick M. Koch,
Doug Johnstone,
Kohji Tomisaka,
Anthony Whitworth,
Ray S. Furuya,
Ji-hyun Kang,
A-Ran Lyo,
Eun Jung Chung,
Doris Arzoumanian,
Geumsook Park,
Woo** Kwon,
Shinyoung Kim,
Motohide Tamura,
Jungmi Kwon,
Archana Soam,
Ilseung Han,
Thiem Hoang,
Kyoung Hee Kim,
Takashi Onaka,
Eswaraiah Chakali,
Derek Ward-Thompson
, et al. (135 additional authors not shown)
Abstract:
We present and analyze observations of polarized dust emission at 850 $μ$m towards the central 1 pc $\times$ 1 pc hub-filament structure of Monoceros R2 (Mon R2). The data are obtained with SCUBA-2/POL-2 on the James Clerk Maxwell Telescope (JCMT) as part of the BISTRO (B-fields in Star-forming Region Observations) survey. The orientations of the magnetic field follow the spiral structure of Mon R…
▽ More
We present and analyze observations of polarized dust emission at 850 $μ$m towards the central 1 pc $\times$ 1 pc hub-filament structure of Monoceros R2 (Mon R2). The data are obtained with SCUBA-2/POL-2 on the James Clerk Maxwell Telescope (JCMT) as part of the BISTRO (B-fields in Star-forming Region Observations) survey. The orientations of the magnetic field follow the spiral structure of Mon R2, which are well-described by an axisymmetric magnetic field model. We estimate the turbulent component of the magnetic field using the angle difference between our observations and the best-fit model of the underlying large-scale mean magnetic field. This estimate is used to calculate the magnetic field strength using the Davis-Chandrasekhar-Fermi method, for which we also obtain the distribution of volume density and velocity dispersion using a column density map derived from $Herschel$ data and the C$^{18}$O ($J$ = 3-2) data taken with HARP on the JCMT, respectively. We make maps of magnetic field strengths and mass-to-flux ratios, finding that magnetic field strengths vary from 0.02 to 3.64 mG with a mean value of 1.0 $\pm$ 0.06 mG, and the mean critical mass-to-flux ratio is 0.47 $\pm$ 0.02. Additionally, the mean Alfvén Mach number is 0.35 $\pm$ 0.01. This suggests that in Mon R2, magnetic fields provide resistance against large-scale gravitational collapse, and magnetic pressure exceeds turbulent pressure. We also investigate the properties of each filament in Mon R2. Most of the filaments are aligned along the magnetic field direction and are magnetically sub-critical.
△ Less
Submitted 13 December, 2022; v1 submitted 12 October, 2022;
originally announced October 2022.
-
Equal Experience in Recommender Systems
Authors:
Jaewoong Cho,
Moonseok Choi,
Changho Suh
Abstract:
We explore the fairness issue that arises in recommender systems. Biased data due to inherent stereotypes of particular groups (e.g., male students' average rating on mathematics is often higher than that on humanities, and vice versa for females) may yield a limited scope of suggested items to a certain group of users. Our main contribution lies in the introduction of a novel fairness notion (tha…
▽ More
We explore the fairness issue that arises in recommender systems. Biased data due to inherent stereotypes of particular groups (e.g., male students' average rating on mathematics is often higher than that on humanities, and vice versa for females) may yield a limited scope of suggested items to a certain group of users. Our main contribution lies in the introduction of a novel fairness notion (that we call equal experience), which can serve to regulate such unfairness in the presence of biased data. The notion captures the degree of the equal experience of item recommendations across distinct groups. We propose an optimization framework that incorporates the fairness notion as a regularization term, as well as introduce computationally-efficient algorithms that solve the optimization. Experiments on synthetic and benchmark real datasets demonstrate that the proposed framework can indeed mitigate such unfairness while exhibiting a minor degradation of recommendation accuracy.
△ Less
Submitted 12 October, 2022;
originally announced October 2022.