Search | arXiv e-print repository

CFEVER: A Chinese Fact Extraction and VERification Dataset

Authors: Ying-Jia Lin, Chun-Yi Lin, Chia-Jen Yeh, Yi-Ting Li, Yun-Yu Hu, Chih-Hao Hsu, Mei-Feng Lee, Hung-Yu Kao

Abstract: We present CFEVER, a Chinese dataset designed for Fact Extraction and VERification. CFEVER comprises 30,012 manually created claims based on content in Chinese Wikipedia. Each claim in CFEVER is labeled as "Supports", "Refutes", or "Not Enough Info" to depict its degree of factualness. Similar to the FEVER dataset, claims in the "Supports" and "Refutes" categories are also annotated with correspon… ▽ More We present CFEVER, a Chinese dataset designed for Fact Extraction and VERification. CFEVER comprises 30,012 manually created claims based on content in Chinese Wikipedia. Each claim in CFEVER is labeled as "Supports", "Refutes", or "Not Enough Info" to depict its degree of factualness. Similar to the FEVER dataset, claims in the "Supports" and "Refutes" categories are also annotated with corresponding evidence sentences sourced from single or multiple pages in Chinese Wikipedia. Our labeled dataset holds a Fleiss' kappa value of 0.7934 for five-way inter-annotator agreement. In addition, through the experiments with the state-of-the-art approaches developed on the FEVER dataset and a simple baseline for CFEVER, we demonstrate that our dataset is a new rigorous benchmark for factual extraction and verification, which can be further used for develo** automated systems to alleviate human fact-checking efforts. CFEVER is available at https://ikmlab.github.io/CFEVER. △ Less

Submitted 20 February, 2024; originally announced February 2024.

Comments: AAAI-24

arXiv:2402.12892 [pdf, other]

Extensive search for axion dark matter over 1\,GHz with CAPP's Main Axion eXperiment

Authors: Saebyeok Ahn, **Myeong Kim, Boris I. Ivanov, Ohjoon Kwon, HeeSu Byun, Arjan F. van Loo, SeongTae Par, Junu Jeong, Soohyung Lee, **su Kim, Çağlar Kutlu, Andrew K. Yi, Yasunobu Nakamura, Seonjeong Oh, Danho Ahn, SungJae Bae, Hyoungsoon Choi, Jihoon Choi, Yonuk Chong, Woohyun Chung, Violeta Gkika, Jihn E. Kim, Younggeun Kim, Byeong Rok Ko, Lino Miceli , et al. (11 additional authors not shown)

Abstract: We report an extensive high-sensitivity search for axion dark matter above 1\,GHz at the Center for Axion and Precision Physics Research (CAPP). The cavity resonant search, exploiting the coupling between axions and photons, explored the frequency (mass) range of 1.025\,GHz (4.24\,$μ$eV) to 1.185\,GHz (4.91\,$μ$eV). We have introduced a number of innovations in this field, demonstrating the practi… ▽ More We report an extensive high-sensitivity search for axion dark matter above 1\,GHz at the Center for Axion and Precision Physics Research (CAPP). The cavity resonant search, exploiting the coupling between axions and photons, explored the frequency (mass) range of 1.025\,GHz (4.24\,$μ$eV) to 1.185\,GHz (4.91\,$μ$eV). We have introduced a number of innovations in this field, demonstrating the practical approach of optimizing all the relevant parameters of axion haloscopes, extending presently available technology. The CAPP 12\,T magnet with an aperture of 320\,mm made of Nb$_3$Sn and NbTi superconductors surrounding a 37-liter ultralight-weight copper cavity is expected to convert DFSZ axions into approximately $10^2$ microwave photons per second. A powerful dilution refrigerator, capable of kee** the core system below 40\,mK, combined with quantum-noise limited readout electronics, achieved a total system noise of about 200\,mK or below, which corresponds to a background of roughly $4\times 10^3$ photons per second within the axion bandwidth. The combination of all those improvements provides unprecedented search performance, imposing the most stringent exclusion limits on axion--photon coupling in this frequency range to date. These results also suggest an experimental capability suitable for highly-sensitive searches for axion dark matter above 1\,GHz. △ Less

Submitted 20 February, 2024; originally announced February 2024.

Comments: A detailed axion dark matter article with 27 pages, 22 figures

arXiv:2402.12202 [pdf, other]

doi 10.1109/ICDMW60847.2023.00191

Heterogeneity-aware Cross-school Electives Recommendation: a Hybrid Federated Approach

Authors: Chengyi Ju, Jiannong Cao, Yu Yang, Zhen-Qun Yang, Ho Man Lee

Abstract: In the era of modern education, addressing cross-school learner diversity is crucial, especially in personalized recommender systems for elective course selection. However, privacy concerns often limit cross-school data sharing, which hinders existing methods' ability to model sparse data and address heterogeneity effectively, ultimately leading to suboptimal recommendations. In response, we propo… ▽ More In the era of modern education, addressing cross-school learner diversity is crucial, especially in personalized recommender systems for elective course selection. However, privacy concerns often limit cross-school data sharing, which hinders existing methods' ability to model sparse data and address heterogeneity effectively, ultimately leading to suboptimal recommendations. In response, we propose HFRec, a heterogeneity-aware hybrid federated recommender system designed for cross-school elective course recommendations. The proposed model constructs heterogeneous graphs for each school, incorporating various interactions and historical behaviors between students to integrate context and content information. We design an attention mechanism to capture heterogeneity-aware representations. Moreover, under a federated scheme, we train individual school-based models with adaptive learning settings to recommend tailored electives. Our HFRec model demonstrates its effectiveness in providing personalized elective recommendations while maintaining privacy, as it outperforms state-of-the-art models on both open-source and real-world datasets. △ Less

Submitted 19 February, 2024; originally announced February 2024.

Journal ref: 2023 IEEE International Conference on Data Mining Workshops (ICDMW)

arXiv:2402.12071 [pdf, other]

EmoBench: Evaluating the Emotional Intelligence of Large Language Models

Authors: Sahand Sabour, Siyang Liu, Zheyuan Zhang, June M. Liu, **feng Zhou, Alvionna S. Sunaryo, Juanzi Li, Tatia M. C. Lee, Rada Mihalcea, Minlie Huang

Abstract: Recent advances in Large Language Models (LLMs) have highlighted the need for robust, comprehensive, and challenging benchmarks. Yet, research on evaluating their Emotional Intelligence (EI) is considerably limited. Existing benchmarks have two major shortcomings: first, they mainly focus on emotion recognition, neglecting essential EI capabilities such as emotion regulation and thought facilitati… ▽ More Recent advances in Large Language Models (LLMs) have highlighted the need for robust, comprehensive, and challenging benchmarks. Yet, research on evaluating their Emotional Intelligence (EI) is considerably limited. Existing benchmarks have two major shortcomings: first, they mainly focus on emotion recognition, neglecting essential EI capabilities such as emotion regulation and thought facilitation through emotion understanding; second, they are primarily constructed from existing datasets, which include frequent patterns, explicit information, and annotation errors, leading to unreliable evaluation. We propose EmoBench, a benchmark that draws upon established psychological theories and proposes a comprehensive definition for machine EI, including Emotional Understanding and Emotional Application. EmoBench includes a set of 400 hand-crafted questions in English and Chinese, which are meticulously designed to require thorough reasoning and understanding. Our findings reveal a considerable gap between the EI of existing LLMs and the average human, highlighting a promising direction for future research. Our code and data are publicly available at https://github.com/Sahandfer/EmoBench. △ Less

Submitted 7 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

Comments: ACL 2024 Main Conference

arXiv:2402.11761 [pdf, ps, other]

The number of automorphic representations of $\mathrm{GL}_2$ with exceptional eigenvalues

Authors: Dohoon Choi, Min Lee, Youngmin Lee, Subong Lim

Abstract: We obtain an upper bound for the dimension of the cuspidal automorphic forms for $\mathrm{GL}_2$ over a number field, whose archimedean local representations are not tempered. More precisely, we prove the following result. Let $F$ be a number field and $\mathbb{A}_{F}$ be the ring of adeles of $F$. Let $\mathcal{O}_{F}$ be the ring of integers of $F$. Let $\mathfrak{X}_{F,\mathrm{ex}}$ be the se… ▽ More We obtain an upper bound for the dimension of the cuspidal automorphic forms for $\mathrm{GL}_2$ over a number field, whose archimedean local representations are not tempered. More precisely, we prove the following result. Let $F$ be a number field and $\mathbb{A}_{F}$ be the ring of adeles of $F$. Let $\mathcal{O}_{F}$ be the ring of integers of $F$. Let $\mathfrak{X}_{F,\mathrm{ex}}$ be the set of irreducible cuspidal automorphic representations $π$ of $\mathrm{GL}_2(\mathbb{A}_{F})$ with the trivial central character such that for each archimedean place $v$ of $F$, the local representation of $π$ at $v$ is an unramified principal series and is not tempered. For an ideal $J$ of $\mathcal{O}_{F}$, let $\mathrm{K}_{0}(J)$ be the subgroup of $\mathrm{GL}_2(\mathbb{A}_{F})$ corresponding to $Γ_0(J) \subset \mathrm{SL}_2(\mathcal{O}_F)$. Let $r_1$ be the number of real embeddings of $F$ and $r_2$ be the number of conjugate pairs of complex embeddings of $F$. Using the Arthur-Selberg trace formula, we have \begin{equation*} \sum_{π\in \mathfrak{X}_{F,\mathrm{ex}}} \dim π^{\mathrm{K}_0(J)} \ll_{F} \frac{[\mathrm{SL}_2(\mathcal{O}_{F}) : Γ_0(J)]}{(\log (N_{F/\mathbb{Q}}(J)))^{2r_1+3r_2}} \quad \text{ as } \quad |N_{F/\mathbb{Q}}(J)|\to \infty. \end{equation*} From this result, we obtain the result on an upper bound for the number of Hecke-Maass cusp forms of weight $0$ on $Γ_0(N)$ which do not satisfy the Selberg eigenvalue conjecture. △ Less

Submitted 18 February, 2024; originally announced February 2024.

MSC Class: 11F72 (Primary); 11F12 (Secondary)

arXiv:2402.09463 [pdf]

Multi-Center Fetal Brain Tissue Annotation (FeTA) Challenge 2022 Results

Authors: Kelly Payette, Céline Steger, Roxane Licandro, Priscille de Dumast, Hongwei Bran Li, Matthew Barkovich, Liu Li, Maik Dannecker, Chen Chen, Cheng Ouyang, Niccolò McConnell, Alina Miron, Yongmin Li, Alena Uus, Irina Grigorescu, Paula Ramirez Gilliland, Md Mahfuzur Rahman Siddiquee, Daguang Xu, Andriy Myronenko, Haoyu Wang, Ziyan Huang, ** Ye, Mireia Alenyà, Valentin Comte, Oscar Camara , et al. (42 additional authors not shown)

Abstract: Segmentation is a critical step in analyzing the develo** human fetal brain. There have been vast improvements in automatic segmentation methods in the past several years, and the Fetal Brain Tissue Annotation (FeTA) Challenge 2021 helped to establish an excellent standard of fetal brain segmentation. However, FeTA 2021 was a single center study, and the generalizability of algorithms across dif… ▽ More Segmentation is a critical step in analyzing the develo** human fetal brain. There have been vast improvements in automatic segmentation methods in the past several years, and the Fetal Brain Tissue Annotation (FeTA) Challenge 2021 helped to establish an excellent standard of fetal brain segmentation. However, FeTA 2021 was a single center study, and the generalizability of algorithms across different imaging centers remains unsolved, limiting real-world clinical applicability. The multi-center FeTA Challenge 2022 focuses on advancing the generalizability of fetal brain segmentation algorithms for magnetic resonance imaging (MRI). In FeTA 2022, the training dataset contained images and corresponding manually annotated multi-class labels from two imaging centers, and the testing data contained images from these two imaging centers as well as two additional unseen centers. The data from different centers varied in many aspects, including scanners used, imaging parameters, and fetal brain super-resolution algorithms applied. 16 teams participated in the challenge, and 17 algorithms were evaluated. Here, a detailed overview and analysis of the challenge results are provided, focusing on the generalizability of the submissions. Both in- and out of domain, the white matter and ventricles were segmented with the highest accuracy, while the most challenging structure remains the cerebral cortex due to anatomical complexity. The FeTA Challenge 2022 was able to successfully evaluate and advance generalizability of multi-class fetal brain tissue segmentation algorithms for MRI and it continues to benchmark new algorithms. The resulting new methods contribute to improving the analysis of brain development in utero. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: Results from FeTA Challenge 2022, held at MICCAI; Manuscript submitted. Supplementary Info (including submission methods descriptions) available here: https://zenodo.org/records/10628648

arXiv:2402.08971 [pdf, other]

Structured Language Generation Model for Robust Structure Prediction

Authors: Minho Lee, Junghyun Min, Woochul Lee, Yeonsoo Lee

Abstract: Previous work in structured prediction (e.g. NER, information extraction) using single model make use of explicit dataset information, which helps boost in-distribution performance but is orthogonal to robust generalization in real-world situations. To overcome this limitation, we propose the Structured Language Generation Model (SLGM), a framework that reduces sequence-to-sequence problems to cla… ▽ More Previous work in structured prediction (e.g. NER, information extraction) using single model make use of explicit dataset information, which helps boost in-distribution performance but is orthogonal to robust generalization in real-world situations. To overcome this limitation, we propose the Structured Language Generation Model (SLGM), a framework that reduces sequence-to-sequence problems to classification problems via methodologies in loss calibration and decoding method. Our experimental results show that SLGM is able to maintain performance without explicit dataset information, follow and potentially replace dataset-specific fine-tuning. △ Less

Submitted 18 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

Comments: 8 pages, 4 figures, 5 tables, 7 pages of appendix with 9 additional tables

arXiv:2402.08406 [pdf, other]

Transition Constrained Bayesian Optimization via Markov Decision Processes

Authors: Jose Pablo Folch, Calvin Tsay, Robert M Lee, Behrang Shafei, Weronika Ormaniec, Andreas Krause, Mark van der Wilk, Ruth Misener, Mojmír Mutný

Abstract: Bayesian optimization is a methodology to optimize black-box functions. Traditionally, it focuses on the setting where you can arbitrarily query the search space. However, many real-life problems do not offer this flexibility; in particular, the search space of the next query may depend on previous ones. Example challenges arise in the physical sciences in the form of local movement constraints, r… ▽ More Bayesian optimization is a methodology to optimize black-box functions. Traditionally, it focuses on the setting where you can arbitrarily query the search space. However, many real-life problems do not offer this flexibility; in particular, the search space of the next query may depend on previous ones. Example challenges arise in the physical sciences in the form of local movement constraints, required monotonicity in certain variables, and transitions influencing the accuracy of measurements. Altogether, such transition constraints necessitate a form of planning. This work extends classical Bayesian optimization via the framework of Markov Decision Processes. We iteratively solve a tractable linearization of our utility function using reinforcement learning to obtain a policy that plans ahead for the entire horizon. This is a parallel to the optimization of an acquisition function in policy space. The resulting policy is potentially history-dependent and non-Markovian. We showcase applications in chemical reactor optimization, informative path planning, machine calibration, and other synthetic examples. △ Less

Submitted 29 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

Comments: 10 pages main, 32 pages total, 16 figures, 2 tables, preprint

arXiv:2402.08382 [pdf, other]

Punctuation Restoration Improves Structure Understanding without Supervision

Authors: Junghyun Min, Minho Lee, Woochul Lee, Yeonsoo Lee

Abstract: Unsupervised learning objectives like language modeling and de-noising constitute a significant part in producing pre-trained models that perform various downstream applications from natural language understanding to conversational tasks. However, despite impressive generative capabilities of recent large language models, their abilities to capture syntactic or semantic structure within text lag b… ▽ More Unsupervised learning objectives like language modeling and de-noising constitute a significant part in producing pre-trained models that perform various downstream applications from natural language understanding to conversational tasks. However, despite impressive generative capabilities of recent large language models, their abilities to capture syntactic or semantic structure within text lag behind. We hypothesize that the mismatch between linguistic performance and competence in machines is attributable to insufficient transfer of linguistic structure knowledge to computational systems with currently popular pre-training objectives. We show that punctuation restoration as a learning objective improves in- and out-of-distribution performance on structure-related tasks like named entity recognition, open information extraction, chunking, and part-of-speech tagging. Punctuation restoration is an effective learning objective that can improve structure understanding and yield a more robust structure-aware representations of natural language. △ Less

Submitted 21 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

Comments: 10 pages, 1 figure, 6 tables

arXiv:2402.07999 [pdf, other]

NetInfoF Framework: Measuring and Exploiting Network Usable Information

Authors: Meng-Chieh Lee, Haiyang Yu, Jian Zhang, Vassilis N. Ioannidis, Xiang Song, Soji Adeshina, Da Zheng, Christos Faloutsos

Abstract: Given a node-attributed graph, and a graph task (link prediction or node classification), can we tell if a graph neural network (GNN) will perform well? More specifically, do the graph structure and the node features carry enough usable information for the task? Our goals are (1) to develop a fast tool to measure how much information is in the graph structure and in the node features, and (2) to e… ▽ More Given a node-attributed graph, and a graph task (link prediction or node classification), can we tell if a graph neural network (GNN) will perform well? More specifically, do the graph structure and the node features carry enough usable information for the task? Our goals are (1) to develop a fast tool to measure how much information is in the graph structure and in the node features, and (2) to exploit the information to solve the task, if there is enough. We propose NetInfoF, a framework including NetInfoF_Probe and NetInfoF_Act, for the measurement and the exploitation of network usable information (NUI), respectively. Given a graph data, NetInfoF_Probe measures NUI without any model training, and NetInfoF_Act solves link prediction and node classification, while two modules share the same backbone. In summary, NetInfoF has following notable advantages: (a) General, handling both link prediction and node classification; (b) Principled, with theoretical guarantee and closed-form solution; (c) Effective, thanks to the proposed adjustment to node similarity; (d) Scalable, scaling linearly with the input size. In our carefully designed synthetic datasets, NetInfoF correctly identifies the ground truth of NUI and is the only method being robust to all graph scenarios. Applied on real-world datasets, NetInfoF wins in 11 out of 12 times on link prediction compared to general GNN baselines. △ Less

Submitted 20 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

Comments: Accepted to ICLR 2024 (Spotlight)

arXiv:2402.06900 [pdf, other]

Can LLMs Recognize Toxicity? Definition-Based Toxicity Metric

Authors: Hyukhun Koh, Dohyung Kim, Minwoo Lee, Kyomin Jung

Abstract: In the pursuit of develo** Large Language Models (LLMs) that adhere to societal standards, it is imperative to detect the toxicity in the generated text. The majority of existing toxicity metrics rely on encoder models trained on specific toxicity datasets, which are susceptible to out-of-distribution (OOD) problems and depend on the dataset's definition of toxicity. In this paper, we introduce… ▽ More In the pursuit of develo** Large Language Models (LLMs) that adhere to societal standards, it is imperative to detect the toxicity in the generated text. The majority of existing toxicity metrics rely on encoder models trained on specific toxicity datasets, which are susceptible to out-of-distribution (OOD) problems and depend on the dataset's definition of toxicity. In this paper, we introduce a robust metric grounded on LLMs to flexibly measure toxicity according to the given definition. We first analyze the toxicity factors, followed by an examination of the intrinsic toxic attributes of LLMs to ascertain their suitability as evaluators. Finally, we evaluate the performance of our metric with detailed analysis. Our empirical results demonstrate outstanding performance in measuring toxicity within verified factors, improving on conventional metrics by 12 points in the F1 score. Our findings also indicate that upstream toxicity significantly influences downstream metrics, suggesting that LLMs are unsuitable for toxicity evaluations within unverified factors. △ Less

Submitted 18 June, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

Comments: 8 page long

arXiv:2402.06788 [pdf, other]

Magnetic field-temperature phase diagram of spin-1/2 triangular lattice antiferromagnet KYbSe$_2$

Authors: Sangyun Lee, Andrew J. Woods, Minseong Lee, Shengzhi Zhang, Eun Sang Choi, A. O. Scheie, D. A. Tennant, J. Xing, A. S. Sefat, R. Movshovich

Abstract: A quantum spin liquid (QSL) is a state of matter characterized by fractionalized quasiparticle excitations, quantum entanglement, and a lack of long-range magnetic order. However, QSLs have evaded definitive experimental observation. Several Yb$^{3+}$-based triangular lattice antiferromagnets with effective $S$ = $\frac{1}{2}$ have been suggested to stabilize the QSL state as the ground state. Her… ▽ More A quantum spin liquid (QSL) is a state of matter characterized by fractionalized quasiparticle excitations, quantum entanglement, and a lack of long-range magnetic order. However, QSLs have evaded definitive experimental observation. Several Yb$^{3+}$-based triangular lattice antiferromagnets with effective $S$ = $\frac{1}{2}$ have been suggested to stabilize the QSL state as the ground state. Here, we build a comprehensive magnetic temperature phase diagram of a high-quality single crystalline KYbSe$_2$ via heat capacity and magnetocaloric effect down to 30 mK with magnetic field applied along the $a$-axis. At zero magnetic field, we observe the magnetic long-range order at $T_N$ = 0.29 K entering 120 degrees ordered state in heat capacity, consistent with neutron scattering studies. Analysis of the low-temperature ($T$) specific heat ($C$) at zero magnetic field indicates linear $T$-dependence of $C/T$ and a broad hump of $C/T$ in the proximate QSL region above $T_N$. By applying magnetic field, we observe the up-up-down phase with 1/3 magnetization plateau and oblique phases, in addition to two new phases. These observations strongly indicate that while KYbSe$_2$ closely exhibits characteristics resembling an ideal triangular lattice, deviations may exist, such as the effect of the next-nearest-neighbor exchange interaction, calling for careful consideration for spin Hamiltonian modeling. Further investigations into tuning parameters, such as chemical pressure, could potentially induce an intriguing QSL phase in the material. △ Less

Submitted 9 February, 2024; originally announced February 2024.

arXiv:2402.06087 [pdf, other]

Descriptive Kernel Convolution Network with Improved Random Walk Kernel

Authors: Meng-Chieh Lee, Lingxiao Zhao, Leman Akoglu

Abstract: Graph kernels used to be the dominant approach to feature engineering for structured data, which are superseded by modern GNNs as the former lacks learnability. Recently, a suite of Kernel Convolution Networks (KCNs) successfully revitalized graph kernels by introducing learnability, which convolves input with learnable hidden graphs using a certain graph kernel. The random walk kernel (RWK) has b… ▽ More Graph kernels used to be the dominant approach to feature engineering for structured data, which are superseded by modern GNNs as the former lacks learnability. Recently, a suite of Kernel Convolution Networks (KCNs) successfully revitalized graph kernels by introducing learnability, which convolves input with learnable hidden graphs using a certain graph kernel. The random walk kernel (RWK) has been used as the default kernel in many KCNs, gaining increasing attention. In this paper, we first revisit the RWK and its current usage in KCNs, revealing several shortcomings of the existing designs, and propose an improved graph kernel RWK+, by introducing color-matching random walks and deriving its efficient computation. We then propose RWK+CN, a KCN that uses RWK+ as the core kernel to learn descriptive graph features with an unsupervised objective, which can not be achieved by GNNs. Further, by unrolling RWK+, we discover its connection with a regular GCN layer, and propose a novel GNN layer RWK+Conv. In the first part of experiments, we demonstrate the descriptive learning ability of RWK+CN with the improved random walk kernel RWK+ on unsupervised pattern mining tasks; in the second part, we show the effectiveness of RWK+ for a variety of KCN architectures and supervised graph learning tasks, and demonstrate the expressiveness of RWK+Conv layer, especially on the graph-level tasks. RWK+ and RWK+Conv adapt to various real-world applications, including web applications such as bot detection in a web-scale Twitter social network, and community classification in Reddit social interaction networks. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: WWW 2024

arXiv:2402.05183 [pdf, other]

Resonant Chains and the Convergent Migration of Planets in Protoplanetary Disks

Authors: Ka Ho Wong, Man Hoi Lee

Abstract: An increasing number of compact planetary systems with multiple planets in a resonant chain have been detected. The resonant chain must be maintained by convergent migration of the planets due to planet-disk interactions if it is formed before the dispersal of the protoplanetary gas disk. For type I migration in an adiabatic disk, we show that an analytic criterion for convergent migration can be… ▽ More An increasing number of compact planetary systems with multiple planets in a resonant chain have been detected. The resonant chain must be maintained by convergent migration of the planets due to planet-disk interactions if it is formed before the dispersal of the protoplanetary gas disk. For type I migration in an adiabatic disk, we show that an analytic criterion for convergent migration can be developed by requiring that any part of the resonant chain should be convergently migrating toward the remaining part. The criterion depends primarily on the logarithmic gradients $α$ and $β$ of the surface density and temperature profiles of the disk, respectively, and it is independent of the absolute values of the surface density and temperature. The analytic criterion is applied to the Kepler-60, Kepler-80, Kepler-223, TOI-178, and TRAPPIST-1 systems. Due to the variation of planetary masses within the resonant chains, we find that convergent migration typically requires rather extreme values of $(α, β)$ that have little or no overlap with common disk models. Finally, we show that there is an empirical relationship between the distance of the innermost planet from the central star and the stellar mass for the observed resonant chain systems, which supports the idea that the resonant chains are formed and maintained by stalling the migration of the innermost planet near the inner edge of the disk truncated by the magnetic fields of the protostar. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: 21 pages, including 10 figures; accepted for publication in AJ

arXiv:2402.04850 [pdf, other]

Muon $g-2$ and Proton Lifetime in SUSY SU(5) GUTs with Split Superpartners

Authors: Seong-Sik Kim, Hyun Min Lee, Sung-Bo Sim

Abstract: We consider the interplay of the muon $g-2$ anomaly and the proton decay in the SUSY SU(5) GUTs with generation-independent scalar soft masses. In these scenarios, we introduce a number of $\bf 5+{\bar 5}$ messenger fields with doublet-triplet splitting in general gauge mediation to transmit SUSY breaking to the visible sector by gauge loops. As a result, squarks and sleptons receive generation-in… ▽ More We consider the interplay of the muon $g-2$ anomaly and the proton decay in the SUSY SU(5) GUTs with generation-independent scalar soft masses. In these scenarios, we introduce a number of $\bf 5+{\bar 5}$ messenger fields with doublet-triplet splitting in general gauge mediation to transmit SUSY breaking to the visible sector by gauge loops. As a result, squarks and sleptons receive generation-independent soft SUSY breaking masses, which are split already at the messenger scale. Taking into account the perturbative unification of gauge couplings as well as the bounds from electroweak precision and vacuum stability bounds, we showed the parameter space in general gauge mediation to explain the muon $g-2$ anomaly with smuon and sneutrino loops while evading the strong bounds on squarks and gluinos from the Large Hadron Collider. We also obtained the dominant Higgsino contributions to the proton decay mode, $p\to K^+{\barν}$, with general generation-independent sparticle masses for squarks and sleptons. Even for split scalar soft masses in our model, however, we found that the bounds from the proton decay are satisfied only if the effective Yukawa couplings of the colored Higgsinos are suppressed further by a factor of order $10^{-4}-10^{-3}$. We illustrated how such a suppression factor is realized in orbifold GUTs in the extra dimension where the colored Higgsinos in the bulk are not coupled to the matter fields localized at the orbifold fixed points at the leading order. △ Less

Submitted 29 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

Comments: 35 pages, 8 figures, v2: typos fixed and reference updated, v3: version to appear in Phys. Rev. D

arXiv:2402.03751 [pdf, ps, other]

The first robust evidence showing a dark matter density spike around the supermassive black hole in OJ 287

Authors: Man Ho Chan, Chak Man Lee

Abstract: Black hole dynamics suggests that dark matter would re-distribute near a supermassive black hole to form a density spike. However, no direct evidence of dark matter density spike around a supermassive black hole has been identified. In this letter, we present the first robust evidence showing a dark matter density spike around a supermassive black hole. We revisit the data of the well-known superm… ▽ More Black hole dynamics suggests that dark matter would re-distribute near a supermassive black hole to form a density spike. However, no direct evidence of dark matter density spike around a supermassive black hole has been identified. In this letter, we present the first robust evidence showing a dark matter density spike around a supermassive black hole. We revisit the data of the well-known supermassive black hole binary OJ 287 and show that the inclusion of the dynamical friction due to a dark matter density spike around the supermassive black hole can satisfactorily account for the observed orbital decay rate. The derived spike index $γ_{\rm sp}=2.351^{+0.032}_{-0.045}$ gives an excellent agreement with the value $γ_{\rm sp}=2.333$ predicted by the benchmark model assuming an adiabatically growing supermassive black hole. This provides a strong verification of the canonical theory suggested two decades ago modeling the gravitational interaction between collisionless dark matter and supermassive black holes. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: Accepted for publication in ApJL

Journal ref: ApJL 962, L40 (2024)

arXiv:2402.03713 [pdf, other]

Measurement of $CP$ asymmetries in $B^0\toη'K^0_s$ decays at Belle II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker, J. V. Bennett , et al. (377 additional authors not shown)

Abstract: We describe a measurement of charge-parity ($CP$) violation asymmetries in $B^0\toη'K^0_S$ decays using Belle II data. We consider $η'\toη(\toγγ)π^+π^-$ and $η'\toρ(\toπ^+π^-)γ$ decays. The data were collected at the SuperKEKB asymmetric-energy $e^+e^-$ collider between the years 2019 and 2022, and contain $(387\pm 6) \times 10^6$ bottom-antibottom meson pairs. We reconstruct $829\pm35$ signal dec… ▽ More We describe a measurement of charge-parity ($CP$) violation asymmetries in $B^0\toη'K^0_S$ decays using Belle II data. We consider $η'\toη(\toγγ)π^+π^-$ and $η'\toρ(\toπ^+π^-)γ$ decays. The data were collected at the SuperKEKB asymmetric-energy $e^+e^-$ collider between the years 2019 and 2022, and contain $(387\pm 6) \times 10^6$ bottom-antibottom meson pairs. We reconstruct $829\pm35$ signal decays and extract the $CP$ violating parameters from a fit to the distribution of the proper-decay-time difference between the two $B$ mesons. The measured direct and mixing-induced $CP$ asymmetries are $\text{C}_{η'K^0_S} = -0.19 \pm 0.08 \pm 0.03 $ and $\text{S}_{η'K^0_S} = +0.67 \pm 0.10 \pm 0.04 $, respectively, where the first uncertainties are statistical and the second are systematic. These results are in agreement with current world averages and standard model predictions. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Report number: Belle II Preprint 2024-003, KEK Preprint 2023-50

arXiv:2402.03399 [pdf, other]

Rethinking RGB Color Representation for Image Restoration Models

Authors: Jaerin Lee, JoonKyu Park, Sungyong Baik, Kyoung Mu Lee

Abstract: Image restoration models are typically trained with a pixel-wise distance loss defined over the RGB color representation space, which is well known to be a source of blurry and unrealistic textures in the restored images. The reason, we believe, is that the three-channel RGB space is insufficient for supervising the restoration models. To this end, we augment the representation to hold structural… ▽ More Image restoration models are typically trained with a pixel-wise distance loss defined over the RGB color representation space, which is well known to be a source of blurry and unrealistic textures in the restored images. The reason, we believe, is that the three-channel RGB space is insufficient for supervising the restoration models. To this end, we augment the representation to hold structural information of local neighborhoods at each pixel while kee** the color information and pixel-grainedness unharmed. The result is a new representation space, dubbed augmented RGB ($a$RGB) space. Substituting the underlying representation space for the per-pixel losses facilitates the training of image restoration models, thereby improving the performance without affecting the evaluation phase. Notably, when combined with auxiliary objectives such as adversarial or perceptual losses, our $a$RGB space consistently improves overall metrics by reconstructing both color and local structures, overcoming the conventional perception-distortion trade-off. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: 31 pages (11 pages main manuscript + 20 pages appendices), 22 figures

arXiv:2402.02959 [pdf, ps, other]

On the functional equation of twisted Ruelle zeta function and Fried's conjecture

Authors: Jay Jorgenson, Min Lee, Lejla Smajlovic

Abstract: Let $M$ be a finite volume hyperbolic Riemann surface with arbitrary signature, and let $χ$ be an arbitrary $m$-dimensional multiplier system of weight $k$. Let $R(s,χ)$ be the associated Ruelle zeta function, and $\varphi(s,χ)$ the determinant of the scattering matrix. We prove the functional equation that $R(s,χ)\varphi(s,χ) = R(-s,χ)\varphi(s,χ)H(s,χ)$ where $H(s,χ)$ is a meromorphic function o… ▽ More Let $M$ be a finite volume hyperbolic Riemann surface with arbitrary signature, and let $χ$ be an arbitrary $m$-dimensional multiplier system of weight $k$. Let $R(s,χ)$ be the associated Ruelle zeta function, and $\varphi(s,χ)$ the determinant of the scattering matrix. We prove the functional equation that $R(s,χ)\varphi(s,χ) = R(-s,χ)\varphi(s,χ)H(s,χ)$ where $H(s,χ)$ is a meromorphic function of order one explicitly determined using the topological data of $M$ and of $χ$, and the trigonometric function $\sin(s)$. From this, we determine the order of the divisor of $R(s,χ)$ at $s=0$ and compute the lead coefficient in its Laurent expansion at $s=0$. When combined with results by Kitano and by Yamaguchi, we prove further instances of the Fried conjecture, which states that the R-torsion of the above data is simply expressed in terms of $R(0,χ)$. △ Less

Submitted 5 February, 2024; originally announced February 2024.

MSC Class: 11M36 (primary); 11F72

arXiv:2402.02005 [pdf, other]

Topology-Informed Graph Transformer

Authors: Yun Young Choi, Sun Woo Park, Minho Lee, Youngho Woo

Abstract: Transformers have revolutionized performance in Natural Language Processing and Vision, paving the way for their integration with Graph Neural Networks (GNNs). One key challenge in enhancing graph transformers is strengthening the discriminative power of distinguishing isomorphisms of graphs, which plays a crucial role in boosting their predictive performances. To address this challenge, we introd… ▽ More Transformers have revolutionized performance in Natural Language Processing and Vision, paving the way for their integration with Graph Neural Networks (GNNs). One key challenge in enhancing graph transformers is strengthening the discriminative power of distinguishing isomorphisms of graphs, which plays a crucial role in boosting their predictive performances. To address this challenge, we introduce 'Topology-Informed Graph Transformer (TIGT)', a novel transformer enhancing both discriminative power in detecting graph isomorphisms and the overall performance of Graph Transformers. TIGT consists of four components: A topological positional embedding layer using non-isomorphic universal covers based on cyclic subgraphs of graphs to ensure unique graph representation: A dual-path message-passing layer to explicitly encode topological characteristics throughout the encoder layers: A global attention mechanism: And a graph information layer to recalibrate channel-wise graph features for better feature representation. TIGT outperforms previous Graph Transformers in classifying synthetic dataset aimed at distinguishing isomorphism classes of graphs. Additionally, mathematical analysis and empirical evaluations highlight our model's competitive edge over state-of-the-art Graph Transformers across various benchmark datasets. △ Less

Submitted 2 February, 2024; originally announced February 2024.

arXiv:2402.01587 [pdf, other]

The Molecular Cloud Lifecycle II: Formation and Destruction of Molecular Clouds Diagnosed via H$_2$ Fluorescent Emission Emission

Authors: Blakesley Burkhart, Shmuel Bialy, Daniel Seifried, Stefanie Walch, Erika Hamden, Thomas J. Haworth, Keri Hoadley, Shuo Kong, Madisen Johnson, Sarah Jeffreson, Mark R. Krumholz, Min-Young Lee, Amiel Sternberg, Neal J. Turner

Abstract: Molecular hydrogen (H$_2$) formation and dissociation are key processes that drive the gas lifecycle in galaxies. Using the SImulating the LifeCycle of Molecular Clouds (SILCC) zoom-in simulation suite, we explore the utility of future observations of H$_2$ dissociation and formation for tracking the lifecycle of molecular clouds. The simulations used in this work include non-equilibrium H$_2$ for… ▽ More Molecular hydrogen (H$_2$) formation and dissociation are key processes that drive the gas lifecycle in galaxies. Using the SImulating the LifeCycle of Molecular Clouds (SILCC) zoom-in simulation suite, we explore the utility of future observations of H$_2$ dissociation and formation for tracking the lifecycle of molecular clouds. The simulations used in this work include non-equilibrium H$_2$ formation, stellar radiation, sink particles, and turbulence. We find that, at early times in the cloud evolution, H$_2$ formation rapidly outpaces dissociation and molecular clouds build their mass from the atomic reservoir in their environment. Rapid H$_2$ formation is also associated with a higher early star formation rate. For the clouds studied here, H$_2$ is strongly out of chemical equilibrium during the early stages of cloud formation but settles into a bursty chemical steady-state about 2 Myrs after the first stars form. At the latest stage of cloud evolution, dissociation outweighs formation and the clouds enter a dispersal phase. We discuss how theories for the molecular cloud lifecycle and the star formation efficiency may be distinguished with observational measurements of H$_2$ fluorescence with a space-based high-resolution FUV spectrometer, such as the proposed Hyperion and Eos NASA Explorer missions. Such missions would enable measurements of the H$_2$ dissociation and formation rates, which we demonstrate can be connected to different phases in a molecular cloud's star-forming life, including cloud building, rapidly star-forming, H$_2$ chemical equilibrium, and cloud destruction. △ Less

Submitted 5 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: Submitted to ApJ, comments welcome

arXiv:2402.01203 [pdf, other]

Neural Language of Thought Models

Authors: Yi-Fu Wu, Minseung Lee, Sung** Ahn

Abstract: The Language of Thought Hypothesis suggests that human cognition operates on a structured, language-like system of mental representations. While neural language models can naturally benefit from the compositional structure inherently and explicitly expressed in language data, learning such representations from non-linguistic general observations, like images, remains a challenge. In this work, we… ▽ More The Language of Thought Hypothesis suggests that human cognition operates on a structured, language-like system of mental representations. While neural language models can naturally benefit from the compositional structure inherently and explicitly expressed in language data, learning such representations from non-linguistic general observations, like images, remains a challenge. In this work, we introduce the Neural Language of Thought Model (NLoTM), a novel approach for unsupervised learning of LoTH-inspired representation and generation. NLoTM comprises two key components: (1) the Semantic Vector-Quantized Variational Autoencoder, which learns hierarchical, composable discrete representations aligned with objects and their properties, and (2) the Autoregressive LoT Prior, an autoregressive transformer that learns to generate semantic concept tokens compositionally, capturing the underlying data distribution. We evaluate NLoTM on several 2D and 3D image datasets, demonstrating superior performance in downstream tasks, out-of-distribution generalization, and image generation quality compared to patch-based VQ-VAE and continuous object-centric representations. Our work presents a significant step towards creating neural networks exhibiting more human-like understanding by develo** LoT-like representations and offers insights into the intersection of cognitive science and machine learning. △ Less

Submitted 16 April, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: Accepted in ICLR 2024

arXiv:2401.17651 [pdf, ps, other]

Relative entropy technique in terms of position and momentum and its application to Euler-Poisson system

Authors: Jan Giesselmann, Kiwoong Kwon, Min-Gi Lee

Abstract: This paper presents a systematic study of the relative entropy technique for compressible motions of continuum bodies described as Hamiltonian flows. While the description for the classical mechanics of $N$ particles involves a Hamiltonian in terms of position and momentum vectors, that for the continuum fluid involves a Hamiltonian in terms of density and momentum. For space dimension $d\ge 2$, t… ▽ More This paper presents a systematic study of the relative entropy technique for compressible motions of continuum bodies described as Hamiltonian flows. While the description for the classical mechanics of $N$ particles involves a Hamiltonian in terms of position and momentum vectors, that for the continuum fluid involves a Hamiltonian in terms of density and momentum. For space dimension $d\ge 2$, the Hamiltonian functional has a non-convex dependency on the deformation gradient or placement map due to material frame indifference. Because of this, the applicability of the relative entropy technique with respect to the deformation gradient or the placement map is inherently limited. Despite these limitations, we delineate the feasible applications and limitations of the technique by pushing it to its available extent. Specifically, we derive the relative Hamiltonian identity, where the Hamiltonian takes the position and momentum field as its primary and conjugate state variables, all within the context of the referential coordinate system that describes the motion. This approach, when applicable, turns out to yield rather strong stability statements. As instances, we consider Euler-Poisson systems in one space dimension. For a specific pressureless model, we verify non-increasing $L^2$ state differences before the formation of $δ$-shock. In addition, weak-strong uniqueness, stability of rarefaction waves, and convergence to the gradient flow in the singular limit of large friction are shown. Depending on the presence or absence of pressure, assumptions are made to suitably accommodate phenomena such as $δ$-shocks, vacuums, and shock discontinuities in the weak solutions. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 35 pages

arXiv:2401.17528 [pdf, ps, other]

What if PSR J1910-5959A is an observable self-lensing binary?

Authors: Man Ho Chan, Chak Man Lee

Abstract: In a binary, when the orbital plane of the companion star is almost edge-on along the line-of-sight direction, this would produce an observable self-gravitational lensing effect, which would slightly increase the overall optical intensity of the binary. However, the probability of getting one observable self-lensing binary (SLB) is very low. There are only five observed SLBs so far and all of them… ▽ More In a binary, when the orbital plane of the companion star is almost edge-on along the line-of-sight direction, this would produce an observable self-gravitational lensing effect, which would slightly increase the overall optical intensity of the binary. However, the probability of getting one observable self-lensing binary (SLB) is very low. There are only five observed SLBs so far and all of them are eclipsing binaries. In this article, we theoretically show that the neutron star-white dwarf (NS-WD) binary PSR J1910-5959A could be an observable non-eclipsing SLB. It might be the first binary showing both periodic optical amplification and Shapiro time delay of radio signals, which is useful to verify our understanding about gravitational lensing in relativistic binaries. Moreover, we show that the observed peak amplification limit of the PSR J1910-5959A can help constrain the radius of the WD, which is a crucial parameter to examine the mass-radius and temperature-radius relationship for helium WD. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: Accepted for publication in Physical Review D

Journal ref: Phys. Rev. D 109, 044049 (2024)

arXiv:2401.17360 [pdf, other]

Bender--Knuth Billiards in Coxeter Groups

Authors: Grant Barkley, Colin Defant, Eliot Hodges, Noah Kravitz, Mitchell Lee

Abstract: Let $(W,S)$ be a Coxeter system, and write $S=\{s_i:i\in I\}$, where $I$ is a finite index set. Fix a nonempty convex subset $\mathscr{L}$ of $W$. If $W$ is of type $A$, then $\mathscr{L}$ is the set of linear extensions of a poset, and there are important Bender--Knuth involutions $\mathrm{BK}_i\colon\mathscr{L}\to\mathscr{L}$ indexed by elements of $I$. For arbitrary $W$ and for each $i\in I$, w… ▽ More Let $(W,S)$ be a Coxeter system, and write $S=\{s_i:i\in I\}$, where $I$ is a finite index set. Fix a nonempty convex subset $\mathscr{L}$ of $W$. If $W$ is of type $A$, then $\mathscr{L}$ is the set of linear extensions of a poset, and there are important Bender--Knuth involutions $\mathrm{BK}_i\colon\mathscr{L}\to\mathscr{L}$ indexed by elements of $I$. For arbitrary $W$ and for each $i\in I$, we introduce an operator $τ_i\colon W\to W$ (depending on $\mathscr{L}$) that we call a noninvertible Bender--Knuth toggle; this operator restricts to an involution on $\mathscr{L}$ that coincides with $\mathrm{BK}_i$ in type $A$. Given a Coxeter element $c=s_{i_n}\cdots s_{i_1}$, we consider the operator $\mathrm{Pro}_c=τ_{i_n}\cdotsτ_{i_1}$. We say $W$ is futuristic if for every nonempty finite convex set $\mathscr{L}$, every Coxeter element $c$, and every $u\in W$, there exists an integer $K\geq 0$ such that $\mathrm{Pro}_c^K(u)\in\mathscr{L}$. We prove that finite Coxeter groups, right-angled Coxeter groups, rank-3 Coxeter groups, affine Coxeter groups of types $\widetilde A$ and $\widetilde C$, and Coxeter groups whose Coxeter graphs are complete are all futuristic. When $W$ is finite, we actually prove that if $s_{i_N}\cdots s_{i_1}$ is a reduced expression for the long element of $W$, then $τ_{i_N}\cdotsτ_{i_1}(W)=\mathscr{L}$; this allows us to determine the smallest integer $\mathrm{M}(c)$ such that $\mathrm{Pro}_c^{\mathrm{M}(c)}(W)=\mathscr{L}$ for all $\mathscr{L}$. We also exhibit infinitely many non-futuristic Coxeter groups, including all irreducible affine Coxeter groups that are not of type $\widetilde A$, $\widetilde C$, or $\widetilde G_2$. △ Less

Submitted 27 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

Comments: 51 pages, 12 figures

MSC Class: 05E18; 20F55; 37B20

arXiv:2401.17343 [pdf, other]

YTCommentQA: Video Question Answerability in Instructional Videos

Authors: Saelyne Yang, Sunghyun Park, Yunseok Jang, Moontae Lee

Abstract: Instructional videos provide detailed how-to guides for various tasks, with viewers often posing questions regarding the content. Addressing these questions is vital for comprehending the content, yet receiving immediate answers is difficult. While numerous computational models have been developed for Video Question Answering (Video QA) tasks, they are primarily trained on questions generated base… ▽ More Instructional videos provide detailed how-to guides for various tasks, with viewers often posing questions regarding the content. Addressing these questions is vital for comprehending the content, yet receiving immediate answers is difficult. While numerous computational models have been developed for Video Question Answering (Video QA) tasks, they are primarily trained on questions generated based on video content, aiming to produce answers from within the content. However, in real-world situations, users may pose questions that go beyond the video's informational boundaries, highlighting the necessity to determine if a video can provide the answer. Discerning whether a question can be answered by video content is challenging due to the multi-modal nature of videos, where visual and verbal information are intertwined. To bridge this gap, we present the YTCommentQA dataset, which contains naturally-generated questions from YouTube, categorized by their answerability and required modality to answer -- visual, script, or both. Experiments with answerability classification tasks demonstrate the complexity of YTCommentQA and emphasize the need to comprehend the combined role of visual and script information in video reasoning. The dataset is available at https://github.com/lgresearch/YTCommentQA. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: AAAI 2024

arXiv:2401.16693 [pdf]

Synchronization Behavior of Newton's Cradle

Authors: Minseok Lee, Seokchan Hong

Abstract: A Newton's cradle is a device that demonstrates conservation of momentum using a series of identical colliding pendula. Despite being a famous example that demonstrates the concept of momentum conservation, extensive analysis of the system is rarely reported in literature. Here, we model the system as a collection of identical nonlinear spring pendulums performing viscoelastic collisions, which sh… ▽ More A Newton's cradle is a device that demonstrates conservation of momentum using a series of identical colliding pendula. Despite being a famous example that demonstrates the concept of momentum conservation, extensive analysis of the system is rarely reported in literature. Here, we model the system as a collection of identical nonlinear spring pendulums performing viscoelastic collisions, which shows excellent agreement with experiments performed at various conditions. Dependence of its synchronization rate on four key system parameters are studied in detail. Interestingly, the resonance between radial and angular motion was found to modulate the synchronization rate. The proposed theory with full consideration of two dimensional motion and string hysteresis provides an excellent long-term prediction of the synchronized cradle motion. △ Less

Submitted 12 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

arXiv:2401.15894 [pdf, other]

A Gated MLP Architecture for Learning Topological Dependencies in Spatio-Temporal Graphs

Authors: Yun Young Choi, Minho Lee, Sun Woo Park, Seunghwan Lee, Joohwan Ko

Abstract: Graph Neural Networks (GNNs) and Transformer have been increasingly adopted to learn the complex vector representations of spatio-temporal graphs, capturing intricate spatio-temporal dependencies crucial for applications such as traffic datasets. Although many existing methods utilize multi-head attention mechanisms and message-passing neural networks (MPNNs) to capture both spatial and temporal r… ▽ More Graph Neural Networks (GNNs) and Transformer have been increasingly adopted to learn the complex vector representations of spatio-temporal graphs, capturing intricate spatio-temporal dependencies crucial for applications such as traffic datasets. Although many existing methods utilize multi-head attention mechanisms and message-passing neural networks (MPNNs) to capture both spatial and temporal relations, these approaches encode temporal and spatial relations independently, and reflect the graph's topological characteristics in a limited manner. In this work, we introduce the Cycle to Mixer (Cy2Mixer), a novel spatio-temporal GNN based on topological non-trivial invariants of spatio-temporal graphs with gated multi-layer perceptrons (gMLP). The Cy2Mixer is composed of three blocks based on MLPs: A message-passing block for encapsulating spatial information, a cycle message-passing block for enriching topological information through cyclic subgraphs, and a temporal block for capturing temporal properties. We bolster the effectiveness of Cy2Mixer with mathematical evidence emphasizing that our cycle message-passing block is capable of offering differentiated information to the deep learning model compared to the message-passing block. Furthermore, empirical evaluations substantiate the efficacy of the Cy2Mixer, demonstrating state-of-the-art performances across various traffic benchmark datasets. △ Less

Submitted 29 January, 2024; originally announced January 2024.

arXiv:2401.15773 [pdf]

Evaluation of k-means time series clustering based on z-normalization and NP-Free

Authors: Ming-Chang Lee, Jia-Chun Lin, Volker Stolz

Abstract: Despite the widespread use of k-means time series clustering in various domains, there exists a gap in the literature regarding its comprehensive evaluation with different time series normalization approaches. This paper seeks to fill this gap by conducting a thorough performance evaluation of k-means time series clustering on real-world open-source time series datasets. The evaluation focuses on… ▽ More Despite the widespread use of k-means time series clustering in various domains, there exists a gap in the literature regarding its comprehensive evaluation with different time series normalization approaches. This paper seeks to fill this gap by conducting a thorough performance evaluation of k-means time series clustering on real-world open-source time series datasets. The evaluation focuses on two distinct normalization techniques: z-normalization and NP-Free. The former is one of the most commonly used normalization approach for time series. The latter is a real-time time series representation approach, which can serve as a time series normalization approach. The primary objective of this paper is to assess the impact of these two normalization techniques on k-means time series clustering in terms of its clustering quality. The experiments employ the silhouette score, a well-established metric for evaluating the quality of clusters in a dataset. By systematically investigating the performance of k-means time series clustering with these two normalization techniques, this paper addresses the current gap in k-means time series clustering evaluation and contributes valuable insights to the development of time series clustering. △ Less

Submitted 28 January, 2024; originally announced January 2024.

Comments: 12 pages, 6 figures, 8 tables, 13th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2024)

arXiv:2401.15413 [pdf]

doi 10.1021/cbmi.4c00016

Hyperphosphorylation-Induced Phase Transition in Vesicle Delivery Dynamics of Motor Proteins in Neuronal Cells

Authors: Eunsang Lee, Donghee Kim, Yo Han Song, Kyu** Shin, Sanggeun Song, Minho Lee, Yeongchang Goh, Mi Hee Lim, Ji-Hyun Kim, Jaeyoung Sung, Kang Taek Lee

Abstract: Synaptic vesicle transport by motor proteins along microtubules is a crucial active process underlying neuronal communication. It is known that microtubules are destabilized by tau-hyperphosphorylation, which causes tau proteins to detach from microtubules and form neurofibril tangles. However, how tau-phosphorylation affects transport dynamics of motor proteins on the microtubule remains unknown.… ▽ More Synaptic vesicle transport by motor proteins along microtubules is a crucial active process underlying neuronal communication. It is known that microtubules are destabilized by tau-hyperphosphorylation, which causes tau proteins to detach from microtubules and form neurofibril tangles. However, how tau-phosphorylation affects transport dynamics of motor proteins on the microtubule remains unknown. Here, we discover that long-distance unidirectional motion of vesicle-motor protein multiplexes (VMPMs) in living cells is suppressed under tau-hyperphosphorylation, with the consequent loss of fast vesicle-transport along the microtubule. The VMPMs in hyperphosphorylated cells exhibit seemingly bidirectional random motion, with dynamic properties far different from VMPM motion in normal cells. We establish a parsimonious physicochemical model of VMPM's active motion that provides a unified, quantitative explanation and predictions for our experimental results. Our analysis reveals that, under hyperphosphorylation conditions, motor-protein-multiplexes have both static and dynamic motility fluctuations. The loss of the fast vesicle-transport along the microtubule can be a mechanism of neurodegenerative disorders associated with tau-hyperphosphorylation. △ Less

Submitted 23 April, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

arXiv:2401.14698 [pdf, other]

Under the Surface: Tracking the Artifactuality of LLM-Generated Data

Authors: Debarati Das, Karin De Langis, Anna Martin-Boyle, Jaehyung Kim, Minhwa Lee, Zae Myung Kim, Shirley Anugrah Hayati, Risako Owan, Bin Hu, Ritik Parkar, Ryan Koo, Jonginn Park, Aahan Tyagi, Libby Ferland, Sanjali Roy, Vincent Liu, Dongyeop Kang

Abstract: This work delves into the expanding role of large language models (LLMs) in generating artificial data. LLMs are increasingly employed to create a variety of outputs, including annotations, preferences, instruction prompts, simulated dialogues, and free text. As these forms of LLM-generated data often intersect in their application, they exert mutual influence on each other and raise significant c… ▽ More This work delves into the expanding role of large language models (LLMs) in generating artificial data. LLMs are increasingly employed to create a variety of outputs, including annotations, preferences, instruction prompts, simulated dialogues, and free text. As these forms of LLM-generated data often intersect in their application, they exert mutual influence on each other and raise significant concerns about the quality and diversity of the artificial data incorporated into training cycles, leading to an artificial data ecosystem. To the best of our knowledge, this is the first study to aggregate various types of LLM-generated text data, from more tightly constrained data like "task labels" to more lightly constrained "free-form text". We then stress test the quality and implications of LLM-generated artificial data, comparing it with human data across various existing benchmarks. Despite artificial data's capability to match human performance, this paper reveals significant hidden disparities, especially in complex tasks where LLMs often miss the nuanced understanding of intrinsic human-generated content. This study critically examines diverse LLM-generated data and emphasizes the need for ethical practices in data creation and when using LLMs. It highlights the LLMs' shortcomings in replicating human traits and behaviors, underscoring the importance of addressing biases and artifacts produced in LLM-generated content for future research and development. All data and code are available on our project page. △ Less

Submitted 30 January, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

Comments: Core Authors: Debarati Das, Karin De Langis, Anna Martin-Boyle, Jaehyung Kim, Minhwa Lee and Zae Myung Kim | Project lead : Debarati Das | PI : Dongyeop Kang

arXiv:2401.13970 [pdf, ps, other]

doi 10.1145/3613905.3636287

CUI@CHI 2024: Building Trust in CUIs-From Design to Deployment

Authors: Smit Desai, Christina Wei, Jaisie Sin, Mateusz Dubiel, Nima Zargham, Shashank Ahire, Martin Porcheron, Anastasia Kuzminykh, Minha Lee, Heloisa Candello, Joel Fischer, Cosmin Munteanu, Benjamin R Cowan

Abstract: Conversational user interfaces (CUIs) have become an everyday technology for people the world over, as well as a booming area of research. Advances in voice synthesis and the emergence of chatbots powered by large language models (LLMs), notably ChatGPT, have pushed CUIs to the forefront of human-computer interaction (HCI) research and practice. Now that these technologies enable an elemental leve… ▽ More Conversational user interfaces (CUIs) have become an everyday technology for people the world over, as well as a booming area of research. Advances in voice synthesis and the emergence of chatbots powered by large language models (LLMs), notably ChatGPT, have pushed CUIs to the forefront of human-computer interaction (HCI) research and practice. Now that these technologies enable an elemental level of usability and user experience (UX), we must turn our attention to higher-order human factors: trust and reliance. In this workshop, we aim to bring together a multidisciplinary group of researchers and practitioners invested in the next phase of CUI design. Through keynotes, presentations, and breakout sessions, we will share our knowledge, identify cutting-edge resources, and fortify an international network of CUI scholars. In particular, we will engage with the complexity of trust and reliance as attitudes and behaviours that emerge when people interact with conversational agents. △ Less

Submitted 25 January, 2024; originally announced January 2024.

arXiv:2401.13899 [pdf, ps, other]

Differential Energy Equalities for Weak Solutions to the Navier-Stokes Equation

Authors: M. -C. Lee, J. Glimm, A. H. Rahimyar, T. Wallstrom

Abstract: We prove new results for general weak solutions of the incompressible Navier-Stokes equations on the three-dimensional torus, without using the strong energy inequality or other regularity assumptions. Specifically, we prove two types of differential energy equalities, in which $(d/dt)\lVert u\rVert^2$ is replaced by $2\langle \partial_t u,u\rangle$ and… ▽ More We prove new results for general weak solutions of the incompressible Navier-Stokes equations on the three-dimensional torus, without using the strong energy inequality or other regularity assumptions. Specifically, we prove two types of differential energy equalities, in which $(d/dt)\lVert u\rVert^2$ is replaced by $2\langle \partial_t u,u\rangle$ and $\lim_{N\rightarrow\infty}(d/dt)\lVert u_N\rVert^2$, where the $u_N$ are projections of the original weak solution $u$. We also provide a new characterization of Leray-Hopf solutions, as weak solutions having a version for which the energy is monotonically decreasing. In order to prove these results, we first prove that both $\partial_t u$ and $Δu$ are in $L^{4/3}(ε, T; L^{6/5})$, results that were previously known only for Leray-Hopf weak solutions. △ Less

Submitted 5 April, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

Comments: 3D decaying turbulence, general weak solutions, interior regularity, differential energy equality, Leray-Hopf weak solutions

arXiv:2401.13836 [pdf, other]

doi 10.1016/j.conengprac.2024.105841

Machine learning for industrial sensing and control: A survey and practical perspective

Authors: Nathan P. Lawrence, Seshu Kumar Damarla, Jong Woo Kim, Aditya Tulsyan, Faraz Amjad, Kai Wang, Benoit Chachuat, Jong Min Lee, Biao Huang, R. Bhushan Gopaluni

Abstract: With the rise of deep learning, there has been renewed interest within the process industries to utilize data on large-scale nonlinear sensing and control problems. We identify key statistical and machine learning techniques that have seen practical success in the process industries. To do so, we start with hybrid modeling to provide a methodological framework underlying core application areas: so… ▽ More With the rise of deep learning, there has been renewed interest within the process industries to utilize data on large-scale nonlinear sensing and control problems. We identify key statistical and machine learning techniques that have seen practical success in the process industries. To do so, we start with hybrid modeling to provide a methodological framework underlying core application areas: soft sensing, process optimization, and control. Soft sensing contains a wealth of industrial applications of statistical and machine learning methods. We quantitatively identify research trends, allowing insight into the most successful techniques in practice. We consider two distinct flavors for data-driven optimization and control: hybrid modeling in conjunction with mathematical programming techniques and reinforcement learning. Throughout these application areas, we discuss their respective industrial requirements and challenges. A common challenge is the interpretability and efficiency of purely data-driven methods. This suggests a need to carefully balance deep learning techniques with domain knowledge. As a result, we highlight ways prior knowledge may be integrated into industrial machine learning applications. The treatment of methods, problems, and applications presented here is poised to inform and inspire practitioners and researchers to develop impactful data-driven sensing, optimization, and control solutions in the process industries. △ Less

Submitted 24 January, 2024; originally announced January 2024.

Comments: 48 pages

Journal ref: Control Engineering Practice 2024

arXiv:2401.13253 [pdf]

doi 10.1021/acs.jpclett.4c00323

Transport Dynamics of Water Molecules Confined between Lipid Membranes

Authors: Minho Lee, Euihyun Lee, Ji-Hyun Kim, Hyonseok Hwang, Minhaeng Cho, Jaeyoung Sung

Abstract: Water molecules confined between biological membranes exhibit a distinctive non-Gaussian displacement distribution, far different from bulk water. Here, we introduce a new transport equation for water molecules in the intermembrane space, quantitatively explaining molecular dynamics simulation results. We find the unique transport dynamics of water molecules stems from the lateral diffusion coeffi… ▽ More Water molecules confined between biological membranes exhibit a distinctive non-Gaussian displacement distribution, far different from bulk water. Here, we introduce a new transport equation for water molecules in the intermembrane space, quantitatively explaining molecular dynamics simulation results. We find the unique transport dynamics of water molecules stems from the lateral diffusion coefficient fluctuation caused by their longitudinal motion. We also identify an interfacial region where water possesses distinct physical properties, unaffected by changes in the intermembrane separation. △ Less

Submitted 18 April, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

Comments: 4 figures in main text, 4 figures in Supplemental Material, 1 Supplemental Video

arXiv:2401.12021 [pdf, other]

Study of $Υ(10753)$ decays to $π^{+}π^{-}Υ(nS)$ final states at Belle II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker , et al. (371 additional authors not shown)

Abstract: We present an analysis of the process $e^{+}e^{-}\toπ^{+}π^{-}Υ(nS)$ (where $n$ = 1, 2, or 3) reconstructed in $19.6\rm$ $\rm fb^{-1}$ of Belle II data during a special run of the SuperKEKB collider at four energy points near the peak of the $Υ(10753)$ resonance. By analyzing the mass distribution of the $π^+π^-Υ(nS)$ system and the Born cross sections of the $e^{+}e^{-}\toπ^{+}π^{-}Υ(nS)$ process… ▽ More We present an analysis of the process $e^{+}e^{-}\toπ^{+}π^{-}Υ(nS)$ (where $n$ = 1, 2, or 3) reconstructed in $19.6\rm$ $\rm fb^{-1}$ of Belle II data during a special run of the SuperKEKB collider at four energy points near the peak of the $Υ(10753)$ resonance. By analyzing the mass distribution of the $π^+π^-Υ(nS)$ system and the Born cross sections of the $e^{+}e^{-}\toπ^{+}π^{-}Υ(nS)$ process, we report the first observation of $Υ(10753)$ decays to the $π^{+}π^{-}Υ(1S)$ and $π^{+}π^{-}Υ(2S)$ final states, and find no evidence for decays to $π^{+}π^{-}Υ(3S)$. Possible intermediate states in the $π^+π^-Υ(1S,2S)$ transitions are also investigated, and no evidence for decays proceeding via the $π^\mp Z_b^\pm$ or $f_0(980)Υ(nS)$ intermediate states is found. We measure Born cross sections for the $e^{+}e^{-}\toπ^{+}π^{-}Υ(nS)$ process that, combined with results from Belle, improve the precision of measurements of the $Υ(10753)$ mass and width by nearly a factor of two to $(10756.3\pm2.7\pm0.6)$ MeV/$c^2$ and $(29.7\pm8.5\pm1.1)$ MeV, respectively. The relative ratios of the Born cross sections at the $Υ(10753)$ resonance peak are also reported for the first time. △ Less

Submitted 18 June, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

arXiv:2401.11826 [pdf, other]

Tracing the rise of supermassive black holes: A panchromatic search for faint, unobscured quasars at z > 6 with COSMOS-Web and other surveys

Authors: Irham T. Andika, Knud Jahnke, Masafusa Onoue, John D. Silverman, Itsna K. Fitriana, Angela Bongiorno, Malte Brinch, Caitlin M. Casey, Andreas Faisst, Steven Gillman, Ghassem Gozaliasl, Christopher C. Hayward, Michaela Hirschmann, Dale Kocevski, Anton M. Koekemoer, Vasily Kokorev, Erini Lambrides, Minju M. Lee, R. Michael Rich, Benny Trakhtenbrot, C. Megan Urry, Stephen M. Wilkins, Aswin P. Vijayan

Abstract: We report the identification of 64 new candidates of compact galaxies, potentially hosting faint quasars with bolometric luminosities of $L_\mathrm{bol} = 10^{43}$--10$^{46}$ erg s$^{-1}$, residing in the reionization epoch within the redshift range of $6 \lesssim z \lesssim 8$. These candidates were selected by harnessing the rich multiband datasets provided by the emerging JWST-driven extragalac… ▽ More We report the identification of 64 new candidates of compact galaxies, potentially hosting faint quasars with bolometric luminosities of $L_\mathrm{bol} = 10^{43}$--10$^{46}$ erg s$^{-1}$, residing in the reionization epoch within the redshift range of $6 \lesssim z \lesssim 8$. These candidates were selected by harnessing the rich multiband datasets provided by the emerging JWST-driven extragalactic surveys, focusing on COSMOS-Web, as well as JADES, UNCOVER, CEERS, and PRIMER. Our search strategy includes two stages: applying stringent photometric cuts to catalog-level data and detailed spectral energy distribution fitting. These techniques effectively isolate the quasar candidates while mitigating contamination from low-redshift interlopers, such as brown dwarfs and nearby galaxies. The selected candidates indicate physical traits compatible with low-luminosity active galactic nuclei, likely hosting $\approx10^5$--$10^7~M_\odot$ supermassive black holes (SMBHs) living in galaxies with stellar masses of $\approx10^8$--$10^{10}~M_\odot$. The SMBHs selected in this study, on average, exhibit an elevated mass compared to their hosts, with the mass ratio distribution slightly higher than those of galaxies in the local Universe. As with other high-$z$ studies, this is at least in part due to the selection method for these quasars. An extensive Monte Carlo analysis provides compelling evidence that heavy black hole seeds from the direct collapse scenario appear to be the preferred pathway to mature this specific subset of SMBHs by $z\approx7$. This work underscores the significance of further spectroscopic observations, as the quasar candidates presented here offer exceptional opportunities to delve into the nature of the earliest galaxies and SMBHs that formed during cosmic infancy. △ Less

Submitted 2 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

Comments: Accepted for publication in the Astronomy & Astrophysics journal. 19 pages, 10 figures, and 4 tables. We welcome comments from the reader

arXiv:2401.11668 [pdf, ps, other]

Constraining annihilating dark matter using the multi-frequency radio flux profiles of the M33 galaxy

Authors: Man Ho Chan, Chak Man Lee, Lang Cui, Ning Chang, Chun Sing Leung

Abstract: Radio data can give stringent constraints for annihilating dark matter. In general, radio observations can detect very accurate radio flux density with high resolution and different frequencies for nearby galaxies. We are able to obtain the radio flux density as a function of distance from the galactic center and frequencies $S(r,ν)$. In this article, we demonstrate a comprehensive radio analysis… ▽ More Radio data can give stringent constraints for annihilating dark matter. In general, radio observations can detect very accurate radio flux density with high resolution and different frequencies for nearby galaxies. We are able to obtain the radio flux density as a function of distance from the galactic center and frequencies $S(r,ν)$. In this article, we demonstrate a comprehensive radio analysis of the M33 galaxy, combining the radio flux density profile $S(r)$ and the frequency spectrum $S(ν)$ to get the constraints of dark matter annihilation parameters. By analyzing the archival radio data obtained from the Effelsberg telescope, we show that the dark matter annihilation contributing to the radio flux density might be insignificant in the disk region of the M33 galaxy. Moreover, by including the baryonic radio contribution, we constrain the $2σ$ conservative upper limits of the annihilation cross section, which can be complementary to the existing constraints based on neutrino, cosmic-ray, and gamma-ray observations. Our results indicate that analyzing the galactic multi-frequency radio flux profiles can give useful and authentic constraints on dark matter for the leptophilic annihilation channels. △ Less

Submitted 21 January, 2024; originally announced January 2024.

Comments: Accepted publication in ApJ

Journal ref: ApJ 962,141 (2024)

arXiv:2401.10838 [pdf, other]

doi 10.1145/3613904.3642217

Rambler: Supporting Writing With Speech via LLM-Assisted Gist Manipulation

Authors: Susan Lin, Jeremy Warner, J. D. Zamfirescu-Pereira, Matthew G. Lee, Sauhard Jain, Michael Xuelin Huang, Piyawat Lertvittayakumjorn, Shanqing Cai, Shumin Zhai, Björn Hartmann, Can Liu

Abstract: Dictation enables efficient text input on mobile devices. However, writing with speech can produce disfluent, wordy, and incoherent text and thus requires heavy post-processing. This paper presents Rambler, an LLM-powered graphical user interface that supports gist-level manipulation of dictated text with two main sets of functions: gist extraction and macro revision. Gist extraction generates key… ▽ More Dictation enables efficient text input on mobile devices. However, writing with speech can produce disfluent, wordy, and incoherent text and thus requires heavy post-processing. This paper presents Rambler, an LLM-powered graphical user interface that supports gist-level manipulation of dictated text with two main sets of functions: gist extraction and macro revision. Gist extraction generates keywords and summaries as anchors to support the review and interaction with spoken text. LLM-assisted macro revisions allow users to respeak, split, merge and transform dictated text without specifying precise editing locations. Together they pave the way for interactive dictation and revision that help close gaps between spontaneous spoken words and well-structured writing. In a comparative study with 12 participants performing verbal composition tasks, Rambler outperformed the baseline of a speech-to-text editor + ChatGPT, as it better facilitates iterative revisions with enhanced user control over the content while supporting surprisingly diverse user strategies. △ Less

Submitted 7 March, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

Comments: To appear at ACM CHI 2024

arXiv:2401.08495 [pdf, other]

doi 10.1145/3630106.3658975

Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in Humans

Authors: Messi H. J. Lee, Jacob M. Montgomery, Calvin K. Lai

Abstract: Large language models (LLMs) are becoming pervasive in everyday life, yet their propensity to reproduce biases inherited from training data remains a pressing concern. Prior investigations into bias in LLMs have focused on the association of social groups with stereotypical attributes. However, this is only one form of human bias such systems may reproduce. We investigate a new form of bias in LLM… ▽ More Large language models (LLMs) are becoming pervasive in everyday life, yet their propensity to reproduce biases inherited from training data remains a pressing concern. Prior investigations into bias in LLMs have focused on the association of social groups with stereotypical attributes. However, this is only one form of human bias such systems may reproduce. We investigate a new form of bias in LLMs that resembles a social psychological phenomenon where socially subordinate groups are perceived as more homogeneous than socially dominant groups. We had ChatGPT, a state-of-the-art LLM, generate texts about intersectional group identities and compared those texts on measures of homogeneity. We consistently found that ChatGPT portrayed African, Asian, and Hispanic Americans as more homogeneous than White Americans, indicating that the model described racial minority groups with a narrower range of human experience. ChatGPT also portrayed women as more homogeneous than men, but these differences were small. Finally, we found that the effect of gender differed across racial/ethnic groups such that the effect of gender was consistent within African and Hispanic Americans but not within Asian and White Americans. We argue that the tendency of LLMs to describe groups as less diverse risks perpetuating stereotypes and discriminatory behavior. △ Less

Submitted 25 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

Comments: Forthcoming at ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2024

arXiv:2401.07716 [pdf, other]

Layerwise Quantum Convolutional Neural Networks Provide a Unified Way for Estimating Fundamental Properties of Quantum Information Theory

Authors: Myeong** Shin, Seungwoo Lee, Mingyu Lee, Donghwa Ji, Hyeonjun Yeo, Harrison J. Lee, Kabgyun Jeong

Abstract: The estimation of fundamental properties in quantum information theory, including von Neumann entropy, Rényi entropy, Tsallis entropy, quantum relative entropy, trace distance, and fidelity, has received significant attention. While various algorithms exist for individual property estimation, a unified approach is lacking. This paper proposes a unified methodology using Layerwise Quantum Convoluti… ▽ More The estimation of fundamental properties in quantum information theory, including von Neumann entropy, Rényi entropy, Tsallis entropy, quantum relative entropy, trace distance, and fidelity, has received significant attention. While various algorithms exist for individual property estimation, a unified approach is lacking. This paper proposes a unified methodology using Layerwise Quantum Convolutional Neural Networks (LQCNN). Recent studies exploring parameterized quantum circuits for property estimation face challenges such as barren plateaus and complexity issues in large qubit states. In contrast, our work overcomes these challenges, avoiding barren plateaus and providing a practical solution for large qubit states. Our first contribution offers a mathematical proof that the LQCNN structure preserves fundamental properties. Furthermore, our second contribution analyzes the algorithm's complexity, demonstrating its avoidance of barren plateaus through a structured local cost function. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: 9 pages, 1 figure

arXiv:2401.07476 [pdf, other]

Background study of the AMoRE-pilot experiment

Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Yu. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (83 additional authors not shown)

Abstract: We report a study on the background of the Advanced Molybdenum-Based Rare process Experiment (AMoRE), a search for neutrinoless double beta decay (\znbb) of $^{100}$Mo. The pilot stage of the experiment was conducted using $\sim$1.9 kg of \CAMOO~ crystals at the Yangyang Underground Laboratory, South Korea, from 2015 to 2018. We compared the measured $β/γ$ energy spectra in three experimental conf… ▽ More We report a study on the background of the Advanced Molybdenum-Based Rare process Experiment (AMoRE), a search for neutrinoless double beta decay (\znbb) of $^{100}$Mo. The pilot stage of the experiment was conducted using $\sim$1.9 kg of \CAMOO~ crystals at the Yangyang Underground Laboratory, South Korea, from 2015 to 2018. We compared the measured $β/γ$ energy spectra in three experimental configurations with the results of Monte Carlo simulations and identified the background sources in each configuration. We replaced several detector components and enhanced the neutron shielding to lower the background level between configurations. A limit on the half-life of $0νββ$ decay of $^{100}$Mo was found at $T_{1/2}^{0ν} \ge 3.0\times 10^{23}$ years at 90\% confidence level, based on the measured background and its modeling. Further reduction of the background rate in the AMoRE-I and AMoRE-II are discussed. △ Less

Submitted 7 April, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

arXiv:2401.07462 [pdf, other]

doi 10.1140/epjc/s10052-024-12770-1

Nonproportionality of NaI(Tl) Scintillation Detector for Dark Matter Search Experiments

Authors: S. M. Lee, G. Adhikari, N. Carlin, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Fran. a, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, S. W. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim , et al. (37 additional authors not shown)

Abstract: We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced… ▽ More We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced by decays supported by both long and short-lived isotopes. Analyzing peaks from decays supported only by short-lived isotopes presented a unique challenge due to their limited statistics and overlap** energies, which was overcome by long-term data collection and a time-dependent analysis. A key achievement is the direct measurement of the 0.87 keV light yield, resulting from the cascade following electron capture decay of $^{22}$Na from internal contamination. This measurement, previously accessible only indirectly, deepens our understanding of NaI(Tl) scintillator behavior in the region of interest for dark matter searches. This study holds substantial implications for background modeling and the interpretation of dark matter signals in NaI(Tl) experiments. △ Less

Submitted 10 May, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

Comments: 12 pages, 7 figures

Journal ref: Eur. Phys. J. C 84 (2024) 484

arXiv:2401.07298 [pdf, other]

Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems

Authors: Yue Kang, Cho-Jui Hsieh, Thomas C. M. Lee

Abstract: In the stochastic contextual low-rank matrix bandit problem, the expected reward of an action is given by the inner product between the action's feature matrix and some fixed, but initially unknown $d_1$ by $d_2$ matrix $Θ^*$ with rank $r \ll \{d_1, d_2\}$, and an agent sequentially takes actions based on past experience to maximize the cumulative reward. In this paper, we study the generalized lo… ▽ More In the stochastic contextual low-rank matrix bandit problem, the expected reward of an action is given by the inner product between the action's feature matrix and some fixed, but initially unknown $d_1$ by $d_2$ matrix $Θ^*$ with rank $r \ll \{d_1, d_2\}$, and an agent sequentially takes actions based on past experience to maximize the cumulative reward. In this paper, we study the generalized low-rank matrix bandit problem, which has been recently proposed in \cite{lu2021low} under the Generalized Linear Model (GLM) framework. To overcome the computational infeasibility and theoretical restrain of existing algorithms on this problem, we first propose the G-ESTT framework that modifies the idea from \cite{jun2019bilinear} by using Stein's method on the subspace estimation and then leverage the estimated subspaces via a regularization idea. Furthermore, we remarkably improve the efficiency of G-ESTT by using a novel exclusion idea on the estimated subspace instead, and propose the G-ESTS framework. We also show that G-ESTT can achieve the $\tilde{O}(\sqrt{(d_1+d_2)MrT})$ bound of regret while G-ESTS can achineve the $\tilde{O}(\sqrt{(d_1+d_2)^{3/2}Mr^{3/2}T})$ bound of regret under mild assumption up to logarithm terms, where $M$ is some problem dependent value. Under a reasonable assumption that $M = O((d_1+d_2)^2)$ in our problem setting, the regret of G-ESTT is consistent with the current best regret of $\tilde{O}((d_1+d_2)^{3/2} \sqrt{rT}/D_{rr})$~\citep{lu2021low} ($D_{rr}$ will be defined later). For completeness, we conduct experiments to illustrate that our proposed algorithms, especially G-ESTS, are also computationally tractable and consistently outperform other state-of-the-art (generalized) linear matrix bandit methods based on a suite of simulations. △ Less

Submitted 14 January, 2024; originally announced January 2024.

Comments: Revision of the paper accepted by NeurIPS 2022

arXiv:2401.05036 [pdf, other]

doi 10.1051/0004-6361/202347863

Can the giant planets of the Solar System form via pebble accretion in a smooth protoplanetary disc?

Authors: Tommy Chi Ho Lau, Man Hoi Lee, Ramon Brasser, Soko Matsumura

Abstract: Prevailing $N$-body planet formation models typically start with lunar-mass embryos and show a general trend of rapid migration of massive planetary cores to the inner Solar System in the absence of a migration trap. This setup cannot capture the evolution from a planetesimal to embryo, which is crucial to the final architecture of the system. We aim to model planet formation with planet migration… ▽ More Prevailing $N$-body planet formation models typically start with lunar-mass embryos and show a general trend of rapid migration of massive planetary cores to the inner Solar System in the absence of a migration trap. This setup cannot capture the evolution from a planetesimal to embryo, which is crucial to the final architecture of the system. We aim to model planet formation with planet migration starting with planetesimals of $\sim10^{-6}$ -- $10^{-4}M_\oplus$ and reproduce the giant planets of the Solar System. We simulated a population of 1,000 -- 5,000 planetesimals in a smooth protoplanetary disc, which was evolved under the effects of their mutual gravity, pebble accretion, gas accretion, and planet migration, employing the parallelized $N$-body code SyMBAp. We find that the dynamical interactions among growing planetesimals are vigorous and can halt pebble accretion for excited bodies. While a set of results without planet migration produces one to two gas giants and one to two ice giants beyond 6 au, massive planetary cores readily move to the inner Solar System once planet migration is in effect. Dynamical heating is important in a planetesimal disc and the reduced pebble encounter time should be considered in similar models. Planet migration remains a challenge to form cold giant planets in a smooth protoplanetary disc, which suggests an alternative mechanism is required to stop them at wide orbits. △ Less

Submitted 25 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

Comments: 17 pages, 14 figures, replaced with published version on A&A

arXiv:2401.04143 [pdf, other]

RHOBIN Challenge: Reconstruction of Human Object Interaction

Authors: Xianghui Xie, Xi Wang, Nikos Athanasiou, Bharat Lal Bhatnagar, Chun-Hao P. Huang, Kaichun Mo, Hao Chen, Xia Jia, Zerui Zhang, Liangxian Cui, Xiao Lin, Bingqiao Qian, Jie Xiao, Wenfei Yang, Hyeong** Nam, Daniel Sungho Jung, Kihoon Kim, Kyoung Mu Lee, Otmar Hilliges, Gerard Pons-Moll

Abstract: Modeling the interaction between humans and objects has been an emerging research direction in recent years. Capturing human-object interaction is however a very challenging task due to heavy occlusion and complex dynamics, which requires understanding not only 3D human pose, and object pose but also the interaction between them. Reconstruction of 3D humans and objects has been two separate resear… ▽ More Modeling the interaction between humans and objects has been an emerging research direction in recent years. Capturing human-object interaction is however a very challenging task due to heavy occlusion and complex dynamics, which requires understanding not only 3D human pose, and object pose but also the interaction between them. Reconstruction of 3D humans and objects has been two separate research fields in computer vision for a long time. We hence proposed the first RHOBIN challenge: reconstruction of human-object interactions in conjunction with the RHOBIN workshop. It was aimed at bringing the research communities of human and object reconstruction as well as interaction modeling together to discuss techniques and exchange ideas. Our challenge consists of three tracks of 3D reconstruction from monocular RGB images with a focus on dealing with challenging interaction scenarios. Our challenge attracted more than 100 participants with more than 300 submissions, indicating the broad interest in the research communities. This paper describes the settings of our challenge and discusses the winning methods of each track in more detail. We observe that the human reconstruction task is becoming mature even under heavy occlusion settings while object pose estimation and joint reconstruction remain challenging tasks. With the growing interest in interaction modeling, we hope this report can provide useful insights and foster future research in this direction. Our workshop website can be found at \href{https://rhobin-challenge.github.io/}{https://rhobin-challenge.github.io/}. △ Less

Submitted 7 January, 2024; originally announced January 2024.

Comments: 14 pages, 5 tables, 7 figure. Technical report of the CVPR'23 workshop: RHOBIN challenge (https://rhobin-challenge.github.io/)

arXiv:2401.04035 [pdf, other]

doi 10.1021/acs.jpclett.4c00065

Unveiling multi-quantum excitonic correlations in push-pull polymer semiconductors

Authors: Yulong Zheng, Esteban Rojas-Gatjens, Myeongyeon Lee, Elsa Reichmanis, Carlos Silva-Acuña

Abstract: Bound and unbound Frenkel-exciton pairs are essential transient precursors for a variety of photophysical and biochemical processes. In this work, we identify bound and unbound {Frenkel}-exciton complexes in an electron push-pull polymer semiconductor using coherent two-dimensional spectroscopy. We find that the dominant $A_{0-1}$ peak of the absorption vibronic progression is accompanied by a sub… ▽ More Bound and unbound Frenkel-exciton pairs are essential transient precursors for a variety of photophysical and biochemical processes. In this work, we identify bound and unbound {Frenkel}-exciton complexes in an electron push-pull polymer semiconductor using coherent two-dimensional spectroscopy. We find that the dominant $A_{0-1}$ peak of the absorption vibronic progression is accompanied by a sub-peak, each dressed by distinct vibrational modes. By considering the Liouville pathways within a two-exciton model, the imbalanced cross peaks in one-quantum rephasing and non-rephasing spectra can be accounted for by the presence of pure biexcitons. The two-quantum non-rephasing spectra, on the other hand, provide direct evidence for unbound exciton pairs and biexcitons with dominantly attractive force. In addition, the spectral features of unbound exciton pairs show mixed absorptive and dispersive character, implying many-body interactions within the correlated {Frenkel}-exciton pairs. Our work offers novel perspectives on the rich photophysical processes in semiconductor polymers with the presence of Frenkel exciton complexes. △ Less

Submitted 22 February, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

Comments: Submitted for publication to The Journal of Physical Chemistry Letters

Journal ref: J. Phys. Chem. Lett. 2024, 15, 3705-3712

arXiv:2401.04007 [pdf, other]

Task-Oriented Active Learning of Model Preconditions for Inaccurate Dynamics Models

Authors: Alex LaGrassa, Moonyoung Lee, Oliver Kroemer

Abstract: When planning with an inaccurate dynamics model, a practical strategy is to restrict planning to regions of state-action space where the model is accurate: also known as a \textit{model precondition}. Empirical real-world trajectory data is valuable for defining data-driven model preconditions regardless of the model form (analytical, simulator, learned, etc...). However, real-world data is often… ▽ More When planning with an inaccurate dynamics model, a practical strategy is to restrict planning to regions of state-action space where the model is accurate: also known as a \textit{model precondition}. Empirical real-world trajectory data is valuable for defining data-driven model preconditions regardless of the model form (analytical, simulator, learned, etc...). However, real-world data is often expensive and dangerous to collect. In order to achieve data efficiency, this paper presents an algorithm for actively selecting trajectories to learn a model precondition for an inaccurate pre-specified dynamics model. Our proposed techniques address challenges arising from the sequential nature of trajectories, and potential benefit of prioritizing task-relevant data. The experimental analysis shows how algorithmic properties affect performance in three planning scenarios: icy gridworld, simulated plant watering, and real-world plant watering. Results demonstrate an improvement of approximately 80% after only four real-world trajectories when using our proposed techniques. △ Less

Submitted 23 April, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

Comments: Accepted to International Conference on Robotics and Automation 2024. Will be presented May 2024

arXiv:2401.02840 [pdf, other]

A test of lepton flavor universality with a measurement of $R(D^{*})$ using hadronic $B$ tagging at the Belle II experiment

Authors: Belle II Collaboration, I. Adachi, K. Adamczyk, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, M. Bauer, A. Baur , et al. (412 additional authors not shown)

Abstract: The ratio of branching fractions $R(D^{*}) = \mathcal{B}(\overline{B} \rightarrow D^{*} τ^{-} \overlineν_τ)$/$\mathcal{B} (\overline{B} \rightarrow D^{*} \ell^{-} \overlineν_{\ell})$, where $\ell$ is an electron or muon, is measured using a Belle~II data sample with an integrated luminosity of $189~\mathrm{fb}^{-1}$ at the SuperKEKB asymmetric-energy $e^{+} e^{-}$ collider. Data is collected at th… ▽ More The ratio of branching fractions $R(D^{*}) = \mathcal{B}(\overline{B} \rightarrow D^{*} τ^{-} \overlineν_τ)$/$\mathcal{B} (\overline{B} \rightarrow D^{*} \ell^{-} \overlineν_{\ell})$, where $\ell$ is an electron or muon, is measured using a Belle~II data sample with an integrated luminosity of $189~\mathrm{fb}^{-1}$ at the SuperKEKB asymmetric-energy $e^{+} e^{-}$ collider. Data is collected at the $Υ(\mathrm{4S})$ resonance, and one $B$ meson in the $Υ(\mathrm{4S})\rightarrow B\overline{B}$ decay is fully reconstructed in hadronic decay modes. The accompanying signal $B$ meson is reconstructed as $\overline{B}\rightarrow D^{*} τ^{-}\overlineν_τ$ using leptonic $τ$ decays. The normalization decay, $\overline{B}\rightarrow D^{*} \ell^{-} \overlineν_{\ell}$, where $\ell$ is an electron or muon, produces the same observable final state particles. The ratio of branching fractions is extracted in a simultaneous fit to two signal-discriminating variables in both channels and yields $R(D^{*}) = 0.262~_{-0.039}^{+0.041}(\mathrm{stat})~_{-0.032}^{+0.035}(\mathrm{syst})$. This result is consistent with the current world average and with standard model predictions. △ Less

Submitted 5 January, 2024; originally announced January 2024.

Comments: 16 pages, 17 figures, submitted to PRD

arXiv:2401.02624 [pdf, other]

Correlation-enhanced viable core in metabolic networks

Authors: Mi ** Lee, Sudo Yi, Deok-Sun Lee

Abstract: Cellular ingredient concentrations can be stabilized by adjusting generation and consumption rates through multiple pathways. To explore the portion of cellular metabolism equipped with multiple pathways, we categorize individual metabolic reactions and compounds as viable or inviable: A compound is viable if processed by two or more reactions, and a reaction is viable if all of its substrates and… ▽ More Cellular ingredient concentrations can be stabilized by adjusting generation and consumption rates through multiple pathways. To explore the portion of cellular metabolism equipped with multiple pathways, we categorize individual metabolic reactions and compounds as viable or inviable: A compound is viable if processed by two or more reactions, and a reaction is viable if all of its substrates and products are viable. Using this classification, we identify the maximal subnetwork of viable nodes, referred to as the {\it viable core}, in bipartite metabolic networks across thousands of species. The obtained viable cores are remarkably larger than those in degree-preserving randomized networks, while their broad degree distributions commonly enable the viable cores to shrink gradually as reaction nodes are deleted. We demonstrate that the positive degree-degree correlations of the empirical networks may underlie the enlarged viable cores compared to the randomized networks. By investigating the relation between degree and cross-species frequency of metabolic compounds and reactions, we elucidate the evolutionary origin of the correlations. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: 8 pages, 4 figures

Showing 151–200 of 2,834 results for author: Lee, M