Search | arXiv e-print repository

Reward Generalization in RLHF: A Topological Perspective

Authors: Tianyi Qiu, Fanzhi Zeng, Jiaming Ji, Dong Yan, Kaile Wang, Jiayi Zhou, Yang Han, Josef Dai, Xuehai Pan, Yaodong Yang

Abstract: Existing alignment methods share a common topology of information flow, where reward information is collected from humans, modeled with preference learning, and used to tune language models. However, this shared topology has not been systematically characterized, nor have its alternatives been thoroughly explored, leaving the problems of low data efficiency and unreliable generalization unaddresse… ▽ More Existing alignment methods share a common topology of information flow, where reward information is collected from humans, modeled with preference learning, and used to tune language models. However, this shared topology has not been systematically characterized, nor have its alternatives been thoroughly explored, leaving the problems of low data efficiency and unreliable generalization unaddressed. As a solution, we introduce a theoretical framework for investigating reward generalization in reinforcement learning from human feedback (RLHF), focusing on the topology of information flow at both macro and micro levels. At the macro level, we portray the RLHF information flow as an autoencoding process over behavior distributions, formalizing the RLHF objective of distributional consistency between human preference and model behavior. At the micro level, we present induced Bayesian networks as a theory of reward generalization in RLHF, introducing fine-grained dataset topologies into generalization bounds. Combining analysis on both levels, we propose reward modeling from tree-structured preference information. It is shown to reduce reward uncertainty by up to $Θ(\log n/\log\log n)$ times compared to baselines, where $n$ is the dataset size. Validation on three NLP tasks shows that our tree-based reward model achieves an average win rate of 65% against baseline methods, thus improving reward generalization for free via topology design. △ Less

Submitted 16 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

arXiv:2402.09325 [pdf, other]

PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments

Authors: Xiuzhong Hu, Guangming Xiong, Zheng Zang, Peng Jia, Yuxuan Han, Junyi Ma

Abstract: Large-scale 3D scene reconstruction and novel view synthesis are vital for autonomous vehicles, especially utilizing temporally sparse LiDAR frames. However, conventional explicit representations remain a significant bottleneck towards representing the reconstructed and synthetic scenes at unlimited resolution. Although the recently developed neural radiance fields (NeRF) have shown compelling res… ▽ More Large-scale 3D scene reconstruction and novel view synthesis are vital for autonomous vehicles, especially utilizing temporally sparse LiDAR frames. However, conventional explicit representations remain a significant bottleneck towards representing the reconstructed and synthetic scenes at unlimited resolution. Although the recently developed neural radiance fields (NeRF) have shown compelling results in implicit representations, the problem of large-scale 3D scene reconstruction and novel view synthesis using sparse LiDAR frames remains unexplored. To bridge this gap, we propose a 3D scene reconstruction and novel view synthesis framework called parent-child neural radiance field (PC-NeRF). Based on its two modules, parent NeRF and child NeRF, the framework implements hierarchical spatial partitioning and multi-level scene representation, including scene, segment, and point levels. The multi-level scene representation enhances the efficient utilization of sparse LiDAR point cloud data and enables the rapid acquisition of an approximate volumetric scene representation. With extensive experiments, PC-NeRF is proven to achieve high-precision novel LiDAR view synthesis and 3D reconstruction in large-scale scenes. Moreover, PC-NeRF can effectively handle situations with sparse LiDAR frames and demonstrate high deployment efficiency with limited training epochs. Our approach implementation and the pre-trained models are available at https://github.com/biter0088/pc-nerf. △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2310.00874

arXiv:2402.08980 [pdf]

OmniBOR: A System for Automatic, Verifiable Artifact Resolution across Software Supply Chains

Authors: Bharathi Seshadri, Yongkui Han, Chris Olson, David Pollak, Vojislav Tomasevic

Abstract: Software supply chain attacks, which exploit the build process or artifacts used in the process of building a software product, are increasingly of concern. To combat these attacks, one must be able to check that every artifact that a software product depends on does not contain vulnerabilities. In this paper, we introduce OmniBOR, (Universal Bill of Receipts) a minimalistic scheme for build tools… ▽ More Software supply chain attacks, which exploit the build process or artifacts used in the process of building a software product, are increasingly of concern. To combat these attacks, one must be able to check that every artifact that a software product depends on does not contain vulnerabilities. In this paper, we introduce OmniBOR, (Universal Bill of Receipts) a minimalistic scheme for build tools to create an artifact dependency graph which can be used to track every software artifact incorporated into a built software product. We present the architecture of OmniBOR, the underlying data representations, and two implementations that produce OmniBOR data and embed an OmniBOR Identifier into built software, including a compiler-based approach and one based on tracing the build process. We demonstrate the efficacy of this approach on benchmarks including a Linux distribution for applications such as Common Vulnerabilities and Exposures (CVE) detection and software bill of materials (SBOM) computation. △ Less

Submitted 14 February, 2024; originally announced February 2024.

arXiv:2402.07523 [pdf, other]

doi 10.1109/IWSC60764.2023.00010

Using Ensemble Inference to Improve Recall of Clone Detection

Authors: Gul Aftab Ahmed, James Vincent Patten, Yuanhua Han, Guoxian Lu, David Gregg, Jim Buckley, Muslim Chochlov

Abstract: Large-scale source-code clone detection is a challenging task. In our previous work, we proposed an approach (SSCD) that leverages artificial neural networks and approximates nearest neighbour search to effectively and efficiently locate clones in large-scale bodies of code, in a time-efficient manner. However, our literature review suggests that the relative efficacy of differing neural network m… ▽ More Large-scale source-code clone detection is a challenging task. In our previous work, we proposed an approach (SSCD) that leverages artificial neural networks and approximates nearest neighbour search to effectively and efficiently locate clones in large-scale bodies of code, in a time-efficient manner. However, our literature review suggests that the relative efficacy of differing neural network models has not been assessed in the context of large-scale clone detection approaches. In this work, we aim to assess several such models individually, in terms of their potential to maximize recall, while preserving a high level of precision during clone detection. We investigate if ensemble inference (in this case, using the results of more than one of these neural network models in combination) can further assist in this task. To assess this, we employed four state-of-the-art neural network models and evaluated them individually/in combination. The results, on an illustrative dataset of approximately 500K lines of C/C++ code, suggest that ensemble inference outperforms individual models in all trialled cases, when recall is concerned. Of individual models, the ADA model (belonging to the ChatGPT family of models) has the best performance. However commercial companies may not be prepared to hand their proprietary source code over to the cloud, as required by that approach. Consequently, they may be more interested in an ensemble-combination of CodeBERT-based and CodeT5 models, resulting in similar (if slightly lesser) recall and precision results. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Journal ref: 2023 IEEE 17th International Workshop on Software Clones (IWSC)

arXiv:2402.07123 [pdf, other]

Empirical Analysis of Quantum Approximate Optimization Algorithm for Knapsack-based Financial Portfolio Optimization

Authors: Chansreynich Huot, Kimleang Kea, Tae-Kyung Kim, Youngsun Han

Abstract: Portfolio optimization is a primary component of the decision-making process in finance, aiming to tactfully allocate assets to achieve optimal returns while considering various constraints. Herein, we proposed a method that uses the knapsack-based portfolio optimization problem and incorporates the quantum computing capabilities of the quantum walk mixer with the quantum approximate optimization… ▽ More Portfolio optimization is a primary component of the decision-making process in finance, aiming to tactfully allocate assets to achieve optimal returns while considering various constraints. Herein, we proposed a method that uses the knapsack-based portfolio optimization problem and incorporates the quantum computing capabilities of the quantum walk mixer with the quantum approximate optimization algorithm (QAOA) to address the challenges presented by the NP-hard problem. Additionally, we present the sequential procedure of our suggested approach and demonstrate empirical proof to illustrate the effectiveness of the proposed method in finding the optimal asset allocations across various constraints and asset choices. Moreover, we discuss the effectiveness of the QAOA components in relation to our proposed method. Consequently, our study successfully achieves the approximate ratio of the portfolio optimization technique using a circuit layer of p >= 3, compared to the classical best-known solution of the knapsack problem. Our proposed methods potentially contribute to the growing field of quantum finance by offering insights into the potential benefits of employing quantum algorithms for complex optimization tasks in financial portfolio management. △ Less

Submitted 11 February, 2024; originally announced February 2024.

arXiv:2402.06952 [pdf, other]

Estimating the Effect of Crosstalk Error on Circuit Fidelity Using Noisy Intermediate-Scale Quantum Devices

Authors: Sovanmonynuth Heng, Myeongseong Go, Youngsun Han

Abstract: Current advancements in technology have focused the attention of the quantum computing community toward exploring the potential of near-term devices whose computing power surpasses that of classical computers in practical applications. An unresolved central question revolves around whether the inherent noise in these devices can be overcome or whether any potential quantum advantage would be limit… ▽ More Current advancements in technology have focused the attention of the quantum computing community toward exploring the potential of near-term devices whose computing power surpasses that of classical computers in practical applications. An unresolved central question revolves around whether the inherent noise in these devices can be overcome or whether any potential quantum advantage would be limited. There is no doubt that crosstalk is one of the main sources of noise in noisy intermediate-scale quantum (NISQ) systems, and it poses a fundamental challenge to hardware designs. Crosstalk between parallel instructions can corrupt quantum states and cause incorrect program execution. In this study, we present a necessary analysis of the crosstalk error effect on NISQ devices. Our approach is extremely straightforward and practical to estimate the crosstalk error of various multi-qubit devices. In particular, we combine the randomized benchmarking (RB) and simultaneous randomized benchmarking (SRB) protocol to estimate the crosstalk error from the correlation controlled-NOT (CNOT) gate. We demonstrate this protocol experimentally on 5-, 7-, \& 16-qubit devices. Our results demonstrate the crosstalk error model of three different IBM quantum devices over the experimental week and compare the error variation against the machine, number of qubits, quantum volume, processor, and topology. We then confirm the improvement in the circuit fidelity on different benchmarks by up to 3.06x via inserting an instruction barrier, as compared with an IBM quantum noisy device which offers near-optimal crosstalk mitigation in practice. Finally, we discuss the current system limitation, its tradeoff on fidelity and depth, noise beyond the NISQ system, and mitigation opportunities to ensure that the quantum operation can perform its quantum magic undisturbed. △ Less

Submitted 17 May, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

arXiv:2402.06289 [pdf, other]

Evaluating Membership Inference Attacks and Defenses in Federated Learning

Authors: Gongxi Zhu, Donghao Li, Hanlin Gu, Yuxing Han, Yuan Yao, Lixin Fan, Qiang Yang

Abstract: Membership Inference Attacks (MIAs) pose a growing threat to privacy preservation in federated learning. The semi-honest attacker, e.g., the server, may determine whether a particular sample belongs to a target client according to the observed model information. This paper conducts an evaluation of existing MIAs and corresponding defense strategies. Our evaluation on MIAs reveals two important fin… ▽ More Membership Inference Attacks (MIAs) pose a growing threat to privacy preservation in federated learning. The semi-honest attacker, e.g., the server, may determine whether a particular sample belongs to a target client according to the observed model information. This paper conducts an evaluation of existing MIAs and corresponding defense strategies. Our evaluation on MIAs reveals two important findings about the trend of MIAs. Firstly, combining model information from multiple communication rounds (Multi-temporal) enhances the overall effectiveness of MIAs compared to utilizing model information from a single epoch. Secondly, incorporating models from non-target clients (Multi-spatial) significantly improves the effectiveness of MIAs, particularly when the clients' data is homogeneous. This highlights the importance of considering the temporal and spatial model information in MIAs. Next, we assess the effectiveness via privacy-utility tradeoff for two type defense mechanisms against MIAs: Gradient Perturbation and Data Replacement. Our results demonstrate that Data Replacement mechanisms achieve a more optimal balance between preserving privacy and maintaining model utility. Therefore, we recommend the adoption of Data Replacement methods as a defense strategy against MIAs. Our code is available in https://github.com/Liar-Mask/FedMIA. △ Less

Submitted 9 February, 2024; originally announced February 2024.

Comments: 11 pages, 4 figures

arXiv:2402.05590 [pdf, ps, other]

Deformed Fréchet law for Wigner and sample covariance matrices with tail in crossover regime

Authors: Yi Han

Abstract: Given $A_n:=\frac{1}{\sqrt{n}}(a_{ij})$ an $n\times n$ symmetric random matrix, with elements above the diagonal given by i.i.d. random variables having mean zero and unit variance. It is known that when $\lim_{x\to\infty}x^4\mathbb{P}(|a_{ij}|>x)=0$, then fluctuation of the largest eigenvalue of $A_n$ follows a Tracy-Widom distribution. When the law of $a_{ij}$ is regularly varying with index… ▽ More Given $A_n:=\frac{1}{\sqrt{n}}(a_{ij})$ an $n\times n$ symmetric random matrix, with elements above the diagonal given by i.i.d. random variables having mean zero and unit variance. It is known that when $\lim_{x\to\infty}x^4\mathbb{P}(|a_{ij}|>x)=0$, then fluctuation of the largest eigenvalue of $A_n$ follows a Tracy-Widom distribution. When the law of $a_{ij}$ is regularly varying with index $α\in(0,4)$, then the largest eigenvalue has a Fréchet distribution. An intermediate regime is recently uncovered in \cite{diaconu2023more}: when $\lim_{x\to\infty}x^4\mathbb{P}(|a_{ij}|>x)=c\in(0,\infty)$, then the law of the largest eigenvalue follows a deformed Fréchet distribution. In this work we vastly extend the scope where the latter distribution may arise. We show that the same deformed Fréchet distribution arises (1) for sparse Wigner matrices with an average of $n^{O(1)}$ nonzero entries on each row; (2) for periodically banded Wigner matrices with bandwidth $d_n=n^{O(1)}$; and more generally for weighted adjacency matrices of any $k_n$-regular graphs with $k_n=n^{O(1)}$. In all these cases, we further prove that the joint distribution of the finitely many largest eigenvalues of $A_n$ form a deformed Poisson process, and that eigenvectors of the outlying eigenvalues of $A_n$ are localized, implying a mobility edge phenomenon at the spectral edge $2$. The sparser case with average degree $n^{o(1)}$ is also explored. Our technique extends to sample covariance matrices, proving for the first time that its largest eigenvalue still follows a deformed Fréchet distribution, assuming the matrix entries satisfy $\lim_{x\to\infty}x^4\mathbb{P}(|a_{ij}|>x)=c\in(0,\infty)$. △ Less

Submitted 3 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: 22 pages

arXiv:2402.05383 [pdf, other]

First measurement of the yield of $^8$He isotopes produced in liquid scintillator by cosmic-ray muons at Daya Bay

Authors: Daya Bay Collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

Abstract: Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546… ▽ More Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546$\pm$0.076 for $^8$He, and 6.73$\pm$0.73, 6.75$\pm$0.70, and 13.74$\pm$0.82 for $^9$Li at average muon energies of 63.9~GeV, 64.7~GeV, and 143.0~GeV, respectively. The measured production rate of $^8$He isotopes is more than an order of magnitude lower than any other measurement of cosmogenic isotope production. It replaces the results of previous attempts to determine the ratio of $^8$He to $^9$Li production that yielded a wide range of limits from 0 to 30\%. The results provide future liquid-scintillator-based experiments with improved ability to predict cosmogenic backgrounds. △ Less

Submitted 7 February, 2024; originally announced February 2024.

arXiv:2402.05076 [pdf, ps, other]

Markovian Analysis of Information Cascades with Fake Agents

Authors: Yuming Han

Abstract: People often learn from other's actions when they make decisions while doing online shop**. This kind of observational learning may lead to information cascades, which means agents might ignore their own signals and follow the 'trend' created collectively by the actions of their predecessors. It is well-known that with rational agents, such a cascade model can result in either correct or incorre… ▽ More People often learn from other's actions when they make decisions while doing online shop**. This kind of observational learning may lead to information cascades, which means agents might ignore their own signals and follow the 'trend' created collectively by the actions of their predecessors. It is well-known that with rational agents, such a cascade model can result in either correct or incorrect cascades. In this paper, we additionally consider the presence of fake agents who always take fixed actions and we investigate their influence on the outcome of these cascades. We propose an infinite Markov Chain sequence structure and a tree structure to analyze how the fraction and the type of such fake agents impacts behavior of the upcoming agents. We show that an increase in the fraction of fake agents may reduce the chances of their preferred outcome, and also there is a certain lower bound for the probability of a wrong cascade. In particular, we discuss the probability of an agent being fake tends to 1 and the effect of a constant portion of fake agents. △ Less

Submitted 7 February, 2024; originally announced February 2024.

arXiv:2402.04616 [pdf, other]

TinyLLM: Learning a Small Student from Multiple Large Language Models

Authors: Yijun Tian, Yikun Han, Xiusi Chen, Wei Wang, Nitesh V. Chawla

Abstract: Transferring the reasoning capability from stronger large language models (LLMs) to smaller ones has been quite appealing, as smaller LLMs are more flexible to deploy with less expense. Among the existing solutions, knowledge distillation stands out due to its outstanding efficiency and generalization. However, existing methods suffer from several drawbacks, including limited knowledge diversity a… ▽ More Transferring the reasoning capability from stronger large language models (LLMs) to smaller ones has been quite appealing, as smaller LLMs are more flexible to deploy with less expense. Among the existing solutions, knowledge distillation stands out due to its outstanding efficiency and generalization. However, existing methods suffer from several drawbacks, including limited knowledge diversity and the lack of rich contextual information. To solve the problems and facilitate the learning of compact language models, we propose TinyLLM, a new knowledge distillation paradigm to learn a small student LLM from multiple large teacher LLMs. In particular, we encourage the student LLM to not only generate the correct answers but also understand the rationales behind these answers. Given that different LLMs possess diverse reasoning skills, we guide the student model to assimilate knowledge from various teacher LLMs. We further introduce an in-context example generator and a teacher-forcing Chain-of-Thought strategy to ensure that the rationales are accurate and grounded in contextually appropriate scenarios. Extensive experiments on six datasets across two reasoning tasks demonstrate the superiority of our method. Results show that TinyLLM can outperform large teacher LLMs significantly, despite a considerably smaller model size. △ Less

Submitted 31 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

arXiv:2402.04279 [pdf]

Electrokinetic origin of swirling flow on nanoscale interface

Authors: Shuangshuang Meng, Yu Han, Wei Zhao, Yueqiang Zhu, Chen Zhang, Xiaoqiang Feng, Ce Zhang, Duyang Zang, Guangyin **g, Kaige Wang

Abstract: The zeta ($ζ$) potential is a pivotal metric for characterizing the electric field topology within an electric double layer - an important phenomenon on phase interface. It underpins critical processes in diverse realms such as chemistry, biomedical engineering, and micro/nanofluidics. Yet, local measurement of $ζ$ potential at the interface has historically presented challenges, leading researche… ▽ More The zeta ($ζ$) potential is a pivotal metric for characterizing the electric field topology within an electric double layer - an important phenomenon on phase interface. It underpins critical processes in diverse realms such as chemistry, biomedical engineering, and micro/nanofluidics. Yet, local measurement of $ζ$ potential at the interface has historically presented challenges, leading researchers to simplify a chemically homogenized surface with a uniform $ζ$ potential. In the current investigation, we present evidence that, within a microchannel, the spatial distribution of $ζ$ potential across a chemically homogeneous solid-liquid interface can become two-dimensional (2D) under an imposed flow regime, as disclosed by a state-of-art fluorescence photobleaching electrochemistry analyzer (FLEA) technique. The $ζ$ potential' s propensity to become increasingly negative downstream, presents an approximately symmetric, V-shaped pattern in the spanwise orientation. Intriguingly, and of notable significance to chemistry and engineering, this 2D $ζ$ potential framework was found to electrokinetically induce swirling flows in tens of nanometers, aligning with the streamwise axis, bearing a remarkable resemblance to the well-documented hairpin vortices in turbulent boundary layers. Our findings gesture towards a novel perspective on the genesis of vortex structures in nanoscale. Additionally, the FLEA technique emerges as a potent tool for discerning $ζ$ potential at a local scale with high resolution, potentially accelerating the evolution and applications of novel surface material. △ Less

Submitted 5 February, 2024; originally announced February 2024.

arXiv:2402.03829 [pdf, ps, other]

Precise Measurement of Born Cross Sections for $e^+e^-\to D\bar{D}$ and Observation of One Structure between $\sqrt{s} = 3.80-4.95$ GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

Abstract: Using data samples collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 3.80 to 4.95 GeV, corresponding to an integrated luminosity of 20 fb$^{-1}$, a measurement of Born cross sections for the $e^+e^-\to D^{0}\bar{D}^{0}$ and $D^{+}D^{-}$ processes is presented with unprecedented precision. By performing a simultaneous fit to the dressed cross sections… ▽ More Using data samples collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 3.80 to 4.95 GeV, corresponding to an integrated luminosity of 20 fb$^{-1}$, a measurement of Born cross sections for the $e^+e^-\to D^{0}\bar{D}^{0}$ and $D^{+}D^{-}$ processes is presented with unprecedented precision. By performing a simultaneous fit to the dressed cross sections for both processes, one possible new structure around 3.9 GeV/$c^2$ is observed for the first time, in addition to seven known resonances $ψ(3770)$, $ψ(4040)$, $ψ(4160)$, $Y(4230)$, $Y(4360)$, $ψ(4415)$, and $Y(4660)$. These results offer crucial experimental insights into the nature of hadron production in the open charm region. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: 9 pages, 4 figures, 1 tables, 1 Supplemental_Material

arXiv:2402.03713 [pdf, other]

Measurement of $CP$ asymmetries in $B^0\toη'K^0_s$ decays at Belle II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker, J. V. Bennett , et al. (377 additional authors not shown)

Abstract: We describe a measurement of charge-parity ($CP$) violation asymmetries in $B^0\toη'K^0_S$ decays using Belle II data. We consider $η'\toη(\toγγ)π^+π^-$ and $η'\toρ(\toπ^+π^-)γ$ decays. The data were collected at the SuperKEKB asymmetric-energy $e^+e^-$ collider between the years 2019 and 2022, and contain $(387\pm 6) \times 10^6$ bottom-antibottom meson pairs. We reconstruct $829\pm35$ signal dec… ▽ More We describe a measurement of charge-parity ($CP$) violation asymmetries in $B^0\toη'K^0_S$ decays using Belle II data. We consider $η'\toη(\toγγ)π^+π^-$ and $η'\toρ(\toπ^+π^-)γ$ decays. The data were collected at the SuperKEKB asymmetric-energy $e^+e^-$ collider between the years 2019 and 2022, and contain $(387\pm 6) \times 10^6$ bottom-antibottom meson pairs. We reconstruct $829\pm35$ signal decays and extract the $CP$ violating parameters from a fit to the distribution of the proper-decay-time difference between the two $B$ mesons. The measured direct and mixing-induced $CP$ asymmetries are $\text{C}_{η'K^0_S} = -0.19 \pm 0.08 \pm 0.03 $ and $\text{S}_{η'K^0_S} = +0.67 \pm 0.10 \pm 0.04 $, respectively, where the first uncertainties are statistical and the second are systematic. These results are in agreement with current world averages and standard model predictions. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Report number: Belle II Preprint 2024-003, KEK Preprint 2023-50

arXiv:2402.02995 [pdf, other]

XiHe: A Data-Driven Model for Global Ocean Eddy-Resolving Forecasting

Authors: Xiang Wang, Renzhi Wang, Ningzi Hu, Pinqiang Wang, Peng Huo, Guihua Wang, Huizan Wang, Senzhang Wang, Junxing Zhu, Jianbo Xu, Jun Yin, Senliang Bao, Ciqiang Luo, Ziqing Zu, Yi Han, Weimin Zhang, Kaijun Ren, Kefeng Deng, Junqiang Song

Abstract: The leading operational Global Ocean Forecasting Systems (GOFSs) use physics-driven numerical forecasting models that solve the partial differential equations with expensive computation. Recently, specifically in atmosphere weather forecasting, data-driven models have demonstrated significant potential for speeding up environmental forecasting by orders of magnitude, but there is still no data-dri… ▽ More The leading operational Global Ocean Forecasting Systems (GOFSs) use physics-driven numerical forecasting models that solve the partial differential equations with expensive computation. Recently, specifically in atmosphere weather forecasting, data-driven models have demonstrated significant potential for speeding up environmental forecasting by orders of magnitude, but there is still no data-driven GOFS that matches the forecasting accuracy of the numerical GOFSs. In this paper, we propose the first data-driven 1/12° resolution global ocean eddy-resolving forecasting model named XiHe, which is established from the 25-year France Mercator Ocean International's daily GLORYS12 reanalysis data. XiHe is a hierarchical transformer-based framework coupled with two special designs. One is the land-ocean mask mechanism for focusing exclusively on the global ocean circulation. The other is the ocean-specific block for effectively capturing both local ocean information and global teleconnection. Extensive experiments are conducted under satellite observations, in situ observations, and the IV-TT Class 4 evaluation framework of the world's leading operational GOFSs from January 2019 to December 2020. The results demonstrate that XiHe achieves stronger forecast performance in all testing variables than existing leading operational numerical GOFSs including Mercator Ocean Physical SYstem (PSY4), Global Ice Ocean Prediction System (GIOPS), BLUElinK OceanMAPS (BLK), and Forecast Ocean Assimilation Model (FOAM). Particularly, the accuracy of ocean current forecasting of XiHe out to 60 days is even better than that of PSY4 in just 10 days. Additionally, XiHe is able to forecast the large-scale circulation and the mesoscale eddies. Furthermore, it can make a 10-day forecast in only 0.36 seconds, which accelerates the forecast speed by thousands of times compared to the traditional numerical GOFSs. △ Less

Submitted 8 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

arXiv:2402.02505 [pdf]

doi 10.1016/J.APSUSC.2022.155502

Multi-functional oxidase-like activity of praseodymia nanorods and nanoparticles

Authors: Jiang Lei, Yaning Han, Susana Fernández-García, Miguel Tinoco Rivas, Zhuang Li, Pengli Nan, **gtao Sun, Juan José Delgado Jaén, Huiyan Pan, Ginesa Francisco Martínez-López Blanco, Ana Belén Hungría, José Juan Calvino, Xiaowei Chen

Abstract: The ability to mimic protein-based oxidase with multi-functional inorganic nanozymes would greatly advance biomedical and clinical practices. Praseodymia (PrOx) nanorods (NRs) and nanoparticles (NPs) have been synthesized using hydrothermal and precipitation methods. Both PrOx catalysts with different morphologies exhibit significantly higher oxidase-like activities (Michaelis-Menten constant Km <… ▽ More The ability to mimic protein-based oxidase with multi-functional inorganic nanozymes would greatly advance biomedical and clinical practices. Praseodymia (PrOx) nanorods (NRs) and nanoparticles (NPs) have been synthesized using hydrothermal and precipitation methods. Both PrOx catalysts with different morphologies exhibit significantly higher oxidase-like activities (Michaelis-Menten constant Km < 0.026 mM) than commercial PrOx and most so-far-reported artificial enzymes. One of the substrates, dopamine, can be oxidized and further polymerized to generate polydopamine in acidic conditions. Akin to CeO2, which is a well-studied nanozyme, a different mechanism involving holes+, oxygen vacancies and oxygen mobility over PrOx catalysts has been proposed in this work. However, fluoride ions were found to impose opposite effects on the oxidase-mimicking activity of PrOx and CeO2, implying a promising path for the exploration of new nanozymes. In support of this, PrOx was further applied in colorimetric sensing of L-cysteine and fluoride with high sensitivity. △ Less

Submitted 4 February, 2024; originally announced February 2024.

Comments: 13 pages, 12 figures

Journal ref: Applied Surface Science 2023, 610, 155502

arXiv:2402.01763 [pdf, other]

When Large Language Models Meet Vector Databases: A Survey

Authors: Zhi **g, Yongye Su, Yikun Han, Bo Yuan, Haiyun Xu, Chunjiang Liu, Kehai Chen, Min Zhang

Abstract: This survey explores the synergistic potential of Large Language Models (LLMs) and Vector Databases (VecDBs), a burgeoning but rapidly evolving research area. With the proliferation of LLMs comes a host of challenges, including hallucinations, outdated knowledge, prohibitive commercial application costs, and memory issues. VecDBs emerge as a compelling solution to these issues by offering an effic… ▽ More This survey explores the synergistic potential of Large Language Models (LLMs) and Vector Databases (VecDBs), a burgeoning but rapidly evolving research area. With the proliferation of LLMs comes a host of challenges, including hallucinations, outdated knowledge, prohibitive commercial application costs, and memory issues. VecDBs emerge as a compelling solution to these issues by offering an efficient means to store, retrieve, and manage the high-dimensional vector representations intrinsic to LLM operations. Through this nuanced review, we delineate the foundational principles of LLMs and VecDBs and critically analyze their integration's impact on enhancing LLM functionalities. This discourse extends into a discussion on the speculative future developments in this domain, aiming to catalyze further research into optimizing the confluence of LLMs and VecDBs for advanced data handling and knowledge extraction capabilities. △ Less

Submitted 5 February, 2024; v1 submitted 30 January, 2024; originally announced February 2024.

arXiv:2402.00552 [pdf, other]

Topological Defects as Nucleation Points of the Nematic--Isotropic Phase Transition in Liquid Crystal Shells

Authors: Yucen Han, Jan Lagerwall, Apala Majumdar

Abstract: The transition from a nematic to an isotropic state in a self-closing spherical liquid crystal shell with tangential alignment is a stimulating phenomenon to investigate, as the topology dictates that the shell exhibits local isotropic points at all temperatures in the nematic phase range, in the form of topological defects. The defects may thus be expected to act as nucleation points for the phas… ▽ More The transition from a nematic to an isotropic state in a self-closing spherical liquid crystal shell with tangential alignment is a stimulating phenomenon to investigate, as the topology dictates that the shell exhibits local isotropic points at all temperatures in the nematic phase range, in the form of topological defects. The defects may thus be expected to act as nucleation points for the phase transition upon heating beyond the bulk nematic stability range. Here we study this peculiar transition, theoretically and experimentally, for shells with two different configurations of four +1/2 defects, finding that the defects act as the primary nucleation points if they are co-localized in each other's vicinity. If the defects are instead spread out across the shell, they again act as nucleation points, albeit not necessarily the primary ones. Beyond adding to our understanding of how the orientational order--disorder transition can take place in the shell geometry, our results have practical relevance for, e.g., the use of curved liquid crystals in sensing applications or for liquid crystal elastomer actuators in shell shape, undergoing a shape change as a result of the nematic--isotropic transition. △ Less

Submitted 1 February, 2024; originally announced February 2024.

arXiv:2401.17865 [pdf, other]

Manipulating Predictions over Discrete Inputs in Machine Teaching

Authors: Xiaodong Wu, Yufei Han, Hayssam Dahrouj, Jianbing Ni, Zhenwen Liang, Xiangliang Zhang

Abstract: Machine teaching often involves the creation of an optimal (typically minimal) dataset to help a model (referred to as the `student') achieve specific goals given by a teacher. While abundant in the continuous domain, the studies on the effectiveness of machine teaching in the discrete domain are relatively limited. This paper focuses on machine teaching in the discrete domain, specifically on man… ▽ More Machine teaching often involves the creation of an optimal (typically minimal) dataset to help a model (referred to as the `student') achieve specific goals given by a teacher. While abundant in the continuous domain, the studies on the effectiveness of machine teaching in the discrete domain are relatively limited. This paper focuses on machine teaching in the discrete domain, specifically on manipulating student models' predictions based on the goals of teachers via changing the training data efficiently. We formulate this task as a combinatorial optimization problem and solve it by proposing an iterative searching algorithm. Our algorithm demonstrates significant numerical merit in the scenarios where a teacher attempts at correcting erroneous predictions to improve the student's models, or maliciously manipulating the model to misclassify some specific samples to the target class aligned with his personal profits. Experimental results show that our proposed algorithm can have superior performance in effectively and efficiently manipulating the predictions of the model, surpassing conventional baselines. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 8 pages, 2 figures

ACM Class: I.2.6

arXiv:2401.17467 [pdf]

An entropy-based measurement for understanding origin-destination trip distributions: a case study of New York City taxis

Authors: Yuqin Jiang, Yihong Yuan, Su Yeon Han

Abstract: A comprehensive understanding of human mobility patterns in urban areas is essential for urban development and transportation planning. In this study, we create entropy-based measurements to capture the geographical distribution diversity of trip origins and destinations. Specifically, we develop origin-entropy and destination-entropy based on taxi and ride-sharing trip records. The origin-entropy… ▽ More A comprehensive understanding of human mobility patterns in urban areas is essential for urban development and transportation planning. In this study, we create entropy-based measurements to capture the geographical distribution diversity of trip origins and destinations. Specifically, we develop origin-entropy and destination-entropy based on taxi and ride-sharing trip records. The origin-entropy for a given zone accounts for all the trips that originate from this zone and calculates the level of geographical distribution diversity of these trips destinations. Likewise, the destination-entropy for a given zone considers all the trips that end in this zone and calculates the level of geographical distribution diversity of these trips origins. Furthermore, we have created an interactive geovisualization that enables researchers to delve into and juxtapose the spatial and temporal dynamics of origin and destination entropy, in conjunction with trip counts for both origins and destinations. Results indicate that entropy-based measurements effectively capture shifts in the diversity of trips geographical origins and destinations, reflecting changes in travel decisions due to major events like the COVID-19 pandemic. These measurements, alongside trip counts, offer a more comprehensive understanding of urban human flows. △ Less

Submitted 16 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

arXiv:2401.17457 [pdf, other]

Socially Aware V2X Localized QoS

Authors: Rafael Kaliski, Yue-hua Han

Abstract: Vehicle-to-everything (V2X) is a core 5G technology. V2X and its enabler, Device-to-Device (D2D), are essential for the Internet of Things (IoT) and the Internet of Vehicles (IoV). V2X enables vehicles to communicate with other vehicles (V2V), networks (V2N), and infrastructure (V2I). While V2X enables ubiquitous vehicular connectivity, the impact of bursty data on the network's overall Quality of… ▽ More Vehicle-to-everything (V2X) is a core 5G technology. V2X and its enabler, Device-to-Device (D2D), are essential for the Internet of Things (IoT) and the Internet of Vehicles (IoV). V2X enables vehicles to communicate with other vehicles (V2V), networks (V2N), and infrastructure (V2I). While V2X enables ubiquitous vehicular connectivity, the impact of bursty data on the network's overall Quality of Service (QoS), such as when a vehicle accident occurs, is often ignored. In this work, we study both 4G and 5G V2X utilizing Evolved Universal Terrestrial Radio Access New Radio (E-UTRA-NR) and propose the use of socially aware 5G NR Dual Connectivity (en-DC) for traffic differentiation. We also propose localized QoS, wherein high-priority QoS flows traverse 5G road side units (RSUs) and normal-priority QoS flows traverse 4G Base Station (BS). We formulate a max-min fair QoS-aware Non-Orthogonal Multiple Access (NOMA) resource allocation scheme, QoS reclassify. QoS reclassify enables localized QoS and traffic steering to mitigate bursty network traffic's impact on the network's overall QoS. We then solve QoS reclassify via Integer Linear Programming (ILP) and derive its approximation. We demonstrate that both optimal and approximation QoS reclassify resource allocation schemes in our socially aware QoS management methodology outperform socially unaware legacy 4G V2X algorithms (no localized QoS support, no traffic steering) and socially aware 5G V2X (no localized QoS support, yet utilizes traffic steering). Our proposed QoS reclassify scheme's QoS flow end-to-end latency requires only $\approx~15\%$ of the time legacy 4G V2X requires. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: This work has been submitted to IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible. Under review by IEEE Internet of Things journal

arXiv:2401.15604 [pdf, ps, other]

Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization

Authors: Yinbin Han, Meisam Razaviyayn, Renyuan Xu

Abstract: Diffusion models have emerged as a powerful tool rivaling GANs in generating high-quality samples with improved fidelity, flexibility, and robustness. A key component of these models is to learn the score function through score matching. Despite empirical success on various tasks, it remains unclear whether gradient-based algorithms can learn the score function with a provable accuracy. As a first… ▽ More Diffusion models have emerged as a powerful tool rivaling GANs in generating high-quality samples with improved fidelity, flexibility, and robustness. A key component of these models is to learn the score function through score matching. Despite empirical success on various tasks, it remains unclear whether gradient-based algorithms can learn the score function with a provable accuracy. As a first step toward answering this question, this paper establishes a mathematical framework for analyzing score estimation using neural networks trained by gradient descent. Our analysis covers both the optimization and the generalization aspects of the learning procedure. In particular, we propose a parametric form to formulate the denoising score-matching problem as a regression with noisy labels. Compared to the standard supervised learning setup, the score-matching problem introduces distinct challenges, including unbounded input, vector-valued output, and an additional time variable, preventing existing techniques from being applied directly. In this paper, we show that with proper designs, the evolution of neural networks during training can be accurately modeled by a series of kernel regression tasks. Furthermore, by applying an early-stop** rule for gradient descent and leveraging recent developments in neural tangent kernels, we establish the first generalization error (sample complexity) bounds for learning the score function with neural networks, despite the presence of noise in the observations. Our analysis is grounded in a novel parametric form of the neural network and an innovative connection between score matching and regression analysis, facilitating the application of advanced statistical and optimization techniques. △ Less

Submitted 12 March, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

Comments: 39 pages

arXiv:2401.14720 [pdf, ps, other]

Observation of structures in the processes $e^+e^-\rightarrowωχ_{c1}$ and $ωχ_{c2}$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

Abstract: We present measurements of the Born cross sections for the processes $e^+e^-\rightarrowωχ_{c1}$ and $ωχ_{c2}$ at center-of-mass energies $\sqrt{s}$ from 4.308 to 4.951 GeV. The measurements are performed with data samples corresponding to an integrated luminosity of 11.0 $\rm{fb}^{-1}$ collected with the BESIII detector operating at the BEPCII storage ring. Assuming the $e^+e^-\rightarrowωχ_{c2}$… ▽ More We present measurements of the Born cross sections for the processes $e^+e^-\rightarrowωχ_{c1}$ and $ωχ_{c2}$ at center-of-mass energies $\sqrt{s}$ from 4.308 to 4.951 GeV. The measurements are performed with data samples corresponding to an integrated luminosity of 11.0 $\rm{fb}^{-1}$ collected with the BESIII detector operating at the BEPCII storage ring. Assuming the $e^+e^-\rightarrowωχ_{c2}$ signals come from a single resonance, the mass and width are determined to be $M=(4413.6\pm9.0\pm0.8)$ MeV/$c^2$ and $Γ=(110.5\pm15.0\pm2.9)$ MeV, respectively, which is consistent with the parameters of the well-established resonance $ψ(4415)$. In addition, we also use one single resonance to describe the $e^+e^-\rightarrowωχ_{c1}$ lineshape, and determine the mass and width to be $M=(4544.2\pm18.7\pm1.7)$ MeV/$c^2$ and $Γ=(116.1\pm33.5\pm1.7)$ MeV, respectively. The structure of this lineshape, observed for the first time, requires further understanding. △ Less

Submitted 24 March, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

Comments: 11 pages, 8 figures, with Supplemental Material

arXiv:2401.14711 [pdf, other]

Study of $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ at $\sqrt{s}$ from 2.00 to 3.08 GeV at BESIII

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

Abstract: With the data samples taken at center-of-mass energies from 2.00 to 3.08 GeV with the BESIII detector at the BEPCII collider, a partial wave analysis on the $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ process is performed. The Born cross sections for $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ and its intermediate processes $e^{+}e^{-}\rightarrowρπ$ and $ρ(1450)π$ are measured as functions of $\sqrt{s}$. Th… ▽ More With the data samples taken at center-of-mass energies from 2.00 to 3.08 GeV with the BESIII detector at the BEPCII collider, a partial wave analysis on the $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ process is performed. The Born cross sections for $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ and its intermediate processes $e^{+}e^{-}\rightarrowρπ$ and $ρ(1450)π$ are measured as functions of $\sqrt{s}$. The results for $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ are consistent with previous results measured with the initial state radiation method within one standard deviation, and improve the uncertainty by a factor of ten. By fitting the line shapes of the Born cross sections for the $e^{+}e^{-}\rightarrowρπ$ and $ρ(1450)π$, a structure with mass $M = 2119\pm11\pm15\ {\rm MeV}/c^2$ and width $Γ=69\pm30\pm5 {\rm MeV}$ is observed with a significance of $5.9σ$, where the first uncertainties are statistical and the second ones are systematic. This structure can be intepreteted as an excited $ω$ state. △ Less

Submitted 26 January, 2024; originally announced January 2024.

arXiv:2401.13285 [pdf, other]

Small Object Tracking in LiDAR Point Cloud: Learning the Target-awareness Prototype and Fine-grained Search Region

Authors: Sheng**g Tian, Yinan Han, ** Liu, Xiantong Zhao

Abstract: Single Object Tracking in LiDAR point cloud is one of the most essential parts of environmental perception, in which small objects are inevitable in real-world scenarios and will bring a significant barrier to the accurate location. However, the existing methods concentrate more on exploring universal architectures for common categories and overlook the challenges that small objects have long been… ▽ More Single Object Tracking in LiDAR point cloud is one of the most essential parts of environmental perception, in which small objects are inevitable in real-world scenarios and will bring a significant barrier to the accurate location. However, the existing methods concentrate more on exploring universal architectures for common categories and overlook the challenges that small objects have long been thorny due to the relative deficiency of foreground points and a low tolerance for disturbances. To this end, we propose a Siamese network-based method for small object tracking in the LiDAR point cloud, which is composed of the target-awareness prototype mining (TAPM) module and the regional grid subdivision (RGS) module. The TAPM module adopts the reconstruction mechanism of the masked decoder to learn the prototype in the feature space, aiming to highlight the presence of foreground points that will facilitate the subsequent location of small objects. Through the above prototype is capable of accentuating the small object of interest, the positioning deviation in feature maps still leads to high tracking errors. To alleviate this issue, the RGS module is proposed to recover the fine-grained features of the search region based on ViT and pixel shuffle layers. In addition, apart from the normal settings, we elaborately design a scaling experiment to evaluate the robustness of the different trackers on small objects. Extensive experiments on KITTI and nuScenes demonstrate that our method can effectively improve the tracking performance of small targets without affecting normal-sized objects. △ Less

Submitted 24 January, 2024; originally announced January 2024.

arXiv:2401.12495 [pdf, other]

Improving Zero-noise Extrapolation for Quantum-gate Error Mitigation using a Noise-aware Folding Method

Authors: Leanghok Hour, Myeongseong Go, Youngsun Han

Abstract: Recent thousand-qubit processors represent a significant hardware advancement, but current limitations prevent effective quantum error correction (QEC), necessitating reliance on quantum error mitigation (QEM) to enhance result fidelity from quantum computers. Our paper introduces a noise-aware folding technique that enhances Zero-Noise Extrapolation (ZNE) by leveraging the noise characteristics o… ▽ More Recent thousand-qubit processors represent a significant hardware advancement, but current limitations prevent effective quantum error correction (QEC), necessitating reliance on quantum error mitigation (QEM) to enhance result fidelity from quantum computers. Our paper introduces a noise-aware folding technique that enhances Zero-Noise Extrapolation (ZNE) by leveraging the noise characteristics of target quantum hardware to fold circuits more efficiently. Unlike traditional ZNE approaches assuming uniform error distribution, our method redistributes noise using calibration data based on hardware noise models. By employing a noise-adaptive compilation method combined with our proposed folding mechanism, we enhance the ZNE accuracy of quantum gate-based computing using superconducting quantum computers. This paper highlights the uniqueness of our method, summarizes noise accumulation, presents the scaling algorithm, and compares the reliability of our method with those of existing models using linear extrapolation model. Experimental results show that compared to existing folding methods, our approach achieved a 35% improvement on quantum computer simulators and a 31% improvement on real quantum computers, demonstrating the effectiveness of our proposed approach. △ Less

Submitted 14 May, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

Comments: 22 pages, 6 figures

arXiv:2401.12021 [pdf, other]

Study of $Υ(10753)$ decays to $π^{+}π^{-}Υ(nS)$ final states at Belle II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker , et al. (371 additional authors not shown)

Abstract: We present an analysis of the process $e^{+}e^{-}\toπ^{+}π^{-}Υ(nS)$ (where $n$ = 1, 2, or 3) reconstructed in $19.6\rm$ $\rm fb^{-1}$ of Belle II data during a special run of the SuperKEKB collider at four energy points near the peak of the $Υ(10753)$ resonance. By analyzing the mass distribution of the $π^+π^-Υ(nS)$ system and the Born cross sections of the $e^{+}e^{-}\toπ^{+}π^{-}Υ(nS)$ process… ▽ More We present an analysis of the process $e^{+}e^{-}\toπ^{+}π^{-}Υ(nS)$ (where $n$ = 1, 2, or 3) reconstructed in $19.6\rm$ $\rm fb^{-1}$ of Belle II data during a special run of the SuperKEKB collider at four energy points near the peak of the $Υ(10753)$ resonance. By analyzing the mass distribution of the $π^+π^-Υ(nS)$ system and the Born cross sections of the $e^{+}e^{-}\toπ^{+}π^{-}Υ(nS)$ process, we report the first observation of $Υ(10753)$ decays to the $π^{+}π^{-}Υ(1S)$ and $π^{+}π^{-}Υ(2S)$ final states, and find no evidence for decays to $π^{+}π^{-}Υ(3S)$. Possible intermediate states in the $π^+π^-Υ(1S,2S)$ transitions are also investigated, and no evidence for decays proceeding via the $π^\mp Z_b^\pm$ or $f_0(980)Υ(nS)$ intermediate states is found. We measure Born cross sections for the $e^{+}e^{-}\toπ^{+}π^{-}Υ(nS)$ process that, combined with results from Belle, improve the precision of measurements of the $Υ(10753)$ mass and width by nearly a factor of two to $(10756.3\pm2.7\pm0.6)$ MeV/$c^2$ and $(29.7\pm8.5\pm1.1)$ MeV, respectively. The relative ratios of the Born cross sections at the $Υ(10753)$ resonance peak are also reported for the first time. △ Less

Submitted 18 June, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

arXiv:2401.11294 [pdf, other]

Exponentially slow thermalization and the robustness of Hilbert space fragmentation

Authors: Yiqiu Han, Xiao Chen, Ethan Lake

Abstract: The phenomenon of Hilbert space fragmentation, whereby dynamical constraints fragment Hilbert space into many disconnected sectors, provides a simple mechanism by which thermalization can be arrested. However, little is known about how thermalization occurs in situations where the constraints are not exact. To study this, we consider a situation in which a fragmented 1d chain with pair-flip constr… ▽ More The phenomenon of Hilbert space fragmentation, whereby dynamical constraints fragment Hilbert space into many disconnected sectors, provides a simple mechanism by which thermalization can be arrested. However, little is known about how thermalization occurs in situations where the constraints are not exact. To study this, we consider a situation in which a fragmented 1d chain with pair-flip constraints is coupled to a thermal bath at its boundary. For product states quenched under Hamiltonian dynamics, we numerically observe an exponentially long thermalization time, manifested in both entanglement dynamics and the relaxation of local observables. To understand this, we study an analogous model of random unitary circuit dynamics, where we rigorously prove that the thermalization time scales exponentially with system size. Slow thermalization in this model is shown to be a consequence of strong bottlenecks in configuration space, demonstrating a new way of producing anomalously slow thermalization dynamics. △ Less

Submitted 20 January, 2024; originally announced January 2024.

Comments: 4.5 + 26 pages

arXiv:2401.10273 [pdf]

Revolutionizing Pharma: Unveiling the AI and LLM Trends in the Pharmaceutical Industry

Authors: Yu Han, **gwen Tao

Abstract: This document offers a critical overview of the emerging trends and significant advancements in artificial intelligence (AI) within the pharmaceutical industry. Detailing its application across key operational areas, including research and development, animal testing, clinical trials, hospital clinical stages, production, regulatory affairs, quality control and other supporting areas, the paper ca… ▽ More This document offers a critical overview of the emerging trends and significant advancements in artificial intelligence (AI) within the pharmaceutical industry. Detailing its application across key operational areas, including research and development, animal testing, clinical trials, hospital clinical stages, production, regulatory affairs, quality control and other supporting areas, the paper categorically examines AI's role in each sector. Special emphasis is placed on cutting-edge AI technologies like machine learning algorithms and their contributions to various aspects of pharmaceutical operations. Through this comprehensive analysis, the paper highlights the transformative potential of AI in resha** the pharmaceutical industry's future. △ Less

Submitted 21 January, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

arXiv:2401.10272 [pdf, other]

Multi-Source Collaborative Gradient Discrepancy Minimization for Federated Domain Generalization

Authors: Yikang Wei, Yahong Han

Abstract: Federated Domain Generalization aims to learn a domain-invariant model from multiple decentralized source domains for deployment on unseen target domain. Due to privacy concerns, the data from different source domains are kept isolated, which poses challenges in bridging the domain gap. To address this issue, we propose a Multi-source Collaborative Gradient Discrepancy Minimization (MCGDM) method… ▽ More Federated Domain Generalization aims to learn a domain-invariant model from multiple decentralized source domains for deployment on unseen target domain. Due to privacy concerns, the data from different source domains are kept isolated, which poses challenges in bridging the domain gap. To address this issue, we propose a Multi-source Collaborative Gradient Discrepancy Minimization (MCGDM) method for federated domain generalization. Specifically, we propose intra-domain gradient matching between the original images and augmented images to avoid overfitting the domain-specific information within isolated domains. Additionally, we propose inter-domain gradient matching with the collaboration of other domains, which can further reduce the domain shift across decentralized domains. Combining intra-domain and inter-domain gradient matching, our method enables the learned model to generalize well on unseen domains. Furthermore, our method can be extended to the federated domain adaptation task by fine-tuning the target model on the pseudo-labeled target domain. The extensive experiments on federated domain generalization and adaptation indicate that our method outperforms the state-of-the-art methods significantly. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: Accepted by AAAI 2024

arXiv:2401.09225 [pdf, other]

First measurements of the absolute branching fraction of $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^+π^-$ and upper limit on $Λ_{c}(2595)^{+}\to Λ^{+}_{c}π^+π^-$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (603 additional authors not shown)

Abstract: The absolute branching fraction of the decay $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^+π^-$ is measured for the first time to be $(50.7 \pm 5.0_{\rm{stat.}} \pm 4.9_{\rm{syst.}} )\%$ with 368.48 pb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector at the center-of-mass energies of $\sqrt{s} = 4.918$ and $4.950$ GeV. This result is lower than the naive prediction of 67\%, obtained from isosp… ▽ More The absolute branching fraction of the decay $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^+π^-$ is measured for the first time to be $(50.7 \pm 5.0_{\rm{stat.}} \pm 4.9_{\rm{syst.}} )\%$ with 368.48 pb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector at the center-of-mass energies of $\sqrt{s} = 4.918$ and $4.950$ GeV. This result is lower than the naive prediction of 67\%, obtained from isospin symmetry, by more than $2σ$, thereby indicating that the novel mechanism referred to as the \textit{threshold effect}, proposed for the strong decays of $Λ_{c}(2595)^{+}$, also applies to $Λ_{c}(2625)^{+}$. This measurement is necessary to obtain the coupling constants for the transitions between $s$-wave and $p$-wave charmed baryons in heavy hadron chiral perturbation theory. In addition, we search for the decay $Λ_{c}(2595)^{+}\to Λ^{+}_{c}π^+π^-$. No significant signal is observed, and the upper limit on its branching fraction is determined to be 80.8\% at the 90\% confidence level. △ Less

Submitted 17 January, 2024; originally announced January 2024.

Comments: 8 pages, 6 figures

arXiv:2401.08719 [pdf, other]

CodeComplex: A Time-Complexity Dataset for Bilingual Source Codes

Authors: Seung-Yeop Baik, Mingi Jeon, Joonghyuk Hahn, Jungin Kim, Yo-Sub Han, Sang-Ki Ko

Abstract: Analyzing the worst-case time complexity of a code is a crucial task in computer science and software engineering for ensuring the efficiency, reliability, and robustness of software systems. However, it is well-known that the problem of determining the worst-case time complexity of a given code written in general-purpose programming language is theoretically undecidable by the famous Halting prob… ▽ More Analyzing the worst-case time complexity of a code is a crucial task in computer science and software engineering for ensuring the efficiency, reliability, and robustness of software systems. However, it is well-known that the problem of determining the worst-case time complexity of a given code written in general-purpose programming language is theoretically undecidable by the famous Halting problem proven by Alan Turing. Thus, we move towards more realistic scenarios where the inputs and outputs of a program exist. This allows us to discern the correctness of given codes, challenging to analyze their time complexity exhaustively. In response to this challenge, we introduce CodeComplex, a novel source code dataset where each code is manually annotated with a corresponding worst-case time complexity. CodeComplex comprises 4,900 Java codes and an equivalent number of Python codes, all sourced from programming competitions and annotated with complexity labels by a panel of algorithmic experts. To the best of our knowledge, CodeComplex stands as the most extensive code dataset tailored for predicting complexity. Subsequently, we present the outcomes of our experiments employing various baseline models, leveraging state-of-the-art neural models in code comprehension like CodeBERT, GraphCodeBERT, UniXcoder, PLBART, CodeT5, CodeT5+, and ChatGPT. We analyze how the dataset impacts the model's learning in predicting time complexity. △ Less

Submitted 16 January, 2024; originally announced January 2024.

arXiv:2401.08121 [pdf, other]

CycLight: learning traffic signal cooperation with a cycle-level strategy

Authors: Gengyue Han, Xiaohan Liu, Xianyue Peng, Hao Wang, Yu Han

Abstract: This study introduces CycLight, a novel cycle-level deep reinforcement learning (RL) approach for network-level adaptive traffic signal control (NATSC) systems. Unlike most traditional RL-based traffic controllers that focus on step-by-step decision making, CycLight adopts a cycle-level strategy, optimizing cycle length and splits simultaneously using Parameterized Deep Q-Networks (PDQN) algorithm… ▽ More This study introduces CycLight, a novel cycle-level deep reinforcement learning (RL) approach for network-level adaptive traffic signal control (NATSC) systems. Unlike most traditional RL-based traffic controllers that focus on step-by-step decision making, CycLight adopts a cycle-level strategy, optimizing cycle length and splits simultaneously using Parameterized Deep Q-Networks (PDQN) algorithm. This cycle-level approach effectively reduces the computational burden associated with frequent data communication, meanwhile enhancing the practicality and safety of real-world applications. A decentralized framework is formulated for multi-agent cooperation, while attention mechanism is integrated to accurately assess the impact of the surroundings on the current intersection. CycLight is tested in a large synthetic traffic grid using the microscopic traffic simulation tool, SUMO. Experimental results not only demonstrate the superiority of CycLight over other state-of-the-art approaches but also showcase its robustness against information transmission delays. △ Less

Submitted 16 January, 2024; originally announced January 2024.

arXiv:2401.08075 [pdf, ps, other]

Maximum principle for optimal control of interacting particle system: stochastic flow model

Authors: Andrey A. Dorogovtsev, Yuecai Han, Kateryna Hlyniana, Yuhang Li

Abstract: In this paper, we consider the stochastic optimal control problem for the interacting particle system. We obtain the stochastic maximum principle of the optimal control system by introducing a generalized backward stochastic differential equation with interaction. The existence and uniqueness of the solution of this type of equation is proved. We derive the necessary condition that the optimal con… ▽ More In this paper, we consider the stochastic optimal control problem for the interacting particle system. We obtain the stochastic maximum principle of the optimal control system by introducing a generalized backward stochastic differential equation with interaction. The existence and uniqueness of the solution of this type of equation is proved. We derive the necessary condition that the optimal control should satisfy. As an application, the linear quadratic case is investigated to illustrate the main results. △ Less

Submitted 15 January, 2024; originally announced January 2024.

arXiv:2401.07520 [pdf, ps, other]

Stochastic Maximum Principle for Control System with Time-varying delay

Authors: Yuhang Li, Yuecai Han

Abstract: In this paper, we study the stochastic optimal control problem for control system with time-varying delay. The corresponding stochastic differential equation is a kind of stochastic differential delay equation. We prove the existence and uniqueness of the solution of this equation. We obtain the stochastic maximum principle of the control system with time-varying delay by introducing a kind of gen… ▽ More In this paper, we study the stochastic optimal control problem for control system with time-varying delay. The corresponding stochastic differential equation is a kind of stochastic differential delay equation. We prove the existence and uniqueness of the solution of this equation. We obtain the stochastic maximum principle of the control system with time-varying delay by introducing a kind of generalized anticipated backward stochastic differential equations. We prove the existence and uniqueness of the solution of this adjoint equation. As an application, the linear quadratic moving average control problem is investigated to illustrate the main result. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2312.13516

arXiv:2401.06472 [pdf, ps, other]

doi 10.1088/1367-2630/ad19fe

Quantifying the intrinsic randomness in sequential measurements

Authors: Xinjian Liu, Yukun Wang, Yunguang Han, Xia Wu

Abstract: In the standard Bell scenario, when making a local projective measurement on each system component, the amount of randomness generated is restricted. However, this limitation can be surpassed through the implementation of sequential measurements. Nonetheless, a rigorous definition of random numbers in the context of sequential measurements is yet to be established, except for the lower quantificat… ▽ More In the standard Bell scenario, when making a local projective measurement on each system component, the amount of randomness generated is restricted. However, this limitation can be surpassed through the implementation of sequential measurements. Nonetheless, a rigorous definition of random numbers in the context of sequential measurements is yet to be established, except for the lower quantification in device-independent scenarios. In this paper, we define quantum intrinsic randomness in sequential measurements and quantify the randomness in the Collins-Gisin-Linden-Massar-Popescu (CGLMP) inequality sequential scenario. Initially, we investigate the quantum intrinsic randomness of the mixed states under sequential projective measurements and the intrinsic randomness of the sequential positive-operator-valued measure (POVM) under pure states. Naturally, we rigorously define quantum intrinsic randomness under sequential POVM for arbitrary quantum states. Furthermore, we apply our method to one-Alice and two-Bobs sequential measurement scenarios, and quantify the quantum intrinsic randomness of the maximally entangled state and maximally violated state by giving an extremal decomposition. Finally, using the sequential Navascues-Pironio-Acin (NPA) hierarchy in the device-independent scenario, we derive lower bounds on the quantum intrinsic randomness of the maximally entangled state and maximally violated state. △ Less

Submitted 12 January, 2024; originally announced January 2024.

Comments: 26 pages 5 figures

arXiv:2401.06367 [pdf, other]

Enhancing a Convolutional Autoencoder with a Quantum Approximate Optimization Algorithm for Image Noise Reduction

Authors: Kimleang Kea, Won-Du Chang, Hee Chul Park, Youngsun Han

Abstract: Image denoising is essential for removing noise in images caused by electric device malfunctions or other factors during image acquisition. It helps preserve image quality and interpretation. Many convolutional autoencoder algorithms have proven effective in image denoising. Owing to their promising efficiency, quantum computers have gained popularity. This study introduces a quantum convolutional… ▽ More Image denoising is essential for removing noise in images caused by electric device malfunctions or other factors during image acquisition. It helps preserve image quality and interpretation. Many convolutional autoencoder algorithms have proven effective in image denoising. Owing to their promising efficiency, quantum computers have gained popularity. This study introduces a quantum convolutional autoencoder (QCAE) method for improved image denoising. This method was developed by substituting the representative latent space of the autoencoder with a quantum circuit. To enhance efficiency, we leveraged the advantages of the quantum approximate optimization algorithm (QAOA)-incorporated parameter-shift rule to identify an optimized cost function, facilitating effective learning from data and gradient computation on an actual quantum computer. The proposed QCAE method outperformed its classical counterpart as it exhibited lower training loss and a higher structural similarity index (SSIM) value. QCAE also outperformed its classical counterpart in denoising the MNIST dataset by up to 40% in terms of SSIM value, confirming its enhanced capabilities in real-world applications. Evaluation of QAOA performance across different circuit configurations and layer variations showed that our technique outperformed other circuit designs by 25% on average. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 11 pages, 12 figures and 1 table

arXiv:2401.06255 [pdf, other]

doi 10.1098/rsos.231792

Modelling and Predicting Online Vaccination Views using Bow-tie Decomposition

Authors: Yueting Han, Marya Bazzi, Paolo Turrini

Abstract: Social media has become increasingly important in sha** public vaccination views, especially since the COVID-19 outbreak. This paper uses bow-tie structure to analyse a temporal dataset of directed online social networks that represent the information exchange among anti-vaccination, pro-vaccination, and neutral Facebook pages. Bow-tie structure decomposes a network into seven components, with t… ▽ More Social media has become increasingly important in sha** public vaccination views, especially since the COVID-19 outbreak. This paper uses bow-tie structure to analyse a temporal dataset of directed online social networks that represent the information exchange among anti-vaccination, pro-vaccination, and neutral Facebook pages. Bow-tie structure decomposes a network into seven components, with two components "SCC" and "OUT" emphasised in this paper: SCC is the largest strongly connected component, acting as an "information magnifier", and OUT contains all nodes with a directed path from a node in SCC, acting as an "information creator". We consistently observe statistically significant bow-tie structures with different dominant components for each vaccination group over time. In particular, the anti-vaccination group has a large OUT, and the pro-vaccination group has a large SCC. We further investigate changes in opinions over time, as measured by fan count variations, using agent-based simulations and machine learning models. Across both methods, accounting for bow-tie decomposition better reflects information flow differences among vaccination groups and improves our opinion dynamics prediction results. The modelling frameworks we consider can be applied to any multi-stance temporal network and could form a basis for exploring opinion dynamics using bow-tie structure in a wide range of applications. △ Less

Submitted 20 February, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

Comments: SM update

Journal ref: Royal Society Open Science, 11, 231792 (2024)

arXiv:2401.05530 [pdf, other]

Consensus Focus for Object Detection and minority classes

Authors: Erik Isai Valle Salgado, Chen Li, Yaqi Han, Linchao Shi, Xinghui Li

Abstract: Ensemble methods exploit the availability of a given number of classifiers or detectors trained in single or multiple source domains and tasks to address machine learning problems such as domain adaptation or multi-source transfer learning. Existing research measures the domain distance between the sources and the target dataset, trains multiple networks on the same data with different samples per… ▽ More Ensemble methods exploit the availability of a given number of classifiers or detectors trained in single or multiple source domains and tasks to address machine learning problems such as domain adaptation or multi-source transfer learning. Existing research measures the domain distance between the sources and the target dataset, trains multiple networks on the same data with different samples per class, or combines predictions from models trained under varied hyperparameters and settings. Their solutions enhanced the performance on small or tail categories but hurt the rest. To this end, we propose a modified consensus focus for semi-supervised and long-tailed object detection. We introduce a voting system based on source confidence that spots the contribution of each model in a consensus, lets the user choose the relevance of each class in the target label space so that it relaxes minority bounding boxes suppression, and combines multiple models' results without discarding the poisonous networks. Our tests on synthetic driving datasets retrieved higher confidence and more accurate bounding boxes than the NMS, soft-NMS, and WBF. The code used to generate the results is available in our GitHub repository: http://github.com/ErikValle/Consensus-focus-for-object-detection. △ Less

Submitted 31 May, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

arXiv:2401.05011 [pdf, other]

Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection

Authors: Yucheng Han, Na Zhao, Weiling Chen, Keng Teck Ma, Hanwang Zhang

Abstract: Semi-supervised 3D object detection is a promising yet under-explored direction to reduce data annotation costs, especially for cluttered indoor scenes. A few prior works, such as SESS and 3DIoUMatch, attempt to solve this task by utilizing a teacher model to generate pseudo-labels for unlabeled samples. However, the availability of unlabeled samples in the 3D domain is relatively limited compared… ▽ More Semi-supervised 3D object detection is a promising yet under-explored direction to reduce data annotation costs, especially for cluttered indoor scenes. A few prior works, such as SESS and 3DIoUMatch, attempt to solve this task by utilizing a teacher model to generate pseudo-labels for unlabeled samples. However, the availability of unlabeled samples in the 3D domain is relatively limited compared to its 2D counterpart due to the greater effort required to collect 3D data. Moreover, the loose consistency regularization in SESS and restricted pseudo-label selection strategy in 3DIoUMatch lead to either low-quality supervision or a limited amount of pseudo labels. To address these issues, we present a novel Dual-Perspective Knowledge Enrichment approach named DPKE for semi-supervised 3D object detection. Our DPKE enriches the knowledge of limited training data, particularly unlabeled data, from two perspectives: data-perspective and feature-perspective. Specifically, from the data-perspective, we propose a class-probabilistic data augmentation method that augments the input data with additional instances based on the varying distribution of class probabilities. Our DPKE achieves feature-perspective knowledge enrichment by designing a geometry-aware feature matching method that regularizes feature-level similarity between object proposals from the student and teacher models. Extensive experiments on the two benchmark datasets demonstrate that our DPKE achieves superior performance over existing state-of-the-art approaches under various label ratio conditions. The source code will be made available to the public. △ Less

Submitted 10 January, 2024; originally announced January 2024.

Comments: Code is available at https://github.com/tingxueronghua/DPKE

arXiv:2401.04171 [pdf, other]

doi 10.1093/mnras/stae157

CSST Large-scale Structure Analysis Pipeline: II. the CSST Emulator for Slitless Spectroscopy (CESS)

Authors: Run Wen, Xian Zhong Zheng, Yunkun Han, Xiaohu Yang, Xin Wang, Hu Zou, Fengshan Liu, Xin Zhang, Ying Zu, Dong Dong Shi, Yizhou Gu, Yirong Wang

Abstract: The Chinese Space Station Telescope (CSST) slitless spectroscopic survey will observe objects to a limiting magnitude of ~ 23 mag (5$σ$, point sources) in U, V, and I over 17500 deg$^2$. The spectroscopic observations are expected to be highly efficient and complete for map** galaxies over 0 < z < 1 with secure redshift measurements at spectral resolutions of R ~ 200, providing unprecedented dat… ▽ More The Chinese Space Station Telescope (CSST) slitless spectroscopic survey will observe objects to a limiting magnitude of ~ 23 mag (5$σ$, point sources) in U, V, and I over 17500 deg$^2$. The spectroscopic observations are expected to be highly efficient and complete for map** galaxies over 0 < z < 1 with secure redshift measurements at spectral resolutions of R ~ 200, providing unprecedented data sets for cosmological studies. To quantitatively examine the survey potential, we develop a software tool, namely the CSST Emulator for Slitless Spectroscopy (CESS), to quickly generate simulated 1D slitless spectra with limited computing resources. We introduce the architecture of CESS and the detailed process of creating simulated CSST slitless spectra. The extended light distribution of a galaxy induces the self-broadening effect on the 1D slitless spectrum. We quantify the effect using morphological parameters: Sérsic index, effective radius, position angle, and axis ratio. Moreover, we also develop a module for CESS to estimate the overlap contamination rate for CSST grating observations of galaxies in galaxy clusters. Applying CESS to the high-resolution model spectra of a sample of ~ 140 million galaxies with m_z < 21 mag selected from the Dark Energy Spectroscopic Instrument LS DR9 catalogue, we obtain the simulated CSST slitless spectra. We examine the dependence of measurement errors on different types of galaxies due to instrumental and observational effects and quantitatively investigate the redshift completeness for different environments out to z ~ 1. Our results show that the CSST spectroscopy is able to provide secure redshifts for about one-quarter of the sample galaxies. △ Less

Submitted 26 January, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

Comments: 14 pages, 15 figures, 2 tables, accepted for publication in MNRAS

arXiv:2401.03893 [pdf, other]

Finite-Time Decoupled Convergence in Nonlinear Two-Time-Scale Stochastic Approximation

Authors: Yuze Han, Xiang Li, Zhihua Zhang

Abstract: In two-time-scale stochastic approximation (SA), two iterates are updated at varying speeds using different step sizes, with each update influencing the other. Previous studies in linear two-time-scale SA have found that the convergence rates of the mean-square errors for these updates are dependent solely on their respective step sizes, leading to what is referred to as decoupled convergence. How… ▽ More In two-time-scale stochastic approximation (SA), two iterates are updated at varying speeds using different step sizes, with each update influencing the other. Previous studies in linear two-time-scale SA have found that the convergence rates of the mean-square errors for these updates are dependent solely on their respective step sizes, leading to what is referred to as decoupled convergence. However, the possibility of achieving this decoupled convergence in nonlinear SA remains less understood. Our research explores the potential for finite-time decoupled convergence in nonlinear two-time-scale SA. We find that under a weaker Lipschitz condition, traditional analyses are insufficient for achieving decoupled convergence. This finding is further numerically supported by a counterexample. But by introducing an additional condition of nested local linearity, we show that decoupled convergence is still feasible, contingent on the appropriate choice of step sizes associated with smoothness parameters. Our analysis depends on a refined characterization of the matrix cross term between the two iterates and utilizes fourth-order moments to control higher-order approximation errors induced by the local linearity assumption. △ Less

Submitted 8 January, 2024; originally announced January 2024.

arXiv:2401.03817 [pdf, other]

Context-Aware Coupler Reconfiguration for Tunable Coupler-Based Superconducting Quantum Computers

Authors: Leanghok Hour, Sovanmonynuth Heng, Sengthai Heng, Myeongseong Go, Youngsun Han

Abstract: We address interconnection challenges in limited-qubit superconducting quantum computers (SQC), which often face crosstalk errors due to expanded qubit interactions during operations. Existing mitigation methods carry trade-offs, like hardware couplers or software-based gate scheduling. Our innovation, the Context-Aware COupler REconfiguration (CA-CORE) compilation method, aligns with application-… ▽ More We address interconnection challenges in limited-qubit superconducting quantum computers (SQC), which often face crosstalk errors due to expanded qubit interactions during operations. Existing mitigation methods carry trade-offs, like hardware couplers or software-based gate scheduling. Our innovation, the Context-Aware COupler REconfiguration (CA-CORE) compilation method, aligns with application-specific design principles. It optimizes the qubit connections for improved SQC performance, leveraging tunable couplers. Through contextual analysis of qubit correlations, we configure an efficient coupling map considering SQC constraints. Our method reduces depth and SWAP operations by up to 18.84% and 42.47%, respectively. It also enhances circuit fidelity by 40% compared to IBM and Google's topologies. Notably, our method compiles a 33-qubit circuit in less than 1 second. △ Less

Submitted 31 March, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

Comments: 20 pages, 8 figures

arXiv:2401.03153 [pdf, other]

An Event-Oriented Diffusion-Refinement Method for Sparse Events Completion

Authors: Bo Zhang, Yuqi Han, **li Suo, Qionghai Dai

Abstract: Event cameras or dynamic vision sensors (DVS) record asynchronous response to brightness changes instead of conventional intensity frames, and feature ultra-high sensitivity at low bandwidth. The new mechanism demonstrates great advantages in challenging scenarios with fast motion and large dynamic range. However, the recorded events might be highly sparse due to either limited hardware bandwidth… ▽ More Event cameras or dynamic vision sensors (DVS) record asynchronous response to brightness changes instead of conventional intensity frames, and feature ultra-high sensitivity at low bandwidth. The new mechanism demonstrates great advantages in challenging scenarios with fast motion and large dynamic range. However, the recorded events might be highly sparse due to either limited hardware bandwidth or extreme photon starvation in harsh environments. To unlock the full potential of event cameras, we propose an inventive event sequence completion approach conforming to the unique characteristics of event data in both the processing stage and the output form. Specifically, we treat event streams as 3D event clouds in the spatiotemporal domain, develop a diffusion-based generative model to generate dense clouds in a coarse-to-fine manner, and recover exact timestamps to maintain the temporal resolution of raw data successfully. To validate the effectiveness of our method comprehensively, we perform extensive experiments on three widely used public datasets with different spatial resolutions, and additionally collect a novel event dataset covering diverse scenarios with highly dynamic motions and under harsh illumination. Besides generating high-quality dense events, our method can benefit downstream applications such as object classification and intensity frame reconstruction. △ Less

Submitted 6 January, 2024; originally announced January 2024.

arXiv:2401.02975 [pdf]

Uncovering Regulatory Affairs Complexity in Medical Products: A Qualitative Assessment Utilizing Open Coding and Natural Language Processing (NLP)

Authors: Yu Han, Aaron Ceross, Jeroen H. M. Bergmann

Abstract: This study investigates the complexity of regulatory affairs in the medical device industry, a critical factor influencing market access and patient care. Through qualitative research, we sought expert insights to understand the factors contributing to this complexity. The study involved semi-structured interviews with 28 professionals from medical device companies, specializing in various aspects… ▽ More This study investigates the complexity of regulatory affairs in the medical device industry, a critical factor influencing market access and patient care. Through qualitative research, we sought expert insights to understand the factors contributing to this complexity. The study involved semi-structured interviews with 28 professionals from medical device companies, specializing in various aspects of regulatory affairs. These interviews were analyzed using open coding and Natural Language Processing (NLP) techniques. The findings reveal key sources of complexity within the regulatory landscape, divided into five domains: (A) Regulatory language complexity, (B) Intricacies within the regulatory process, (C) Global-level complexities, (D) Database-related considerations, and (E) Product-level issues. The participants highlighted the need for strategies to streamline regulatory compliance, enhance interactions between regulatory bodies and industry players, and develop adaptable frameworks for rapid technological advancements. Emphasizing interdisciplinary collaboration and increased transparency, the study concludes that these elements are vital for establishing coherent and effective regulatory procedures in the medical device sector. △ Less

Submitted 29 December, 2023; originally announced January 2024.

arXiv:2401.02901 [pdf, other]

Charged-current non-standard neutrino interactions at Daya Bay

Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

Abstract: The full data set of the Daya Bay reactor neutrino experiment is used to probe the effect of the charged current non-standard interactions (CC-NSI) on neutrino oscillation experiments. Two different approaches are applied and constraints on the corresponding CC-NSI parameters are obtained with the neutrino flux taken from the Huber-Mueller model with a $5\%$ uncertainty. For the quantum mechanics-… ▽ More The full data set of the Daya Bay reactor neutrino experiment is used to probe the effect of the charged current non-standard interactions (CC-NSI) on neutrino oscillation experiments. Two different approaches are applied and constraints on the corresponding CC-NSI parameters are obtained with the neutrino flux taken from the Huber-Mueller model with a $5\%$ uncertainty. For the quantum mechanics-based approach (QM-NSI), the constraints on the CC-NSI parameters $ε_{eα}$ and $ε_{eα}^{s}$ are extracted with and without the assumption that the effects of the new physics are the same in the production and detection processes, respectively. The approach based on the weak effective field theory (WEFT-NSI) deals with four types of CC-NSI represented by the parameters $[\varepsilon_{X}]_{eα}$. For both approaches, the results for the CC-NSI parameters are shown for cases with various fixed values of the CC-NSI and the Dirac CP-violating phases, and when they are allowed to vary freely. We find that constraints on the QM-NSI parameters $ε_{eα}$ and $ε_{eα}^{s}$ from the Daya Bay experiment alone can reach the order $\mathcal{O}(0.01)$ for the former and $\mathcal{O}(0.1)$ for the latter, while for WEFT-NSI parameters $[\varepsilon_{X}]_{eα}$, we obtain $\mathcal{O}(0.1)$ for both cases. △ Less

Submitted 19 March, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

Comments: 25 pages, 16 figures, 6 tables; 36 pages, format changed, references added

arXiv:2401.02840 [pdf, other]

A test of lepton flavor universality with a measurement of $R(D^{*})$ using hadronic $B$ tagging at the Belle II experiment

Authors: Belle II Collaboration, I. Adachi, K. Adamczyk, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, M. Bauer, A. Baur , et al. (412 additional authors not shown)

Abstract: The ratio of branching fractions $R(D^{*}) = \mathcal{B}(\overline{B} \rightarrow D^{*} τ^{-} \overlineν_τ)$/$\mathcal{B} (\overline{B} \rightarrow D^{*} \ell^{-} \overlineν_{\ell})$, where $\ell$ is an electron or muon, is measured using a Belle~II data sample with an integrated luminosity of $189~\mathrm{fb}^{-1}$ at the SuperKEKB asymmetric-energy $e^{+} e^{-}$ collider. Data is collected at th… ▽ More The ratio of branching fractions $R(D^{*}) = \mathcal{B}(\overline{B} \rightarrow D^{*} τ^{-} \overlineν_τ)$/$\mathcal{B} (\overline{B} \rightarrow D^{*} \ell^{-} \overlineν_{\ell})$, where $\ell$ is an electron or muon, is measured using a Belle~II data sample with an integrated luminosity of $189~\mathrm{fb}^{-1}$ at the SuperKEKB asymmetric-energy $e^{+} e^{-}$ collider. Data is collected at the $Υ(\mathrm{4S})$ resonance, and one $B$ meson in the $Υ(\mathrm{4S})\rightarrow B\overline{B}$ decay is fully reconstructed in hadronic decay modes. The accompanying signal $B$ meson is reconstructed as $\overline{B}\rightarrow D^{*} τ^{-}\overlineν_τ$ using leptonic $τ$ decays. The normalization decay, $\overline{B}\rightarrow D^{*} \ell^{-} \overlineν_{\ell}$, where $\ell$ is an electron or muon, produces the same observable final state particles. The ratio of branching fractions is extracted in a simultaneous fit to two signal-discriminating variables in both channels and yields $R(D^{*}) = 0.262~_{-0.039}^{+0.041}(\mathrm{stat})~_{-0.032}^{+0.035}(\mathrm{syst})$. This result is consistent with the current world average and with standard model predictions. △ Less

Submitted 5 January, 2024; originally announced January 2024.

Comments: 16 pages, 17 figures, submitted to PRD

arXiv:2401.02542 [pdf, other]

A Community Detection and Graph Neural Network Based Link Prediction Approach for Scientific Literature

Authors: Chunjiang Liu, Yikun Han, Haiyun Xu, Shihan Yang, Kaidi Wang, Yongye Su

Abstract: This study presents a novel approach that synergizes community detection algorithms with various Graph Neural Network (GNN) models to bolster link prediction in scientific literature networks. By integrating the Louvain community detection algorithm into our GNN frameworks, we consistently enhance performance across all models tested. For example, integrating Louvain with the GAT model resulted in… ▽ More This study presents a novel approach that synergizes community detection algorithms with various Graph Neural Network (GNN) models to bolster link prediction in scientific literature networks. By integrating the Louvain community detection algorithm into our GNN frameworks, we consistently enhance performance across all models tested. For example, integrating Louvain with the GAT model resulted in an AUC score increase from 0.777 to 0.823, exemplifying the typical improvements observed. Similar gains are noted when Louvain is paired with other GNN architectures, confirming the robustness and effectiveness of incorporating community-level insights. This consistent uplift in performance reflected in our extensive experimentation on bipartite graphs of scientific collaborations and citations highlights the synergistic potential of combining community detection with GNNs to overcome common link prediction challenges such as scalability and resolution limits. Our findings advocate for the integration of community structures as a significant step forward in the predictive accuracy of network science models, offering a comprehensive understanding of scientific collaboration patterns through the lens of advanced machine learning techniques. △ Less

Submitted 18 January, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

arXiv:2401.02440 [pdf]

Point Location in Constant Time

Authors: Sairam Chaganti, Yijie Han

Abstract: We preprocess the input subdivision with $n$ points on the plane in $O(n\sqrt{\log n})$ time to facilitate point location in constant time. Previously the preprocessing time is $O(n\log n)$ and point location takes $O(\log n)$ time. We preprocess the input subdivision with $n$ points on the plane in $O(n\sqrt{\log n})$ time to facilitate point location in constant time. Previously the preprocessing time is $O(n\log n)$ and point location takes $O(\log n)$ time. △ Less

Submitted 21 December, 2023; originally announced January 2024.

Comments: Sairam Chaganti is currently a senior software engineer at Southwest Airlines

MSC Class: 68W05; 68W40; ACM Class: F.2.2

arXiv:2401.02295 [pdf]

Tunning the number of chiral edge channels in a fixed quantum anomalous Hall system

Authors: Peng Deng, Yulei Han, Peng Zhang, Su Kong Chong, Zhenhua Qiao, Kang L. Wang

Abstract: Quantum anomalous Hall (QAH) insulators exhibit chiral edge channels characterized by vanishing longitudinal conductance and quantized Hall conductance of Ce2/h, wherein the Chern number C is an integer equal to the number of the parallel chiral edge channels. These chiral edge channels conduct dissipationless transport in QAH insulators, making them pivotal for applications in low-consumption ele… ▽ More Quantum anomalous Hall (QAH) insulators exhibit chiral edge channels characterized by vanishing longitudinal conductance and quantized Hall conductance of Ce2/h, wherein the Chern number C is an integer equal to the number of the parallel chiral edge channels. These chiral edge channels conduct dissipationless transport in QAH insulators, making them pivotal for applications in low-consumption electronics and topological quantum computing. While the QAH effect with multiple chiral edge channels (i.e., C >1) has been demonstrated in multilayers consisting of magnetic topological insulators and normal insulators, the channel number remains fixed for a given sample. Here, we unveil the tunability of the number of chiral edge channels within a single QAH insulator device. By tuning the magnetization of individual layers within the multilayer system, Chern insulating states with different Chern numbers are unveiled. The tunable Chern number was corroborated by our theoretical calculations. Furthermore, we conducted layer-dependent calculations to elucidate the contribution of the Chern number from different layers in the multilayer. Our findings demonstrate an extra degree of freedom in manipulating the chiral edge channels in QAH insulators. This newfound tunability offers extra dimension for the implementation of the QAH-based multi-channel dissipationless transport. △ Less

Submitted 4 January, 2024; originally announced January 2024.

Comments: The findings and content of this manuscript were also presented in a talk at the CPS meeting in December 2022. The video recording of the talk can be accessed at the following link: https://www.koushare.com/video/videodetail/39429

Showing 151–200 of 2,005 results for author: han, y