Search | arXiv e-print repository

arXiv:2407.01901 [pdf, ps, other]

The Compressible Navier-Stokes Equations on the Multi-Connected Domains

Abstract: This paper investigates the isentropic compressible Navier-Stokes equations on k-connected domains under Navier-slip boundary conditions. We study the multi-solvability of the stationary systems on general domains, which is closely related with the Cauchy-Riemann systems and critical points of harmonic functions on the domain. Then based on the structure of Green's functions, the commutator estima… ▽ More This paper investigates the isentropic compressible Navier-Stokes equations on k-connected domains under Navier-slip boundary conditions. We study the multi-solvability of the stationary systems on general domains, which is closely related with the Cauchy-Riemann systems and critical points of harmonic functions on the domain. Then based on the structure of Green's functions, the commutator estimates are obtained on the circular domains and extended to general domains with the help of conformal map**s. Moreover, we will utilize these assertions to discuss the global well-posedness and large time behaviours of the non-stationary systems on general domains with large initial values containing vacuum. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2407.01886 [pdf, other]

Core Knowledge Learning Framework for Graph Adaptation and Scalability Learning

Authors: Bowen Zhang, Zhichao Huang, Genan Dai, Guangning Xu, Xiaomao Fan, Hu Huang

Abstract: Graph classification is a pivotal challenge in machine learning, especially within the realm of graph-based data, given its importance in numerous real-world applications such as social network analysis, recommendation systems, and bioinformatics. Despite its significance, graph classification faces several hurdles, including adapting to diverse prediction tasks, training across multiple target do… ▽ More Graph classification is a pivotal challenge in machine learning, especially within the realm of graph-based data, given its importance in numerous real-world applications such as social network analysis, recommendation systems, and bioinformatics. Despite its significance, graph classification faces several hurdles, including adapting to diverse prediction tasks, training across multiple target domains, and handling small-sample prediction scenarios. Current methods often tackle these challenges individually, leading to fragmented solutions that lack a holistic approach to the overarching problem. In this paper, we propose an algorithm aimed at addressing the aforementioned challenges. By incorporating insights from various types of tasks, our method aims to enhance adaptability, scalability, and generalizability in graph classification. Motivated by the recognition that the underlying subgraph plays a crucial role in GNN prediction, while the remainder is task-irrelevant, we introduce the Core Knowledge Learning (\method{}) framework for graph adaptation and scalability learning. \method{} comprises several key modules, including the core subgraph knowledge submodule, graph domain adaptation module, and few-shot learning module for downstream tasks. Each module is tailored to tackle specific challenges in graph classification, such as domain shift, label inconsistencies, and data scarcity. By learning the core subgraph of the entire graph, we focus on the most pertinent features for task relevance. Consequently, our method offers benefits such as improved model performance, increased domain adaptability, and enhanced robustness to domain variations. Experimental results demonstrate significant performance enhancements achieved by our method compared to state-of-the-art approaches. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2407.00363 [pdf, other]

Goos-H{ä}nchen Shift for Relativistic Particles Based on Dirac's Equation

Authors: Jiang-Lin Zhou, Zhen-Xiao Zhang, Xing-Yan Fan, **g-Ling Chen

Abstract: The Goos-H{ä}nchen (GH) shift is a specifical optical phenomenon that describes a shift parallel to the reflected light inside the plane of incidence, when a finite-width light undergoes total internal reflection at the interface of medium. Although the GH shift in optics has been widely observed experimentally, its generalization remains uncovered completely in relativistic quantum mechanics for… ▽ More The Goos-H{ä}nchen (GH) shift is a specifical optical phenomenon that describes a shift parallel to the reflected light inside the plane of incidence, when a finite-width light undergoes total internal reflection at the interface of medium. Although the GH shift in optics has been widely observed experimentally, its generalization remains uncovered completely in relativistic quantum mechanics for the existence of Klein's paradox. Recently, Wang has solved Klein's paradox based on the different solutions adpoted for Dirac's equation with step potential in corresponding energy regions \href{https://dx.doi.org/10.1088/2399-6528/abd340}{[J. Phys. Commun. {\bf 4}, 125010 (2020)]}. In the light of Wang's method, we calculate the GH shift for Dirac fermions under relativistic conditions when they are incident obliquely on a three-dimensional infinite potential barrier. Furthermore, we find that the relativistic quantum GH shift can be negative, which is different from the non-relativistic case. △ Less

Submitted 29 June, 2024; originally announced July 2024.

Comments: 9 pages, 2 figures

arXiv:2406.20047 [pdf, other]

Neural Differentiable Modeling with Diffusion-Based Super-resolution for Two-Dimensional Spatiotemporal Turbulence

Authors: Xiantao Fan, Deepak Akhare, Jian-Xun Wang

Abstract: Simulating spatiotemporal turbulence with high fidelity remains a cornerstone challenge in computational fluid dynamics (CFD) due to its intricate multiscale nature and prohibitive computational demands. Traditional approaches typically employ closure models, which attempt to represent small-scale features in an unresolved manner. However, these methods often sacrifice accuracy and lose high-frequ… ▽ More Simulating spatiotemporal turbulence with high fidelity remains a cornerstone challenge in computational fluid dynamics (CFD) due to its intricate multiscale nature and prohibitive computational demands. Traditional approaches typically employ closure models, which attempt to represent small-scale features in an unresolved manner. However, these methods often sacrifice accuracy and lose high-frequency/wavenumber information, especially in scenarios involving complex flow physics. In this paper, we introduce an innovative neural differentiable modeling framework designed to enhance the predictability and efficiency of spatiotemporal turbulence simulations. Our approach features differentiable hybrid modeling techniques that seamlessly integrate deep neural networks with numerical PDE solvers within a differentiable programming framework, synergizing deep learning with physics-based CFD modeling. Specifically, a hybrid differentiable neural solver is constructed on a coarser grid to capture large-scale turbulent phenomena, followed by the application of a Bayesian conditional diffusion model that generates small-scale turbulence conditioned on large-scale flow predictions. Two innovative hybrid architecture designs are studied, and their performance is evaluated through comparative analysis against conventional large eddy simulation techniques with physics-based subgrid-scale closures and purely data-driven neural solvers. The findings underscore the potential of the neural differentiable modeling framework to significantly enhance the accuracy and computational efficiency of turbulence simulations. This study not only demonstrates the efficacy of merging deep learning with physics-based numerical solvers but also sets a new precedent for advanced CFD modeling techniques, highlighting the transformative impact of differentiable programming in scientific computing. △ Less

Submitted 28 June, 2024; originally announced June 2024.

arXiv:2406.19042 [pdf, other]

Towards Credential-based Device Registration in DApps for DePINs with ZKPs

Authors: Jonathan Heiss, Fernando Castillo, Xinxin Fan

Abstract: Decentralized Physical Infrastructure Networks (DePINS) are secured and governed by blockchains but beyond crypto-economic incentives, they lack measures to establish trust in participating devices and their services. The verification of relevant device credentials during device registration helps to overcome this problem. However, on-chain verification in decentralized applications (dApp) disclos… ▽ More Decentralized Physical Infrastructure Networks (DePINS) are secured and governed by blockchains but beyond crypto-economic incentives, they lack measures to establish trust in participating devices and their services. The verification of relevant device credentials during device registration helps to overcome this problem. However, on-chain verification in decentralized applications (dApp) discloses potentially confidential device attributes whereas off-chain verification introduces undesirable trust assumptions. In this paper, we propose a credential-based device registration (CDR) mechanism that verifies device credentials on the blockchain and leverages zero-knowledge proofs (ZKP) to protect confidential device attributes from being disclosed. We characterize CDR for DePINs, present a general system model, and technically evaluate CDR using zkSNARKs with Groth16 and Marlin. Our experiments give first insights into performance impacts and reveal a tradeoff between the applied proof systems. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.18228 [pdf, other]

Detection, sky localization and early warning for binary neutron star mergers by detectors located in China of different configurations in third generation detector network

Authors: Yufeng Li, Ik Siong Heng, Man Leong Chan, Xilong Fan, Lijun Gou

Abstract: This work shows the results of an evaluation of the impact that a detector located in China, with a noise budget comparable to that of a proposed high-frequency detector with a 20 km arm length, an Einstein Telescope (ET) or a Cosmic Explorer (CE), could have on the network of ET-CE in terms of detection rate, localization, and providing early warning alert for simulated binary neutron star (BNS)s… ▽ More This work shows the results of an evaluation of the impact that a detector located in China, with a noise budget comparable to that of a proposed high-frequency detector with a 20 km arm length, an Einstein Telescope (ET) or a Cosmic Explorer (CE), could have on the network of ET-CE in terms of detection rate, localization, and providing early warning alert for simulated binary neutron star (BNS)s. The results indicate that a three-detector network including a Chinese detector could identify at least 4.4% more BNS mergers than an ET-CE network alone. The localization uncertainty could be reduced by a factor of more than 5 on average compared to the ET-CE network. With a three-detector network involving a Chinese detector, up to 89% of BNS mergers could be located within 10 square degrees of the sky 10 minutes prior to the merger. The assessment suggests that the potential for early warning signals is highest when the Chinese detector is similar to ET, whereas the sources are detected with the highest signal-to-noise ratio and localized to the smallest regions when the detector is more akin to CE. Interestingly, the C20N network (comprising ET+CE+C20) can achieve comparable localization performance as the ET network while outperforming the ETCN network (featuring the ET+CE+ an ET-like detector in China) in terms of detection capabilities, especially at large distances, indicating that adding a 20 km kilohertz detector in China to ET-CE network would make significant contributions at least as adding an ET-like detector in China to multi-messenger astronomy for almost all BNS observations. △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: accepted by PRD

arXiv:2406.17247 [pdf, ps, other]

doi 10.1142/S0217732324500305

Einstein-Podolsky-Rosen steering paradox "2=1'' for $N$ qubits

Authors: Zhi-Jie Liu, Jie Zhou, Hui-Xian Meng, Xing-Yan Fan, Mi Xie, Fu-lin Zhang, **g-Ling Chen

Abstract: Einstein-Podolsky-Rosen (EPR) paradox highlights the absence of a local realistic explanation for quantum mechanics, and shows the incompatibility of the local-hidden-state models with quantum theory. For $N$-qubit states, or more importantly, the $N$-qubit mixed states, we present the EPR steering paradox in the form of the contradictory equality "2=1". We show that the contradiction holds for an… ▽ More Einstein-Podolsky-Rosen (EPR) paradox highlights the absence of a local realistic explanation for quantum mechanics, and shows the incompatibility of the local-hidden-state models with quantum theory. For $N$-qubit states, or more importantly, the $N$-qubit mixed states, we present the EPR steering paradox in the form of the contradictory equality "2=1". We show that the contradiction holds for any $N$-qubit state as long as both the pure state requirement and the measurement requirement are satisfied. This also indicates that the EPR steering paradox exists in more general cases. Finally, we give specific examples to demonstrate and analyze our arguments. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: 12 pages, 0 figure

Journal ref: Modern Physics Letters A Vol. 39, No. 9, 2450030 (2024)

arXiv:2406.13290 [pdf, other]

EPR Steering Criterion and Monogamy Relation via Correlation Matrices in Tripartite Systems

Authors: Li-Juan Li, Xiao-Gang Fan, Xue-Ke Song, Liu Ye, Dong Wang

Abstract: Quantum steering is considered as one of the most well-known nonlocal phenomena in quantum mechanics. Unlike entanglement and Bell non-locality, the asymmetry of quantum steering makes it vital for one-sided device-independent quantum information processing. Although there has been much progress on steering detection for bipartite systems, the criterion for EPR steering in tripartite systems remai… ▽ More Quantum steering is considered as one of the most well-known nonlocal phenomena in quantum mechanics. Unlike entanglement and Bell non-locality, the asymmetry of quantum steering makes it vital for one-sided device-independent quantum information processing. Although there has been much progress on steering detection for bipartite systems, the criterion for EPR steering in tripartite systems remains challenging and inadequate. In this paper, we firstly derive a novel and promising steering criterion for any three-qubit states via correlation matrix. Furthermore, we propose the monogamy relation between the tripartite steering of system and the bipartite steering of subsystems based on the derived criterion. Finally, as illustrations, we demonstrate the performance of the steering criterion and the monogamy relation by means of several representative examples. We believe that the results and methods presented in this work could be beneficial to capture genuine multipartite steering in the near future. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 10 pages, 4 figures, comments are welcomed. Accepted by Physical Review A

arXiv:2406.07921 [pdf, other]

A Two-Stage Online Algorithm for EV Charging Station Energy Management and Carbon Trading

Authors: Dongxiang Yan, Shihan Huang, Sen Li, Xiaoyi Fan, Yue Chen

Abstract: The increasing electric vehicle (EV) adoption challenges the energy management of charging stations (CSs) due to the large number of EVs and the underlying uncertainties. Moreover, the carbon footprint of CSs is growing significantly due to the rising charging power demand. This makes it important for CSs to properly manage their energy usage and ensure their carbon footprint stay within their car… ▽ More The increasing electric vehicle (EV) adoption challenges the energy management of charging stations (CSs) due to the large number of EVs and the underlying uncertainties. Moreover, the carbon footprint of CSs is growing significantly due to the rising charging power demand. This makes it important for CSs to properly manage their energy usage and ensure their carbon footprint stay within their carbon emission quotas. This paper proposes a two-stage online algorithm for this purpose, considering the different time scales of energy management and carbon trading. In the first stage, the CS characterizes the real-time aggregate EV power flexibility, in terms of upper and lower bounds on the total charging power, by a Lyapunov optimization-based online algorithm. In the second stage, the CS co-optimizes energy management and carbon trading, with EV charging power chosen within the aggregate flexibility region provided by the first stage. A generalized battery model is proposed to capture the dynamic carbon footprint changes and carbon trading. A virtual carbon queue is designed to develop an online algorithm for the second stage, which can ensure the carbon footprint of CS be within its carbon emission quota and its total operation cost is nearly offline optimal. Case studies validate the effectiveness and advantages of the proposed algorithm. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 12 pages, 13 figures

arXiv:2406.07612 [pdf, other]

Optical and near-infrared spectroscopy of quasars at $z>6.5$: public data release and composite spectrum

Authors: Silvia Onorato, Joseph F. Hennawi, Jan-Torge Schindler, **yi Yang, Feige Wang, Aaron J. Barth, Eduardo Bañados, Anna-Christina Eilers, Sarah E. I. Bosman, Frederick B. Davies, Bram P. Venemans, Chiara Mazzucchelli, Silvia Belladitta, Fabio Vito, Emanuele Paolo Farina, Irham T. Andika, Xiaohui Fan, Fabian Walter, Roberto Decarli, Masafusa Onoue, Riccardo Nanni

Abstract: We present optical and near-infrared (NIR) spectroscopic observations for a sample of $45$ quasars at $6.50 < z \leq 7.64$ with absolute magnitudes at $1450$ Å in the range $-28.82 \leq M_{1450} \leq -24.13$ and their composite spectrum. The median redshift and $M_{1450}$ of the quasars in the sample are $z_{\rm{median}}=6.71$ and $M_{1450,\rm{median}} \simeq -26.1$, respectively. The NIR spectra… ▽ More We present optical and near-infrared (NIR) spectroscopic observations for a sample of $45$ quasars at $6.50 < z \leq 7.64$ with absolute magnitudes at $1450$ Å in the range $-28.82 \leq M_{1450} \leq -24.13$ and their composite spectrum. The median redshift and $M_{1450}$ of the quasars in the sample are $z_{\rm{median}}=6.71$ and $M_{1450,\rm{median}} \simeq -26.1$, respectively. The NIR spectra are taken with echelle spectrographs, complemented with additional data from optical long slit instruments, and then reduced consistently using the open-source Python-based spectroscopic data reduction pipeline PypeIt. The median value of the mean signal-to-noise ratios of the spectra in J, H, and K band (median $\langle \rm{SNR}_λ \rangle$) is: median $\langle \rm{SNR}_{J} \rangle=9.7$, median $\langle \rm{SNR}_{H} \rangle=10.3$, and median $\langle \rm{SNR}_{K} \rangle=11.7$; demonstrating the good data quality. This work presents the largest medium/moderate-resolution sample of quasars at $z>6.5$ from ground-based instruments. Its homogeneity and reproducibility make it ideally suited for several scientific goals, i.e., the study of the quasar proximity zones and dam** wings, the Ly$α$ forest, the intergalactic medium's metal content, as well as other properties such as the distribution of SMBH masses and Eddington ratios. Our composite spectrum is compared to others at both high and low-$z$ from the literature, showing differences in the strengths of many emission lines, probably due to differences in luminosity among the samples, but a consistent continuum slope, which proves that the same spectral features are preserved in quasars at different redshift ranges. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 18 pages, 10 figures, submitted to MNRAS

arXiv:2406.06697 [pdf, other]

A quasar-galaxy merger at $z\sim 6.2$: rapid host growth via accretion of two massive satellite galaxies

Authors: Roberto Decarli, Federica Loiacono, Emanuele Paolo Farina, Massimo Dotti, Alessandro Lupi, Romain A. Meyer, Marco Mignoli, Antonio Pensabene, Michael A. Strauss, Bram Venemans, **yi Yang, Fabian Walter, Julien Wolf, Eduardo Bañados, Laura Blecha, Sarah Bosman, Chris L. Carilli, Andrea Comastri, Thomas Connor, Tiago Costa, Anna-Christina Eilers, Xiaohui Fan, Roberto Gilli, Hyunsung D. Jun, Weizhe Liu , et al. (16 additional authors not shown)

Abstract: We present JWST/NIRSpec Integral Field Spectroscopy in the rest-frame optical bands of the system PJ308-21, a quasar at $z=6.2342$ caught as its host galaxy interacts with companion galaxies. We detect spatially extended emission of several emission lines (H$α$, H$β$, [OIII], [NII], [SII], HeII), which we use to study the properties of the ionized phase of the interstellar medium: the source and h… ▽ More We present JWST/NIRSpec Integral Field Spectroscopy in the rest-frame optical bands of the system PJ308-21, a quasar at $z=6.2342$ caught as its host galaxy interacts with companion galaxies. We detect spatially extended emission of several emission lines (H$α$, H$β$, [OIII], [NII], [SII], HeII), which we use to study the properties of the ionized phase of the interstellar medium: the source and hardness of the photoionizing radiation field, metallicity, dust reddening, electron density and temperature, and star formation. We also marginally detect continuum starlight emission associated with the companion sources. We find that at least two independent satellite galaxies are part of the system. While the quasar host appears highly enriched and obscured, with AGN-like photoionization conditions, the western companion shows minimal dust extinction, low metallicity ($Z\sim0.4$ Z$_\odot$), and star-formation driven photoionization. The eastern companion shows higher extinction and metallicity ($Z\sim0.8$ Z$_\odot$) compared to the western companion, and it is at least partially photoionized by the nearby quasar. We do not find any indication of AGN in the companion sources. Our study shows that while the quasar host galaxy is already very massive ($M_{\rm dyn}>10^{11}$ M$_\odot$), it is still rapidly building up by accreting two relatively massive ($M_{\rm star}\sim 10^{10}$ M$_\odot$) companion sources. This dataset showcases the power of JWST in exposing the build-up of massive galaxies in the first Gyr of the Universe. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: 15 pages, 16 figures. Accepted for publication in A&A

arXiv:2406.02064 [pdf, other]

Advancing Generalized Transfer Attack with Initialization Derived Bilevel Optimization and Dynamic Sequence Truncation

Authors: Yaohua Liu, Jiaxin Gao, Xuan Liu, Xianghao Jiao, Xin Fan, Risheng Liu

Abstract: Transfer attacks generate significant interest for real-world black-box applications by crafting transferable adversarial examples through surrogate models. Whereas, existing works essentially directly optimize the single-level objective w.r.t. the surrogate model, which always leads to poor interpretability of attack mechanism and limited generalization performance over unknown victim models. In… ▽ More Transfer attacks generate significant interest for real-world black-box applications by crafting transferable adversarial examples through surrogate models. Whereas, existing works essentially directly optimize the single-level objective w.r.t. the surrogate model, which always leads to poor interpretability of attack mechanism and limited generalization performance over unknown victim models. In this work, we propose the \textbf{B}il\textbf{E}vel \textbf{T}ransfer \textbf{A}ttac\textbf{K} (BETAK) framework by establishing an initialization derived bilevel optimization paradigm, which explicitly reformulates the nested constraint relationship between the Upper-Level (UL) pseudo-victim attacker and the Lower-Level (LL) surrogate attacker. Algorithmically, we introduce the Hyper Gradient Response (HGR) estimation as an effective feedback for the transferability over pseudo-victim attackers, and propose the Dynamic Sequence Truncation (DST) technique to dynamically adjust the back-propagation path for HGR and reduce computational overhead simultaneously. Meanwhile, we conduct detailed algorithmic analysis and provide convergence guarantee to support non-convexity of the LL surrogate attacker. Extensive evaluations demonstrate substantial improvement of BETAK (e.g., $\mathbf{53.41}$\% increase of attack success rates against IncRes-v$2_{ens}$) against different victims and defense methods in targeted and untargeted attack scenarios. The source code is available at https://github.com/callous-youth/BETAK. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: Accepted by IJCAI 2024. 10 pages

arXiv:2406.01631 [pdf, other]

An LLM-based Recommender System Environment

Authors: Nathan Corecco, Giorgio Piatti, Luca A. Lanzendörfer, Flint Xiaofeng Fan, Roger Wattenhofer

Abstract: Reinforcement learning (RL) has gained popularity in the realm of recommender systems due to its ability to optimize long-term rewards and guide users in discovering relevant content. However, the successful implementation of RL in recommender systems is challenging because of several factors, including the limited availability of online data for training on-policy methods. This scarcity requires… ▽ More Reinforcement learning (RL) has gained popularity in the realm of recommender systems due to its ability to optimize long-term rewards and guide users in discovering relevant content. However, the successful implementation of RL in recommender systems is challenging because of several factors, including the limited availability of online data for training on-policy methods. This scarcity requires expensive human interaction for online model training. Furthermore, the development of effective evaluation frameworks that accurately reflect the quality of models remains a fundamental challenge in recommender systems. To address these challenges, we propose a comprehensive framework for synthetic environments that simulate human behavior by harnessing the capabilities of large language models (LLMs). We complement our framework with in-depth ablation studies and demonstrate its effectiveness with experiments on movie and book recommendations. By utilizing LLMs as synthetic users, this work introduces a modular and novel framework for training RL-based recommender systems. The software, including the RL environment, is publicly available. △ Less

Submitted 1 June, 2024; originally announced June 2024.

arXiv:2406.01060 [pdf, other]

Mechanical dynamics around higher-order exceptional point in magno-optomechanics

Authors: Wen-Di He, Xiao-Hong Fan, Ming-Yue Liu, Guo-Qiang Zhang, Hai-Chao Li, Wei Xiong

Abstract: We theoretically study diverse exceptional points (EPs) in an experimentally feasible magno-optomechanics consisting of an optomechanical subsystem coupled to a magnomechanical subsystem via physically direct contact. By adiabatically eliminating both the cavity and the Kittel mode, dissipative and parity-time symmetric exceptional points can be observed. When only the cavity mode is eliminated, a… ▽ More We theoretically study diverse exceptional points (EPs) in an experimentally feasible magno-optomechanics consisting of an optomechanical subsystem coupled to a magnomechanical subsystem via physically direct contact. By adiabatically eliminating both the cavity and the Kittel mode, dissipative and parity-time symmetric exceptional points can be observed. When only the cavity mode is eliminated, a second (third) -order pseudo-Hermitian EP emerges for nondegenerate (degenerate) mechanical modes. The distinct dynamical behavior of two mechanical modes around these EPs are further studied. Our proposal provides a promising way to engineer diverse EPs and quantify non-Hermitian phase transition with exceptional dynamical behavior in magno-optomechanics. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 6 pages,5 figures

arXiv:2405.18291 [pdf, other]

FedSAC: Dynamic Submodel Allocation for Collaborative Fairness in Federated Learning

Authors: Zihui Wang, Zheng Wang, Lingjuan Lyu, Zhaopeng Peng, Zhicheng Yang, Chenglu Wen, Rongshan Yu, Cheng Wang, Xiaoliang Fan

Abstract: Collaborative fairness stands as an essential element in federated learning to encourage client participation by equitably distributing rewards based on individual contributions. Existing methods primarily focus on adjusting gradient allocations among clients to achieve collaborative fairness. However, they frequently overlook crucial factors such as maintaining consistency across local models and… ▽ More Collaborative fairness stands as an essential element in federated learning to encourage client participation by equitably distributing rewards based on individual contributions. Existing methods primarily focus on adjusting gradient allocations among clients to achieve collaborative fairness. However, they frequently overlook crucial factors such as maintaining consistency across local models and catering to the diverse requirements of high-contributing clients. This oversight inevitably decreases both fairness and model accuracy in practice. To address these issues, we propose FedSAC, a novel Federated learning framework with dynamic Submodel Allocation for Collaborative fairness, backed by a theoretical convergence guarantee. First, we present the concept of "bounded collaborative fairness (BCF)", which ensures fairness by tailoring rewards to individual clients based on their contributions. Second, to implement the BCF, we design a submodel allocation module with a theoretical guarantee of fairness. This module incentivizes high-contributing clients with high-performance submodels containing a diverse range of crucial neurons, thereby preserving consistency across local models. Third, we further develop a dynamic aggregation module to adaptively aggregate submodels, ensuring the equitable treatment of low-frequency neurons and consequently enhancing overall model accuracy. Extensive experiments conducted on three public benchmarks demonstrate that FedSAC outperforms all baseline methods in both fairness and model accuracy. We see this work as a significant step towards incentivizing broader client participation in federated learning. The source code is available at https://github.com/wangzihuixmu/FedSAC. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: Accepted by KDD'24

arXiv:2405.17141 [pdf, other]

MVMS-RCN: A Dual-Domain Unfolding CT Reconstruction with Multi-sparse-view and Multi-scale Refinement-correction

Authors: Xiaohong Fan, Ke Chen, Huaming Yi, Yin Yang, Jian** Zhang

Abstract: X-ray Computed Tomography (CT) is one of the most important diagnostic imaging techniques in clinical applications. Sparse-view CT imaging reduces the number of projection views to a lower radiation dose and alleviates the potential risk of radiation exposure. Most existing deep learning (DL) and deep unfolding sparse-view CT reconstruction methods: 1) do not fully use the projection data; 2) do n… ▽ More X-ray Computed Tomography (CT) is one of the most important diagnostic imaging techniques in clinical applications. Sparse-view CT imaging reduces the number of projection views to a lower radiation dose and alleviates the potential risk of radiation exposure. Most existing deep learning (DL) and deep unfolding sparse-view CT reconstruction methods: 1) do not fully use the projection data; 2) do not always link their architecture designs to a mathematical theory; 3) do not flexibly deal with multi-sparse-view reconstruction assignments. This paper aims to use mathematical ideas and design optimal DL imaging algorithms for sparse-view tomography reconstructions. We propose a novel dual-domain deep unfolding unified framework that offers a great deal of flexibility for multi-sparse-view CT reconstruction with different sampling views through a single model. This framework combines the theoretical advantages of model-based methods with the superior reconstruction performance of DL-based methods, resulting in the expected generalizability of DL. We propose a refinement module that utilizes unfolding projection domain to refine full-sparse-view projection errors, as well as an image domain correction module that distills multi-scale geometric error corrections to reconstruct sparse-view CT. This provides us with a new way to explore the potential of projection information and a new perspective on designing network architectures. All parameters of our proposed framework are learnable end to end, and our method possesses the potential to be applied to plug-and-play reconstruction. Extensive experiments demonstrate that our framework is superior to other existing state-of-the-art methods. Our source codes are available at https://github.com/fanxiaohong/MVMS-RCN. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 12 pages, submitted

arXiv:2405.15268 [pdf, other]

ParamReL: Learning Parameter Space Representation via Progressively Encoding Bayesian Flow Networks

Authors: Zhangkai Wu, Xuhui Fan, ** Li, Zhilin Zhao, Hui Chen, Longbing Cao

Abstract: The recently proposed Bayesian Flow Networks~(BFNs) show great potential in modeling parameter spaces, offering a unified strategy for handling continuous, discretized, and discrete data. However, BFNs cannot learn high-level semantic representation from the parameter space since {common encoders, which encode data into one static representation, cannot capture semantic changes in parameters.} Thi… ▽ More The recently proposed Bayesian Flow Networks~(BFNs) show great potential in modeling parameter spaces, offering a unified strategy for handling continuous, discretized, and discrete data. However, BFNs cannot learn high-level semantic representation from the parameter space since {common encoders, which encode data into one static representation, cannot capture semantic changes in parameters.} This motivates a new direction: learning semantic representations hidden in the parameter spaces to characterize mixed-typed noisy data. {Accordingly, we propose a representation learning framework named ParamReL, which operates in the parameter space to obtain parameter-wise latent semantics that exhibit progressive structures. Specifically, ParamReL proposes a \emph{self-}encoder to learn latent semantics directly from parameters, rather than from observations. The encoder is then integrated into BFNs, enabling representation learning with various formats of observations. Mutual information terms further promote the disentanglement of latent semantics and capture meaningful semantics simultaneously.} We illustrate {conditional generation and reconstruction} in ParamReL via expanding BFNs, and extensive {quantitative} experimental results demonstrate the {superior effectiveness} of ParamReL in learning parameter representation. △ Less

Submitted 5 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

arXiv:2405.15193 [pdf, other]

CuckooGraph: A Scalable and Space-Time Efficient Data Structure for Large-Scale Dynamic Graphs

Authors: Zhuochen Fan, Yalun Cai, Zirui Liu, Jiarui Guo, Xin Fan, Tong Yang, Bin Cui

Abstract: Graphs play an increasingly important role in various big data applications. However, existing graph data structures cannot simultaneously address the performance bottlenecks caused by the dynamic updates, large scale, and high query complexity of current graphs. This paper proposes a novel data structure for large-scale dynamic graphs called CuckooGraph. It does not need to know the amount of gra… ▽ More Graphs play an increasingly important role in various big data applications. However, existing graph data structures cannot simultaneously address the performance bottlenecks caused by the dynamic updates, large scale, and high query complexity of current graphs. This paper proposes a novel data structure for large-scale dynamic graphs called CuckooGraph. It does not need to know the amount of graph data in advance, and can adaptively resize to the most memory-efficient form according to the data scale, realizing multiple graph analytic tasks faster. The key techniques of CuckooGraph include TRANSFORMATION and DENYLIST. TRANSFORMATION fully utilizes the limited memory by designing related data structures that allow flexible space transformations to smoothly expand/tighten the required space depending on the number of incoming items. DENYLIST efficiently handles item insertion failures and further improves processing speed. We conduct extensive experiments, and the results show that CuckooGraph significantly reduces query time by four orders of magnitude on 1-hop successor and precursor queries compared to the state-of-the-art. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.14139 [pdf, other]

Contribute to balance, wire in accordance: Emergence of backpropagation from a simple, bio-plausible neuroplasticity rule

Authors: Xinhao Fan, Shreesh P Mysore

Abstract: Backpropagation (BP) has been pivotal in advancing machine learning and remains essential in computational applications and comparative studies of biological and artificial neural networks. Despite its widespread use, the implementation of BP in the brain remains elusive, and its biological plausibility is often questioned due to inherent issues such as the need for symmetry of weights between for… ▽ More Backpropagation (BP) has been pivotal in advancing machine learning and remains essential in computational applications and comparative studies of biological and artificial neural networks. Despite its widespread use, the implementation of BP in the brain remains elusive, and its biological plausibility is often questioned due to inherent issues such as the need for symmetry of weights between forward and backward connections, and the requirement of distinct forward and backward phases of computation. Here, we introduce a novel neuroplasticity rule that offers a potential mechanism for implementing BP in the brain. Similar in general form to the classical Hebbian rule, this rule is based on the core principles of maintaining the balance of excitatory and inhibitory inputs as well as on retrograde signaling, and operates over three progressively slower timescales: neural firing, retrograde signaling, and neural plasticity. We hypothesize that each neuron possesses an internal state, termed credit, in addition to its firing rate. After achieving equilibrium in firing rates, neurons receive credits based on their contribution to the E-I balance of postsynaptic neurons through retrograde signaling. As the network's credit distribution stabilizes, connections from those presynaptic neurons are strengthened that significantly contribute to the balance of postsynaptic neurons. We demonstrate mathematically that our learning rule precisely replicates BP in layered neural networks without any approximations. Simulations on artificial neural networks reveal that this rule induces varying community structures in networks, depending on the learning rate. This simple theoretical framework presents a biologically plausible implementation of BP, with testable assumptions and predictions that may be evaluated through biological experiments. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.13113 [pdf, other]

MAMMOTH-Subaru. II. Diverse Populations of Circumgalactic Ly$α$ Nebulae at Cosmic Noon

Authors: Mingyu Li, Haibin Zhang, Zheng Cai, Yongming Liang, Nobunari Kashikawa, Ke Ma, Xiaohui Fan, J. Xavier Prochaska, Bjorn H. C. Emonts, Xin Wang, Yun**g Wu, Shiwu Zhang, Qiong Li, Sean D. Johnson, Minghao Yue, Fabrizio Arrigoni Battaia, Sebastiano Cantalupo, Joseph F. Hennawi, Satoshi Kikuta, Yuanhang Ning, Masami Ouchi, Rhythm Shimakawa, Ben Wang, Weichen Wang, Zheng Zheng , et al. (1 additional authors not shown)

Abstract: Circumgalactic Lyman-alpha (Ly$α$) nebulae are gaseous halos around galaxies exhibiting luminous extended Ly$α$ emission. This work investigates Ly$α$ nebulae from deep imaging of $\sim12~\mathrm{deg}^2$ sky, targeted by the MAMMOTH-Subaru survey. Utilizing the wide-field capability of Hyper Suprime-Cam (HSC), we present one of the largest blind Ly$α$ nebula selections, including QSO nebulae, Ly… ▽ More Circumgalactic Lyman-alpha (Ly$α$) nebulae are gaseous halos around galaxies exhibiting luminous extended Ly$α$ emission. This work investigates Ly$α$ nebulae from deep imaging of $\sim12~\mathrm{deg}^2$ sky, targeted by the MAMMOTH-Subaru survey. Utilizing the wide-field capability of Hyper Suprime-Cam (HSC), we present one of the largest blind Ly$α$ nebula selections, including QSO nebulae, Ly$α$ blobs, and radio galaxy nebulae down to typical $2σ$ Ly$α$ surface brightness of $(5-10)\times10^{-18}\mathrm{~erg~s^{-1}~cm^{-2}~arcsec^{-2}}$. The sample contains 117 nebulae with Ly$α$ sizes of 40 - 400 kpc, and the most gigantic one spans about 365 kpc, referred to as the Ivory Nebula. Combining with multiwavelength data, we investigate diverse nebula populations and associated galaxies. We find a small fraction of Ly$α$ nebulae have QSOs ($\sim7\%$), luminous infrared galaxies ($\sim1\%$), and radio galaxies ($\sim 2\%$). Remarkably, among the 28 enormous Ly$α$ nebulae (ELANe) exceeding 100 kpc, about $80\%$ are associated with UV-faint galaxies ($M_\mathrm{UV} > -22$), categorized as Type II ELANe. We underscore that Type II ELANe constitute the majority but remain largely hidden in current galaxy and QSO surveys. Dusty starburst and obscured AGN activity are proposed to explain the nature of Type II ELANe. The SED of stacking all Ly$α$ nebulae also reveals signs of massive dusty star-forming galaxies with obscured AGNs. We propose a model to explain the dusty nature where the diverse populations of Ly$α$ nebula capture massive galaxies at different evolutionary stages undergoing violent assembling. Ly$α$ nebulae provide critical insights into the formation and evolution of today's massive cluster galaxies at cosmic noon. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 26 pages, 10 figures, 3 tables, submitted to ApJS, comments welcome

arXiv:2405.12275 [pdf, other]

doi 10.1093/mnrasl/slae061

Dam** Wing-Like Features in the Stacked Ly$α$ Forest: Potential Neutral Hydrogen Islands at $z<6$

Authors: Yongda Zhu, George D. Becker, Sarah E. I. Bosman, Christopher Cain, Laura C. Keating, Fahad Nasir, Valentina D'Odorico, Eduardo Bañados, Fuyan Bian, Manuela Bischetti, James S. Bolton, Huanqing Chen, Anson D'Aloisio, Frederick B. Davies, Rebecca L. Davies, Anna-Christina Eilers, Xiaohui Fan, Prakash Gaikwad, Bradley Greig, Martin G. Haehnelt, Girish Kulkarni, Samuel Lai, Ewald Puchwein, Yuxiang Qin, Emma V. Ryan-Weber , et al. (6 additional authors not shown)

Abstract: Recent quasar absorption line observations suggest that reionization may end as late as $z \approx 5.3$. As a means to search for large neutral hydrogen islands at $z<6$, we revisit long dark gaps in the Ly$β$ forest in VLT/X-Shooter and Keck/ESI quasar spectra. We stack the Ly$α$ forest corresponding to both edges of these Ly$β$ dark gaps and identify a dam** wing-like extended absorption profi… ▽ More Recent quasar absorption line observations suggest that reionization may end as late as $z \approx 5.3$. As a means to search for large neutral hydrogen islands at $z<6$, we revisit long dark gaps in the Ly$β$ forest in VLT/X-Shooter and Keck/ESI quasar spectra. We stack the Ly$α$ forest corresponding to both edges of these Ly$β$ dark gaps and identify a dam** wing-like extended absorption profile. The average redshift of the stacked forest is $z=5.8$. By comparing these observations with reionization simulations, we infer that such a dam** wing-like feature can be naturally explained if these gaps are at least partially created by neutral islands. Conversely, simulated dark gaps lacking neutral hydrogen struggle to replicate the observed dam** wing features. Furthermore, this dam** wing-like profile implies that the volume-averaged neutral hydrogen fraction must be $\langle x_{\rm HI} \rangle \geq 6.1 \pm 3.9\%$ at $z = 5.8$. Our results offer robust evidence that reionization extends below $z=6$. △ Less

Submitted 28 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

Comments: 8 pages, 5 figures, 1 table; accepted for publication in MNRAS Letters

arXiv:2405.11879 [pdf, ps, other]

Probing $χ_{cJ}(J=0,1,2)$ decay into baryon and anti-baryon with SU(3) flavor analysis

Authors: Bo Lan, Xiu-** Fan, Ru-Min Wang

Abstract: With the accurate measurements of $χ_{cJ}(J=0,1,2)$ charmonium decays, we explore $χ_{cJ}\to \mathcal{B}_{8}\bar{\mathcal{B}}_{8}$ and $\mathcal{B}_{10}\bar{\mathcal{B}}_{10}$ decays based on the SU(3) flavor symmetry model, where $\mathcal{B}_{8}$ and $\mathcal{B}_{10}$ are light octet and decuplet baryons, respectively. The decay amplitude relations are determined by an effective interaction Ham… ▽ More With the accurate measurements of $χ_{cJ}(J=0,1,2)$ charmonium decays, we explore $χ_{cJ}\to \mathcal{B}_{8}\bar{\mathcal{B}}_{8}$ and $\mathcal{B}_{10}\bar{\mathcal{B}}_{10}$ decays based on the SU(3) flavor symmetry model, where $\mathcal{B}_{8}$ and $\mathcal{B}_{10}$ are light octet and decuplet baryons, respectively. The decay amplitude relations are determined by an effective interaction Hamiltonian. Then using experimental data and the amplitude relations of $χ_{cJ}\to \mathcal{B}_{8}\bar{\mathcal{B}}_{8}$, $\mathcal{B}_{10}\bar{\mathcal{B}}_{10}$ decays, relative nonperturbative coefficients are constrained, and the branching ratios of unmeasured $χ_{cJ}\to \mathcal{B}_{8}\bar{\mathcal{B}}_{8},\mathcal{B}_{10}\bar{\mathcal{B}}_{10}$, for examples, $χ_{cJ}\to n\bar{n}$ and $χ_{cJ}\toΛ\barΣ^{0}+\barΛ Σ^{0}$ channels, are predicted. Moreover, we discuss the case of adding a mixing angle between $Λ$ and $Σ^{0}$, which is determined by the quark mass differences. Our results provide valuable insights that will aid in advancing our understanding of the mechanisms and characteristics of $χ_{cJ}$ decays. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.11683 [pdf, other]

Conditionally-Conjugate Gaussian Process Factor Analysis for Spike Count Data via Data Augmentation

Authors: Yididiya Y. Nadew, Xuhui Fan, Christopher J. Quinn

Abstract: Gaussian process factor analysis (GPFA) is a latent variable modeling technique commonly used to identify smooth, low-dimensional latent trajectories underlying high-dimensional neural recordings. Specifically, researchers model spiking rates as Gaussian observations, resulting in tractable inference. Recently, GPFA has been extended to model spike count data. However, due to the non-conjugacy of… ▽ More Gaussian process factor analysis (GPFA) is a latent variable modeling technique commonly used to identify smooth, low-dimensional latent trajectories underlying high-dimensional neural recordings. Specifically, researchers model spiking rates as Gaussian observations, resulting in tractable inference. Recently, GPFA has been extended to model spike count data. However, due to the non-conjugacy of the likelihood, the inference becomes intractable. Prior works rely on either black-box inference techniques, numerical integration or polynomial approximations of the likelihood to handle intractability. To overcome this challenge, we propose a conditionally-conjugate Gaussian process factor analysis (ccGPFA) resulting in both analytically and computationally tractable inference for modeling neural activity from spike count data. In particular, we develop a novel data augmentation based method that renders the model conditionally conjugate. Consequently, our model enjoys the advantage of simple closed-form updates using a variational EM algorithm. Furthermore, due to its conditional conjugacy, we show our model can be readily scaled using sparse Gaussian Processes and accelerated inference via natural gradients. To validate our method, we empirically demonstrate its efficacy through experiments. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: 23 pages, 2 figures, ICML

arXiv:2405.11416 [pdf, other]

Discrete-state Continuous-time Diffusion for Graph Generation

Authors: Zhe Xu, Ruizhong Qiu, Yuzhong Chen, Huiyuan Chen, Xiran Fan, Menghai Pan, Zhichen Zeng, Mahashweta Das, Hanghang Tong

Abstract: Graph is a prevalent discrete data structure, whose generation has wide applications such as drug discovery and circuit design. Diffusion generative models, as an emerging research focus, have been applied to graph generation tasks. Overall, according to the space of states and time steps, diffusion generative models can be categorized into discrete-/continuous-state discrete-/continuous-time fash… ▽ More Graph is a prevalent discrete data structure, whose generation has wide applications such as drug discovery and circuit design. Diffusion generative models, as an emerging research focus, have been applied to graph generation tasks. Overall, according to the space of states and time steps, diffusion generative models can be categorized into discrete-/continuous-state discrete-/continuous-time fashions. In this paper, we formulate the graph diffusion generation in a discrete-state continuous-time setting, which has never been studied in previous graph diffusion models. The rationale of such a formulation is to preserve the discrete nature of graph-structured data and meanwhile provide flexible sampling trade-offs between sample quality and efficiency. Analysis shows that our training objective is closely related to generation quality, and our proposed generation framework enjoys ideal invariant/equivariant properties concerning the permutation of node ordering. Our proposed model shows competitive empirical performance against state-of-the-art graph generation solutions on various benchmarks and, at the same time, can flexibly trade off the generation quality and efficiency in the sampling phase. △ Less

Submitted 18 May, 2024; originally announced May 2024.

arXiv:2405.10959 [pdf, other]

Foundation Models for Education: Promises and Prospects

Authors: Tianlong Xu, Richard Tong, **g Liang, Xing Fan, Haoyang Li, Qingsong Wen

Abstract: With the advent of foundation models like ChatGPT, educators are excited about the transformative role that AI might play in propelling the next education revolution. The develo** speed and the profound impact of foundation models in various industries force us to think deeply about the changes they will make to education, a domain that is critically important for the future of humans. In this p… ▽ More With the advent of foundation models like ChatGPT, educators are excited about the transformative role that AI might play in propelling the next education revolution. The develo** speed and the profound impact of foundation models in various industries force us to think deeply about the changes they will make to education, a domain that is critically important for the future of humans. In this paper, we discuss the strengths of foundation models, such as personalized learning, education inequality, and reasoning capabilities, as well as the development of agent architecture tailored for education, which integrates AI agents with pedagogical frameworks to create adaptive learning environments. Furthermore, we highlight the risks and opportunities of AI overreliance and creativity. Lastly, we envision a future where foundation models in education harmonize human and AI capabilities, fostering a dynamic, inclusive, and adaptive educational ecosystem. △ Less

Submitted 8 April, 2024; originally announced May 2024.

Comments: Accepted by IEEE Intelligent Systems

arXiv:2405.10793 [pdf, other]

CCTNet: A Circular Convolutional Transformer Network for LiDAR-based Place Recognition Handling Movable Objects Occlusion

Authors: Gang Wang, Chaoran Zhu, Qian Xu, Tongzhou Zhang, Hai Zhang, XiaoPeng Fan, Jue Hu

Abstract: Place recognition is a fundamental task for robotic application, allowing robots to perform loop closure detection within simultaneous localization and map** (SLAM), and achieve relocalization on prior maps. Current range image-based networks use single-column convolution to maintain feature invariance to shifts in image columns caused by LiDAR viewpoint change.However, this raises the issues su… ▽ More Place recognition is a fundamental task for robotic application, allowing robots to perform loop closure detection within simultaneous localization and map** (SLAM), and achieve relocalization on prior maps. Current range image-based networks use single-column convolution to maintain feature invariance to shifts in image columns caused by LiDAR viewpoint change.However, this raises the issues such as "restricted receptive fields" and "excessive focus on local regions", degrading the performance of networks. To address the aforementioned issues, we propose a lightweight circular convolutional Transformer network denoted as CCTNet, which boosts performance by capturing structural information in point clouds and facilitating crossdimensional interaction of spatial and channel information. Initially, a Circular Convolution Module (CCM) is introduced, expanding the network's perceptual field while maintaining feature consistency across varying LiDAR perspectives. Then, a Range Transformer Module (RTM) is proposed, which enhances place recognition accuracy in scenarios with movable objects by employing a combination of channel and spatial attention mechanisms. Furthermore, we propose an Overlap-based loss function, transforming the place recognition task from a binary loop closure classification into a regression problem linked to the overlap between LiDAR frames. Through extensive experiments on the KITTI and Ford Campus datasets, CCTNet surpasses comparable methods, achieving Recall@1 of 0.924 and 0.965, and Recall@1% of 0.990 and 0.993 on the test set, showcasing a superior performance. Results on the selfcollected dataset further demonstrate the proposed method's potential for practical implementation in complex scenarios to handle movable objects, showing improved generalization in various datasets. △ Less

Submitted 26 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

arXiv:2405.10721 [pdf, other]

Supermassive black holes are growing slowly by $z\sim5$

Authors: Samuel Lai, Christopher A. Onken, Christian Wolf, Fuyan Bian, Xiaohui Fan

Abstract: We investigate the black hole mass function at $z\sim5$ using XQz5, our recent sample of the most luminous quasars between the redshifts $4.5 < z < 5.3$. We include 72 quasars with black hole masses estimated from velocity-broadened emission-line measurements and single-epoch virial prescriptions in the footprint of a highly complete parent survey. The sample mean Eddington ratio and standard devi… ▽ More We investigate the black hole mass function at $z\sim5$ using XQz5, our recent sample of the most luminous quasars between the redshifts $4.5 < z < 5.3$. We include 72 quasars with black hole masses estimated from velocity-broadened emission-line measurements and single-epoch virial prescriptions in the footprint of a highly complete parent survey. The sample mean Eddington ratio and standard deviation is $\logλ\approx -0.20\pm0.24$. The completeness-corrected mass function is modelled as a double power-law, and we constrain its evolution across redshift assuming accretion-dominated mass growth. We estimate the evolution of the mass function from $z=5-4$, presenting joint constraints on accretion properties through a measured dimensionless e-folding parameter, $k_{\rm{ef}} \equiv \langleλ\rangle U (1-ε)/ε= 1.79\pm0.06$, where $\langleλ\rangle$ is the mean Eddington ratio, $U$ is the duty cycle, and $ε$ is the radiative efficiency. If these supermassive black holes were to form from seeds smaller than $10^8\,M_{\odot}$, the growth rate must have been considerably faster at $z\gg5$ than observed from $z=5-4$. A growth rate exceeding $3\times$ the observed rate would reduce the initial heavy seed mass to $10^{5-6}\,M_{\odot}$, aligning with supermassive star and/or direct collapse seed masses. Stellar mass ($10^2\,M_{\odot}$) black hole seeds would require $\gtrsim4.5\times$ the observed growth rate at $z\gg5$ to reproduce the measured active black hole mass function. A possible pathway to produce the most extreme quasars is radiatively inefficient accretion flow, suggesting black holes with low angular momentum or photon trap** in supercritically accreting thick discs. △ Less

Submitted 17 May, 2024; originally announced May 2024.

Comments: 17 pages, 8 figures, 4 tables, accepted for publication by MNRAS

arXiv:2405.07113 [pdf, other]

Distinguish the EMRI and B-EMRI system by gravitational waves

Authors: Kun Meng, Hongsheng Zhang, Xi-Long Fan, Yuan Yong

Abstract: Binary stars are as frequency as single stars in the Universe, and at least 70\% of the massive stars located in our Galaxy belong to a binary system. For the first time we produce the gravitational waveforms for the extreme mass ratio inspiral systems (EMRIs) of binary stars moving around central supermassive black hole (SBH). We calculate the orbits of such systems via the Hamilton-Jacobi approa… ▽ More Binary stars are as frequency as single stars in the Universe, and at least 70\% of the massive stars located in our Galaxy belong to a binary system. For the first time we produce the gravitational waveforms for the extreme mass ratio inspiral systems (EMRIs) of binary stars moving around central supermassive black hole (SBH). We calculate the orbits of such systems via the Hamilton-Jacobi approach. To improve accuracy we adopt the quadrupole-octupole expression of gravitational wave (GW) and consider the contribution of radiation reaction. Compared to the waveforms of single star, double or multiple spikes emerge in the waveforms of binary stars. We calculate the mismatch of the EMRI waveforms of binary stars and those of single stars, and find the mismatch increases as the binary internal semi-latus rectum $\tilde{p}$ decreases. According to the distinguishable criterion of two signals, it's inferred that the two types of waveforms are distinguishable for small enough $\tilde{p}$. Further, we demonstrate that the most favourable position to distinguish the waveforms of EMRIs of single star from the waveforms of EMRIs of binary stars is that revolutions of the binary stars and the central SBH takes the same direction, i.e., the rotation axis of the binary is parallel to that of the central SBH. △ Less

Submitted 11 May, 2024; originally announced May 2024.

Comments: 16 pages, 10 figures

arXiv:2405.06814 [pdf, other]

Dual-Task Vision Transformer for Rapid and Accurate Intracerebral Hemorrhage Classification on CT Images

Authors: Jialiang Fan, Guoyu Lu, Xinhui Fan, Lucan Li

Abstract: Intracerebral hemorrhage (ICH) is a severe and sudden medical condition caused by the rupture of blood vessels in the brain, leading to permanent damage to brain tissue and often resulting in functional disabilities or death in patients. Diagnosis and analysis of ICH typically rely on brain CT imaging. Given the urgency of ICH conditions, early treatment is crucial, necessitating rapid analysis of… ▽ More Intracerebral hemorrhage (ICH) is a severe and sudden medical condition caused by the rupture of blood vessels in the brain, leading to permanent damage to brain tissue and often resulting in functional disabilities or death in patients. Diagnosis and analysis of ICH typically rely on brain CT imaging. Given the urgency of ICH conditions, early treatment is crucial, necessitating rapid analysis of CT images to formulate tailored treatment plans. However, the complexity of ICH CT images and the frequent scarcity of specialist radiologists pose significant challenges. Therefore, we built a dataset for ICH and normal classification and three types of ICH image classification based on the hemorrhage location, i.e., Deep, Subcortical, and Lobar. In addition, we propose a dual-task vision transformer (DTViT) for the automated classification and diagnosis of ICH images. This neural network utilizes the encoder from ViT, employing attention mechanisms for feature extraction from CT images. We incorporated two multilayer perception (MLP)-based decoders within the network to simultaneously identify the presence of ICH and classify three types of hemorrhage locations. Experimental results demonstrate that our proposed multi-classification network performs well on the built real-world test dataset. The code and dataset for this study will be made publicly available upon paper acceptance at: https://github.com/Jialiangfan/ICH-classification. △ Less

Submitted 18 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

Comments: 9 pages, 4 figure3

arXiv:2405.04289 [pdf, ps, other]

Direct Training High-Performance Deep Spiking Neural Networks: A Review of Theories and Methods

Authors: Chenlin Zhou, Han Zhang, Liutao Yu, Yumin Ye, Zhaokun Zhou, Liwei Huang, Zhengyu Ma, Xiaopeng Fan, Huihui Zhou, Yonghong Tian

Abstract: Spiking neural networks (SNNs) offer a promising energy-efficient alternative to artificial neural networks (ANNs), in virtue of their high biological plausibility, rich spatial-temporal dynamics, and event-driven computation. The direct training algorithms based on the surrogate gradient method provide sufficient flexibility to design novel SNN architectures and explore the spatial-temporal dynam… ▽ More Spiking neural networks (SNNs) offer a promising energy-efficient alternative to artificial neural networks (ANNs), in virtue of their high biological plausibility, rich spatial-temporal dynamics, and event-driven computation. The direct training algorithms based on the surrogate gradient method provide sufficient flexibility to design novel SNN architectures and explore the spatial-temporal dynamics of SNNs. According to previous studies, the performance of models is highly dependent on their sizes. Recently, direct training deep SNNs have achieved great progress on both neuromorphic datasets and large-scale static datasets. Notably, transformer-based SNNs show comparable performance with their ANN counterparts. In this paper, we provide a new perspective to summarize the theories and methods for training deep SNNs with high performance in a systematic and comprehensive way, including theory fundamentals, spiking neuron models, advanced SNN models and residual architectures, software frameworks and neuromorphic hardware, applications, and future trends. The reviewed papers are collected at https://github.com/zhouchenlin2096/Awesome-Spiking-Neural-Networks △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 29 pages

arXiv:2405.03781 [pdf, other]

doi 10.3847/1538-4357/ad488a

Large Scale Overdensity of Lyman Break Galaxies Around the z=6.3 Ultraluminous Quasar J0100+2802

Authors: Maria Pudoka, Feige Wang, Xiaohui Fan, **yi Yang, Jaclyn Champagne, Victoria Jones, Fuyan Bian, Zheng Cai, Linhua Jiang, Dezi Liu, Xue-Bing Wu

Abstract: We study the environment of the z=6.33 ultraluminous quasar SDSS J010013.02+280225.8 (J0100) to understand its association with large-scale structure. Theoretical models propose high-redshift quasars as markers of galaxy overdensities residing in the most massive dark matter halos (DMHs) in the early universe. J0100 is an ultraluminous quasar with the most massive black hole known at z>6, suggesti… ▽ More We study the environment of the z=6.33 ultraluminous quasar SDSS J010013.02+280225.8 (J0100) to understand its association with large-scale structure. Theoretical models propose high-redshift quasars as markers of galaxy overdensities residing in the most massive dark matter halos (DMHs) in the early universe. J0100 is an ultraluminous quasar with the most massive black hole known at z>6, suggesting a high likelihood of residing in a massive DMH. We present wide-field ($\sim$522 square arcminute) imaging in the r-, i-, and z-bands from the Large Binocular Camera on the Large Binocular Telescope, with Y- and J-band imaging from the Wide-field Infrared Camera on the Canada-France-Hawaii Telescope, centered on J0100. Applying color selections, we identify 23 objects as i-droput Lyman Break Galaxy (LBG) candidates in the J0100 field. We use the deep photometric catalog in the 1.27 square degree COSMOS field to calculate the density of LBGs in a blank field, and to estimate the selection completeness and purity. The observed surface density of LBG candidates in the J0100 field corresponds to a galaxy overdensity of $δ$=4 (at 8.4$σ$). This large-scale overdensity suggests that the $\sim$ 22 square arcminute overdensity found by Kashino et al. using JWST data extends out to much larger scales. We calculate the angular auto-correlation function of the candidates and find a positive correlation on $\lesssim$ 10 arcminute scales as well as evidence of asymmetries in their spatial distribution, further suggesting a direct detection of large-scale structure around the ultra-luminous quasar J0100. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 21 pages, 11 figures, 3 tables, to be published in The Astrophysical Journal (ApJ)

arXiv:2405.03100 [pdf, ps, other]

Generalized Einstein-Podolsky-Rosen Steering Paradox

Authors: Zhi-Jie Liu, Xing-Yan Fan, Jie Zhou, Mi Xie, **g-Ling Chen

Abstract: Quantum paradoxes are essential means to reveal the incompatibility between quantum and classical theories, among which the Einstein-Podolsky-Rosen (EPR) steering paradox offers a sharper criterion for the contradiction between local-hidden-state model and quantum mechanics than the usual inequality-based method. In this work, we present a generalized EPR steering paradox, which predicts a contrad… ▽ More Quantum paradoxes are essential means to reveal the incompatibility between quantum and classical theories, among which the Einstein-Podolsky-Rosen (EPR) steering paradox offers a sharper criterion for the contradiction between local-hidden-state model and quantum mechanics than the usual inequality-based method. In this work, we present a generalized EPR steering paradox, which predicts a contradictory equality $2_{Q}=\left( 1+δ\right)_{C}$ ($0\leqδ<1$) given by the quantum ($Q$) and classical ($C$) theories. For any $N$-qubit state in which the conditional state of the steered party is pure, we test the paradox through a two-setting steering protocol, and find that the state is steerable if some specific measurement requirements are satisfied. Moreover, our construction also enlightens the building of EPR steering inequality, which may contribute to some schemes for typical quantum teleportation and quantum key distributions. △ Less

Submitted 5 June, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

Comments: Main: 8 pages, 0 Figure; SM: 35 pages, 0 figure. Revised version

arXiv:2405.02476 [pdf, other]

SSI4IoT: Unlocking the Potential of IoT Tailored Self-Sovereign Identity

Authors: Thusitha Dayaratne, Xinxin Fan, Yuhong Liu, Carsten Rudolph

Abstract: The emerging Self-Sovereign Identity (SSI) techniques, such as Decentralized Identifiers (DIDs) and Verifiable Credentials (VCs), move control of digital identity from conventional identity providers to individuals and lay down the foundation for people, organizations, and things establishing rich digital relationship. The existing applications of SSI mainly focus on creating person-to-person and… ▽ More The emerging Self-Sovereign Identity (SSI) techniques, such as Decentralized Identifiers (DIDs) and Verifiable Credentials (VCs), move control of digital identity from conventional identity providers to individuals and lay down the foundation for people, organizations, and things establishing rich digital relationship. The existing applications of SSI mainly focus on creating person-to-person and person-to-service relationships, whereas person-to-device and device-to-device interactions have been largely overlooked. In this paper, we close this gap by identifying a number of key challenges of applying SSI to the Internet of Things (IoT) and providing a comprehensive taxonomy and usage of VCs in the IoT context with respect to their validity period, trust and interoperability level, and scope of usage. The life-cycle management of VCs as well as various optimization techniques for realizing SSI in IoT environments are also addressed in great detail. This work is a noteworthy step towards massive adoption of SSI for securing existing and future IoT applications in practice. △ Less

Submitted 3 May, 2024; originally announced May 2024.

arXiv:2405.00222 [pdf, other]

Optimized Distribution of Entanglement Graph States in Quantum Networks

Authors: Xiaojie Fan, Caitao Zhan, Himanshu Gupta, C. R. Ramakrishnan

Abstract: Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, robust, and more capable quantum computing platforms by connecting smaller quantum computers. Moreover, unlike classical systems, QNs can enable fully secured long-distance communication. Thus, quantu… ▽ More Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, robust, and more capable quantum computing platforms by connecting smaller quantum computers. Moreover, unlike classical systems, QNs can enable fully secured long-distance communication. Thus, quantum networks lie at the heart of the success of future quantum information technologies. In quantum networks, multipartite entangled states distributed over the network help implement and support many quantum network applications for communications, sensing, and computing. Our work focuses on develo** optimal techniques to generate and distribute multipartite entanglement states efficiently. Prior works on generating general multipartite entanglement states have focused on the objective of minimizing the number of maximally entangled pairs (EPs) while ignoring the heterogeneity of the network nodes and links as well as the stochastic nature of underlying processes. In this work, we develop a hypergraph based linear programming framework that delivers optimal (under certain assumptions) generation schemes for general multipartite entanglement represented by graph states, under the network resources, decoherence, and fidelity constraints, while considering the stochasticity of the underlying processes. We illustrate our technique by develo** generation schemes for the special cases of path and tree graph states, and discuss optimized generation schemes for more general classes of graph states. Using extensive simulations over a quantum network simulator (NetSquid), we demonstrate the effectiveness of our developed techniques and show that they outperform prior known schemes by up to orders of magnitude. △ Less

Submitted 30 April, 2024; originally announced May 2024.

Comments: 11 pages, 13 figures

arXiv:2404.18390 [pdf, other]

Critical grid method: An extensible Smoothed Particle Hydrodynamics fluid general interpolation method for Fluid-Structure Interaction surface coupling based on preCICE

Authors: Sifan Long, Xiaowei Guo, Xiaokang Fan, Canqun Yang

Abstract: Solving Fluid-Structure Interaction (FSI) problems using traditional methods is a big challenge in the field of numerical simulation. As a powerful multi-physical field coupled library, preCICE has a bright application prospect for solving FSI, which supports many open/closed source software and commercial CFD solvers to solve FSI problems in the form of a black box. However, this library currentl… ▽ More Solving Fluid-Structure Interaction (FSI) problems using traditional methods is a big challenge in the field of numerical simulation. As a powerful multi-physical field coupled library, preCICE has a bright application prospect for solving FSI, which supports many open/closed source software and commercial CFD solvers to solve FSI problems in the form of a black box. However, this library currently only supports mesh-based coupling schemes. This paper proposes a critical grid (mesh) as an intermediate medium for the particle method to connect a bidirectional coupling tool named preCICE. The particle and critical mesh are used to interpolate the displacement and force so that the pure Lagrangian Smoothed Particle Hydrodynamic (SPH) method can also solve the FSI problem. This method is called the particle mesh coupling (PMC) method, which theoretically solves the mesh mismatch problem based on the particle method to connect preCICE. In addition, we conduct experiments to verify the performance of the PMC method, in which the fluid and the structure is discretized by SPH and the Finite Element Method (FEM), respectively. The results show that the PMC method given in this paper is effective for solving FSI problems. Finally, our source code for the SPH fluid adapter is open-source and available on GitHub for further develo** preCICE compatibility with more meshless methods. △ Less

Submitted 28 April, 2024; originally announced April 2024.

arXiv:2404.17675 [pdf, other]

Ideal noncrystals: A possible new class of ordered matter without apparent broken symmetry

Authors: Xinyu Fan, Ding Xu, Jianhua Zhang, Hao Hu, Peng Tan, Ning Xu, Hajime Tanaka, Hua Tong

Abstract: Order and disorder constitute two fundamental and opposite themes in condensed matter physics and materials science. Crystals are considered the epitome of order characterized by long-range translational order. The discovery of quasicrystals, with no periodicity but rotational symmetries forbidden for crystals, leads to a paradigm shift in solid-state physics. Moving one step forward, it is intrig… ▽ More Order and disorder constitute two fundamental and opposite themes in condensed matter physics and materials science. Crystals are considered the epitome of order characterized by long-range translational order. The discovery of quasicrystals, with no periodicity but rotational symmetries forbidden for crystals, leads to a paradigm shift in solid-state physics. Moving one step forward, it is intriguing to ask whether ordered matter exists without apparent symmetry breaking. The same question may arise in the pursuit of how ordered amorphous (noncrystalline) solids can be. Here we report the finding of ideal noncrystals in two dimensions, which are disordered in the conventional sense without Bragg peaks but highly ordered according to the steric order. We find that such ideal noncrystals have vibrational modes the same as phonons following the Debye law. The elastic responses are fully affine, which is again characteristic of crystals, and the spatial fluctuations of local volume fractions approach hyperuniformity. Therefore, ideal noncrystals represent an anomalous form of matter with a mixed nature of noncrystalline structure but crystal-like properties. Since such states are found to be thermodynamically favorable, we identify them as a possible new class of ordered matter without apparent broken symmetry. Our results thus extend the scope of the ordered state of matter and may impact the understanding of entropy-driving ordering also in generic amorphous materials. △ Less

Submitted 26 April, 2024; originally announced April 2024.

arXiv:2404.16371 [pdf, other]

Multimodal Information Interaction for Medical Image Segmentation

Authors: Xinxin Fan, Lin Liu, Haoran Zhang

Abstract: The use of multimodal data in assisted diagnosis and segmentation has emerged as a prominent area of interest in current research. However, one of the primary challenges is how to effectively fuse multimodal features. Most of the current approaches focus on the integration of multimodal features while ignoring the correlation and consistency between different modal features, leading to the inclusi… ▽ More The use of multimodal data in assisted diagnosis and segmentation has emerged as a prominent area of interest in current research. However, one of the primary challenges is how to effectively fuse multimodal features. Most of the current approaches focus on the integration of multimodal features while ignoring the correlation and consistency between different modal features, leading to the inclusion of potentially irrelevant information. To address this issue, we introduce an innovative Multimodal Information Cross Transformer (MicFormer), which employs a dual-stream architecture to simultaneously extract features from each modality. Leveraging the Cross Transformer, it queries features from one modality and retrieves corresponding responses from another, facilitating effective communication between bimodal features. Additionally, we incorporate a deformable Transformer architecture to expand the search space. We conducted experiments on the MM-WHS dataset, and in the CT-MRI multimodal image segmentation task, we successfully improved the whole-heart segmentation DICE score to 85.57 and MIoU to 75.51. Compared to other multimodal segmentation techniques, our method outperforms by margins of 2.83 and 4.23, respectively. This demonstrates the efficacy of MicFormer in integrating relevant information between different modalities in multimodal tasks. These findings hold significant implications for multimodal image tasks, and we believe that MicFormer possesses extensive potential for broader applications across various domains. Access to our method is available at https://github.com/fxxJuses/MICFormer △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.16271 [pdf]

True random number generation using metastable 1T' molybdenum ditelluride

Authors: Yang Liu, Pengyu Liu, Yingyi Wen, Zihan Liang, Songwei Liu, Lekai Song, **gfang Pei, Xiaoyue Fan, Teng Ma, Gang Wang, Shuo Gao, Kong-Pang Pun, Xiaolong Chen, Guohua Hu

Abstract: True random numbers play a critical role in secure cryptography. The generation relies on a stable and readily extractable entropy source. Here, from solution-processed structurally metastable 1T' MoTe2, we prove stable output of featureless, stochastic, and yet stable conductance noise at a broad temperature (down to 15 K) with minimal power consumption (down to 0.05 micro-W). Our characterizatio… ▽ More True random numbers play a critical role in secure cryptography. The generation relies on a stable and readily extractable entropy source. Here, from solution-processed structurally metastable 1T' MoTe2, we prove stable output of featureless, stochastic, and yet stable conductance noise at a broad temperature (down to 15 K) with minimal power consumption (down to 0.05 micro-W). Our characterizations and statistical analysis of the characteristics of the conductance noise suggest that the noise arises from the volatility of the stochastic polarization of the underlying ferroelectric dipoles in the 1T' MoTe2. Further, as proved in our experiments and indicated by our Monte Carlo simulation, the ferroelectric dipole polarization is a reliable entropy source with the stochastic polarization persistent and stable over time. Exploiting the conductance noise, we achieve the generation of true random numbers and demonstrate their use in common cryptographic applications, for example, password generation and data encryption. Besides, particularly, we show a privacy safeguarding approach to sensitive data that can be critical for the cryptography of neural networks. We believe our work will bring insights into the understanding of the metastable 1T' MoTe2 and, more importantly, underpin its great potential in secure cryptography. △ Less

Submitted 24 April, 2024; originally announced April 2024.

arXiv:2404.15963 [pdf, other]

Cosmic Himalayas: The Highest Quasar Density Peak Identified in a 10,000 deg$^2$ Sky with Spatial Discrepancies between Galaxies, Quasars, and IGM HI

Authors: Yongming Liang, Masami Ouchi, Dongsheng Sun, Nobunari Kashikawa, Zheng Cai, Sebastiano Cantalupo, Kentaro Nagamine, Hidenobu Yajima, Takanobu Kirihara, Haibin Zhang, Mingyu Li, Rhythm Shimakawa, Xiaohui Fan, Kei Ito, Masayuki Tanaka, Yuichi Harikane, J. Xavier Prochaska, Andrea Travascio, Weichen Wang, Martin Elvis, Giuseppina Fabbiano, Junya Arita, Masafusa Onoue, John D. Silverman, Dongdong Shi , et al. (5 additional authors not shown)

Abstract: We report the identification of a quasar overdensity in the BOSSJ0210 field, dubbed Cosmic Himalayas, consisting of 11 quasars at $z=2.16-2.20$, the densest overdensity of quasars ($17σ$) in the $\sim$10,000 deg$^2$ of the Sloan Digital Sky Survey. We present the spatial distributions of galaxies and quasars and an HI absorption map of the intergalactic medium (IGM). On the map of 465 galaxies sel… ▽ More We report the identification of a quasar overdensity in the BOSSJ0210 field, dubbed Cosmic Himalayas, consisting of 11 quasars at $z=2.16-2.20$, the densest overdensity of quasars ($17σ$) in the $\sim$10,000 deg$^2$ of the Sloan Digital Sky Survey. We present the spatial distributions of galaxies and quasars and an HI absorption map of the intergalactic medium (IGM). On the map of 465 galaxies selected from the MAMMOTH-Subaru survey, we find two galaxy density peaks that do not fall on the quasar overdensity but instead exist at the northwest and southeast sides, approximately 25 $h^{-1}$ comoving-Mpc apart from the quasar overdensity. With a spatial resolution of 15 $h^{-1}$ comoving Mpc in projection, we produce a three-dimensional HI tomography map by the IGM Ly$α$ forest in the spectra of 23 SDSS/eBOSS quasars behind the quasar overdensity. Surprisingly, the quasar overdensity coincides with neither an absorption peak nor a transmission peak of IGM HI but lies near the border separating opaque and transparent volumes, with the more luminous quasars located in an environment with lesser IGM HI. Hence remarkably, the overdensity region traced by the 11 quasars, albeit all in coherently active states, has no clear coincidence with peaks of galaxies or HI absorption densities. Current physical scenarios with mixtures of HI overdensities and quasar photoionization cannot fully interpret the emergence of Cosmic Himalayas, suggesting this peculiar structure is an excellent laboratory to unveil the interplay between galaxies, quasars, and the IGM. △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: 19 pages, 11 figures, submitted to ApJ, comments are welcome

arXiv:2404.15657 [pdf, other]

FedSI: Federated Subnetwork Inference for Efficient Uncertainty Quantification

Authors: Hui Chen, Hengyu Liu, Zhangkai Wu, Xuhui Fan, Longbing Cao

Abstract: While deep neural networks (DNNs) based personalized federated learning (PFL) is demanding for addressing data heterogeneity and shows promising performance, existing methods for federated learning (FL) suffer from efficient systematic uncertainty quantification. The Bayesian DNNs-based PFL is usually questioned of either over-simplified model structures or high computational and memory costs. In… ▽ More While deep neural networks (DNNs) based personalized federated learning (PFL) is demanding for addressing data heterogeneity and shows promising performance, existing methods for federated learning (FL) suffer from efficient systematic uncertainty quantification. The Bayesian DNNs-based PFL is usually questioned of either over-simplified model structures or high computational and memory costs. In this paper, we introduce FedSI, a novel Bayesian DNNs-based subnetwork inference PFL framework. FedSI is simple and scalable by leveraging Bayesian methods to incorporate systematic uncertainties effectively. It implements a client-specific subnetwork inference mechanism, selects network parameters with large variance to be inferred through posterior distributions, and fixes the rest as deterministic ones. FedSI achieves fast and scalable inference while preserving the systematic uncertainties to the fullest extent. Extensive experiments on three different benchmark datasets demonstrate that FedSI outperforms existing Bayesian and non-Bayesian FL baselines in heterogeneous FL scenarios. △ Less

Submitted 24 April, 2024; originally announced April 2024.

arXiv:2404.15413 [pdf, other]

A Spatially Resolved [CII] Survey of 31 $z\sim7$ Massive Galaxies Hosting Luminous Quasars

Authors: Feige Wang, **yi Yang, Xiaohui Fan, Bram Venemans, Roberto Decarli, Eduardo Bañados, Fabian Walter, Aaron J. Barth, Fuyan Bian, Frederick B. Davies, Anna-Christina Eilers, Emanuele Paolo Farina, Joseph F. Hennawi, Jiang-Tao Li, Chiara Mazzucchelli, Ran Wang, Xue-Bing Wu, Minghao Yue

Abstract: The [CII] 158 $μ$m emission line and the underlying far-infrared (FIR) dust continuum are important tracers for studying star formation and kinematic properties of early galaxies. We present a survey of the [CII] emission lines and FIR continua of 31 luminous quasars at $z>6.5$ using the Atacama Large Millimeter Array (ALMA) and the NOrthern Extended Millimeter Array (NOEMA) at sub-arcsec resoluti… ▽ More The [CII] 158 $μ$m emission line and the underlying far-infrared (FIR) dust continuum are important tracers for studying star formation and kinematic properties of early galaxies. We present a survey of the [CII] emission lines and FIR continua of 31 luminous quasars at $z>6.5$ using the Atacama Large Millimeter Array (ALMA) and the NOrthern Extended Millimeter Array (NOEMA) at sub-arcsec resolution. This survey more than doubles the number of quasars with [CII] and FIR observations at these redshifts and enables statistical studies of quasar host galaxies deep into the epoch of reionization. We detect [CII] emission in 27 quasar hosts with a luminosity range of $L_{\rm [CII]}=(0.3-5.5)\times10^9~L_\odot$ and detect the FIR continuum of 28 quasar hosts with a luminosity range of $L_{\rm FIR}=(0.5-13.0)\times10^{12}~L_\odot$. Both $L_{\rm [CII]}$ and $L_{\rm FIR}$ are correlated ($ρ\simeq0.4$) with the quasar bolometric luminosity, albeit with substantial scatter. The quasar hosts detected by ALMA are clearly resolved with a median diameter of $\sim$5 kpc. About 40% of the quasar host galaxies show a velocity gradient in [CII] emission, while the rest show either dispersion-dominated or disturbed kinematics. Basic estimates of the dynamical masses of the rotation-dominated host galaxies yield $M_{\rm dyn}=(0.1-7.5)\times10^{11}~M_\odot$. Considering our findings alongside those of literature studies, we found that the ratio between $M_{\rm BH}$ and $M_{\rm dyn}$ is about ten times higher than that of local $M_{\rm BH}-M_{\rm dyn}$ relation on average but with substantial scatter (the ratio difference ranging from $\sim$0.6 to 60) and large uncertainties. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: accepted for publication in ApJ

arXiv:2404.13419 [pdf, other]

On Modeling Multi-Criteria Decision Making with Uncertain Information using Probabilistic Rules

Authors: Shengxin Hong, Xiuyi Fan

Abstract: Decision-making processes often involve dealing with uncertainty, which is traditionally addressed through probabilistic models. However, in practical scenarios, assessing probabilities reliably can be challenging, compounded by diverse perceptions of probabilistic information among decision makers. To address this variability and accommodate diverse preferences regarding uncertainty, we introduce… ▽ More Decision-making processes often involve dealing with uncertainty, which is traditionally addressed through probabilistic models. However, in practical scenarios, assessing probabilities reliably can be challenging, compounded by diverse perceptions of probabilistic information among decision makers. To address this variability and accommodate diverse preferences regarding uncertainty, we introduce the Probabilistic Abstract Decision Framework (PADF). PADF offers a structured approach for reasoning across different decision criteria, encompassing the optimistic, pessimistic, and Laplace perspectives, each tailored to distinct perceptions of uncertainty. We illustrate how PADF facilitates the computation of optimal decisions aligned with these criteria by leveraging probabilistic rules. Furthermore, we present strategies for optimizing the computational efficiency of these rules, leveraging appropriate independence assumptions to navigate the extensive search space inherent in PADF. Through these contributions, our framework provides a robust and adaptable tool for effectively navigating the complexities of decision-making under uncertainty. △ Less

Submitted 20 April, 2024; originally announced April 2024.

arXiv:2404.11536 [pdf, other]

FedPFT: Federated Proxy Fine-Tuning of Foundation Models

Authors: Zhaopeng Peng, Xiaoliang Fan, Yufan Chen, Zheng Wang, Shirui Pan, Chenglu Wen, Ruisheng Zhang, Cheng Wang

Abstract: Adapting Foundation Models (FMs) for downstream tasks through Federated Learning (FL) emerges a promising strategy for protecting data privacy and valuable FMs. Existing methods fine-tune FM by allocating sub-FM to clients in FL, however, leading to suboptimal performance due to insufficient tuning and inevitable error accumulations of gradients. In this paper, we propose Federated Proxy Fine-Tuni… ▽ More Adapting Foundation Models (FMs) for downstream tasks through Federated Learning (FL) emerges a promising strategy for protecting data privacy and valuable FMs. Existing methods fine-tune FM by allocating sub-FM to clients in FL, however, leading to suboptimal performance due to insufficient tuning and inevitable error accumulations of gradients. In this paper, we propose Federated Proxy Fine-Tuning (FedPFT), a novel method enhancing FMs adaptation in downstream tasks through FL by two key modules. First, the sub-FM construction module employs a layer-wise compression approach, facilitating comprehensive FM fine-tuning across all layers by emphasizing those crucial neurons. Second, the sub-FM alignment module conducts a two-step distillations-layer-level and neuron-level-before and during FL fine-tuning respectively, to reduce error of gradient by accurately aligning sub-FM with FM under theoretical guarantees. Experimental results on seven commonly used datasets (i.e., four text and three vision) demonstrate the superiority of FedPFT. △ Less

Submitted 28 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

Comments: Accepted by IJCAI'24

arXiv:2404.11437 [pdf, ps, other]

$SO(4)$ Symmetry in Hydrogen Atom with Spin

Authors: Xing-Yan Fan, Xiang-Ru Xie, Sheng-Ming Li, **g-Ling Chen

Abstract: As the simplest atom in nature, the hydrogen atom has been explored thoroughly from the perspective of non-relativistic quantum mechanics to relativistic quantum mechanics. Among the research on hydrogen atom, its energy level is the most basic, which can be obtained more conveniently predicated on the $SO(4)$ symmetry than the wave-equation resolution. Moreover, ``spin'' is another indispensable… ▽ More As the simplest atom in nature, the hydrogen atom has been explored thoroughly from the perspective of non-relativistic quantum mechanics to relativistic quantum mechanics. Among the research on hydrogen atom, its energy level is the most basic, which can be obtained more conveniently predicated on the $SO(4)$ symmetry than the wave-equation resolution. Moreover, ``spin'' is another indispensable topic in quantum mechanics, appearing as an intrinsic degree of freedom. In this work, we generalize the quantum Runge-Lenz vector to a spin-dependent one, and then extract a novel Hamiltonian of hydrogen atom with spin based on the requirement of $SO(4)$ symmetry. Furthermore, the energy spectrum of hydrogen atom with spin potentials is also determined by the remarkable approach of $SO(4)$ symmetry. Our findings extend the ground of hydrogen atom, and may contribute to other complicated models based on hydrogen atom. △ Less

Submitted 17 April, 2024; originally announced April 2024.

Comments: 7 pages, 0 figure

arXiv:2404.10253 [pdf, other]

Kilometer-Level Coupled Modeling Using 40 Million Cores: An Eight-Year Journey of Model Development

Authors: Xiaohui Duan, Yuxuan Li, Zhao Liu, Bin Yang, Juepeng Zheng, Haohuan Fu, Shaoqing Zhang, Shiming Xu, Yang Gao, Wei Xue, Di Wei, Xiao**g Lv, Lifeng Yan, Haopeng Huang, Haitian Lu, Lingfeng Wan, Haoran Lin, Qixin Chang, Chenlin Li, Quanjie He, Zeyu Song, Xuantong Wang, Yangyang Yu, Xilong Fan, Zhaopeng Qu , et al. (16 additional authors not shown)

Abstract: With current and future leading systems adopting heterogeneous architectures, adapting existing models for heterogeneous supercomputers is of urgent need for improving model resolution and reducing modeling uncertainty. This paper presents our three-week effort on porting a complex earth system model, CESM 2.2, to a 40-million-core Sunway supercomputer. Taking a non-intrusive approach that tries t… ▽ More With current and future leading systems adopting heterogeneous architectures, adapting existing models for heterogeneous supercomputers is of urgent need for improving model resolution and reducing modeling uncertainty. This paper presents our three-week effort on porting a complex earth system model, CESM 2.2, to a 40-million-core Sunway supercomputer. Taking a non-intrusive approach that tries to minimizes manual code modifications, our project tries to achieve both improvement of performance and consistency of the model code. By using a hierarchical grid system and an OpenMP-based offloading toolkit, our porting and parallelization effort covers over 80% of the code, and achieves a simulation speed of 340 SDPD (simulated days per day) for 5-km atmosphere, 265 SDPD for 3-km ocean, and 222 SDPD for a coupled model, thus making multi-year or even multi-decadal experiments at such high resolution possible. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: 18 pages, 13 figures

arXiv:2404.06276 [pdf]

Extending the Defect Tolerance of Halide Perovskite Nanocrystals to Hot Carrier Cooling Dynamics

Authors: Junzhi Ye, Navendu Mondal, Ben P. Carwithen, Yunwei Zhang, Linjie Dai, Xiangbin Fan, Jian Mao, Zhiqiang Cui, Pratyush Ghosh, Clara Otero Martinez, Lars van Turnhout, Zhongzheng Yu, Ziming Chen, Neil C. Greenham, Samuel D. Stranks, Lakshminarayana Polavarapu, Artem Bakulin, Akshay Rao, Robert L. Z. Hoye

Abstract: Defect tolerance is a critical enabling factor for efficient lead-halide perovskite materials, but the current understanding is primarily on band-edge (cold) carriers, with significant debate over whether hot carriers (HCs) can also exhibit defect tolerance. Here, this important gap in the field is addressed by investigating how internationally-introduced traps affect HC relaxation in CsPbX3 nanoc… ▽ More Defect tolerance is a critical enabling factor for efficient lead-halide perovskite materials, but the current understanding is primarily on band-edge (cold) carriers, with significant debate over whether hot carriers (HCs) can also exhibit defect tolerance. Here, this important gap in the field is addressed by investigating how internationally-introduced traps affect HC relaxation in CsPbX3 nanocrystals (X = Br, I, or mixture). Using femtosecond interband and intraband spectroscopy, along with energy-dependent photoluminescence measurements and kinetic modelling, it is found that HCs are not universally defect tolerant in CsPbX3, but are strongly correlated to the defect tolerance of cold carriers, requiring shallow traps to be present (as in CsPbI3). It is found that HCs are directly captured by traps, instead of going through an intermediate cold carrier, and deeper traps cause faster HC cooling, reducing the effects of the hot phonon bottleneck and Auger reheating. This work provides important insights into how defects influence HCs, which will be important for designing materials for hot carrier solar cells, multiexciton generation, and optical gain media. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: 18 pages, 5 figures

arXiv:2404.04048 [pdf, other]

Revealing the Boundary between Quantum Mechanics and Classical Model by EPR-Steering Inequality

Authors: Ruo-Chen Wang, Zhuo-Chen Li, Xing-Yan Fan, Xiang-Ru Xie, Hong-Hao Wei, Choo Hiap Oh, **g-Ling Chen

Abstract: In quantum information, the Werner state is a benchmark to test the boundary between quantum mechanics and classical models. There have been three well-known critical values for the two-qubit Werner state, i.e., $V_{\rm c}^{\rm E}=1/3$ characterizing the boundary between entanglement and separable model, $V_{\rm c}^{\rm B}=1/K_G(3)$ characterizing the boundary between Bell's nonlocality and the lo… ▽ More In quantum information, the Werner state is a benchmark to test the boundary between quantum mechanics and classical models. There have been three well-known critical values for the two-qubit Werner state, i.e., $V_{\rm c}^{\rm E}=1/3$ characterizing the boundary between entanglement and separable model, $V_{\rm c}^{\rm B}=1/K_G(3)$ characterizing the boundary between Bell's nonlocality and the local-hidden-variable model, while $V_{\rm c}^{\rm S}=1/2$ characterizing the boundary between Einstein-Podolsky-Rosen (EPR) steering and the local-hidden-state model. So far, the problem of $V_{\rm c}^{\rm E}=1/3$ has been completely solved by an inequality involving in the positive-partial-transpose criterion, while how to reveal the other two critical values by the inequality approach are still open. In this work, we focus on EPR steering, which is a form of quantum nonlocality intermediate between entanglement and Bell's nonlocality. By proposing the optimal $N$-setting linear EPR-steering inequalities, we have successfully obtained the desired value $V_{\rm c}^{\rm S}=1/2$ for the two-qubit Werner state, thus resolving the long-standing problem. △ Less

Submitted 9 April, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

Comments: Main text: 6 pages, 2 figures; SM: 8 pages, 5 figures. Adding some references

arXiv:2404.01174 [pdf, other]

SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding

Authors: Wenrui Li, Xiaopeng Hong, Ruiqin Xiong, Xiaopeng Fan

Abstract: Temporal video grounding (TVG) is a critical task in video content understanding, requiring precise alignment between video content and natural language instructions. Despite significant advancements, existing methods face challenges in managing confidence bias towards salient objects and capturing long-term dependencies in video sequences. To address these issues, we introduce SpikeMba: a multi-m… ▽ More Temporal video grounding (TVG) is a critical task in video content understanding, requiring precise alignment between video content and natural language instructions. Despite significant advancements, existing methods face challenges in managing confidence bias towards salient objects and capturing long-term dependencies in video sequences. To address these issues, we introduce SpikeMba: a multi-modal spiking saliency mamba for temporal video grounding. Our approach integrates Spiking Neural Networks (SNNs) with state space models (SSMs) to leverage their unique advantages in handling different aspects of the task. Specifically, we use SNNs to develop a spiking saliency detector that generates the proposal set. The detector emits spike signals when the input signal exceeds a predefined threshold, resulting in a dynamic and binary saliency proposal set. To enhance the model's capability to retain and infer contextual information, we introduce relevant slots which learnable tensors that encode prior knowledge. These slots work with the contextual moment reasoner to maintain a balance between preserving contextual information and exploring semantic relevance dynamically. The SSMs facilitate selective information propagation, addressing the challenge of long-term dependency in video content. By combining SNNs for proposal generation and SSMs for effective contextual reasoning, SpikeMba addresses confidence bias and long-term dependencies, thereby significantly enhancing fine-grained multimodal relationship capture. Our experiments demonstrate the effectiveness of SpikeMba, which consistently outperforms state-of-the-art methods across mainstream benchmarks. △ Less

Submitted 23 May, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

arXiv:2404.00937 [pdf, ps, other]

The Interaction Between Stars and Past AGN Disk: Possible Explanation for the Kinematic Distributions of S-stars in the Galactic Center

Authors: Xiao Fan, Qingwen Wu, Jiancheng Wu, Xiangli Lei, Mengye Wang, Fulin Li

Abstract: The presence of young stars, aged around several million years and situated within the range of $\sim 0.04-1$ pc from our Galactic center raises a question about their origins and dynamical evolutions. Their kinematics provide an opportunity to explore their formation or possible subsequent dynamical evolution. If Sagittarius A* was active in the past as suggested by several observations, the accr… ▽ More The presence of young stars, aged around several million years and situated within the range of $\sim 0.04-1$ pc from our Galactic center raises a question about their origins and dynamical evolutions. Their kinematics provide an opportunity to explore their formation or possible subsequent dynamical evolution. If Sagittarius A* was active in the past as suggested by several observations, the accretion disk may have a significant impact on the dynamics of stars in the Galactic center. The drag force exerted on stars during star-disk interaction could lead some of them to sink into the accretion disk, and these embedded stars will rapidly migrate inward and eventually be disrupted within $\sim10^5$ yr. This could roughly explain the absence of stars within $2.5 \times 10^4 R_{\rm g}$ ($\sim$ 1000 au). Additionally, Kozai-Lidov oscillations, induced by the gravitational perturbation of the disk, could contribute to the bimodal distribution of S-star inclinations and drive a majority of stars into high eccentricity orbits. △ Less

Submitted 26 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

Comments: accepted by ApJ, comments are welcome

arXiv:2403.20156 [pdf, other]

CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

Authors: Hei Yi Mak, Flint Xiaofeng Fan, Luca A. Lanzendörfer, Cheston Tan, Wei Tsang Ooi, Roger Wattenhofer

Abstract: In this study, we delve into Federated Reinforcement Learning (FedRL) in the context of value-based agents operating across diverse Markov Decision Processes (MDPs). Existing FedRL methods typically aggregate agents' learning by averaging the value functions across them to improve their performance. However, this aggregation strategy is suboptimal in heterogeneous environments where agents converg… ▽ More In this study, we delve into Federated Reinforcement Learning (FedRL) in the context of value-based agents operating across diverse Markov Decision Processes (MDPs). Existing FedRL methods typically aggregate agents' learning by averaging the value functions across them to improve their performance. However, this aggregation strategy is suboptimal in heterogeneous environments where agents converge to diverse optimal value functions. To address this problem, we introduce the Convergence-AwarE SAmpling with scReening (CAESAR) aggregation scheme designed to enhance the learning of individual agents across varied MDPs. CAESAR is an aggregation strategy used by the server that combines convergence-aware sampling with a screening mechanism. By exploiting the fact that agents learning in identical MDPs are converging to the same optimal value function, CAESAR enables the selective assimilation of knowledge from more proficient counterparts, thereby significantly enhancing the overall learning efficiency. We empirically validate our hypothesis and demonstrate the effectiveness of CAESAR in enhancing the learning efficiency of agents, using both a custom-built GridWorld environment and the classical FrozenLake-v1 task, each presenting varying levels of environmental heterogeneity. △ Less

Submitted 16 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

Showing 1–50 of 1,411 results for author: Fan, X