Search | arXiv e-print repository

arXiv:2405.20727 [pdf, other]

GANcrop: A Contrastive Defense Against Backdoor Attacks in Federated Learning

Authors: Xiaoyun Gan, Shanyu Gan, Taizhi Su, Peng Liu

Abstract: With heightened awareness of data privacy protection, Federated Learning (FL) has attracted widespread attention as a privacy-preserving distributed machine learning method. However, the distributed nature of federated learning also provides opportunities for backdoor attacks, where attackers can guide the model to produce incorrect predictions without affecting the global model training process.… ▽ More With heightened awareness of data privacy protection, Federated Learning (FL) has attracted widespread attention as a privacy-preserving distributed machine learning method. However, the distributed nature of federated learning also provides opportunities for backdoor attacks, where attackers can guide the model to produce incorrect predictions without affecting the global model training process. This paper introduces a novel defense mechanism against backdoor attacks in federated learning, named GANcrop. This approach leverages contrastive learning to deeply explore the disparities between malicious and benign models for attack identification, followed by the utilization of Generative Adversarial Networks (GAN) to recover backdoor triggers and implement targeted mitigation strategies. Experimental findings demonstrate that GANcrop effectively safeguards against backdoor attacks, particularly in non-IID scenarios, while maintaining satisfactory model accuracy, showcasing its remarkable defensive efficacy and practical utility. △ Less

Submitted 31 May, 2024; originally announced May 2024.

arXiv:2405.07233 [pdf, other]

OXYGENERATOR: Reconstructing Global Ocean Deoxygenation Over a Century with Deep Learning

Authors: Bin Lu, Ze Zhao, Luyu Han, Xiaoying Gan, Yuntao Zhou, Lei Zhou, Luoyi Fu, Xinbing Wang, Chenghu Zhou, **g Zhang

Abstract: Accurately reconstructing the global ocean deoxygenation over a century is crucial for assessing and protecting marine ecosystem. Existing expert-dominated numerical simulations fail to catch up with the dynamic variation caused by global warming and human activities. Besides, due to the high-cost data collection, the historical observations are severely sparse, leading to big challenge for precis… ▽ More Accurately reconstructing the global ocean deoxygenation over a century is crucial for assessing and protecting marine ecosystem. Existing expert-dominated numerical simulations fail to catch up with the dynamic variation caused by global warming and human activities. Besides, due to the high-cost data collection, the historical observations are severely sparse, leading to big challenge for precise reconstruction. In this work, we propose OxyGenerator, the first deep learning based model, to reconstruct the global ocean deoxygenation from 1920 to 2023. Specifically, to address the heterogeneity across large temporal and spatial scales, we propose zoning-varying graph message-passing to capture the complex oceanographic correlations between missing values and sparse observations. Additionally, to further calibrate the uncertainty, we incorporate inductive bias from dissolved oxygen (DO) variations and chemical effects. Compared with in-situ DO observations, OxyGenerator significantly outperforms CMIP6 numerical simulations, reducing MAPE by 38.77%, demonstrating a promising potential to understand the "breathless ocean" in data-driven manner. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Comments: Accepted to ICML 2024

arXiv:2404.08584 [pdf, other]

Pathological Primitive Segmentation Based on Visual Foundation Model with Zero-Shot Mask Generation

Authors: Abu Bakor Hayat Arnob, Xiangxue Wang, Yi** Jiao, Xiao Gan, Wenlong Ming, Jun Xu

Abstract: Medical image processing usually requires a model trained with carefully crafted datasets due to unique image characteristics and domain-specific challenges, especially in pathology. Primitive detection and segmentation in digitized tissue samples are essential for objective and automated diagnosis and prognosis of cancer. SAM (Segment Anything Model) has recently been developed to segment general… ▽ More Medical image processing usually requires a model trained with carefully crafted datasets due to unique image characteristics and domain-specific challenges, especially in pathology. Primitive detection and segmentation in digitized tissue samples are essential for objective and automated diagnosis and prognosis of cancer. SAM (Segment Anything Model) has recently been developed to segment general objects from natural images with high accuracy, but it requires human prompts to generate masks. In this work, we present a novel approach that adapts pre-trained natural image encoders of SAM for detection-based region proposals. Regions proposed by a pre-trained encoder are sent to cascaded feature propagation layers for projection. Then, local semantic and global context is aggregated from multi-scale for bounding box localization and classification. Finally, the SAM decoder uses the identified bounding boxes as essential prompts to generate a comprehensive primitive segmentation map. The entire base framework, SAM, requires no additional training or fine-tuning but could produce an end-to-end result for two fundamental segmentation tasks in pathology. Our method compares with state-of-the-art models in F1 score for nuclei detection and binary/multiclass panoptic(bPQ/mPQ) and mask quality(dice) for segmentation quality on the PanNuke dataset while offering end-to-end efficiency. Our model also achieves remarkable Average Precision (+4.5%) on the secondary dataset (HuBMAP Kidney) compared to Faster RCNN. The code is publicly available at https://github.com/learner-codec/autoprom_sam. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 2024 IEEE International Symposium on Biomedical Imaging

ACM Class: I.4.6; I.2

arXiv:2404.04969 [pdf, other]

Temporal Generalization Estimation in Evolving Graphs

Authors: Bin Lu, Tingyan Ma, Xiaoying Gan, Xinbing Wang, Yunqiang Zhu, Chenghu Zhou, Shiyu Liang

Abstract: Graph Neural Networks (GNNs) are widely deployed in vast fields, but they often struggle to maintain accurate representations as graphs evolve. We theoretically establish a lower bound, proving that under mild conditions, representation distortion inevitably occurs over time. To estimate the temporal distortion without human annotation after deployment, one naive approach is to pre-train a recurre… ▽ More Graph Neural Networks (GNNs) are widely deployed in vast fields, but they often struggle to maintain accurate representations as graphs evolve. We theoretically establish a lower bound, proving that under mild conditions, representation distortion inevitably occurs over time. To estimate the temporal distortion without human annotation after deployment, one naive approach is to pre-train a recurrent model (e.g., RNN) before deployment and use this model afterwards, but the estimation is far from satisfactory. In this paper, we analyze the representation distortion from an information theory perspective, and attribute it primarily to inaccurate feature extraction during evolution. Consequently, we introduce Smart, a straightforward and effective baseline enhanced by an adaptive feature extractor through self-supervised graph reconstruction. In synthetic random graphs, we further refine the former lower bound to show the inevitable distortion over time and empirically observe that Smart achieves good estimation performance. Moreover, we observe that Smart consistently shows outstanding generalization estimation on four real-world evolving graphs. The ablation studies underscore the necessity of graph reconstruction. For example, on OGB-arXiv dataset, the estimation metric MAPE deteriorates from 2.19% to 8.00% without reconstruction. △ Less

Submitted 7 April, 2024; originally announced April 2024.

Comments: Published as a conference paper at ICLR 2024

arXiv:2403.08343 [pdf, ps, other]

Coverage and Rate Analysis for Integrated Sensing and Communication Networks

Authors: Xu Gan, Chongwen Huang, Zhaohui Yang, Xiaoming Chen, Jiguang He, Zhaoyang Zhang, Chau Yuen, Yong Liang Guan, Mérouane Debbah

Abstract: Integrated sensing and communication (ISAC) is increasingly recognized as a pivotal technology for next-generation cellular networks, offering mutual benefits in both sensing and communication capabilities. This advancement necessitates a re-examination of the fundamental limits within networks where these two functions coexist via shared spectrum and infrastructures. However, traditional stochast… ▽ More Integrated sensing and communication (ISAC) is increasingly recognized as a pivotal technology for next-generation cellular networks, offering mutual benefits in both sensing and communication capabilities. This advancement necessitates a re-examination of the fundamental limits within networks where these two functions coexist via shared spectrum and infrastructures. However, traditional stochastic geometry-based performance analyses are confined to either communication or sensing networks separately. This paper bridges this gap by introducing a generalized stochastic geometry framework in ISAC networks. Based on this framework, we define and calculate the coverage and ergodic rate of sensing and communication performance under resource constraints. Then, we shed light on the fundamental limits of ISAC networks by presenting theoretical results for the coverage rate of the unified performance, taking into account the coupling effects of dual functions in coexistence networks. Further, we obtain the analytical formulations for evaluating the ergodic sensing rate constrained by the maximum communication rate, and the ergodic communication rate constrained by the maximum sensing rate. Extensive numerical results validate the accuracy of all theoretical derivations, and also indicate that denser networks significantly enhance ISAC coverage. Specifically, increasing the base station density from $1$ $\text{km}^{-2}$ to $10$ $\text{km}^{-2}$ can boost the ISAC coverage rate from $1.4\%$ to $39.8\%$. Further, results also reveal that with the increase of the constrained sensing rate, the ergodic communication rate improves significantly, but the reverse is not obvious. △ Less

Submitted 22 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

arXiv:2403.02576 [pdf, other]

AceMap: Knowledge Discovery through Academic Graph

Authors: Xinbing Wang, Luoyi Fu, Xiaoying Gan, Ying Wen, Guanjie Zheng, Jiaxin Ding, Liyao Xiang, Nanyang Ye, Meng **, Shiyu Liang, Bin Lu, Haiwen Wang, Yi Xu, Cheng Deng, Shao Zhang, Huquan Kang, Xingli Wang, Qi Li, Zhixin Guo, Jiexing Qi, Pan Liu, Yuyang Ren, Lyuwen Wu, Jungang Yang, Jian** Zhou , et al. (1 additional authors not shown)

Abstract: The exponential growth of scientific literature requires effective management and extraction of valuable insights. While existing scientific search engines excel at delivering search results based on relational databases, they often neglect the analysis of collaborations between scientific entities and the evolution of ideas, as well as the in-depth analysis of content within scientific publicatio… ▽ More The exponential growth of scientific literature requires effective management and extraction of valuable insights. While existing scientific search engines excel at delivering search results based on relational databases, they often neglect the analysis of collaborations between scientific entities and the evolution of ideas, as well as the in-depth analysis of content within scientific publications. The representation of heterogeneous graphs and the effective measurement, analysis, and mining of such graphs pose significant challenges. To address these challenges, we present AceMap, an academic system designed for knowledge discovery through academic graph. We present advanced database construction techniques to build the comprehensive AceMap database with large-scale academic entities that contain rich visual, textual, and numerical information. AceMap also employs innovative visualization, quantification, and analysis methods to explore associations and logical relationships among academic entities. AceMap introduces large-scale academic network visualization techniques centered on nebular graphs, providing a comprehensive view of academic networks from multiple perspectives. In addition, AceMap proposes a unified metric based on structural entropy to quantitatively measure the knowledge content of different academic entities. Moreover, AceMap provides advanced analysis capabilities, including tracing the evolution of academic ideas through citation relationships and concept co-occurrence, and generating concise summaries informed by this evolutionary process. In addition, AceMap uses machine reading methods to generate potential new ideas at the intersection of different fields. Exploring the integration of large language models and knowledge graphs is a promising direction for future research in idea evolution. Please visit \url{https://www.acemap.info} for further exploration. △ Less

Submitted 14 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

Comments: Technical Report for AceMap (https://www.acemap.info)

arXiv:2403.01434 [pdf]

Microcavity induced by few-layer GaSe crystal on silicon photonic crystal waveguide for efficient optical frequency conversion

Authors: Xiaoqing Chen, Yanyan Zhang, Yingke Ji, Yu Zhang, Jianguo Wang, Xianghu Wu, Chenyang Zhao, Liang Fang, Biqiang Jiang, Jianlin Zhao, Xuetao Gan

Abstract: We demonstrate the post-induction of high-quality microcavity on silicon photonic crystal (PC) waveguide by integrating few-layer GaSe crystal, which promises highly efficient on-chip optical frequency conversions. The integration of GaSe shifts the dispersion bands of the PC waveguide mode into the bandgap, resulting in localized modes confined by the bare PC waveguides. Thanks to the small contr… ▽ More We demonstrate the post-induction of high-quality microcavity on silicon photonic crystal (PC) waveguide by integrating few-layer GaSe crystal, which promises highly efficient on-chip optical frequency conversions. The integration of GaSe shifts the dispersion bands of the PC waveguide mode into the bandgap, resulting in localized modes confined by the bare PC waveguides. Thanks to the small contrast of refractive index at the boundaries of microcavity, it is reliably to obtain quality (Q) factors exceeding 10^4. With the enhanced light-GaSe interaction by the microcavity modes and high second-order nonlinearity of GaSe, remarkable second-harmonic generation (SHG) and sum-frequency generation (SFG) are achieved. A record-high on-chip SHG conversion efficiency of 131100% W^-1 is obtained, enabling the clear SHG imaging of the resonant modes with the pump of sub-milliwatts continuous-wave (CW) laser. Driven by a pump of on-resonance CW laser, strong SFGs are successfully carried out with the other pump of a CW laser spanning over the broad telecom-band. Broadband frequency conversion of an incoherent superluminescent light-emitting diode with low spectral power density is also realized in the integrated GaSe-PC waveguide. Our results are expected to provide new strategies for high-efficiency light-matter interactions, nonlinear photonics and light source generation in silicon photonic integrated circuits. △ Less

Submitted 3 March, 2024; originally announced March 2024.

Comments: 5 figures

arXiv:2402.15975 [pdf]

Nonlinear photodetector based on InSe p-n homojunction for improving spatial imaging resolution

Authors: Yu Zhang, Xiaoqing Chen, Mingwen Zhang, Xianghu Wu, Jianguo Wang, Ruijuan Tian, Liang Fang, Yanyan Zhang, Jianlin Zhao, Xuetao Gan

Abstract: We demonstrate an efficient nonlinear photodetector (NLPD) with quadratic response based on a few-layer InSe p-n homojunction, which is beneficial from the strong second harmonic generation (SHG) process in InSe and effective harvest of photocarriers actuated by the high-quality homojunction. The NLPD can sense light with photon energy smaller than InSe electronic bandgap because the SHG process i… ▽ More We demonstrate an efficient nonlinear photodetector (NLPD) with quadratic response based on a few-layer InSe p-n homojunction, which is beneficial from the strong second harmonic generation (SHG) process in InSe and effective harvest of photocarriers actuated by the high-quality homojunction. The NLPD can sense light with photon energy smaller than InSe electronic bandgap because the SHG process in InSe doubles the frequency of incident light, extending InSe photodetection wavelength range to 1750 nm. The InSe p-n homojunction, which is electrostatically doped by two split back gates, presents a rectification ratio exceeding 106 with a dark current down to 2 pA and a high normalized responsivity of 0.534 A/W2 for the telecom-band pulsed light at 1550 nm. The photocurrents of the SHG-assisted photodetection have a quadratic dependence on the optical powers, making the NLPD highly sensitive to light intensity variation with improved spatial resolution. As examples, the NLPD is employed to precisely determine the localization point of a focused laser beam waist and implement spatial imaging with an improved resolution compared with the linear photodetector. These features highlight the potential of the proposed NLPD in develo** advanced optical sensing and imaging systems. △ Less

Submitted 24 February, 2024; originally announced February 2024.

arXiv:2402.15196 [pdf]

Compact on-chip power splitter based on topological photonic crystal

Authors: Puhui Zhang, Jiacheng Zhang, Linpeng Gu, Liang Fang, Yanyan Zhang, Jianlin ZHao, Xuetao Gan

Abstract: We propose and demonstrate an on-chip 1*N power splitter based on topological photonic crystal (TPC) on a monolithic silicon photonic platform. Benefiting from the valley-locked propagation mode at the interface of TPCs with different topological phases, the proposed power splitter has negligible backscattering around the sharp bendings and good robustness to fabrication defects, which therefore e… ▽ More We propose and demonstrate an on-chip 1*N power splitter based on topological photonic crystal (TPC) on a monolithic silicon photonic platform. Benefiting from the valley-locked propagation mode at the interface of TPCs with different topological phases, the proposed power splitter has negligible backscattering around the sharp bendings and good robustness to fabrication defects, which therefore enable lower insertion loss, better uniformity, and more compact footprint than the conventional designs. For the fabricated 1*2 (8) power splitter, the uniformity among the output ports is below 0.35 (0.65) dB and the maximum insertion loss is 0.38 (0.58) dB with compact footprint of 5*5 um2 (10*12 um2) within a bandwidth of 70 nm. In addition, the topological power splitter only requires simple configurations of TPCs with different topological phases, which is more reliable in design and fabrication compared with the conventional designs. △ Less

Submitted 23 February, 2024; originally announced February 2024.

Comments: 8 pages,4 figures

arXiv:2402.10593 [pdf, other]

Bayesian Learning for Double-RIS Aided ISAC Systems with Superimposed Pilots and Data

Authors: Xu Gan, Chongwen Huang, Zhaohui Yang, Caijun Zhong, Xiaoming Chen, Zhaoyang Zhang, Qinghua Guo, Chau Yuen, Merouane Debbah

Abstract: Reconfigurable intelligent surface (RIS) has great potential to improve the performance of integrated sensing and communication (ISAC) systems, especially in scenarios where line-of-sight paths between the base station and users are blocked. However, the spectral efficiency (SE) of RIS-aided ISAC uplink transmissions may be drastically reduced by the heavy burden of pilot overhead for realizing se… ▽ More Reconfigurable intelligent surface (RIS) has great potential to improve the performance of integrated sensing and communication (ISAC) systems, especially in scenarios where line-of-sight paths between the base station and users are blocked. However, the spectral efficiency (SE) of RIS-aided ISAC uplink transmissions may be drastically reduced by the heavy burden of pilot overhead for realizing sensing capabilities. In this paper, we tackle this bottleneck by proposing a superimposed symbol scheme, which superimposes sensing pilots onto data symbols over the same time-frequency resources. Specifically, we develop a structure-aware sparse Bayesian learning framework, where decoded data symbols serve as side information to enhance sensing performance and increase SE. To meet the low-latency requirements of emerging ISAC applications, we further propose a low-complexity simultaneous communication and localization algorithm for multiple users. This algorithm employs the unitary approximate message passing in the Bayesian learning framework for initial angle estimate, followed by iterative refinements through reduced-dimension matrix calculations. Moreover, the sparse code multiple access technology is incorporated into this iterative framework for accurate data detection which also facilitates localization. Numerical results show that the proposed superimposed symbol-based scheme empowered by the developed algorithm can achieve centimeter-level localization while attaining up to $96\%$ of the SE of conventional communications without sensing capabilities. Moreover, compared to other typical ISAC schemes, the proposed superimposed symbol scheme can provide an effective throughput improvement over $133\%$. △ Less

Submitted 16 February, 2024; originally announced February 2024.

arXiv:2312.13975 [pdf, other]

A Joint Communication and Computation Design for Semantic Wireless Communication with Probability Graph

Authors: Zhouxiang Zhao, Zhaohui Yang, Xu Gan, Quoc-Viet Pham, Chongwen Huang, Wei Xu, Zhaoyang Zhang

Abstract: In this paper, we delve into the challenge of optimizing joint communication and computation for semantic communication over wireless networks using a probability graph framework. In the considered model, the base station (BS) extracts the small-sized compressed semantic information through removing redundant messages based on the stored knowledge base. Specifically, the knowledge base is encapsul… ▽ More In this paper, we delve into the challenge of optimizing joint communication and computation for semantic communication over wireless networks using a probability graph framework. In the considered model, the base station (BS) extracts the small-sized compressed semantic information through removing redundant messages based on the stored knowledge base. Specifically, the knowledge base is encapsulated in a probability graph that encapsulates statistical relations. At the user side, the compressed information is accurately deduced using the same probability graph employed by the BS. While this approach introduces an additional computational overhead for semantic information extraction, it significantly curtails communication resource consumption by transmitting concise data. We derive both communication and computation cost models based on the inference process of the probability graph. Building upon these models, we introduce a joint communication and computation resource allocation problem aimed at minimizing the overall energy consumption of the network, while accounting for latency, power, and semantic constraints. To address this problem, we obtain a closed-form solution for transmission power under a fixed semantic compression ratio. Subsequently, we propose an efficient linear search-based algorithm to attain the optimal solution for the considered problem with low computational complexity. Simulation results underscore the effectiveness of our proposed system, showcasing notable improvements compared to conventional non-semantic schemes. △ Less

Submitted 22 December, 2023; v1 submitted 21 December, 2023; originally announced December 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2310.00015

arXiv:2312.06157 [pdf]

doi 10.1002/adfm.202310811

Approaching the robust linearity in dual-floating van der Waals photodiode

Authors: **peng Xu, Xiaoguang Luo, Xi Lin, Xi Zhang, Fan Liu, Yuting Yan, Siqi Hu, Mingwen Zhang, Nannan Han, Xuetao Gan, Yingchun Cheng, Wei Huang

Abstract: Two-dimensional (2D) material photodetectors have gained great attention as potential elements for optoelectronic applications. However, the linearity of the photoresponse is often compromised by the carrier interaction, even in 2D photodiodes. In this study, we present a new device concept of dual-floating van der Waals heterostructures (vdWHs) photodiode by employing ambipolar MoTe2 and n-type M… ▽ More Two-dimensional (2D) material photodetectors have gained great attention as potential elements for optoelectronic applications. However, the linearity of the photoresponse is often compromised by the carrier interaction, even in 2D photodiodes. In this study, we present a new device concept of dual-floating van der Waals heterostructures (vdWHs) photodiode by employing ambipolar MoTe2 and n-type MoS2 2D semiconductors. The presence of type II heterojunctions on both sides of channel layers effectively deplete carriers and restrict the photocarrier trap** within the channel layers. As a result, the device exhibits robust linear photoresponse under photovoltaic mode from the visible (405 nm) to near-infrared (1600 nm) band. With the built-in electric field of the vdWHs, we achieve a linear dynamic range of ~ 100 dB, responsivity of ~ 1.57 A/W, detectivity of ~ 4.28 * 10^11 Jones, and response speed of ~ 30 μs. Our results showcase a promising device concept with excellent linearity towards fast and low-loss detection, high-resolution imaging, and logic optoelectronics. △ Less

Submitted 11 December, 2023; originally announced December 2023.

Comments: 29 pages, 5 figures in the main text, 12 figures and 2 tables in the supporting information

arXiv:2312.06142 [pdf]

doi 10.1021/acs.nanolett.3c03500

Self-powered programmable van der Waals photodetectors with nonvolatile semi-floating gate

Authors: Fan Liu, Xi Lin, Yuting Yan, Xuetao Gan, Yingchun Cheng, Xiaoguang Luo

Abstract: Tunable photovoltaic photodetectors are of significant relevance in the fields of programmable and neuromorphic optoelectronics. However, their widespread adoption is hindered by intricate architectural design and energy consumption challenges. This study employs a nonvolatile MoTe2/hBN/graphene semi-floating photodetector to address these issues. Programed with pulsed gate voltage, the MoTe2 chan… ▽ More Tunable photovoltaic photodetectors are of significant relevance in the fields of programmable and neuromorphic optoelectronics. However, their widespread adoption is hindered by intricate architectural design and energy consumption challenges. This study employs a nonvolatile MoTe2/hBN/graphene semi-floating photodetector to address these issues. Programed with pulsed gate voltage, the MoTe2 channel can be reconfigured from an n+-n to a p-n homojunction, and the photocurrent transition changes from negative to positive values. Scanning photocurrent map** reveals that the negative and positive photocurrents are attributed to Schottky junction and p-n homojunction, respectively. In the p-n configuration, the device demonstrates self-driven, linear, rapid response (~3 ms), and broadband sensitivity (from 405 to 1500 nm) for photodetection, with typical performances of responsivity at ~0.5 A/W and detectivity ~1.6*10^12 Jones under 635 nm illumination. These outstanding photodetection capabilities emphasize the potential of the semi-floating photodetector as a pioneering approach for advancing logical and nonvolatile optoelectronics. △ Less

Submitted 11 December, 2023; originally announced December 2023.

Comments: 34 pages, 5 figures in the main text, 12 figures and 1 table in the supporting information

arXiv:2311.02149 [pdf, other]

Detecting Axion Dark Matter with Black Hole Polarimetry

Authors: Xucheng Gan, Lian-Tao Wang, Huangyu Xiao

Abstract: The axion, as a leading dark matter candidate, is the target of many on-going and proposed experimental searches based on its coupling to photons. Ultralight axions that couple to photons can also cause polarization rotation of light which can be probed by cosmic microwave background. In this work, we show that a large axion field is inevitably developed around black holes due to the Bose-Einstein… ▽ More The axion, as a leading dark matter candidate, is the target of many on-going and proposed experimental searches based on its coupling to photons. Ultralight axions that couple to photons can also cause polarization rotation of light which can be probed by cosmic microwave background. In this work, we show that a large axion field is inevitably developed around black holes due to the Bose-Einstein condensation of axions, enhancing the induced birefringence effects. Therefore, we propose measuring the modulation of supermassive black hole imaging polarization angles as a new probe to the axion-photon coupling of axion dark matter. The oscillating axion field around black holes induces polarization rotation on the black hole image, which is detectable and distinguishable from astrophysical effects on the polarization angle, as it exhibits distinctive temporal variability and frequency invariability. We present the range of axion-photon couplings within the axion mass range $10^{-21}-10^{-16}\text{eV}$ that can be probed by the Event Horizon Telescope. The axion parameter space probed by black hole polarimetry will expand with the improvement in sensitivity on the polarization measurement and more black hole polarimetry targets with determined black hole masses. △ Less

Submitted 3 November, 2023; originally announced November 2023.

Comments: 13 pages, 4 appendices, 3 figures

Report number: FERMILAB-PUB-23-635-T

arXiv:2310.01806 [pdf]

Improvement and Enhancement of YOLOv5 Small Target Recognition Based on Multi-module Optimization

Authors: Qingyang Li, Yuchen Li, Hongyi Duan, JiaLiang Kang, Jianan Zhang, Xueqian Gan, Ruotong Xu

Abstract: In this paper, the limitations of YOLOv5s model on small target detection task are deeply studied and improved. The performance of the model is successfully enhanced by introducing GhostNet-based convolutional module, RepGFPN-based Neck module optimization, CA and Transformer's attention mechanism, and loss function improvement using NWD. The experimental results validate the positive impact of th… ▽ More In this paper, the limitations of YOLOv5s model on small target detection task are deeply studied and improved. The performance of the model is successfully enhanced by introducing GhostNet-based convolutional module, RepGFPN-based Neck module optimization, CA and Transformer's attention mechanism, and loss function improvement using NWD. The experimental results validate the positive impact of these improvement strategies on model precision, recall and mAP. In particular, the improved model shows significant superiority in dealing with complex backgrounds and tiny targets in real-world application tests. This study provides an effective optimization strategy for the YOLOv5s model on small target detection, and lays a solid foundation for future related research and applications. △ Less

Submitted 3 October, 2023; originally announced October 2023.

Comments: 8 pages 10 figures

arXiv:2308.08344 [pdf, other]

Graph Out-of-Distribution Generalization with Controllable Data Augmentation

Authors: Bin Lu, Xiaoying Gan, Ze Zhao, Shiyu Liang, Luoyi Fu, Xinbing Wang, Chenghu Zhou

Abstract: Graph Neural Network (GNN) has demonstrated extraordinary performance in classifying graph properties. However, due to the selection bias of training and testing data (e.g., training on small graphs and testing on large graphs, or training on dense graphs and testing on sparse graphs), distribution deviation is widespread. More importantly, we often observe \emph{hybrid structure distribution shif… ▽ More Graph Neural Network (GNN) has demonstrated extraordinary performance in classifying graph properties. However, due to the selection bias of training and testing data (e.g., training on small graphs and testing on large graphs, or training on dense graphs and testing on sparse graphs), distribution deviation is widespread. More importantly, we often observe \emph{hybrid structure distribution shift} of both scale and density, despite of one-sided biased data partition. The spurious correlations over hybrid distribution deviation degrade the performance of previous GNN methods and show large instability among different datasets. To alleviate this problem, we propose \texttt{OOD-GMixup} to jointly manipulate the training distribution with \emph{controllable data augmentation} in metric space. Specifically, we first extract the graph rationales to eliminate the spurious correlations due to irrelevant information. Secondly, we generate virtual samples with perturbation on graph rationale representation domain to obtain potential OOD training samples. Finally, we propose OOD calibration to measure the distribution deviation of virtual samples by leveraging Extreme Value Theory, and further actively control the training distribution by emphasizing the impact of virtual OOD samples. Extensive studies on several real-world datasets on graph classification demonstrate the superiority of our proposed method over state-of-the-art baselines. △ Less

Submitted 16 August, 2023; originally announced August 2023.

Comments: Under review

arXiv:2308.07951 [pdf, other]

Cosmic Millicharge Background and Reheating Probes

Authors: Xucheng Gan, Yu-Dai Tsai

Abstract: We demonstrate that the searches for dark sector particles can provide probes of reheating scenarios, focusing on the cosmic millicharge background produced in the early universe. We discuss two types of millicharge particles (mCPs): either with, or without, an accompanying dark photon. These two types of mCPs have distinct theoretical motivations and cosmological signatures. We discuss constraint… ▽ More We demonstrate that the searches for dark sector particles can provide probes of reheating scenarios, focusing on the cosmic millicharge background produced in the early universe. We discuss two types of millicharge particles (mCPs): either with, or without, an accompanying dark photon. These two types of mCPs have distinct theoretical motivations and cosmological signatures. We discuss constraints from the overproduction and mCP-baryon interactions of the mCP without an accompanying dark photon, with different reheating temperatures. We also consider the $ΔN_{\rm eff}$ constraints on the mCPs from kinetic mixing, varying the reheating temperature. The regions of interest in which the accelerator and other experiments can probe the reheating scenarios are identified in this paper for both scenarios. These probes can potentially allow us to set an upper bound on the reheating temperature down to $\sim 10$ MeV, much lower than the previously considered upper bound from inflationary cosmology at around $\sim 10^{16}$ GeV. In addition, we find parameter regions in which the two mCP scenarios may be differentiated by cosmological considerations. Finally, we discuss the implications of dedicated mCP searches and future CMB-S4 observations. △ Less

Submitted 15 August, 2023; originally announced August 2023.

Comments: 10 pages plus references, 5 figures

Report number: UCI-HEP-TR-2023-05; FERMILAB-PUB-23-428-T-V

arXiv:2308.06974 [pdf]

A One Stop 3D Target Reconstruction and multilevel Segmentation Method

Authors: Jiexiong Xu, Weikun Zhao, Zhiyan Tang, Xiangchao Gan

Abstract: 3D object reconstruction and multilevel segmentation are fundamental to computer vision research. Existing algorithms usually perform 3D scene reconstruction and target objects segmentation independently, and the performance is not fully guaranteed due to the challenge of the 3D segmentation. Here we propose an open-source one stop 3D target reconstruction and multilevel segmentation framework (OS… ▽ More 3D object reconstruction and multilevel segmentation are fundamental to computer vision research. Existing algorithms usually perform 3D scene reconstruction and target objects segmentation independently, and the performance is not fully guaranteed due to the challenge of the 3D segmentation. Here we propose an open-source one stop 3D target reconstruction and multilevel segmentation framework (OSTRA), which performs segmentation on 2D images, tracks multiple instances with segmentation labels in the image sequence, and then reconstructs labelled 3D objects or multiple parts with Multi-View Stereo (MVS) or RGBD-based 3D reconstruction methods. We extend object tracking and 3D reconstruction algorithms to support continuous segmentation labels to leverage the advances in the 2D image segmentation, especially the Segment-Anything Model (SAM) which uses the pretrained neural network without additional training for new scenes, for 3D object segmentation. OSTRA supports most popular 3D object models including point cloud, mesh and voxel, and achieves high performance for semantic segmentation, instance segmentation and part segmentation on several 3D datasets. It even surpasses the manual segmentation in scenes with complex structures and occlusions. Our method opens up a new avenue for reconstructing 3D targets embedded with rich multi-scale segmentation information in complex scenes. OSTRA is available from https://github.com/ganlab/OSTRA. △ Less

Submitted 14 August, 2023; originally announced August 2023.

arXiv:2307.04420 [pdf, ps, other]

FedDCT: A Dynamic Cross-Tier Federated Learning Scheme in Wireless Communication Networks

Authors: Peng Liu, Youquan Xian, Chuanjian Yao, Xiaoyun Gan, Lianghaojie Zhou, Jianyong Jiang, Dongcheng Li

Abstract: With the rapid proliferation of Internet of Things (IoT) devices and the growing concern for data privacy among the public, Federated Learning (FL) has gained significant attention as a privacy-preserving machine learning paradigm. FL enables the training of a global model among clients without exposing local data. However, when a federated learning system runs on wireless communication networks,… ▽ More With the rapid proliferation of Internet of Things (IoT) devices and the growing concern for data privacy among the public, Federated Learning (FL) has gained significant attention as a privacy-preserving machine learning paradigm. FL enables the training of a global model among clients without exposing local data. However, when a federated learning system runs on wireless communication networks, limited wireless resources, heterogeneity of clients, and network transmission failures affect its performance and accuracy. In this study, we propose a novel dynamic cross-tier FL scheme, named FedDCT to increase training accuracy and performance in wireless communication networks. We utilize a tiering algorithm that dynamically divides clients into different tiers according to specific indicators and assigns specific timeout thresholds to each tier to reduce the training time required. To improve the accuracy of the model without increasing the training time, we introduce a cross-tier client selection algorithm that can effectively select the tiers and participants. Simulation experiments show that our scheme can make the model converge faster and achieve a higher accuracy in wireless communication networks. △ Less

Submitted 10 July, 2023; originally announced July 2023.

arXiv:2306.15750 [pdf, ps, other]

On the recursive and explicit form of the general J.C.P. Miller formula with applications

Authors: Dariusz Bugajewski, Dawid Bugajewski, Xiao-Xiong Gan, Piotr Maćkowiak

Abstract: The famous J.C.P. Miller formula provides a recurrence algorithm for the composition $B_a \circ f$, where $B_a$ is the formal binomial series and $f$ is a formal power series, however it requires that $f$ has to be a nonunit. In this paper we provide the general J.C.P. Miller formula which eliminates the requirement of nonunitness of $f$ and, instead, we establish a necessary and sufficient condit… ▽ More The famous J.C.P. Miller formula provides a recurrence algorithm for the composition $B_a \circ f$, where $B_a$ is the formal binomial series and $f$ is a formal power series, however it requires that $f$ has to be a nonunit. In this paper we provide the general J.C.P. Miller formula which eliminates the requirement of nonunitness of $f$ and, instead, we establish a necessary and sufficient condition for the existence of the composition $B_a \circ f$. We also provide the general J.C.P. Miller recurrence algorithm for computing the coefficients of that composition, if $ B_a\circ f$ is well defined, obviously. Our generalizations cover both the case in which $f$ is a one--variable formal power series and the case in which $f$ is a multivariable formal power series. In the central part of this article we state, using some combinatorial techniques, the explicit form of the general J.C.P. Miller formula for one-variable case. As applications of these results we provide an explicit formula for the inverses of polynomials and formal power series for which the inverses exist, obviously. We also use our results to investigation of approximate solution to a differential equation which cannot be solved in an explicit way. △ Less

Submitted 27 June, 2023; originally announced June 2023.

MSC Class: Primary: 05A10; 13F25; 13J05; Secondary: 40A30

arXiv:2305.12144 [pdf, other]

DiffCap: Exploring Continuous Diffusion on Image Captioning

Authors: Yufeng He, Zefan Cai, Xu Gan, Baobao Chang

Abstract: Current image captioning works usually focus on generating descriptions in an autoregressive manner. However, there are limited works that focus on generating descriptions non-autoregressively, which brings more decoding diversity. Inspired by the success of diffusion models on generating natural-looking images, we propose a novel method DiffCap to apply continuous diffusions on image captioning.… ▽ More Current image captioning works usually focus on generating descriptions in an autoregressive manner. However, there are limited works that focus on generating descriptions non-autoregressively, which brings more decoding diversity. Inspired by the success of diffusion models on generating natural-looking images, we propose a novel method DiffCap to apply continuous diffusions on image captioning. Unlike image generation where the output is fixed-size and continuous, image description length varies with discrete tokens. Our method transforms discrete tokens in a natural way and applies continuous diffusion on them to successfully fuse extracted image features for diffusion caption generation. Our experiments on COCO dataset demonstrate that our method uses a much simpler structure to achieve comparable results to the previous non-autoregressive works. Apart from quality, an intriguing property of DiffCap is its high diversity during generation, which is missing from many autoregressive models. We believe our method on fusing multimodal features in diffusion language generation will inspire more researches on multimodal language generation tasks for its simplicity and decoding flexibility. △ Less

Submitted 20 May, 2023; originally announced May 2023.

arXiv:2302.07491 [pdf, other]

doi 10.1109/TNNLS.2024.3386168

Self-Supervised Temporal Graph learning with Temporal and Structural Intensity Alignment

Authors: Meng Liu, Ke Liang, Yawei Zhao, Wenxuan Tu, Sihang Zhou, Xinbiao Gan, Xinwang Liu, Kunlun He

Abstract: Temporal graph learning aims to generate high-quality representations for graph-based tasks with dynamic information, which has recently garnered increasing attention. In contrast to static graphs, temporal graphs are typically organized as node interaction sequences over continuous time rather than an adjacency matrix. Most temporal graph learning methods model current interactions by incorporati… ▽ More Temporal graph learning aims to generate high-quality representations for graph-based tasks with dynamic information, which has recently garnered increasing attention. In contrast to static graphs, temporal graphs are typically organized as node interaction sequences over continuous time rather than an adjacency matrix. Most temporal graph learning methods model current interactions by incorporating historical neighborhood. However, such methods only consider first-order temporal information while disregarding crucial high-order structural information, resulting in suboptimal performance. To address this issue, we propose a self-supervised method called S2T for temporal graph learning, which extracts both temporal and structural information to learn more informative node representations. Notably, the initial node representations combine first-order temporal and high-order structural information differently to calculate two conditional intensities. An alignment loss is then introduced to optimize the node representations, narrowing the gap between the two intensities and making them more informative. Concretely, in addition to modeling temporal information using historical neighbor sequences, we further consider structural knowledge at both local and global levels. At the local level, we generate structural intensity by aggregating features from high-order neighbor sequences. At the global level, a global representation is generated based on all nodes to adjust the structural intensity according to the active statuses on different nodes. Extensive experiments demonstrate that the proposed model S2T achieves at most 10.13% performance improvement compared with the state-of-the-art competitors on several datasets. △ Less

Submitted 28 April, 2024; v1 submitted 15 February, 2023; originally announced February 2023.

arXiv:2302.03056 [pdf, other]

doi 10.1007/JHEP11(2023)031

Cosmologically Varying Kinetic Mixing

Authors: Xucheng Gan, Di Liu

Abstract: The portal connecting the invisible and visible sectors is one of the most natural explanations of the dark world. However, the early-time dark matter production via the portal faces extremely stringent late-time constraints. To solve such tension, we construct the scalar-controlled kinetic mixing varying with the ultralight CP-even scalar's cosmological evolution. To realize this and eliminate th… ▽ More The portal connecting the invisible and visible sectors is one of the most natural explanations of the dark world. However, the early-time dark matter production via the portal faces extremely stringent late-time constraints. To solve such tension, we construct the scalar-controlled kinetic mixing varying with the ultralight CP-even scalar's cosmological evolution. To realize this and eliminate the constant mixing, we couple the ultralight scalar within $10^{-33}\text{eV} \lesssim m_0 \ll \text{eV}$ with the heavy doubly charged messengers and impose the $\mathbb{Z}_2$ symmetry under the dark charge conjugation. Via the varying mixing, the $\text{keV}-\text{MeV}$ dark photon dark matter is produced through the early-time freeze-in when the scalar is misaligned from the origin and free from the late-time exclusions when the scalar does the damped oscillation and dynamically sets the kinetic mixing. We also find that the scalar-photon coupling emerges from the underlying physics, which changes the cosmological history and provides the experimental targets based on the fine-structure constant variation and the equivalence principle violation. To ensure the scalar naturalness, we discretely re-establish the broken shift symmetry by embedding the minimal model into the $\mathbb{Z}_N$-protected model. When $N \sim 10$, the scalar's mass quantum correction can be suppressed much below $10^{-33}\text{eV}$. △ Less

Submitted 8 November, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

Comments: v2: 26 pages, 3 appendices, 10 figures, main text streamlined, references added, results and conclusions unchanged

Report number: DESY-23-013, LAPTH-003/23

Journal ref: JHEP11(2023)031

arXiv:2211.05139 [pdf, other]

doi 10.1007/JHEP05(2023)046

Millicharged Relics Reveal Massless Dark Photons

Authors: Asher Berlin, Jeff A. Dror, Xucheng Gan, Joshua T. Ruderman

Abstract: The detection of massless kinetically-mixed dark photons is notoriously difficult, as the effect of this mixing can be removed by a field redefinition in vacuum. In this work, we study the prospect of detecting massless dark photons in the presence of a cosmic relic directly charged under this dark electromagnetism. Such millicharged particles, in the form of dark matter or dark radiation, generat… ▽ More The detection of massless kinetically-mixed dark photons is notoriously difficult, as the effect of this mixing can be removed by a field redefinition in vacuum. In this work, we study the prospect of detecting massless dark photons in the presence of a cosmic relic directly charged under this dark electromagnetism. Such millicharged particles, in the form of dark matter or dark radiation, generate an effective dark photon mass that drives photon-to-dark photon oscillations in the early universe. We also study the prospect for such models to alleviate existing cosmological constraints on massive dark photons, enlarging the motivation for direct tests of this parameter space using precision terrestrial probes. △ Less

Submitted 7 May, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

Comments: 16 pages, 3 appendices, 6 figures; v2, minor improvements to the text, references added, results and conclusions unchanged

Report number: FERMILAB-PUB-21-724-SQMS-T

Journal ref: JHEP 05 (2023) 046

arXiv:2208.06072 [pdf, ps, other]

Multiple RISs Assisted Cell-Free Networks With Two-timescale CSI: Performance Analysis and System Design

Authors: Xu Gan, Caijun Zhong, Chongwen Huang, Zhaohui Yang, Zhaoyang Zhang

Abstract: Reconfigurable intelligent surface (RIS) can be employed in a cell-free system to create favorable propagation conditions from base stations (BSs) to users via configurable elements. However, prior works on RIS-aided cell-free system designs mainly rely on the instantaneous channel state information (CSI), which may incur substantial overhead due to extremely high dimensions of estimated channels.… ▽ More Reconfigurable intelligent surface (RIS) can be employed in a cell-free system to create favorable propagation conditions from base stations (BSs) to users via configurable elements. However, prior works on RIS-aided cell-free system designs mainly rely on the instantaneous channel state information (CSI), which may incur substantial overhead due to extremely high dimensions of estimated channels. To mitigate this issue, a low-complexity algorithm via the two-timescale transmission protocol is proposed in this paper, where the joint beamforming at BSs and RISs is facilitated via alternating optimization framework to maximize the average weighted sum-rate. Specifically, the passive beamformers at RISs are optimized through the statistical CSI, and the transmit beamformers at BSs are based on the instantaneous CSI of effective channels. In this manner, a closed-form expression for the achievable weighted sum-rate is derived, which enables the evaluation of the impact of key parameters on system performance. To gain more insights, a special case without line-of-sight (LoS) components is further investigated, where a power gain on the order of $\mathcal{O}(M)$ is achieved, with $M$ being the BS antennas number. Numerical results validate the tightness of our derived analytical expression and show the fast convergence of the proposed algorithm. Findings illustrate that the performance of the proposed algorithm with two-timescale CSI is comparable to that with instantaneous CSI in low or moderate SNR regime. The impact of key system parameters such as the number of RIS elements, CSI settings and Rician factor is also evaluated. Moreover, the remarkable advantages from the adoption of the cell-free paradigm and the deployment of RISs are demonstrated intuitively. △ Less

Submitted 11 August, 2022; originally announced August 2022.

Comments: 31 pages, 9 figures

arXiv:2207.08731 [pdf]

Network medicine framework reveals generic herb-symptom effectiveness of Traditional Chinese Medicine

Authors: Xiao Gan, Zixin Shu, Xinyan Wang, Dengying Yan, Jun Li, Shany ofaim, Réka Albert, Xiaodong Li, Baoyan Liu, Xuezhong Zhou, Albert-László Barabási

Abstract: Traditional Chinese medicine (TCM) relies on natural medical products to treat symptoms and diseases. While clinical data have demonstrated the effectiveness of selected TCM-based treatments, the mechanistic root of how TCM herbs treat diseases remains largely unknown. More importantly, current approaches focus on single herbs or prescriptions, missing the high-level general principles of TCM. To… ▽ More Traditional Chinese medicine (TCM) relies on natural medical products to treat symptoms and diseases. While clinical data have demonstrated the effectiveness of selected TCM-based treatments, the mechanistic root of how TCM herbs treat diseases remains largely unknown. More importantly, current approaches focus on single herbs or prescriptions, missing the high-level general principles of TCM. To uncover the mechanistic nature of TCM on a system level, in this work we establish a generic network medicine framework for TCM from the human protein interactome. Applying our framework reveals a network pattern between symptoms (diseases) and herbs in TCM. We first observe that genes associated with a symptom are not distributed randomly in the interactome, but cluster into localized modules; furthermore, a short network distance between two symptom modules is indicative of the symptoms' co-occurrence and similarity. Next, we show that the network proximity of a herb's targets to a symptom module is predictive of the herb's effectiveness in treating the symptom. We validate our framework with real-world hospital patient data by showing that (1) shorter network distance between symptoms of inpatients correlates with higher relative risk (co-occurrence), and (2) herb-symptom network proximity is indicative of patients' symptom recovery rate after herbal treatment. Finally, we identified novel herb-symptom pairs in which the herb's effectiveness in treating the symptom is predicted by network and confirmed in hospital data, but previously unknown to the TCM community. These predictions highlight our framework's potential in creating herb discovery or repurposing opportunities. In conclusion, network medicine offers a powerful novel platform to understand the mechanism of traditional medicine and to predict novel herbal treatment against diseases. △ Less

Submitted 18 July, 2022; originally announced July 2022.

Comments: 25 pages, 4 figures plus 1 table

arXiv:2207.00723 [pdf]

doi 10.1063/5.0093147

High-responsivity MoS$_2$ hot-electron telecom-band photodetector integrated with microring resonator

Authors: Qiao Zhang, Yingke Ji, Siqi Hu, Zhiwen Li, Chen Li, Linpeng Gu, Ruijuan Tian, Jiachen Zhang, Liang Fang, Bijun Zhao, Jianlin Zhao, Xuetao Gan

Abstract: We report a high-responsive hot-electron photodetector based on the integration of an Au-MoS$_2$ junction with a silicon nitride microring resonator (MRR) for detecting telecom-band light. The coupling of the evanescent field of the silicon nitride MRR with the Au-MoS$_2$ Schottky junction region enhances the hot-electron injection efficiency. The device exhibits a high responsivity of 154.6 mA W-… ▽ More We report a high-responsive hot-electron photodetector based on the integration of an Au-MoS$_2$ junction with a silicon nitride microring resonator (MRR) for detecting telecom-band light. The coupling of the evanescent field of the silicon nitride MRR with the Au-MoS$_2$ Schottky junction region enhances the hot-electron injection efficiency. The device exhibits a high responsivity of 154.6 mA W-1 at the wavelength of 1516 nm, and the moderately uniform responsivities are obtained over the wavelength range of 1500 nm-1630 nm. This MRR-enhanced MoS2 hot-electron photodetector offers possibilities for integrated optoelectronic systems. △ Less

Submitted 1 July, 2022; originally announced July 2022.

Comments: 6 pages, 3 figures

arXiv:2206.07971 [pdf]

doi 10.1002/lpor.202100498

Efficient Second Harmonic Generation from Silicon Slotted Nanocubes with Bound States in the Continuum

Authors: C. Fang, Q. Yang, Q. Yuan, L. Gu, X. Gan, Y. Shao, Y. Liu, G. Han, Y. Hao

Abstract: Optical materials with centrosymmetry, such as silicon and germanium, are unfortunately absent of second-order nonlinear optical responses, hindering their developments in efficient nonlinear optical devices. Here, a design with an array of slotted nanocubes is proposed to realize remarkable second harmonic generation (SHG) from the centrosymmetric silicon, which takes advantage of enlarged surfac… ▽ More Optical materials with centrosymmetry, such as silicon and germanium, are unfortunately absent of second-order nonlinear optical responses, hindering their developments in efficient nonlinear optical devices. Here, a design with an array of slotted nanocubes is proposed to realize remarkable second harmonic generation (SHG) from the centrosymmetric silicon, which takes advantage of enlarged surface second-order nonlinearity, strengthened electric field over the surface of the air-slot, as well as the resonance enhancement by the bound states in the continuum. Compared with that from the array of silicon nanocubes without air-slots, SHG from the slotted nanocube array is improved by more than two orders of magnitude. The experimentally measured SHG efficiency of the silicon slotted nanocube array is high as 1.8*10^-4 W^-1, which is expected to be further engineered by modifying the air-slot geometries. Our result could provide a new strategy to expand nonlinear optical effects and devices of centrosymmetric materials. △ Less

Submitted 16 June, 2022; originally announced June 2022.

Comments: 31 pages, 9 figures, 1 Table, 1 TOC

Journal ref: Laser Photonics Reviews 16(5) 2022, 2100498

arXiv:2206.07935 [pdf]

doi 10.29026/oea.2021.200030

High-Q Resonances Governed by the Quasi-Bound States in the Continuum in All-Dielectric Metasurfaces

Authors: C. Fang, Q. Yang, Q. Yuan, X. Gan, J. Zhao, Y. Shao, Y. Liu, G. Han, Y. Hao

Abstract: The realization of high-Q resonances in a silicon metasurface with various broken-symmetry blocks is reported. Theoretical analysis reveals that the sharp resonances in the metasurfaces originate from symmetry-protected bound states in the continuum (BIC) and the magnetic dipole dominates these peculiar states. A smaller size of the defect in the broken-symmetry block gives rise to the resonance w… ▽ More The realization of high-Q resonances in a silicon metasurface with various broken-symmetry blocks is reported. Theoretical analysis reveals that the sharp resonances in the metasurfaces originate from symmetry-protected bound states in the continuum (BIC) and the magnetic dipole dominates these peculiar states. A smaller size of the defect in the broken-symmetry block gives rise to the resonance with a larger Q factor. Importantly, this relationship can be tuned by changing the structural parameter, resulting from the modulation of the topological configuration of BICs. Consequently, a Q factor of more than 3,000 can be easily achieved by optimizing dimensions of the nanostructure. At this sharp resonance, the intensity of the third harmonic generation signal in the patterned structure can be 368 times larger than that of the flat silicon film. The proposed strategy and underlying theory can open up new avenues to realize ultrasharp resonances, which may promote the development of the potential meta-devices for nonlinearity, lasing action, and sensing. △ Less

Submitted 16 June, 2022; originally announced June 2022.

Comments: 12 pages,5 figures

Journal ref: Opto-Electronic Advances 4 (2021) 200030

arXiv:2206.05559 [pdf]

doi 10.1002/adma.202008080

Tunable linearity of high-performance vertical dual-gate vdW phototransistor

Authors: **peng Xu, Xiaoguang Luo, Siqi Hu, Xi Zhang, Dong Mei, Fan Liu, Nannan Han, Dan Liu, Xuetao Gan, Yingchun Cheng, Wei Huang

Abstract: Layered two-dimensional (2D) semiconductors have been widely exploited in photodetectors due to their excellent electronic and optoelectronic properties. To improve their performance, photogating, photoconductive, photovoltaic, photothermoelectric, and other effects have been used in phototransistors and photodiodes made with 2D semiconductors or hybrid structures. However, it is difficult to achi… ▽ More Layered two-dimensional (2D) semiconductors have been widely exploited in photodetectors due to their excellent electronic and optoelectronic properties. To improve their performance, photogating, photoconductive, photovoltaic, photothermoelectric, and other effects have been used in phototransistors and photodiodes made with 2D semiconductors or hybrid structures. However, it is difficult to achieve the desired high responsivity and linear photoresponse simultaneously in a monopolar conduction channel or a p-n junction. Here we present dual-channel conduction with ambipolar multilayer WSe2 by employing the device concept of dual-gate phototransistor, where p-type and n-type channels are produced in the same semiconductor using opposite dual-gating. It is possible to tune the photoconductive gain using a vertical electric field, so that the gain is constant with respect to the light intensity-a linear photoresponse, with a high responsivity of ~2.5*10^4 A/W. Additionally, the 1/f noise of the device is kept at a low level under the opposite dual-gating due to the reduction of current and carrier fluctuation, resulting in a high detectivity of ~2*10^13 Jones in the linear photoresponse regime. The linear photoresponse and high performance of our dual-gate WSe2 phototransistor offer the possibility of achieving high-resolution and quantitative light detection with layered 2D semiconductors. △ Less

Submitted 11 June, 2022; originally announced June 2022.

Comments: 29 pages, 4 figures

Journal ref: Adv. Mater. 33(15), 2008080, 2021

arXiv:2206.03308 [pdf]

Chip-integrated van der Waals PN heterojunction photodetector with low dark current and high responsivity

Authors: Ruijuan Tian, Xuetao Gan, Chen Li, Xiaoqing Chen, Siqi Hu, Linpeng Gu, Dries Van Thourhout, Andres Castellanos-Gomez, Zhipei Sun, Jianlin Zhao

Abstract: Two-dimensional materials are attractive for constructing high-performance photonic chip-integrated photodetectors because of their remarkable electronic and optical properties and dangling-bond-free surfaces. However, the reported chip-integrated two-dimensional material photodetectors were mainly implemented with the configuration of metal-semiconductor-metal, suffering from high dark currents a… ▽ More Two-dimensional materials are attractive for constructing high-performance photonic chip-integrated photodetectors because of their remarkable electronic and optical properties and dangling-bond-free surfaces. However, the reported chip-integrated two-dimensional material photodetectors were mainly implemented with the configuration of metal-semiconductor-metal, suffering from high dark currents and low responsivities at high operation speed. Here, we report a van der Waals PN heterojunction photodetector, composed of p-type black phosphorous and n-type molybdenum telluride, integrated on a silicon nitride waveguide. The built-in electric field of the PN heterojunction significantly suppresses the dark current and improves the responsivity. Under a bias of 1 V pointing from n-type molybdenum telluride to p-type black phosphorous, the dark current is lower than 7 nA, which is more than two orders of magnitude lower than those reported in other waveguide-integrated black phosphorus photodetectors. An intrinsic responsivity up to 577 mA/W is obtained. Remarkably, the van der Waals PN heterojunction is tunable by the electrostatic do** to further engineer its rectification and improve the photodetection, enabling an increased responsivity of 709 mA/W. Besides, the heterojunction photodetector exhibits a response bandwidth of ~1.0 GHz and a uniform photodetection over a wide spectral range, as experimentally measured from 1500 to 1630 nm. The demonstrated chip-integrated van der Waals PN heterojunction photodetector with low dark current, high responsivity and fast response has great potentials to develop high-performance on-chip photodetectors for various photonic integrated circuits based on silicon, lithium niobate, polymer, etc. △ Less

Submitted 7 June, 2022; originally announced June 2022.

arXiv:2206.03240 [pdf]

doi 10.1021/acsnano.2c00514

Electrically tunable second harmonic generation in atomically thin ReS2

Authors: **g Wang, Nannan Han, Zheng-Dong Luo, Mingwen Zhang, Xiaoqing Chen, Yan Liu, Yue Hao, Jianlin Zhao, Xuetao Gan

Abstract: Electrical tuning of second-order nonlinearity in optical materials is attractive to strengthen and expand the functionalities of nonlinear optical technologies, though its implementation remains elusive. Here, we report the electrically tunable second-order nonlinearity in atomically thin ReS2 flakes benefiting from their distorted 1T crystal structure and interlayer charge transfer. Enabled by t… ▽ More Electrical tuning of second-order nonlinearity in optical materials is attractive to strengthen and expand the functionalities of nonlinear optical technologies, though its implementation remains elusive. Here, we report the electrically tunable second-order nonlinearity in atomically thin ReS2 flakes benefiting from their distorted 1T crystal structure and interlayer charge transfer. Enabled by the efficient electrostatic control of the few-atomic-layer ReS2, we show that second harmonic generation (SHG) can be induced in odd-number-layered ReS2 flakes which are centrosymmetric and thus without intrinsic SHG. Moreover, the SHG can be precisely modulated by the electric field, reversibly switching from almost zero to an amplitude more than one order of magnitude stronger than that of the monolayer MoS2. For the even-number-layered ReS2 flakes with the intrinsic SHG, the external electric field could be leveraged to enhance the SHG. We further perform the first-principles calculations which suggest that the modification of in-plane second-order hyperpolarizability by the redistributed interlayer-transferring charges in the distorted 1T crystal structure underlies the electrically tunable SHG in ReS2. With its active SHG tunability while using the facile electrostatic control, our work may further expand the nonlinear optoelectronic functions of two-dimensional materials for develo** electrically controllable nonlinear optoelectronic devices. △ Less

Submitted 7 June, 2022; originally announced June 2022.

arXiv:2206.03143 [pdf, other]

doi 10.1021/acsphotonics.2c00038

High-efficiency second-harmonic and sum-frequency generation in a silicon nitride microring integrated with few-layer GaSe

Authors: Binbin Wang, Yafei Ji, Linpeng Gu, Liang Fang, Xuetao Gan, Jianlin Zhao

Abstract: Silicon nitride (SiN) photonics platform has attributes of ultra-low linear and nonlinear propagation losses and CMOS-compatible fabrication process, promising large-scale multifunctional photonic circuits. However, the centrosymmetric nature of SiN inhibits second-order nonlinear optical responses in its photonics platform, which is desirable for develo** efficient nonlinear active devices. Her… ▽ More Silicon nitride (SiN) photonics platform has attributes of ultra-low linear and nonlinear propagation losses and CMOS-compatible fabrication process, promising large-scale multifunctional photonic circuits. However, the centrosymmetric nature of SiN inhibits second-order nonlinear optical responses in its photonics platform, which is desirable for develo** efficient nonlinear active devices. Here, we demonstrate high-efficiency second-order nonlinear processes in SiN photonics platform by integrating a few-layer GaSe flake on a SiN microring resonator. With the pump of microwatts continuous-wave lasers, second-harmonic generation and sum-frequency generation with the conversion efficiencies of 849%/W and 123%/W, respectively, are achieved, which benefit from the ultrahigh second-order nonlinear susceptibility of GaSe, resonance enhanced GaSe-light interaction, and phase-matching condition satisfied by the mode engineering. Combining with the easy integration, the GaSe-assisted high-efficiency second-order nonlinear processes offer a new route to enriching already strong functionality of SiN photonics platform in nonlinear optics. △ Less

Submitted 7 June, 2022; originally announced June 2022.

Comments: 23 pages, 4 figures

Journal ref: ACS Photonics 2022, 9, 1671-1678

arXiv:2206.03078 [pdf]

doi 10.1021/acs.nanolett.1c04359

Strong Second Harmonic Generation from Bilayer Graphene with Symmetry Breaking by Redox-Governed Charge Do**

Authors: Mingwen Zhang, Nannan Han, **g Wang, Zhihong Zhang, Kaihui Liu, Zhipei Sun, Jianlin Zhao, Xuetao Gan

Abstract: Missing second-order nonlinearity in centrosymmetric graphene overshadows its intriguing optical attribute. Here, we report redox-governed charge do** could effectively break the centrosymmetry of bilayer graphene (BLG), enabling a strong second harmonic generation (SHG) with a strength close to that of the well-known monolayer MoS2. Verified from control experiments with in situ electrical curr… ▽ More Missing second-order nonlinearity in centrosymmetric graphene overshadows its intriguing optical attribute. Here, we report redox-governed charge do** could effectively break the centrosymmetry of bilayer graphene (BLG), enabling a strong second harmonic generation (SHG) with a strength close to that of the well-known monolayer MoS2. Verified from control experiments with in situ electrical current annealing and electrically gate-controlled SHG, the required centrosymmetry breaking of the emerging SHG arises from the charge-do** on the bottom layer of BLG by the oxygen/water redox couple. Our results not only reveal that charge do** is an effective way to break the inversion symmetry of BLG despite its strong interlayer coupling but also indicate that SHG spectroscopy is a valid technique to probe molecular do** on two-dimensional materials. △ Less

Submitted 7 June, 2022; originally announced June 2022.

arXiv:2205.13954 [pdf, other]

doi 10.1145/3534678.3539280

Geometer: Graph Few-Shot Class-Incremental Learning via Prototype Representation

Authors: Bin Lu, Xiaoying Gan, Lina Yang, Weinan Zhang, Luoyi Fu, Xinbing Wang

Abstract: With the tremendous expansion of graphs data, node classification shows its great importance in many real-world applications. Existing graph neural network based methods mainly focus on classifying unlabeled nodes within fixed classes with abundant labeling. However, in many practical scenarios, graph evolves with emergence of new nodes and edges. Novel classes appear incrementally along with few… ▽ More With the tremendous expansion of graphs data, node classification shows its great importance in many real-world applications. Existing graph neural network based methods mainly focus on classifying unlabeled nodes within fixed classes with abundant labeling. However, in many practical scenarios, graph evolves with emergence of new nodes and edges. Novel classes appear incrementally along with few labeling due to its newly emergence or lack of exploration. In this paper, we focus on this challenging but practical graph few-shot class-incremental learning (GFSCIL) problem and propose a novel method called Geometer. Instead of replacing and retraining the fully connected neural network classifer, Geometer predicts the label of a node by finding the nearest class prototype. Prototype is a vector representing a class in the metric space. With the pop-up of novel classes, Geometer learns and adjusts the attention-based prototypes by observing the geometric proximity, uniformity and separability. Teacher-student knowledge distillation and biased sampling are further introduced to mitigate catastrophic forgetting and unbalanced labeling problem respectively. Experimental results on four public datasets demonstrate that Geometer achieves a substantial improvement of 9.46% to 27.60% over state-of-the-art methods. △ Less

Submitted 3 June, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

Comments: Accepted to KDD2022

arXiv:2205.13947 [pdf, other]

doi 10.1145/3534678.3539281

Spatio-Temporal Graph Few-Shot Learning with Cross-City Knowledge Transfer

Authors: Bin Lu, Xiaoying Gan, Weinan Zhang, Huaxiu Yao, Luoyi Fu, Xinbing Wang

Abstract: Spatio-temporal graph learning is a key method for urban computing tasks, such as traffic flow, taxi demand and air quality forecasting. Due to the high cost of data collection, some develo** cities have few available data, which makes it infeasible to train a well-performed model. To address this challenge, cross-city knowledge transfer has shown its promise, where the model learned from data-s… ▽ More Spatio-temporal graph learning is a key method for urban computing tasks, such as traffic flow, taxi demand and air quality forecasting. Due to the high cost of data collection, some develo** cities have few available data, which makes it infeasible to train a well-performed model. To address this challenge, cross-city knowledge transfer has shown its promise, where the model learned from data-sufficient cities is leveraged to benefit the learning process of data-scarce cities. However, the spatio-temporal graphs among different cities show irregular structures and varied features, which limits the feasibility of existing Few-Shot Learning (\emph{FSL}) methods. Therefore, we propose a model-agnostic few-shot learning framework for spatio-temporal graph called ST-GFSL. Specifically, to enhance feature extraction by transfering cross-city knowledge, ST-GFSL proposes to generate non-shared parameters based on node-level meta knowledge. The nodes in target city transfer the knowledge via parameter matching, retrieving from similar spatio-temporal characteristics. Furthermore, we propose to reconstruct the graph structure during meta-learning. The graph reconstruction loss is defined to guide structure-aware learning, avoiding structure deviation among different datasets. We conduct comprehensive experiments on four traffic speed prediction benchmarks and the results demonstrate the effectiveness of ST-GFSL compared with state-of-the-art methods. △ Less

Submitted 3 June, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

Comments: Accepted to KDD2022

arXiv:2205.12144 [pdf, other]

doi 10.1145/3531013

Optimizing Performance of Federated Person Re-identification: Benchmarking and Analysis

Authors: Weiming Zhuang, Xin Gan, Yonggang Wen, Shuai Zhang

Abstract: The increasingly stringent data privacy regulations limit the development of person re-identification (ReID) because person ReID training requires centralizing an enormous amount of data that contains sensitive personal information. To address this problem, we introduce federated person re-identification (FedReID) -- implementing federated learning, an emerging distributed training method, to pers… ▽ More The increasingly stringent data privacy regulations limit the development of person re-identification (ReID) because person ReID training requires centralizing an enormous amount of data that contains sensitive personal information. To address this problem, we introduce federated person re-identification (FedReID) -- implementing federated learning, an emerging distributed training method, to person ReID. FedReID preserves data privacy by aggregating model updates, instead of raw data, from clients to a central server. Furthermore, we optimize the performance of FedReID under statistical heterogeneity via benchmark analysis. We first construct a benchmark with an enhanced algorithm, two architectures, and nine person ReID datasets with large variances to simulate the real-world statistical heterogeneity. The benchmark results present insights and bottlenecks of FedReID under statistical heterogeneity, including challenges in convergence and poor performance on datasets with large volumes. Based on these insights, we propose three optimization approaches: (1) We adopt knowledge distillation to facilitate the convergence of FedReID by better transferring knowledge from clients to the server; (2) We introduce client clustering to improve the performance of large datasets by aggregating clients with similar data distributions; (3) We propose cosine distance weight to elevate performance by dynamically updating the weights for aggregation depending on how well models are trained in clients. Extensive experiments demonstrate that these approaches achieve satisfying convergence with much better performance on all datasets. We believe that FedReID will shed light on implementing and optimizing federated learning on more computer vision applications. △ Less

Submitted 24 May, 2022; originally announced May 2022.

Comments: TOMM

arXiv:2204.04382 [pdf, other]

Federated Unsupervised Domain Adaptation for Face Recognition

Authors: Weiming Zhuang, Xin Gan, Yonggang Wen, Xuesen Zhang, Shuai Zhang, Shuai Yi

Abstract: Given labeled data in a source domain, unsupervised domain adaptation has been widely adopted to generalize models for unlabeled data in a target domain, whose data distributions are different. However, existing works are inapplicable to face recognition under privacy constraints because they require sharing of sensitive face images between domains. To address this problem, we propose federated un… ▽ More Given labeled data in a source domain, unsupervised domain adaptation has been widely adopted to generalize models for unlabeled data in a target domain, whose data distributions are different. However, existing works are inapplicable to face recognition under privacy constraints because they require sharing of sensitive face images between domains. To address this problem, we propose federated unsupervised domain adaptation for face recognition, FedFR. FedFR jointly optimizes clustering-based domain adaptation and federated learning to elevate performance on the target domain. Specifically, for unlabeled data in the target domain, we enhance a clustering algorithm with distance constrain to improve the quality of predicted pseudo labels. Besides, we propose a new domain constraint loss (DCL) to regularize source domain training in federated learning. Extensive experiments on a newly constructed benchmark demonstrate that FedFR outperforms the baseline and classic methods on the target domain by 3% to 14% on different evaluation metrics. △ Less

Submitted 9 April, 2022; originally announced April 2022.

Comments: ICME'22. arXiv admin note: substantial text overlap with arXiv:2105.07606

arXiv:2112.01162 [pdf]

doi 10.3390/ma14237330

Exciting Magnetic Dipole Mode of Split Ring Plasmonic Nano Resonator by Photonic Crystal Nanocavity

Authors: Yingke Ji, Binbin Wang, Liang Fang, Qiang Zhao, Fajun Xiao, Xuetao Gan

Abstract: On chip exciting electric modes in individual plasmonic nanostructures are realized widely; nevertheless, the excitation of their magnetic counterparts is seldom reported. Here, we propose a highly efficient on chip excitation approach of the magnetic dipole mode of an individual split ring resonator (SRR) by integrating it onto a photonic crystal nanocavity (PCNC). A high excitation efficiency of… ▽ More On chip exciting electric modes in individual plasmonic nanostructures are realized widely; nevertheless, the excitation of their magnetic counterparts is seldom reported. Here, we propose a highly efficient on chip excitation approach of the magnetic dipole mode of an individual split ring resonator (SRR) by integrating it onto a photonic crystal nanocavity (PCNC). A high excitation efficiency of up to 58% is realized through the resonant coupling between the modes of the SRR and PCNC. A further fine adjustment of the excited magnetic dipole mode is demonstrated by tuning the relative position and twist angle between the SRR and PCNC. Finally, a structure with a photonic crystal waveguide side coupled with the hybrid SRR PCNC is illustrated, which could excite the magnetic dipole mode with an in plane coupling geometry and potentially facilitate the future device application. Our result may open a way for develo** chip integrated photonic devices employing a magnetic field component in the optical field. △ Less

Submitted 2 December, 2021; originally announced December 2021.

arXiv:2111.10576 [pdf]

doi 10.1021/acs.nanolett.2c01852

Contact conductance governs metallicity in conducting metal oxide nanocrystal films

Authors: Corey M. Staller, Stephen L. Gibbs, Xing Yee Gan, Jay T. Bender, Karalee Jarvis, Gary K. Ong, Delia J. Milliron

Abstract: In bulk semiconductor materials, the insulator-metal transition (IMT) is governed by the concentration of conduction electrons. Meanwhile, even when fabricated from metallic building blocks, nanocrystal films are often insulating with inter-nanocrystal contacts acting as electron transport bottlenecks. Using a library of transparent conducting tin-doped indium oxide nanocrystal films with varied e… ▽ More In bulk semiconductor materials, the insulator-metal transition (IMT) is governed by the concentration of conduction electrons. Meanwhile, even when fabricated from metallic building blocks, nanocrystal films are often insulating with inter-nanocrystal contacts acting as electron transport bottlenecks. Using a library of transparent conducting tin-doped indium oxide nanocrystal films with varied electron concentration, size, and contact area, we test candidate criteria for the IMT and establish a phase diagram for electron transport behavior. From variable temperature conductivity measurements, we learn that both the IMT and a subsequent crossover to conventional metallic behavior near room temperature are governed by the conductance of the inter-nanocrystal contacts. To cross the IMT, inter-nanocrystal coupling must be sufficient to overcome the charging energy of a nanocrystal, while conventional metallic behavior requires contact conductance to reach the conductance of a nanocrystal. This understanding can enable the design and fabrication of metallic conducting materials from nanocrystal building blocks. △ Less

Submitted 15 February, 2022; v1 submitted 20 November, 2021; originally announced November 2021.

Comments: 11 pages, 4 figures

arXiv:2111.05670 [pdf, other]

DeCOM: Decomposed Policy for Constrained Cooperative Multi-Agent Reinforcement Learning

Authors: Zhaoxing Yang, Rong Ding, Haiming **, Yifei Wei, Haoyi You, Guiyun Fan, Xiaoying Gan, Xinbing Wang

Abstract: In recent years, multi-agent reinforcement learning (MARL) has presented impressive performance in various applications. However, physical limitations, budget restrictions, and many other factors usually impose \textit{constraints} on a multi-agent system (MAS), which cannot be handled by traditional MARL frameworks. Specifically, this paper focuses on constrained MASes where agents work \textit{c… ▽ More In recent years, multi-agent reinforcement learning (MARL) has presented impressive performance in various applications. However, physical limitations, budget restrictions, and many other factors usually impose \textit{constraints} on a multi-agent system (MAS), which cannot be handled by traditional MARL frameworks. Specifically, this paper focuses on constrained MASes where agents work \textit{cooperatively} to maximize the expected team-average return under various constraints on expected team-average costs, and develops a \textit{constrained cooperative MARL} framework, named DeCOM, for such MASes. In particular, DeCOM decomposes the policy of each agent into two modules, which empowers information sharing among agents to achieve better cooperation. In addition, with such modularization, the training algorithm of DeCOM separates the original constrained optimization into an unconstrained optimization on reward and a constraints satisfaction problem on costs. DeCOM then iteratively solves these problems in a computationally efficient manner, which makes DeCOM highly scalable. We also provide theoretical guarantees on the convergence of DeCOM's policy update algorithm. Finally, we validate the effectiveness of DeCOM with various types of costs in both toy and large-scale (with 500 agents) environments. △ Less

Submitted 10 November, 2021; originally announced November 2021.

Comments: 25 pages

arXiv:2111.03459 [pdf, other]

ProSTformer: Pre-trained Progressive Space-Time Self-attention Model for Traffic Flow Forecasting

Authors: Xiao Yan, Xianghua Gan, **g**g Tang, Rui Wang

Abstract: Traffic flow forecasting is essential and challenging to intelligent city management and public safety. Recent studies have shown the potential of convolution-free Transformer approach to extract the dynamic dependencies among complex influencing factors. However, two issues prevent the approach from being effectively applied in traffic flow forecasting. First, it ignores the spatiotemporal struct… ▽ More Traffic flow forecasting is essential and challenging to intelligent city management and public safety. Recent studies have shown the potential of convolution-free Transformer approach to extract the dynamic dependencies among complex influencing factors. However, two issues prevent the approach from being effectively applied in traffic flow forecasting. First, it ignores the spatiotemporal structure of the traffic flow videos. Second, for a long sequence, it is hard to focus on crucial attention due to the quadratic times dot-product computation. To address the two issues, we first factorize the dependencies and then design a progressive space-time self-attention mechanism named ProSTformer. It has two distinctive characteristics: (1) corresponding to the factorization, the self-attention mechanism progressively focuses on spatial dependence from local to global regions, on temporal dependence from inside to outside fragment (i.e., closeness, period, and trend), and finally on external dependence such as weather, temperature, and day-of-week; (2) by incorporating the spatiotemporal structure into the self-attention mechanism, each block in ProSTformer highlights the unique dependence by aggregating the regions with spatiotemporal positions to significantly decrease the computation. We evaluate ProSTformer on two traffic datasets, and each dataset includes three separate datasets with big, medium, and small scales. Despite the radically different design compared to the convolutional architectures for traffic flow forecasting, ProSTformer performs better or the same on the big scale datasets than six state-of-the-art baseline methods by RMSE. When pre-trained on the big scale datasets and transferred to the medium and small scale datasets, ProSTformer achieves a significant enhancement and behaves best. △ Less

Submitted 3 November, 2021; originally announced November 2021.

arXiv:2108.06492 [pdf, other]

Collaborative Unsupervised Visual Representation Learning from Decentralized Data

Authors: Weiming Zhuang, Xin Gan, Yonggang Wen, Shuai Zhang, Shuai Yi

Abstract: Unsupervised representation learning has achieved outstanding performances using centralized data available on the Internet. However, the increasing awareness of privacy protection limits sharing of decentralized unlabeled image data that grows explosively in multiple parties (e.g., mobile phones and cameras). As such, a natural problem is how to leverage these data to learn visual representations… ▽ More Unsupervised representation learning has achieved outstanding performances using centralized data available on the Internet. However, the increasing awareness of privacy protection limits sharing of decentralized unlabeled image data that grows explosively in multiple parties (e.g., mobile phones and cameras). As such, a natural problem is how to leverage these data to learn visual representations for downstream tasks while preserving data privacy. To address this problem, we propose a novel federated unsupervised learning framework, FedU. In this framework, each party trains models from unlabeled data independently using contrastive learning with an online network and a target network. Then, a central server aggregates trained models and updates clients' models with the aggregated model. It preserves data privacy as each party only has access to its raw data. Decentralized data among multiple parties are normally non-independent and identically distributed (non-IID), leading to performance degradation. To tackle this challenge, we propose two simple but effective methods: 1) We design the communication protocol to upload only the encoders of online networks for server aggregation and update them with the aggregated encoder; 2) We introduce a new module to dynamically decide how to update predictors based on the divergence caused by non-IID. The predictor is the other component of the online network. Extensive experiments and ablations demonstrate the effectiveness and significance of FedU. It outperforms training with only one party by over 5% and other methods by over 14% in linear and semi-supervised evaluation on non-IID data. △ Less

Submitted 14 August, 2021; originally announced August 2021.

Comments: ICCV'21

arXiv:2108.01002 [pdf]

Wood-leaf classification of tree point cloud based on intensity and geometrical information

Authors: **gqian Sun, Pei Wang, Zhiyong Gao, Zichu Liu, Yaxin Li, Xiaozheng Gan

Abstract: Terrestrial laser scanning (TLS) can obtain tree point cloud with high precision and high density. Efficient classification of wood points and leaf points is essential to study tree structural parameters and ecological characteristics. By using both the intensity and spatial information, a three-step classification and verification method was proposed to achieve automated wood-leaf classification.… ▽ More Terrestrial laser scanning (TLS) can obtain tree point cloud with high precision and high density. Efficient classification of wood points and leaf points is essential to study tree structural parameters and ecological characteristics. By using both the intensity and spatial information, a three-step classification and verification method was proposed to achieve automated wood-leaf classification. Tree point cloud was classified into wood points and leaf points by using intensity threshold, neighborhood density and voxelization successively. Experiment was carried in Haidian Park, Bei**g, and 24 trees were scanned by using the RIEGL VZ-400 scanner. The tree point clouds were processed by using the proposed method, whose classification results were compared with the manual classification results which were used as standard results. To evaluate the classification accuracy, three indicators were used in the experiment, which are Overall Accuracy (OA), Kappa coefficient (Kappa) and Matthews correlation coefficient (MCC). The ranges of OA, Kappa and MCC of the proposed method are from 0.9167 to 0.9872, from 0.7276 to 0.9191, and from 0.7544 to 0.9211 respectively. The average values of OA, Kappa and MCC are 0.9550, 0.8547 and 0.8627 respectively. Time cost of wood-leaf classification was also recorded to evaluate the algorithm efficiency. The average processing time are 1.4 seconds per million points. The results showed that the proposed method performed well automatically and quickly on wood-leaf classification based on the experimental dataset. △ Less

Submitted 2 August, 2021; originally announced August 2021.

arXiv:2108.00181 [pdf, other]

doi 10.1063/5.0060007

Fano resonance from a one-dimensional topological photonic crystal

Authors: Linpeng Gu, Binbin Wang, Qingchen Yuan, Liang Fang, Qiang Zhao, Xuetao Gan, Jianlin Zhao

Abstract: An ultra-compact one-dimensional topological photonic crystal (1D-TPC) is designed in a single mode silicon bus-waveguide to generate Fano resonance lineshape. The Fano resonance comes from the interference between the discrete topological boundary state of the 1D-TPC and the continuum high-order leaky mode of the bus-waveguide. Standalone asymmetric Fano resonance lineshapes are obtained experime… ▽ More An ultra-compact one-dimensional topological photonic crystal (1D-TPC) is designed in a single mode silicon bus-waveguide to generate Fano resonance lineshape. The Fano resonance comes from the interference between the discrete topological boundary state of the 1D-TPC and the continuum high-order leaky mode of the bus-waveguide. Standalone asymmetric Fano resonance lineshapes are obtained experimentally in the waveguide transmission spectrum with a maximum extinction ratio of 33 dB and a slope ratio of 10 dB/nm over a broadband flat background. △ Less

Submitted 31 July, 2021; originally announced August 2021.

arXiv:2105.12323 [pdf]

doi 10.1021/acsnano.1c02425

Ultralow Threshold, Single-Mode InGaAs/GaAs Multi-Quantum Disk Nanowire Lasers

Authors: Xutao Zhang, Ruixuan Yi, Nikita Gagrani, Ziyuan Li, Fanlu Zhang, Xuetao Gan, Xiaomei Yao, Xiaoming Yuan, Naiyin Wang, Jianlin Zhao, **** Chen, Wei Lu, Lan Fu, Hark Hoe Tan, Chennupati Jagadish

Abstract: We present single-mode nanowire (NW) lasers with ultralow threshold in the near-infrared spectral range. To ensure the single-mode operation, the NW diameter and length are reduced specifically to minimize the longitudinal and transverse modes of the NW cavity. Increased optical losses and reduced gain volume by the dimension reduction are compensated by excellent NW morphology and InGaAs/GaAs mul… ▽ More We present single-mode nanowire (NW) lasers with ultralow threshold in the near-infrared spectral range. To ensure the single-mode operation, the NW diameter and length are reduced specifically to minimize the longitudinal and transverse modes of the NW cavity. Increased optical losses and reduced gain volume by the dimension reduction are compensated by excellent NW morphology and InGaAs/GaAs multi-quantum disks. At 5 K, a threshold low as 1.6 μJ/cm2 per pulse is achieved with a resulting quality factor exceeding 6400. By further passivating the NW with an AlGaAs shell to suppress surface non-radiative recombination, single-mode lasing operation is obtained with a threshold of only 48 μJ/cm2 per pulse at room temperature with a high characteristic temperature of 223 K and power output of ~ 0.9 μW. These single-mode, ultralow threshold, high power output NW lasers are promising for the development of near-infrared nanoscale coherent light sources for integrated photonic circuits, sensing, and spectroscopy. △ Less

Submitted 26 May, 2021; originally announced May 2021.

Journal ref: ACS Nano, 2021, 5, 5, 9126-9133

arXiv:2105.09735 [pdf]

Giant Enhancement of Nonlinear Harmonic Generation in a Silicon Topological Photonic Crystal Nanocavity Chain

Authors: Qingchen Yuan, Linpeng Gu, Liang Fang, Xuetao Gan, Zhigang Chen, Jianlin Zhao

Abstract: Strongly enhanced third-harmonic generation (THG) by the topological localization of an edge mode in a Su-Schrieffer-Heeger (SSH) chain of silicon photonic crystal nanocavities is demonstrated. The edge mode of the nanocavity chain not only naturally inherits resonant properties of the single nanocavity, but also exhibits the topological feature with mode robustness extending well beyond individua… ▽ More Strongly enhanced third-harmonic generation (THG) by the topological localization of an edge mode in a Su-Schrieffer-Heeger (SSH) chain of silicon photonic crystal nanocavities is demonstrated. The edge mode of the nanocavity chain not only naturally inherits resonant properties of the single nanocavity, but also exhibits the topological feature with mode robustness extending well beyond individual nanocavity. By engineering the SSH nanocavities with alternating strong and weak coupling strengths on a silicon slab, we observe the edge mode formation that entails a THG signal with three orders of magnitude enhancement compared with that in a trivial SSH structure. Our results indicate that the photonic crystal nanocavity chain could provide a promising on-chip platform for topology-driven nonlinear photonics. △ Less

Submitted 20 May, 2021; originally announced May 2021.

Comments: 18 pages, 4 figures

arXiv:2105.07606 [pdf, other]

Towards Unsupervised Domain Adaptation for Deep Face Recognition under Privacy Constraints via Federated Learning

Authors: Weiming Zhuang, Xin Gan, Yonggang Wen, Xuesen Zhang, Shuai Zhang, Shuai Yi

Abstract: Unsupervised domain adaptation has been widely adopted to generalize models for unlabeled data in a target domain, given labeled data in a source domain, whose data distributions differ from the target domain. However, existing works are inapplicable to face recognition under privacy constraints because they require sharing sensitive face images between two domains. To address this problem, we pro… ▽ More Unsupervised domain adaptation has been widely adopted to generalize models for unlabeled data in a target domain, given labeled data in a source domain, whose data distributions differ from the target domain. However, existing works are inapplicable to face recognition under privacy constraints because they require sharing sensitive face images between two domains. To address this problem, we propose a novel unsupervised federated face recognition approach (FedFR). FedFR improves the performance in the target domain by iteratively aggregating knowledge from the source domain through federated learning. It protects data privacy by transferring models instead of raw data between domains. Besides, we propose a new domain constraint loss (DCL) to regularize source domain training. DCL suppresses the data volume dominance of the source domain. We also enhance a hierarchical clustering algorithm to predict pseudo labels for the unlabeled target domain accurately. To this end, FedFR forms an end-to-end training pipeline: (1) pre-train in the source domain; (2) predict pseudo labels by clustering in the target domain; (3) conduct domain-constrained federated learning across two domains. Extensive experiments and analysis on two newly constructed benchmarks demonstrate the effectiveness of FedFR. It outperforms the baseline and classic methods in the target domain by over 4% on the more realistic benchmark. We believe that FedFR will shed light on applying federated learning to more computer vision tasks under privacy constraints. △ Less

Submitted 17 May, 2021; originally announced May 2021.

arXiv:2105.07603 [pdf, other]

doi 10.1109/JIOT.2022.3143842

EasyFL: A Low-code Federated Learning Platform For Dummies

Authors: Weiming Zhuang, Xin Gan, Yonggang Wen, Shuai Zhang

Abstract: Academia and industry have developed several platforms to support the popular privacy-preserving distributed learning method -- Federated Learning (FL). However, these platforms are complex to use and require a deep understanding of FL, which imposes high barriers to entry for beginners, limits the productivity of researchers, and compromises deployment efficiency. In this paper, we propose the fi… ▽ More Academia and industry have developed several platforms to support the popular privacy-preserving distributed learning method -- Federated Learning (FL). However, these platforms are complex to use and require a deep understanding of FL, which imposes high barriers to entry for beginners, limits the productivity of researchers, and compromises deployment efficiency. In this paper, we propose the first low-code FL platform, EasyFL, to enable users with various levels of expertise to experiment and prototype FL applications with little coding. We achieve this goal while ensuring great flexibility and extensibility for customization by unifying simple API design, modular design, and granular training flow abstraction. With only a few lines of code, EasyFL empowers them with many out-of-the-box functionalities to accelerate experimentation and deployment. These practical functionalities are heterogeneity simulation, comprehensive tracking, distributed training optimization, and seamless deployment. They are proposed based on challenges identified in the proposed FL life cycle. Compared with other platforms, EasyFL not only requires just three lines of code (at least 10x lesser) to build a vanilla FL application but also incurs lower training overhead. Besides, our evaluations demonstrate that EasyFL expedites distributed training by 1.5x. It also improves the efficiency of deployment. We believe that EasyFL will increase the productivity of researchers and democratize FL to wider audiences. △ Less

Submitted 19 January, 2022; v1 submitted 17 May, 2021; originally announced May 2021.

Journal ref: IEEE Internet of Things Journal (Early Access) 2022

arXiv:2105.07171 [pdf, other]

doi 10.1109/JLT.2021.3082558

A topological photonic ring-resonator for on-chip channel filters

Authors: Linpeng Gu, Qingchen Yuan, Qiang Zhao, Yafei Ji, Ziyu Liu, Liang Fang, Xuetao Gan, Jianlin Zhao

Abstract: A topologically protected ring-resonator formed in valley photonic crystals is proposed and fabricated on a silicon slab. The unidirectional transmission and robustness against structure defects of its resonant modes are illustrated. Coupled with topological waveguides, the topological ring is functioned as notch and channel-drop filters. The work opens up a new avenue for develo** advanced chip… ▽ More A topologically protected ring-resonator formed in valley photonic crystals is proposed and fabricated on a silicon slab. The unidirectional transmission and robustness against structure defects of its resonant modes are illustrated. Coupled with topological waveguides, the topological ring is functioned as notch and channel-drop filters. The work opens up a new avenue for develo** advanced chip-integrated photonic circuits with attributes of topological photonics. △ Less

Submitted 15 May, 2021; originally announced May 2021.

Showing 1–50 of 84 results for author: Gan, X