Skip to main content

Showing 1–50 of 492 results for author: Fan, S

.
  1. arXiv:2407.01364  [pdf

    econ.GN

    Co-benefits of Agricultural Diversification and Technology for Food and Nutrition Security in China

    Authors: Thomas Cherico Wanger, Estelle Raveloaritiana, Siyan Zeng, Haixiu Gao, Xueqing He, Yiwen Shao, Panlong Wu, Kris A. G. Wyckhuys, Wenwu Zhou, Yi Zou, Zengrong Zhu, Ling Li, Haiyan Cen, Yunhui Liu, Shenggen Fan

    Abstract: China is the leading crop producer and has successfully implemented sustainable development programs related to agriculture. Sustainable agriculture has been promoted to achieve national food security targets such as food self-sufficiency through the well-facilitated farmland construction (WFFC) approach. The WFFC is introduced in Chinas current national 10-year plan to consolidate farmlands into… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.18824  [pdf, other

    physics.optics cond-mat.mes-hall math-ph

    Topological winding guaranteed coherent orthogonal scattering

    Authors: Cheng Guo, Shanhui Fan

    Abstract: Coherent control has enabled various novel phenomena in wave scattering. We introduce an effect called coherent orthogonal scattering, where the output wave becomes orthogonal to the reference output state without scatterers. This effect leads to a unity extinction coefficient and complete mode conversion. We examine the conditions for this effect and reveal its topological nature by relating it t… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 13 pages, 7 figures. In press

  3. arXiv:2406.11546  [pdf, other

    eess.AS cs.CL cs.SD

    GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement

    Authors: Yifan Yang, Zheshu Song, Jianheng Zhuo, Mingyu Cui, **peng Li, Bo Yang, Yexing Du, Ziyang Ma, Xunying Liu, Ziyuan Wang, Ke Li, Shuai Fan, Kai Yu, Wei-Qiang Zhang, Guoguo Chen, Xie Chen

    Abstract: The evolution of speech technology has been spurred by the rapid increase in dataset sizes. Traditional speech models generally depend on a large amount of labeled training data, which is scarce for low-resource languages. This paper presents GigaSpeech 2, a large-scale, multi-domain, multilingual speech recognition corpus. It is designed for low-resource languages and does not rely on paired spee… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under review

  4. arXiv:2406.05217  [pdf, ps, other

    math.NT

    The shifted prime-divisor function over shifted primes

    Authors: Steve Fan

    Abstract: Let $a,b\in\mathbb{Z}\setminus\{0\}$. For every $n\in\mathbb{N}$, we denote by $ω_a^*(n)$ the number of shifted-prime divisors $p-a$ of $n$, where $p>a$ is prime. In this paper, we study the moments of $ω_a^*$ over shifted primes $p-b$. Specifically, we prove an asymptotic formula for the first moment and upper and lower bounds of the correct order of magnitude for the second moment. These results… ▽ More

    Submitted 14 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: 39 pages

    MSC Class: Primary: 11N36; 11N37; Secondary: 11B05

  5. arXiv:2406.03865  [pdf, other

    cs.CV cs.AI

    Semantic Similarity Score for Measuring Visual Similarity at Semantic Level

    Authors: Senran Fan, Zhicheng Bao, Chen Dong, Haotai Liang, Xiaodong Xu, ** Zhang

    Abstract: Semantic communication, as a revolutionary communication architecture, is considered a promising novel communication paradigm. Unlike traditional symbol-based error-free communication systems, semantic-based visual communication systems extract, compress, transmit, and reconstruct images at the semantic level. However, widely used image similarity evaluation metrics, whether pixel-based MSE or PSN… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  6. arXiv:2406.03287  [pdf, other

    cs.NE cs.CL cs.LG

    SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms

    Authors: Xingrun Xing, Zheng Zhang, Ziyi Ni, Shitao Xiao, Yiming Ju, Siqi Fan, Yequan Wang, Jiajun Zhang, Guoqi Li

    Abstract: Towards energy-efficient artificial intelligence similar to the human brain, the bio-inspired spiking neural networks (SNNs) have advantages of biological plausibility, event-driven sparsity, and binary activation. Recently, large-scale language models exhibit promising generalization capability, making it a valuable issue to explore more general spike-driven models. However, the binary spikes in… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  7. arXiv:2406.02191  [pdf, other

    stat.ML cs.LG

    On the Recoverability of Causal Relations from Temporally Aggregated I.I.D. Data

    Authors: Shunxing Fan, Mingming Gong, Kun Zhang

    Abstract: We consider the effect of temporal aggregation on instantaneous (non-temporal) causal discovery in general setting. This is motivated by the observation that the true causal time lag is often considerably shorter than the observational interval. This discrepancy leads to high aggregation, causing time-delay causality to vanish and instantaneous dependence to manifest. Although we expect such insta… ▽ More

    Submitted 11 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  8. arXiv:2406.02002  [pdf, other

    cs.CL cs.AI

    Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue

    Authors: Shixuan Fan, Wei Wei, Wendi Li, Xian-Ling Mao, Wenfeng Xie, Dangyang Chen

    Abstract: The core of the dialogue system is to generate relevant, informative, and human-like responses based on extensive dialogue history. Recently, dialogue generation domain has seen mainstream adoption of large language models (LLMs), due to its powerful capability in generating utterances. However, there is a natural deficiency for such models, that is, inherent position bias, which may lead them to… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to IJCAI 2024

  9. arXiv:2406.01988  [pdf, other

    cs.CL cs.AI

    Personalized Topic Selection Model for Topic-Grounded Dialogue

    Authors: Shixuan Fan, Wei Wei, Xiaofei Wen, Xianling Mao, Jixiong Chen, Dangyang Chen

    Abstract: Recently, the topic-grounded dialogue (TGD) system has become increasingly popular as its powerful capability to actively guide users to accomplish specific tasks through topic-guided conversations. Most existing works utilize side information (\eg topics or personas) in isolation to enhance the topic selection ability. However, due to disregarding the noise within these auxiliary information sour… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Findings

  10. arXiv:2406.01392  [pdf, other

    cs.CL

    Sparsity-Accelerated Training for Large Language Models

    Authors: Da Ma, Lu Chen, Pengyu Wang, Hongshen Xu, Hanqi Li, Liangtai Sun, Su Zhu, Shuai Fan, Kai Yu

    Abstract: Large language models (LLMs) have demonstrated proficiency across various natural language processing (NLP) tasks but often require additional training, such as continual pre-training and supervised fine-tuning. However, the costs associated with this, primarily due to their large parameter count, remain high. This paper proposes leveraging \emph{sparsity} in pre-trained LLMs to expedite this trai… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Findings

  11. arXiv:2406.00491  [pdf, other

    cs.NI

    Optimizing Age of Information in Random Access Networks: A Second-Order Approach for Active/Passive Users

    Authors: Siqi Fan, Yuxin Zhong, I-Hong Hou, Clement K Kam

    Abstract: In this paper, we study the moments of the Age of Information (AoI) for both active and passive users in a random access network. In this network, active users broadcast sensing data, while passive users detect in-band radio activities from out-of-network devices, such as jammers. Collisions occur when multiple active users transmit simultaneously. Passive users can detect radio activities only wh… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE Transaction on Communications. arXiv admin note: text overlap with arXiv:2305.05137

  12. arXiv:2406.00321  [pdf, other

    physics.optics cond-mat.other quant-ph

    Non-Abelian lattice gauge fields in the photonic synthetic frequency dimension

    Authors: Dali Cheng, Kai Wang, Charles Roques-Carmes, Eran Lustig, Olivia Y. Long, Heming Wang, Shanhui Fan

    Abstract: Non-Abelian gauge fields provide a conceptual framework for the description of particles having spins. The theoretical importance of non-Abelian gauge fields motivates their experimental synthesis and explorations. Here, we demonstrate non-Abelian lattice gauge fields for photons. In the study of gauge fields, lattice models are essential for the understanding of their implications in extended sys… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  13. arXiv:2405.20241  [pdf, other

    quant-ph

    Decoherence-free many-body Hamiltonians in nonlinear waveguide quantum electrodynamics

    Authors: Aviv Karnieli, Offek Tziperman, Charles Roques-Carmes, Shanhui Fan

    Abstract: Enhancing interactions in many-body quantum systems, while protecting them from environmental decoherence, is at the heart of many quantum technologies. Waveguide quantum electrodynamics is a promising platform for achieving this, as it hosts infinite-range interactions and decoherence-free subspaces of quantum emitters. However, as coherent interactions between emitters are typically washed out i… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  14. arXiv:2405.19765  [pdf, other

    cs.CV cs.AI

    Towards Unified Multi-granularity Text Detection with Interactive Attention

    Authors: Xingyu Wan, Chengquan Zhang, Pengyuan Lyu, Sen Fan, Zihan Ni, Kun Yao, Errui Ding, **gdong Wang

    Abstract: Existing OCR engines or document image analysis systems typically rely on training separate models for text detection in varying scenarios and granularities, leading to significant computational complexity and resource demands. In this paper, we introduce "Detect Any Text" (DAT), an advanced paradigm that seamlessly unifies scene text detection, layout analysis, and document page detection into a… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  15. arXiv:2405.19665  [pdf

    eess.SY cs.AI cs.LG

    A novel fault localization with data refinement for hydroelectric units

    Authors: Jialong Huang, Junlin Song, Penglong Lian, Mengjie Gan, Zhiheng Su, Benhao Wang, Wenji Zhu, Xiaomin Pu, Jianxiao Zou, Shicai Fan

    Abstract: Due to the scarcity of fault samples and the complexity of non-linear and non-smooth characteristics data in hydroelectric units, most of the traditional hydroelectric unit fault localization methods are difficult to carry out accurate localization. To address these problems, a sparse autoencoder (SAE)-generative adversarial network (GAN)-wavelet noise reduction (WNR)- manifold-boosted deep learni… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 6pages,4 figures,Conference on Decision and Control(CDC) conference

  16. arXiv:2405.19642  [pdf

    cs.AI

    Few-shot fault diagnosis based on multi-scale graph convolution filtering for industry

    Authors: Mengjie Gan, Penglong Lian, Zhiheng Su, Jiyang Zhang, Jialong Huang, Benhao Wang, Jianxiao Zou, Shicai Fan

    Abstract: Industrial equipment fault diagnosis often encounter challenges such as the scarcity of fault data, complex operating conditions, and varied types of failures. Signal analysis, data statistical learning, and conventional deep learning techniques face constraints under these conditions due to their substantial data requirements and the necessity for transfer learning to accommodate new failure mode… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 6 pages, 2 figures, 2 tables, 63rd IEEE Conference on Decision and Control

  17. arXiv:2405.19454  [pdf, other

    cs.LG stat.ML

    Deep Grokking: Would Deep Neural Networks Generalize Better?

    Authors: Simin Fan, Razvan Pascanu, Martin Jaggi

    Abstract: Recent research on the grokking phenomenon has illuminated the intricacies of neural networks' training dynamics and their generalization behaviors. Grokking refers to a sharp rise of the network's generalization accuracy on the test set, which occurs long after an extended overfitting phase, during which the network perfectly fits the training set. While the existing research primarily focus on s… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  18. arXiv:2405.09418  [pdf, other

    cond-mat.str-el

    Highly Tunable Ru-dimer Molecular Orbital State in 6H-perovskite Ba$_3$MRu$_2$O$_9$

    Authors: Bo Yuan, Beom Hyun Kim, Qiang Chen, Daniel Dobrowolski, Monika Azmanska, G. M. Luke, Shiyu Fan, Valentina Bisogni, Jonathan Pelliciari, J. P. Clancy

    Abstract: Molecular orbital (MO) systems with clusters of heavy transition metal (TM) ions are one of the most important classes of model materials for studying the interplay between local physics and effects of itinerancy. Despite a large number of candidates identified in the family of 4d TM materials, an understanding of their physics from competing \textit{microscopic} energy scales is still missing. We… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 7 pages, 3 figures, Supplemental Materials available upon request

  19. arXiv:2405.05288  [pdf, other

    cs.SI cs.IR cs.LG

    Learning Social Graph for Inactive User Recommendation

    Authors: Nian Liu, Shen Fan, Ting Bai, Peng Wang, Mingwei Sun, Yanhu Mo, Xiaoxiao Xu, Hong Liu, Chuan Shi

    Abstract: Social relations have been widely incorporated into recommender systems to alleviate data sparsity problem. However, raw social relations don't always benefit recommendation due to their inferior quality and insufficient quantity, especially for inactive users, whose interacted items are limited. In this paper, we propose a novel social recommendation method called LSIR (\textbf{L}earning \textbf{… ▽ More

    Submitted 22 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: This paper has been received by DASFAA 2024

  20. arXiv:2405.04768  [pdf, other

    cond-mat.mtrl-sci

    Circularly polarized light irradiated ferromagnetic MnBi$_2$Te$_4$: the long-sought ideal Weyl semimetal

    Authors: Shuai Fan, Shengpu Huang, Zhuo Chen, Fangyang Zhan, Xian-Yong Ding, Da-Shuai Ma, Rui Wang

    Abstract: The interaction between light and non-trivial energy band topology allows for the precise manipulation of topological quantum states, which has attracted intensive interest in condensed matter physics. In this work, using first-principles calculations, we studied the topological transition of ferromagnetic (FM) MnBi$_2$Te$_4$ upon irradiation with circularly polarized light (CPL). We revealed that… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  21. arXiv:2405.03121  [pdf, other

    cs.CV cs.AI

    AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding

    Authors: Tao Liu, Feilong Chen, Shuai Fan, Chenpeng Du, Qi Chen, Xie Chen, Kai Yu

    Abstract: The paper introduces AniTalker, an innovative framework designed to generate lifelike talking faces from a single portrait. Unlike existing models that primarily focus on verbal cues such as lip synchronization and fail to capture the complex dynamics of facial expressions and nonverbal cues, AniTalker employs a universal motion representation. This innovative representation effectively captures a… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 14 pages, 7 figures

  22. arXiv:2404.17900  [pdf, other

    cs.CV

    Unsupervised Anomaly Detection via Masked Diffusion Posterior Sampling

    Authors: Di Wu, Shicai Fan, Xue Zhou, Li Yu, Yuzhong Deng, Jianxiao Zou, Baihong Lin

    Abstract: Reconstruction-based methods have been commonly used for unsupervised anomaly detection, in which a normal image is reconstructed and compared with the given test image to detect and locate anomalies. Recently, diffusion models have shown promising applications for anomaly detection due to their powerful generative ability. However, these models lack strict mathematical support for normal image re… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Journal ref: International Joint Conference on Artificial Intelligence 2024

  23. arXiv:2404.15339  [pdf, other

    eess.IV

    Efficient EndoNeRF Reconstruction and Its Application for Data-driven Surgical Simulation

    Authors: Yuehao Wang, Bingchen Gong, Yonghao Long, Siu Hin Fan, Qi Dou

    Abstract: The healthcare industry has a growing need for realistic modeling and efficient simulation of surgical scenes. With effective models of deformable surgical scenes, clinicians are able to conduct surgical planning and surgery training on scenarios close to real-world cases. However, a significant challenge in achieving such a goal is the scarcity of high-quality soft tissue models with accurate sha… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 14 pages, 4 figures. Accepted by International Journal of Computer Assisted Radiology and Surgery

  24. arXiv:2404.15218  [pdf

    physics.optics physics.ins-det quant-ph

    Highly sensitive and efficient 1550 nm photodetector for room temperature operation

    Authors: Rituraj, Zhi Gang Yu, R. M. E. B. Kandegedara, Shanhui Fan, Srini Krishnamurthy

    Abstract: Photonic quantum technologies such as effective quantum communication require room temperature (RT) operating single- or few- photon sensors with high external quantum efficiency (EQE) at 1550 nm wavelength. The leading class of devices in this segment is avalanche photodetectors operating particularly in the Geiger mode. Often the requirements for RT operation and for a high EQE are in conflict,… ▽ More

    Submitted 12 May, 2024; v1 submitted 20 March, 2024; originally announced April 2024.

  25. arXiv:2404.14059  [pdf, ps, other

    math.PR

    Dual Representation of Unbounded Dynamic Concave Utilities

    Authors: Shengjun Fan, Ying Hu, Shanjian Tang

    Abstract: In several linear spaces of possibly unbounded endowments, we represent the dynamic concave utilities (hence the dynamic convex risk measures) as the solutions of backward stochastic differential equations (BSDEs) with unbounded terminal values, with the help of our recent existence and uniqueness results on unbounded solutions of scalar BSDEs whose generators have a linear, super-linear, sub-quad… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 35 pages

  26. arXiv:2404.12130  [pdf, other

    cs.LG cs.CV cs.DC

    One-Shot Sequential Federated Learning for Non-IID Data by Enhancing Local Model Diversity

    Authors: Naibo Wang, Yuchen Deng, Wenjie Feng, Shichen Fan, Jianwei Yin, See-Kiong Ng

    Abstract: Traditional federated learning mainly focuses on parallel settings (PFL), which can suffer significant communication and computation costs. In contrast, one-shot and sequential federated learning (SFL) have emerged as innovative paradigms to alleviate these costs. However, the issue of non-IID (Independent and Identically Distributed) data persists as a significant challenge in one-shot and SFL se… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  27. arXiv:2404.06079  [pdf, other

    eess.AS cs.AI

    The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge

    Authors: Yiwei Guo, Chenrun Wang, Yifan Yang, Hankun Wang, Ziyang Ma, Chenpeng Du, Shuai Wang, Hanzheng Li, Shuai Fan, Hui Zhang, Xie Chen, Kai Yu

    Abstract: Discrete speech tokens have been more and more popular in multiple speech processing fields, including automatic speech recognition (ASR), text-to-speech (TTS) and singing voice synthesis (SVS). In this paper, we describe the systems developed by the SJTU X-LANCE group for the TTS (acoustic + vocoder), SVS, and ASR tracks in the Interspeech 2024 Speech Processing Using Discrete Speech Unit Challen… ▽ More

    Submitted 9 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: 5 pages, 3 figures. Report of a challenge

  28. arXiv:2404.02438  [pdf, other

    cs.CL cs.LG stat.ML

    From Narratives to Numbers: Valid Inference Using Language Model Predictions from Verbal Autopsy Narratives

    Authors: Shuxian Fan, Adam Visokay, Kentaro Hoffman, Stephen Salerno, Li Liu, Jeffrey T. Leek, Tyler H. McCormick

    Abstract: In settings where most deaths occur outside the healthcare system, verbal autopsies (VAs) are a common tool to monitor trends in causes of death (COD). VAs are interviews with a surviving caregiver or relative that are used to predict the decedent's COD. Turning VAs into actionable insights for researchers and policymakers requires two steps (i) predicting likely COD using the VA interview and (ii… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 12 pages, 7 figures

  29. arXiv:2404.00717  [pdf, other

    cs.RO cs.CV cs.MA

    End-to-End Autonomous Driving through V2X Cooperation

    Authors: Haibao Yu, Wenxian Yang, Jiaru Zhong, Zhenwei Yang, Siqi Fan, ** Luo, Zaiqing Nie

    Abstract: Cooperatively utilizing both ego-vehicle and infrastructure sensor data via V2X communication has emerged as a promising approach for advanced autonomous driving. However, current research mainly focuses on improving individual modules, rather than taking end-to-end learning to optimize final planning performance, resulting in underutilized data potential. In this paper, we introduce UniV2X, a pio… ▽ More

    Submitted 19 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  30. arXiv:2403.19501  [pdf, other

    cs.CV

    RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method

    Authors: Ming Yan, Yan Zhang, Shuqiang Cai, Shuqi Fan, Xincheng Lin, Yudi Dai, Siqi Shen, Chenglu Wen, Lan Xu, Yuexin Ma, Cheng Wang

    Abstract: Comprehensive capturing of human motions requires both accurate captures of complex poses and precise localization of the human within scenes. Most of the HPE datasets and methods primarily rely on RGB, LiDAR, or IMU data. However, solely using these modalities or a combination of them may not be adequate for HPE, particularly for complex and fast movements. For holistic human motion understanding… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: CVPR2024, Project website: http://www.lidarhumanmotion.net/reli11d/

  31. arXiv:2403.19185  [pdf, other

    cs.IT eess.SP

    Deep CSI Compression for Dual-Polarized Massive MIMO Channels with Disentangled Representation Learning

    Authors: Suhang Fan, Wei Xu, Renjie Xie, Shi **, Derrick Wing Kwan Ng, Naofal Al-Dhahir

    Abstract: Channel state information (CSI) feedback is critical for achieving the promised advantages of enhancing spectral and energy efficiencies in massive multiple-input multiple-output (MIMO) wireless communication systems. Deep learning (DL)-based methods have been proven effective in reducing the required signaling overhead for CSI feedback. In practical dual-polarized MIMO scenarios, channels in the… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  32. arXiv:2403.18349  [pdf, other

    cs.CL

    Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback

    Authors: Hongshen Xu, Zichen Zhu, Situo Zhang, Da Ma, Shuai Fan, Lu Chen, Kai Yu

    Abstract: Large Language Models (LLMs) often generate erroneous outputs, known as hallucinations, due to their limitations in discerning questions beyond their knowledge scope. While addressing hallucination has been a focal point in research, previous efforts primarily concentrate on enhancing correctness without giving due consideration to the significance of rejection mechanisms. In this paper, we conduc… ▽ More

    Submitted 7 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  33. arXiv:2403.13071  [pdf, other

    quant-ph

    Strong coupling and single-photon nonlinearity in free-electron quantum optics

    Authors: Aviv Karnieli, Charles Roques-Carmes, Nicholas Rivera, Shanhui Fan

    Abstract: The observation that free electrons can interact coherently with quantized electromagnetic fields and matter systems has led to a plethora of proposals leveraging the unique quantum properties of free electrons. At the heart of these proposals lies the assumption of a strong quantum interaction between a flying free electron and a photonic mode. However, existing schemes are intrinsically limited… ▽ More

    Submitted 1 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: References updated in version 2

  34. arXiv:2403.10188  [pdf, other

    cs.CR cs.AR

    Taiyi: A high-performance CKKS accelerator for Practical Fully Homomorphic Encryption

    Authors: Shengyu Fan, Xianglong Deng, Zhuoyu Tian, Zhicheng Hu, Liang Chang, Rui Hou, Dan Meng, Mingzhe Zhang

    Abstract: Fully Homomorphic Encryption (FHE), a novel cryptographic theory enabling computation directly on ciphertext data, offers significant security benefits but is hampered by substantial performance overhead. In recent years, a series of accelerator designs have significantly enhanced the performance of FHE applications, bringing them closer to real-world applicability. However, these accelerators fac… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 14 pages, 15 figures

  35. arXiv:2403.10145  [pdf, other

    cs.CV cs.RO

    RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception

    Authors: Ruiyang Hao, Siqi Fan, Yingru Dai, Zhenlin Zhang, Chenxi Li, Yuntian Wang, Haibao Yu, Wenxian Yang, Jirui Yuan, Zaiqing Nie

    Abstract: The value of roadside perception, which could extend the boundaries of autonomous driving and traffic management, has gradually become more prominent and acknowledged in recent years. However, existing roadside perception approaches only focus on the single-infrastructure sensor system, which cannot realize a comprehensive understanding of a traffic area because of the limited sensing range and bl… ▽ More

    Submitted 31 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024. 10 pages with 6 figures

    ACM Class: I.4.8; I.5.4

  36. arXiv:2403.02181  [pdf, other

    cs.CL cs.AI cs.LG

    Not all Layers of LLMs are Necessary during Inference

    Authors: Siqi Fan, Xin Jiang, Xiang Li, Xuying Meng, Peng Han, Shuo Shang, Aixin Sun, Yequan Wang, Zhongyuan Wang

    Abstract: The inference phase of Large Language Models (LLMs) is very expensive. An ideal inference stage of LLMs could utilize fewer computational resources while still maintaining its capabilities (e.g., generalization and in-context learning ability). In this paper, we try to answer the question, "During LLM inference, can we use shallow layers for easy instances; and deep layers for hard ones?" To answe… ▽ More

    Submitted 14 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  37. arXiv:2402.15272  [pdf, other

    cs.CV cs.AI

    EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection

    Authors: Zhe Wang, Siqi Fan, Xiaoliang Huo, Tongda Xu, Yan Wang, **g**g Liu, Yilun Chen, Ya-Qin Zhang

    Abstract: In autonomous driving, cooperative perception makes use of multi-view cameras from both vehicles and infrastructure, providing a global vantage point with rich semantic context of road conditions beyond a single vehicle viewpoint. Currently, two major challenges persist in vehicle-infrastructure cooperative 3D (VIC3D) object detection: $1)$ inherent pose errors when fusing multi-view images, cause… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 7 pages, 8 figures. Accepted by ICRA 2024. arXiv admin note: text overlap with arXiv:arXiv:2303.10975

  38. arXiv:2402.14435  [pdf, ps, other

    math.PR

    Random time horizon BSDEs with stochastic monotonicity and general growth generators and related PDEs

    Authors: Xinying Li, Yaqi Zhang, Shengjun Fan

    Abstract: This paper is devoted to solving a multidimensional backward stochastic differential equation (BSDE) with a general random terminal time, which may take values in [0,+infinity]. The generator g satisfies a stochastic monotonicity condition in the first unknown variable y and a stochastic Lipschitz continuity condition in the second unknown variable z, and it can have a more general growth with res… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  39. arXiv:2402.09678  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Elementary excitations of single-photon emitters in hexagonal Boron Nitride

    Authors: Jonathan Pelliciari, Enrique Mejia, John M. Woods, Yanhong Gu, Jiemin Li, Saroj B. Chand, Shiyu Fan, Kenji Watanabe, Takashi Taniguchi, Valentina Bisogni, Gabriele Grosso

    Abstract: Single-photon emitters serve as building blocks for many emerging concepts in quantum photonics. The recent identification of bright, tunable, and stable emitters in hexagonal boron nitride (hBN) has opened the door to quantum platforms operating across the infrared to ultraviolet spectrum. While it is widely acknowledged that defects are responsible for single-photon emitters in hBN, crucial deta… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  40. arXiv:2402.07197  [pdf, other

    cs.AI

    GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks

    Authors: Mengmei Zhang, Mingwei Sun, Peng Wang, Shen Fan, Yanhu Mo, Xiaoxiao Xu, Hong Liu, Cheng Yang, Chuan Shi

    Abstract: Large language models (LLMs) like ChatGPT, exhibit powerful zero-shot and instruction-following capabilities, have catalyzed a revolutionary transformation across diverse fields, especially for open-ended tasks. While the idea is less explored in the graph domain, despite the availability of numerous powerful graph models (GMs), they are restricted to tasks in a pre-defined form. Although several… ▽ More

    Submitted 27 February, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  41. arXiv:2402.05728  [pdf, other

    cs.CV

    CTGAN: Semantic-guided Conditional Texture Generator for 3D Shapes

    Authors: Yi-Ting Pan, Chai-Rong Lee, Shu-Ho Fan, Jheng-Wei Su, Jia-Bin Huang, Yung-Yu Chuang, Hung-Kuo Chu

    Abstract: The entertainment industry relies on 3D visual content to create immersive experiences, but traditional methods for creating textured 3D models can be time-consuming and subjective. Generative networks such as StyleGAN have advanced image synthesis, but generating 3D objects with high-fidelity textures is still not well explored, and existing methods have limitations. We propose the Semantic-guide… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  42. arXiv:2402.00704  [pdf, other

    physics.optics quant-ph

    Measuring, processing, and generating partially coherent light with self-configuring optics

    Authors: Charles Roques-Carmes, Shanhui Fan, David Miller

    Abstract: Optical phenomena always display some degree of partial coherence between their respective degrees of freedom. Partial coherence is of particular interest in multimodal systems, where classical and quantum correlations between spatial, polarization, and spectral degrees of freedom can lead to fascinating phenomena (e.g., entanglement) and be leveraged for advanced imaging and sensing modalities (e… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  43. arXiv:2402.00510  [pdf

    astro-ph.EP

    Deciphering Pluto's Haze: How a Solar-Powered Vapor-Pressure Plume Shapes Its Bimodal Particle Size Distribution

    Authors: Sihe Chen, Danica Adams, Siteng Fan, Peter Gao, Eliot Young, Yuk Yung

    Abstract: Combining findings from New Horizons' suite of instruments reveals a bimodal haze particle distribution within Pluto's atmosphere, which haze models have not been able to reproduce. We employ the photochemical and microphysics KINAERO model to simulate seasonal cycles and their impact on the haze distribution. We find that the smaller spherical particle mode can be generated through photochemistry… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  44. arXiv:2401.17576  [pdf, other

    math.PR

    Existence, uniqueness and comparison theorem on unbounded solutions of general time interval BSDEs with sub-quadratic generators

    Authors: Chuang Gu, Yan Wang, Shengjun Fan

    Abstract: This paper is devoted to the existence, uniqueness and comparison theorem on unbounded solutions of one-dimensional backward stochastic differential equations (BSDEs) with sub-quadratic generators, where the terminal time is allowed to be finite or infinite. We first establish existence of the unbounded solutions for this kind of BSDEs with generator $g$ satisfying a time-varying one-sided linear… ▽ More

    Submitted 9 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: 28 pages

  45. arXiv:2401.17560  [pdf, ps, other

    math.PR

    On the existence and uniqueness of unbounded solutions to quadratic BSDEs with monotonic-convex generators

    Authors: Yan Wang, Xinying Li, Chuang Gu, Shengjun Fan

    Abstract: With the terminal value $ξ^-$ admitting a certain exponential moment and $ξ^+$ admitting every exponential moments or being bounded, we establish several existence and uniqueness results for unbounded solutions of backward stochastic differential equations (BSDEs) whose generator $g$ satisfies a monotonicity condition with general growth in the first unknown variable $y$ and a convexity condition… ▽ More

    Submitted 5 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: 23 pages

  46. arXiv:2401.16784  [pdf, other

    cs.LG cs.AI cs.SI

    Graph Fairness Learning under Distribution Shifts

    Authors: Yibo Li, Xiao Wang, Yujie Xing, Shaohua Fan, Ruijia Wang, Yaoqi Liu, Chuan Shi

    Abstract: Graph neural networks (GNNs) have achieved remarkable performance on graph-structured data. However, GNNs may inherit prejudice from the training data and make discriminatory predictions based on sensitive attributes, such as gender and race. Recently, there has been an increasing interest in ensuring fairness on GNNs, but all of them are under the assumption that the training and testing data are… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted by WWW 2024

  47. arXiv:2401.14818  [pdf, other

    cs.CL cs.DL

    ChemDFM: Dialogue Foundation Model for Chemistry

    Authors: Zihan Zhao, Da Ma, Lu Chen, Liangtai Sun, Zihao Li, Hongshen Xu, Zichen Zhu, Su Zhu, Shuai Fan, Guodong Shen, Xin Chen, Kai Yu

    Abstract: Large language models (LLMs) have established great success in the general domain of natural language processing. Their emerging task generalization and free-form dialogue capabilities can greatly help to design Chemical General Intelligence (CGI) to assist real-world research in chemistry. However, the existence of specialized language and knowledge in the field of chemistry, such as the highly i… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 10 pages, 12 figures, 13 tables. Under Review

  48. arXiv:2401.12564  [pdf, other

    cs.LG cs.SI

    Graph Contrastive Invariant Learning from the Causal Perspective

    Authors: Yanhu Mo, Xiao Wang, Shaohua Fan, Chuan Shi

    Abstract: Graph contrastive learning (GCL), learning the node representation by contrasting two augmented graphs in a self-supervised way, has attracted considerable attention. GCL is usually believed to learn the invariant representation. However, does this understanding always hold in practice? In this paper, we first study GCL from the perspective of causality. By analyzing GCL with the structural causal… ▽ More

    Submitted 7 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  49. arXiv:2401.11661  [pdf, other

    math-ph cond-mat.other

    One-dimensional non-Hermitian band structures as Riemann surfaces

    Authors: Heming Wang, Lingling Fan, Shanhui Fan

    Abstract: We present the viewpoint of treating one-dimensional band structures as Riemann surfaces, linking the unique properties of non-Hermiticity to the geometry and topology of the Riemann surface. Branch cuts and branch points play a significant role when this viewpoint is applied to both the open-boundary spectrum and the braiding structure. An open-boundary spectrum is interpreted as branch cuts conn… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 13 pages, 6 figures

  50. arXiv:2401.10427  [pdf, ps, other

    math.NT

    Shifted-prime divisors

    Authors: Steve Fan, Carl Pomerance

    Abstract: Let $ω^*(n)$ denote the number of divisors of $n$ that are shifted primes, that is, the number of divisors of $n$ of the form $p-1$, with $p$ prime. Studied by Prachar in an influential paper from 70 years ago, the higher moments of $ω^*(n)$ are still somewhat a mystery. This paper addresses these higher moments and considers other related problems.

    Submitted 19 March, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 24 pages, 3 tables

    MSC Class: 11N25; 11N37; 11B05