Skip to main content

Showing 1–50 of 758 results for author: Fang, Z

.
  1. arXiv:2407.00952  [pdf, other

    cs.LG cs.CL cs.DC

    SplitLoRA: A Split Parameter-Efficient Fine-Tuning Framework for Large Language Models

    Authors: Zheng Lin, Xuanjie Hu, Yuxin Zhang, Zhe Chen, Zihan Fang, Xianhao Chen, Ang Li, Praneeth Vepakomma, Yue Gao

    Abstract: The scalability of large language models (LLMs) in handling high-complexity models and large-scale datasets has led to tremendous successes in pivotal domains. While there is an urgent need to acquire more training data for LLMs, a concerning reality is the depletion of high-quality public datasets within a few years. In view of this, the federated learning (FL) LLM fine-tuning paradigm recently h… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 9 pages, 3 figures

  2. arXiv:2406.19311  [pdf, other

    cs.CR cs.SD eess.AS

    Zero-Query Adversarial Attack on Black-box Automatic Speech Recognition Systems

    Authors: Zheng Fang, Tao Wang, Lingchen Zhao, Shenyi Zhang, Bowen Li, Yunjie Ge, Qi Li, Chao Shen, Qian Wang

    Abstract: In recent years, extensive research has been conducted on the vulnerability of ASR systems, revealing that black-box adversarial example attacks pose significant threats to real-world ASR systems. However, most existing black-box attacks rely on queries to the target ASRs, which is impractical when queries are not permitted. In this paper, we propose ZQ-Attack, a transfer-based adversarial attack… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: To appear in the Proceedings of The ACM Conference on Computer and Communications Security (CCS), 2024

  3. arXiv:2406.18140  [pdf, other

    cs.CV cs.AI

    Exclusive Style Removal for Cross Domain Novel Class Discovery

    Authors: Yicheng Wang, Feng Liu, Junmin Liu, Zhen Fang, Kai Sun

    Abstract: As a promising field in open-world learning, \textit{Novel Class Discovery} (NCD) is usually a task to cluster unseen novel classes in an unlabeled set based on the prior knowledge of labeled data within the same domain. However, the performance of existing NCD methods could be severely compromised when novel classes are sampled from a different distribution with the labeled ones. In this paper, w… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  4. arXiv:2406.16527  [pdf

    cs.CL cs.CY cs.DL cs.LG

    SyROCCo: Enhancing Systematic Reviews using Machine Learning

    Authors: Zheng Fang, Miguel Arana-Catania, Felix-Anselm van Lier, Juliana Outes Velarde, Harry Bregazzi, Mara Airoldi, Eleanor Carter, Rob Procter

    Abstract: The sheer number of research outputs published every year makes systematic reviewing increasingly time- and resource-intensive. This paper explores the use of machine learning techniques to help navigate the systematic review process. ML has previously been used to reliably 'screen' articles for review - that is, identify relevant articles based on reviewers' inclusion criteria. The application of… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 28 pages, 5 figures. To appear in Data & Policy journal

  5. arXiv:2406.15774  [pdf, other

    cs.RO

    Observation Time Difference: an Online Dynamic Objects Removal Method for Ground Vehicles

    Authors: Rongguang Wu, Chenglin Pang, Xuankang Wu, Zheng Fang

    Abstract: In the process of urban environment map**, the sequential accumulations of dynamic objects will leave a large number of traces in the map. These traces will usually have bad influences on the localization accuracy and navigation performance of the robot. Therefore, dynamic objects removal plays an important role for creating clean map. However, conventional dynamic objects removal methods usuall… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  6. arXiv:2406.10948  [pdf

    cs.LG cs.AI

    Incorporating uncertainty quantification into travel mode choice modeling: a Bayesian neural network (BNN) approach and an uncertainty-guided active survey framework

    Authors: Shuwen Zheng, Zhou Fang, Liang Zhao

    Abstract: Existing deep learning approaches for travel mode choice modeling fail to inform modelers about their prediction uncertainty. Even when facing scenarios that are out of the distribution of training data, which implies high prediction uncertainty, these approaches still provide deterministic answers, potentially leading to misguidance. To address this limitation, this study introduces the concept o… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  7. arXiv:2406.10054  [pdf, other

    cs.SE cs.CR

    SmartOracle: Generating Smart Contract Oracle via Fine-Grained Invariant Detection

    Authors: Jianzhong Su, Jiachi Chen, Zhiyuan Fang, Xingwei Lin, Yutian Tang, Zibin Zheng

    Abstract: As decentralized applications (DApps) proliferate, the increased complexity and usage of smart contracts have heightened their susceptibility to security incidents and financial losses. Although various vulnerability detection tools have been developed to mitigate these issues, they often suffer poor performance in detecting vulnerabilities, as they either rely on simplistic and general-purpose or… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  8. arXiv:2406.09931  [pdf, other

    eess.IV cs.CV cs.LG

    SCKansformer: Fine-Grained Classification of Bone Marrow Cells via Kansformer Backbone and Hierarchical Attention Mechanisms

    Authors: Yifei Chen, Zhu Zhu, Shenghao Zhu, Linwei Qiu, Binfeng Zou, Fan Jia, Yunpeng Zhu, Chenyan Zhang, Zhaojie Fang, Feiwei Qin, ** Fan, Changmiao Wang, Yu Gao, Gang Yu

    Abstract: The incidence and mortality rates of malignant tumors, such as acute leukemia, have risen significantly. Clinically, hospitals rely on cytological examination of peripheral blood and bone marrow smears to diagnose malignant tumors, with accurate blood cell counting being crucial. Existing automated methods face challenges such as low feature expression capability, poor interpretability, and redund… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 15 pages, 6 figures

  9. arXiv:2406.07626  [pdf, other

    cond-mat.mes-hall cond-mat.dis-nn cond-mat.quant-gas quant-ph

    Tailoring Bound State Geometry in High-Dimensional Non-Hermitian Systems

    Authors: Ao Yang, Zixi Fang, Kai Zhang, Chen Fang

    Abstract: It is generally believed that the non-Hermitian effect (NHSE), due to its non-reciprocal nature, creates barriers for the appearance of impurity bound states. In this paper, we find that in two and higher dimensions, the presence of geometry-dependent skin effect eliminates this barrier such that even an infinitesimal impurity potential can confine bound states in this type of non-Hermitian system… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 14 pages, 4 figures

  10. arXiv:2406.07057  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study

    Authors: Yichi Zhang, Yao Huang, Yitong Sun, Chang Liu, Zhe Zhao, Zhengwei Fang, Yifan Wang, Huanran Chen, Xiao Yang, Xingxing Wei, Hang Su, Yinpeng Dong, Jun Zhu

    Abstract: Despite the superior capabilities of Multimodal Large Language Models (MLLMs) across diverse tasks, they still face significant trustworthiness challenges. Yet, current literature on the assessment of trustworthy MLLMs remains limited, lacking a holistic evaluation to offer thorough insights into future improvements. In this work, we establish MultiTrust, the first comprehensive and unified benchm… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 100 pages, 84 figures, 33 tables

  11. arXiv:2406.06512  [pdf, other

    cs.CV cs.AI

    Merlin: A Vision Language Foundation Model for 3D Computed Tomography

    Authors: Louis Blankemeier, Joseph Paul Cohen, Ashwin Kumar, Dave Van Veen, Syed Jamal Safdar Gardezi, Magdalini Paschali, Zhihong Chen, Jean-Benoit Delbrouck, Eduardo Reis, Cesar Truyts, Christian Bluethgen, Malte Engmann Kjeldskov Jensen, Sophie Ostmeier, Maya Varma, Jeya Maria Jose Valanarasu, Zhongnan Fang, Zepeng Huo, Zaid Nabulsi, Diego Ardila, Wei-Hung Weng, Edson Amaro Junior, Neera Ahuja, Jason Fries, Nigam H. Shah, Andrew Johnston , et al. (6 additional authors not shown)

    Abstract: Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen. Given the current radiologist shortage, there is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies. Prior state-of-the-art approaches for automated medical image interpretation leverage vision la… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  12. arXiv:2406.04737  [pdf, other

    eess.SP eess.SY

    Fast-Fading Channel and Power Optimization of the Magnetic Inductive Cellular Network

    Authors: Honglei Ma, Erwu Liu, Zhijun Fang, Rui Wang, Yongbin Gao, Wenjun Yu, Dongming Zhang

    Abstract: The cellular network of magnetic Induction (MI) communication holds promise in long-distance underground environments. In the traditional MI communication, there is no fast-fading channel since the MI channel is treated as a quasi-static channel. However, for the vehicle (mobile) MI (VMI) communication, the unpredictable antenna vibration brings the remarkable fast-fading. As such fast-fading cann… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE TWC for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  13. arXiv:2406.02399  [pdf, other

    math.PR

    How often does a critical elephant random walk return to origin

    Authors: Zheng Fang

    Abstract: We study the asymptotic behaviour of the number of times the elephant random walk in the critical regime visits the origin. Our result entails that most zeros of the critical elephant random walk occur shortly before its last passage time at zero. Additionally, we derive the tail estimate of the first return time of the random walk.

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 13 pages, 1 figure

  14. arXiv:2406.01240  [pdf, other

    eess.IV

    Arctic Sea Ice Image Super-Resolution Based on Multi-Scale Convolution and Dual-Gating Mechanism

    Authors: Zhaomin Fang, Wankun Chen, Feng Gao, Yanhai Gan, Junyu Dong, Yang Zhou

    Abstract: Arctic Sea Ice Concentration (SIC) is the ratio of ice-covered area to the total sea area of the Arctic Ocean, which is a key indicator for maritime activities. Nowadays, we often use passive microwave images to display SIC, but it has low spatial resolution, and most of the existing super-resolution methods of Arctic SIC don't take the integration of spatial and channel features into account and… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE IGARSS 2024

  15. arXiv:2406.01031  [pdf, ps, other

    cs.IT

    Geometric constellation sha** for wireless optical intensity channels

    Authors: Suhua Zhou, Tianqi Li, Zhaoxi Fang, **g Zhou, Wenyi Zhang

    Abstract: A simple geometric sha** method is proposed for optical wireless communication systems based on intensity modulation and direct detection (IM/DD). Constellations consisting of equiprobable levels with exponential-like distribution are obtained. The method possesses asymptotic optimality in the sense that the high-SNR channel capacity can be approached by such constellations with increasing size.… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 5 pages, 4 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  16. arXiv:2406.00452  [pdf, other

    cs.LG cs.AI

    Towards a Unified Framework of Clustering-based Anomaly Detection

    Authors: Zeyu Fang, Ming Gu, Sheng Zhou, Jiawei Chen, Qiaoyu Tan, Haishuai Wang, Jiajun Bu

    Abstract: Unsupervised Anomaly Detection (UAD) plays a crucial role in identifying abnormal patterns within data without labeled examples, holding significant practical implications across various domains. Although the individual contributions of representation learning and clustering to anomaly detection are well-established, their interdependencies remain under-explored due to the absence of a unified the… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  17. arXiv:2405.19878  [pdf, other

    cs.LG cs.GT

    Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models

    Authors: Zeyu Fang, Tian Lan

    Abstract: Generative models such as diffusion have been employed as world models in offline reinforcement learning to generate synthetic data for more effective learning. Existing work either generates diffusion models one-time prior to training or requires additional interaction data to update it. In this paper, we propose a novel approach for offline reinforcement learning with closed-loop policy evaluati… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  18. arXiv:2405.19856  [pdf, other

    cs.CL cs.SE

    DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

    Authors: Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, Lecheng Wang, Kaibo Liu, Zheng Fang, Lanshen Wang, Jiazheng Ding, Xuanming Zhang, Yuqi Zhu, Yihong Dong, Zhi **, Binhua Li, Fei Huang, Yongbin Li

    Abstract: How to evaluate the coding abilities of Large Language Models (LLMs) remains an open question. We find that existing benchmarks are poorly aligned with real-world code repositories and are insufficient to evaluate the coding abilities of LLMs. To address the knowledge gap, we propose a new benchmark named DevEval, which has three advances. (1) DevEval aligns with real-world repositories in multi… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted by the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024). arXiv admin note: substantial text overlap with arXiv:2404.00599, arXiv:2401.06401

  19. arXiv:2405.19668  [pdf, other

    cs.CV

    AutoBreach: Universal and Adaptive Jailbreaking with Efficient Wordplay-Guided Optimization

    Authors: Jiawei Chen, Xiao Yang, Zhengwei Fang, Yu Tian, Yinpeng Dong, Zhaoxia Yin, Hang Su

    Abstract: Despite the widespread application of large language models (LLMs) across various tasks, recent studies indicate that they are susceptible to jailbreak attacks, which can render their defense mechanisms ineffective. However, previous jailbreak research has frequently been constrained by limited universality, suboptimal efficiency, and a reliance on manual crafting. In response, we rethink the appr… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Under review

  20. arXiv:2405.16583  [pdf

    physics.optics

    An erbium-doped waveguide amplifier on thin film lithium niobate with an output power exceeding 100 mW

    Authors: Rui Bao, Zhiwei Fang, Jian Liu, Zhaoxiang Liu, **ming Chen, Min Wang, Rongbo Wu, Haisu Zhang, Ya Cheng

    Abstract: We demonstrate high-power thin film lithium niobate (TFLN) erbium-doped waveguide amplifier (EDWA) with a maximum on-chip output power of 113 mW and a gain of 16 dB. The on-chip integrated EDWA is composed of large mode area (LMA) waveguide structures with a total length of 7 cm and a footprint of 1x1 cm2. Particularly, we connect segmented LMA waveguides with waveguide tapers to achieve on-chip m… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 13 pages, 4 figures

  21. arXiv:2405.15705  [pdf, other

    cs.AR eess.SY

    Sums: Sniffing Unknown Multiband Signals under Low Sampling Rates

    Authors: **bo Peng, Zhe Chen, Zheng Lin, Haoxuan Yuan, Zihan Fang, Lingzhong Bao, Zihang Song, Ying Li, **g Ren, Yue Gao

    Abstract: Due to sophisticated deployments of all kinds of wireless networks (e.g., 5G, Wi-Fi, Bluetooth, LEO satellite, etc.), multiband signals distribute in a large bandwidth (e.g., from 70 MHz to 8 GHz). Consequently, for network monitoring and spectrum sharing applications, a sniffer for extracting physical layer information, such as structure of packet, with low sampling rate (especially, sub-Nyquist… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 12 pages, 9 figures

  22. arXiv:2405.15542  [pdf, other

    cs.NI cs.DC cs.LG eess.SP

    SATSense: Multi-Satellite Collaborative Framework for Spectrum Sensing

    Authors: Haoxuan Yuan, Zhe Chen, Zheng Lin, **bo Peng, Zihan Fang, Yuhang Zhong, Zihang Song, Yue Gao

    Abstract: Low Earth Orbit satellite Internet has recently been deployed, providing worldwide service with non-terrestrial networks. With the large-scale deployment of both non-terrestrial and terrestrial networks, limited spectrum resources will not be allocated enough. Consequently, dynamic spectrum sharing is crucial for their coexistence in the same spectrum, where accurate spectrum sensing is essential.… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 13 pages, 16 figures

  23. arXiv:2405.14578  [pdf, other

    cs.LG

    Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling

    Authors: Shuaipeng Li, Penghao Zhao, Hailin Zhang, Xingwu Sun, Hao Wu, Dian Jiao, Weiyan Wang, Chengjun Liu, Zheng Fang, **bao Xue, Yangyu Tao, Bin Cui, Di Wang

    Abstract: In current deep learning tasks, Adam style optimizers such as Adam, Adagrad, RMSProp, Adafactor, and Lion have been widely used as alternatives to SGD style optimizers. These optimizers typically update model parameters using the sign of gradients, resulting in more stable convergence curves. The learning rate and the batch size are the most critical hyperparameters for optimizers, which require c… ▽ More

    Submitted 4 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  24. arXiv:2405.14153  [pdf, other

    cs.LG

    A Neighbor-Searching Discrepancy-based Drift Detection Scheme for Learning Evolving Data

    Authors: Feng Gu, Jie Lu, Zhen Fang, Kun Wang, Guangquan Zhang

    Abstract: Uncertain changes in data streams present challenges for machine learning models to dynamically adapt and uphold performance in real-time. Particularly, classification boundary change, also known as real concept drift, is the major cause of classification performance deterioration. However, accurately detecting real concept drift remains challenging because the theoretical foundations of existing… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  25. arXiv:2405.12601  [pdf, other

    cs.CV

    FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors

    Authors: Shuai Liu, Boyang Li, Zhiyu Fang, Mingyue Cui, Kai Huang

    Abstract: LiDAR-based 3D object detection has made impressive progress recently, yet most existing models are black-box, lacking interpretability. Previous explanation approaches primarily focus on analyzing image-based models and are not readily applicable to LiDAR-based 3D detectors. In this paper, we propose a feature factorization activation map (FFAM) to generate high-quality visual explanations for 3D… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  26. arXiv:2405.11794  [pdf, other

    cs.CV

    ViViD: Video Virtual Try-on using Diffusion Models

    Authors: Zixun Fang, Wei Zhai, Aimin Su, Hongliang Song, Kai Zhu, Mao Wang, Yu Chen, Zhiheng Liu, Yang Cao, Zheng-Jun Zha

    Abstract: Video virtual try-on aims to transfer a clothing item onto the video of a target person. Directly applying the technique of image-based try-on to the video domain in a frame-wise manner will cause temporal-inconsistent outcomes while previous video-based try-on solutions can only generate low visual quality and blurring results. In this work, we present ViViD, a novel framework employing powerful… ▽ More

    Submitted 28 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  27. arXiv:2405.09245  [pdf, other

    eess.SP

    A Robust UAV-Based Approach for Power-Modulated Jammer Localization Using DoA

    Authors: Zexin Fang, Bin Han, Hans D. Schotten

    Abstract: Unmanned aerial vehicles (UAVs) are well-suited to localize jammers, particularly when jammers are at non-terrestrial locations, where conventional detection methods face challenges. In this work we propose a novel localization method, sample pruning gradient descend (SPGD), which offers robust performance against multiple power-modulated jammers with low computational complexity.

    Submitted 21 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: Submitted to the 2024 IEEE 100th Vehicular Technology Conference (VTC2024-Fall)

  28. arXiv:2405.08748  [pdf, other

    cs.CV

    Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

    Authors: Zhimin Li, Jianwei Zhang, Qin Lin, Jiangfeng Xiong, Yanxin Long, Xinchi Deng, Yingfang Zhang, Xingchao Liu, Minbin Huang, Zedong Xiao, Dayou Chen, Jiajun He, Jiahao Li, Wenyue Li, Chen Zhang, Rongwei Quan, Jianxiang Lu, Jiabin Huang, Xiaoyan Yuan, Xiaoxiao Zheng, Yixuan Li, Jihong Zhang, Chao Zhang, Meng Chen, Jie Liu , et al. (20 additional authors not shown)

    Abstract: We present Hunyuan-DiT, a text-to-image diffusion transformer with fine-grained understanding of both English and Chinese. To construct Hunyuan-DiT, we carefully design the transformer structure, text encoder, and positional encoding. We also build from scratch a whole data pipeline to update and evaluate data for iterative model optimization. For fine-grained language understanding, we train a Mu… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Project Page: https://dit.hunyuan.tencent.com/

  29. arXiv:2405.08638  [pdf, other

    cs.LG

    vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement

    Authors: Yiwen Zhu, **yi Liu, Wenya Wei, Qianyi Fu, Yu**g Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan

    Abstract: Reinforcement Learning (RL) is a widely employed technique in decision-making problems, encompassing two fundamental operations -- policy evaluation and policy improvement. Enhancing learning efficiency remains a key challenge in RL, with many efforts focused on using ensemble critics to boost policy evaluation efficiency. However, when using multiple critics, the actor in the policy improvement p… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024, with appendix

  30. arXiv:2405.04834  [pdf, other

    cs.CV

    FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation

    Authors: Xuehai He, Jian Zheng, Jacob Zhiyuan Fang, Robinson Piramuthu, Mohit Bansal, Vicente Ordonez, Gunnar A Sigurdsson, Nanyun Peng, Xin Eric Wang

    Abstract: Controllable text-to-image (T2I) diffusion models generate images conditioned on both text prompts and semantic inputs of other modalities like edge maps. Nevertheless, current controllable T2I methods commonly face challenges related to efficiency and faithfulness, especially when conditioning on multiple inputs from either the same or diverse modalities. In this paper, we propose a novel Flexibl… ▽ More

    Submitted 21 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  31. arXiv:2405.03323  [pdf, other

    cond-mat.mes-hall

    Distance between two manifolds, topological phase transitions and scaling laws

    Authors: ZhaoXiang Fang, Ming Gong, Guang-Can Guo, Yongxu Fu, Long Xiong

    Abstract: Topological phases are generally characterized by topological invariants denoted by integer numbers. However, different topological systems often require different topological invariants to measure, such as geometric phases, topological orders, winding numbers, etc. Moreover, geometric phases and its associated definitions usually fail at critical points. Therefore, it's challenging to predict wha… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 7+36 pages, 4+31 figures

  32. arXiv:2405.00542  [pdf, other

    eess.IV cs.CV

    UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement

    Authors: Ruiquan Ge, Zhaojie Fang, Pengxue Wei, Zhanghao Chen, Hongyang Jiang, Ahmed Elazab, Wangting Li, Xiang Wan, Shaochong Zhang, Changmiao Wang

    Abstract: Fundus photography, in combination with the ultra-wide-angle fundus (UWF) techniques, becomes an indispensable diagnostic tool in clinical settings by offering a more comprehensive view of the retina. Nonetheless, UWF fluorescein angiography (UWF-FA) necessitates the administration of a fluorescent dye via injection into the patient's hand or elbow unlike UWF scanning laser ophthalmoscopy (UWF-SLO… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  33. arXiv:2405.00358  [pdf, other

    cs.AI cs.LG

    Arbitrary Time Information Modeling via Polynomial Approximation for Temporal Knowledge Graph Embedding

    Authors: Zhiyu Fang, **gyan Qin, Xiaobin Zhu, Chun Yang, Xu-Cheng Yin

    Abstract: Distinguished from traditional knowledge graphs (KGs), temporal knowledge graphs (TKGs) must explore and reason over temporally evolving facts adequately. However, existing TKG approaches still face two main challenges, i.e., the limited capability to model arbitrary timestamps continuously and the lack of rich inference patterns under temporal constraints. In this paper, we propose an innovative… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted by LREC-COLING 2024 (long paper, camera-ready version)

  34. Transformer-based Reasoning for Learning Evolutionary Chain of Events on Temporal Knowledge Graph

    Authors: Zhiyu Fang, Shuai-Long Lei, Xiaobin Zhu, Chun Yang, Shi-Xue Zhang, Xu-Cheng Yin, **gyan Qin

    Abstract: Temporal Knowledge Graph (TKG) reasoning often involves completing missing factual elements along the timeline. Although existing methods can learn good embeddings for each factual element in quadruples by integrating temporal information, they often fail to infer the evolution of temporal facts. This is mainly because of (1) insufficiently exploring the internal structure and semantic relationshi… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted by SIGIR 2024 (the Full paper track, camera ready version)

  35. arXiv:2404.19455  [pdf, ps, other

    math.DS

    Bifurcations and explicit unfoldings of grazing loops connecting one high multiplicity tangent point

    Authors: Zhihao Fang, Xingwu Chen

    Abstract: For piecewise-smooth differential systems, in this paper we focus on crossing limit cycles and sliding loops bifurcating from a grazing loop connecting one high multiplicity tangent point. For the low multiplicity cases considered in previous publications, the method is to define and analyze return maps following the classic idea of Poincaré. However, high multiplicity leads to that either domains… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  36. arXiv:2404.19450  [pdf, ps, other

    math.DS

    Classifications and bifurcations of tangent points and their loops of planar piecewise-smooth systems

    Authors: Zhihao Fang, Xingwu Chen

    Abstract: Tangent points, especial dynamics existing only in piecewise-smooth systems, usually have dynamical properties like equilibria of smooth systems. Loops connecting tangent points own partly properties of limit cycles and homoclinic loops of smooth systems. In this paper we give classifications for tangent points by tangency degree and for loops connecting them by configuration, and investigate thei… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  37. arXiv:2404.15992  [pdf, other

    cs.CV eess.IV

    HDDGAN: A Heterogeneous Dual-Discriminator Generative Adversarial Network for Infrared and Visible Image Fusion

    Authors: Guosheng Lu, Zile Fang, Chunming He, Zhigang Zhao

    Abstract: Infrared and visible image fusion (IVIF) aims to preserve thermal radiation information from infrared images while integrating texture details from visible images, enabling the capture of important features and hidden details of subjects in complex scenes and disturbed environments. Consequently, IVIF offers distinct advantages in practical applications such as video surveillance, night navigation… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  38. arXiv:2404.14956  [pdf, other

    eess.IV cs.CV

    DAWN: Domain-Adaptive Weakly Supervised Nuclei Segmentation via Cross-Task Interactions

    Authors: Ye Zhang, Yifeng Wang, Zijie Fang, Hao Bian, Linghan Cai, Ziyue Wang, Yongbing Zhang

    Abstract: Weakly supervised segmentation methods have gained significant attention due to their ability to reduce the reliance on costly pixel-level annotations during model training. However, the current weakly supervised nuclei segmentation approaches typically follow a two-stage pseudo-label generation and network training process. The performance of the nuclei segmentation heavily relies on the quality… ▽ More

    Submitted 24 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: 13 pages, 11 figures, 8 tables

  39. arXiv:2404.12352  [pdf, other

    cs.CV

    Point-In-Context: Understanding Point Cloud via In-Context Learning

    Authors: Mengyuan Liu, Zhongbin Fang, Xia Li, Joachim M. Buhmann, Xiangtai Li, Chen Change Loy

    Abstract: With the emergence of large-scale models trained on diverse datasets, in-context learning has emerged as a promising paradigm for multitasking, notably in natural language processing and image processing. However, its application in 3D point cloud tasks remains largely unexplored. In this work, we introduce Point-In-Context (PIC), a novel framework for 3D point cloud understanding via in-context l… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Project page: https://fanglaosi.github.io/Point-In-Context_Pages. arXiv admin note: text overlap with arXiv:2306.08659

  40. arXiv:2404.11450  [pdf, other

    cs.DB cs.CR

    Real-Time Trajectory Synthesis with Local Differential Privacy

    Authors: Yujia Hu, Yuntao Du, Zhikun Zhang, Ziquan Fang, Lu Chen, Kai Zheng, Yunjun Gao

    Abstract: Trajectory streams are being generated from location-aware devices, such as smartphones and in-vehicle navigation systems. Due to the sensitive nature of the location data, directly sharing user trajectories suffers from privacy leakage issues. Local differential privacy (LDP), which perturbs sensitive data on the user side before it is shared or analyzed, emerges as a promising solution for priva… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by ICDE 2024. Code is available at: https://github.com/ZJU-DAILY/RetraSyn

  41. arXiv:2404.10714  [pdf, other

    eess.IV cs.CV

    AV-GAN: Attention-Based Varifocal Generative Adversarial Network for Uneven Medical Image Translation

    Authors: Zexin Li, Yiyang Lin, Zijie Fang, Shuyan Li, Xiu Li

    Abstract: Different types of staining highlight different structures in organs, thereby assisting in diagnosis. However, due to the impossibility of repeated staining, we cannot obtain different types of stained slides of the same tissue area. Translating the slide that is easy to obtain (e.g., H&E) to slides of staining types difficult to obtain (e.g., MT, PAS) is a promising way to solve this problem. How… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  42. arXiv:2404.09005  [pdf, ps, other

    cs.CR cs.AI cs.ET cs.GT cs.LG

    Proof-of-Learning with Incentive Security

    Authors: Zishuo Zhao, Zhixuan Fang, Xuechao Wang, Xi Chen, Yuan Zhou

    Abstract: Most concurrent blockchain systems rely heavily on the Proof-of-Work (PoW) or Proof-of-Stake (PoS) mechanisms for decentralized consensus and security assurance. However, the substantial energy expenditure stemming from computationally intensive yet meaningless tasks has raised considerable concerns surrounding traditional PoW approaches, The PoS mechanism, while free of energy consumption, is sub… ▽ More

    Submitted 4 June, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

    Comments: 16 pages

  43. arXiv:2404.06891  [pdf, other

    cs.NI

    PACP: Priority-Aware Collaborative Perception for Connected and Autonomous Vehicles

    Authors: Zhengru Fang, Senkang Hu, Haonan An, Yuang Zhang, **g**g Wang, Hangcheng Cao, Xianhao Chen, Yuguang Fang

    Abstract: Surrounding perceptions are quintessential for safe driving for connected and autonomous vehicles (CAVs), where the Bird's Eye View has been employed to accurately capture spatial relationships among vehicles. However, severe inherent limitations of BEV, like blind spots, have been identified. Collaborative perception has emerged as an effective solution to overcoming these limitations through dat… ▽ More

    Submitted 1 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  44. arXiv:2404.06448  [pdf, other

    cs.LG cs.AI

    Automated Federated Pipeline for Parameter-Efficient Fine-Tuning of Large Language Models

    Authors: Zihan Fang, Zheng Lin, Zhe Chen, Xianhao Chen, Yue Gao, Yuguang Fang

    Abstract: Recently, there has been a surge in the development of advanced intelligent generative content (AIGC), especially large language models (LLMs). However, for many downstream tasks, it is necessary to fine-tune LLMs using private data. While federated learning offers a promising privacy-preserving solution to LLM fine-tuning, the substantial size of an LLM, combined with high computational and commu… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 15 pages, 16 figures

  45. arXiv:2404.06345  [pdf, other

    cs.AI cs.RO

    AgentsCoDriver: Large Language Model Empowered Collaborative Driving with Lifelong Learning

    Authors: Senkang Hu, Zhengru Fang, Zihan Fang, Yiqin Deng, Xianhao Chen, Yuguang Fang

    Abstract: Connected and autonomous driving is develo** rapidly in recent years. However, current autonomous driving systems, which are primarily based on data-driven approaches, exhibit deficiencies in interpretability, generalization, and continuing learning capabilities. In addition, the single-vehicle autonomous driving systems lack of the ability of collaboration and negotiation with other vehicles, w… ▽ More

    Submitted 21 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  46. arXiv:2404.04882  [pdf

    physics.flu-dyn

    Turbulence-induced oscillation on particle detachment from a wall

    Authors: Zhikai You, Yiyang Zhang, Zhu Fang, Shuiqing Li

    Abstract: Particle resuspension is a ubiquitous phenomenon with pivotal relevance in numerous natural and industrial contexts. In this study, we present findings on the resuspension of individual micro-sized particles, captured through high-speed camera experiments. Our observations reveal a universal behavior whereby a particle undergoes oscillatory motion due to turbulent excitation prior to its detachmen… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  47. arXiv:2404.04865  [pdf, other

    cs.LG cs.CV stat.ML

    On the Learnability of Out-of-distribution Detection

    Authors: Zhen Fang, Yixuan Li, Feng Liu, Bo Han, Jie Lu

    Abstract: Supervised learning aims to train a classifier under the assumption that training and test data are from the same distribution. To ease the above assumption, researchers have studied a more realistic setting: out-of-distribution (OOD) detection, where test data may come from classes that are unknown during training (i.e., OOD data). Due to the unavailability and diversity of OOD data, good general… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Accepted by JMLR in 7th of April, 2024. This is a journal extension of the previous NeurIPS 2022 Outstanding Paper "Is Out-of-distribution Detection Learnable?" [arXiv:2210.14707]

  48. arXiv:2404.03088  [pdf, other

    cs.LG cs.AI cs.NI eess.SP

    Robust Federated Learning for Wireless Networks: A Demonstration with Channel Estimation

    Authors: Zexin Fang, Bin Han, Hans D. Schotten

    Abstract: Federated learning (FL) offers a privacy-preserving collaborative approach for training models in wireless networks, with channel estimation emerging as a promising application. Despite extensive studies on FL-empowered channel estimation, the security concerns associated with FL require meticulous attention. In a scenario where small base stations (SBSs) serve as local models trained on cached da… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Submitted to IEEE GLOBECOM 2024

  49. arXiv:2404.01664  [pdf, other

    physics.soc-ph nlin.AO nlin.PS physics.bio-ph

    Nonreciprocal interactions in crowd dynamics: investigating the impact of moving threats on pedestrian speed preferences

    Authors: Shaocong Xie, Rui Ye, Xiaolian Li, Zhongyi Huang, Shuchao Cao, Wei Lv, Hong He, ** Zhang, Zhiming Fang, Jun Zhang, Weiguo Song

    Abstract: Nonreciprocal interaction crowd systems, such as human-human, human-vehicle, and human-robot systems, often have serious impacts on pedestrian safety and social order. A more comprehensive understanding of these systems is needed to optimize system stability and efficiency. Despite the importance of these interactions, empirical research in this area remains limited. Thus, in our study we explore… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  50. arXiv:2403.20078  [pdf, other

    cs.CV cs.LG

    Negative Label Guided OOD Detection with Pretrained Vision-Language Models

    Authors: Xue Jiang, Feng Liu, Zhen Fang, Hong Chen, Tongliang Liu, Feng Zheng, Bo Han

    Abstract: Out-of-distribution (OOD) detection aims at identifying samples from unknown classes, playing a crucial role in trustworthy models against errors on unexpected inputs. Extensive research has been dedicated to exploring OOD detection in the vision modality. Vision-language models (VLMs) can leverage both textual and visual information for various multi-modal applications, whereas few OOD detection… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: ICLR 2024 Spotlight