Skip to main content

Showing 51–100 of 758 results for author: Fang, Z

.
  1. arXiv:2403.17828  [pdf, other

    astro-ph.HE

    The Relativistic Spin Precession in the Compact Double Neutron Star System PSR~J1946+2052

    Authors: Lingqi Meng, Weiwei Zhu, Michael Kramer, Xueli Miao, Gregory Desvignes, Li**g Shao, Huanchen Hu, Paulo C. C. Freire, Yongkun Zhang, Mengyao Xue, Ziyao Fang, David J. Champion, Mao Yuan, Chenchen Miao, Jiarui Niu, Qiuyang Fu, Jumei Yao, Yanjun Guo, Chengmin Zhang

    Abstract: We observe systematic profile changes in the visible pulsar of the compact double neutron star system PSR~J1946+2052 using observations with the Five-hundred-meter Aperture Spherical radio Telescope (FAST). The interpulse of PSR~J1946+2052 changed from single-peak to double-peak shape from 2018 to 2021. We attribute this evolution as the result of the relativistic spin precession of the pulsar. Wi… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 12 pages, 9 figures, accepted for publication in ApJ

  2. arXiv:2403.17525  [pdf, other

    cs.CV cs.AI

    Equip** Sketch Patches with Context-Aware Positional Encoding for Graphic Sketch Representation

    Authors: Sicong Zang, Zhijun Fang

    Abstract: The drawing order of a sketch records how it is created stroke-by-stroke by a human being. For graphic sketch representation learning, recent studies have injected sketch drawing orders into graph edge construction by linking each patch to another in accordance to a temporal-based nearest neighboring strategy. However, such constructed graph edges may be unreliable, since a sketch could have varia… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  3. arXiv:2403.16851  [pdf

    cs.DL cs.AI cs.CL cs.LG

    Can ChatGPT predict article retraction based on Twitter mentions?

    Authors: Er-Te Zheng, Hui-Zhen Fu, Zhichao Fang

    Abstract: Detecting problematic research articles timely is a vital task. This study explores whether Twitter mentions of retracted articles can signal potential problems with the articles prior to retraction, thereby playing a role in predicting future retraction of problematic articles. A dataset comprising 3,505 retracted articles and their associated Twitter mentions is analyzed, alongside 3,505 non-ret… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  4. arXiv:2403.16394  [pdf, other

    cs.CL cs.AI

    Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation

    Authors: Yingshan Chang, Yasi Zhang, Zhiyuan Fang, Yingnian Wu, Yonatan Bisk, Feng Gao

    Abstract: The literature on text-to-image generation is plagued by issues of faithfully composing entities with relations. But there lacks a formal understanding of how entity-relation compositions can be effectively learned. Moreover, the underlying phenomenon space that meaningfully reflects the problem structure is not well-defined, leading to an arms race for larger quantities of data in the hope that g… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  5. arXiv:2403.15393  [pdf, other

    cs.CL cs.LG cs.SI

    Detection of Opioid Users from Reddit Posts via an Attention-based Bidirectional Recurrent Neural Network

    Authors: Yuchen Wang, Zhengyu Fang, Wei Du, Shuai Xu, Rong Xu, **g Li

    Abstract: The opioid epidemic, referring to the growing hospitalizations and deaths because of overdose of opioid usage and addiction, has become a severe health problem in the United States. Many strategies have been developed by the federal and local governments and health communities to combat this crisis. Among them, improving our understanding of the epidemic through better health surveillance is one o… ▽ More

    Submitted 9 February, 2024; originally announced March 2024.

  6. arXiv:2403.11058  [pdf, ps, other

    math.AP

    Formal derivations from Boltzmann equation to three stationary equations

    Authors: Zhendong Fang

    Abstract: In this paper, we concentrate on the connection between Boltzmann equation and stationary equations. To our knowledge, the stationary Navier-Stokes-Fourier system, the stationary Euler equations and the stationary Stokes equations are formally derived by moment estimate in the first time and extend the results of Bardos, Golse, and Levermore in J. Statist. Phys. 63(1-2), 323-344, 1991.

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Hydrodynamic limit, Moment estimate, The stationary Navier-Stokes-Fourier system, The stationary Euler equations, The stationary Stokes equations

  7. arXiv:2403.10082  [pdf, other

    cs.CV

    CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner

    Authors: Tingbing Yan, Wenzheng Zeng, Yang Xiao, Xingyu Tong, Bo Tan, Zhiwen Fang, Zhiguo Cao, Joey Tianyi Zhou

    Abstract: Most existing one-shot skeleton-based action recognition focuses on raw low-level information (e.g., joint location), and may suffer from local information loss and low generalization ability. To alleviate these, we propose to leverage text description generated from large language models (LLM) that contain high-level human knowledge, to guide feature learning, in a global-local-global way. Partic… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  8. arXiv:2403.10006  [pdf, other

    cs.CY cs.HC cs.LG cs.SI

    Graph Enhanced Reinforcement Learning for Effective Group Formation in Collaborative Problem Solving

    Authors: Zheng Fang, Fucai Ke, Jae Young Han, Zhijie Feng, Toby Cai

    Abstract: This study addresses the challenge of forming effective groups in collaborative problem-solving environments. Recognizing the complexity of human interactions and the necessity for efficient collaboration, we propose a novel approach leveraging graph theory and reinforcement learning. Our methodology involves constructing a graph from a dataset where nodes represent participants, and edges signify… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  9. arXiv:2403.09274  [pdf, other

    cs.CV

    EventRPG: Event Data Augmentation with Relevance Propagation Guidance

    Authors: Mingyuan Sun, Donghao Zhang, Zongyuan Ge, Jiaxu Wang, Jia Li, Zheng Fang, Ren**g Xu

    Abstract: Event camera, a novel bio-inspired vision sensor, has drawn a lot of attention for its low latency, low power consumption, and high dynamic range. Currently, overfitting remains a critical problem in event-based classification tasks for Spiking Neural Network (SNN) due to its relatively weak spatial representation capability. Data augmentation is a simple but efficient method to alleviate overfitt… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted by ICLR 2024

  10. arXiv:2403.08840  [pdf, other

    cs.CV cs.AI

    NoiseDiffusion: Correcting Noise for Image Interpolation with Diffusion Models beyond Spherical Linear Interpolation

    Authors: PengFei Zheng, Yonggang Zhang, Zhen Fang, Tongliang Liu, Defu Lian, Bo Han

    Abstract: Image interpolation based on diffusion models is promising in creating fresh and interesting images. Advanced interpolation methods mainly focus on spherical linear interpolation, where images are encoded into the noise space and then interpolated for denoising to images. However, existing methods face challenges in effectively interpolating natural images (not generated by diffusion models), ther… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  11. arXiv:2403.05388  [pdf, other

    cs.CV

    Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation

    Authors: Yu Han, Ziwei Long, Yanting Zhang, ** Wu, Zhijun Fang, Rui Fan

    Abstract: Correspondence matching plays a crucial role in numerous robotics applications. In comparison to conventional hand-crafted methods and recent data-driven approaches, there is significant interest in plug-and-play algorithms that make full use of pre-trained backbone networks for multi-scale feature extraction and leverage hierarchical refinement strategies to generate matched correspondences. The… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  12. arXiv:2403.05160  [pdf, other

    cs.CV

    MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models

    Authors: Zijie Fang, Yifeng Wang, Zhi Wang, Jian Zhang, Xiangyang Ji, Yongbing Zhang

    Abstract: Recently, pathological diagnosis, the gold standard for cancer diagnosis, has achieved superior performance by combining the Transformer with the multiple instance learning (MIL) framework using whole slide images (WSIs). However, the giga-pixel nature of WSIs poses a great challenge for the quadratic-complexity self-attention mechanism in Transformer to be applied in MIL. Existing studies usually… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 11 pages, 2 figures

  13. arXiv:2403.04344  [pdf, other

    cs.GT cs.LG

    RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning

    Authors: Boning Li, Zhixuan Fang, Longbo Huang

    Abstract: Effective action abstraction is crucial in tackling challenges associated with large action spaces in Imperfect Information Extensive-Form Games (IIEFGs). However, due to the vast state space and computational complexity in IIEFGs, existing methods often rely on fixed abstractions, resulting in sub-optimal performance. In response, we introduce RL-CFR, a novel reinforcement learning (RL) approach… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  14. arXiv:2402.18601  [pdf, other

    physics.plasm-ph astro-ph.HE gr-qc hep-ph nucl-th

    Analytic solutions for the linearized first-order magnetohydrodynamics and implications for causality and stability

    Authors: Zhe Fang, Koichi Hattori, ** Hu

    Abstract: We solve the first-order relativistic magnetohydrodynamics (MHD) within the linear-mode analysis performed near an equilibrium configuration in the fluid rest frame. We find two complete sets of analytic solutions for the four and two coupled modes with seven dissipative transport coefficients. The former set has been missing in the literature for a long time. Our method provides a simple and gene… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 31 pages, 2 figures

  15. arXiv:2402.18060  [pdf, other

    cs.CL

    Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions

    Authors: Hanjie Chen, Zhouxiang Fang, Yash Singla, Mark Dredze

    Abstract: LLMs have demonstrated impressive performance in answering medical questions, such as achieving passing scores on medical licensing examinations. However, medical board exam or general clinical questions do not capture the complexity of realistic clinical cases. Moreover, the lack of reference explanations means we cannot easily evaluate the reasoning of model decisions, a crucial component of sup… ▽ More

    Submitted 25 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  16. arXiv:2402.17888  [pdf, other

    cs.LG cs.AI

    ConjNorm: Tractable Density Estimation for Out-of-Distribution Detection

    Authors: Bo Peng, Yadan Luo, Yonggang Zhang, Yixuan Li, Zhen Fang

    Abstract: Post-hoc out-of-distribution (OOD) detection has garnered intensive attention in reliable machine learning. Many efforts have been dedicated to deriving score functions based on logits, distances, or rigorous data distribution assumptions to identify low-scoring OOD samples. Nevertheless, these estimate scores may fail to accurately reflect the true data density or impose impractical constraints.… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: ICLR24 poster

  17. arXiv:2402.17356  [pdf, ps, other

    cond-mat.str-el

    Modulation of chiral anomaly and bilinear magnetoconductivity in Weyl semimetals by impurity-resonance states

    Authors: Mei-Wei Hu, Zhuo-Yan Fang, Hou-Jian Duan, Mou Yang, Ming-Xun Deng, Rui-Qiang Wang

    Abstract: The phenomenon of nonlinear transport has attracted tremendous interest within the condensed matter community. We present a theoretical framework for nonlinear transport based on the nonequilibrium retarded Green's function, and examine the impact of disorder on nonlinear magnetotransport in Weyl semimetals (WSMs). It is demonstrated that bilinear magnetoconductivity can be induced in disordered W… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 5 figures

  18. arXiv:2402.16249  [pdf, other

    cs.CV

    SeqTrack3D: Exploring Sequence Information for Robust 3D Point Cloud Tracking

    Authors: Yu Lin, Zhiheng Li, Yubo Cui, Zheng Fang

    Abstract: 3D single object tracking (SOT) is an important and challenging task for the autonomous driving and mobile robotics. Most existing methods perform tracking between two consecutive frames while ignoring the motion patterns of the target over a series of frames, which would cause performance degradation in the scenes with sparse points. To break through this limitation, we introduce Sequence-to-Sequ… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted by ICRA2024

  19. Inductive Graph Alignment Prompt: Bridging the Gap between Graph Pre-training and Inductive Fine-tuning From Spectral Perspective

    Authors: Yuchen Yan, Peiyan Zhang, Zheng Fang, Qingqing Long

    Abstract: The "Graph pre-training and fine-tuning" paradigm has significantly improved Graph Neural Networks(GNNs) by capturing general knowledge without manual annotations for downstream tasks. However, due to the immense gap of data and tasks between the pre-training and fine-tuning stages, the model performance is still limited. Inspired by prompt fine-tuning in Natural Language Processing(NLP), many end… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    ACM Class: E.2

  20. arXiv:2402.12750  [pdf, other

    cs.CV cs.AI cs.CL

    Model Composition for Multimodal Large Language Models

    Authors: Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu

    Abstract: Recent developments in Multimodal Large Language Models (MLLMs) have shown rapid progress, moving towards the goal of creating versatile MLLMs that understand inputs from various modalities. However, existing methods typically rely on joint training with paired multimodal instruction data, which is resource-intensive and challenging to extend to new modalities. In this paper, we propose a new para… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Code will be available at https://github.com/THUNLP-MT/ModelCompose

  21. arXiv:2402.12164  [pdf, other

    cs.GT

    Integrating Dynamic Weighted Approach with Fictitious Play and Pure Counterfactual Regret Minimization for Equilibrium Finding

    Authors: Qi Ju, Falin Hei, Zhemei Fang, Yunfeng Luo

    Abstract: Develo** efficient algorithms to converge to Nash Equilibrium is a key focus in game theory. The use of dynamic weighting has been especially advantageous in normal-form games, enhancing the rate of convergence. For instance, the Greedy Regret Minimization (RM) algorithm has markedly outperformed earlier techniques. Nonetheless, its dependency on mixed strategies throughout the iterative process… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  22. arXiv:2402.10476  [pdf, other

    cs.CV

    Spike-EVPR: Deep Spiking Residual Network with Cross-Representation Aggregation for Event-Based Visual Place Recognition

    Authors: Chenming Hu, Zheng Fang, Kuanxu Hou, Delei Kong, Junjie Jiang, Hao Zhuang, Mingyuan Sun, Xinjie Huang

    Abstract: Event cameras have been successfully applied to visual place recognition (VPR) tasks by using deep artificial neural networks (ANNs) in recent years. However, previously proposed deep ANN architectures are often unable to harness the abundant temporal information presented in event streams. In contrast, deep spiking networks exhibit more intricate spatiotemporal dynamics and are inherently well-su… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 14 pages, 10 figures

  23. arXiv:2402.09810  [pdf, other

    eess.SP

    3D Cooperative Localization in UAV Systems: CRLB Analysis and Security Solutions

    Authors: Zexin Fang, Bin Han, Hans D. Schotten

    Abstract: This paper presents a robust and secure framework for achieving accurate and reliable cooperative localization in multiple unmanned aerial vehicle (UAV) systems. The Cramer-Rao low bound (CRLB) for the three-dimensional (3D) cooperative localization network is derived, with particular attention given to the non-uniform spatial distribution of anchor nodes. Challenges of mobility and security threa… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: Submitted to IEEE Transactions on Wireless Communications

  24. arXiv:2402.08803  [pdf

    physics.optics physics.app-ph

    Low-loss multilevel operation using lossy PCM-integrated silicon photonics

    Authors: Rui Chen, Virat Tara, Jayita Dutta, Zhuoran Fang, Jiajiu Zheng, Arka Majumdar

    Abstract: Chalcogenide phase-change materials (PCMs) offer new paradigms for programmable photonic integrated circuits (PICs) thanks to their zero static energy and significant refractive index contrast. However, prototypical PCMs, such as GeSbTe (GST), are lossy in their crystalline phase, albeit transparent in the amorphous state. Moreover, electrically switching PCMs to intermediate states is a stochasti… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  25. arXiv:2402.05660  [pdf, other

    cs.LG cs.AI

    Rethinking Propagation for Unsupervised Graph Domain Adaptation

    Authors: Meihan Liu, Zeyu Fang, Zhen Zhang, Ming Gu, Sheng Zhou, Xin Wang, Jiajun Bu

    Abstract: Unsupervised Graph Domain Adaptation (UGDA) aims to transfer knowledge from a labelled source graph to an unlabelled target graph in order to address the distribution shifts between graph domains. Previous works have primarily focused on aligning data from the source and target graph in the representation space learned by graph neural networks (GNNs). However, the inherent generalization capabilit… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted by AAAI-24

  26. arXiv:2402.03899  [pdf, other

    physics.ins-det

    Characterisation of resistive MPGDs with 2D readout

    Authors: L. Scharenberg, F. Brunbauer, H. Danielson, Z. Fang, K. J. Flöthner, F. Garcia, D. Janssens, M. Lisowska, J. Liu, Y. Lyu, B. Mehl, H. Muller, R. de Oliveira, E. Oliveri, G. Orlandini, D. Pfeiffer, O. Pizzirusso, L. Ropelewski, J. Samarati, M. Shao, A. Teixeira, M. Van Stenis, R. Veenhof, Z. Zhang, Y. Zhou

    Abstract: Micro-Pattern Gaseous Detectors (MPGDs) with resistive anode planes provide intrinsic discharge robustness while maintaining good spatial and time resolutions. Typically read out with 1D strips or pad structures, here the characterisation results of resistive anode plane MPGDs with 2D strip readout are presented. A uRWELL prototype is investigated in view of its use as a reference tracking detecto… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  27. arXiv:2402.03502  [pdf, other

    cs.LG stat.ML

    How Does Unlabeled Data Provably Help Out-of-Distribution Detection?

    Authors: Xuefeng Du, Zhen Fang, Ilias Diakonikolas, Yixuan Li

    Abstract: Using unlabeled data to regularize the machine learning models has demonstrated promise for improving safety and reliability in detecting out-of-distribution (OOD) data. Harnessing the power of unlabeled in-the-wild data is non-trivial due to the heterogeneity of both in-distribution (ID) and OOD data. This lack of a clean set of OOD samples poses significant challenges in learning an optimal OOD… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  28. arXiv:2402.01231  [pdf, other

    cs.LG

    Unveiling Delay Effects in Traffic Forecasting: A Perspective from Spatial-Temporal Delay Differential Equations

    Authors: Qingqing Long, Zheng Fang, Chen Fang, Chong Chen, Pengfei Wang, Yuanchun Zhou

    Abstract: Traffic flow forecasting is a fundamental research issue for transportation planning and management, which serves as a canonical and typical example of spatial-temporal predictions. In recent years, Graph Neural Networks (GNNs) and Recurrent Neural Networks (RNNs) have achieved great success in capturing spatial-temporal correlations for traffic flow forecasting. Yet, two non-ignorable issues have… ▽ More

    Submitted 25 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 11 pages, 7 figures

  29. arXiv:2402.00541  [pdf, other

    cs.CV

    Masked Conditional Diffusion Model for Enhancing Deepfake Detection

    Authors: Tiewen Chen, Shanmin Yang, Shu Hu, Zhenghan Fang, Ying Fu, Xi Wu, Xin Wang

    Abstract: Recent studies on deepfake detection have achieved promising results when training and testing faces are from the same dataset. However, their results severely degrade when confronted with forged samples that the model has not yet seen during training. In this paper, deepfake data to help detect deepfakes. this paper present we put a new insight into diffusion model-based data augmentation, and pr… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  30. arXiv:2402.00321  [pdf, other

    cs.CV

    SmartCooper: Vehicular Collaborative Perception with Adaptive Fusion and Judger Mechanism

    Authors: Yuang Zhang, Haonan An, Zhengru Fang, Guowen Xu, Yuan Zhou, Xianhao Chen, Yuguang Fang

    Abstract: In recent years, autonomous driving has garnered significant attention due to its potential for improving road safety through collaborative perception among connected and autonomous vehicles (CAVs). However, time-varying channel variations in vehicular transmission environments demand dynamic allocation of communication resources. Moreover, in the context of collaborative perception, it is importa… ▽ More

    Submitted 4 March, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

  31. arXiv:2401.16565  [pdf, other

    cond-mat.mtrl-sci

    Towards Accurate Prediction of Configurational Disorder Properties in Materials using Graph Neural Networks

    Authors: Zhenyao Fang, Qimin Yan

    Abstract: The prediction of configurational disorder properties, such as configurational entropy and order-disorder phase transition temperature, of compound materials relies on efficient and accurate evaluations of configurational energies. Previous cluster expansion methods are not applicable to configurationally-complex material systems, including those with atomic distortions and long-range orders. In t… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  32. arXiv:2401.15151  [pdf, other

    cond-mat.mtrl-sci

    First-principles methodology for studying magnetotransport in narrow-gap semiconductors: an application to Zirconium Pentatelluride ZrTe5

    Authors: Hanqi Pi, Shengnan Zhang, Yang Xu, Zhong Fang, Hongming Weng, Quansheng Wu

    Abstract: The origin of anomalous resistivity peak and accompanied sign reversal of Hall resistivity of ZrTe$_5$ has been under debate for a long time. Although various theoretical models have been proposed to account for these intriguing transport properties, a systematic study from first principles view is still lacking. In this work, we present a first principles calculation combined with Boltzmann trans… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 12 pages, 7 figures

  33. arXiv:2401.15150  [pdf, other

    cond-mat.mtrl-sci

    New perspectives of Hall effects from first-principles calculations

    Authors: ShengNan Zhang, Hanqi Pi, Zhong Fang, Hongming Weng, QuanSheng Wu

    Abstract: The Hall effect has been a fascinating topic ever since its discovery, resulting in exploration of entire family of this intriguing phenomena. As the field of topology develops and novel materials emerge endlessly over the past few decades, researchers have been passionately debating the origins of various Hall effects. Differentiating between the ordinary Hall effect and extraordinary transport p… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 12 pages, 4 figures

  34. arXiv:2401.15146  [pdf, other

    cond-mat.mtrl-sci

    First-principles Methodology for studying magnetotransport in magnetic materials

    Authors: Zhihao Liu, Shengnan Zhang, Zhong Fang, Hongming Weng, Quansheng Wu

    Abstract: Unusual magnetotransport behaviors such as temperature dependent negative magnetoresistance(MR) and bowtie-shaped MR have puzzled us for a long time. Although several mechanisms have been proposed to explain them, the absence of comprehensive quantitative calculations has made these explanations less convincing. In our work, we introduce a methodology to study the magnetotransport behaviors in mag… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 6 pages, 4 figures

  35. arXiv:2401.11531  [pdf, other

    cs.CR cs.LG

    Tempo: Confidentiality Preservation in Cloud-Based Neural Network Training

    Authors: Rongwu Xu, Zhixuan Fang

    Abstract: Cloud deep learning platforms provide cost-effective deep neural network (DNN) training for customers who lack computation resources. However, cloud systems are often untrustworthy and vulnerable to attackers, leading to growing concerns about model privacy. Recently, researchers have sought to protect data privacy in deep learning by leveraging CPU trusted execution environments (TEEs), which min… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  36. arXiv:2401.11378  [pdf, other

    cs.RO cs.LG

    Multi-Agent Generative Adversarial Interactive Self-Imitation Learning for AUV Formation Control and Obstacle Avoidance

    Authors: Zheng Fang, Tianhao Chen, Dong Jiang, Zheng Zhang, Guangliang Li

    Abstract: Multiple autonomous underwater vehicles (multi-AUV) can cooperatively accomplish tasks that a single AUV cannot complete. Recently, multi-agent reinforcement learning has been introduced to control of multi-AUV. However, designing efficient reward functions for various tasks of multi-AUV control is difficult or even impractical. Multi-agent generative adversarial imitation learning (MAGAIL) allows… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 8pages,10figures,Published to RA-L

  37. arXiv:2401.11093  [pdf, other

    stat.AP

    Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding

    Authors: Haisheng Fu, Feng Liang, Jie Liang, Zhenman Fang, Guohe Zhang, **gning Han

    Abstract: Recent advancements in deep learning-based image compression are notable. However, prevalent schemes that employ a serial context-adaptive entropy model to enhance rate-distortion (R-D) performance are markedly slow. Furthermore, the complexities of the encoding and decoding networks are substantially high, rendering them unsuitable for some practical applications. In this paper, we propose two te… ▽ More

    Submitted 21 March, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted by DCC2024

  38. A multi-dimensional analysis of usage counts, Mendeley readership, and citations for journal and conference papers

    Authors: Wencan Tian, Zhichao Fang, Xianwen Wang, Rodrigo Costas

    Abstract: This study analyzed 16,799 journal papers and 98,773 conference papers published by IEEE Xplore in 2016 to investigate the relationships among usage counts, Mendeley readership, and citations through descriptive, regression, and mediation analyses. Differences in the relationship among these metrics between journal and conference papers are also studied. Results showed that there is no significant… ▽ More

    Submitted 26 January, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: 23 pages, 7 figures

  39. arXiv:2401.09740  [pdf, other

    cs.CR

    Hijacking Attacks against Neural Networks by Analyzing Training Data

    Authors: Yunjie Ge, Qian Wang, Huayang Huang, Qi Li, Cong Wang, Chao Shen, Lingchen Zhao, Peipei Jiang, Zheng Fang, Shenyi Zhang

    Abstract: Backdoors and adversarial examples are the two primary threats currently faced by deep neural networks (DNNs). Both attacks attempt to hijack the model behaviors with unintended outputs by introducing (small) perturbations to the inputs. Backdoor attacks, despite the high success rates, often require a strong assumption, which is not always easy to achieve in reality. Adversarial example attacks,… ▽ More

    Submitted 19 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: Full version with major polishing, compared to the Usenix Security 2024 edition

  40. arXiv:2401.08695  [pdf, other

    cs.AI cs.CV cs.HC

    Enabling Collaborative Clinical Diagnosis of Infectious Keratitis by Integrating Expert Knowledge and Interpretable Data-driven Intelligence

    Authors: Zhengqing Fang, Shuowen Zhou, Zhouhang Yuan, Yuxuan Si, Mengze Li, **xu Li, Yesheng Xu, Wenjia Xie, Kun Kuang, Yingming Li, Fei Wu, Yu-Feng Yao

    Abstract: Although data-driven artificial intelligence (AI) in medical image diagnosis has shown impressive performance in silico, the lack of interpretability makes it difficult to incorporate the "black box" into clinicians' workflows. To make the diagnostic patterns learned from data understandable by clinicians, we develop an interpretable model, knowledge-guided diagnosis model (KGDM), that provides a… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: 33 pages

  41. arXiv:2401.08210  [pdf, other

    cs.CV

    ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification

    Authors: Zhongbin Fang, Xia Li, Xiangtai Li, Shen Zhao, Mengyuan Liu

    Abstract: Recently, 3D point cloud classification has made significant progress with the help of many datasets. However, these datasets do not reflect the incomplete nature of real-world point clouds caused by occlusion, which limits the practical application of current methods. To bridge this gap, we propose ModelNet-O, a large-scale synthetic dataset of 123,041 samples that emulate real-world point clouds… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Project page: https://github.com/fanglaosi/PointMLS

  42. arXiv:2401.07240  [pdf, other

    cs.CV

    DCDet: Dynamic Cross-based 3D Object Detector

    Authors: Shuai Liu, Boyang Li, Zhiyu Fang, Kai Huang

    Abstract: Recently, significant progress has been made in the research of 3D object detection. However, most prior studies have focused on the utilization of center-based or anchor-based label assignment schemes. Alternative label assignment strategies remain unexplored in 3D object detection. We find that the center-based label assignment often fails to generate sufficient positive samples for training, wh… ▽ More

    Submitted 22 May, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

  43. arXiv:2401.06401   

    cs.SE cs.AI cs.CL

    DevEval: Evaluating Code Generation in Practical Software Projects

    Authors: Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Zhi **, Hao Zhu, Huanyu Liu, Kaibo Liu, Lecheng Wang, Zheng Fang, Lanshen Wang, Jiazheng Ding, Xuanming Zhang, Yihong Dong, Yuqi Zhu, Bin Gu, Mengfei Yang

    Abstract: How to evaluate Large Language Models (LLMs) in code generation is an open question. Many benchmarks have been proposed but are inconsistent with practical software projects, e.g., unreal program distributions, insufficient dependencies, and small-scale project contexts. Thus, the capabilities of LLMs in practical projects are still unclear. In this paper, we propose a new benchmark named DevEval,… ▽ More

    Submitted 5 March, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: We are re-checking this benchmark and repeating related experiments. New versions of DevEval will be released later

  44. arXiv:2401.05970  [pdf

    physics.optics physics.app-ph

    On-chip wavelength division multiplexing by angled multimode interferometer fabricated on erbium-doped thin film lithium niobate on insulator

    Authors: **li Han, Rui Bao, Rongbo Wu, Zhaoxiang Liu, Zhe Wang, Chao Sun, Zhihao Zhang, Mengqi Li, Zhiwei Fang, Min Wang, Haisu Zhang, Ya Cheng

    Abstract: Photonic integrated circuits based on erbium doped thin film lithium niobate on insulator has attracted broad interests with insofar various waveguide amplifiers and microlasers demonstrated. Wideband operation facilitated by the broadband absorption and emission of erbium ions necessitates the functional integration of wavelength filter and multiplexer on the same chip. Here a low-loss wavelength… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 11 pages, 5 figures

  45. arXiv:2401.01544  [pdf, other

    cs.CV eess.SP

    Collaborative Perception for Connected and Autonomous Driving: Challenges, Possible Solutions and Opportunities

    Authors: Senkang Hu, Zhengru Fang, Yiqin Deng, Xianhao Chen, Yuguang Fang

    Abstract: Autonomous driving has attracted significant attention from both academia and industries, which is expected to offer a safer and more efficient driving system. However, current autonomous driving systems are mostly based on a single vehicle, which has significant limitations which still poses threats to driving safety. Collaborative perception with connected and autonomous vehicles (CAVs) shows a… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  46. arXiv:2401.01222  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Excitonic Instability in Ta2Pd3Te5 monolayer

    Authors: **gyu Yao, Haohao Sheng, Ruihan Zhang, Rongtian Pang, **-Jian Zhou, Quansheng Wu, Hongming Weng, Xi Dai, Zhong Fang, Zhijun Wang

    Abstract: By systematic theoretical calculations, we have revealed an excitonic insulator (EI) in a van der Waals (vdW) layered compound Ta2Pd3Te5. The interlayer binding energy in the vdW layered compound is 19.6 meV/$\unicode{x212B}$$^2$. The computed phonon spectrum suggests that the monolayer is dynamically stable without lattice distortion. The monolayer can be obtained by exfoliation or molecular-beam… ▽ More

    Submitted 8 May, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: 6 pages, 4 figures

  47. arXiv:2312.15903  [pdf, other

    cs.IR

    An Incremental Update Framework for Online Recommenders with Data-Driven Prior

    Authors: Chen Yang, ** Chen, Qian Yu, Xiangdong Wu, Kui Ma, Zihao Zhao, Zhiwei Fang, Wenlong Chen, Chaosheng Fan, Jie He, Chang** Peng, Zhangang Lin, **g** Shao

    Abstract: Online recommenders have attained growing interest and created great revenue for businesses. Given numerous users and items, incremental update becomes a mainstream paradigm for learning large-scale models in industrial scenarios, where only newly arrived data within a sliding window is fed into the model, meeting the strict requirements of quick response. However, this strategy would be prone to… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  48. arXiv:2312.15130  [pdf, other

    cs.CV

    PACE: A Large-Scale Dataset with Pose Annotations in Cluttered Environments

    Authors: Yang You, Kai Xiong, Zhening Yang, Zhengxiang Huang, Junwei Zhou, Ruoxi Shi, Zhou Fang, Adam W. Harley, Leonidas Guibas, Cewu Lu

    Abstract: Pose estimation is a crucial task in computer vision and robotics, enabling the tracking and manipulation of objects in images or videos. While several datasets exist for pose estimation, there is a lack of large-scale datasets specifically focusing on cluttered scenes with occlusions. We introduce PACE (Pose Annotations in Cluttered Environments), a large-scale benchmark designed to advance the d… ▽ More

    Submitted 31 March, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

  49. arXiv:2312.11396  [pdf, other

    cs.CV cs.AI

    MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance

    Authors: Qi Mao, Lan Chen, Yuchao Gu, Zhen Fang, Mike Zheng Shou

    Abstract: Recent diffusion-based image editing approaches have exhibited impressive editing capabilities in images with simple compositions. However, localized editing in complex scenarios has not been well-studied in the literature, despite its growing real-world demands. Existing mask-based inpainting methods fall short of retaining the underlying structure within the edit region. Meanwhile, mask-free att… ▽ More

    Submitted 21 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: for project page, see https://mag-edit.github.io/

  50. arXiv:2312.10987  [pdf, other

    cs.CL

    Cross-Subject Data Splitting for Brain-to-Text Decoding

    Authors: Congchi Yin, Qian Yu, Zhiwei Fang, Jie He, Chang** Peng, Zhangang Lin, **g** Shao, Piji Li

    Abstract: Recent major milestones have successfully decoded non-invasive brain signals (e.g. functional Magnetic Resonance Imaging (fMRI) and electroencephalogram (EEG)) into natural language. Despite the progress in model design, how to split the datasets for training, validating, and testing still remains a matter of debate. Most of the prior researches applied subject-specific data splitting, where the d… ▽ More

    Submitted 14 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.