Skip to main content

Showing 1–50 of 280 results for author: Cui, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17215  [pdf, other

    eess.SY cs.AI

    Enabling Large Language Models to Perform Power System Simulations with Previously Unseen Tools: A Case of Daline

    Authors: Mengshuo Jia, Zeyu Cui, Gabriela Hug

    Abstract: The integration of experiment technologies with large language models (LLMs) is transforming scientific research, offering AI capabilities beyond specialized problem-solving to becoming research assistants for human scientists. In power systems, simulations are essential for research. However, LLMs face significant challenges in power system simulations due to limited pre-existing knowledge and th… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: All the supplementary files mentioned in the manuscript will be open-source upon acceptance

  2. arXiv:2406.12577  [pdf, other

    cs.CV

    Cephalometric Landmark Detection across Ages with Prototypical Network

    Authors: Han Wu, Chong Wang, Lanzhuju Mei, Tong Yang, Min Zhu, Dingggang Shen, Zhiming Cui

    Abstract: Automated cephalometric landmark detection is crucial in real-world orthodontic diagnosis. Current studies mainly focus on only adult subjects, neglecting the clinically crucial scenario presented by adolescents whose landmarks often exhibit significantly different appearances compared to adults. Hence, an open question arises about how to develop a unified and effective detection algorithm across… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: MICCAI 2024

  3. arXiv:2406.03882  [pdf, other

    cs.CL cs.SD eess.AS

    Spontaneous Speech-Based Suicide Risk Detection Using Whisper and Large Language Models

    Authors: Ziyun Cui, Chang Lei, Wen Wu, Yinan Duan, Diyang Qu, Ji Wu, Runsen Chen, Chao Zhang

    Abstract: The early detection of suicide risk is important since it enables the intervention to prevent potential suicide attempts. This paper studies the automatic detection of suicide risk based on spontaneous speech from adolescents, and collects a Mandarin dataset with 15 hours of suicide speech from more than a thousand adolescents aged from ten to eighteen for our experiments. To leverage the diverse… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  4. arXiv:2406.03879  [pdf, other

    cs.LG cs.CV

    Decay Pruning Method: Smooth Pruning With a Self-Rectifying Procedure

    Authors: Minghao Yang, Linlin Gao, Pengyuan Li, Wenbo Li, Yihong Dong, Zhiying Cui

    Abstract: Current structured pruning methods often result in considerable accuracy drops due to abrupt network changes and loss of information from pruned structures. To address these issues, we introduce the Decay Pruning Method (DPM), a novel smooth pruning approach with a self-rectifying mechanism. DPM consists of two key components: (i) Smooth Pruning: It converts conventional single-step pruning into m… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  5. arXiv:2406.03199  [pdf, other

    cs.CL cs.AI cs.LG

    Bayesian WeakS-to-Strong from Text Classification to Generation

    Authors: Ziyun Cui, Ziyang Zhang, Wen Wu, Guangzhi Sun, Chao Zhang

    Abstract: Advances in large language models raise the question of how alignment techniques will adapt as models become increasingly complex and humans will only be able to supervise them weakly. Weak-to-Strong mimics such a scenario where weak model supervision attempts to harness the full capabilities of a much stronger model. This work extends Weak-to-Strong to WeakS-to-Strong by exploring an ensemble of… ▽ More

    Submitted 24 May, 2024; originally announced June 2024.

  6. arXiv:2405.20337  [pdf, other

    cs.CV cs.AI

    OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving

    Authors: Lening Wang, Wenzhao Zheng, Yilong Ren, Han Jiang, Zhiyong Cui, Haiyang Yu, Jiwen Lu

    Abstract: Understanding the evolution of 3D scenes is important for effective autonomous driving. While conventional methods mode scene development with the motion of individual instances, world models emerge as a generative framework to describe the general scene dynamics. However, most existing methods adopt an autoregressive framework to perform next-token prediction, which suffer from inefficiency in mo… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Code is available at: https://github.com/wzzheng/OccSora

  7. GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis

    Authors: Boming Zhao, Yuan Li, Ziyu Sun, Lin Zeng, Yujun Shen, Rui Ma, Yinda Zhang, Hujun Bao, Zhaopeng Cui

    Abstract: Forecasting future scenarios in dynamic environments is essential for intelligent decision-making and navigation, a challenge yet to be fully realized in computer vision and robotics. Traditional approaches like video prediction and novel-view synthesis either lack the ability to forecast from arbitrary viewpoints or to predict temporal dynamics. In this paper, we introduce GaussianPrediction, a n… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted to SIGGRAPH 2024 Conference. Project Page: https://zju3dv.github.io/gaussian-prediction/

  8. arXiv:2405.15301  [pdf, other

    cs.LG

    Rankability-enhanced Revenue Uplift Modeling Framework for Online Marketing

    Authors: Bowei He, Yunpeng Weng, Xing Tang, Ziqiang Cui, Zexu Sun, Liang Chen, Xiuqiang He, Chen Ma

    Abstract: Uplift modeling has been widely employed in online marketing by predicting the response difference between the treatment and control groups, so as to identify the sensitive individuals toward interventions like coupons or discounts. Compared with traditional \textit{conversion uplift modeling}, \textit{revenue uplift modeling} exhibits higher potential due to its direct connection with the corpora… ▽ More

    Submitted 12 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024

  9. arXiv:2405.14371  [pdf, other

    cs.DC

    EdgeShard: Efficient LLM Inference via Collaborative Edge Computing

    Authors: Ming** Zhang, Jiannong Cao, Xiaoming Shen, Zeyang Cui

    Abstract: Large language models (LLMs) have shown great potential in natural language processing and content generation. However, current LLMs heavily rely on cloud computing, leading to prolonged latency, high bandwidth cost, and privacy concerns. Edge computing is promising to address such concerns by deploying LLMs on edge devices, closer to data sources. Some works try to leverage model quantization to… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Under review

  10. arXiv:2405.11257  [pdf, other

    cs.RO

    PS6D: Point Cloud Based Symmetry-Aware 6D Object Pose Estimation in Robot Bin-Picking

    Authors: Yifan Yang, Zhihao Cui, Qianyi Zhang, **gtai Liu

    Abstract: 6D object pose estimation holds essential roles in various fields, particularly in the gras** of industrial workpieces. Given challenges like rust, high reflectivity, and absent textures, this paper introduces a point cloud based pose estimation framework (PS6D). PS6D centers on slender and multi-symmetric objects. It extracts multi-scale features through an attention-guided feature extraction m… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  11. arXiv:2405.10705  [pdf, other

    eess.IV cs.CV

    3D Vessel Reconstruction from Sparse-View Dynamic DSA Images via Vessel Probability Guided Attenuation Learning

    Authors: Zhentao Liu, Huangxuan Zhao, Wenhui Qin, Zhenghong Zhou, Xinggang Wang, Wen** Wang, Xiaochun Lai, Chuansheng Zheng, Dinggang Shen, Zhiming Cui

    Abstract: Digital Subtraction Angiography (DSA) is one of the gold standards in vascular disease diagnosing. With the help of contrast agent, time-resolved 2D DSA images deliver comprehensive insights into blood flow information and can be utilized to reconstruct 3D vessel structures. Current commercial DSA systems typically demand hundreds of scanning views to perform reconstruction, resulting in substanti… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 12 pages, 13 figures, 5 tables

  12. arXiv:2405.09369  [pdf, other

    cs.IR

    Diffusion-based Contrastive Learning for Sequential Recommendation

    Authors: Ziqiang Cui, Haolun Wu, Bowei He, Ji Cheng, Chen Ma

    Abstract: Self-supervised contrastive learning, which directly extracts inherent data correlations from unlabeled data, has been widely utilized to mitigate the data sparsity issue in sequential recommendation. The majority of existing methods create different augmented views of the same user sequence via random augmentation, and subsequently minimize their distance in the embedding space to enhance the qua… ▽ More

    Submitted 7 June, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  13. arXiv:2405.08589  [pdf, other

    cs.CV

    Variable Substitution and Bilinear Programming for Aligning Partially Overlap** Point Sets

    Authors: Wei Lian, Zhesen Cui, Fei Ma, Hang Pan, Wangmeng Zuo

    Abstract: In many applications, the demand arises for algorithms capable of aligning partially overlap** point sets while remaining invariant to the corresponding transformations. This research presents a method designed to meet such requirements through minimization of the objective function of the robust point matching (RPM) algorithm. First, we show that the RPM objective is a cubic polynomial. Then, t… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  14. arXiv:2405.08054  [pdf, other

    cs.GR cs.CV

    Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning

    Authors: Wenqi Dong, Bangbang Yang, Lin Ma, Xiao Liu, Liyuan Cui, Hujun Bao, Yuewen Ma, Zhaopeng Cui

    Abstract: As humans, we aspire to create media content that is both freely willed and readily controlled. Thanks to the prominent development of generative techniques, we now can easily utilize 2D diffusion methods to synthesize images controlled by raw sketch or designated human poses, and even progressively edit/regenerate local regions with masked inpainting. However, similar workflows in 3D modeling tas… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Project webpage: https://zju3dv.github.io/coin3d

  15. arXiv:2405.06059  [pdf, other

    cs.CL cs.AI

    A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds

    Authors: Christopher Z. Cui, Xiangyu Peng, Mark O. Riedl

    Abstract: Open-ended worlds are those in which there are no pre-specified goals or environmental reward signal. As a consequence, an agent must know how to perform a multitude of tasks. However, when a new task is presented to an agent, we expect it to be able to reuse some of what it knows from previous tasks to rapidly learn that new task. We introduce a novel technique whereby policies for different a pr… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  16. arXiv:2405.04909  [pdf, other

    cs.CV cs.AI

    Traj-LLM: A New Exploration for Empowering Trajectory Prediction with Pre-trained Large Language Models

    Authors: Zhengxing Lan, Hongbo Li, Lingshan Liu, Bo Fan, Yisheng Lv, Yilong Ren, Zhiyong Cui

    Abstract: Predicting the future trajectories of dynamic traffic actors is a cornerstone task in autonomous driving. Though existing notable efforts have resulted in impressive performance improvements, a gap persists in scene cognitive and understanding of the complex traffic semantics. This paper proposes Traj-LLM, the first to investigate the potential of using Large Language Models (LLMs) without explici… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  17. arXiv:2404.17520  [pdf, other

    cs.RO

    A Cognitive-Driven Trajectory Prediction Model for Autonomous Driving in Mixed Autonomy Environment

    Authors: Haicheng Liao, Zhenning Li, Chengyue Wang, Bonan Wang, Hanlin Kong, Yanchen Guan, Guofa Li, Zhiyong Cui, Chengzhong Xu

    Abstract: As autonomous driving technology progresses, the need for precise trajectory prediction models becomes paramount. This paper introduces an innovative model that infuses cognitive insights into trajectory prediction, focusing on perceived safety and dynamic decision-making. Distinct from traditional approaches, our model excels in analyzing interactions and behavior patterns in mixed autonomy traff… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI 2024

  18. arXiv:2404.12031  [pdf, other

    cs.CV

    MLS-Track: Multilevel Semantic Interaction in RMOT

    Authors: Zeliang Ma, Song Yang, Zhe Cui, Zhicheng Zhao, Fei Su, Delong Liu, **gyu Wang

    Abstract: The new trend in multi-object tracking task is to track objects of interest using natural language. However, the scarcity of paired prompt-instance data hinders its progress. To address this challenge, we propose a high-quality yet low-cost data generation method base on Unreal Engine 5 and construct a brand-new benchmark dataset, named Refer-UE-City, which primarily includes scenes from intersect… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 17 pages 8 figures

  19. arXiv:2404.06153  [pdf, other

    cs.LG q-bio.GN

    scRDiT: Generating single-cell RNA-seq data by diffusion transformers and accelerating sampling

    Authors: Shengze Dong, Zhuorui Cui, Ding Liu, **zhi Lei

    Abstract: Motivation: Single-cell RNA sequencing (scRNA-seq) is a groundbreaking technology extensively utilized in biological research, facilitating the examination of gene expression at the individual cell level within a given tissue sample. While numerous tools have been developed for scRNA-seq data analysis, the challenge persists in capturing the distinct features of such data and replicating virtual d… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 11 pages, 4 figures,

  20. arXiv:2404.04430  [pdf, other

    cs.CV

    PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos

    Authors: Yufei Zhang, Jeffrey O. Kephart, Zijun Cui, Qiang Ji

    Abstract: While current methods have shown promising progress on estimating 3D human motion from monocular videos, their motion estimates are often physically unrealistic because they mainly consider kinematics. In this paper, we introduce Physics-aware Pretrained Transformer (PhysPT), which improves kinematics-based motion estimates and infers motion forces. PhysPT exploits a Transformer encoder-decoder ba… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  21. arXiv:2404.02152  [pdf, other

    cs.CV

    GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image

    Authors: Chong Bao, Yinda Zhang, Yuan Li, Xiyu Zhang, Bangbang Yang, Hujun Bao, Marc Pollefeys, Guofeng Zhang, Zhaopeng Cui

    Abstract: Recently, we have witnessed the explosive growth of various volumetric representations in modeling animatable head avatars. However, due to the diversity of frameworks, there is no practical method to support high-level applications like 3D head avatar editing across different representations. In this paper, we propose a generic avatar editing approach that can be universally applied to various 3D… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024. Project page: https://zju3dv.github.io/geneavatar/

  22. arXiv:2403.19881  [pdf, other

    cs.AI

    IME: Integrating Multi-curvature Shared and Specific Embedding for Temporal Knowledge Graph Completion

    Authors: Jiapu Wang, Zheng Cui, Boyue Wang, Shirui Pan, Junbin Gao, Baocai Yin, Wen Gao

    Abstract: Temporal Knowledge Graphs (TKGs) incorporate a temporal dimension, allowing for a precise capture of the evolution of knowledge and reflecting the dynamic nature of the real world. Typically, TKGs contain complex geometric structures, with various geometric structures interwoven. However, existing Temporal Knowledge Graph Completion (TKGC) methods either model TKGs in a single space or neglect the… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  23. arXiv:2403.18282  [pdf, other

    cs.CV

    SGDM: Static-Guided Dynamic Module Make Stronger Visual Models

    Authors: Wenjie Xing, Zhenchao Cui, **g Qi

    Abstract: The spatial attention mechanism has been widely used to improve object detection performance. However, its operation is currently limited to static convolutions lacking content-adaptive features. This paper innovatively approaches from the perspective of dynamic convolution. We propose Razor Dynamic Convolution (RDConv) to address thetwo flaws in dynamic weight convolution, making it hard to imple… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 16 pages, 4 figures

  24. arXiv:2403.16406  [pdf

    cs.HC

    Development of a Chinese Human-Automation Trust Scale

    Authors: Zixin Cui, Xiangling Zhuang, Seul Chan Lee, Jieun Lee, Xintong Li, Makoto Itoh

    Abstract: The development of a reliable and valid assessment tool of human-automation trust is an important topic. This study aimed to develop a Chinese version of human-automation trust scale (C-HATS) with reasonable reliability and validity based on Lee and See (2004)'s trust model. After three phases of assessments including exploratory factor analysis, item analysis, and confirmatory factor analysis, di… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 26 pages with 3 figures

  25. arXiv:2403.16095  [pdf, other

    cs.CV cs.RO

    CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field

    Authors: Jiarui Hu, Xianhao Chen, Boyin Feng, Guanglin Li, Liang**g Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

    Abstract: Recently neural radiance fields (NeRF) have been widely exploited as 3D representations for dense simultaneous localization and map** (SLAM). Despite their notable successes in surface modeling and novel view synthesis, existing NeRF-based methods are hindered by their computationally intensive and time-consuming volume rendering pipeline. This paper presents an efficient dense RGB-D SLAM system… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Project Page: https://zju3dv.github.io/cg-slam

  26. arXiv:2403.08251  [pdf, other

    cs.MA cs.AI cs.CY

    Emergence of Social Norms in Generative Agent Societies: Principles and Architecture

    Authors: Siyue Ren, Zhiyao Cui, Ruiqi Song, Zhen Wang, Shuyue Hu

    Abstract: Social norms play a crucial role in guiding agents towards understanding and adhering to standards of behavior, thus reducing social conflicts within multi-agent systems (MASs). However, current LLM-based (or generative) MASs lack the capability to be normative. In this paper, we propose a novel architecture, named CRSEC, to empower the emergence of social norms within generative MASs. Our archite… ▽ More

    Submitted 20 May, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Published as a conference paper at IJCAI 2024

  27. arXiv:2403.02221  [pdf, other

    cs.LG

    TPLLM: A Traffic Prediction Framework Based on Pretrained Large Language Models

    Authors: Yilong Ren, Yue Chen, Shuai Liu, Boyue Wang, Haiyang Yu, Zhiyong Cui

    Abstract: Traffic prediction constitutes a pivotal facet within the purview of Intelligent Transportation Systems (ITS), and the attainment of highly precise predictions holds profound significance for efficacious traffic management. The precision of prevailing deep learning-driven traffic prediction models typically sees an upward trend with a rise in the volume of training data. However, the procurement o… ▽ More

    Submitted 18 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  28. arXiv:2402.19251  [pdf, other

    cs.AI cs.RO

    A Cognitive-Based Trajectory Prediction Approach for Autonomous Driving

    Authors: Haicheng Liao, Yongkang Li, Zhenning Li, Chengyue Wang, Zhiyong Cui, Shengbo Eben Li, Chengzhong Xu

    Abstract: In autonomous vehicle (AV) technology, the ability to accurately predict the movements of surrounding vehicles is paramount for ensuring safety and operational efficiency. Incorporating human decision-making insights enables AVs to more effectively anticipate the potential actions of other vehicles, significantly improving prediction accuracy and responsiveness in dynamic environments. This paper… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  29. arXiv:2402.17487  [pdf, other

    cs.CV cs.LG eess.IV

    Bit Rate Matching Algorithm Optimization in JPEG-AI Verification Model

    Authors: Panqi Jia, A. Burakhan Koyuncu, Jue Mao, Ze Cui, Yi Ma, Tiansheng Guo, Timofey Solovyev, Alexander Karabutov, Yin Zhao, **g Wang, Elena Alshina, Andre Kaup

    Abstract: The research on neural network (NN) based image compression has shown superior performance compared to classical compression frameworks. Unlike the hand-engineered transforms in the classical frameworks, NN-based models learn the non-linear transforms providing more compact bit representations, and achieve faster coding speed on parallel devices over their classical counterparts. Those properties… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted at (IEEE) PCS 2024; 6 pages

  30. arXiv:2402.16718  [pdf

    physics.med-ph cs.AI

    An Overview of the Development of Stereotactic Body Radiation Therapy

    Authors: Yanqi Zong, Zhengrong Cui, Luqi Lin, Sihao Wang, Yizhi Chen

    Abstract: Stereotactic body radiation therapy (SBRT) refers to focusing high-energy rays in three-dimensional space on the tumor lesion area, reducing the dose received by surrounding normal tissues, which can effectively improve the local control rate of the tumor and reduce the probability of complications. With the comprehensive development of medical imaging, radiation biology and other disciplines, thi… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  31. arXiv:2402.12746  [pdf, ps, other

    eess.AS cs.SD

    Plugin Speech Enhancement: A Universal Speech Enhancement Framework Inspired by Dynamic Neural Network

    Authors: Yanan Chen, Zihao Cui, Yingying Gao, Junlan Feng, Chao Deng, Shilei Zhang

    Abstract: The expectation to deploy a universal neural network for speech enhancement, with the aim of improving noise robustness across diverse speech processing tasks, faces challenges due to the existing lack of awareness within static speech enhancement frameworks regarding the expected speech in downstream modules. These limitations impede the effectiveness of static speech enhancement approaches in ac… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  32. arXiv:2401.10786  [pdf, other

    cs.CV

    Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion

    Authors: Zuoyue Li, Zhenqiang Li, Zhaopeng Cui, Marc Pollefeys, Martin R. Oswald

    Abstract: Directly generating scenes from satellite imagery offers exciting possibilities for integration into applications like games and map services. However, challenges arise from significant view changes and scene scale. Previous efforts mainly focused on image or video generation, lacking exploration into the adaptability of scene generation for arbitrary views. Existing 3D generation works either ope… ▽ More

    Submitted 1 April, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Journal ref: CVPR 2024

  33. arXiv:2401.06557  [pdf, other

    cs.LG cs.AI cs.SI stat.ME

    Treatment-Aware Hyperbolic Representation Learning for Causal Effect Estimation with Social Networks

    Authors: Ziqiang Cui, Xing Tang, Yang Qiao, Bowei He, Liang Chen, Xiuqiang He, Chen Ma

    Abstract: Estimating the individual treatment effect (ITE) from observational data is a crucial research topic that holds significant value across multiple domains. How to identify hidden confounders poses a key challenge in ITE estimation. Recent studies have incorporated the structural information of social networks to tackle this challenge, achieving notable advancements. However, these methods utilize g… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Accepted by SIAM SDM'24

  34. arXiv:2401.01491   

    cs.CE

    A Hybrid Neural Network Model For Predicting The Nitrate Concentration In The Recirculating Aquaculture System

    Authors: Xiangyu Fan, Jiaxin Lia, Yingzhe Wang, Yingsha Qu, Hao Li, Keming Qu, Zhengguo Cui

    Abstract: This study was groundbreaking in its application of neural network models for nitrate management in the Recirculating Aquaculture System (RAS). A hybrid neural network model was proposed, which accurately predicted daily nitrate concentration and its trends using six water quality parameters. We conducted a 105-day aquaculture experiment, during which we collected 450 samples from five sets of RAS… ▽ More

    Submitted 15 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: The content of this paper needs to be further filled and improved

  35. arXiv:2401.00781  [pdf

    cs.LG stat.ML

    Inferring Heterogeneous Treatment Effects of Crashes on Highway Traffic: A Doubly Robust Causal Machine Learning Approach

    Authors: Shuang Li, Ziyuan Pu, Zhiyong Cui, Seunghyeon Lee, Xiucheng Guo, Dong Ngoduy

    Abstract: Highway traffic crashes exert a considerable impact on both transportation systems and the economy. In this context, accurate and dependable emergency responses are crucial for effective traffic management. However, the influence of crashes on traffic status varies across diverse factors and may be biased due to selection bias. Therefore, there arises a necessity to accurately estimate the heterog… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: 38 pages, 13 figures, 8 tables

  36. arXiv:2312.17050  [pdf, other

    cs.CV

    KeDuSR: Real-World Dual-Lens Super-Resolution via Kernel-Free Matching

    Authors: Huan**g Yue, Zifan Cui, Kun Li, **gyu Yang

    Abstract: Dual-lens super-resolution (SR) is a practical scenario for reference (Ref) based SR by utilizing the telephoto image (Ref) to assist the super-resolution of the low-resolution wide-angle image (LR input). Different from general RefSR, the Ref in dual-lens SR only covers the overlapped field of view (FoV) area. However, current dual-lens SR methods rarely utilize these specific characteristics and… ▽ More

    Submitted 2 January, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 14 pages, 10 figures. Accepted by AAAI-2024

  37. arXiv:2312.13156  [pdf, other

    cs.CE cs.AI

    AccidentGPT: Accident Analysis and Prevention from V2X Environmental Perception with Multi-modal Large Model

    Authors: Lening Wang, Yilong Ren, Han Jiang, Pinlong Cai, Daocheng Fu, Tianqi Wang, Zhiyong Cui, Haiyang Yu, Xuesong Wang, Hanchu Zhou, Helai Huang, Yinhai Wang

    Abstract: Traffic accidents, being a significant contributor to both human casualties and property damage, have long been a focal point of research for many scholars in the field of traffic safety. However, previous studies, whether focusing on static environmental assessments or dynamic driving analyses, as well as pre-accident predictions or post-accident rule analyses, have typically been conducted in is… ▽ More

    Submitted 28 December, 2023; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 21 pages, 19 figures

  38. arXiv:2312.10649  [pdf, other

    cs.CV

    PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields

    Authors: Boming Zhao, Luwei Yang, Mao Mao, Hujun Bao, Zhaopeng Cui

    Abstract: Due to the ability to synthesize high-quality novel views, Neural Radiance Fields (NeRF) have been recently exploited to improve visual localization in a known environment. However, the existing methods mostly utilize NeRFs for data augmentation to improve the regression model training, and the performance on novel viewpoints and appearances is still limited due to the lack of geometric constraint… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  39. arXiv:2312.09093  [pdf, other

    cs.CV

    Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption

    Authors: Ziteng Cui, Lin Gu, Xiao Sun, Xianzheng Ma, Yu Qiao, Tatsuya Harada

    Abstract: The standard Neural Radiance Fields (NeRF) paradigm employs a viewer-centered methodology, entangling the aspects of illumination and material reflectance into emission solely from 3D points. This simplified rendering approach presents challenges in accurately modeling images captured under adverse lighting conditions, such as low light or over-exposure. Motivated by the ancient Greek emission the… ▽ More

    Submitted 24 January, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: AAAI 2024, code available at https://cuiziteng.github.io/Aleth_NeRF_web/ Modified version of previous paper arXiv:2303.05807

  40. arXiv:2312.07353  [pdf, other

    cs.CV

    CLIP in Medical Imaging: A Comprehensive Survey

    Authors: Zihao Zhao, Yuxiao Liu, Han Wu, Yonghao Li, Sheng Wang, Lin Teng, Disheng Liu, Zhiming Cui, Qian Wang, Dinggang Shen

    Abstract: Contrastive Language-Image Pre-training (CLIP), a simple yet effective pre-training paradigm, successfully introduces text supervision to vision models. It has shown promising results across various tasks, attributable to its generalizability and interpretability. The use of CLIP has recently gained increasing interest in the medical imaging domain, serving both as a pre-training paradigm for alig… ▽ More

    Submitted 21 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Project page available at https://github.com/zhaozh10/Awesome-CLIP-in-Medical-Imaging

  41. arXiv:2312.05704  [pdf, other

    cs.NI eess.SP

    On the Ground and in the Sky: A Tutorial on Radio Localization in Ground-Air-Space Networks

    Authors: Hazem Sallouha, Sharief Saleh, Sibren De Bast, Zhuangzhuang Cui, Sofie Pollin, Henk Wymeersch

    Abstract: The inherent limitations in scaling up ground infrastructure for future wireless networks, combined with decreasing operational costs of aerial and space networks, are driving considerable research interest in multisegment ground-air-space (GAS) networks. In GAS networks, where ground and aerial users share network resources, ubiquitous and accurate user localization becomes indispensable, not onl… ▽ More

    Submitted 17 June, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in IEEE Communications Surveys & Tutorials

  42. arXiv:2312.01747  [pdf, other

    cs.RO

    Autonomous and Adaptive Role Selection for Multi-robot Collaborative Area Search Based on Deep Reinforcement Learning

    Authors: Lina Zhu, Jiyu Cheng, Hao Zhang, Zhichao Cui, Wei Zhang, Yuehu Liu

    Abstract: In the tasks of multi-robot collaborative area search, we propose the unified approach for simultaneous map** for sensing more targets (exploration) while searching and locating the targets (coverage). Specifically, we implement a hierarchical multi-agent reinforcement learning algorithm to decouple task planning from task execution. The role concept is integrated into the upper-level task plann… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  43. arXiv:2311.17318  [pdf

    cs.CY

    Impact of Indoor Mobility Behavior on the Respiratory Infectious Diseases Transmission Trends

    Authors: Ziwei Cui, Ming Cai, Zheng Zhu, Gongbo Chen, Yao Xiao

    Abstract: The importance of indoor human mobility in the transmission dynamics of respiratory infectious diseases has been acknowledged. Previous studies have predominantly addressed a single type of mobility behavior such as queueing and a series of behaviors under specific scenarios. However, these studies ignore the abstraction of mobility behavior in various scenes and the critical examination of how th… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  44. arXiv:2311.14473  [pdf, other

    eess.IV cs.CV

    Joint Diffusion: Mutual Consistency-Driven Diffusion Model for PET-MRI Co-Reconstruction

    Authors: Taofeng Xie, Zhuo-Xu Cui, Chen Luo, Huayu Wang, Congcong Liu, Yuanzhi Zhang, Xuemei Wang, Yanjie Zhu, Qiyu **, Guoqing Chen, Yihang Zhou, Dong Liang, Haifeng Wang

    Abstract: Positron Emission Tomography and Magnetic Resonance Imaging (PET-MRI) systems can obtain functional and anatomical scans. PET suffers from a low signal-to-noise ratio. Meanwhile, the k-space data acquisition process in MRI is time-consuming. The study aims to accelerate MRI and enhance PET image quality. Conventional approaches involve the separate reconstruction of each modality within PET-MRI sy… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  45. arXiv:2311.10292  [pdf, other

    quant-ph cs.ET physics.optics

    Realization of a programmable multi-purpose photonic quantum memory with over-thousand qubit manipulations

    Authors: Sheng Zhang, Jixuan Shi, Zhaibin Cui, Ye Wang, Yukai Wu, Luming Duan, Yunfei Pu

    Abstract: Quantum networks can enable various applications such as distributed quantum computing, long-distance quantum communication, and network-based quantum sensing with unprecedented performances. One of the most important building blocks for a quantum network is a photonic quantum memory which serves as the interface between the communication channel and the local functional unit. A programmable quant… ▽ More

    Submitted 29 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 17 pages, 19 figures

    Journal ref: Phys. Rev. X 14, 021018 (2024)

  46. arXiv:2311.08013  [pdf, other

    cs.CV cs.GR cs.RO

    CP-SLAM: Collaborative Neural Point-based SLAM System

    Authors: Jiarui Hu, Mao Mao, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

    Abstract: This paper presents a collaborative implicit neural simultaneous localization and map** (SLAM) system with RGB-D image sequences, which consists of complete front-end and back-end modules including odometry, loop detection, sub-map fusion, and global refinement. In order to enable all these modules in a unified framework, we propose a novel neural point based 3D scene representation in which eac… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023

  47. arXiv:2311.03074  [pdf, other

    eess.IV cs.CV

    A Two-Stage Generative Model with CycleGAN and Joint Diffusion for MRI-based Brain Tumor Detection

    Authors: Wenxin Wang, Zhuo-Xu Cui, Guanxun Cheng, Chentao Cao, Xi Xu, Ziwei Liu, Haifeng Wang, Yulong Qi, Dong Liang, Yanjie Zhu

    Abstract: Accurate detection and segmentation of brain tumors is critical for medical diagnosis. However, current supervised learning methods require extensively annotated images and the state-of-the-art generative models used in unsupervised methods often have limitations in covering the whole data distribution. In this paper, we propose a novel framework Two-Stage Generative Model (TSGM) that combines Cyc… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 11 pages,9 figures,3 tables

  48. arXiv:2310.17664  [pdf, other

    cs.LG eess.AS eess.SP

    Cascaded Multi-task Adaptive Learning Based on Neural Architecture Search

    Authors: Yingying Gao, Shilei Zhang, Zihao Cui, Chao Deng, Junlan Feng

    Abstract: Cascading multiple pre-trained models is an effective way to compose an end-to-end system. However, fine-tuning the full cascaded model is parameter and memory inefficient and our observations reveal that only applying adapter modules on cascaded model can not achieve considerable performance as fine-tuning. We propose an automatic and effective adaptive learning method to optimize end-to-end casc… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  49. arXiv:2310.13119  [pdf, other

    cs.CV cs.GR

    DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation

    Authors: Bangbang Yang, Wenqi Dong, Lin Ma, Wenbo Hu, Xiao Liu, Zhaopeng Cui, Yuewen Ma

    Abstract: Diffusion-based methods have achieved prominent success in generating 2D media. However, accomplishing similar proficiencies for scene-level mesh texturing in 3D spatial applications, e.g., XR/VR, remains constrained, primarily due to the intricate nature of 3D geometry and the necessity for immersive free-viewpoint rendering. In this paper, we propose a novel indoor scene texturing framework, whi… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Project webpage: https://ybbbbt.com/publication/dreamspace

  50. arXiv:2310.11106  [pdf, other

    cs.CV

    3D Structure-guided Network for Tooth Alignment in 2D Photograph

    Authors: Yulong Dou, Lanzhuju Mei, Dinggang Shen, Zhiming Cui

    Abstract: Orthodontics focuses on rectifying misaligned teeth (i.e., malocclusions), affecting both masticatory function and aesthetics. However, orthodontic treatment often involves complex, lengthy procedures. As such, generating a 2D photograph depicting aligned teeth prior to orthodontic treatment is crucial for effective dentist-patient communication and, more importantly, for encouraging patients to a… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.