Skip to main content

Showing 51–100 of 756 results for author: Wenbo

.
  1. arXiv:2405.11286  [pdf, other

    cs.CV

    Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion

    Authors: Zeyu Zhang, Yiran Wang, Biao Wu, Shuo Chen, Zhiyuan Zhang, Shiya Huang, Wenbo Zhang, Meng Fang, Ling Chen, Yang Zhao

    Abstract: In recent years, there has been significant interest in creating 3D avatars and motions, driven by their diverse applications in areas like film-making, video games, AR/VR, and human-robot interaction. However, current efforts primarily concentrate on either generating the 3D avatar mesh alone or producing motion sequences, with integrating these two aspects proving to be a persistent challenge. A… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  2. arXiv:2405.11135  [pdf, other

    cs.CR

    AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA

    Authors: Weitao Feng, Wenbo Zhou, Jiyan He, Jie Zhang, Tianyi Wei, Guanlin Li, Tianwei Zhang, Weiming Zhang, Nenghai Yu

    Abstract: Diffusion models have achieved remarkable success in generating high-quality images. Recently, the open-source models represented by Stable Diffusion (SD) are thriving and are accessible for customization, giving rise to a vibrant community of creators and enthusiasts. However, the widespread availability of customized SD models has led to copyright concerns, like unauthorized model distribution a… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Code is available at https://github.com/Georgefwt/AquaLoRA

  3. arXiv:2405.08816  [pdf, other

    cs.CV cs.RO

    The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition

    Authors: Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingxing Wei, Yinpeng Dong, Bo Yang, Shengyin Jiang , et al. (66 additional authors not shown)

    Abstract: In the realm of autonomous driving, robust perception under out-of-distribution conditions is paramount for the safe deployment of vehicles. Challenges such as adverse weather, sensor malfunctions, and environmental unpredictability can severely impact the performance of autonomous systems. The 2024 RoboDrive Challenge was crafted to propel the development of driving perception technologies that c… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: ICRA 2024; 32 pages, 24 figures, 5 tables; Code at https://robodrive-24.github.io/

  4. arXiv:2405.07685  [pdf, other

    eess.SY

    Comprehensive Analysis of Access Control Models in Edge Computing: Challenges, Solutions, and Future Directions

    Authors: Tao Xue, Ying Zhang, Yanbin Wang, Wenbo Wang, Shuailou Li, Haibin Zhang

    Abstract: Many contemporary applications, including smart homes and autonomous vehicles, rely on the Internet of Things technology. While cloud computing provides a multitude of valuable services for these applications, it generally imposes constraints on latency-sensitive applications due to the significant propagation delays. As a complementary technique to cloud computing, edge computing situates computi… ▽ More

    Submitted 22 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  5. arXiv:2405.07023  [pdf, other

    eess.IV cs.CV

    Efficient Real-world Image Super-Resolution Via Adaptive Directional Gradient Convolution

    Authors: Long Peng, Yang Cao, Ren**g Pei, Wenbo Li, Jiaming Guo, Xueyang Fu, Yang Wang, Zheng-Jun Zha

    Abstract: Real-SR endeavors to produce high-resolution images with rich details while mitigating the impact of multiple degradation factors. Although existing methods have achieved impressive achievements in detail recovery, they still fall short when addressing regions with complex gradient arrangements due to the intensity-based linear weighting feature extraction manner. Moreover, the stochastic artifact… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  6. arXiv:2405.06536  [pdf, other

    cs.CV

    Mesh Denoising Transformer

    Authors: Wenbo Zhao, Xianming Liu, Deming Zhai, Junjun Jiang, Xiangyang Ji

    Abstract: Mesh denoising, aimed at removing noise from input meshes while preserving their feature structures, is a practical yet challenging task. Despite the remarkable progress in learning-based mesh denoising methodologies in recent years, their network designs often encounter two principal drawbacks: a dependence on single-modal geometric representations, which fall short in capturing the multifaceted… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  7. arXiv:2405.04781  [pdf, other

    cs.CL

    CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization

    Authors: Zheyan Qu, Lu Yin, Zitong Yu, Wenbo Wang, Xing zhang

    Abstract: Large language models (LLMs) have demonstrated astonishing capabilities in natural language processing (NLP) tasks, sparking interest in their application to professional domains with higher specialized requirements. However, restricted access to closed-source LLMs via APIs and the difficulty in collecting massive high-quality datasets pose obstacles to the development of large language models in… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  8. arXiv:2405.02622  [pdf

    physics.flu-dyn

    New Interpretation for error propagation of data-driven Reynolds stress closures via global stability analysis

    Authors: Xianglin Shan, Wenbo Cao, Weiwei Zhang

    Abstract: In light of the challenges surrounding convergence and error propagation encountered in Reynolds-averaged Navier-Stokes (RANS) equations with data-driven Reynolds stress closures, researchers commonly attribute these issues to ill-conditioning through conditional number analysis. This paper delves into an additional factor, numerical instability, contributing to these challenges. We conduct global… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  9. Rip-NeRF: Anti-aliasing Radiance Fields with Ripmap-Encoded Platonic Solids

    Authors: Junchen Liu, Wenbo Hu, Zhuo Yang, Jianteng Chen, Guoliang Wang, Xiaoxue Chen, Yantong Cai, Huan-ang Gao, Hao Zhao

    Abstract: Despite significant advancements in Neural Radiance Fields (NeRFs), the renderings may still suffer from aliasing and blurring artifacts, since it remains a fundamental challenge to effectively and efficiently characterize anisotropic areas induced by the cone-casting procedure. This paper introduces a Ripmap-Encoded Platonic Solid representation to precisely and efficiently featurize 3D anisotrop… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: SIGGRAPH 2024, Project page: https://junchenliu77.github.io/Rip-NeRF , Code: https://github.com/JunchenLiu77/Rip-NeRF

  10. arXiv:2405.01957  [pdf

    physics.flu-dyn

    An analysis and solution of ill-conditioning in physics-informed neural networks

    Authors: Wenbo Cao, Weiwei Zhang

    Abstract: Physics-informed neural networks (PINNs) have recently emerged as a novel and popular approach for solving forward and inverse problems involving partial differential equations (PDEs). However, achieving stable training and obtaining correct results remain a challenge in many cases, often attributed to the ill-conditioning of PINNs. Nonetheless, further analysis is still lacking, severely limiting… ▽ More

    Submitted 24 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  11. arXiv:2405.01830  [pdf, other

    quant-ph physics.comp-ph physics.optics

    Computational Electromagnetics Meets Spin Qubits: Controlling Noise Effects in Quantum Sensing and Computing

    Authors: Wenbo Sun, Sathwik Bharadwaj, Runwei Zhou, Dan Jiao, Zubin Jacob

    Abstract: Solid-state spin qubits have emerged as promising quantum information platforms but are susceptible to magnetic noise. Despite extensive efforts in controlling noise in spin qubit quantum applications, one important but less controlled noise source is near-field electromagnetic fluctuations. Low-frequency (MHz and GHz) electromagnetic fluctuations are significantly enhanced near nanostructured los… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 9 pages, 5 figures

  12. arXiv:2405.00637  [pdf, ps, other

    eess.SY

    A Distributed Model Identification Algorithm for Multi-Agent Systems

    Authors: Vivek Khatana, Chin-Yao Chang, Wenbo Wang

    Abstract: In this study, we investigate agent-based approach for system model identification with an emphasis on power distribution system applications. Departing from conventional practices of relying on historical data for offline model identification, we adopt an online update approach utilizing real-time data by employing the latest data points for gradient computation. This methodology offers advantage… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 6 pages, 4 figures

  13. arXiv:2404.18630  [pdf, other

    cs.CV

    4D-DRESS: A 4D Dataset of Real-world Human Clothing with Semantic Annotations

    Authors: Wenbo Wang, Hsuan-I Ho, Chen Guo, Boxiang Rong, Artur Grigorev, Jie Song, Juan Jose Zarate, Otmar Hilliges

    Abstract: The studies of human clothing for digital avatars have predominantly relied on synthetic datasets. While easy to collect, synthetic data often fall short in realism and fail to capture authentic clothing dynamics. Addressing this gap, we introduce 4D-DRESS, the first real-world 4D dataset advancing human clothing research with its high-quality 4D textured scans and garment meshes. 4D-DRESS capture… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 paper, 21 figures, 9 tables

  14. arXiv:2404.16147  [pdf, other

    cs.RO

    Chat2Scenario: Scenario Extraction From Dataset Through Utilization of Large Language Model

    Authors: Yongqi Zhao, Wenbo Xiao, Tomislav Mihalj, Jia Hu, Arno Eichberger

    Abstract: The advent of Large Language Models (LLM) provides new insights to validate Automated Driving Systems (ADS). In the herein-introduced work, a novel approach to extracting scenarios from naturalistic driving datasets is presented. A framework called Chat2Scenario is proposed leveraging the advanced Natural Language Processing (NLP) capabilities of LLM to understand and identify different driving sc… ▽ More

    Submitted 26 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: IEEE Intelligent Vehicles Symposium (IV 2024)

  15. arXiv:2404.16006  [pdf, other

    cs.CV

    MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

    Authors: Kaining Ying, Fanqing Meng, ** Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao, Yali Wang, Yu Qiao, ** Luo, Kaipeng Zhang, Wenqi Shao

    Abstract: Large Vision-Language Models (LVLMs) show significant strides in general-purpose multimodal applications such as visual dialogue and embodied navigation. However, existing multimodal evaluation benchmarks cover a limited number of multimodal tasks testing rudimentary capabilities, falling short in tracking LVLM development. In this study, we present MMT-Bench, a comprehensive benchmark designed to… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 77 pages, 41 figures

  16. arXiv:2404.15384  [pdf, other

    cs.LG cs.AI

    FL-TAC: Enhanced Fine-Tuning in Federated Learning via Low-Rank, Task-Specific Adapter Clustering

    Authors: Siqi **, Yuzhu Mao, Yang Liu, Xiao-** Zhang, Wenbo Ding

    Abstract: Although large-scale pre-trained models hold great potential for adapting to downstream tasks through fine-tuning, the performance of such fine-tuned models is often limited by the difficulty of collecting sufficient high-quality, task-specific data. Federated Learning (FL) offers a promising solution by enabling fine-tuning across large-scale clients with a variety of task data, but it is bottlen… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  17. arXiv:2404.15209  [pdf, other

    cs.LG stat.ME stat.ML

    Data-Driven Knowledge Transfer in Batch $Q^*$ Learning

    Authors: Elynn Chen, Xi Chen, Wenbo **g

    Abstract: In data-driven decision-making in marketing, healthcare, and education, it is desirable to utilize a large amount of data from existing ventures to navigate high-dimensional feature spaces and address data scarcity in new ventures. We explore knowledge transfer in dynamic decision-making by concentrating on batch stationary environments and formally defining task discrepancies through the lens of… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  18. arXiv:2404.14809  [pdf, other

    cs.CL cs.AI cs.DB

    A Survey of Large Language Models on Generative Graph Analytics: Query, Learning, and Applications

    Authors: Wenbo Shang, Xin Huang

    Abstract: A graph is a fundamental data model to represent various entities and their complex relationships in society and nature, such as social networks, transportation networks, financial networks, and biomedical systems. Recently, large language models (LLMs) have showcased a strong generalization ability to handle various NLP and multi-mode tasks to answer users' arbitrary questions and specific-domain… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 31 pages including references, 22 figures

  19. arXiv:2404.14795  [pdf, other

    cs.CL cs.CR cs.LG

    Talk Too Much: Poisoning Large Language Models under Token Limit

    Authors: Jiaming He, Wenbo Jiang, Guanyu Hou, Wenshu Fan, Rui Zhang, Hongwei Li

    Abstract: Mainstream poisoning attacks on large language models (LLMs) typically set a fixed trigger in the input instance and specific responses for triggered queries. However, the fixed trigger setting (e.g., unusual words) may be easily detected by human detection, limiting the effectiveness and practicality in real-world scenarios. To enhance the stealthiness of the trigger, we present a poisoning attac… ▽ More

    Submitted 11 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

  20. arXiv:2404.13874  [pdf, other

    cs.CL cs.CV

    VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models

    Authors: Haoyi Qiu, Wenbo Hu, Zi-Yi Dou, Nanyun Peng

    Abstract: Large Vision-Language Models (LVLMs) suffer from hallucination issues, wherein the models generate plausible-sounding but factually incorrect outputs, undermining their reliability. A comprehensive quantitative evaluation is necessary to identify and understand the extent of hallucinations in these models. However, existing benchmarks are often limited in scope, focusing mainly on object hallucina… ▽ More

    Submitted 5 June, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: ACL 2024 Findings

  21. arXiv:2404.13396  [pdf

    cond-mat.mtrl-sci

    Angle-Resolved Magneto-Chiral Anisotropy in a Non-Centrosymmetric Atomic Layer Superlattice

    Authors: Long Cheng, Mingrui Bao, **gxian Zhang, Xue Zhang, Qun Yang, Qiang Li, Hui Cao, Dawei Qiu, Jia Liu, Fei Ye, Qing Wang, Genhao Liang, Hui Li, Guanglei Cheng, Hua Zhou, Jian-Min Zuo, Xiaodong Zhou, Jian Shen, Zhifeng Zhu, Sai Mu, Wenbo Wang, Xiaofang Zhai

    Abstract: Chirality in solid-state materials has sparked significant interest due to potential applications of topologically-protected chiral states in next-generation information technology. The electrical magneto-chiral effect (eMChE), arising from relativistic spin-orbit interactions, shows great promise for develo** chiral materials and devices for electronic integration. Here we demonstrate an angle-… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  22. arXiv:2404.11249  [pdf, other

    cs.CV

    A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene

    Authors: Wenbo Zhang, Yifan Zhang, Jianfeng Lin, Binqiang Huang, **lu Zhang, Wenhao Yu

    Abstract: Pre-trained vision-language (V-L) models such as CLIP have shown excellent performance in many downstream cross-modal tasks. However, most of them are only applicable to the English context. Subsequent research has focused on this problem and proposed improved models, such as CN-CLIP and AltCLIP, to facilitate their applicability to Chinese and even other languages. Nevertheless, these models suff… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  23. arXiv:2404.09630  [pdf, other

    physics.soc-ph

    Optimal design of ride-pooling as on-demand feeder services

    Authors: Wenbo Fan, Weihua Gu, Meng Xu

    Abstract: The technology-enabled ride-pooling (RP) is designed as an on-demand feeder service to connect remote areas to transit terminals (or activity centers). We propose the so-called ``hold-dispatch'' operation strategy, which imposes a target number of shared rides (termed the ride-pooling size) for each vehicle to enhance RP's transportation efficiency. Analytical models are formulated at the planning… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  24. arXiv:2404.08501  [pdf, ps, other

    cs.NE cs.AI

    Analyzing and Overcoming Local Optima in Complex Multi-Objective Optimization by Decomposition-Based Evolutionary Algorithms

    Authors: Ting Dong, Haoxin Wang, Hengxi Zhang, Wenbo Ding

    Abstract: When addressing the challenge of complex multi-objective optimization problems, particularly those with non-convex and non-uniform Pareto fronts, Decomposition-based Multi-Objective Evolutionary Algorithms (MOEADs) often converge to local optima, thereby limiting solution diversity. Despite its significance, this issue has received limited theoretical exploration. Through a comprehensive geometric… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  25. arXiv:2404.08355  [pdf, other

    math.ST

    On testing mean of high dimensional compositional data

    Authors: Qianqian Jiang, Wenbo Li, Zeng Li

    Abstract: We investigate one/two-sample mean tests for high-dimensional compositional data when the number of variables is comparable with the sample size, as commonly encountered in microbiome research. Existing methods mainly focus on max-type test statistics which are suitable for detecting sparse signals. However, in this paper, we introduce a novel approach using sum-type test statistics which are capa… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  26. arXiv:2404.07876  [pdf, other

    math.DS

    Joint transitivity for linear iterates

    Authors: Sebastián Donoso, Andreas Koutsogiannis, Wenbo Sun

    Abstract: We establish sufficient and necessary conditions for the joint transitivity of linear iterates in a minimal topological dynamical system with commuting transformations. This result provides the first topological analogue of the classical Berend and Bergelson joint ergodicity criterion in measure-preserving systems.

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Comments welcome!

    MSC Class: Primary: 37B05; Secondary: 37B02; 37B20

  27. arXiv:2404.02863  [pdf

    cond-mat.str-el cond-mat.mtrl-sci cond-mat.supr-con

    Discovery of universal phonon thermal Hall effect in crystals

    Authors: Xiaobo **, Xu Zhang, Wenbo Wan, Hanru Wang, Yihan Jiao, Shiyan Li

    Abstract: Thermal Hall effect (THE) in insulator is a remarkable phenomenon that arises from the motion of chargeless quasi-particles under a magnetic field. While magnons or exotic spin excitations were considered as the origin of THE in some magnetic materials, there are more and more evidences suggesting that phonons play a significant role. However, the mechanism behind phonon THE is still unknown. Here… ▽ More

    Submitted 2 May, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: 33 pages

  28. arXiv:2404.02663  [pdf

    eess.SP cs.IT

    Ground-to-UAV sub-Terahertz channel measurement and modeling

    Authors: Da Li, Peian Li, Jiabiao Zhao, Jianjian Liang, Jiacheng Liu, Guohao Liu, Yuanshuai Lei, Wenbo Liu, Jianqin Deng, Fuyong Liu, Jianjun Ma

    Abstract: Unmanned Aerial Vehicle (UAV) assisted terahertz (THz) wireless communications have been expected to play a vital role in the next generation of wireless networks. UAVs can serve as either repeaters or data collectors within the communication link, thereby potentially augmenting the efficacy of communication systems. Despite their promise, the channel analysis and modeling specific to THz wireless… ▽ More

    Submitted 28 June, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Submitted to Optics Express

  29. arXiv:2403.18957  [pdf, other

    cs.CY cs.CL cs.LG cs.SI

    Moderating Illicit Online Image Promotion for Unsafe User-Generated Content Games Using Large Vision-Language Models

    Authors: Keyan Guo, Ayush Utkarsh, Wenbo Ding, Isabelle Ondracek, Ziming Zhao, Guo Freeman, Nishant Vishwamitra, Hongxin Hu

    Abstract: Online user-generated content games (UGCGs) are increasingly popular among children and adolescents for social interaction and more creative online entertainment. However, they pose a heightened risk of exposure to explicit content, raising growing concerns for the online safety of children and adolescents. Despite these concerns, few studies have addressed the issue of illicit image-based promoti… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: To Appear in the 33rd USENIX Security Symposium, August 14-16, 2024

  30. arXiv:2403.17552  [pdf, other

    cs.CL

    Naive Bayes-based Context Extension for Large Language Models

    Authors: Jianlin Su, Murtadha Ahmed, Wenbo, Luo Ao, Mingren Zhu, Yunfeng Liu

    Abstract: Large Language Models (LLMs) have shown promising in-context learning abilities. However, conventional In-Context Learning (ICL) approaches are often impeded by length limitations of transformer architecture, which pose challenges when attempting to effectively integrate supervision from a substantial number of demonstration examples. In this paper, we introduce a novel framework, called Naive Bay… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to main NAACL 2024

  31. arXiv:2403.16560  [pdf, other

    cs.RO

    Active Admittance Control with Iterative Learning for General-Purpose Contact-Rich Manipulation

    Authors: Bo Zhou, Yuyao Sun, Wenbo Liu, Ruixuan Jiao, Fang Fang, Shihua Li

    Abstract: Force interaction is inevitable when robots face multiple operation scenarios. How to make the robot competent in force control for generalized operations such as multi-tasks still remains a challenging problem. Aiming at the reproducibility of interaction tasks and the lack of a generalized force control framework for multi-task scenarios, this paper proposes a novel hybrid control framework base… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  32. arXiv:2403.16224  [pdf, other

    cs.CV

    Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields

    Authors: Haoyuan Wang, Wenbo Hu, Lei Zhu, Rynson W. H. Lau

    Abstract: Inverse rendering aims at recovering both geometry and materials of objects. It provides a more compatible reconstruction for conventional rendering engines, compared with the neural radiance fields (NeRFs). On the other hand, existing NeRF-based inverse rendering methods cannot handle glossy objects with local light interactions well, as they typically oversimplify the illumination as a 2D enviro… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: CVPR 2024 paper. Project webpage https://whyy.site/paper/nep

  33. arXiv:2403.15530  [pdf, other

    cs.CV

    Pixel-GS: Density Control with Pixel-aware Gradient for 3D Gaussian Splatting

    Authors: Zheng Zhang, Wenbo Hu, Yixing Lao, Tong He, Hengshuang Zhao

    Abstract: 3D Gaussian Splatting (3DGS) has demonstrated impressive novel view synthesis results while advancing real-time rendering performance. However, it relies heavily on the quality of the initial point cloud, resulting in blurring and needle-like artifacts in areas with insufficient initializing points. This is mainly attributed to the point cloud growth condition in 3DGS that only considers the avera… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  34. arXiv:2403.11056  [pdf, other

    cs.CV

    Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration

    Authors: Zhihao Liang, Qi Zhang, Wenbo Hu, Ying Feng, Lei Zhu, Kui Jia

    Abstract: The 3D Gaussian Splatting (3DGS) gained its popularity recently by combining the advantages of both primitive-based and volumetric 3D representations, resulting in improved quality and efficiency for 3D scene rendering. However, 3DGS is not alias-free, and its rendering at varying resolutions could produce severe blurring or jaggies. This is because 3DGS treats each pixel as an isolated, single po… ▽ More

    Submitted 3 April, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: 29 pages

  35. arXiv:2403.10050  [pdf, other

    cs.CV

    Texture-GS: Disentangling the Geometry and Texture for 3D Gaussian Splatting Editing

    Authors: Tian-Xing Xu, Wenbo Hu, Yu-Kun Lai, Ying Shan, Song-Hai Zhang

    Abstract: 3D Gaussian splatting, emerging as a groundbreaking approach, has drawn increasing attention for its capabilities of high-fidelity reconstruction and real-time rendering. However, it couples the appearance and geometry of the scene within the Gaussian attributes, which hinders the flexibility of editing operations, such as texture swap**. To address this issue, we propose a novel approach, namel… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  36. arXiv:2403.09323  [pdf, other

    cs.CV

    E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection

    Authors: Jiaqing Zhang, Mingxiang Cao, Xue Yang, Weiying Xie, Jie Lei, Daixun Li, Wenbo Huang, Yunsong Li

    Abstract: Multimodal image fusion and object detection are crucial for autonomous driving. While current methods have advanced the fusion of texture details and semantic information, their complex training processes hinder broader applications. Addressing this challenge, we introduce E2E-MFD, a novel end-to-end algorithm for multimodal fusion detection. E2E-MFD streamlines the process, achieving high perfor… ▽ More

    Submitted 23 May, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  37. arXiv:2403.08361  [pdf, other

    hep-ex hep-ph

    Search for cosmic-ray boosted sub-MeV dark matter-electron scatterings in PandaX-4T

    Authors: Xiaofeng Shang, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Lisheng Geng, Karl Giboni, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, **rong He, Di Huang, Junting Huang, Zhou Huang, Ruquan Hou, Yu Hou, Xiangdong Ji, Yonglin Ju, Chenxiang Li , et al. (67 additional authors not shown)

    Abstract: We report the first search for the elastic scatterings between cosmic-ray boosted sub-MeV dark matter and electrons in the PandaX-4T liquid xenon experiment. Sub-MeV dark matter particles can be accelerated by scattering with electrons in the cosmic rays and produce detectable electron recoil signals in the detector. Using the commissioning data from PandaX-4T of 0.63~tonne$\cdot$year exposure, we… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 6 pages, 3 figures

  38. arXiv:2403.08219  [pdf, other

    cs.RO

    SpaceOctopus: An Octopus-inspired Motion Planning Framework for Multi-arm Space Robot

    Authors: Wenbo Zhao, Shengjie Wang, Yixuan Fan, Yang Gao, Tao Zhang

    Abstract: Space robots have played a critical role in autonomous maintenance and space junk removal. Multi-arm space robots can efficiently complete the target capture and base reorientation tasks due to their flexibility and the collaborative capabilities between the arms. However, the complex coupling properties arising from both the multiple arms and the free-floating base present challenges to the motio… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 8 pages, 9 figures

  39. arXiv:2403.06340  [pdf, other

    quant-ph

    Error-Mitigated Quantum Random Access Memory

    Authors: Wenbo Shi, Neel Kanth Kundu, Matthew R. McKay, Robert Malaney

    Abstract: As an alternative to quantum error correction, quantum error mitigation methods, including Zero-Noise Extrapolation (ZNE), have been proposed to alleviate run-time errors in current noisy quantum devices. In this work, we propose a modified version of ZNE that provides for a significant performance enhancement on current noisy devices. Our modified ZNE method extrapolates to zero-noise data by eva… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  40. arXiv:2403.06220  [pdf, other

    hep-ex physics.ins-det

    Detecting Neutrinos from Supernova Bursts in PandaX-4T

    Authors: Binyu Pang, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, **rong He, Di Huang, Yanlin Huang, Junting Huang, Zhou Huang, Ruquan Hou , et al. (71 additional authors not shown)

    Abstract: Neutrinos from core-collapse supernovae are essential for the understanding of neutrino physics and stellar evolution. The dual-phase xenon dark matter detectors can provide a way to track explosions of galactic supernovae by detecting neutrinos through coherent elastic neutrino-nucleus scatterings. In this study, a variation of progenitor masses as well as explosion models are assumed to predict… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: 9 pages,6 figures

  41. arXiv:2403.06070  [pdf, other

    cs.CV cs.HC

    Reframe Anything: LLM Agent for Open World Video Reframing

    Authors: Jiawang Cao, Yongliang Wu, Weiheng Chi, Wenbo Zhu, Ziyue Su, Jay Wu

    Abstract: The proliferation of mobile devices and social media has revolutionized content dissemination, with short-form video becoming increasingly prevalent. This shift has introduced the challenge of video reframing to fit various screen aspect ratios, a process that highlights the most compelling parts of a video. Traditionally, video reframing is a manual, time-consuming task requiring professional exp… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 14 pages, 6 figures

  42. arXiv:2403.05047  [pdf, other

    cs.CV

    REPS: Reconstruction-based Point Cloud Sampling

    Authors: Guoqing Zhang, Wenbo Zhao, Jian Liu, Xianming Liu

    Abstract: Sampling is widely used in various point cloud tasks as it can effectively reduce resource consumption. Recently, some methods have proposed utilizing neural networks to optimize the sampling process for various task requirements. Currently, deep downsampling methods can be categorized into two main types: generative-based and score-based. Generative-based methods directly generate sampled point c… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: project page: https://github.com/hitcslj/REPS

  43. arXiv:2403.04239  [pdf, other

    physics.ins-det hep-ex

    Signal Response Model in PandaX-4T

    Authors: Yunyang Luo, Zihao Bo, Shibo Zhang, Abdusalam Abdukerim, Chen Cheng, Wei Chen, Xun Chen, Yunhua Chen, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, **rong He, Di Huang, Yanlin Huang, Zhou Huang , et al. (66 additional authors not shown)

    Abstract: PandaX-4T experiment is a deep-underground dark matter direct search experiment that employs a dual-phase time projection chamber with a sensitive volume containing 3.7 tonne of liquid xenon. The detector of PandaX-4T is capable of simultaneously collecting the primary scintillation and ionization signals, utilizing their ratio to discriminate dark matter signals from background sources such as ga… ▽ More

    Submitted 14 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  44. arXiv:2403.02767  [pdf, other

    cs.CV

    DeconfuseTrack:Dealing with Confusion for Multi-Object Tracking

    Authors: Cheng Huang, Shoudong Han, Mengyu He, Wenbo Zheng, Yuhao Wei

    Abstract: Accurate data association is crucial in reducing confusion, such as ID switches and assignment errors, in multi-object tracking (MOT). However, existing advanced methods often overlook the diversity among trajectories and the ambiguity and conflicts present in motion and appearance cues, leading to confusion among detections, trajectories, and associations when performing simple global data associ… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR2024

  45. arXiv:2403.02601  [pdf, other

    eess.IV cs.CV

    Low-Res Leads the Way: Improving Generalization for Super-Resolution by Self-Supervised Learning

    Authors: Haoyu Chen, Wenbo Li, **** Gu, **g**g Ren, Haoze Sun, Xueyi Zou, Zhensong Zhang, Youliang Yan, Lei Zhu

    Abstract: For image super-resolution (SR), bridging the gap between the performance on synthetic datasets and real-world degradation scenarios remains a challenge. This work introduces a novel "Low-Res Leads the Way" (LWay) training framework, merging Supervised Pre-training with Self-supervised Learning to enhance the adaptability of SR models to real-world images. Our approach utilizes a low-resolution (L… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  46. arXiv:2403.02297  [pdf, other

    cs.RO

    Uncertainty-Aware Prediction and Application in Planning for Autonomous Driving: Definitions, Methods, and Comparison

    Authors: Wenbo Shao, Jiahui Xu, Zhong Cao, Hong Wang, Jun Li

    Abstract: Autonomous driving systems face the formidable challenge of navigating intricate and dynamic environments with uncertainty. This study presents a unified prediction and planning framework that concurrently models short-term aleatoric uncertainty (SAU), long-term aleatoric uncertainty (LAU), and epistemic uncertainty (EU) to predict and establish a robust foundation for planning in dynamic contexts… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 14 pages, 7 figures

  47. arXiv:2403.01826  [pdf, other

    cs.CE

    A Novel Shortest Path Query Algorithm Based on Optimized Adaptive Topology Structure

    Authors: Xiao Fang, Xuyang Song, Jiyuan Ma, Guanhua Liu, Shurong Pang, Wenbo Zhao, Cong Cao, Ling Fan

    Abstract: Urban rail transit is a fundamental component of public transportation, however, commonly station-based path search algorithms often overlook the impact of transfer times on search results, leading to decreased accuracy. To solve this problem, this paper proposes a novel shortest path query algorithm based on adaptive topology optimization called the Adaptive Topology Extension Road Network Struct… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  48. arXiv:2403.01733  [pdf, other

    cs.CV

    3D Hand Reconstruction via Aggregating Intra and Inter Graphs Guided by Prior Knowledge for Hand-Object Interaction Scenario

    Authors: Feng Shuang, Wenbo He, Shaodong Li

    Abstract: Recently, 3D hand reconstruction has gained more attention in human-computer cooperation, especially for hand-object interaction scenario. However, it still remains huge challenge due to severe hand-occlusion caused by interaction, which contain the balance of accuracy and physical plausibility, highly nonlinear map** of model parameters and occlusion feature enhancement. To overcome these issue… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  49. arXiv:2403.01074  [pdf

    physics.app-ph

    Eavesdrop** risk evaluation for wavy-surface-assisted terahertz channel in emulated rain

    Authors: Peian Li, Wenbo Liu, Da Li, Mingxia Zhang, Xiaopeng Wang, Houjun Sun, Jianjun Ma

    Abstract: The advancement of non-line-of-sight (NLOS) data transmission through reflective methods plays a pivotal role in enhancing communication efficiency and expanding user reach. However, this innovation introduces significant eavesdrop** risks, particularly magnified by the complex scattering effects encountered under adverse weather conditions. This study delves into the assessment of eavesdrop**… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: Submitted to Optics Express

  50. arXiv:2403.00585  [pdf, other

    cs.IT

    Decentralized Uncoded Storage Elastic Computing with Heterogeneous Computation Speeds

    Authors: Wenbo Huang, Xudong You, Kai Wan, Robert Caiming Qiu, Mingyue Ji

    Abstract: Elasticity plays an important role in modern cloud computing systems. Elastic computing allows virtual machines (i.e., computing nodes) to be preempted when high-priority jobs arise, and also allows new virtual machines to participate in the computation. In 2018, Yang et al. introduced Coded Storage Elastic Computing (CSEC) to address the elasticity using coding technology, with lower storage and… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 10 pages, 8 figures, submitted to ISIT2024