Skip to main content

Showing 1–50 of 211 results for author: Wen, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11334  [pdf, other

    cs.AI

    Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment

    Authors: Chao Wen, Jacqueline Staub, Adish Singla

    Abstract: Large language and multimodal models have shown remarkable successes on various benchmarks focused on specific skills such as general-purpose programming, natural language understanding, math word problem-solving, and visual question answering. However, it is unclear how well these models perform on tasks that require a combination of these skills. In this paper, we curate a novel program synthesi… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2405.18291  [pdf, other

    cs.LG cs.AI cs.DC

    FedSAC: Dynamic Submodel Allocation for Collaborative Fairness in Federated Learning

    Authors: Zihui Wang, Zheng Wang, Lingjuan Lyu, Zhaopeng Peng, Zhicheng Yang, Chenglu Wen, Rongshan Yu, Cheng Wang, Xiaoliang Fan

    Abstract: Collaborative fairness stands as an essential element in federated learning to encourage client participation by equitably distributing rewards based on individual contributions. Existing methods primarily focus on adjusting gradient allocations among clients to achieve collaborative fairness. However, they frequently overlook crucial factors such as maintaining consistency across local models and… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD'24

  3. arXiv:2405.13403  [pdf, other

    eess.IV cs.MM

    Adaptive Wireless Image Semantic Transmission and Over-The-Air Testing

    Authors: Jiarun Ding, Peiwen Jiang, Chao-Kai Wen, Shi **

    Abstract: Semantic communication has undergone considerable evolution due to the recent rapid development of artificial intelligence (AI), significantly enhancing both communication robustness and efficiency. Despite these advancements, most current semantic communication methods for image transmission pay little attention to the differing importance of objects and backgrounds in images. To address this iss… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  4. arXiv:2405.02173  [pdf, other

    cs.HC cs.CY

    Task Synthesis for Elementary Visual Programming in XLogoOnline Environment

    Authors: Chao Wen, Ahana Ghosh, Jacqueline Staub, Adish Singla

    Abstract: In recent years, the XLogoOnline programming platform has gained popularity among novice learners. It integrates the Logo programming language with visual programming, providing a visual interface for learning computing concepts. However, XLogoOnline offers only a limited set of tasks, which are inadequate for learners to master the computing concepts that require sufficient practice. To address t… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Accepted as a paper at the AIED'24 conference in the late-breaking results track

  5. arXiv:2404.19134  [pdf, other

    cs.CV

    Evaluating Deep Clustering Algorithms on Non-Categorical 3D CAD Models

    Authors: Siyuan Xiang, Chin Tseng, Congcong Wen, Deshana Desai, Yifeng Kou, Binil Starly, Daniele Panozzo, Chen Feng

    Abstract: We introduce the first work on benchmarking and evaluating deep clustering algorithms on large-scale non-categorical 3D CAD models. We first propose a workflow to allow expert mechanical engineers to efficiently annotate 252,648 carefully sampled pairwise CAD model similarities, from a subset of the ABC dataset with 22,968 shapes. Using seven baseline deep clustering methods, we then investigate t… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  6. arXiv:2404.16493  [pdf, other

    cs.CV

    Commonsense Prototype for Outdoor Unsupervised 3D Object Detection

    Authors: Hai Wu, Shijia Zhao, Xun Huang, Chenglu Wen, Xin Li, Cheng Wang

    Abstract: The prevalent approaches of unsupervised 3D object detection follow cluster-based pseudo-label generation and iterative self-training processes. However, the challenge arises due to the sparsity of LiDAR scans, which leads to pseudo-labels with erroneous size and position, resulting in subpar detection performance. To tackle this problem, this paper introduces a Commonsense Prototype-based Detecto… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  7. arXiv:2404.15131  [pdf, other

    cs.RO

    Optimizing Multi-Touch Textile and Tactile Skin Sensing Through Circuit Parameter Estimation

    Authors: Bo Ying Su, Yuchen Wu, Chengtao Wen, Changliu Liu

    Abstract: Tactile and textile skin technologies have become increasingly important for enhancing human-robot interaction and allowing robots to adapt to different environments. Despite notable advancements, there are ongoing challenges in skin signal processing, particularly in achieving both accuracy and speed in dynamic touch sensing. This paper introduces a new framework that poses the touch sensing prob… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  8. arXiv:2404.11536  [pdf, other

    cs.LG cs.AI

    FedPFT: Federated Proxy Fine-Tuning of Foundation Models

    Authors: Zhaopeng Peng, Xiaoliang Fan, Yufan Chen, Zheng Wang, Shirui Pan, Chenglu Wen, Ruisheng Zhang, Cheng Wang

    Abstract: Adapting Foundation Models (FMs) for downstream tasks through Federated Learning (FL) emerges a promising strategy for protecting data privacy and valuable FMs. Existing methods fine-tune FM by allocating sub-FM to clients in FL, however, leading to suboptimal performance due to insufficient tuning and inevitable error accumulations of gradients. In this paper, we propose Federated Proxy Fine-Tuni… ▽ More

    Submitted 28 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI'24

  9. arXiv:2404.04783  [pdf, other

    cs.IT eess.SP

    Fourier Transform-based Wavenumber Domain 3D Imaging in RIS-aided Communication Systems

    Authors: Yixuan Huang, Jie Yang, Wankai Tang, Chao-Kai Wen, Shi **

    Abstract: Radio imaging is rapidly gaining prominence in the design of future communication systems, with the potential to utilize reconfigurable intelligent surfaces (RISs) as imaging apertures. Although the sparsity of targets in three-dimensional (3D) space has led most research to adopt compressed sensing (CS)-based imaging algorithms, these often require substantial computational and memory burdens. Dr… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 16 pages, 11 figures, submitted to IEEE for possible publication

  10. arXiv:2404.00795  [pdf, other

    cs.SE

    Towards Practical Requirement Analysis and Verification: A Case Study on Software IP Components in Aerospace Embedded Systems

    Authors: Zhi Ma, Cheng Wen, Jie Su, Ming Zhao, Bin Yu, Xu Lu, Cong Tian

    Abstract: IP-based software design is a crucial research field that aims to improve efficiency and reliability by reusing complex software components known as intellectual property (IP) components. To ensure the reusability of these components, particularly in security-sensitive software systems, it is necessary to analyze the requirements and perform formal verification for each IP component. However, conv… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  11. arXiv:2404.00762  [pdf, other

    cs.SE

    Enchanting Program Specification Synthesis by Large Language Models using Static Analysis and Program Verification

    Authors: Cheng Wen, Jialun Cao, Jie Su, Zhiwu Xu, Shengchao Qin, Mengda He, Haokun Li, Shing-Chi Cheung, Cong Tian

    Abstract: Formal verification provides a rigorous and systematic approach to ensure the correctness and reliability of software systems. Yet, constructing specifications for the full proof relies on domain expertise and non-trivial manpower. In view of such needs, an automated approach for specification synthesis is desired. While existing automated approaches are limited in their versatility, i.e., they ei… ▽ More

    Submitted 2 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  12. arXiv:2403.19501  [pdf, other

    cs.CV

    RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method

    Authors: Ming Yan, Yan Zhang, Shuqiang Cai, Shuqi Fan, Xincheng Lin, Yudi Dai, Siqi Shen, Chenglu Wen, Lan Xu, Yuexin Ma, Cheng Wang

    Abstract: Comprehensive capturing of human motions requires both accurate captures of complex poses and precise localization of the human within scenes. Most of the HPE datasets and methods primarily rely on RGB, LiDAR, or IMU data. However, solely using these modalities or a combination of them may not be adequate for HPE, particularly for complex and fast movements. For holistic human motion understanding… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: CVPR2024, Project website: http://www.lidarhumanmotion.net/reli11d/

  13. arXiv:2403.11764  [pdf, other

    cs.IT eess.SP

    RIS-aided Single-frequency 3D Imaging by Exploiting Multi-view Image Correlations

    Authors: Yixuan Huang, Jie Yang, Chao-Kai Wen, Shi **

    Abstract: Retrieving range information in three-dimensional (3D) radio imaging is particularly challenging due to the limited communication bandwidth and pilot resources. To address this issue, we consider a reconfigurable intelligent surface (RIS)-aided uplink communication scenario, generating multiple measurements through RIS phase adjustment. This study successfully realizes 3D single-frequency imaging… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 16 pages, 12 figures, accepted by IEEE Transactions on Communications

  14. arXiv:2403.00729  [pdf, other

    cs.CV cs.RO

    Can Transformers Capture Spatial Relations between Objects?

    Authors: Chuan Wen, Dinesh Jayaraman, Yang Gao

    Abstract: Spatial relationships between objects represent key scene information for humans to understand and interact with the world. To study the capability of current computer vision systems to recognize physically grounded spatial relations, we start by proposing precise relation definitions that permit consistently annotating a benchmark dataset. Despite the apparent simplicity of this task relative to… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 21 pages, 8 figures, ICLR 2024

  15. arXiv:2402.18969  [pdf, other

    cs.CV

    OHTA: One-shot Hand Avatar via Data-driven Implicit Priors

    Authors: Xiaozheng Zheng, Chao Wen, Zhuo Su, Zeran Xu, Zhaohu Li, Yang Zhao, Zhou Xue

    Abstract: In this paper, we delve into the creation of one-shot hand avatars, attaining high-fidelity and drivable hand representations swiftly from a single image. With the burgeoning domains of the digital human, the need for quick and personalized hand avatar creation has become increasingly critical. Existing techniques typically require extensive input data and may prove cumbersome or even impractical… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted to CVPR 2024. Project page: https://zxz267.github.io/OHTA

  16. arXiv:2402.18493  [pdf, other

    cs.CV

    Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection

    Authors: Xun Huang, Hai Wu, Xin Li, Xiaoliang Fan, Chenglu Wen, Cheng Wang

    Abstract: LiDAR-based 3D object detection models have traditionally struggled under rainy conditions due to the degraded and noisy scanning signals. Previous research has attempted to address this by simulating the noise from rain to improve the robustness of detection models. However, significant disparities exist between simulated and actual rain-impacted data points. In this work, we propose a novel rain… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: Accepted by AAAI2024

  17. arXiv:2402.09546  [pdf, other

    cs.RO cs.AI

    How Secure Are Large Language Models (LLMs) for Navigation in Urban Environments?

    Authors: Congcong Wen, Jiazhao Liang, Shuaihang Yuan, Hao Huang, Yi Fang

    Abstract: In the field of robotics and automation, navigation systems based on Large Language Models (LLMs) have recently shown impressive performance. However, the security aspects of these systems have received relatively less attention. This paper pioneers the exploration of vulnerabilities in LLM-based navigation models in urban outdoor environments, a critical area given the technology's widespread app… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  18. arXiv:2401.11445  [pdf, other

    cs.RO eess.SY

    Towards Non-Robocentric Dynamic Landing of Quadrotor UAVs

    Authors: Li-Yu Lo, Boyang Li, Chih-Yung Wen, Ching-Wei Chang

    Abstract: In this work, we propose a dynamic landing solution without the need for onboard exteroceptive sensors and an expensive computation unit, where all localization and control modules are carried out on the ground in a non-inertial frame. Our system starts with a relative state estimator of the aerial robot from the perspective of the landing platform, where the state tracking of the UAV is done thro… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  19. arXiv:2401.11439  [pdf, other

    cs.RO cs.AI cs.CV

    General Flow as Foundation Affordance for Scalable Robot Learning

    Authors: Chengbo Yuan, Chuan Wen, Tong Zhang, Yang Gao

    Abstract: We address the challenge of acquiring real-world manipulation skills with a scalable framework.Inspired by the success of large-scale auto-regressive prediction in Large Language Models (LLMs), we hold the belief that identifying an appropriate prediction target capable of leveraging large-scale datasets is crucial for achieving efficient and universal learning. Therefore, we propose to utilize fl… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  20. arXiv:2401.00025  [pdf, other

    cs.RO cs.CV

    Any-point Trajectory Modeling for Policy Learning

    Authors: Chuan Wen, Xingyu Lin, John So, Kai Chen, Qi Dou, Yang Gao, Pieter Abbeel

    Abstract: Learning from demonstration is a powerful method for teaching robots new skills, and having more demonstration data often improves policy learning. However, the high cost of collecting demonstration data is a significant bottleneck. Videos, as a rich data source, contain knowledge of behaviors, physics, and semantics, but extracting control-specific information from them is challenging due to the… ▽ More

    Submitted 16 February, 2024; v1 submitted 28 December, 2023; originally announced January 2024.

    Comments: 16 pages, 13 figures

  21. arXiv:2312.14495  [pdf, other

    cs.SI cs.IT eess.SP

    Beam Foreseeing in Millimeter-Wave Systems with Situational Awareness: Fundamental Limits via Cramér-Rao Lower Bound

    Authors: Wan-Ting Shih, Chao-Kai Wen, Shang-Ho Tsai, Shi **, Chau Yuen

    Abstract: Millimeter-wave (mmWave) networks offer the potential for high-speed data transfer and precise localization, leveraging large antenna arrays and extensive bandwidths. However, these networks are challenged by significant path loss and susceptibility to blockages. In this study, we delve into the use of situational awareness for beam prediction within the 5G NR beam management framework. We introdu… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 16 pages, 10 figures; IEEE Transactions on Wireless Communications

  22. arXiv:2312.14453  [pdf, other

    cs.RO eess.SY

    Hybrid Aerodynamics-Based Model Predictive Control for a Tail-Sitter UAV

    Authors: Bailun Jiang, Boyang Li, Ching-Wei Chang, Chih-Yung Wen

    Abstract: It is challenging to model and control a tail-sitter unmanned aerial vehicle (UAV) because its blended wing body generates complicated nonlinear aerodynamic effects, such as wing lift, fuselage drag, and propeller-wing interactions. We therefore devised a hybrid aerodynamic modeling method and model predictive control (MPC) design for a quadrotor tail-sitter UAV. The hybrid model consists of the N… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  23. arXiv:2312.08664  [pdf, other

    cs.CV

    SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration

    Authors: Kezheng Xiong, Maoji Zheng, Qingshan Xu, Chenglu Wen, Siqi Shen, Cheng Wang

    Abstract: Point cloud registration, a fundamental task in 3D computer vision, has remained largely unexplored in cross-source point clouds and unstructured scenes. The primary challenges arise from noise, outliers, and variations in scale and density. However, neglected geometric natures of point clouds restricts the performance of current methods. In this paper, we propose a novel method termed SPEAL to le… ▽ More

    Submitted 3 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI2024

  24. arXiv:2312.08591  [pdf, other

    cs.CV

    Joint2Human: High-quality 3D Human Generation via Compact Spherical Embedding of 3D Joints

    Authors: Muxin Zhang, Qiao Feng, Zhuo Su, Chao Wen, Zhou Xue, Kun Li

    Abstract: 3D human generation is increasingly significant in various applications. However, the direct use of 2D generative methods in 3D generation often results in losing local details, while methods that reconstruct geometry from generated images struggle with global view consistency. In this work, we introduce Joint2Human, a novel method that leverages 2D diffusion models to generate detailed 3D human g… ▽ More

    Submitted 6 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  25. arXiv:2311.15950  [pdf, other

    cs.IT cs.AI

    Auto-CsiNet: Scenario-customized Automatic Neural Network Architecture Generation for Massive MIMO CSI Feedback

    Authors: Xiangyi Li, Jiajia Guo, Chao-Kai Wen, Shi **

    Abstract: Deep learning has revolutionized the design of the channel state information (CSI) feedback module in wireless communications. However, designing the optimal neural network (NN) architecture for CSI feedback can be a laborious and time-consuming process. Manual design can be prohibitively expensive for customizing NNs to different scenarios. This paper proposes using neural architecture search (NA… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 16 pages, 10 figures, 6 tables

  26. arXiv:2311.15313  [pdf, ps, other

    eess.SP cs.IT

    Low-Complexity Joint Beamforming for RIS-Assisted MU-MISO Systems Based on Model-Driven Deep Learning

    Authors: Weijie **, **g Zhang, Chao-Kai Wen, Shi **, Xiao Li, Shuangfeng Han

    Abstract: Reconfigurable intelligent surfaces (RIS) can improve signal propagation environments by adjusting the phase of the incident signal. However, optimizing the phase shifts jointly with the beamforming vector at the access point is challenging due to the non-convex objective function and constraints. In this study, we propose an algorithm based on weighted minimum mean square error optimization and p… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: 14 pages, 9 figures, 2 tables. This paper has been accepted for publication by the IEEE Transactions on Wireless Communications. Copyright may be transferred without notice, after which this version may no longer be accessible

  27. arXiv:2311.06916  [pdf

    eess.SY cs.AI

    TSViT: A Time Series Vision Transformer for Fault Diagnosis

    Authors: Shouhua Zhang, Jiehan Zhou, Xue Ma, Chenglin Wen, Susanna Pirttikangas, Chen Yu, Weishan Zhang, Chunsheng Yang

    Abstract: Traditional fault diagnosis methods using Convolutional Neural Networks (CNNs) face limitations in capturing temporal features (i.e., the variation of vibration signals over time). To address this issue, this paper introduces a novel model, the Time Series Vision Transformer (TSViT), specifically designed for fault diagnosis. On one hand, TSViT model integrates a convolutional layer to segment vib… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  28. On Finding Bi-objective Pareto-optimal Fraud Prevention Rule Sets for Fintech Applications

    Authors: Chengyao Wen, Yin Lou

    Abstract: Rules are widely used in Fintech institutions to make fraud prevention decisions, since rules are highly interpretable thanks to their intuitive if-then structure. In practice, a two-stage framework of fraud prevention decision rule set mining is usually employed in large Fintech institutions; Stage 1 generates a potentially large pool of rules and Stage 2 aims to produce a refined rule subset acc… ▽ More

    Submitted 27 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  29. arXiv:2311.00390  [pdf, other

    cs.RO

    A Modular Pneumatic Soft Gripper Design for Aerial Gras** and Landing

    Authors: Hiu Ching Cheung, Ching-Wei Chang, Bailun Jiang, Chih-Yung Wen, Henry K. Chu

    Abstract: Aerial robots have garnered significant attention due to their potential applications in various industries, such as inspection, search and rescue, and drone delivery. Successful missions often depend on the ability of these robots to grasp and land effectively. This paper presents a novel modular soft gripper design tailored explicitly for aerial gras** and landing operations. The proposed modu… ▽ More

    Submitted 25 March, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 7 pages, 13 figures, accepted by IEEE RoboSoft 2024

  30. arXiv:2310.07433  [pdf, other

    cs.RO cs.AI cs.LG

    Imitation Learning from Observation with Automatic Discount Scheduling

    Authors: Yuyang Liu, Weijun Dong, Yingdong Hu, Chuan Wen, Zhao-Heng Yin, Chongjie Zhang, Yang Gao

    Abstract: Humans often acquire new skills through observation and imitation. For robotic agents, learning from the plethora of unlabeled video demonstration data available on the Internet necessitates imitating the expert without access to its action, presenting a challenge known as Imitation Learning from Observations (ILfO). A common approach to tackle ILfO problems is to convert them into inverse reinfor… ▽ More

    Submitted 7 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024

  31. arXiv:2309.15941  [pdf, other

    cs.CV

    AutoEncoding Tree for City Generation and Applications

    Authors: Wenyu Han, Congcong Wen, Lazarus Chok, Yan Liang Tan, Sheung Lung Chan, Hang Zhao, Chen Feng

    Abstract: City modeling and generation have attracted an increased interest in various applications, including gaming, urban planning, and autonomous driving. Unlike previous works focused on the generation of single objects or indoor scenes, the huge volumes of spatial data in cities pose a challenge to the generative models. Furthermore, few publicly available 3D real-world city datasets also hinder the d… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  32. arXiv:2309.04590  [pdf, other

    cs.RO eess.SY

    Robotic Defect Inspection with Visual and Tactile Perception for Large-scale Components

    Authors: Arpit Agarwal, Abhiroop Ajith, Chengtao Wen, Veniamin Stryzheus, Brian Miller, Matthew Chen, Micah K. Johnson, Jose Luis Susa Rincon, Justinian Rosca, Wenzhen Yuan

    Abstract: In manufacturing processes, surface inspection is a key requirement for quality assessment and damage localization. Due to this, automated surface anomaly detection has become a promising area of research in various industrial inspection systems. A particular challenge in industries with large-scale components, like aircraft and heavy machinery, is inspecting large parts with very small defect dim… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: This is a pre-print for International Conference on Intelligent Robots and Systems 2023 publication

  33. arXiv:2308.11335  [pdf, other

    cs.IT eess.SP

    Graph Neural Network-Enhanced Expectation Propagation Algorithm for MIMO Turbo Receivers

    Authors: Xingyu Zhou, **g Zhang, Chao-Kai Wen, Shi **, Shuangfeng Han

    Abstract: Deep neural networks (NNs) are considered a powerful tool for balancing the performance and complexity of multiple-input multiple-output (MIMO) receivers due to their accurate feature extraction, high parallelism, and excellent inference ability. Graph NNs (GNNs) have recently demonstrated outstanding capability in learning enhanced message passing rules and have shown success in overcoming the dr… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: 15 pages, 12 figures, 2 tables. This paper has been accepted for publication by the IEEE Transactions on Signal Processing. Copyright may be transferred without notice, after which this version may no longer be accessible

  34. arXiv:2308.08855  [pdf, other

    cs.CV

    Realistic Full-Body Tracking from Sparse Observations via Joint-Level Modeling

    Authors: Xiaozheng Zheng, Zhuo Su, Chao Wen, Zhou Xue, Xiaojie **

    Abstract: To bridge the physical and virtual worlds for rapidly developed VR/AR applications, the ability to realistically drive 3D full-body avatars is of great significance. Although real-time body tracking with only the head-mounted displays (HMDs) and hand controllers is heavily under-constrained, a carefully designed end-to-end neural network is of great potential to solve the problem by learning from… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023. Project page: https://zxz267.github.io/AvatarJLM

  35. arXiv:2308.06562  [pdf, other

    eess.SP cs.IT

    Gradient-Based Markov Chain Monte Carlo for MIMO Detection

    Authors: Xingyu Zhou, Le Liang, **g Zhang, Chao-Kai Wen, Shi **

    Abstract: Accurately detecting symbols transmitted over multiple-input multiple-output (MIMO) wireless channels is crucial in realizing the benefits of MIMO techniques. However, optimal MIMO detection is associated with a complexity that grows exponentially with the MIMO dimensions and quickly becomes impractical. Recently, stochastic sampling-based Bayesian inference techniques, such as Markov chain Monte… ▽ More

    Submitted 5 December, 2023; v1 submitted 12 August, 2023; originally announced August 2023.

    Comments: 16 pages, 12 figures, 2 tables. This paper has been accepted for publication by the IEEE Transactions on Wireless Communications. Copyright may be transferred without notice, after which this version may no longer be accessible

  36. arXiv:2308.03016  [pdf, other

    cs.IT

    Sha** a Smarter Electromagnetic Landscape: IAB, NCR, and RIS in 5G Standard and Future 6G

    Authors: Chao-Kai Wen, Lung-Sheng Tsai, Arman Shojaeifard, Pei-Kai Liao, Kai-Kit Wong, Chan-Byoung Chae

    Abstract: The main objective of 5G and beyond networks is to provide an optimal user experience in terms of throughput and reliability, irrespective of location and time. To achieve this, traditional fixed macro base station deployments are being replaced by more innovative and flexible solutions, such as wireless backhaul and relays. This article focuses on the evolution and standardization of these advanc… ▽ More

    Submitted 18 January, 2024; v1 submitted 6 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 figures, 1 table. This work has been accepted to publish in IEEE Communications Standards Magazine

  37. Data-Driven Modeling with Experimental Augmentation for the Modulation Strategy of the Dual-Active-Bridge Converter

    Authors: Xinze Li, Josep Pou, Jiaxin Dong, Fanfan Lin, Changyun Wen, Suvajit Mukherjee, Xin Zhang

    Abstract: For the performance modeling of power converters, the mainstream approaches are essentially knowledge-based, suffering from heavy manpower burden and low modeling accuracy. Recent emerging data-driven techniques greatly relieve human reliance by automatic modeling from simulation data. However, model discrepancy may occur due to unmodeled parasitics, deficient thermal and magnetic models, unpredic… ▽ More

    Submitted 2 August, 2023; v1 submitted 30 July, 2023; originally announced July 2023.

    Comments: 11 pages

    Journal ref: IEEE.Trans.Ind.Electron. Early Access (2023) 1-11

  38. arXiv:2307.15290  [pdf, other

    cs.CL

    ChatHome: Development and Evaluation of a Domain-Specific Language Model for Home Renovation

    Authors: Cheng Wen, Xianghui Sun, Shuaijiang Zhao, Xiaoquan Fang, Liangyu Chen, Wei Zou

    Abstract: This paper presents the development and evaluation of ChatHome, a domain-specific language model (DSLM) designed for the intricate field of home renovation. Considering the proven competencies of large language models (LLMs) like GPT-4 and the escalating fascination with home renovation, this study endeavors to reconcile these aspects by generating a dedicated model that can yield high-fidelity, p… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: ChatHome,DSLM for home renovation

  39. arXiv:2307.15280  [pdf, other

    cs.IT eess.SP

    Active RIS-Assisted MIMO-OFDM System: Analyses and Prototype Measurements

    Authors: De-Ming Chian, Feng-Ji Chen, Yu-Chen Chang, Chao-Kai Wen, Chi-Hung Wu, Fu-Kang Wang, Kai-Kit Wong, Chan-Byoung Chae

    Abstract: In this study, we develop an active reconfigurable intelligent surface (RIS)-assisted multiple-input multiple-output orthogonal frequency division multiplexing (MIMO-OFDM) prototype compliant with the 5G New Radio standard at 3.5~GHz. The experimental results clearly indicate that active RIS plays a vital role in enhancing MIMO performance, surpassing passive RIS. Furthermore, when considering fac… ▽ More

    Submitted 14 November, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: 5 pages, 5 figures, 1 table, accepted by IEEE Communications Letters, for demo video see: https://www.youtube.com/watch?v=3R6eZXizwns

  40. arXiv:2307.15266  [pdf, other

    cs.CV

    RSGPT: A Remote Sensing Vision Language Model and Benchmark

    Authors: Yuan Hu, Jianlong Yuan, Congcong Wen, Xiaonan Lu, Xiang Li

    Abstract: The emergence of large-scale large language models, with GPT-4 as a prominent example, has significantly propelled the rapid advancement of artificial general intelligence and sparked the revolution of Artificial Intelligence 2.0. In the realm of remote sensing (RS), there is a growing interest in develo** large vision language models (VLMs) specifically tailored for data analysis in this domain… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  41. arXiv:2307.12049  [pdf, other

    cs.CV

    Patch-Wise Point Cloud Generation: A Divide-and-Conquer Approach

    Authors: Cheng Wen, Baosheng Yu, Rao Fu, Dacheng Tao

    Abstract: A generative model for high-fidelity point clouds is of great importance in synthesizing 3d environments for applications such as autonomous driving and robotics. Despite the recent success of deep generative models for 2d images, it is non-trivial to generate 3d point clouds without a comprehensive understanding of both local and global geometric structures. In this paper, we devise a new 3d poin… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  42. Joint Beam Management and SLAM for mmWave Communication Systems

    Authors: Hang Que, Jie Yang, Chao-Kai Wen, Shuqiang Xia, Xiao Li, Shi **

    Abstract: The millimeter-wave (mmWave) communication technology, which employs large-scale antenna arrays, enables inherent sensing capabilities. Simultaneous localization and map** (SLAM) can utilize channel multipath angle estimates to realize integrated sensing and communication design in 6G communication systems. However, existing works have ignored the significant overhead required by the mmWave beam… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Journal ref: IEEE Transactions on Communications, early access, July 2023

  43. BPNet: Bézier Primitive Segmentation on 3D Point Clouds

    Authors: Rao Fu, Cheng Wen, Qian Li, Xiao Xiao, Pierre Alliez

    Abstract: This paper proposes BPNet, a novel end-to-end deep learning framework to learn Bézier primitive segmentation on 3D point clouds. The existing works treat different primitive types separately, thus limiting them to finite shape categories. To address this issue, we seek a generalized primitive segmentation on point clouds. Taking inspiration from Bézier decomposition on NURBS models, we transfer it… ▽ More

    Submitted 15 October, 2023; v1 submitted 8 July, 2023; originally announced July 2023.

  44. arXiv:2305.12669  [pdf, other

    cs.IT eess.SP

    Angle-based SLAM on 5G mmWave Systems: Design, Implementation, and Measurement

    Authors: Jie Yang, Chao-Kai Wen, **g Xu, Hang Que, Haikun Wei, Shi **

    Abstract: Simultaneous localization and map** (SLAM) is a key technology that provides user equipment (UE) tracking and environment map** services, enabling the deep integration of sensing and communication. The millimeter-wave (mmWave) communication, with its larger bandwidths and antenna arrays, inherently facilitates more accurate delay and angle measurements than sub-6 GHz communication, thereby pro… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted by the IEEE Internet of Things Journal

  45. Joint Localization and Environment Sensing by Harnessing NLOS Components in RIS-aided mmWave Communication Systems

    Authors: Yixuan Huang, Jie Yang, Wankai Tang, Chao-Kai Wen, Shuqiang Xia, Shi **

    Abstract: This study explores the use of non-line-of-sight (NLOS) components in millimeter-wave (mmWave) communication systems for joint localization and environment sensing. The radar cross section (RCS) of a reconfigurable intelligent surface (RIS) is calculated to develop a general path gain model for RISs and traditional scatterers. The results show that RISs have a greater potential to assist in locali… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: 32 pages, 12 figures, accepted by IEEE Transactions on Wireless Communications

    Journal ref: IEEE Transactions on Wireless Communications, early access, April 2023

  46. arXiv:2305.12308  [pdf, other

    cs.IT

    MIMO Evolution toward 6G: End-User-Centric Collaborative MIMO

    Authors: Lung-Sheng Tsai, Shang-Ling Shih, Pei-Kai Liao, Chao-Kai Wen

    Abstract: In 6G, the trend of transitioning from massive antenna elements to even more massive ones is continued. However, installing additional antennas in the limited space of user equipment (UE) is challenging, resulting in limited capacity scaling gain for end users, despite network side support for increasing numbers of antennas. To address this issue, we propose an end-user-centric collaborative MIMO… ▽ More

    Submitted 14 November, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: 7 pages, 5 figures, 1 table. This work has been accepted in IEEE Communications Magazine

  47. arXiv:2305.05726  [pdf, other

    cs.CV cs.AI

    Vision-Language Models in Remote Sensing: Current Progress and Future Trends

    Authors: Xiang Li, Congcong Wen, Yuan Hu, Zhenghang Yuan, Xiao Xiang Zhu

    Abstract: The remarkable achievements of ChatGPT and GPT-4 have sparked a wave of interest and research in the field of large language models for Artificial General Intelligence (AGI). These models provide intelligent solutions close to human thinking, enabling us to use general artificial intelligence to solve problems in various applications. However, in remote sensing (RS), the scientific literature on t… ▽ More

    Submitted 2 April, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted by IEEE Geoscience and Remote Sensing Magazine

  48. arXiv:2305.02464  [pdf, ps, other

    cs.IT eess.SP

    Multi-timescale Channel Customization for Transmission Design in RIS-assisted MIMO Systems

    Authors: Weicong Chen, Chao-Kai Wen, Xiao Li, Shi **

    Abstract: The performance of transmission schemes is heavily influenced by the wireless channel, which is typically considered an uncontrollable factor. However, the introduction of reconfigurable intelligent surfaces (RISs) to wireless communications enables the customization of a preferred channel for adopted transmissions by resha** electromagnetic waves. In this study, we propose multi-timescale chann… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted by IEEE JSAC special issue on Beyond Shannon Communications: A Paradigm Shift to Catalyze 6G

  49. Channel Customization for Limited Feedback in RIS-assisted FDD Systems

    Authors: Weicong Chen, Chao-Kai Wen, Xiao Li, Michail Matthaiou, Shi **

    Abstract: Reconfigurable intelligent surfaces (RISs) represent a pioneering technology to realize smart electromagnetic environments by resha** the wireless channel. \textcolor[rgb]{0,0,0}{Jointly designing the transceiver and RIS relies on the channel state information (CSI), whose feedback has not been investigated in multi-RIS-assisted frequency division duplexing systems.} In this study, the limited f… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted by IEEE Transactions on Wireless Communications(https://ieeexplore.ieee.org/document/9976945)

  50. arXiv:2304.03713  [pdf, other

    cs.IT eess.SP

    A Novel Channel Model for Reconfigurable Intelligent Surfaces with Consideration of Polarization and Switch Impairments

    Authors: De-Ming Chian, Chao-Kai Wen, Chi-Hung Wu, Fu-Kang Wang, Kai-Kit Wong

    Abstract: Future wireless networks require the ability to actively adjust the wireless environment to meet strict performance indicators. Reconfigurable Intelligent Surface (RIS) technology is gaining attention for its advantages of low power consumption, cost-effectiveness, and ease of deployment. However, existing channel models for RIS often ignore important properties, such as the impairment in the RIS'… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 14 pages, 12 figures, 1 table. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible