Skip to main content

Showing 151–200 of 2,227 results for author: Chen, K

.
  1. arXiv:2404.06480  [pdf, other

    cs.CL cs.AI

    Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks

    Authors: Chonghua Wang, Haodong Duan, Songyang Zhang, Dahua Lin, Kai Chen

    Abstract: Recently, the large language model (LLM) community has shown increasing interest in enhancing LLMs' capability to handle extremely long documents. As various long-text techniques and model architectures emerge, the precise and detailed evaluation of models' long-text capabilities has become increasingly important. Existing long-text evaluation benchmarks, such as L-Eval and LongBench, construct lo… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: NAACL 2024

  2. arXiv:2404.06221  [pdf, other

    hep-ph hep-ex

    Polarization and quantum entanglement effects in $B^\pm_c\to J/ψ+π^\pm +π^0$ process

    Authors: Kaiwen Chen, Yiqi Geng, Yichao **, Zhicheng Yan, Ruilin Zhu

    Abstract: Motivated by the very recent observation of the $B^+_c\to J/ψ+π^+ +π^0$ decay using proton-proton collision data by the LHCb collaboration, we study the four-body angular distributions and the quantum entanglement effects in the $B^+_c\to J/ψ+π^+ +π^0$ associated with $J/ψ\to μ^++μ^-$. The helicity angular distributions are given in the QCD effective theory and the von Neumann entropy is obtained… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 5 pages and 5 figures

  3. arXiv:2404.05242  [pdf, other

    cs.RO

    Collision-Free Trajectory Optimization in Cluttered Environments with Sums-of-Squares Programming

    Authors: Yulin Li, Chunxin Zheng, Kai Chen, Yusen Xie, Xindong Tang, Michael Yu Wang, Jun Ma

    Abstract: In this work, we propose a trajectory optimization approach for robot navigation in cluttered 3D environments. We represent the robot's geometry as a semialgebraic set defined by polynomial inequalities such that robots with general shapes can be suitably characterized. To address the robot navigation task in obstacle-dense environments, we exploit the free space directly to construct a sequence o… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  4. arXiv:2404.04956  [pdf, other

    cs.CV cs.CR

    Gaussian Shading: Provable Performance-Lossless Image Watermarking for Diffusion Models

    Authors: Zi** Yang, Kai Zeng, Kejiang Chen, Han Fang, Weiming Zhang, Nenghai Yu

    Abstract: Ethical concerns surrounding copyright protection and inappropriate content generation pose challenges for the practical implementation of diffusion models. One effective solution involves watermarking the generated images. However, existing methods often compromise the model performance or require additional training, which is undesirable for operators and users. To address this issue, we propose… ▽ More

    Submitted 6 May, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: 17 pages, 11 figures, accepted by CVPR 2024

  5. arXiv:2404.04935  [pdf, other

    cs.CV

    Anomaly Detection in Electrocardiograms: Advancing Clinical Diagnosis Through Self-Supervised Learning

    Authors: Aofan Jiang, Chaoqin Huang, Qing Cao, Yuchen Xu, Zi Zeng, Kang Chen, Ya Zhang, Yanfeng Wang

    Abstract: The electrocardiogram (ECG) is an essential tool for diagnosing heart disease, with computer-aided systems improving diagnostic accuracy and reducing healthcare costs. Despite advancements, existing systems often miss rare cardiac anomalies that could be precursors to serious, life-threatening issues or alterations in the cardiac macro/microstructure. We address this gap by focusing on self-superv… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  6. arXiv:2404.04619  [pdf, other

    cs.AI cs.CV

    Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model

    Authors: Zhonghan Zhao, Ke Ma, Wenhao Chai, Xuan Wang, Kewei Chen, Dongxu Guo, Yanting Zhang, Hongwei Wang, Gaoang Wang

    Abstract: With the power of large language models (LLMs), open-ended embodied agents can flexibly understand human instructions, generate interpretable guidance strategies, and output executable actions. Nowadays, Multi-modal Language Models~(MLMs) integrate multi-modal signals into LLMs, further bringing richer perception to entity agents and allowing embodied agents to perceive world-understanding tasks m… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2403.08282

  7. arXiv:2404.04599  [pdf, ps, other

    quant-ph cs.CC

    Local Test for Unitarily Invariant Properties of Bipartite Quantum States

    Authors: Kean Chen, Qisheng Wang, Zhicheng Zhang

    Abstract: We study the power of local test for bipartite quantum states. Our central result is that, for properties of bipartite pure states, unitary invariance on one part implies an optimal (over all global testers) local tester acting only on the other part. This suggests a canonical local tester for entanglement spectra (i.e., Schmidt coefficients), and reveals that purified samples offer no advantage i… ▽ More

    Submitted 29 April, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: 51 pages. Compared to [v1], we (i) extended testers with parameterized completeness and soundness, (ii) added new lower bounds for testing the bond dimension of matrix product states (MPS), and (iii) improved the lower bounds for testing Schmidt rank

  8. arXiv:2404.04155  [pdf, other

    cs.CV

    MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector

    Authors: Junbo Li, Keyan Chen, Gengju Tian, Lu Li, Zhenwei Shi

    Abstract: The segmentation and interpretation of the Martian surface play a pivotal role in Mars exploration, providing essential data for the trajectory planning and obstacle avoidance of rovers. However, the complex topography, similar surface features, and the lack of extensive annotated data pose significant challenges to the high-precision semantic segmentation of the Martian surface. To address these… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  9. Which Experimental Design is Better Suited for VQA Tasks? Eye Tracking Study on Cognitive Load, Performance, and Gaze Allocations

    Authors: Sita A. Vriend, Sandeep Vidyapu, Amer Rama, Kun-Ting Chen, Daniel Weiskopf

    Abstract: We conducted an eye-tracking user study with 13 participants to investigate the influence of stimulus-question ordering and question modality on participants using visual question-answering (VQA) tasks. We examined cognitive load, task performance, and gaze allocations across five distinct experimental designs, aiming to identify setups that minimize the cognitive burden on participants. The colle… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Accepted at ETVIS 2024

  10. arXiv:2404.04016  [pdf, ps, other

    hep-ph hep-ex

    Flavor-spin symmetry of the $P^N_ψ/H_{Ω_{ccc}}^N$ and $P^Λ_{ψs}/H^Λ_{Ω_{ccc}s}$ molecular states

    Authors: Kan Chen, Bo Wang

    Abstract: Based on a contact lagrangian that incorporates the SU(3) flavor and SU(2) spin symmetries, we discuss the symmetry properties of the interactions among the heavy flavor meson-baryon $P_ψ^N$, $P_{ψs}^Λ$ (with quark components [$n\bar{c}$][$nnc$], [$s\bar{c}$][$nnc$], or [$n\bar{c}$][$nsc$]) systems and di-baryon $H_{Ω_{ccc}}^N$, $H^Λ_{Ω_{ccc}s}$ (with quark components [$nnc$][$ncc$], [$nnc$][… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 17 pages, 10 tables, 4 figures

  11. arXiv:2404.03659  [pdf, other

    cs.LG cs.CR

    Federated Unlearning for Human Activity Recognition

    Authors: Kongyang Chen, Dong** zhang, Ya** Chai, Weibin Zhang, Shaowei Wang, Jiaxing Shen

    Abstract: The rapid evolution of Internet of Things (IoT) technology has spurred the widespread adoption of Human Activity Recognition (HAR) in various daily life domains. Federated Learning (FL) is frequently utilized to build a global HAR model by aggregating user contributions without transmitting raw individual data. Despite substantial progress in user privacy protection with FL, challenges persist. Re… ▽ More

    Submitted 17 January, 2024; originally announced April 2024.

  12. arXiv:2404.02041  [pdf, other

    cs.CV

    SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation

    Authors: Vinkle Srivastav, Keqi Chen, Nicolas Padoy

    Abstract: We present a new self-supervised approach, SelfPose3d, for estimating 3d poses of multiple persons from multiple camera views. Unlike current state-of-the-art fully-supervised methods, our approach does not require any 2d or 3d ground-truth poses and uses only the multi-view input images from a calibrated camera setup and 2d pseudo poses generated from an off-the-shelf 2d human pose estimator. We… ▽ More

    Submitted 8 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted for CVPR 2024. Code: https://github.com/CAMMA-public/SelfPose3D. Video demo: https://youtu.be/GAqhmUIr2E8

  13. arXiv:2404.01977  [pdf, other

    stat.ME math.ST

    Least Squares Inference for Data with Network Dependency

    Authors: **g Lei, Kehui Chen, Haeun Moon

    Abstract: We address the inference problem concerning regression coefficients in a classical linear regression model using least squares estimates. The analysis is conducted under circumstances where network dependency exists across units in the sample. Neglecting the dependency among observations may lead to biased estimation of the asymptotic variance and often inflates the Type I error in coefficient inf… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 27 pages, 1 figure

  14. arXiv:2404.01138  [pdf, other

    quant-ph

    Protocols and Trade-Offs of Quantum State Purification

    Authors: Hongshun Yao, Yu-Ao Chen, Erdong Huang, Kaichu Chen, Xin Wang

    Abstract: Quantum state purification plays a pivotal role in quantum communication and quantum computation, aiming to recover the purified state from multiple copies of an unknown noisy state. This work introduces a general state purification framework designed to achieve the highest fidelity with a specified probability and characterize the associated trade-offs. In particular, for i.i.d. quantum states un… ▽ More

    Submitted 18 May, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 20 pages including appendix, v2 updated the main results

  15. arXiv:2404.00906  [pdf, other

    cs.CV

    From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models

    Authors: Rongjie Li, Songyang Zhang, Dahua Lin, Kai Chen, Xuming He

    Abstract: Scene graph generation (SGG) aims to parse a visual scene into an intermediate graph representation for downstream reasoning tasks. Despite recent advancements, existing methods struggle to generate scene graphs with novel visual relation concepts. To address this challenge, we introduce a new open-vocabulary SGG framework based on sequence generation. Our framework leverages vision-language pre-t… ▽ More

    Submitted 24 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  16. arXiv:2404.00834  [pdf, other

    cs.CV

    Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach

    Authors: Guoqiang Liang, Kanghao Chen, Hangyu Li, Yunfan Lu, Lin Wang

    Abstract: Event camera has recently received much attention for low-light image enhancement (LIE) thanks to their distinct advantages, such as high dynamic range. However, current research is prohibitively restricted by the lack of large-scale, real-world, and spatial-temporally aligned event-image datasets. To this end, we propose a real-world (indoor and outdoor) dataset comprising over 30K pairs of image… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  17. arXiv:2404.00242  [pdf, other

    cs.CL cs.AI

    DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference

    Authors: **wei Yao, Kaiqi Chen, Kexun Zhang, Jiaxuan You, Binhang Yuan, Zeke Wang, Tao Lin

    Abstract: Given the increasing demand for tree-structured interactions with LLMs, we introduce DeFT (Decoding with Flash Tree-Attention), an IO-aware tree attention algorithm tailored for tree-structured inference. Unlike traditional sequence-based decoding, tree-structured decoding better accommodates modern task requirements, including self-consistency, few-shot prompting, multi-step reasoning, and multi-… ▽ More

    Submitted 29 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Update DeFT-v2. DeFT-v1 was accepted by ICLR'24 AGI Workshop ( https://openreview.net/forum?id=HqfLHoX8bR ). Code will be released soon

  18. arXiv:2403.19654  [pdf, other

    cs.CV

    RSMamba: Remote Sensing Image Classification with State Space Model

    Authors: Keyan Chen, Bowen Chen, Chenyang Liu, Wenyuan Li, Zhengxia Zou, Zhenwei Shi

    Abstract: Remote sensing image classification forms the foundation of various understanding tasks, serving a crucial function in remote sensing image interpretation. The recent advancements of Convolutional Neural Networks (CNNs) and Transformers have markedly enhanced classification accuracy. Nonetheless, remote sensing scene classification remains a significant challenge, especially given the complexity a… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  19. arXiv:2403.19646  [pdf, other

    cs.CV

    Change-Agent: Towards Interactive Comprehensive Remote Sensing Change Interpretation and Analysis

    Authors: Chenyang Liu, Keyan Chen, Haotian Zhang, Zipeng Qi, Zhengxia Zou, Zhenwei Shi

    Abstract: Monitoring changes in the Earth's surface is crucial for understanding natural processes and human impacts, necessitating precise and comprehensive interpretation methodologies. Remote sensing satellite imagery offers a unique perspective for monitoring these changes, leading to the emergence of remote sensing image change interpretation (RSICI) as a significant research focus. Current RSICI techn… ▽ More

    Submitted 1 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  20. arXiv:2403.18840  [pdf, other

    hep-th cond-mat.str-el cs.LG hep-ph physics.comp-ph

    Feynman Diagrams as Computational Graphs

    Authors: Pengcheng Hou, Tao Wang, Daniel Cerkoney, Xiansheng Cai, Zhiyi Li, You** Deng, Lei Wang, Kun Chen

    Abstract: We propose a computational graph representation of high-order Feynman diagrams in Quantum Field Theory (QFT), applicable to any combination of spatial, temporal, momentum, and frequency domains. Utilizing the Dyson-Schwinger and parquet equations, our approach effectively organizes these diagrams into a fractal structure of tensor operations, significantly reducing computational redundancy. This a… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

  21. arXiv:2403.18776  [pdf, other

    physics.optics eess.IV

    Breaking the Limitations with Sparse Inputs by Variational Frameworks (BLIss) in Terahertz Super-Resolution 3D Reconstruction

    Authors: Yiyao Zhang, Ke Chen, Shang-Hua Yang

    Abstract: Data acquisition, image processing, and image quality are the long-lasting issues for terahertz (THz) 3D reconstructed imaging. Existing methods are primarily designed for 2D scenarios, given the challenges associated with obtaining super-resolution (SR) data and the absence of an efficient SR 3D reconstruction framework in conventional computed tomography (CT). Here, we demonstrate BLIss, a new a… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 15 pages, 7 figures. Supplemental Document: https://doi.org/10.6084/m9.figshare.24455206

    Journal ref: Optics Express (OE) 2024

  22. arXiv:2403.18344  [pdf, other

    cs.AI

    LC-LLM: Explainable Lane-Change Intention and Trajectory Predictions with Large Language Models

    Authors: Mingxing Peng, Xusen Guo, Xianda Chen, Meixin Zhu, Kehua Chen, Hao, Yang, Xuesong Wang, Yinhai Wang

    Abstract: To ensure safe driving in dynamic environments, autonomous vehicles should possess the capability to accurately predict the lane change intentions of surrounding vehicles in advance and forecast their future trajectories. Existing motion prediction approaches have ample room for improvement, particularly in terms of long-term prediction accuracy and interpretability. In this paper, we address thes… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  23. Linear Hybrid Asymmetrical Load-Modulated Balanced Amplifier with Multi-Band Reconfigurability and Antenna-VSWR Resilience

    Authors: Jiachen Guo, Yuchen Cao, Kenle Chen

    Abstract: This paper presents the first-ever highly linear and load-insensitive three-way load-modulation power amplifier (PA) based on reconfigurable hybrid asymmetrical load modulated balanced amplifier (H-ALMBA). Through proper amplitude and phase controls, the carrier, control amplifier (CA), and two peaking balanced amplifiers (BA1 and BA2) can form a linear high-order load modulation over wide bandwid… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  24. arXiv:2403.17524  [pdf, other

    cs.CR cs.CL

    Provably Secure Disambiguating Neural Linguistic Steganography

    Authors: Yuang Qi, Kejiang Chen, Kai Zeng, Weiming Zhang, Nenghai Yu

    Abstract: Recent research in provably secure neural linguistic steganography has overlooked a crucial aspect: the sender must detokenize stegotexts to avoid raising suspicion from the eavesdropper. The segmentation ambiguity problem, which arises when using language models based on subwords, leads to occasional decoding failures in all neural language steganography implementations based on these models. Cur… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  25. arXiv:2403.17297  [pdf, other

    cs.CL cs.AI

    InternLM2 Technical Report

    Authors: Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang , et al. (75 additional authors not shown)

    Abstract: The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI). However, replicating such advancements in open-source models has been challenging. This paper introduces InternLM2, an open-source LLM that outperforms its predecessors in comprehensive evaluations across 6 dimensions and 30 benchmarks, long-context m… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  26. arXiv:2403.17010  [pdf, other

    cs.CV cs.LG cs.RO

    Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding

    Authors: Lingdong Kong, Xiang Xu, Jun Cen, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu

    Abstract: Safety-critical 3D scene understanding tasks necessitate not only accurate but also confident predictions from 3D perception models. This study introduces Calib3D, a pioneering effort to benchmark and scrutinize the reliability of 3D scene understanding models from an uncertainty estimation viewpoint. We comprehensively evaluate 28 state-of-the-art models across 10 diverse 3D datasets, uncovering… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Preprint; 37 pages, 8 figures, 11 tables; Code at https://github.com/ldkong1205/Calib3D

  27. arXiv:2403.16897  [pdf, other

    cs.CV

    Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text

    Authors: Junshu Tang, Yanhong Zeng, Ke Fan, Xuheng Wang, Bo Dai, Kai Chen, Lizhuang Ma

    Abstract: Creating and animating 3D biped cartoon characters is crucial and valuable in various applications. Compared with geometry, the diverse texture design plays an important role in making 3D biped cartoon characters vivid and charming. Therefore, we focus on automatic texture design for cartoon characters based on input instructions. This is challenging for domain-specific requirements and a lack of… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Project page: https://make-it-vivid.github.io/

  28. arXiv:2403.14164  [pdf, ps, other

    gr-qc hep-th

    Motion of spinning particles around a polymer black hole in loop quantum gravity

    Authors: Ke Chen, Shao-Wen Wei

    Abstract: In the curved spacetime background, the trajectory of a spinning test particle will deviate from the geodesic. Using the effective potential method, we study the motion of a spinning test particle on the equatorial plane of a polymer black hole in loop quantum gravity described by the Mathisson-Papapetrou-Dixon equations with minimal spin-gravity interaction. We find that for the bounded orbits in… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 11 pages, 12 figures

  29. arXiv:2403.13355  [pdf, other

    cs.CR cs.AI

    BadEdit: Backdooring large language models by model editing

    Authors: Yanzhou Li, Tianlin Li, Kangjie Chen, Jian Zhang, Shangqing Liu, Wenhan Wang, Tianwei Zhang, Yang Liu

    Abstract: Mainstream backdoor attack methods typically demand substantial tuning data for poisoning, limiting their practicality and potentially degrading the overall performance when applied to Large Language Models (LLMs). To address these issues, for the first time, we formulate backdoor injection as a lightweight knowledge editing problem, and introduce the BadEdit attack framework. BadEdit directly alt… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: ICLR 2024

  30. arXiv:2403.13304  [pdf, other

    cs.CV

    DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

    Authors: Yibo Wang, Ruiyuan Gao, Kai Chen, Kaiqiang Zhou, Yingjie Cai, Lanqing Hong, Zhenguo Li, Lihui Jiang, Dit-Yan Yeung, Qiang Xu, Kai Zhang

    Abstract: Current perceptive models heavily depend on resource-intensive datasets, prompting the need for innovative solutions. Leveraging recent advances in diffusion models, synthetic data, by constructing image inputs from various annotations, proves beneficial for downstream tasks. While prior methods have separately addressed generative and perceptive models, DetDiffusion, for the first time, harmonize… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  31. arXiv:2403.12881  [pdf, other

    cs.CL

    Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

    Authors: Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao

    Abstract: Open-sourced Large Language Models (LLMs) have achieved great success in various NLP tasks, however, they are still far inferior to API-based models when acting as agents. How to integrate agent ability into general LLMs becomes a crucial and urgent problem. This paper first delivers three key observations: (1) the current agent training corpus is entangled with both formats following and agent re… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Technical Report

  32. arXiv:2403.11662  [pdf, other

    cs.RO

    FE-DeTr: Keypoint Detection and Tracking in Low-quality Image Frames with Events

    Authors: Xiangyuan Wang, Kuangyi Chen, Wen Yang, Lei Yu, Yannan Xing, Huai Yu

    Abstract: Keypoint detection and tracking in traditional image frames are often compromised by image quality issues such as motion blur and extreme lighting conditions. Event cameras offer potential solutions to these challenges by virtue of their high temporal resolution and high dynamic range. However, they have limited performance in practical applications due to their inherent noise in event data. This… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 7 pages, Accepted by ICRA 2024

  33. arXiv:2403.11484  [pdf, other

    cs.RO

    Robot Navigation in Unknown and Cluttered Workspace with Dynamical System Modulation in Starshaped Roadmap

    Authors: Kai Chen, Haichao Liu, Yulin Li, Jianghua Duan, Lei Zhu, Jun Ma

    Abstract: This paper presents a novel reactive motion planning framework for navigating robots in unknown and cluttered 2D workspace. Typical existing methods are developed by enforcing the robot staying in free regions represented by the locally extracted ellipse or polygon. Instead, we navigate the robot in free space with an alternate starshaped decomposition, which is calculated directly from real-time… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  34. arXiv:2403.10494  [pdf, other

    cs.RO

    Lifelong LERF: Local 3D Semantic Inventory Monitoring Using FogROS2

    Authors: Adam Rashid, Chung Min Kim, Justin Kerr, Letian Fu, Kush Hari, Ayah Ahmad, Kaiyuan Chen, Huang Huang, Marcus Gualtieri, Michael Wang, Christian Juette, Nan Tian, Liu Ren, Ken Goldberg

    Abstract: Inventory monitoring in homes, factories, and retail stores relies on maintaining data despite objects being swapped, added, removed, or moved. We introduce Lifelong LERF, a method that allows a mobile robot with minimal compute to jointly optimize a dense language and geometric representation of its surroundings. Lifelong LERF maintains this representation over time by detecting semantic changes… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: See project webpage at: https://sites.google.com/berkeley.edu/lifelonglerf/home

  35. arXiv:2403.09572  [pdf, other

    cs.CV

    Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation

    Authors: Yunhao Gou, Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James T. Kwok, Yu Zhang

    Abstract: Multimodal large language models (MLLMs) have shown impressive reasoning abilities, which, however, are also more vulnerable to jailbreak attacks than their LLM predecessors. Although still capable of detecting unsafe responses, we observe that safety mechanisms of the pre-aligned LLMs in MLLMs can be easily bypassed due to the introduction of image features. To construct robust MLLMs, we propose… ▽ More

    Submitted 22 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: Project Page: https://gyhdog99.github.io/projects/ecso/

  36. arXiv:2403.09486  [pdf, other

    cs.CV

    SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams

    Authors: Kang Chen, Shiyan Chen, Jiyuan Zhang, Baoyue Zhang, Ya**g Zheng, Tiejun Huang, Zhaofei Yu

    Abstract: Reconstructing a sequence of sharp images from the blurry input is crucial for enhancing our insights into the captured scene and poses a significant challenge due to the limited temporal features embedded in the image. Spike cameras, sampling at rates up to 40,000 Hz, have proven effective in capturing motion features and beneficial for solving this ill-posed problem. Nonetheless, existing method… ▽ More

    Submitted 1 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 14 pages

  37. arXiv:2403.08604  [pdf, other

    cs.CL cs.SE

    DevBench: A Comprehensive Benchmark for Software Development

    Authors: Bowen Li, Wenhan Wu, Ziwei Tang, Lin Shi, John Yang, **yang Li, Shunyu Yao, Chen Qian, Binyuan Hui, Qicheng Zhang, Zhiyin Yu, He Du, ** Yang, Dahua Lin, Chao Peng, Kai Chen

    Abstract: Recent advancements in large language models (LLMs) have significantly enhanced their coding capabilities. However, existing benchmarks predominantly focused on simplified or isolated aspects of programming, such as single-file code generation or repository issue debugging, falling short of measuring the full spectrum of challenges raised by real-world programming activities. To this end, we propo… ▽ More

    Submitted 15 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Our data and code are available at https://github.com/open-compass/DevBench

  38. arXiv:2403.08282  [pdf, other

    cs.CV

    Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation

    Authors: Zhonghan Zhao, Kewei Chen, Dongxu Guo, Wenhao Chai, Tian Ye, Yanting Zhang, Gaoang Wang

    Abstract: Due to the dynamic and unpredictable open-world setting, navigating complex environments in Minecraft poses significant challenges for multi-agent systems. Agents must interact with the environment and coordinate their actions with other agents to achieve common objectives. However, traditional approaches often struggle to efficiently manage inter-agent communication and task distribution, crucial… ▽ More

    Submitted 18 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: ICLR 2024 Workshop on LLM Agents

  39. arXiv:2403.08048  [pdf, other

    astro-ph.SR astro-ph.GA

    A Spectroscopic Hunt for Post-Red Supergiants in the Large Magellanic Cloud I: Preliminary Results

    Authors: Kaitlyn M. Chen, Trevor Z. Dorn-Wallenstein

    Abstract: Yellow supergiants (YSGs) are rare and poorly understood, and studying them is critical to constraining massive star evolution. We obtained flux-calibrated Magellan Inamori Kyocera Echelle (MIKE) high-resolution spectra of 40 YSGs in the Large Magellanic Cloud (LMC); this sample likely contains post-red supergiants (RSGs). Fitting these data with ATLAS9 model atmospheres, we determined fundamental… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in RNAAS. 4 pages, 1 Table. Comments welcome

  40. arXiv:2403.07901  [pdf, other

    cs.CV cs.LG

    MIP: CLIP-based Image Reconstruction from PEFT Gradients

    Authors: Peiheng Zhou, Ming Hu, Xiaofei Xie, Yihao Huang, Kangjie Chen, Mingsong Chen

    Abstract: Contrastive Language-Image Pre-training (CLIP) model, as an effective pre-trained multimodal neural network, has been widely used in distributed machine learning tasks, especially Federated Learning (FL). Typically, CLIP-based FL adopts Parameter-Efficient Fine-Tuning (PEFT) for model training, which only fine-tunes adapter parameters or soft prompts rather than the full parameters. Although PEFT… ▽ More

    Submitted 25 February, 2024; originally announced March 2024.

  41. arXiv:2403.07262  [pdf, other

    cs.LG cs.AI

    A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective

    Authors: Yunpeng Qing, Shunyu liu, **gyuan Cong, Kaixuan Chen, Yihe Zhou, Mingli Song

    Abstract: Offline reinforcement learning endeavors to leverage offline datasets to craft effective agent policy without online interaction, which imposes proper conservative constraints with the support of behavior policies to tackle the out-of-distribution problem. However, existing works often suffer from the constraint conflict issue when offline datasets are collected from multiple behavior policies, i.… ▽ More

    Submitted 30 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  42. arXiv:2403.07225  [pdf, other

    cs.RO

    Stereo-NEC: Enhancing Stereo Visual-Inertial SLAM Initialization with Normal Epipolar Constraints

    Authors: Weihan Wang, Chieh Chou, Ganesh Sevagamoorthy, Kevin Chen, Zheng Chen, Ziyue Feng, Youjie Xia, Feiyang Cai, Yi Xu, Philippos Mordohai

    Abstract: We propose an accurate and robust initialization approach for stereo visual-inertial SLAM systems. Unlike the current state-of-the-art method, which heavily relies on the accuracy of a pure visual SLAM system to estimate inertial variables without updating camera poses, potentially compromising accuracy and robustness, our approach offers a different solution. We realize the crucial impact of prec… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  43. arXiv:2403.06596  [pdf

    physics.optics

    Ultra-broadband Optical Switching Plasmons Waveguide in Ge Nanowires

    Authors: Xinghui Liu, Kaili Chang, Jiarong Guo, Mengfei Xue, Ran Zhou, Ke Chen, Jianing Chen

    Abstract: Plasmonic devices, with their ultra-high integration density and data-carrying capacity comparable to optical devices, are currently a hot topic in the field of nanophotonic devices. Photodetectors, non-volatile memories, and ultra-compact lasers based on plasmons in low-dimensional materials are emerging at a rapid pace. However, the narrow optical response band and limited of convenient tunable… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  44. arXiv:2403.06504  [pdf, other

    cs.DC

    Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

    Authors: Changyue Liao, Mo Sun, Zihan Yang, Kaiqi Chen, Binhang Yuan, Fei Wu, Zeke Wang

    Abstract: Recent advances in large language models have brought immense value to the world, with their superior capabilities stemming from the massive number of parameters they utilize. However, even the GPUs with the highest memory capacities, currently peaking at 80GB, are far from sufficient to accommodate these vast parameters and their associated optimizer states when conducting stochastic gradient des… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  45. arXiv:2403.06232  [pdf, other

    cond-mat.supr-con

    Emergence of Surface Superconductivity through Interference in Superconducting-proximity Topological Insulators

    Authors: Yajiang Chen, Ke-Ji Chen, Jia-Ji Zhu, A. A. Shanenko

    Abstract: Superconducting-proximity topological insulators (STIs) have garnered significant research attention over the past two decades. In this Letter, we demonstrate that a low-dimensional STI in the topological-nontrivial phase (TP) exhibits an interference-induced surface (boundary) superconductivity with the surface critical temperature $T_{cs}$ significantly higher than the bulk one $T_{cb}$. Such a… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures

  46. arXiv:2403.06133  [pdf, other

    hep-ph hep-ex nucl-th

    Transverse polarization of Lambda hyperons in hadronic collisions

    Authors: Ying Gao, Kai-Bao Chen, Yu-Kun Song, Shu-Yi Wei

    Abstract: The transverse polarization of $Λ$ hyperon within reconstructed jets in hadronic collisions offers a complementary platform to probe the polarized fragmentation function $D_{1T}^\perp$. We illustrate that by performing a global analysis of the transverse polarization of $Λ$ hyperons produced in different kinematic regions and in different hadronic collisions, such as $pp$, $p\bar p$, $pA$, and… ▽ More

    Submitted 20 June, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: 13 pages, 15 figures

  47. arXiv:2403.05832  [pdf, other

    cs.CE

    Research progress on intelligent optimization techniques for energy-efficient design of ship hull forms

    Authors: Shuwei Zhu, Siying Lv, Kaifeng Chen, Wei Fang, Leilei Cao

    Abstract: The design optimization of ship hull form based on hydrodynamics theory and simulation-based design (SBD) technologies generally considers ship performance and energy efficiency performance as the design objective, which plays an important role in smart design and manufacturing of green ship. An optimal design of sustainable energy system requires multidisciplinary tools to build ships with the le… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 30 pages, 8 figures

    MSC Class: 41C99 ACM Class: J.6; I.2.8

  48. arXiv:2403.05828  [pdf, other

    quant-ph cs.AI cs.AR cs.DC

    Multi-GPU-Enabled Hybrid Quantum-Classical Workflow in Quantum-HPC Middleware: Applications in Quantum Simulations

    Authors: Kuan-Cheng Chen, Xiaoren Li, Xiaotian Xu, Yun-Yuan Wang, Chen-Yu Liu

    Abstract: Achieving high-performance computation on quantum systems presents a formidable challenge that necessitates bridging the capabilities between quantum hardware and classical computing resources. This study introduces an innovative distribution-aware Quantum-Classical-Quantum (QCQ) architecture, which integrates cutting-edge quantum software framework works with high-performance classical computing… ▽ More

    Submitted 18 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: 8 pages, 8 figures

  49. arXiv:2403.04990  [pdf, other

    hep-ph cs.LG quant-ph

    Jet Discrimination with Quantum Complete Graph Neural Network

    Authors: Yi-An Chen, Kai-Feng Chen

    Abstract: Machine learning, particularly deep neural networks, has been widely utilized in high energy physics and has shown remarkable results in various applications. Moreover, the concept of machine learning has been extended to quantum computers, giving rise to a new research area known as quantum machine learning. In this paper, we propose a novel variational quantum circuit model, Quantum Complete Gra… ▽ More

    Submitted 12 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  50. arXiv:2403.04475  [pdf, other

    quant-ph

    Critical quantum metrology robust against dissipation and non-adiabaticity

    Authors: Jia-Hao Lü, Wen Ning, Fan Wu, Ri-Hua Zheng, Ken Chen, Xin Zhu, Zhen-Biao Yang, Huai-Zhi Wu, Shi-Biao Zheng

    Abstract: Critical systems near quantum phase transitions were predicted to be useful for improvement of metrological precision, thanks to their ultra-sensitive response to a tiny variation of the control Hamiltonian. Despite the promising perspective, realization of criticality-enhanced quantum metrology is an experimentally challenging task, mainly owing to the extremely long time needed to encode the sig… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 13 pages, 11 figures