Skip to main content

Showing 151–200 of 13,686 results for author: Chen, Y

.
  1. arXiv:2406.07882  [pdf, other

    cs.CL cs.AI cs.HC

    Designing a Dashboard for Transparency and Control of Conversational AI

    Authors: Yida Chen, Aoyu Wu, Trevor DePodesta, Catherine Yeh, Kenneth Li, Nicholas Castillo Marin, Oam Patel, Jan Riecke, Shivam Raval, Olivia Seow, Martin Wattenberg, Fernanda ViƩgas

    Abstract: Conversational LLMs function as black box systems, leaving users guessing about why they see the output they do. This lack of transparency is potentially problematic, especially given concerns around bias and truthfulness. To address this issue, we present an end-to-end prototype-connecting interpretability techniques with user experience design-that seeks to make chatbots more transparent. We beg… ▽ More

    Submitted 15 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Project page: https://bit.ly/talktuner-project-page 38 pages, 23 figures

  2. arXiv:2406.07651  [pdf, ps, other

    stat.ME stat.CO

    surveygenmod2: A SAS macro for estimating complex survey adjusted generalized linear models and Wald-type tests

    Authors: R. Noah Padgett, Ying Chen

    Abstract: surveygenmod2 builds on the macro written by da Silva (2017) for generalized linear models under complex survey designs. The updated macro fixed several minor bugs we encountered while updating the macro for use in SAS\textregistered. We added additional features for conducting basic Wald-type tests on groups of parameters based on the estimated regression coefficients and parameter variance-covar… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2406.07529  [pdf, other

    cs.LG

    MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation

    Authors: Lu Li, Tianyu Zhang, Zhiqi Bu, Suyuchen Wang, Huan He, Jie Fu, Yonghui Wu, Jiang Bian, Yong Chen, Yoshua Bengio

    Abstract: Model merging has emerged as an effective approach to combine multiple single-task models, fine-tuned from the same pre-trained model, into a multitask model. This process typically involves computing a weighted average of the model parameters without any additional training. Existing model-merging methods focus on enhancing average task accuracy. However, interference and conflicts between the ob… ▽ More

    Submitted 18 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2406.07480  [pdf, other

    cs.CV

    Image Neural Field Diffusion Models

    Authors: Yinbo Chen, Oliver Wang, Richard Zhang, Eli Shechtman, Xiaolong Wang, Michael Gharbi

    Abstract: Diffusion models have shown an impressive ability to model complex data distributions, with several key advantages over GANs, such as stable training, better coverage of the training distribution's modes, and the ability to solve inverse problems without extra training. However, most diffusion models learn the distribution of fixed-resolution images. We propose to learn the distribution of continu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Project page: https://yinboc.github.io/infd/

  5. arXiv:2406.07462  [pdf

    physics.class-ph cond-mat.mtrl-sci

    Rayleigh surface waves of extremal elastic materials

    Authors: Yu Wei, Yi Chen, Wen Cheng, Xiaoning Liu, Gengkai Hu

    Abstract: Extremal elastic materials here refer to a specific class of elastic materials whose elastic matrices exhibit one or more zero eigenvalues, resulting in soft deformation modes that, in principle, cost no energy. They can be approximated through artificially designed solid microstructures. Extremal elastic materials have exotic bulk wave properties unavailable with conventional solids due to the so… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 8 figures

  6. arXiv:2406.07422  [pdf, other

    eess.AS

    Single-Codec: Single-Codebook Speech Codec towards High-Performance Speech Generation

    Authors: Hanzhao Li, Liumeng Xue, Haohan Guo, Xinfa Zhu, Yuanjun Lv, Lei Xie, Yunlin Chen, Hao Yin, Zhifei Li

    Abstract: The multi-codebook speech codec enables the application of large language models (LLM) in TTS but bottlenecks efficiency and robustness due to multi-sequence prediction. To avoid this obstacle, we propose Single-Codec, a single-codebook single-sequence codec, which employs a disentangled VQ-VAE to decouple speech into a time-invariant embedding and a phonetically-rich discrete sequence. Furthermor… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  7. arXiv:2406.07225  [pdf, other

    quant-ph

    A generic and robust quantum agent inspired by deep meta-reinforcement learning

    Authors: Zibo Miao, Shihui Zhang, Yu Pan, Sibo Tao, Yu Chen

    Abstract: Deep reinforcement learning (deep RL) has enabled human- or superhuman- performances in various applications. Recently, deep RL has also been adopted to improve the performance of quantum control. However, a large volume of data is typically required to train the neural network in deep RL, making it inefficient compared with the traditional optimal quantum control method. Here, we thus develop a n… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  8. arXiv:2406.07147  [pdf

    cs.HC cs.AI cs.CY

    Wearable Device-Based Physiological Signal Monitoring: An Assessment Study of Cognitive Load Across Tasks

    Authors: Ling He, Yanxin Chen, Wenqi Wang, Shuting He, Xiaoqiang Hu

    Abstract: This study employs cutting-edge wearable monitoring technology to conduct high-precision, high-temporal-resolution cognitive load assessment on EEG data from the FP1 channel and heart rate variability (HRV) data of secondary vocational students(SVS). By jointly analyzing these two critical physiological indicators, the research delves into their application value in assessing cognitive load among… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  9. A Neck Orthosis with Multi-Directional Variable Stiffness for Persons with Dropped Head Syndrome

    Authors: Santiago Price Torrendell, Hideki Kadone, Modar Hassan, Yang Chen, Kousei Miura, Kenji Suzuki

    Abstract: Dropped Head Syndrome (DHS) causes a passively correctable neck deformation. Currently, there is no wearable orthopedic neck brace to fulfill the needs of persons suffering from DHS. Related works have made progress in this area by creating mobile neck braces that provide head support to mitigate deformation while permitting neck mobility, which enhances user-perceived comfort and quality of life.… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted Manuscript

    Journal ref: IEEE Robotics and Automation Letters, vol. 9, no. 7, pp. 6224-6231, July 2024

  10. arXiv:2406.07011  [pdf, ps, other

    cs.CR

    Breaking Free: Efficient Multi-Party Private Set Union Without Non-Collusion Assumptions

    Authors: Minglang Dong, Yu Chen, Cong Zhang, Yujie Bai

    Abstract: Multi-party private set union (MPSU) protocol enables $m$ $(m > 2)$ parties, each holding a set, to collectively compute the union of their sets without revealing any additional information to other parties. There are two main categories of MPSU protocols: The first builds on public-key techniques. All existing works in this category involve a super-linear number of public-key operations, resultin… ▽ More

    Submitted 1 July, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  11. arXiv:2406.06567  [pdf, other

    cs.LG cs.AI cs.CL

    DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion

    Authors: Yilong Chen, Linhao Zhang, Junyuan Shang, Zhenyu Zhang, Tingwen Liu, Shuohuan Wang, Yu Sun

    Abstract: Large language models (LLMs) with billions of parameters demonstrate impressive performance. However, the widely used Multi-Head Attention (MHA) in LLMs incurs substantial computational and memory costs during inference. While some efforts have optimized attention mechanisms by pruning heads or sharing parameters among heads, these methods often lead to performance degradation or necessitate subst… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 9 figures, 3 tables

  12. arXiv:2406.06541  [pdf, other

    cs.AR

    Global and Local Attention-based Inception U-Net for Static IR Drop Estimation

    Authors: Yilu Chen, Zhijie Cai, Min Wei, Zhifeng Lin, Jianli Chen

    Abstract: Static IR drop analysis is a fundamental and critical task in chip design since the IR drop will significantly affect the design's functionality, performance, and reliability. However, the process of IR drop analysis can be time-consuming, potentially taking several hours. Furthermore, in the process of fixing violations, it is frequently imperative to do IR drop analysis iteratively, hence exacer… ▽ More

    Submitted 27 April, 2024; originally announced June 2024.

    Comments: 7 pages, 8 figures

  13. arXiv:2406.06388  [pdf, ps, other

    math.RT

    Simple smooth modules over the Ramond algebra and applications to vertex operator superalgebras

    Authors: Yulu Chen, Ran Shen, Yufeng Yao, Kaiming Zhao

    Abstract: Simple smooth modules over the Virasoro algebra and one of the super-Virasoro algebra named the Neveu-Schwarz algebra were classified. This problem remained unsolved for the other super-Virasoro algebra called the Ramond algebra. In this paper, all simple smooth modules over the Ramond algebra are classified. More precisely, a simple smooth module over the Ramond algebra is either a simple highest… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages

  14. arXiv:2406.06327  [pdf

    q-bio.NC

    Leveraging Hyperscanning EEG and VR Omnidirectional Treadmill to Explore Inter-Brain Synchrony in Collaborative Spatial Navigation

    Authors: Chun-Hsiang Chuang, Po-Hsun Peng, Yi-Chieh Chen

    Abstract: Navigating through a physical environment to reach a desired location involves a complex interplay of cognitive, sensory, and motor functions. When navigating with others, experiencing a degree of behavioral and cognitive synchronization is both natural and ubiquitous. This synchronization facilitates a harmonious effort toward achieving a common goal, reflecting how individuals instinctively alig… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  15. arXiv:2406.06118  [pdf, other

    hep-ex

    Strong and weak $CP$ tests in sequential decays of polarized $Ī£^0$ hyperons

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: The $J/Ļˆ, Ļˆ(3686) \to Ī£^0 \barĪ£^{0}$ processes and subsequent decays are studied using the world's largest $J/Ļˆ$ and $Ļˆ(3686)$ data samples collected with the BESIII detector. The strong-$CP$ symmetry is tested in the decays of the $Ī£^0$ hyperons for the first time by measuring the decay parameters, $Ī±_{Ī£^0} = -0.0017 \pm 0.0021 \pm 0.0018$ and $\barĪ±_{Ī£^0} = 0.0021 \pm 0.0020 \pm 0.0022$. The wea… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  16. arXiv:2406.06086  [pdf, other

    cs.SD eess.AS

    RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection

    Authors: Yujie Chen, Jiangyan Yi, Jun Xue, Chenglong Wang, Xiaohui Zhang, Shunbo Dong, Siding Zeng, Jianhua Tao, Lv Zhao, Cunhang Fan

    Abstract: Fake artefacts for discriminating between bonafide and fake audio can exist in both short- and long-range segments. Therefore, combining local and global feature information can effectively discriminate between bonafide and fake audio. This paper proposes an end-to-end bidirectional state space model, named RawBMamba, to capture both short- and long-range discriminative information for audio deepf… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  17. arXiv:2406.06068  [pdf, other

    cs.NI

    Instability of Self-Driving Satellite Mega-Constellation: From Theory to Practical Impacts on Network Lifetime and Capacity

    Authors: Yimei Chen, Yuanjie Li, Hewu Li, Lixin Liu, Li Ouyang, Jiabo Yang, Junyi Li, Jian** Wu, Qian Wu, Jun Liu, Zeqi Lai

    Abstract: Low Earth Orbit (LEO) satellite mega-constellations aim to enable high-speed Internet for numerous users anywhere on Earth. To safeguard their network infrastructure in congested outer space, they perform automatic orbital maneuvers to avoid collisions with external debris and satellites. However, our control-theoretic analysis and empirical validation using Starlink's space situational awareness… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  18. arXiv:2406.06063  [pdf, other

    physics.comp-ph quant-ph

    Enabling Large-Scale and High-Precision Fluid Simulations on Near-Term Quantum Computers

    Authors: Zhao-Yun Chen, Teng-Yang Ma, Chuang-Chao Ye, Liang Xu, Ming-Yang Tan, Xi-Ning Zhuang, Xiao-Fan Xu, Yun-Jie Wang, Tai-** Sun, Yong Chen, Lei Du, Liang-Liang Guo, Hai-Feng Zhang, Hao-Ran Tao, Tian-Le Wang, Xiao-Yan Yang, Ze-An Zhao, Peng Wang, Sheng Zhang, Chi Zhang, Ren-Ze Zhao, Zhi-Long Jia, Wei-Cheng Kong, Meng-Han Dou, Jun-Chao Wang , et al. (7 additional authors not shown)

    Abstract: Quantum computational fluid dynamics (QCFD) offers a promising alternative to classical computational fluid dynamics (CFD) by leveraging quantum algorithms for higher efficiency. This paper introduces a comprehensive QCFD method, including an iterative method "Iterative-QLS" that suppresses error in quantum linear solver, and a subspace method to scale the solution to a larger size. We implement o… ▽ More

    Submitted 19 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 31 pages, 10 figures

  19. arXiv:2406.05931  [pdf, other

    cs.RO

    Differentiable Discrete Elastic Rods for Real-Time Modeling of Deformable Linear Objects

    Authors: Yizhou Chen, Yiting Zhang, Zachary Brei, Tiancheng Zhang, Yuzhen Chen, Julie Wu, Ram Vasudevan

    Abstract: This paper addresses the task of modeling Deformable Linear Objects (DLOs), such as ropes and cables, during dynamic motion over long time horizons. This task presents significant challenges due to the complex dynamics of DLOs. To address these challenges, this paper proposes differentiable Discrete Elastic Rods For deformable linear Objects with Real-time Modeling (DEFORM), a novel framework that… ▽ More

    Submitted 14 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  20. arXiv:2406.05901  [pdf, other

    physics.plasm-ph astro-ph.SR physics.space-ph

    Simulation Models for Exploring Magnetic Reconnection

    Authors: Michael Shay, Subash Adhikari, Naoki Beesho, Joachim Birn, Jorg Buechner, Paul Cassak, Li-Jen Chen, Yuxi Chen, Giulia Cozzani, Jim Drake, Fan Guo, Michael Hesse, Neeraj Jain, Yann Pfau-Kempf, Yu Lin, Yi-Hsin Liu, Mitsuo Oka, Yuri A. Omelchenko, Minna Palmroth, Oreste Pezzi, Patricia H. Reiff, Marc Swisdak, Frank Toffoletto, Gabor Toth, Richard A. Wolf

    Abstract: Simulations have played a critical role in the advancement of our knowledge of magnetic reconnection. However, due to the inherently multiscale nature of reconnection, it is impossible to simulate all physics at all scales. For this reason, a wide range of simulation methods have been crafted to study particular aspects and consequences of magnetic reconnection. This chapter reviews many of these… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Chapter 5.2 of ISSI Book on Magnetic Reconnection, submitted to Space Science Reviews

  21. arXiv:2406.05893  [pdf, other

    cs.LG

    Event prediction and causality inference despite incomplete information

    Authors: Harrison Lam, Yuanjie Chen, Noboru Kanazawa, Mohammad Chowdhury, Anna Battista, Stephan Waldert

    Abstract: We explored the challenge of predicting and explaining the occurrence of events within sequences of data points. Our focus was particularly on scenarios in which unknown triggers causing the occurrence of events may consist of non-consecutive, masked, noisy data points. This scenario is akin to an agent tasked with learning to predict and explain the occurrence of events without understanding the… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 16 pages, 8 figures, 1 table

  22. arXiv:2406.05827  [pdf, ps, other

    hep-ex

    Measurement of the integrated luminosity of the data collected at 3.773 GeV by BESIII from 2021 to 2024

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: We present a measurement of the integrated luminosity of $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at a center-of-mass energy of $E_{\rm cm} = 3.773$~GeV. The integrated luminosities of the data sets taken from December 2021 to June 2022, from November 2022 to June 2023, and from October 2023 to February 2024 are determined to be $4.995 \pm 0.019$~fb$^{-1}$,… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  23. arXiv:2406.05682  [pdf, other

    cs.LG cs.AI

    From Basic to Extra Features: Hypergraph Transformer Pretrain-then-Finetuning for Balanced Clinical Predictions on EHR

    Authors: Ran Xu, Yiwen Lu, Chang Liu, Yong Chen, Yan Sun, Xiao Hu, Joyce C Ho, Carl Yang

    Abstract: Electronic Health Records (EHRs) contain rich patient information and are crucial for clinical research and practice. In recent years, deep learning models have been applied to EHRs, but they often rely on massive features, which may not be readily available for all patients. We propose HTP-Star, which leverages hypergraph structures with a pretrain-then-finetune framework for modeling EHR data, e… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: CHIL 2024

  24. arXiv:2406.05676  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Chern insulator phase realized in dual-gate-tuned MnBi2Te4 thin films grown by molecular beam epitaxy

    Authors: Yunhe Bai, Yuanzhao Li, Ruixuan Liu, Jianli Luan, Yang Chen, Wenyu Song, Peng-Fei Ji, Cui Ding, Zongwei Gao, Qinghua Zhang, Fanqi Meng, Bingbing Tong, Lin Li, Tianchen Zhu, Lin Gu, Lili Wang, **song Zhang, Yayu Wang, Qi-Kun Xue, Ke He, Yang Feng, Xiao Feng

    Abstract: The intrinsic magnetic order, large topological-magnetic gap and rich topological phases make MnBi2Te4 a wonderful platform to study exotic topological quantum states such as axion insulator and Chern insulator. To realize and manipulate these topological phases in a MnBi2Te4 thin film, precise manipulation of the electric field across the film is essential, which requires a dual-gate structure. I… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 24 pages, 4 figures

  25. arXiv:2406.05502  [pdf

    physics.optics

    Characterization of Recirculating Waveguide Meshes Based on an Optimization Method with a Parameter Space Reduction Technology

    Authors: Ran Tao, Jifang Qiu, Yuchen Chen, Bowen Zhang, Yan Li, Hongxiang Guo, Jian Wu

    Abstract: Fabrication imperfections must be considered during configuration to ensure that the setup is suitable for the actual fabricated programmable photonic integrated circuits (PPICs). Therefore, characterization of imperfections is crucial but difficult, especially for PPICs made from recirculating waveguide meshes. The flexibility required by these meshes demands a more complex topology and compact T… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  26. arXiv:2406.05467  [pdf, other

    astro-ph.SR physics.plasm-ph physics.space-ph

    Prevalence of non-standard collapsing of strong Langmuir turbulence in solar corona plasmas

    Authors: Yaokun Li, Haomin Sun, Hao Ning, Sulan Ni, Xiangliang Kong, Jiansen He, Yao Chen

    Abstract: We present a fully-kinetic simulation of the full life cycle of strong Langmuir turbulence (SLT) excited by electron beams that are accelerated under the solar corona conditions. We find that (1) most packets ($\sim$80%) are affected by their neighbors during their collapse, as a result, their spatial scale variations present non-standard evolutionary features, i.e., deviating away from what was p… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  27. arXiv:2406.05397  [pdf, other

    cs.SE

    Metamorphic Relation Generation: State of the Art and Visions for Future Research

    Authors: Rui Li, Huai Liu, Pak-Lok Poon, Dave Towey, Chang-Ai Sun, Zheng Zheng, Zhi Quan Zhou, Tsong Yueh Chen

    Abstract: Metamorphic testing has become one mainstream technique to address the notorious oracle problem in software testing, thanks to its great successes in revealing real-life bugs in a wide variety of software systems. Metamorphic relations, the core component of metamorphic testing, have continuously attracted research interests from both academia and industry. In the last decade, a rapidly increasing… ▽ More

    Submitted 10 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: Accepted by International Workshop on Software Engineering in 2030

  28. arXiv:2406.05232  [pdf, other

    cs.CL cs.LG

    Improving Logits-based Detector without Logits from Black-box LLMs

    Authors: Cong Zeng, Shengkun Tang, Xianjun Yang, Yuanzhou Chen, Yiyou Sun, zhiqiang xu, Yao Li, Haifeng Chen, Wei Cheng, Dongkuan Xu

    Abstract: The advent of Large Language Models (LLMs) has revolutionized text generation, producing outputs that closely mimic human writing. This blurring of lines between machine- and human-written text presents new challenges in distinguishing one from the other a task further complicated by the frequent updates and closed nature of leading proprietary LLMs. Traditional logits-based detection methods leve… ▽ More

    Submitted 11 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  29. arXiv:2406.05007  [pdf, other

    quant-ph

    Slow and Stored Light via Electromagnetically Induced Transparency Using A $Ī›$-type Superconducting Artificial Atom

    Authors: Kai-I Chu, Xiao-Cheng Lu, Kuan-Hsun Chiang, Yen-Hsiang Lin, Chii-Dong Chen, Ite A. Yu, Wen-Te Liao, Yung-Fu Chen

    Abstract: Recent progresses in Josephson-junction-based superconducting circuits have propelled quantum information processing forward. However, the lack of a metastable state in most superconducting artificial atoms hinders the development of photonic quantum memory in this platform. Here, we use a single superconducting qubit-resonator system to realize a desired $Ī›$-type artificial atom, and to demonstra… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 10 pages, 6 figures

  30. arXiv:2406.04999  [pdf, other

    cs.CV

    ProMotion: Prototypes As Motion Learners

    Authors: Yawen Lu, Dongfang Liu, Qifan Wang, Cheng Han, Yiming Cui, Zhiwen Cao, Xueling Zhang, Yingjie Victor Chen, Heng Fan

    Abstract: In this work, we introduce ProMotion, a unified prototypical framework engineered to model fundamental motion tasks. ProMotion offers a range of compelling attributes that set it apart from current task-specific paradigms. We adopt a prototypical perspective, establishing a unified paradigm that harmonizes disparate motion learning approaches. This novel paradigm streamlines the architectural desi… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 11 pages

  31. arXiv:2406.04801  [pdf, other

    cs.CV

    MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks

    Authors: Xingkui Zhu, Yiran Guan, Dingkang Liang, Yuchao Chen, Yuliang Liu, Xiang Bai

    Abstract: The sparsely activated mixture of experts (MoE) model presents a promising alternative to traditional densely activated (dense) models, enhancing both quality and computational efficiency. However, training MoE models from scratch demands extensive data and computational resources. Moreover, public repositories like timm mainly provide pre-trained dense checkpoints, lacking similar resources for M… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 9 pages, 6 figures

    ACM Class: I.2

  32. arXiv:2406.04743  [pdf, other

    cs.LG cs.CR cs.DC stat.AP

    When Swarm Learning meets energy series data: A decentralized collaborative learning design based on blockchain

    Authors: Lei Xu, Yulong Chen, Yuntian Chen, Longfeng Nie, Xuetao Wei, Liang Xue, Dongxiao Zhang

    Abstract: Machine learning models offer the capability to forecast future energy production or consumption and infer essential unknown variables from existing data. However, legal and policy constraints within specific energy sectors render the data sensitive, presenting technical hurdles in utilizing data from diverse sources. Therefore, we propose adopting a Swarm Learning (SL) scheme, which replaces the… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  33. arXiv:2406.04712  [pdf, other

    cs.CL

    AICoderEval: Improving AI Domain Code Generation of Large Language Models

    Authors: Yinghui Xia, Yuyan Chen, Tianyu Shi, Jun Wang, **song Yang

    Abstract: Automated code generation is a pivotal capability of large language models (LLMs). However, assessing this capability in real-world scenarios remains challenging. Previous methods focus more on low-level code generation, such as model loading, instead of generating high-level codes catering for real-world tasks, such as image-to-text, text classification, in various domains. Therefore, we construc… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  34. arXiv:2406.04583  [pdf, other

    cs.CL

    Extroversion or Introversion? Controlling The Personality of Your Large Language Models

    Authors: Yanquan Chen, Zhen Wu, Junjie Guo, Shujian Huang, Xinyu Dai

    Abstract: Large language models (LLMs) exhibit robust capabilities in text generation and comprehension, mimicking human behavior and exhibiting synthetic personalities. However, some LLMs have displayed offensive personality, propagating toxic discourse. Existing literature neglects the origin and evolution of LLM personalities, as well as the effective personality control. To fill these gaps, our study em… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  35. arXiv:2406.04575  [pdf, other

    cs.LG cs.AI stat.AP stat.ML

    Optimization of geological carbon storage operations with multimodal latent dynamic model and deep reinforcement learning

    Authors: Zhongzheng Wang, Yuntian Chen, Guodong Chen, Dongxiao Zhang

    Abstract: Maximizing storage performance in geological carbon storage (GCS) is crucial for commercial deployment, but traditional optimization demands resource-intensive simulations, posing computational challenges. This study introduces the multimodal latent dynamic (MLD) model, a deep learning framework for fast flow prediction and well control optimization in GCS. The MLD model includes a representation… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  36. arXiv:2406.04530  [pdf, ps, other

    math.NA math.OC

    A general framework for floating point error analysis of simplex derivatives

    Authors: Yiwen Chen, Warren Hare, Amy Wiebe

    Abstract: Gradient approximations are a class of numerical approximation techniques that are of central importance in numerical optimization. In derivative-free optimization, most of the gradient approximations, including the simplex gradient, centred simplex gradient, and adapted centred simplex gradient, are in the form of simplex derivatives. Owing to machine precision, the approximation accuracy of any… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    MSC Class: 65D25; 90C56

  37. arXiv:2406.04469  [pdf, other

    physics.optics

    Resolving the Orientations of and Separation between an Overlap** Pair of Dipole Emitters

    Authors: Yiyang Chen, Yuanxin Qiu, Matthew D. Lew

    Abstract: We prove that it is impossible to distinguish two spatially overlap** fluorescent molecules from a single rotating molecule, even if one modulates the polarization of pum** light or the detection dipole-spread function (DSF). If the target is known to be a dipole pair, existing imaging methods perform poorly for measuring their angular separation. We propose simultaneously modulating the excit… ▽ More

    Submitted 26 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: 6 pages, 4 figures

  38. arXiv:2406.04451  [pdf, other

    cs.RO

    RiskMap: A Unified Driving Context Representation for Autonomous Motion Planning in Urban Driving Environment

    Authors: Ren Xin, Sheng Wang, Yingbing Chen, Jie Cheng, Ming Liu

    Abstract: Planning is complicated by the combination of perception and map information, particularly when driving in heavy traffic. Develo** an extendable and efficient representation that visualizes sensor noise and provides constraints to real-time planning tasks is desirable. We aim to develop an extendable map representation offering prior to cost in planning tasks to simplify the planning process of… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Submission to ICRA 2023 was not accepted. This paper is now available just for public reference

  39. arXiv:2406.04428  [pdf, other

    cs.CL cs.AI

    MoralBench: Moral Evaluation of LLMs

    Authors: Jianchao Ji, Yutong Chen, Mingyu **, Wujiang Xu, Wenyue Hua, Yongfeng Zhang

    Abstract: In the rapidly evolving field of artificial intelligence, large language models (LLMs) have emerged as powerful tools for a myriad of applications, from natural language processing to decision-making support systems. However, as these models become increasingly integrated into societal frameworks, the imperative to ensure they operate within ethical and moral boundaries has never been more critica… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  40. arXiv:2406.04358  [pdf, other

    quant-ph physics.optics

    Quantum erasure based on phase structure

    Authors: Ye Yang, Chengyuan Wang, Yun Chen, Jianyi Xv, Xin Yang, **wen Wang, Shuwei Qiu, Hong Gao, Fuli Li

    Abstract: The quantum eraser effect exemplifies the distinct properties of quantum mechanics that challenge classical intuition and expose the wave-particle duality of light. This effect has been extensively explored in various experiments; most of these investigations use polarisation to distinguish which path information, and less attention has been paid to the phase structure which is related wavefront o… ▽ More

    Submitted 18 May, 2024; originally announced June 2024.

  41. arXiv:2406.04101  [pdf, other

    cs.CV

    How Far Can We Compress Instant-NGP-Based NeRF?

    Authors: Yihang Chen, Qianyi Wu, Mehrtash Harandi, Jianfei Cai

    Abstract: In recent years, Neural Radiance Field (NeRF) has demonstrated remarkable capabilities in representing 3D scenes. To expedite the rendering process, learnable explicit representations have been introduced for combination with implicit NeRF representation, which however results in a large storage space requirement. In this paper, we introduce the Context-based NeRF Compression (CNC) framework, whic… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Project Page: https://yihangchen-ee.github.io/project_cnc/ Code: https://github.com/yihangchen-ee/cnc/. We further propose a 3DGS compression method HAC, which is based on CNC: https://yihangchen-ee.github.io/project_hac/

    Journal ref: CVPR 2024

  42. arXiv:2406.04038  [pdf, other

    cs.LG

    Road Network Representation Learning with the Third Law of Geography

    Authors: Haicang Zhou, Weiming Huang, Yile Chen, Tiantian He, Gao Cong, Yew-Soon Ong

    Abstract: Road network representation learning aims to learn compressed and effective vectorized representations for road segments that are applicable to numerous tasks. In this paper, we identify the limitations of existing methods, particularly their overemphasis on the distance effect as outlined in the First Law of Geography. In response, we propose to endow road network representation with the principl… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  43. arXiv:2406.03849  [pdf

    cs.LG stat.AP stat.ML

    A Noise-robust Multi-head Attention Mechanism for Formation Resistivity Prediction: Frequency Aware LSTM

    Authors: Yongan Zhang, Junfeng Zhao, Jian Li, Xuanran Wang, Youzhuang Sun, Yuntian Chen, Dongxiao Zhang

    Abstract: The prediction of formation resistivity plays a crucial role in the evaluation of oil and gas reservoirs, identification and assessment of geothermal energy resources, groundwater detection and monitoring, and carbon capture and storage. However, traditional well logging techniques fail to measure accurate resistivity in cased boreholes, and the transient electromagnetic method for cased borehole… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  44. arXiv:2406.03808  [pdf

    cs.LG cs.AI stat.AP

    Cross-variable Linear Integrated ENhanced Transformer for Photovoltaic power forecasting

    Authors: Jiaxin Gao, Qinglong Cao, Yuntian Chen, Dongxiao Zhang

    Abstract: Photovoltaic (PV) power forecasting plays a crucial role in optimizing the operation and planning of PV systems, thereby enabling efficient energy management and grid integration. However, un certainties caused by fluctuating weather conditions and complex interactions between different variables pose significant challenges to accurate PV power forecasting. In this study, we propose PV-Client (Cro… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  45. arXiv:2406.03740  [pdf

    cond-mat.str-el cond-mat.mtrl-sci cond-mat.supr-con

    Correlated Electronic Structure and Incipient Flat Bands of the Kagome Superconductor CsCr3Sb5

    Authors: Yidian Li, Yi Liu, Xian Du, Siqi Wu, Wenxuan Zhao, Kaiyi Zhai, Yinqi Hu, Senyao Zhang, Houke Chen, Jieyi Liu, Yiheng Yang, Cheng Peng, Makoto Hashimoto, Donghui Lu, Zhongkai Liu, Yilin Wang, Yulin Chen, Guanghan Cao, Lexian Yang

    Abstract: Kagome materials exhibit many novel phenomena emerging from the interplay between lattice geometry, electronic structure, and topology. A prime example is the vanadium-based kagome materials AV3Sb5 (A = K, Rb, and Cs) with superconductivity and unconventional charge-density wave (CDW). More interestingly, the substitution of vanadium by chromium further introduces magnetism and enhances the correl… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  46. arXiv:2406.03692  [pdf, other

    gr-qc hep-th

    New Taub-NUT Black Holes with Massive Spin-2 Hair

    Authors: Yu-Qi Chen, Hai-Shan Liu

    Abstract: We consider Einstein gravity extended with quadratic curvature invariants, where the well-known Ricci-flat Taub-NUT black hole remains a solution. An analysis of the unstable Lichnerowicz modes in the Taub-NUT background enables us to identify the mass and NUT parameters (m,n) where new Taub-NUT black holes can emerge. We then adopt numerical technique to construct these new Taub-NUT black holes t… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 7 pages, 6 figures

  47. arXiv:2406.03689  [pdf, other

    cs.CL cs.AI

    Evaluating the World Model Implicit in a Generative Model

    Authors: Keyon Vafa, Justin Y. Chen, Jon Kleinberg, Sendhil Mullainathan, Ashesh Rambachan

    Abstract: Recent work suggests that large language models may implicitly learn world models. How should we assess this possibility? We formalize this question for the case where the underlying reality is governed by a deterministic finite automaton. This includes problems as diverse as simple logical reasoning, geographic navigation, game-playing, and chemistry. We propose new evaluation metrics for world m… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  48. arXiv:2406.03582  [pdf, other

    cs.CV cs.AI

    Understanding the Limitations of Diffusion Concept Algebra Through Food

    Authors: E. Zhixuan Zeng, Yuhao Chen, Alexander Wong

    Abstract: Image generation techniques, particularly latent diffusion models, have exploded in popularity in recent years. Many techniques have been developed to manipulate and clarify the semantic concepts these large-scale models learn, offering crucial insights into biases and concept relationships. However, these techniques are often only validated in conventional realms of human or animal faces and arti… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  49. arXiv:2406.03554  [pdf

    cond-mat.mtrl-sci

    Magnetic ground state and strain-mediated chiral-like atomic distortions behavior in two-dimensional rectangular spin lattice

    Authors: Yu Liao, Yueqiao Qu, Zuo Li, Yu Chen, Liang Liu, Jun-Zhong Wang, Gang Yao

    Abstract: Due to the large perpendicular magnetic anisotropy originating from spin-orbit coupling, magnetoelastic coupling is generally reported in easy-plane magnets with rectangular lattice where the easy magnetization is coupled with the lattice direction, while the acquisition of a novel coupling, beyond the easy-plane ferromagnets, in two-dimensional (2D) materials remains unknown. Here, by employing t… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  50. arXiv:2406.03510  [pdf, other

    cs.SD cs.AI eess.AS

    Speech-based Clinical Depression Screening: An Empirical Study

    Authors: Yangbin Chen, Chenyang Xu, Chunfeng Liang, Yanbao Tao, Chuan Shi

    Abstract: This study investigates the utility of speech signals for AI-based depression screening across varied interaction scenarios, including psychiatric interviews, chatbot conversations, and text readings. Participants include depressed patients recruited from the outpatient clinics of Peking University Sixth Hospital and control group members from the community, all diagnosed by psychiatrists followin… ▽ More

    Submitted 12 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: 5 pages, 3 figures