Skip to main content

Showing 251–300 of 3,416 results for author: Wu, Z

.
  1. arXiv:2403.12744  [pdf, other

    cs.CL

    Instructing Large Language Models to Identify and Ignore Irrelevant Conditions

    Authors: Zhenyu Wu, Chao Shen, Meng Jiang

    Abstract: Math word problem (MWP) solving requires generating a reasoning path based on a given problem description that often contains irrelevant conditions. Existing chain-of-thought (CoT) prompting methods elicited multi-step reasoning abilities of large language models (LLMs) to solve MWPs. However, they were seriously confused by the irrelevant conditions, resulting in low accuracy. In this paper, we p… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: NAACL 2024 - Camera Ready

  2. arXiv:2403.12440  [pdf, other

    cs.CV

    Self-learning Canonical Space for Multi-view 3D Human Pose Estimation

    Authors: Xiaoben Li, Mancheng Meng, Ziyan Wu, Terrence Chen, Fan Yang, Dinggang Shen

    Abstract: Multi-view 3D human pose estimation is naturally superior to single view one, benefiting from more comprehensive information provided by images of multiple views. The information includes camera poses, 2D/3D human poses, and 3D geometry. However, the accurate annotation of these information is hard to obtain, making it challenging to predict accurate 3D human pose from multi-view images. To deal w… ▽ More

    Submitted 29 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  3. arXiv:2403.12434  [pdf, other

    cs.CV

    Human Mesh Recovery from Arbitrary Multi-view Images

    Authors: Xiaoben Li, Mancheng Meng, Ziyan Wu, Terrence Chen, Fan Yang, Dinggang Shen

    Abstract: Human mesh recovery from arbitrary multi-view images involves two characteristics: the arbitrary camera poses and arbitrary number of camera views. Because of the variability, designing a unified framework to tackle this task is challenging. The challenges can be summarized as the dilemma of being able to simultaneously estimate arbitrary camera poses and recover human mesh from arbitrary multi-vi… ▽ More

    Submitted 17 June, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  4. arXiv:2403.12416  [pdf, other

    cs.CV cs.CL

    Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning

    Authors: Chong Ma, Hanqi Jiang, Wenting Chen, Yiwei Li, Zihao Wu, Xiaowei Yu, Zhengliang Liu, Lei Guo, Dajiang Zhu, Tuo Zhang, Dinggang Shen, Tianming Liu, Xiang Li

    Abstract: In the medical multi-modal frameworks, the alignment of cross-modality features presents a significant challenge. However, existing works have learned features that are implicitly aligned from the data, without considering the explicit relationships in the medical context. This data-reliance may lead to low generalization of the learned alignment relationships. In this work, we propose the Eye-gaz… ▽ More

    Submitted 13 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 12 pages, 6 figures

    MSC Class: 68T07 ACM Class: I.2.0; I.4.0; I.5.4; I.7.0

  5. arXiv:2403.12393  [pdf, other

    cs.CL

    Dr3: Ask Large Language Models Not to Give Off-Topic Answers in Open Domain Multi-Hop Question Answering

    Authors: Yuan Gao, Yiheng Zhu, Yuanbin Cao, Yinzhi Zhou, Zhen Wu, Yujie Chen, Shenglan Wu, Haoyuan Hu, Xinyu Dai

    Abstract: Open Domain Multi-Hop Question Answering (ODMHQA) plays a crucial role in Natural Language Processing (NLP) by aiming to answer complex questions through multi-step reasoning over retrieved information from external knowledge sources. Recently, Large Language Models (LLMs) have demonstrated remarkable performance in solving ODMHQA owing to their capabilities including planning, reasoning, and util… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024, Long Paper

  6. arXiv:2403.11868  [pdf, other

    cs.GR cs.CV

    View-Consistent 3D Editing with Gaussian Splatting

    Authors: Yuxuan Wang, Xuanyu Yi, Zike Wu, Na Zhao, Long Chen, Hanwang Zhang

    Abstract: The advent of 3D Gaussian Splatting (3DGS) has revolutionized 3D editing, offering efficient, high-fidelity rendering and enabling precise local manipulations. Currently, diffusion-based 2D editing models are harnessed to modify multi-view rendered images, which then guide the editing of 3DGS models. However, this approach faces a critical issue of multi-view inconsistency, where the guidance imag… ▽ More

    Submitted 20 May, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 25 pages

  7. arXiv:2403.11845  [pdf

    eess.SP

    Simplified Self-homodyne Coherent System Based on Alamouti Coding and Digital Subcarrier Multiplexing

    Authors: Wei Wang, Dongdong Zou, Zhenpeng Wu, Qi Sui, Xingwen Yi, Fan Li, Chao Lu, Zhaohui Li

    Abstract: Coherent technology inherent with more availabledegrees of freedom is deemed a competitive solution for nextgeneration ultra-high-speed short-reach optical interconnects.However, the fatal barriers to implementing the conventiona.coherent system in short-reach optical interconnect are the costfootprint, and power consumption. Self-homodyne coherentsystem exhibits its potential to reduce the power… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  8. arXiv:2403.11459  [pdf, other

    cs.RO

    ALDM-Gras**: Diffusion-aided Zero-Shot Sim-to-Real Transfer for Robot Gras**

    Authors: Yiwei Li, Zihao Wu, Huaqin Zhao, Tianze Yang, Zhengliang Liu, Peng Shu, ** Sun, Ramviyas Parasuraman, Tianming Liu

    Abstract: To tackle the "reality gap" encountered in Sim-to-Real transfer, this study proposes a diffusion-based framework that minimizes inconsistencies in gras** actions between the simulation settings and realistic environments. The process begins by training an adversarial supervision layout-to-image diffusion model(ALDM). Then, leverage the ALDM approach to enhance the simulation environment, renderi… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  9. arXiv:2403.11416  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Surface region band enhancement in noble gas adsorption assisted ARPES on kagome superconductor RbV3Sb5

    Authors: Cao Peng, Yiwei Li, Xu Chen, Shenghao Dai, Zewen Wu, Chunlong Wu, Qiang Wan, Keming Zhao, Renzhe Li, Shangkun Mo, Dingkun Qin, Shuming Yu, Hao Zhong, Shengjun Yuan, Jiangang Guo, Nan Xu

    Abstract: Electronic states near surface regions can be distinct from bulk states, which are paramount in understanding various physical phenomena occurring at surfaces and in applications in semiconductors, energy, and catalysis. Here, we report an abnormal surface region band enhancement effect in angle-resolved photoemission spectroscopy on kagome superconductor RbV3Sb5, by depositing noble gases with fi… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 17 pages,4 figures

    Journal ref: Phys. Rev. B 109, 115415 (2024)

  10. arXiv:2403.11311  [pdf, other

    cs.CL cs.MM

    Mixture-of-Prompt-Experts for Multi-modal Semantic Understanding

    Authors: Zichen Wu, Hsiu-Yuan Huang, Fanyi Qu, Yunfang Wu

    Abstract: Deep multimodal semantic understanding that goes beyond the mere superficial content relation mining has received increasing attention in the realm of artificial intelligence. The challenges of collecting and annotating high-quality multi-modal data have underscored the significance of few-shot learning. In this paper, we focus on two critical tasks under this context: few-shot multi-modal sarcasm… ▽ More

    Submitted 24 March, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024, Long Paper

  11. arXiv:2403.11126  [pdf, ps, other

    cond-mat.supr-con

    Observation of diamagnetic strange-metal phase in sulfur-copper codoped lead apatite

    Authors: Hongyang Wang, Hao Wu, Ning Chen, Xianfeng Qiao, Ling Wang, Zhixing Wu, Zhihui Geng, Weiwei Xue, Shufeng Ye, Yao Yao

    Abstract: By codo** sulfur and copper into lead apatite, the crystal grains are directionally stacked and the room-temperature resistivity is reduced from insulating to $2\times10^{-5}~Ω\cdot$m. The resistance-temperature curve exhibits a nearly linear relationship at low temperature suggesting the presence of strange-metal phase, and a second-order phase transition is then observed at around 230~K during… ▽ More

    Submitted 6 May, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: 12 pages, 4 figures

  12. arXiv:2403.11122  [pdf, other

    cs.CV

    LERENet: Eliminating Intra-class Differences for Metal Surface Defect Few-shot Semantic Segmentation

    Authors: Hanze Ding, Zhangkai Wu, Jiyan Zhang, Ming **, Yanfang Liu

    Abstract: Few-shot segmentation models excel in metal defect detection due to their rapid generalization ability to new classes and pixel-level segmentation, rendering them ideal for addressing data scarcity issues and achieving refined object delineation in industrial applications. Existing works neglect the \textit{Intra-Class Differences}, inherent in metal surface defect data, which hinders the model fr… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  13. arXiv:2403.10877  [pdf, ps, other

    hep-ex hep-ph

    Test of lepton universality and measurement of the form factors of $D^0\to K^{*}(892)^-μ^+ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: We report a first study of the semileptonic decay $D^0\rightarrow K^-π^0μ^{+}ν_μ$ by analyzing an $e^+e^-$ annihilation data sample of $7.9~\mathrm{fb}^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. The absolute branching fraction of $D^0\to K^-π^0μ^{+}ν_μ$ is measured for the first time to be $(0.729 \pm 0.014_{\rm stat} \pm 0.011_{\rm syst})\%$. Based on an a… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 9 pages, 3 figures

  14. arXiv:2403.10794  [pdf, other

    cs.RO cs.LG cs.MA

    Diffusion-Reinforcement Learning Hierarchical Motion Planning in Adversarial Multi-agent Games

    Authors: Zixuan Wu, Sean Ye, Manisha Natarajan, Matthew C. Gombolay

    Abstract: Reinforcement Learning- (RL-)based motion planning has recently shown the potential to outperform traditional approaches from autonomous navigation to robot manipulation. In this work, we focus on a motion planning task for an evasive target in a partially observable multi-agent adversarial pursuit-evasion games (PEG). These pursuit-evasion problems are relevant to various applications, such as se… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: This work has been submitted to the IEEE Robotics and Automation Letters (RA-L) for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  15. arXiv:2403.10376  [pdf, other

    cs.CV

    PASTA: Towards Flexible and Efficient HDR Imaging Via Progressively Aggregated Spatio-Temporal Alignment

    Authors: Xiaoning Liu, Ao Li, Zongwei Wu, Yapeng Du, Le Zhang, Yulun Zhang, Radu Timofte, Ce Zhu

    Abstract: Leveraging Transformer attention has led to great advancements in HDR deghosting. However, the intricate nature of self-attention introduces practical challenges, as existing state-of-the-art methods often demand high-end GPUs or exhibit slow inference speeds, especially for high-resolution images like 2K. Striking an optimal balance between performance and latency remains a critical concern. In r… ▽ More

    Submitted 9 April, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  16. arXiv:2403.10242  [pdf, other

    cs.CV

    FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model

    Authors: Qijun Feng, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang

    Abstract: Reconstructing detailed 3D objects from single-view images remains a challenging task due to the limited information available. In this paper, we introduce FDGaussian, a novel two-stage framework for single-image 3D reconstruction. Recent methods typically utilize pre-trained 2D diffusion models to generate plausible novel views from the input image, yet they encounter issues with either multi-vie… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  17. arXiv:2403.10081  [pdf, other

    cs.CL cs.IR

    DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language Models

    Authors: Weihang Su, Yichen Tang, Qingyao Ai, Zhi**g Wu, Yiqun Liu

    Abstract: Dynamic retrieval augmented generation (RAG) paradigm actively decides when and what to retrieve during the text generation process of Large Language Models (LLMs). There are two key elements of this paradigm: identifying the optimal moment to activate the retrieval module (deciding when to retrieve) and crafting the appropriate query once retrieval is triggered (determining what to retrieve). How… ▽ More

    Submitted 5 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  18. arXiv:2403.09171  [pdf, other

    cs.LG cs.AI

    ADEdgeDrop: Adversarial Edge Drop** for Robust Graph Neural Networks

    Authors: Zhaoliang Chen, Zhihao Wu, Ylli Sadikaj, Claudia Plant, Hong-Ning Dai, Shi** Wang, Wenzhong Guo

    Abstract: Although Graph Neural Networks (GNNs) have exhibited the powerful ability to gather graph-structured information from neighborhood nodes via various message-passing mechanisms, the performance of GNNs is limited by poor generalization and fragile robustness caused by noisy and redundant graph data. As a prominent solution, Graph Augmentation Learning (GAL) has recently received increasing attentio… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  19. arXiv:2403.08644  [pdf, other

    cond-mat.mtrl-sci cond-mat.stat-mech

    Thermodynamic Integration for Dynamically Unstable Systems Using Interatomic Force Constants without Molecular Dynamics

    Authors: Junsoo Park, Zhigang Wu, John W. Lawson

    Abstract: We demonstrate an efficient and accurate, general-purpose first-principles blueprint for calculating anharmonic vibrational free energy and predicting structural phase transition temperatures of solids. Thermodynamic integration is performed without molecular dynamics using only interatomic force constants to model analogues of the true potential and generate their thermal ensembles. By replacing… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  20. arXiv:2403.08364  [pdf

    cs.LG cs.AI

    Decoupled Federated Learning on Long-Tailed and Non-IID data with Feature Statistics

    Authors: Zhuoxin Chen, Zhenyu Wu, Yang Ji

    Abstract: Federated learning is designed to enhance data security and privacy, but faces challenges when dealing with heterogeneous data in long-tailed and non-IID distributions. This paper explores an overlooked scenario where tail classes are sparsely distributed over a few clients, causing the models trained with these classes to have a lower probability of being selected during client aggregation, leadi… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  21. arXiv:2403.08215  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving

    Authors: Sicen Guo, Zhiyuan Wu, Qijun Chen, Ioannis Pitas, Rui Fan

    Abstract: Despite the impressive performance achieved by data-fusion networks with duplex encoders for visual semantic segmentation, they become ineffective when spatial geometric data are not available. Implicitly infusing the spatial geometric prior knowledge acquired by a duplex-encoder teacher model into a single-encoder student model is a practical, albeit less explored research avenue. This paper delv… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 13 pages, 4 figures, 5 tables

  22. arXiv:2403.08003  [pdf, ps, other

    cs.CV

    Real-time Surgical Instrument Segmentation in Video Using Point Tracking and Segment Anything

    Authors: Zijian Wu, Adam Schmidt, Peter Kazanzides, Septimiu E. Salcudean

    Abstract: The Segment Anything Model (SAM) is a powerful vision foundation model that is revolutionizing the traditional paradigm of segmentation. Despite this, a reliance on prompting each frame and large computational cost limit its usage in robotically assisted surgery. Applications, such as augmented reality guidance, require little user intervention along with efficient inference to be usable clinicall… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 10 pages, 4 figures

  23. arXiv:2403.07809  [pdf, other

    cs.LG cs.CL

    pyvene: A Library for Understanding and Improving PyTorch Models via Interventions

    Authors: Zhengxuan Wu, Atticus Geiger, Aryaman Arora, **g Huang, Zheng Wang, Noah D. Goodman, Christopher D. Manning, Christopher Potts

    Abstract: Interventions on model-internal states are fundamental operations in many areas of AI, including model editing, steering, robustness, and interpretability. To facilitate such research, we introduce $\textbf{pyvene}$, an open-source Python library that supports customizable interventions on a range of different PyTorch modules. $\textbf{pyvene}$ supports complex intervention schemes with an intuiti… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

  24. arXiv:2403.06766  [pdf, other

    hep-ex

    Determination of the number of $ψ(3686)$ events taken at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The number of $ψ(3686)$ events collected by the BESIII detector during the 2021 run period is determined to be $(2259.3\pm 11.1)\times 10^6$ by counting inclusive $ψ(3686)$ hadronic events. The uncertainty is systematic and the statistical uncertainty is negligible. Meanwhile, the numbers of $ψ(3686)$ events collected during the 2009 and 2012 run periods are updated to be… ▽ More

    Submitted 28 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  25. arXiv:2403.06650  [pdf, other

    cond-mat.supr-con

    Magnetic signatures of multicomponent superconductivity in pressurized UTe2

    Authors: Zheyu Wu, Jiasheng Chen, Theodore. I. Weinberger, Andrej Cabala, Vladimir Sechovsky, Michal Valiska, Patricia L. Alireza, Alexander G. Eaton, F. Malte Grosche

    Abstract: The heavy fermion material UTe$_2$ possesses a rich phase diagram with multiple superconducting phases, several of which exhibit characteristics of odd-parity pairing. Here, we report on the pressure dependence of signatures of the superconducting transition in the temperature dependent ac magnetic susceptibility $χ(T)$ in high quality UTe$_2$ single crystals. We resolve a single superconducting t… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  26. arXiv:2403.06448  [pdf, other

    cs.CL cs.AI

    Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models

    Authors: Weihang Su, Changyue Wang, Qingyao Ai, Yiran HU, Zhi**g Wu, Yujia Zhou, Yiqun Liu

    Abstract: Hallucinations in large language models (LLMs) refer to the phenomenon of LLMs producing responses that are coherent yet factually inaccurate. This issue undermines the effectiveness of LLMs in practical applications, necessitating research into detecting and mitigating hallucinations of LLMs. Previous studies have mainly concentrated on post-processing techniques for hallucination detection, whic… ▽ More

    Submitted 10 June, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  27. arXiv:2403.06185  [pdf, other

    cs.IT eess.SP math.OC

    Quantized Constant-Envelope Waveform Design for Massive MIMO DFRC Systems

    Authors: Zheyu Wu, Ya-Feng Liu, Wei-Kun Chen, Christos Masouros

    Abstract: Both dual-functional radar-communication (DFRC) and massive multiple-input multiple-output (MIMO) have been recognized as enabling technologies for 6G wireless networks. This paper considers the advanced waveform design for hardware-efficient massive MIMO DFRC systems. Specifically, the transmit waveform is imposed with the quantized constant-envelope (QCE) constraint, which facilitates the employ… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: 17 pages, 11 figures, submitted for possible publication

  28. arXiv:2403.05834  [pdf, other

    cs.MM cs.SD eess.AS

    Enhancing Expressiveness in Dance Generation via Integrating Frequency and Music Style Information

    Authors: Qiaochu Huang, Xu He, Boshi Tang, Haolin Zhuang, Liyang Chen, Shuochen Gao, Zhiyong Wu, Haozhi Huang, Helen Meng

    Abstract: Dance generation, as a branch of human motion generation, has attracted increasing attention. Recently, a few works attempt to enhance dance expressiveness, which includes genre matching, beat alignment, and dance dynamics, from certain aspects. However, the enhancement is quite limited as they lack comprehensive consideration of the aforementioned three factors. In this paper, we propose Expressi… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  29. arXiv:2403.05758  [pdf, other

    cs.CV

    Automating Catheterization Labs with Real-Time Perception

    Authors: Fan Yang, Benjamin Planche, Meng Zheng, Cheng Chen, Terrence Chen, Ziyan Wu

    Abstract: For decades, three-dimensional C-arm Cone-Beam Computed Tomography (CBCT) imaging system has been a critical component for complex vascular and nonvascular interventional procedures. While it can significantly improve multiplanar soft tissue imaging and provide pre-treatment target lesion roadmap** and guidance, the traditional workflow can be cumbersome and time-consuming, especially for less e… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  30. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  31. arXiv:2403.05010  [pdf, other

    cs.SD cs.AI eess.AS

    RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction

    Authors: Peng Liu, Dongyang Dai, Zhiyong Wu

    Abstract: Recent advancements in generative modeling have significantly enhanced the reconstruction of audio waveforms from various representations. While diffusion models are adept at this task, they are hindered by latency issues due to their operation at the individual sample point level and the need for numerous sampling steps. In this study, we introduce RFWave, a cutting-edge multi-band Rectified Flow… ▽ More

    Submitted 2 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  32. arXiv:2403.05006  [pdf, ps, other

    cs.LG cs.AI stat.ME stat.ML

    Provable Multi-Party Reinforcement Learning with Diverse Human Feedback

    Authors: Huiying Zhong, Zhun Deng, Weijie J. Su, Zhiwei Steven Wu, Linjun Zhang

    Abstract: Reinforcement learning with human feedback (RLHF) is an emerging paradigm to align models with human preferences. Typically, RLHF aggregates preferences from multiple individuals who have diverse viewpoints that may conflict with each other. Our work \textit{initiates} the theoretical study of multi-party RLHF that explicitly models the diverse preferences of multiple individuals. We show how trad… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  33. arXiv:2403.04374  [pdf, other

    eess.SY cs.AI

    Model-Free Load Frequency Control of Nonlinear Power Systems Based on Deep Reinforcement Learning

    Authors: Xiaodi Chen, Meng Zhang, Zhengguang Wu, Ligang Wu, Xiaohong Guan

    Abstract: Load frequency control (LFC) is widely employed in power systems to stabilize frequency fluctuation and guarantee power quality. However, most existing LFC methods rely on accurate power system modeling and usually ignore the nonlinear characteristics of the system, limiting controllers' performance. To solve these problems, this paper proposes a model-free LFC method for nonlinear power systems b… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  34. arXiv:2403.04293  [pdf, other

    cs.AI cs.CR

    MKF-ADS: Multi-Knowledge Fusion Based Self-supervised Anomaly Detection System for Control Area Network

    Authors: Pengzhou Cheng, Zongru Wu, Gongshen Liu

    Abstract: Control Area Network (CAN) is an essential communication protocol that interacts between Electronic Control Units (ECUs) in the vehicular network. However, CAN is facing stringent security challenges due to innate security risks. Intrusion detection systems (IDSs) are a crucial safety component in remediating Vehicular Electronics and Systems vulnerabilities. However, existing IDSs fail to identif… ▽ More

    Submitted 14 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: 14 figures, 5 tables

  35. arXiv:2403.04184  [pdf, other

    cs.SI cs.CY

    Exploring the Impact of Opinion Polarization on Short Video Consumption

    Authors: Bangde Du, Ziyi Ye, Zhi**g Wu, Qingyao Ai, Yiqun Liu

    Abstract: Investigating the increasingly popular domain of short video consumption, this study focuses on the impact of Opinion Polarization (OP), a significant factor in the digital landscape influencing public opinions and social interactions. We analyze OP's effect on viewers' perceptions and behaviors, finding that traditional feedback metrics like likes and watch time fail to fully capture and measure… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 9 pages, 8 figures

    MSC Class: 92C55 ACM Class: H.5.2; K.4.2; J.4

  36. arXiv:2403.03946  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Pressure-enhanced $f$-electron orbital weighting in UTe2 mapped by quantum interferometry

    Authors: T. I. Weinberger, Z. Wu, A. J. Hickey, D. E. Graf, G. Li, P. Wang, R. Zhou, A. Cabala, J. Pu, V. Sechovsky, M. Valiska, G. G. Lonzarich, F. M. Grosche, A. G. Eaton

    Abstract: The phase landscape of UTe$_2$ features a remarkable diversity of superconducting phases under applied pressure and magnetic field. Recent quantum oscillation studies at ambient pressure have revealed the quasi-2D Fermi surface of this material. However, the pressure-dependence of the Fermi surface remains an open question. Here we track the evolution of the UTe$_2$ Fermi surface as a function of… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  37. arXiv:2403.03815  [pdf, ps, other

    astro-ph.GA

    GMRT observations of OHM candidates from the ALFALFA survey

    Authors: Shouzhi Wang, Zhongzu Wu, Bo Zhang, Yu. Sotnikova, T. Mufakharov, Zhiqiang Shen, Yongjun Chen, Jianfeng Wu

    Abstract: We present the results of our observations using the Giant Meterwave Radio Telescope (GMRT) to investigate the radio continuum and OH line emission of 10 OHM candidates from the Arecibo Legacy Fast ALFA (ALFALFA) survey. Among these candidates, we have identified two sources, AGC115713 and AGC249507, which display compact OH line emission that are spatially associated with radio continuum emission… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by A&A

  38. arXiv:2403.03552  [pdf, other

    cs.GT cs.LG cs.MA eess.SY

    Population-aware Online Mirror Descent for Mean-Field Games by Deep Reinforcement Learning

    Authors: Zida Wu, Mathieu Lauriere, Samuel Jia Cong Chua, Matthieu Geist, Olivier Pietquin, Ankur Mehta

    Abstract: Mean Field Games (MFGs) have the ability to handle large-scale multi-agent systems, but learning Nash equilibria in MFGs remains a challenging task. In this paper, we propose a deep reinforcement learning (DRL) algorithm that achieves population-dependent Nash equilibrium without the need for averaging or sampling from history, inspired by Munchausen RL and Online Mirror Descent. Through the desig… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  39. arXiv:2403.03500  [pdf, other

    hep-ex

    Observation of the decay $h_{c}\to3(π^{+}π^{-})π^{0}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on $(2712.4\pm14.1)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector, we study the decays $h_{c}\to3(π^{+}π^{-})π^{0}$, $h_{c}\to2(π^{+}π^{-})ω$, $h_{c}\to2(π^{+}π^{-})π^{0}η$, $h_{c}\to2(π^{+}π^{-})η$, and $h_{c}\to p\bar{p}$ via $ψ(3686)\toπ^{0}h_{c}$. The decay channel $h_{c}\to3(π^{+}π^{-})π^{0}$ is observed for the first time, and its branching fraction is determined to… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, 3 figures

  40. arXiv:2403.03329  [pdf, other

    cs.CL

    Guardrail Baselines for Unlearning in LLMs

    Authors: Pratiksha Thaker, Yash Maurya, Shengyuan Hu, Zhiwei Steven Wu, Virginia Smith

    Abstract: Recent work has demonstrated that finetuning is a promising approach to 'unlearn' concepts from large language models. However, finetuning can be expensive, as it requires both generating a set of examples and running iterations of finetuning to update the model. In this work, we show that simple guardrail-based approaches such as prompting and filtering can achieve unlearning results comparable t… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Preliminary work, accepted to ICLR workshop SeT-LLM 2024

  41. arXiv:2403.03217  [pdf, other

    cs.CV

    Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion

    Authors: Meng Zheng, Benjamin Planche, Xuan Gong, Fan Yang, Terrence Chen, Ziyan Wu

    Abstract: 3D patient body modeling is critical to the success of automated patient positioning for smart medical scanning and operating rooms. Existing CNN-based end-to-end patient modeling solutions typically require a) customized network designs demanding large amount of relevant training data, covering extensive realistic clinical scenarios (e.g., patient covered by sheets), which leads to suboptimal gen… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: MICCAI 2022

  42. arXiv:2403.03100  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

    Authors: Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, **yu Li, Sheng Zhao

    Abstract: While recent large-scale text-to-speech (TTS) models have achieved significant progress, they still fall short in speech quality, similarity, and prosody. Considering speech intricately encompasses various attributes (e.g., content, prosody, timbre, and acoustic details) that pose significant challenges for generation, a natural idea is to factorize speech into individual subspaces representing di… ▽ More

    Submitted 23 April, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Achieving human-level quality and naturalness on multi-speaker datasets (e.g., LibriSpeech) in a zero-shot way

  43. arXiv:2403.02993  [pdf, other

    cs.AI

    Localized Zeroth-Order Prompt Optimization

    Authors: Wenyang Hu, Yao Shu, Zongmin Yu, Zhaoxuan Wu, Xiangqiang Lin, Zhongxiang Dai, See-Kiong Ng, Bryan Kian Hsiang Low

    Abstract: The efficacy of large language models (LLMs) in understanding and generating natural language has aroused a wide interest in develo** prompt-based methods to harness the power of black-box LLMs. Existing methodologies usually prioritize a global optimization for finding the global optimum, which however will perform poorly in certain tasks. This thus motivates us to re-think the necessity of fin… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  44. arXiv:2403.02535  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Quantum critical fluctuations generate intensely magnetic field-resilient superconductivity in UTe2

    Authors: Z. Wu, T. I. Weinberger, A. J. Hickey, D. V. Chichinadze, D. Shaffer, A. Cabala, H. Chen, M. Long, T. J. Brumm, W. Xie, Y. Lin, Y. Skourski, Z. Zengwei, D. E. Graf, V. Sechovsky, G. G. Lonzarich, 1 M. Valiska, F. M. Grosche, A. G. Eaton

    Abstract: Quantum critical phase boundaries (QCPBs) -- where a continuous phase transition occurs at zero temperature -- have been found to nucleate novel electronic states in a number of strongly correlated materials. Emergent electronic phases, such as unconventional superconductivity, frequently occur in close proximity to a QCPB. However, the antagonism between magnetic field and superconductivity gener… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  45. Ivie: Lightweight Anchored Explanations of Just-Generated Code

    Authors: Litao Yan, Alyssa Hwang, Zhiyuan Wu, Andrew Head

    Abstract: Programming assistants have reshaped the experience of programming into one where programmers spend less time writing and more time critically examining code. In this paper, we explore how programming assistants can be extended to accelerate the inspection of generated code. We introduce an extension to the programming assistant called Ivie, or instantly visible in-situ explanations. When using Iv… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 15 pages, 10 figures, to be published in the CHI Conference on Human Factors in Computing Systems (CHI 24)

  46. arXiv:2403.02286  [pdf, other

    cs.DB

    Stage: Query Execution Time Prediction in Amazon Redshift

    Authors: Ziniu Wu, Ryan Marcus, Zhengchun Liu, Parimarjan Negi, Vikram Nathan, Pascal Pfeil, Gaurav Saxena, Mohammad Rahman, Balakrishnan Narayanaswamy, Tim Kraska

    Abstract: Query performance (e.g., execution time) prediction is a critical component of modern DBMSes. As a pioneering cloud data warehouse, Amazon Redshift relies on an accurate execution time prediction for many downstream tasks, ranging from high-level optimizations, such as automatically creating materialized views, to low-level tasks on the critical path of query execution, such as admission, scheduli… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 15 pages

  47. arXiv:2403.02265  [pdf, other

    cs.CV cs.GR

    DaReNeRF: Direction-aware Representation for Dynamic Scenes

    Authors: Ange Lou, Benjamin Planche, Zhongpai Gao, Yamin Li, Tianyu Luan, Hao Ding, Terrence Chen, Jack Noble, Ziyan Wu

    Abstract: Addressing the intricate challenge of modeling and re-rendering dynamic scenes, most recent approaches have sought to simplify these complexities using plane-based explicit representations, overcoming the slow training time issues associated with methods like Neural Radiance Fields (NeRF) and implicit representations. However, the straightforward decomposition of 4D dynamic scenes into multiple 2D… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR 2024. Paper + supplementary material

  48. arXiv:2403.02177  [pdf, other

    cs.CL

    ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context

    Authors: Zirui Wu, Yansong Feng

    Abstract: Tables play a crucial role in conveying information in various domains. We propose a Plan-then-Reason framework to answer different types of user queries over tables with sentence context. The framework first plans the reasoning paths over the context, then assigns each step to program-based or textual reasoning to reach the final answer. This framework enhances the table reasoning abilities for b… ▽ More

    Submitted 1 July, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  49. arXiv:2403.01937  [pdf, other

    hep-ph

    Examining the critical phenomenon of pion parton distribution: Insights from the Moment Problem

    Authors: Xiaobin Wang, Zexin Wu, Minghui Ding, Lei Chang

    Abstract: A recent study by Wang {\it et al.}(arXiv:2309.01417) proposed a novel connection between the nature of the parton distribution function (PDF) and the characteristics of its moments. In this study, we apply these findings to analyze the evolution of the pion valence quark PDF, garnering valuable qualitative insights. Firstly, we validate the non-negativity and continuity of the PDF across a wide r… ▽ More

    Submitted 7 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures

  50. arXiv:2403.01867  [pdf, other

    cs.LO

    Deciding Separation Logic with Pointer Arithmetic and Inductive Definitions

    Authors: Wanyun Su, Zhilin Wu, Mihaela Sighireanu

    Abstract: Pointer arithmetic is widely used in low-level programs, e.g. memory allocators. The specification of such programs usually requires using pointer arithmetic inside inductive definitions to define the common data structures, e.g. heap lists in memory allocators. In this work, we investigate decision problems for SLAH, a separation logic fragment that allows pointer arithmetic inside inductive defi… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.