Skip to main content

Showing 201–250 of 5,199 results for author: Xue, Z

.
  1. arXiv:2405.02341  [pdf, other

    cs.CR cs.LG

    Improved Communication-Privacy Trade-offs in $L_2$ Mean Estimation under Streaming Differential Privacy

    Authors: Wei-Ning Chen, Berivan Isik, Peter Kairouz, Albert No, Sewoong Oh, Zheng Xu

    Abstract: We study $L_2$ mean estimation under central differential privacy and communication constraints, and address two key challenges: firstly, existing mean estimation schemes that simultaneously handle both constraints are usually optimized for $L_\infty$ geometry and rely on random rotation or Kashin's representation to adapt to $L_2$ geometry, resulting in suboptimal leading constants in mean square… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  2. arXiv:2405.01615  [pdf, other

    cs.NE cs.LG

    Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning

    Authors: Chengqian Gao, William de Vazelhes, Hualin Zhang, Bin Gu, Zhiqiang Xu

    Abstract: Evolution Strategies (ES) have emerged as a competitive alternative for model-free reinforcement learning, showcasing exemplary performance in tasks like Mujoco and Atari. Notably, they shine in scenarios with imperfect reward functions, making them invaluable for real-world applications where dense reward signals may be elusive. Yet, an inherent assumption in ES, that all input features are task-… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 16 pages, including proofs in the appendix

  3. arXiv:2405.01607  [pdf, other

    cs.LG cs.CV

    Wildfire Risk Prediction: A Review

    Authors: Zhengsen Xu, Jonathan Li, Linlin Xu

    Abstract: Wildfires have significant impacts on global vegetation, wildlife, and humans. They destroy plant communities and wildlife habitats and contribute to increased emissions of carbon dioxide, nitrogen oxides, methane, and other pollutants. The prediction of wildfires relies on various independent variables combined with regression or machine learning methods. In this technical review, we describe the… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  4. arXiv:2405.01319  [pdf, other

    cs.LG cs.CE

    Data Sco**: Effectively Learning the Evolution of Generic Transport PDEs

    Authors: Jiangce Chen, Wenzhuo Xu, Zeda Xu, Noelia Grande Gutiérrez, Sneha Prabha Narra, Christopher McComb

    Abstract: Transport phenomena (e.g., fluid flows) are governed by time-dependent partial differential equations (PDEs) describing mass, momentum, and energy conservation, and are ubiquitous in many engineering applications. However, deep learning architectures are fundamentally incompatible with the simulation of these PDEs. This paper clearly articulates and then solves this incompatibility. The local-depe… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  5. arXiv:2405.01041  [pdf, other

    cs.LG

    Efficient and Flexible Method for Reducing Moderate-size Deep Neural Networks with Condensation

    Authors: Tianyi Chen, Zhi-Qin John Xu

    Abstract: Neural networks have been extensively applied to a variety of tasks, achieving astounding results. Applying neural networks in the scientific field is an important research direction that is gaining increasing attention. In scientific applications, the scale of neural networks is generally moderate-size, mainly to ensure the speed of inference during application. Additionally, comparing neural net… ▽ More

    Submitted 1 July, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  6. arXiv:2405.00987  [pdf, other

    cs.LG

    S$^2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic

    Authors: Safa Messaoud, Billel Mokeddem, Zhenghai Xue, Linsey Pang, Bo An, Haipeng Chen, Sanjay Chawla

    Abstract: Learning expressive stochastic policies instead of deterministic ones has been proposed to achieve better stability, sample complexity, and robustness. Notably, in Maximum Entropy Reinforcement Learning (MaxEnt RL), the policy is modeled as an expressive Energy-Based Model (EBM) over the Q-values. However, this formulation requires the estimation of the entropy of such EBMs, which is an open probl… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted for publication at ICLR 2024

  7. arXiv:2405.00317  [pdf, other

    math.DS physics.comp-ph

    Input gradient annealing neural network for solving low-temperature Fokker-Planck equations

    Authors: Liangkai Hang, Dan Hu, Zin-Qin John Xu

    Abstract: We present a novel yet simple deep learning approach, called input gradient annealing neural network (IGANN), for solving stationary Fokker-Planck equations. Traditional methods, such as finite difference and finite elements, suffer from the curse of dimensionality. Neural network based algorithms are meshless methods, which can avoid the curse of dimensionality. However, at low temperature, when… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  8. arXiv:2405.00098  [pdf, other

    hep-ex

    Amplitude analysis and branching fraction measurement of $B^{+}\to D^{*-}D^{+}_{s}π^{+}$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1057 additional authors not shown)

    Abstract: The decays of the $B^{+}$ meson to the final state $D^{*-}D^{+}_{s}π^{+}$ are studied in proton-proton collision data collected with the LHCb detector at centre-of-mass energies of 7, 8, and 13 TeV, corresponding to a total integrated luminosity of 9 fb$^{-1}$. The ratio of branching fractions of the $B^{+}\to D^{*-}D^{+}_{s}π^{+}$ and $B^{0}\to D^{*-}D^{+}_{s}$ decays is measured to be… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-001.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-001, CERN-EP-2024-110

  9. arXiv:2404.19702  [pdf, other

    cs.CV

    GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting

    Authors: Kai Zhang, Sai Bi, Hao Tan, Yuanbo Xiangli, Nanxuan Zhao, Kalyan Sunkavalli, Zexiang Xu

    Abstract: We propose GS-LRM, a scalable large reconstruction model that can predict high-quality 3D Gaussian primitives from 2-4 posed sparse images in 0.23 seconds on single A100 GPU. Our model features a very simple transformer-based architecture; we patchify input posed images, pass the concatenated multi-view image tokens through a sequence of transformer blocks, and decode final per-pixel Gaussian para… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Project webpage: https://sai-bi.github.io/project/gs-lrm/

  10. arXiv:2404.19534  [pdf, other

    cs.CV

    MIPI 2024 Challenge on Nighttime Flare Removal: Methods and Results

    Authors: Yuekun Dai, Dafeng Zhang, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Peiqing Yang, Zhezhu **, Guanqun Liu, Chen Change Loy, Lize Zhang, Shuai Liu, Chaoyu Feng, Luyang Wang, Shuan Chen, Guangqi Shao, Xiaotao Wang, Lei Lei, Qirui Yang, Qihua Cheng, Zhiqiang Xu, Yihao Liu, Huan**g Yue, **gyu Yang , et al. (38 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 27 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Nighttime Flare Removal Challenge Report. Website: https://mipi-challenge.org/MIPI2024/

  11. arXiv:2404.19510  [pdf, other

    hep-ex

    First observation of $Λ_{b}^{0} \rightarrow Σ_c^{(*)++} D^{(*)-} K^{-}$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1067 additional authors not shown)

    Abstract: The four decays, $Λ_{b}^{0} \rightarrow Σ_c^{(*)++} D^{(*)-} K^{-}$, are observed for the first time using proton-proton collision data collected with the LHCb detector at a centre-of-mass energy of $13\,\rm{TeV}$, corresponding to an integrated luminosity of $6\,\rm{fb}^{-1}$. By considering the $Λ_b^0 \rightarrow Λ_c^{+} \overline{D}^0 K^{-}$ decay as reference channel, the following branching f… ▽ More

    Submitted 11 June, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-044.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-044, CERN-EP-2024-098

  12. arXiv:2404.19403  [pdf, other

    cs.RO cs.AI

    Transformer-Enhanced Motion Planner: Attention-Guided Sampling for State-Specific Decision Making

    Authors: Lei Zhuang, **gdong Zhao, Yuntao Li, Zichun Xu, Liangliang Zhao, Hong Liu

    Abstract: Sampling-based motion planning (SBMP) algorithms are renowned for their robust global search capabilities. However, the inherent randomness in their sampling mechanisms often result in inconsistent path quality and limited search efficiency. In response to these challenges, this work proposes a novel deep learning-based motion planning framework, named Transformer-Enhanced Motion Planner (TEMP), w… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  13. arXiv:2404.19097  [pdf, other

    cs.HC

    Exploring the Capability of LLMs in Performing Low-Level Visual Analytic Tasks on SVG Data Visualizations

    Authors: Zhongzheng Xu, Emily Wall

    Abstract: Data visualizations help extract insights from datasets, but reaching these insights requires decomposing high level goals into low-level analytic tasks that can be complex due to varying degrees of data literacy and visualization experience. Recent advancements in large language models (LLMs) have shown promise for lowering barriers for users to achieve tasks such as writing code and may likewise… ▽ More

    Submitted 30 April, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  14. arXiv:2404.18886  [pdf, other

    cs.LG cs.AI

    A Survey on Diffusion Models for Time Series and Spatio-Temporal Data

    Authors: Yiyuan Yang, Ming **, Haomin Wen, Chaoli Zhang, Yuxuan Liang, Lintao Ma, Yi Wang, Chenghao Liu, Bin Yang, Zenglin Xu, Jiang Bian, Shirui Pan, Qingsong Wen

    Abstract: The study of time series is crucial for understanding trends and anomalies over time, enabling predictive insights across various sectors. Spatio-temporal data, on the other hand, is vital for analyzing phenomena in both space and time, providing a dynamic perspective on complex system interactions. Recently, diffusion models have seen widespread application in time series and spatio-temporal data… ▽ More

    Submitted 11 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Ongoing work & Under review; 27 pages, 8 figures, 2 tables; Github Repo: https://github.com/yyysjz1997/Awesome-TimeSeries-SpatioTemporal-Diffusion-Model

  15. arXiv:2404.18814  [pdf, ps, other

    cs.CR

    Belt and Brace: When Federated Learning Meets Differential Privacy

    Authors: Xuebin Ren, Shusen Yang, Cong Zhao, Julie McCann, Zongben Xu

    Abstract: Federated learning (FL) has great potential for large-scale machine learning (ML) without exposing raw data.Differential privacy (DP) is the de facto standard of privacy protection with provable guarantees.Advances in ML suggest that DP would be a perfect fit for FL with comprehensive privacy preservation. Hence, extensive efforts have been devoted to achieving practically usable FL with DP, which… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 10 pages, 4 figures, accepted by and to appear in Communications of the ACM (CACM)

  16. arXiv:2404.18563  [pdf, other

    astro-ph.SR

    Cool matter distribution in inner solar corona from 2023 total solar eclipse observation

    Authors: Z. Q. Qu, H. Su, Y. Liang, Z. Xu, R. Y. Zhou

    Abstract: Solar corona has been judged to consist of free electrons and highly ionized ions with extremely high temperature as a widely accepted knowledge. This view is changed by our eclipse observations. Distributions of cool matter represented by neutral iron atoms in hot inner solar corona are presented via derived global maps of solar Fraunhofer(F-) and Emission(E-) coronae, compared with those of cont… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  17. arXiv:2404.18533  [pdf, other

    cs.AI cs.HC

    Evaluating Concept-based Explanations of Language Models: A Study on Faithfulness and Readability

    Authors: Meng Li, Haoran **, Ruixuan Huang, Zhihao Xu, Defu Lian, Zijia Lin, Di Zhang, Xiting Wang

    Abstract: Despite the surprisingly high intelligence exhibited by Large Language Models (LLMs), we are somehow intimidated to fully deploy them into real-life applications considering their black-box nature. Concept-based explanations arise as a promising avenue for explaining what the LLMs have learned, making them more transparent to humans. However, current evaluations for concepts tend to be heuristic a… ▽ More

    Submitted 29 April, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  18. arXiv:2404.18338  [pdf, other

    math.NA

    An extension of the box method discrete fracture model (Box-DFM) to include low-permeable barriers with minimal additional degrees of freedom

    Authors: Ziyao Xu, Dennis Gläser

    Abstract: The box method discrete fracture model (Box-DFM) is an important finite volume-based discrete fracture model (DFM) to simulate flows in fractured porous media. In this paper, we investigate a simple but effective extension of the box method discrete fracture model to include low-permeable barriers. The method remains identical to the traditional Box-DFM [41, 48] in the absence of barriers. The inc… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  19. arXiv:2404.18337  [pdf, ps, other

    cs.DS

    Additive Spanner Lower Bounds with Optimal Inner Graph Structure

    Authors: Greg Bodwin, Gary Hoppenworth, Virginia Vassilevska Williams, Nicole Wein, Zixuan Xu

    Abstract: We construct $n$-node graphs on which any $O(n)$-size spanner has additive error at least $+Ω(n^{3/17})$, improving on the previous best lower bound of $Ω(n^{1/7})$ [Bodwin-Hoppenworth FOCS '22]. Our construction completes the first two steps of a particular three-step research program, introduced in prior work and overviewed here, aimed at producing tight bounds for the problem by aligning aspect… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: ICALP 2024

  20. arXiv:2404.18200  [pdf, other

    q-fin.MF

    Mean Field Game of High-Frequency Anticipatory Trading

    Authors: Xue Cheng, Meng Wang, Ziyi Xu

    Abstract: The interactions between a large population of high-frequency traders (HFTs) and a large trader (LT) who executes a certain amount of assets at discrete time points are studied. HFTs are faster in the sense that they trade continuously and predict the transactions of LT. A jump process is applied to model the transition of HFTs' attitudes towards inventories and the equilibrium is solved through t… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  21. arXiv:2404.17809  [pdf, other

    cs.CL cs.AI

    Recall, Retrieve and Reason: Towards Better In-Context Relation Extraction

    Authors: Guozheng Li, Peng Wang, Wenjun Ke, Yikai Guo, Ke Ji, Ziyu Shang, Jiajun Liu, Zijie Xu

    Abstract: Relation extraction (RE) aims to identify relations between entities mentioned in texts. Although large language models (LLMs) have demonstrated impressive in-context learning (ICL) abilities in various tasks, they still suffer from poor performances compared to most supervised fine-tuned RE methods. Utilizing ICL for RE with LLMs encounters two challenges: (1) retrieving good demonstrations from… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: IJCAI 2024

  22. arXiv:2404.17807  [pdf, other

    cs.CL cs.AI

    Meta In-Context Learning Makes Large Language Models Better Zero and Few-Shot Relation Extractors

    Authors: Guozheng Li, Peng Wang, Jiajun Liu, Yikai Guo, Ke Ji, Ziyu Shang, Zijie Xu

    Abstract: Relation extraction (RE) is an important task that aims to identify the relationships between entities in texts. While large language models (LLMs) have revealed remarkable in-context learning (ICL) capability for general zero and few-shot learning, recent studies indicate that current LLMs still struggle with zero and few-shot RE. Previous studies are mainly dedicated to design prompt formats and… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: IJCAI 2024

  23. arXiv:2404.17802  [pdf, other

    cs.CL cs.AI

    Empirical Analysis of Dialogue Relation Extraction with Large Language Models

    Authors: Guozheng Li, Zijie Xu, Ziyu Shang, Jiajun Liu, Ke Ji, Yikai Guo

    Abstract: Dialogue relation extraction (DRE) aims to extract relations between two arguments within a dialogue, which is more challenging than standard RE due to the higher person pronoun frequency and lower information density in dialogues. However, existing DRE methods still suffer from two serious issues: (1) hard to capture long and sparse multi-turn information, and (2) struggle to extract golden relat… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: IJCAI 2024

  24. arXiv:2404.17780  [pdf, other

    cs.MA cs.AI

    Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning

    Authors: Dapeng Li, Hang Dong, Lu Wang, Bo Qiao, Si Qin, Qingwei Lin, Dongmei Zhang, Qi Zhang, Zhiwei Xu, Bin Zhang, Guoliang Fan

    Abstract: In recent years, multi-agent reinforcement learning algorithms have made significant advancements in diverse gaming environments, leading to increased interest in the broader application of such techniques. To address the prevalent challenge of partial observability, communication-based algorithms have improved cooperative performance through the sharing of numerical embedding between agents. Howe… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 figures

  25. arXiv:2404.17723  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Retrieval-Augmented Generation with Knowledge Graphs for Customer Service Question Answering

    Authors: Zhentao Xu, Mark Jerome Cruz, Matthew Guevara, Tie Wang, Manasi Deshpande, Xiaofeng Wang, Zheng Li

    Abstract: In customer service technical support, swiftly and accurately retrieving relevant past issues is critical for efficiently resolving customer inquiries. The conventional retrieval methods in retrieval-augmented generation (RAG) for large language models (LLMs) treat a large corpus of past issue tracking tickets as plain text, ignoring the crucial intra-issue structure and inter-issue relations, whi… ▽ More

    Submitted 6 May, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    ACM Class: I.2

  26. arXiv:2404.17571  [pdf, other

    cs.CV

    Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

    Authors: Zhengze Xu, Mengting Chen, Zhao Wang, Linyu Xing, Zhonghua Zhai, Nong Sang, **song Lan, Shuai Xiao, Changxin Gao

    Abstract: Video try-on is a challenging task and has not been well tackled in previous works. The main obstacle lies in preserving the details of the clothing and modeling the coherent motions simultaneously. Faced with those difficulties, we address video try-on by proposing a diffusion-based framework named "Tunnel Try-on." The core idea is excavating a "focus tunnel" in the input video that gives close-u… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Project Page: https://mengtingchen.github.io/tunnel-try-on-page/

  27. arXiv:2404.17567  [pdf, other

    astro-ph.CO

    Sensitivity-Improved Polarization Maps at 40 GHz with CLASS and WMAP data

    Authors: Rui Shi, John W. Appel, Charles L. Bennett, Ricardo Bustos, David T. Chuss, Sumit Dahal, Jullianna Denes Couto, Joseph R. Eimer, Thomas Essinger-Hileman, Kathleen Harrington, Jeffrey Iuliano, Yunyang Li, Tobias A. Marriage, Matthew A. Petroff, Karwan Rostem, Zeya Song, Deniz A. N. Valle, Duncan J. Watts, Janet L. Weiland, Edward J. Wollack, Zhilei Xu

    Abstract: Improved polarization measurements at frequencies below 70 GHz with degree-level angular resolution are crucial for advancing our understanding of the Galactic synchrotron radiation and the potential polarized anomalous microwave emission and ultimately benefiting the detection of primordial $B$ modes. In this study, we present sensitivity-improved 40 GHz polarization maps obtained by combining th… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 19 pages, 16 figures, 1 table

  28. arXiv:2404.17275  [pdf, other

    cs.CV cs.LG

    Adversarial Reweighting with $α$-Power Maximization for Domain Adaptation

    Authors: Xiang Gu, Xi Yu, Yan Yang, Jian Sun, Zongben Xu

    Abstract: The practical Domain Adaptation (DA) tasks, e.g., Partial DA (PDA), open-set DA, universal DA, and test-time adaptation, have gained increasing attention in the machine learning community. In this paper, we propose a novel approach, dubbed Adversarial Reweighting with $α$-Power Maximization (ARPM), for PDA where the source domain contains private classes absent in target domain. In ARPM, we propos… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: To appear in IJCV

  29. arXiv:2404.16824  [pdf, other

    cs.CV

    V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection

    Authors: Xuanyu Zhang, Youmin Xu, Runyi Li, Jiwen Yu, Weiqi Li, Zhipei Xu, Jian Zhang

    Abstract: AI-generated video has revolutionized short video production, filmmaking, and personalized media, making video local editing an essential tool. However, this progress also blurs the line between reality and fiction, posing challenges in multimedia forensics. To solve this urgent issue, V2A-Mark is proposed to address the limitations of current video tampering forensics, such as poor generalizabili… ▽ More

    Submitted 15 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  30. arXiv:2404.16789  [pdf, other

    cs.LG cs.AI cs.CL

    Continual Learning of Large Language Models: A Comprehensive Survey

    Authors: Haizhou Shi, Zihao Xu, Hengyi Wang, Weiyi Qin, Wenyuan Wang, Yibin Wang, Zifeng Wang, Sayna Ebrahimi, Hao Wang

    Abstract: The recent success of large language models (LLMs) trained on static, pre-collected, general datasets has sparked numerous research directions and applications. One such direction addresses the non-trivial challenge of integrating pre-trained LLMs into dynamic data distributions, task structures, and user preferences. Pre-trained LLMs, when tailored for specific needs, often experience significant… ▽ More

    Submitted 29 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 47 pages, 2 figures, 4 tables. Work in progress

  31. arXiv:2404.16770  [pdf, other

    cond-mat.str-el

    Pseudogap phase as fluctuating pair density wave

    Authors: Zheng-Yuan Yue, Zheng-Tao Xu, Shuo Yang, Zheng-Cheng Gu

    Abstract: The physical nature of pseudogap phase is one of the most important and intriguing problems towards understanding the key mechanism of high temperature superconductivity in cuprates. Theoretically, the square-lattice $t$-$J$ model is widely believed to be the simplest toy model that captures the essential physics of cuprate superconductors. We employ the Grassmann tensor product state approach to… ▽ More

    Submitted 15 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 10 pages, 13 figures, references added

  32. arXiv:2404.16469  [pdf, ps, other

    cond-mat.supr-con

    From weak to strong-coupling superconductivity tuned by substrate in TiN films

    Authors: Yixin Liu, Zulei Xu, Aobo Yu, Xiaoni Wang, Wei Peng, Yu Wu, Gang Mu, Zhi-Rong Lin

    Abstract: The interplay between substrates and superconducting thin films has attracted increasing attention. Here, we report an in-depth investigation on superconducting properties of the epitaxial TiN thin films grown on two different substrates by dc reactive magnetron sputtering. The TiN films grown on (0001) sapphire exhibit (111) crystal orientation, while that grown on (100) Si substrates exhibit (10… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 6 pages, 5 figures

  33. arXiv:2404.16425  [pdf, other

    astro-ph.HE

    Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

    Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

    Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 41 pages, 8 figures, 7 tables

  34. arXiv:2404.16349  [pdf, ps, other

    cs.DS cs.CC

    More Asymmetry Yields Faster Matrix Multiplication

    Authors: Josh Alman, Ran Duan, Virginia Vassilevska Williams, Yinzhan Xu, Zixuan Xu, Renfei Zhou

    Abstract: We present a new improvement on the laser method for designing fast matrix multiplication algorithms. The new method further develops the recent advances by [Duan, Wu, Zhou FOCS 2023] and [Vassilevska Williams, Xu, Xu, Zhou SODA 2024]. Surprisingly the new improvement is achieved by incorporating more asymmetry in the analysis, circumventing a fundamental tool of prior work that requires two of th… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 44 pages. arXiv admin note: text overlap with arXiv:2307.07970

  35. arXiv:2404.16343  [pdf, other

    astro-ph.GA astro-ph.HE

    Magnetically Driven Relativistic Jet in the High-Redshift Blazar OH~471

    Authors: S. Guo, T. An, Y. Liu, Y. Sotnikova, A. Volvach, T. Mufakharov, L. Chen, L. Cui, A. Wang, Z. Xu, Y. Zhang, W. Xu, Y. A. Kovalev, Y. Y. Kovalev, M. Kharinov, A. Erkenov, T. Semenova, L. Volvach

    Abstract: Context : Understanding the mechanisms that launch and shape powerful relativistic jets from supermassive black holes (SMBHs) in high-redshift active galactic nuclei (AGN) is crucial for probing the co-evolution of SMBHs and galaxies over cosmic time. Aims :We study the high-redshift ($z=3.396$) blazar OH~471 to explore the jet launching mechanism in the early Universe. Methods : Using multi-f… ▽ More

    Submitted 20 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 16 pages, 7 figures, 3 tables

    Journal ref: A&A 685, L11 (2024)

  36. Single-Atom Verification of the Optimal Trade-Off between Speed and Cost in Shortcuts to Adiabaticity

    Authors: J. -W. Zhang, J. -T. Bu, J. C. Li, Weiquan Meng, W. -Q. Ding, B. Wang, W. -F. Yuan, H. -J. Du, G. -Y. Ding, W. -J. Chen, L. Chen, F. Zhou, Zhenyu Xu, M. Feng

    Abstract: The approach of shortcuts to adiabaticity enables the effective execution of adiabatic dynamics in quantum information processing with enhanced speed. Owing to the inherent trade-off between dynamical speed and the cost associated with the transitionless driving field, executing arbitrarily fast operations becomes impractical. To understand the accurate interplay between speed and energetic cost i… ▽ More

    Submitted 6 June, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 6+5 pages, 3+3 figures

    Journal ref: Phys. Rev. Lett. 132, 213602 (2024)

  37. arXiv:2404.14963  [pdf, other

    cs.CL cs.AI

    Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems

    Authors: Qihuang Zhong, Kang Wang, Ziyang Xu, Juhua Liu, Liang Ding, Bo Du, Dacheng Tao

    Abstract: Chain-of-Thought (CoT) prompting has enhanced the performance of Large Language Models (LLMs) across various reasoning tasks. However, CoT still falls short in dealing with complex math word problems, as it usually suffers from three pitfalls: semantic misunderstanding errors, calculation errors and step-missing errors. Prior studies involve addressing the calculation errors and step-missing error… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Work in progress

  38. arXiv:2404.13840  [pdf, other

    hep-ex

    Study of $e^+e^-\toωX(3872)$ and $γX(3872)$ from 4.66 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using data samples with an integrated luminosity of $4.5~\text{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies ranging from 4.66 to 4.95 GeV, we study the processes of $e^+e^-\toωX(3872)$ and $e^+e^-\toγX(3872)$. With the $e^+e^-\toωX(3872)$ process, the branching fraction ratio $R\equiv\frac{\mathcal{B}(X(3872)\toγJ/ψ)}{\mathcal{B}(X(3872)\toπ^+π^- J/ψ)}$ is measured to be… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 19 pages, 10 figures

  39. arXiv:2404.13599  [pdf, other

    cs.CL

    "A good pun is its own reword": Can Large Language Models Understand Puns?

    Authors: Zhijun Xu, Siyu Yuan, Lingjie Chen, Deqing Yang

    Abstract: Puns play a vital role in academic research due to their distinct structure and clear definition, which aid in the comprehensive analysis of linguistic humor. However, the understanding of puns in large language models (LLMs) has not been thoroughly examined, limiting their use in creative writing and humor creation. In this paper, we leverage three popular tasks, i.e., pun recognition, explanatio… ▽ More

    Submitted 16 June, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  40. arXiv:2404.13445  [pdf, other

    cs.CV cs.GR

    DMesh: A Differentiable Mesh Representation

    Authors: Sanghyun Son, Matheus Gadelha, Yang Zhou, Zexiang Xu, Ming C. Lin, Yi Zhou

    Abstract: We present a differentiable representation, DMesh, for general 3D triangular meshes. DMesh considers both the geometry and connectivity information of a mesh. In our design, we first get a set of convex tetrahedra that compactly tessellates the domain based on Weighted Delaunay Triangulation (WDT), and select triangular faces on the tetrahedra to define the final mesh. We formulate probability of… ▽ More

    Submitted 1 June, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

    Comments: 35 pages, 22 figures. Updated with more analysis and experimental results

  41. arXiv:2404.12861  [pdf, other

    cs.CV

    Foundation Model assisted Weakly Supervised LiDAR Semantic Segmentation

    Authors: Yilong Chen, Zongyi Xu, xiaoshui Huang, Ruicheng Zhang, Xinqi Jiang, Xinbo Gao

    Abstract: Current point cloud semantic segmentation has achieved great advances when given sufficient labels. However, the dense annotation of LiDAR point clouds remains prohibitively expensive and time-consuming, unable to keep up with the continuously growing volume of data. In this paper, we propose annotating images with scattered points, followed by utilizing SAM (a Foundation model) to generate semant… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  42. arXiv:2404.12524  [pdf, other

    cs.CV cs.LG cs.RO

    DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects

    Authors: Dominik Bauer, Zhenjia Xu, Shuran Song

    Abstract: Manipulation of elastoplastic objects like dough often involves topological changes such as splitting and merging. The ability to accurately predict these topological changes that a specific action might incur is critical for planning interactions with elastoplastic objects. We present DoughNet, a Transformer-based architecture for handling these challenges, consisting of two components. First, a… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: Under review. 17 pages, 14 figures

  43. arXiv:2404.12385  [pdf, other

    cs.CV cs.GR

    MeshLRM: Large Reconstruction Model for High-Quality Mesh

    Authors: Xinyue Wei, Kai Zhang, Sai Bi, Hao Tan, Fujun Luan, Valentin Deschaintre, Kalyan Sunkavalli, Hao Su, Zexiang Xu

    Abstract: We propose MeshLRM, a novel LRM-based approach that can reconstruct a high-quality mesh from merely four input images in less than one second. Different from previous large reconstruction models (LRMs) that focus on NeRF-based reconstruction, MeshLRM incorporates differentiable mesh extraction and rendering within the LRM framework. This allows for end-to-end mesh reconstruction by fine-tuning a p… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  44. arXiv:2404.12242  [pdf, other

    cs.CL

    CMNEE: A Large-Scale Document-Level Event Extraction Dataset based on Open-Source Chinese Military News

    Authors: Mengna Zhu, Zijie Xu, Kaisheng Zeng, Kaiming Xiao, Mao Wang, Wenjun Ke, Hongbin Huang

    Abstract: Extracting structured event knowledge, including event triggers and corresponding arguments, from military texts is fundamental to many applications, such as intelligence analysis and decision assistance. However, event extraction in the military field faces the data scarcity problem, which impedes the research of event extraction models in this domain. To alleviate this problem, we propose CMNEE,… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 13 pages, 7 figures, accepted to LREC-COLING 2024

  45. arXiv:2404.12038  [pdf, other

    cs.CL

    Uncovering Safety Risks of Large Language Models through Concept Activation Vector

    Authors: Zhihao Xu, Ruixuan Huang, Changyu Chen, Shuai Wang, Xiting Wang

    Abstract: Despite careful safety alignment, current large language models (LLMs) remain vulnerable to various attacks. To further unveil the safety risks of LLMs, we introduce a Safety Concept Activation Vector (SCAV) framework, which effectively guides the attacks by accurately interpreting LLMs' safety mechanisms. We then develop an SCAV-guided attack method that can generate both attack prompts and embed… ▽ More

    Submitted 2 July, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  46. arXiv:2404.10838  [pdf, other

    cs.CV cs.CL cs.MM

    Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning

    Authors: Zhengyang Liang, Meiyu Liang, Wei Huang, Yawen Li, Zhe Xue

    Abstract: In recent years, pre-trained multimodal large models have attracted widespread attention due to their outstanding performance in various multimodal applications. Nonetheless, the extensive computational resources and vast datasets required for their training present significant hurdles for deployment in environments with limited computational resources. To address this challenge, we propose a nove… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 10 pages

  47. arXiv:2404.10760  [pdf, other

    cs.CV

    Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark

    Authors: Jiangning Zhang, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Zhucun Xue, Yong Liu, Guansong Pang, Dacheng Tao

    Abstract: Anomaly detection (AD) is often focused on detecting anomaly areas for industrial quality inspection and medical lesion examination. However, due to the specific scenario targets, the data scale for AD is relatively small, and evaluation metrics are still deficient compared to classic vision tasks, such as object detection and semantic segmentation. To fill these gaps, this work first constructs a… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  48. arXiv:2404.10379  [pdf, ps, other

    math.CO

    Sublinear hitting sets for some geometric graphs

    Authors: Xinbu Cheng, Zixiang Xu

    Abstract: For an $n$-vertex graph $G$, let $h(G)$ denote the smallest size of a subset of $V(G)$ such that it intersects every maximum independent set of $G$. A conjecture posed by Bollobás, Erdős and Tuza in early 90s remains widely open, asserting that for any $n$-vertex graph $G$, if the independence number $α(G) =Ω(n) $, then $h(G) = o(n)$. In this paper, we establish the validity of this conjecture for… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 16 pages

    MSC Class: 05C69

  49. arXiv:2404.09872  [pdf, other

    cs.CV

    Conditional Prototype Rectification Prompt Learning

    Authors: Haoxing Chen, Yaohui Li, Zizheng Huang, Yan Hong, Zhuoer Xu, Zhangxuan Gu, Jun Lan, Huijia Zhu, Weiqiang Wang

    Abstract: Pre-trained large-scale vision-language models (VLMs) have acquired profound understanding of general visual concepts. Recent advancements in efficient transfer learning (ETL) have shown remarkable success in fine-tuning VLMs within the scenario of limited data, introducing only a few parameters to harness task-specific insights from VLMs. Despite significant progress, current leading ETL methods… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  50. arXiv:2404.09494  [pdf, ps, other

    cs.LG

    On the Necessity of Collaboration in Online Model Selection with Decentralized Data

    Authors: Junfan Li, Zenglin Xu, Zheshun Wu, Irwin King

    Abstract: We consider online model selection with decentralized data over $M$ clients, and study the necessity of collaboration among clients. Previous work proposed various federated algorithms without demonstrating their necessity, while we answer the question from a novel perspective of computational constraints. We prove lower bounds on the regret, and propose a federated algorithm and analyze the upper… ▽ More

    Submitted 21 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.