Skip to main content

Showing 251–300 of 2,824 results for author: Xue, L

.
  1. arXiv:2402.04933  [pdf, other

    cs.LG stat.AP

    A Bayesian Approach to Online Learning for Contextual Restless Bandits with Applications to Public Health

    Authors: Biyonka Liang, Lily Xu, Aparna Taneja, Milind Tambe, Lucas Janson

    Abstract: Public health programs often provide interventions to encourage beneficiary adherence,and effectively allocating interventions is vital for producing the greatest overall health outcomes. Such resource allocation problems are often modeled as restless multi-armed bandits (RMABs) with unknown underlying transition dynamics, hence requiring online reinforcement learning (RL). We present Bayesian Lea… ▽ More

    Submitted 27 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 26 pages, 18 figures

  2. arXiv:2402.04871  [pdf, ps, other

    math.AP math-ph

    Nonlinear Stability of Planar Shock Waves for the 3-D Boltzmann Equation

    Authors: Dingqun Deng, Lingda Xu

    Abstract: This paper studies the stability and large-time behavior of the three-dimensional (3-D) Boltzmann equation near shock profiles. We prove the nonlinear stability of the composite wave consisting of two shock profiles under general perturbations without the assumption of integral zero of macroscopic quantities. To address the challenge caused by the compressibility of shock profiles, we apply the me… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 46 pages, all comments are welcome

    MSC Class: Primary: 35Q20; Secondary 76L05; 76P05; 35L67

  3. arXiv:2402.03944  [pdf, other

    cs.CV

    IMUSE: IMU-based Facial Expression Capture

    Authors: Youjia Wang, Yiwen Wu, Hengan Zhou, Hongyang Lin, Xingyue Peng, Yingwenqi Jiang, Yingsheng Zhu, Guanpeng Long, Yatu Zhang, **gya Wang, Lan Xu, **gyi Yu

    Abstract: For facial motion capture and analysis, the dominated solutions are generally based on visual cues, which cannot protect privacy and are vulnerable to occlusions. Inertial measurement units (IMUs) serve as potential rescues yet are mainly adopted for full-body motion capture. In this paper, we propose IMUSE to fill the gap, a novel path for facial expression capture using purely IMU signals, signi… ▽ More

    Submitted 12 June, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: Go to IMUSE project page https://sites.google.com/view/projectpage-imuse and watch our video https://youtu.be/Rki9syHsvpc

  4. arXiv:2402.03829  [pdf, ps, other

    hep-ex

    Precise Measurement of Born Cross Sections for $e^+e^-\to D\bar{D}$ and Observation of One Structure between $\sqrt{s} = 3.80-4.95$ GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

    Abstract: Using data samples collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 3.80 to 4.95 GeV, corresponding to an integrated luminosity of 20 fb$^{-1}$, a measurement of Born cross sections for the $e^+e^-\to D^{0}\bar{D}^{0}$ and $D^{+}D^{-}$ processes is presented with unprecedented precision. By performing a simultaneous fit to the dressed cross sections… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures, 1 tables, 1 Supplemental_Material

  5. arXiv:2402.02245  [pdf, other

    cs.CV cs.LG eess.IV

    Revisiting Generative Adversarial Networks for Binary Semantic Segmentation on Imbalanced Datasets

    Authors: Lei Xu, Moncef Gabbouj

    Abstract: Anomalous crack region detection is a typical binary semantic segmentation task, which aims to detect pixels representing cracks on pavement surface images automatically by algorithms. Although existing deep learning-based methods have achieved outcoming results on specific public pavement datasets, the performance would deteriorate dramatically on imbalanced datasets. The input datasets used in s… ▽ More

    Submitted 7 March, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

  6. arXiv:2402.02146  [pdf, other

    cs.AI cs.LG cs.NI eess.SP

    Emergency Computing: An Adaptive Collaborative Inference Method Based on Hierarchical Reinforcement Learning

    Authors: Weiqi Fu, Lianming Xu, Xin Wu, Li Wang, Aiguo Fei

    Abstract: In achieving effective emergency response, the timely acquisition of environmental information, seamless command data transmission, and prompt decision-making are crucial. This necessitates the establishment of a resilient emergency communication dedicated network, capable of providing communication and sensing services even in the absence of basic infrastructure. In this paper, we propose an Emer… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  7. arXiv:2402.01993  [pdf, other

    hep-ex

    Measurement of the Electromagnetic Transition Form-factors in the decays $η'\rightarrowπ^+π^-l^+l^-$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (618 additional authors not shown)

    Abstract: With a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events accumulated with the BESIII detector, we analyze the decays $η'\rightarrowπ^+π^-l^+l^-(l=e,$ $μ)$ via the process $J/ψ\rightarrowγη'$. The branching fractions are measured to be $\mathcal{B}(η'\rightarrowπ^+π^-e^+e^-)=(2.45\pm0.02(\rm{stat.})\pm0.08(\rm{syst.})) \times10^{-3}$ and… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  8. arXiv:2402.01336  [pdf, other

    hep-ex

    Measurements of the branching fraction ratio $\cal{B}(φ\to μ^+μ^-)/\cal{B}(φ\to e^+e^-)$ with charm meson decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1080 additional authors not shown)

    Abstract: Measurements of the branching fraction ratio ${\cal{B}(φ\to μ^+ μ^-)/\cal{B}(φ\to e^+e^-)}$ with ${D_{s}^{+} \to π^{+} φ}$ and ${D^{+} \to π^{+} φ}$ decays, denoted $R^{s}_{φπ}$ and $R^{d}_{φπ}$, are presented. The analysis is performed using a dataset corresponding to an integrated luminosity of 5.4$\,\rm{fb}^{-1}$ of $pp$ collision data collected with the LHCb experiment. The branching fractions… ▽ More

    Submitted 1 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-038.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-038, CERN-EP-2024-001

  9. arXiv:2402.01271  [pdf, other

    eess.AS cs.SD

    An Intra-BRNN and GB-RVQ Based END-TO-END Neural Audio Codec

    Authors: Lin** Xu, Jiawei Jiang, Dejun Zhang, Xianjun Xia, Li Chen, Yijian Xiao, Piao Ding, Shenyi Song, Sixing Yin, Ferdous Sohel

    Abstract: Recently, neural networks have proven to be effective in performing speech coding task at low bitrates. However, under-utilization of intra-frame correlations and the error of quantizer specifically degrade the reconstructed audio quality. To improve the coding quality, we present an end-to-end neural speech codec, namely CBRC (Convolutional and Bidirectional Recurrent neural Codec). An interleave… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: INTERSPEECH 2023

  10. arXiv:2402.00616  [pdf

    eess.SP

    Dual-Tap Optical-Digital Feedforward Equalization Enabling High-Speed Optical Transmission in IM/DD Systems

    Authors: Yu Guo, Yangbo Wu, Zhao Yang, Lei Xue, Ning Liang, Yang Ren, Zhengrui Tu, Jia Feng, Qunbi Zhuge

    Abstract: Intensity-modulation and direct-detection (IM/DD) transmission is widely adopted for high-speed optical transmission scenarios due to its cost-effectiveness and simplicity. However, as the data rate increases, the fiber chromatic dispersion (CD) would induce a serious power fading effect, and direct detection could generate inter-symbol interference (ISI). Moreover, the ISI becomes more severe wit… ▽ More

    Submitted 1 February, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 6 pages, 7 gigures, journal

  11. Study of $CP$ violation in $B^0_{(s)} \to D K^{*}(892)^0$ decays with $D \to K π( ππ)$, $ ππ( ππ)$, and $KK$ final states

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1072 additional authors not shown)

    Abstract: A measurement of $CP$-violating observables associated with the interference of $B^0\to D^0 K^{*}(892)^0$ and $B^0\to \bar{D}^0 K^*(892)^0$ decay amplitudes is performed in the $D^0 \to K^{\mp}π^{\pm}(π^+π^-),$ $D^0 \to π^+π^-(π^+π^-)$, and $D^0\to K^+K^-$ final states using data collected by the LHCb experiment corresponding to an integrated luminosity of $9$ $\text{fb}^{-1}$. $CP$-violating obse… ▽ More

    Submitted 13 May, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-040.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-040, CERN-EP-2024-007

    Journal ref: JHEP 05(2024) 025

  12. arXiv:2401.17896  [pdf, ps, other

    physics.app-ph quant-ph

    Photosynthetic properties assisted by the quantum entanglement in two adjacent pigment molecules

    Authors: Lu-Xin Xu, Shun-Cai Zhao, Ling-Fang Li

    Abstract: The quantum dynamics of entanglement is widely revealed in photosynthetic light-harvesting complexes. Different from the previous work, we explore the properties of exciton transport and photosynthesis assisted by the quantum entanglement in two adjacent pigment molecules, which are measured by the population dynamics behaviors, the $j$-$V$ characteristics and by the output power via a photosynthe… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 11 pages, 4 figures

    Journal ref: Eur. Phys. J. Plus 137, 683 (2022)

  13. Differentiation of correlated fluctuations in site energy on excitation energy transfer in photosynthetic light-harvesting complexes

    Authors: Lu-Xin Xu, Shun-Cai Zhao, Sheng-Nan Zhu, Lin-Jie Chen

    Abstract: One of the promising approaches to revealing the photosynthetic efficiency of close to one unit is to investigate the quantum regime of excitation energy transfer (EET). The majority of studies, however, have concluded that different pigment molecules contribute equally to EET, rather than differently. We investigate the roles of different site-energies in EET by evaluating the correlated fluctuat… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 14 pages, 7 figures

    Journal ref: Results in Physics, 38, 105597, 2022

  14. Measurements of Normalized Differential Cross Sections of Inclusive $η$ Production in $e^{+}e^{-}$ Annihilation at Energy from 2.0000 to 3.6710 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, D. Anderle, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (641 additional authors not shown)

    Abstract: Using data samples collected with the BESIII detector operating at the BEPCII storage ring, the cross section of the inclusive process $e^{+}e^{-} \to η+ X$, normalized by the total cross section of $e^{+}e^{-} \to \text{hadrons}$, is measured at eight center-of-mass energy points from 2.0000 GeV to 3.6710 GeV. These are the first measurements with momentum dependence in this energy region. Our me… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 9 pages, 2 figures

  15. Delayed response to the photovoltaic performance in a double quantum dot photocell with spatially correlated fluctuation

    Authors: Sheng-Nan Zhu, Shun-Cai Zhao, Lu-Xin Xu, Lin-Jie Chen

    Abstract: A viable strategy for enhancing photovoltaic performance in a double quantum dot (DQD) photocell is to comprehend the underlying quantum physical regime of charge transfer. This work explores the photovoltaic performance dependent spatially correlated fluctuation in a DQD photocell. A suggested DQD photocell model was used to examine the effects of spatially correlated variation on charge transfer… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 16 pages, 5 figures

    Journal ref: Chin. Phys. B 32, 057302 (2023)

  16. arXiv:2401.17540  [pdf, other

    math.OC

    Good and Fast Row-Sparse ah-Symmetric Reflexive Generalized Inverses

    Authors: Gabriel Ponte, Marcia Fampa, Jon Lee, Luze Xu

    Abstract: We present several algorithms aimed at constructing sparse and structured sparse (row-sparse) generalized inverses, with application to the efficient computation of least-squares solutions, for inconsistent systems of linear equations, in the setting of multiple right-hand sides and a rank-deficient constraint matrix. Leveraging our earlier formulations to minimize the 1- and 2,1- norms of general… ▽ More

    Submitted 25 June, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  17. arXiv:2401.17196  [pdf, other

    cs.CL

    Single Word Change is All You Need: Designing Attacks and Defenses for Text Classifiers

    Authors: Lei Xu, Sarah Alnegheimish, Laure Berti-Equille, Alfredo Cuesta-Infante, Kalyan Veeramachaneni

    Abstract: In text classification, creating an adversarial example means subtly perturbing a few words in a sentence without changing its meaning, causing it to be misclassified by a classifier. A concerning observation is that a significant portion of adversarial examples generated by existing methods change only one word. This single-word perturbation vulnerability represents a significant weakness in clas… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  18. arXiv:2401.16687  [pdf, other

    cs.CR cs.LG

    Revisiting Gradient Pruning: A Dual Realization for Defending against Gradient Attacks

    Authors: Lulu Xue, Shengshan Hu, Ruizhi Zhao, Leo Yu Zhang, Shengqing Hu, Lichao Sun, Dezhong Yao

    Abstract: Collaborative learning (CL) is a distributed learning framework that aims to protect user privacy by allowing users to jointly train a model by sharing their gradient updates only. However, gradient inversion attacks (GIAs), which recover users' training data from shared gradients, impose severe privacy threats to CL. Existing defense methods adopt different techniques, e.g., differential privacy,… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  19. arXiv:2401.16564  [pdf

    eess.SP

    Data and Physics driven Deep Learning Models for Fast MRI Reconstruction: Fundamentals and Methodologies

    Authors: Jiahao Huang, Yinzhe Wu, Fanwen Wang, Yingying Fang, Yang Nan, Cagan Alkan, Lei Xu, Zhifan Gao, Weiwen Wu, Lei Zhu, Zhaolin Chen, Peter Lally, Neal Bangerter, Kawin Setsompop, Yike Guo, Daniel Rueckert, Ge Wang, Guang Yang

    Abstract: Magnetic Resonance Imaging (MRI) is a pivotal clinical diagnostic tool, yet its extended scanning times often compromise patient comfort and image quality, especially in volumetric, temporal and quantitative scans. This review elucidates recent advances in MRI acceleration via data and physics-driven models, leveraging techniques from algorithm unrolling models, enhancement-based models, and plug-… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  20. arXiv:2401.16522  [pdf, other

    cs.CV

    Dropout Concrete Autoencoder for Band Selection on HSI Scenes

    Authors: Lei Xu, Mete Ahishali, Moncef Gabbouj

    Abstract: Deep learning-based informative band selection methods on hyperspectral images (HSI) recently have gained intense attention to eliminate spectral correlation and redundancies. However, the existing deep learning-based methods either need additional post-processing strategies to select the descriptive bands or optimize the model indirectly, due to the parameterization inability of discrete variable… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  21. arXiv:2401.16465  [pdf, other

    cs.CV cs.GR

    DressCode: Autoregressively Sewing and Generating Garments from Text Guidance

    Authors: Kai He, Kaixin Yao, Qixuan Zhang, **gyi Yu, Lingjie Liu, Lan Xu

    Abstract: Apparel's significant role in human appearance underscores the importance of garment digitalization for digital human creation. Recent advances in 3D content creation are pivotal for digital human creation. Nonetheless, garment generation from text guidance is still nascent. We introduce a text-driven 3D garment generation framework, DressCode, which aims to democratize design for novices and offe… ▽ More

    Submitted 14 June, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Project page: https://IHe-KaiI.github.io/DressCode/

  22. arXiv:2401.15687  [pdf, other

    cs.CV cs.GR

    Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance

    Authors: Qingcheng Zhao, Pengyu Long, Qixuan Zhang, Dafei Qin, Han Liang, Longwen Zhang, Yingliang Zhang, **gyi Yu, Lan Xu

    Abstract: The synthesis of 3D facial animations from speech has garnered considerable attention. Due to the scarcity of high-quality 4D facial data and well-annotated abundant multi-modality labels, previous methods often suffer from limited realism and a lack of lexible conditioning. We address this challenge through a trilogy. We first introduce Generalized Neural Parametric Facial Asset (GNPFA), an effic… ▽ More

    Submitted 30 January, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: Project Page: https://sites.google.com/view/media2face

  23. Observation of topological frequency combs

    Authors: Christopher J. Flower, Mahmoud Jalali Mehrabad, Lida Xu, Gregory Moille, Daniel G. Suarez-Forero, Ogulcan Orsel, Gaurav Bahl, Yanne Chembo, Kartik Srinivasan, Sunil Mittal, Mohammad Hafezi

    Abstract: On-chip generation of optical frequency combs using nonlinear ring resonators has opened the route to numerous novel applications of combs that were otherwise limited to mode-locked laser systems. Nevertheless, even after more than a decade of development, on-chip nonlinear combs still predominantly rely on the use of single-ring resonators. Recent theoretical investigations have shown that genera… ▽ More

    Submitted 8 April, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

    Comments: 9 pages, 5 figures (SI: 7 pages, 9 figures)

  24. arXiv:2401.15042  [pdf, other

    cs.CL cs.AI

    PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models

    Authors: Haochen Tan, Zhijiang Guo, Zhan Shi, Lu Xu, Zhili Liu, Yunlong Feng, Xiaoguang Li, Yasheng Wang, Lifeng Shang, Qun Liu, Linqi Song

    Abstract: Large Language Models (LLMs) have succeeded remarkably in understanding long-form contents. However, exploring their capability for generating long-form contents, such as reports and articles, has been relatively unexplored and inadequately assessed by existing benchmarks. The prevalent evaluation methods, which predominantly rely on crowdsourcing, are recognized for their labor-intensive nature a… ▽ More

    Submitted 4 June, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Accepted to ACL 2024 main conference

  25. arXiv:2401.14720  [pdf, ps, other

    hep-ex

    Observation of structures in the processes $e^+e^-\rightarrowωχ_{c1}$ and $ωχ_{c2}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

    Abstract: We present measurements of the Born cross sections for the processes $e^+e^-\rightarrowωχ_{c1}$ and $ωχ_{c2}$ at center-of-mass energies $\sqrt{s}$ from 4.308 to 4.951 GeV. The measurements are performed with data samples corresponding to an integrated luminosity of 11.0 $\rm{fb}^{-1}$ collected with the BESIII detector operating at the BEPCII storage ring. Assuming the $e^+e^-\rightarrowωχ_{c2}$… ▽ More

    Submitted 24 March, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 11 pages, 8 figures, with Supplemental Material

  26. arXiv:2401.14711  [pdf, other

    hep-ex

    Study of $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ at $\sqrt{s}$ from 2.00 to 3.08 GeV at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (608 additional authors not shown)

    Abstract: With the data samples taken at center-of-mass energies from 2.00 to 3.08 GeV with the BESIII detector at the BEPCII collider, a partial wave analysis on the $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ process is performed. The Born cross sections for $e^{+}e^{-}\rightarrowπ^{+}π^{-}π^{0}$ and its intermediate processes $e^{+}e^{-}\rightarrowρπ$ and $ρ(1450)π$ are measured as functions of $\sqrt{s}$. Th… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  27. arXiv:2401.14361  [pdf, other

    cs.LG cs.PF

    MoE-Infinity: Activation-Aware Expert Offloading for Efficient MoE Serving

    Authors: Leyang Xue, Yao Fu, Zhan Lu, Luo Mai, Mahesh Marina

    Abstract: This paper presents MoE-Infinity, a cost-efficient mixture-of-expert (MoE) serving system that realizes activation-aware expert offloading. MoE-Infinity features sequence-level expert activation tracing, a new approach adept at identifying sparse activations and capturing the temporal locality of MoE inference. By analyzing these traces, MoE-Infinity performs novel activation-aware expert prefetch… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  28. arXiv:2401.14351  [pdf, other

    cs.LG cs.DC

    ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models

    Authors: Yao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian Brabete, Dmitrii Ustiugov, Yuvraj Patel, Luo Mai

    Abstract: This paper presents ServerlessLLM, a locality-enhanced serverless inference system for Large Language Models (LLMs). ServerlessLLM exploits the substantial capacity and bandwidth of storage and memory devices available on GPU servers, thereby reducing costly remote checkpoint downloads and achieving efficient checkpoint loading. ServerlessLLM achieves this through three main contributions: (i) fas… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  29. arXiv:2401.14194  [pdf, other

    cs.CL

    Parameter-Efficient Conversational Recommender System as a Language Processing Task

    Authors: Mathieu Ravaut, Hao Zhang, Lu Xu, Aixin Sun, Yong Liu

    Abstract: Conversational recommender systems (CRS) aim to recommend relevant items to users by eliciting user preference through natural language conversation. Prior work often utilizes external knowledge graphs for items' semantic information, a language model for dialogue generation, and a recommendation module for ranking relevant items. This combination of multiple components suffers from a cumbersome t… ▽ More

    Submitted 24 February, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 9 pages, 4 figures, 8 tables, EACL 2024 conference, fixed typo

  30. arXiv:2401.14183  [pdf, other

    cs.AI cs.MA eess.SY math.OC

    Towards Autonomous Supply Chains: Definition, Characteristics, Conceptual Framework, and Autonomy Levels

    Authors: Liming Xu, Stephen Mak, Yaniv Proselkov, Alexandra Brintrup

    Abstract: Recent global disruptions, such as the pandemic and geopolitical conflicts, have profoundly exposed vulnerabilities in traditional supply chains, requiring exploration of more resilient alternatives. Autonomous supply chains (ASCs) have emerged as a potential solution, offering increased visibility, flexibility, and resilience in turbulent trade environments. Despite discussions in industry and ac… ▽ More

    Submitted 13 October, 2023; originally announced January 2024.

    Comments: This paper includes 20 pages and 8 figures

  31. arXiv:2401.13225  [pdf, ps, other

    hep-ex

    A New Look at the Scalar Meson $f_0(500)$ via $D^+\to π^+π^-\ell^+ν_\ell$ Decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai, X. Cai , et al. (615 additional authors not shown)

    Abstract: Using $2.93~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773 GeV, we investigate the semileptonic decays $D^+\to π^+π^- \ell^+ν_\ell$ ($\ell=e$ and $μ$). The $D^+\to f_0(500)μ^+ν_μ$ decay is observed for the first time. By analyzing simultaneously the differential decay rates of $D^+\to f_0(500) μ^+ν_μ$ and… ▽ More

    Submitted 4 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: Supplemental Materials added in this version

    Report number: BAM-00660

  32. arXiv:2401.13062  [pdf

    cs.RO eess.SY physics.bio-ph

    Force sensing to reconstruct potential energy landscapes for cluttered large obstacle traversal

    Authors: Yaqing Wang, Ling Xu, Chen Li

    Abstract: Visual sensing of environmental geometry allows robots to use artificial potential fields to avoid sparse obstacles. Yet robots must further traverse cluttered large obstacles for applications like search and rescue through rubble and planetary exploration across Martain rocks. Recent studies discovered that to traverse cluttered large obstacles, multi-legged insects and insect-inspired robots mak… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  33. arXiv:2401.12672  [pdf, other

    cs.AI

    ChatGraph: Chat with Your Graphs

    Authors: Yun Peng, Sen Lin, Qian Chen, Lyu Xu, Xiaojun Ren, Yafei Li, Jianliang Xu

    Abstract: Graph analysis is fundamental in real-world applications. Traditional approaches rely on SPARQL-like languages or clicking-and-dragging interfaces to interact with graph data. However, these methods either require users to possess high programming skills or support only a limited range of graph analysis functionalities. To address the limitations, we propose a large language model (LLM)-based fram… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  34. arXiv:2401.12246  [pdf, other

    cs.CL cs.LG

    Orion-14B: Open-source Multilingual Large Language Models

    Authors: Du Chen, Yi Huang, Xiaopu Li, Yongqiang Li, Yongqiang Liu, Haihui Pan, Leichao Xu, Dacheng Zhang, Zhipeng Zhang, Kun Han

    Abstract: In this study, we introduce Orion-14B, a collection of multilingual large language models with 14 billion parameters. We utilize a data scheduling approach to train a foundational model on a diverse corpus of 2.5 trillion tokens, sourced from texts in English, Chinese, Japanese, Korean, and other languages. Additionally, we fine-tuned a series of models tailored for conversational applications and… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: Authors are alphabetically listed by last names, except the corresponding author who is listed last

  35. arXiv:2401.11819  [pdf, other

    cs.CL cs.AI

    SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in Chinese

    Authors: Liang Xu, Hang Xue, Lei Zhu, Kangkang Zhao

    Abstract: We introduce SuperCLUE-Math6(SC-Math6), a new benchmark dataset to evaluate the mathematical reasoning abilities of Chinese language models. SC-Math6 is designed as an upgraded Chinese version of the GSM8K dataset with enhanced difficulty, diversity, and application scope. It consists of over 2000 mathematical word problems requiring multi-step reasoning and providing natural language solutions. W… ▽ More

    Submitted 1 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: Dataset revised and finalized, results updated with new model; 8 pages, 7 figures, 4 tables

  36. Prompt and nonprompt $ψ(2S)$ production in $p$Pb collisions at $\sqrt{s_{NN}}=8.16$ TeV

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, B. Adeva, M. Adinolfi, P. Adlarson, H. Afsharnia, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, A. Alfonso Albero, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey , et al. (1079 additional authors not shown)

    Abstract: The production of $ψ(2S)$ mesons in proton-lead collisions at a centre-of-mass energy per nucleon pair of $\sqrt{s_{NN}}=8.16$ TeV is studied with the LHCb detector using data corresponding to an integrated luminosity of 34 nb$^{-1}$. The prompt and nonprompt $ψ(2S)$ production cross-sections and the ratio of the $ψ(2S)$ to $J/ψ$ cross-section are measured as a function of the meson transverse mom… ▽ More

    Submitted 22 April, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2023-024.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-024, CERN-EP-2023-293

    Journal ref: JHEP 04 (2024) 111

  37. arXiv:2401.11224  [pdf, other

    eess.IV cs.CV

    Susceptibility of Adversarial Attack on Medical Image Segmentation Models

    Authors: Zhongxuan Wang, Leo Xu

    Abstract: The nature of deep neural networks has given rise to a variety of attacks, but little work has been done to address the effect of adversarial attacks on segmentation models trained on MRI datasets. In light of the grave consequences that such attacks could cause, we explore four models from the U-Net family and examine their responses to the Fast Gradient Sign Method (FGSM) attack. We conduct FGSM… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 6 pages, 8 figures, presented at 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI) conference

  38. arXiv:2401.11181  [pdf, other

    cs.DC

    Inference without Interference: Disaggregate LLM Inference for Mixed Downstream Workloads

    Authors: Cunchen Hu, Heyang Huang, Liangliang Xu, Xusheng Chen, Jiang Xu, Shuang Chen, Hao Feng, Chenxi Wang, Sa Wang, Yungang Bao, Ninghui Sun, Yizhou Shan

    Abstract: Transformer-based large language model (LLM) inference serving is now the backbone of many cloud services. LLM inference consists of a prefill phase and a decode phase. However, existing LLM deployment practices often overlook the distinct characteristics of these phases, leading to significant interference. To mitigate interference, our insight is to carefully schedule and group inference request… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  39. arXiv:2401.10934  [pdf, other

    cs.IR cs.AI

    A New Creative Generation Pipeline for Click-Through Rate with Stable Diffusion Model

    Authors: Hao Yang, Jianxin Yuan, Shuai Yang, Linhe Xu, Shuo Yuan, Yifan Zeng

    Abstract: In online advertising scenario, sellers often create multiple creatives to provide comprehensive demonstrations, making it essential to present the most appealing design to maximize the Click-Through Rate (CTR). However, sellers generally struggle to consider users preferences for creative design, leading to the relatively lower aesthetics and quantities compared to Artificial Intelligence (AI)-ba… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  40. arXiv:2401.10418  [pdf, other

    eess.SY

    Hazard resistance-based spatiotemporal risk analysis for distribution network outages during hurricanes

    Authors: Luo Xu, Ning Lin, Dazhi Xi, Kairui Feng, H. Vincent Poor

    Abstract: Blackouts in recent decades show an increasing prevalence of power outages due to extreme weather events such as hurricanes. Precisely assessing the spatiotemporal outages in distribution networks, the most vulnerable part of power systems, is critical to enhance power system resilience. The Sequential Monte Carlo (SMC) simulation method is widely used for spatiotemporal risk analysis of power sys… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 10 pages, 10 figures

  41. arXiv:2401.10070  [pdf, other

    cs.CL cs.SD eess.AS

    Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks

    Authors: Yichao Du, Zhirui Zhang, Linan Yue, Xu Huang, Yuqing Zhang, Tong Xu, Linli Xu, Enhong Chen

    Abstract: To protect privacy and meet legal regulations, federated learning (FL) has gained significant attention for training speech-to-text (S2T) systems, including automatic speech recognition (ASR) and speech translation (ST). However, the commonly used FL approach (i.e., \textsc{FedAvg}) in S2T tasks typically suffers from extensive communication overhead due to multi-round interactions based on the wh… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: ICASSP 2024

  42. arXiv:2401.10019  [pdf, other

    cs.CL cs.AI

    R-Judge: Benchmarking Safety Risk Awareness for LLM Agents

    Authors: Tongxin Yuan, Zhiwei He, Lingzhong Dong, Yiming Wang, Ruijie Zhao, Tian Xia, Lizhen Xu, Binglin Zhou, Fangqi Li, Zhuosheng Zhang, Rui Wang, Gongshen Liu

    Abstract: Large language models (LLMs) have exhibited great potential in autonomously completing tasks across real-world applications. Despite this, these LLM agents introduce unexpected safety risks when operating in interactive environments. Instead of centering on LLM-generated content safety in most prior studies, this work addresses the imperative need for benchmarking the behavioral safety of LLM agen… ▽ More

    Submitted 17 February, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  43. arXiv:2401.09507  [pdf, other

    cs.LG

    Deep Ensemble Shape Calibration: Multi-Field Post-hoc Calibration in Online Advertising

    Authors: Shuai Yang, Hao Yang, Zhuang Zou, Linhe Xu, Shuo Yuan, Yifan Zeng

    Abstract: In the e-commerce advertising scenario, estimating the true probabilities (known as a calibrated estimate) on Click-Through Rate (CTR) and Conversion Rate (CVR) is critical. Previous research has introduced numerous solutions for addressing the calibration problem. These methods typically involve the training of calibrators using a validation set and subsequently applying these calibrators to corr… ▽ More

    Submitted 20 May, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

  44. Measurement of Born cross section of $e^{+}e^{-}\rightarrowΣ^{+}\barΣ^{-}$ at center-of-mass energies between 3.510 and 4.951 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (632 additional authors not shown)

    Abstract: Using 24.1 fb$^{-1}$ of $e^{+}e^{-}$ collision data collected with the BESIII detector at the BEPCII collider, the Born cross sections and effective form factors of the $e^{+}e^{-}\rightarrowΣ^{+}\barΣ^{-}$ reaction are measured. The measurements are performed at center-of-mass energies ranging from 3.510 to 4.951 GeV. No significant evidence for the decay of the charmonium(-like) states,… ▽ More

    Submitted 6 May, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: 22 pages, 3 figures, 3 tables, consistent with the publication in JHEP05(2024)022

    Journal ref: JHEP05(2024)022

  45. arXiv:2401.09225  [pdf, other

    hep-ex

    First measurements of the absolute branching fraction of $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^+π^-$ and upper limit on $Λ_{c}(2595)^{+}\to Λ^{+}_{c}π^+π^-$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (603 additional authors not shown)

    Abstract: The absolute branching fraction of the decay $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^+π^-$ is measured for the first time to be $(50.7 \pm 5.0_{\rm{stat.}} \pm 4.9_{\rm{syst.}} )\%$ with 368.48 pb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector at the center-of-mass energies of $\sqrt{s} = 4.918$ and $4.950$ GeV. This result is lower than the naive prediction of 67\%, obtained from isosp… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 8 pages, 6 figures

  46. Improved measurements of the Dalitz decays $η/η'\rightarrowγe^{+}e^{-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (618 additional authors not shown)

    Abstract: Based on a data sample of 10 billion $J/ψ$ events collected with the BESIII detector, improved measurements of the Dalitz decays $η/η'\rightarrowγe^+e^-$ are performed, where the $η$ and $η'$ are produced through the radiative decays $J/ψ\rightarrowγη/η'$. The branching fractions of $η\rightarrowγe^+e^-$ and $η'\rightarrowγe^+e^-$ are measured to be $(7.07 \pm 0.05 \pm 0.23)\times10^{-3}$ and… ▽ More

    Submitted 5 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Journal ref: Phys.Rev.D 109 (2024) 7, 072001

  47. arXiv:2401.09013  [pdf, other

    cs.NI eess.SP

    An Improved Virtual Force Approach for UAV Deployment and Resource Allocation in Emergency Communications

    Authors: Hongying Guo, Li Wang, Ruoguang Li, Luyang Hou, Lianming Xu, Aiguo Fei

    Abstract: In this paper, we consider an unmanned aerial vehicle (UAV)-enabled emergency communication system, which establishes temporary communication link with users equipment (UEs) in a typical disaster environment with mountainous forest and obstacles. Towards this end, a joint deployment, power allocation, and user association optimization problem is formulated to maximize the total transmission rate,… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  48. arXiv:2401.09012  [pdf, other

    hep-ex nucl-ex

    First study of antihyperon-nucleon scattering $\barΛp\rightarrow\barΛp$ and measurement of $Λp\rightarrowΛp$ cross section

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the processes $Λp\rightarrowΛp$ and $\barΛp\rightarrow\barΛp$ are studied, where the $Λ/\barΛ$ baryons are produced in the process $J/ψ\rightarrowΛ\barΛ$ and the protons are the hydrogen nuclei in the cooling oil of the beam pipe. Clear signals are observed for the two reactions. The cr… ▽ More

    Submitted 18 May, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: 9 pages, 5 figures

  49. arXiv:2401.08252  [pdf, other

    hep-ex

    Observation of $ψ(3686) \to Ω^- K^+ \barΞ^0 $+c.c

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (630 additional authors not shown)

    Abstract: Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected with the BESIII detector at BEPCII, the decay of $ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.$ is observed for the first time. The branching fraction of this decay is measured to be $\mathcal{B}_{ψ(3686) \to Ω^- K^+ \barΞ^0 +c.c.}=(2.78 \pm 0.40 \pm 0.18 ) \times 10^{-6}$, where the first uncertainty is statistical and the second is systemati… ▽ More

    Submitted 15 April, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  50. arXiv:2401.07256  [pdf, other

    cs.MA

    Emergency Localization for Mobile Ground Users: An Adaptive UAV Trajectory Planning Method

    Authors: Zhihao Zhu, Jiafan He, Luyang Hou, Lianming Xu, Wendi Zhu, Li Wang

    Abstract: In emergency search and rescue scenarios, the quick location of trapped people is essential. However, disasters can render the Global Positioning System (GPS) unusable. Unmanned aerial vehicles (UAVs) with localization devices can serve as mobile anchors due to their agility and high line-of-sight (LoS) probability. Nonetheless, the number of available UAVs during the initial stages of disaster re… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.