Skip to main content

Showing 101–150 of 8,503 results for author: Zhang, S

.
  1. arXiv:2406.09803  [pdf, ps, other

    nucl-th

    Triaxial shape of the one-proton emitter $^{149}$Lu

    Authors: Qi Lu, Kai-Yuan Zhang, Shi-Sheng Zhang

    Abstract: We revisit the proton emitter $^{149}$Lu utilizing the recently developed triaxial relativistic Hartree-Bogoliubov theory in continuum (TRHBc). By incorporating the microscopic nuclear structure properties from the TRHBc theory into the WKB approximation, we successfully reproduce the measured proton-emission half-life of $^{149}$Lu within experimental uncertainties. A triaxial ground state charac… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2406.09775  [pdf, ps, other

    math.NA

    A semi-implicit stochastic multiscale method for radiative heat transfer problem

    Authors: Shan Zhang, Yajun Wang, Xiaofei Guan

    Abstract: In this paper, we propose and analyze a new semi-implicit stochastic multiscale method for the radiative heat transfer problem with additive noise fluctuation in composite materials. In the proposed method, the strong nonlinearity term induced by heat radiation is first approximated, by a semi-implicit predictor-corrected numerical scheme, for each fixed time step, resulting in a spatially random… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 30 pages

    MSC Class: 65N12; 65N15; 80M10

  3. arXiv:2406.09766  [pdf, other

    nucl-th

    Exploration on $1n$ halo nucleus $^{19}$C from D-RHFB structure to reaction observables

    Authors: Jia-Lin An, Qi Lu, Wen Hui Long, Shi-Sheng Zhang

    Abstract: We utilize the axially deformed relativistic Hartree-Fock-Bogoliubov (D-RHFB) model to describe the structure of neutron-rich carbon isotopes, taking into account the continuum, pairing correlations, tensor force and their interplay. In this scheme, one- and two-neutron separation energies of neutron-rich carbon isotopes agree well with measured data, as well as the spin and parity $J^π=1/2^+$ for… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 7 pages, 4 figures

  4. arXiv:2406.09683  [pdf, other

    astro-ph.GA

    Interstellar Nitrogen Isotope Ratios: Measurements on tracers of C$^{14}$N and C$^{15}$N

    Authors: J. L. Chen, J. S. Zhang, C. Henkel, Y. T. Yan, H. Z. Yu, Y. X. Wang, Y. P. Zou, J. Y. Zhao, X. Y. Wang

    Abstract: The nitrogen isotope ratio 14N/15N is a powerful tool to trace Galactic stellar nucleosynthesis and constraining Galactic chemical evolution. Previous observations have found lower 14N/15N ratios in the Galactic center and higher values in the Galactic disk. This is consistent with the inside-out formation scenario of our Milky Way. However, previous studies mostly utilized double isotope ratios a… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 34 pages, 9 figures, 6 tables

    Journal ref: The Astrophysical Journal (2004)

  5. arXiv:2406.09612  [pdf, other

    cs.AI cs.LG physics.chem-ph

    Automated Molecular Concept Generation and Labeling with Large Language Models

    Authors: Shichang Zhang, Botao Xia, Zimin Zhang, Qianli Wu, Fang Sun, Ziniu Hu, Yizhou Sun

    Abstract: Artificial intelligence (AI) is significantly transforming scientific research. Explainable AI methods, such as concept-based models (CMs), are promising for driving new scientific discoveries because they make predictions based on meaningful concepts and offer insights into the prediction process. In molecular science, however, explainable CMs are not as common compared to black-box models like G… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  6. arXiv:2406.09589  [pdf, other

    eess.AS

    Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment

    Authors: Yiwen Shao, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu, Daniel Povey, Sanjeev Khudanpur

    Abstract: In the field of multi-channel, multi-speaker Automatic Speech Recognition (ASR), the task of discerning and accurately transcribing a target speaker's speech within background noise remains a formidable challenge. Traditional approaches often rely on microphone array configurations and the information of the target speaker's location or voiceprint. This study introduces the Solo Spatial Feature (S… ▽ More

    Submitted 17 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted for presentation at Interspeech 2024

  7. arXiv:2406.09475  [pdf, other

    hep-ex

    Search for $X(1870)$ via the decay $J/ψ\to ωK^+ K^-η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Using a sample of $(10087\pm 44)\times10^{6}$ $J/ψ$ events collected by the BESIII detector at the BEPCII collider, we search for the decay $X(1870)\to K^+ K^-η$ via the $J/ψ\to ωK^+ K^- η$ process for the first time. No significant $X(1870)$ signal is observed. The upper limit on the branching fraction of the decay $ J/ψ\to ωX(1870) \toωK^+ K^- η$ is determined to be $9.55\times 10^{-7}$ at the… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  8. arXiv:2406.09444  [pdf, other

    eess.AS cs.CL cs.SD

    GenDistiller: Distilling Pre-trained Language Models based on an Autoregressive Generative Model

    Authors: Yingying Gao, Shilei Zhang, Chao Deng, Junlan Feng

    Abstract: Pre-trained speech language models such as HuBERT and WavLM leverage unlabeled speech data for self-supervised learning and offer powerful representations for numerous downstream tasks. Despite the success of these models, their high requirements for memory and computing resource hinder their application on resource restricted devices. Therefore, this paper introduces GenDistiller, a novel knowled… ▽ More

    Submitted 21 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.13418

  9. arXiv:2406.09411  [pdf, other

    cs.CV cs.AI cs.CL

    MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

    Authors: Fei Wang, Xingyu Fu, James Y. Huang, Zekun Li, Qin Liu, Xiaogeng Liu, Mingyu Derek Ma, Nan Xu, Wenxuan Zhou, Kai Zhang, Tianyi Lorena Yan, Wenjie Jacky Mo, Hsiang-Hui Liu, Pan Lu, Chunyuan Li, Chaowei Xiao, Kai-Wei Chang, Dan Roth, Sheng Zhang, Hoifung Poon, Muhao Chen

    Abstract: We introduce MuirBench, a comprehensive benchmark that focuses on robust multi-image understanding capabilities of multimodal LLMs. MuirBench consists of 12 diverse multi-image tasks (e.g., scene understanding, ordering) that involve 10 categories of multi-image relations (e.g., multiview, temporal relations). Comprising 11,264 images and 2,600 multiple-choice questions, MuirBench is created in a… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  10. arXiv:2406.09071  [pdf

    cs.LG

    FlamePINN-1D: Physics-informed neural networks to solve forward and inverse problems of 1D laminar flames

    Authors: Jiahao Wu, Su Zhang, Yuxin Wu, Guihua Zhang, Xin Li, Hai Zhang

    Abstract: Given the existence of various forward and inverse problems in combustion studies and applications that necessitate distinct methods for resolution, a framework to solve them in a unified way is critically needed. A promising approach is the integration of machine learning methods with governing equations of combustion systems, which exhibits superior generality and few-shot learning ability compa… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  11. Blind Super-Resolution via Meta-learning and Markov Chain Monte Carlo Simulation

    Authors: **gyuan Xia, Zhixiong Yang, Shengxi Li, Shuanghui Zhang, Yaowen Fu, Deniz Gündüz, Xiang Li

    Abstract: Learning-based approaches have witnessed great successes in blind single image super-resolution (SISR) tasks, however, handcrafted kernel priors and learning based kernel priors are typically required. In this paper, we propose a Meta-learning and Markov Chain Monte Carlo (MCMC) based SISR approach to learn kernel priors from organized randomness. In concrete, a lightweight network is adopted as k… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted for publication in IEEE Transactions on Pattern Analysis and Machine Intelligence (2024)

  12. arXiv:2406.08855  [pdf, other

    cs.RO

    Trajectory Planning for Autonomous Driving in Unstructured Scenarios Based on Graph Neural Network and Numerical Optimization

    Authors: Sumin Zhang, Kuo Li, Rui He, Zhiwei Meng, Yupeng Chang, Xiaosong **, Ri Bai

    Abstract: In unstructured environments, obstacles are diverse and lack lane markings, making trajectory planning for intelligent vehicles a challenging task. Traditional trajectory planning methods typically involve multiple stages, including path planning, speed planning, and trajectory optimization. These methods require the manual design of numerous parameters for each stage, resulting in significant wor… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  13. arXiv:2406.08804  [pdf, other

    cs.DC cs.AI cs.IR

    DIET: Customized Slimming for Incompatible Networks in Sequential Recommendation

    Authors: Kairui Fu, Shengyu Zhang, Zheqi Lv, **gyuan Chen, Jiwei Li

    Abstract: Due to the continuously improving capabilities of mobile edges, recommender systems start to deploy models on edges to alleviate network congestion caused by frequent mobile requests. Several studies have leveraged the proximity of edge-side to real-time data, fine-tuning them to create edge-specific models. Despite their significant progress, these methods require substantial on-edge computationa… ▽ More

    Submitted 15 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by KDD 2024

  14. arXiv:2406.08778  [pdf, ps, other

    math.CV math.AP math.DG

    Regularizing property of the twisted conical Kähler-Ricci flow

    Authors: Jiawei Liu, Shiyu Zhang, Xi Zhang

    Abstract: In this paper, we show the regularity and uniqueness of the twisted conical Kähler-Ricci flow running from a positive closed current with zero Lelong number, which extends the regularizing property of the smooth twisted Kähler-Ricci flow, known as Guedj-Zeriahi's existence theorem and Di Nezza-Lu's uniqueness theorem, to the conical singularity case.

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: All comments are welcome! arXiv admin note: substantial text overlap with arXiv:2406.04590

  15. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  16. arXiv:2406.08225  [pdf, ps, other

    hep-ex

    Observation of $η_{c}$(1S, 2S) and $χ_{cJ}$ decays to 2$(π^{+}π^{-})η$ via $ψ$(3686) radiative transitions

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (636 additional authors not shown)

    Abstract: Based on $2.7 \times 10^9~ψ(3686)$ decays collected with the BESIII detector, the radiative decay $ψ(3686)\to\gamma2(π^{+}π^{-})η$ is investigated to measure properties of S- and P-wave charmonium states. The branching fraction of the decay $η_{c}(1S) \to 2(π^{+}π^{-})η$, which is found to have a strong dependence on the interference pattern between $η_c(1S)$ and non-$η_c(1S)$ processes, is measur… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  17. arXiv:2406.08116  [pdf, other

    cs.CL cs.AI

    Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling

    Authors: Zile Qiao, Wei Ye, Yong Jiang, Tong Mo, Pengjun Xie, Wei** Li, Fei Huang, Shikun Zhang

    Abstract: Retrieval-augmented language models (RALMs) have recently shown great potential in mitigating the limitations of implicit knowledge in LLMs, such as untimely updating of the latest expertise and unreliable retention of long-tail knowledge. However, since the external knowledge base, as well as the retriever, can not guarantee reliability, potentially leading to the knowledge retrieved not being he… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  18. arXiv:2406.08010  [pdf, other

    cs.IR cs.LG

    A Self-boosted Framework for Calibrated Ranking

    Authors: Shunyu Zhang, Hu Liu, Wentian Bao, Enyun Yu, Yang Song

    Abstract: Scale-calibrated ranking systems are ubiquitous in real-world applications nowadays, which pursue accurate ranking quality and calibrated probabilistic predictions simultaneously. For instance, in the advertising ranking system, the predicted click-through rate (CTR) is utilized for ranking and required to be calibrated for the downstream cost-per-click ads bidding. Recently, multi-objective based… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: KDD 2024

  19. arXiv:2406.07956  [pdf, other

    nucl-th

    Ab initio calculations with a new local chiral N3LO nucleon-nucleon force

    Authors: P. Y. Wang, J. G. Li, S. Zhang, Q. Yuan, M. R. Xie, W. Zuo

    Abstract: Ab initio calculations have achieved remarkable success in nuclear structure studies. Numerous works highlight the pivotal role of three-body forces in nuclear ab initio calculations. Concurrently, efforts have been made to replicate these calculations using only realistic nucleon-nucleon (NN) interactions. A novel local chiral next-to-next-to-next-to-leading order (N3LO) NN interaction, distinct… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 9 pages, 7 figures

  20. arXiv:2406.07801  [pdf, other

    cs.CL cs.SD eess.AS

    PolySpeech: Exploring Unified Multitask Speech Models for Competitiveness with Single-task Models

    Authors: Runyan Yang, Huibao Yang, Xiqing Zhang, Tiantian Ye, Ying Liu, Yingying Gao, Shilei Zhang, Chao Deng, Junlan Feng

    Abstract: Recently, there have been attempts to integrate various speech processing tasks into a unified model. However, few previous works directly demonstrated that joint optimization of diverse tasks in multitask speech models has positive influence on the performance of individual tasks. In this paper we present a multitask speech model -- PolySpeech, which supports speech recognition, speech synthesis,… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures

  21. arXiv:2406.07625  [pdf, other

    cond-mat.str-el cond-mat.quant-gas quant-ph

    Emergent Universal Quench Dynamics in Randomly Interacting Spin Models

    Authors: Yuchen Li, Tian-Gang Zhou, Ze Wu, Pai Peng, Shengyu Zhang, Riqiang Fu, Ren Zhang, Wei Zheng, Pengfei Zhang, Hui Zhai, Xinhua Peng, Jiangfeng Du

    Abstract: Universality often emerges in low-energy equilibrium physics of quantum many-body systems, despite their microscopic complexity and variety. Recently, there has been a growing interest in studying far-from-equilibrium dynamics of quantum many-body systems. Such dynamics usually involves highly excited states beyond the traditional low-energy theory description. Whether universal behaviors can also… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 figures; Supplementary Information 26 pages, 11 figures, 2 tables

  22. arXiv:2406.07590  [pdf, other

    cs.LG cs.AI

    StreamPrompt: Learnable Prompt-guided Data Selection for Efficient Stream Learning

    Authors: Tongjun Shi, Shuhao Zhang

    Abstract: Stream Learning (SL) requires models to rapidly adapt to continuous data streams, setting it apart from traditional Continual Learning (CL). Recent SL methods emphasize efficiency by selecting data subsets for training, but they often struggle due to their reliance on static, rule-based selection algorithms that cannot effectively adapt to the changing importance of data. In this work, we introduc… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  23. arXiv:2406.07547  [pdf, other

    cs.CV

    Zero-shot Image Editing with Reference Imitation

    Authors: Xi Chen, Yutong Feng, Mengting Chen, Yiyang Wang, Shilong Zhang, Yu Liu, Yujun Shen, Hengshuang Zhao

    Abstract: Image editing serves as a practical yet challenging task considering the diverse demands from users, where one of the hardest parts is to precisely describe how the edited image should look like. In this work, we present a new form of editing, termed imitative editing, to help users exercise their creativity more conveniently. Concretely, to edit an image region of interest, users are free to dire… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: https://xavierchen34.github.io/MimicBrush-Page

  24. arXiv:2406.07337  [pdf, other

    cs.LG

    Transferring Knowledge from Large Foundation Models to Small Downstream Models

    Authors: Shikai Qiu, Boran Han, Danielle C. Maddix, Shuai Zhang, Yuyang Wang, Andrew Gordon Wilson

    Abstract: How do we transfer the relevant knowledge from ever larger foundation models into small, task-specific downstream models that can run at much lower costs? Standard transfer learning using pre-trained weights as the initialization transfers limited information and commits us to often massive pre-trained architectures. This procedure also precludes combining multiple pre-trained models that learn co… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: ICML 2024. Code available at https://github.com/amazon-science/adaptive-feature-transfer

  25. arXiv:2406.07289  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?

    Authors: Qingkai Fang, Shaolei Zhang, Zhengrui Ma, Min Zhang, Yang Feng

    Abstract: Recently proposed two-pass direct speech-to-speech translation (S2ST) models decompose the task into speech-to-text translation (S2TT) and text-to-speech (TTS) within an end-to-end model, yielding promising results. However, the training of these models still relies on parallel speech data, which is extremely challenging to collect. In contrast, S2TT and TTS have accumulated a large amount of data… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: ACL 2024 main conference. Project Page: https://ictnlp.github.io/ComSpeech-Site/

    ACM Class: I.2.7

  26. arXiv:2406.07230  [pdf, other

    cs.CV cs.AI

    Needle In A Multimodal Haystack

    Authors: Weiyun Wang, Shuibo Zhang, Yiming Ren, Yuchen Duan, Tiantong Li, Shuo Liu, Mengkang Hu, Zhe Chen, Kaipeng Zhang, Lewei Lu, Xizhou Zhu, ** Luo, Yu Qiao, Jifeng Dai, Wenqi Shao, Wenhai Wang

    Abstract: With the rapid advancement of multimodal large language models (MLLMs), their evaluation has become increasingly comprehensive. However, understanding long multimodal content, as a foundational ability for real-world applications, remains underexplored. In this work, we present Needle In A Multimodal Haystack (MM-NIAH), the first benchmark specifically designed to systematically evaluate the capab… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  27. arXiv:2406.07225  [pdf, other

    quant-ph

    A generic and robust quantum agent inspired by deep meta-reinforcement learning

    Authors: Zibo Miao, Shihui Zhang, Yu Pan, Sibo Tao, Yu Chen

    Abstract: Deep reinforcement learning (deep RL) has enabled human- or superhuman- performances in various applications. Recently, deep RL has also been adopted to improve the performance of quantum control. However, a large volume of data is typically required to train the neural network in deep RL, making it inefficient compared with the traditional optimal quantum control method. Here, we thus develop a n… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  28. arXiv:2406.07085  [pdf, other

    cs.CV

    CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation

    Authors: Zhongzhen Huang, Yankai Jiang, Rongzhao Zhang, Shaoting Zhang, Xiaofan Zhang

    Abstract: Existing promptable segmentation methods in the medical imaging field primarily consider either textual or visual prompts to segment relevant objects, yet they often fall short when addressing anomalies in medical images, like tumors, which may vary greatly in shape, size, and appearance. Recognizing the complexity of medical scenarios and the limitations of textual or visual prompts, we propose a… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  29. arXiv:2406.07067  [pdf, other

    cs.IR cs.AI

    TIM: Temporal Interaction Model in Notification System

    Authors: Huxiao Ji, Haitao Yang, Linchuan Li, Shunyu Zhang, Cunyi Zhang, Xuan** Li, Wenwu Ou

    Abstract: Modern mobile applications heavily rely on the notification system to acquire daily active users and enhance user engagement. Being able to proactively reach users, the system has to decide when to send notifications to users. Although many researchers have studied optimizing the timing of sending notifications, they only utilized users' contextual features, without modeling users' behavior patter… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  30. arXiv:2406.06937  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation

    Authors: Zhengrui Ma, Qingkai Fang, Shaolei Zhang, Shoutao Guo, Yang Feng, Min Zhang

    Abstract: Simultaneous translation models play a crucial role in facilitating communication. However, existing research primarily focuses on text-to-text or speech-to-text models, necessitating additional cascade components to achieve speech-to-speech translation. These pipeline methods suffer from error propagation and accumulate delays in each cascade component, resulting in reduced synchronization betwee… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: ACL 2024; Codes and demos are at https://github.com/ictnlp/NAST-S2x

  31. arXiv:2406.06910  [pdf, other

    cs.CL

    Agent-SiMT: Agent-assisted Simultaneous Machine Translation with Large Language Models

    Authors: Shoutao Guo, Shaolei Zhang, Zhengrui Ma, Min Zhang, Yang Feng

    Abstract: Simultaneous Machine Translation (SiMT) generates target translations while reading the source sentence. It relies on a policy to determine the optimal timing for reading sentences and generating translations. Existing SiMT methods generally adopt the traditional Transformer architecture, which concurrently determines the policy and generates translations. While they excel at determining policies,… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 8 figures, 7 tables. v2 of arXiv:2402.13036

  32. arXiv:2406.06794  [pdf, other

    math-ph math.SP

    Landscape estimates of the integrated density of states for Jacobi operators on graphs

    Authors: Laura Shou, Wei Wang, Shiwen Zhang

    Abstract: We show the integrated density of states for a variety of Jacobi operators on graphs, such as the Anderson model and random hop** models on graphs with Gaussian heat kernel bounds, can be estimated from above and below in terms of the localization landscape counting function. Specific examples of these graphs include stacked and decorated lattices, graphs corresponding to band matrices, and aper… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 51 pages, 9 figures

  33. arXiv:2406.06619  [pdf, other

    eess.AS cs.AI cs.CL

    LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR

    Authors: Zheshu Song, Jianheng Zhuo, Yifan Yang, Ziyang Ma, Shixiong Zhang, Xie Chen

    Abstract: Recent years have witnessed significant progress in multilingual automatic speech recognition (ASR), driven by the emergence of end-to-end (E2E) models and the scaling of multilingual datasets. Despite that, two main challenges persist in multilingual ASR: language interference and the incorporation of new languages without degrading the performance of the existing ones. This paper proposes LoRA-W… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures, conference

  34. arXiv:2406.06573  [pdf, other

    cs.CL cs.LG

    MedFuzz: Exploring the Robustness of Large Language Models in Medical Question Answering

    Authors: Robert Osazuwa Ness, Katie Matton, Hayden Helm, Sheng Zhang, Junaid Bajwa, Carey E. Priebe, Eric Horvitz

    Abstract: Large language models (LLM) have achieved impressive performance on medical question-answering benchmarks. However, high benchmark accuracy does not imply that the performance generalizes to real-world clinical settings. Medical question-answering benchmarks rely on assumptions consistent with quantifying LLM performance but that may not hold in the open world of the clinic. Yet LLMs learn broad k… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 figures, 2 algorithms, appendix

    ACM Class: I.2.7

  35. arXiv:2406.06525  [pdf, other

    cs.CV

    Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

    Authors: Peize Sun, Yi Jiang, Shoufa Chen, Shilong Zhang, Bingyue Peng, ** Luo, Zehuan Yuan

    Abstract: We introduce LlamaGen, a new family of image generation models that apply original ``next-token prediction'' paradigm of large language models to visual generation domain. It is an affirmative answer to whether vanilla autoregressive models, e.g., Llama, without inductive biases on visual signals can achieve state-of-the-art image generation performance if scaling properly. We reexamine design spa… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Codes and models: \url{https://github.com/FoundationVision/LlamaGen}

  36. arXiv:2406.06433  [pdf, other

    cs.LG cs.AI

    DISCO: An End-to-End Bandit Framework for Personalised Discount Allocation

    Authors: Jason Shuo Zhang, Benjamin Howson, Panayiota Savva, Eleanor Loh

    Abstract: Personalised discount codes provide a powerful mechanism for managing customer relationships and operational spend in e-commerce. Bandits are well suited for this product area, given the partial information nature of the problem, as well as the need for adaptation to the changing business environment. Here, we introduce DISCO, an end-to-end contextual bandit framework for personalised discount cod… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted at ECML/PKDD 2024

  37. arXiv:2406.06118  [pdf, other

    hep-ex

    Strong and weak $CP$ tests in sequential decays of polarized $Σ^0$ hyperons

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: The $J/ψ, ψ(3686) \to Σ^0 \barΣ^{0}$ processes and subsequent decays are studied using the world's largest $J/ψ$ and $ψ(3686)$ data samples collected with the BESIII detector. The strong-$CP$ symmetry is tested in the decays of the $Σ^0$ hyperons for the first time by measuring the decay parameters, $α_{Σ^0} = -0.0017 \pm 0.0021 \pm 0.0018$ and $\barα_{Σ^0} = 0.0021 \pm 0.0020 \pm 0.0022$. The wea… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  38. arXiv:2406.06063  [pdf, other

    physics.comp-ph quant-ph

    Enabling Large-Scale and High-Precision Fluid Simulations on Near-Term Quantum Computers

    Authors: Zhao-Yun Chen, Teng-Yang Ma, Chuang-Chao Ye, Liang Xu, Ming-Yang Tan, Xi-Ning Zhuang, Xiao-Fan Xu, Yun-Jie Wang, Tai-** Sun, Yong Chen, Lei Du, Liang-Liang Guo, Hai-Feng Zhang, Hao-Ran Tao, Tian-Le Wang, Xiao-Yan Yang, Ze-An Zhao, Peng Wang, Sheng Zhang, Chi Zhang, Ren-Ze Zhao, Zhi-Long Jia, Wei-Cheng Kong, Meng-Han Dou, Jun-Chao Wang , et al. (7 additional authors not shown)

    Abstract: Quantum computational fluid dynamics (QCFD) offers a promising alternative to classical computational fluid dynamics (CFD) by leveraging quantum algorithms for higher efficiency. This paper introduces a comprehensive QCFD method, including an iterative method "Iterative-QLS" that suppresses error in quantum linear solver, and a subspace method to scale the solution to a larger size. We implement o… ▽ More

    Submitted 19 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 31 pages, 10 figures

  39. arXiv:2406.05839  [pdf, other

    eess.AS cs.AI

    MaLa-ASR: Multimedia-Assisted LLM-Based ASR

    Authors: Guanrou Yang, Ziyang Ma, Fan Yu, Zhifu Gao, Shiliang Zhang, Xie Chen

    Abstract: As more and more information-rich data like video become available, utilizing multi-modal auxiliary information to enhance audio tasks has sparked widespread research interest. The recent surge in research on LLM-based audio models provides fresh perspectives for tackling audio tasks. Given that LLM can flexibly ingest multiple inputs, we propose MaLa-ASR, an LLM-based ASR model that can integrate… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  40. arXiv:2406.05827  [pdf, ps, other

    hep-ex

    Measurement of the integrated luminosity of the data collected at 3.773 GeV by BESIII from 2021 to 2024

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: We present a measurement of the integrated luminosity of $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at a center-of-mass energy of $E_{\rm cm} = 3.773$~GeV. The integrated luminosities of the data sets taken from December 2021 to June 2022, from November 2022 to June 2023, and from October 2023 to February 2024 are determined to be $4.995 \pm 0.019$~fb$^{-1}$,… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  41. arXiv:2406.05783  [pdf, other

    astro-ph.IM

    Response of the first POLAR-2 Prototype to Polarized Beams

    Authors: Merlin Kole, Nicolas de Angelis, Ana Bacelj, Franck Cadoux, Agnieszka Elwertowska, Johannes Hulsman, Hancheng Li, Grzegorz Łubian, Tomasz Kowalski, Gilles Koziol, Agnieszka Pollo, Nicolas Produit, Dominik Rybka, Adrien Stil, Jianchao Sun, Xin Wu, Kacper Zezuliński, Shuang-Nan Zhang

    Abstract: POLAR-2 is a dedicated gamma-ray polarimeter currently foreseen to be launched towards the China Space Station around 2027. The design of the detector is based on the legacy of its predecessor mission POLAR which was launched in 2016. POLAR-2 aims to measure the polarization of the Gamma-ray Burst prompt emission within the 30-800 keV energy range. Thanks to its high sensitivity to gamma-ray polar… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in JINST

  42. arXiv:2406.05658  [pdf, other

    cs.CV cs.AI

    Visual Prompt Tuning in Null Space for Continual Learning

    Authors: Yue Lu, Shizhou Zhang, De Cheng, Yinghui Xing, Nannan Wang, Peng Wang, Yanning Zhang

    Abstract: Existing prompt-tuning methods have demonstrated impressive performances in continual learning (CL), by selecting and updating relevant prompts in the vision-transformer models. On the contrary, this paper aims to learn each task by tuning the prompts in the direction orthogonal to the subspace spanned by previous tasks' features, so as to ensure no interference on tasks that have been learned to… ▽ More

    Submitted 10 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: 20 pages, 10 figures

  43. arXiv:2406.05412  [pdf

    cs.CV

    Select-Mosaic: Data Augmentation Method for Dense Small Object Scenes

    Authors: Hao Zhang, Shuaijie Zhang, Renbin Zou

    Abstract: Data augmentation refers to the process of applying a series of transformations or expansions to original data to generate new samples, thereby increasing the diversity and quantity of the data, effectively improving the performance and robustness of models. As a common data augmentation method, Mosaic data augmentation technique stitches multiple images together to increase the diversity and comp… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  44. arXiv:2406.05106  [pdf, ps, other

    physics.atom-ph

    Vapor cell Rydberg atom electrometry with time-separated fields

    Authors: Michael V. Romalis, Joe Wiedemann, Shaobo Zhang, Nezih Dural

    Abstract: Rydberg atoms have large transition electric dipole moments and high sensitivity to electric fields. We describe a new method for microwave field sensing in a vapor cell consisting of separate excitation, quantum evolution between two Rydberg levels in the dark and state-dependent detection with probe laser transmission. Using microwave pulse techniques we study homogeneous and inhomogeneous relax… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 5 pages, 4 figures

  45. arXiv:2406.04785  [pdf, other

    cs.DC

    Enabling Efficient Batch Serving for LMaaS via Generation Length Prediction

    Authors: Ke Cheng, Wen Hu, Zhi Wang, Peng Du, Jianguo Li, Sheng Zhang

    Abstract: Nowadays, large language models (LLMs) are published as a service and can be accessed by various applications via APIs, also known as language-model-as-a-service (LMaaS). Without knowing the generation length of requests, existing serving systems serve requests in a first-come, first-served (FCFS) manner with a fixed batch size, which leads to two problems that affect batch serving efficiency. Fir… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 12 pages, 14 figures

  46. arXiv:2406.04675  [pdf, other

    cs.CV

    OVMR: Open-Vocabulary Recognition with Multi-Modal References

    Authors: Zehong Ma, Shiliang Zhang, Longhui Wei, Qi Tian

    Abstract: The challenge of open-vocabulary recognition lies in the model has no clue of new categories it is applied to. Existing works have proposed different methods to embed category cues into the model, \eg, through few-shot fine-tuning, providing category names or textual descriptions to Vision-Language Models. Fine-tuning is time-consuming and degrades the generalization capability. Textual descriptio… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: CVPR2024

  47. arXiv:2406.04659  [pdf, other

    cs.CV

    LocLLM: Exploiting Generalizable Human Keypoint Localization via Large Language Model

    Authors: Dongkai Wang, Shiyu Xuan, Shiliang Zhang

    Abstract: The capacity of existing human keypoint localization models is limited by keypoint priors provided by the training data. To alleviate this restriction and pursue more general model, this work studies keypoint localization from a different perspective by reasoning locations based on keypiont clues in text descriptions. We propose LocLLM, the first Large-Language Model (LLM) based keypoint localizat… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: CVPR2024

  48. arXiv:2406.04339  [pdf, other

    cs.CV

    RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation

    Authors: Jiaming Liu, Mengzhen Liu, Zhenyu Wang, Lily Lee, Kaichen Zhou, Pengju An, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang

    Abstract: A fundamental objective in robot manipulation is to enable models to comprehend visual scenes and execute actions. Although existing robot Multimodal Large Language Models (MLLMs) can handle a range of basic tasks, they still face challenges in two areas: 1) inadequate reasoning ability to tackle complex tasks, and 2) high computational costs for MLLM fine-tuning and inference. The recently propos… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  49. arXiv:2406.04338  [pdf, other

    cs.CV cs.AI cs.GR

    Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion

    Authors: Fangfu Liu, Hanyang Wang, Shunyu Yao, Shengjun Zhang, Jie Zhou, Yueqi Duan

    Abstract: In recent years, there has been rapid development in 3D generation models, opening up new possibilities for applications such as simulating the dynamic movements of 3D objects and customizing their behaviors. However, current 3D generative models tend to focus only on surface features such as color and shape, neglecting the inherent physical properties that govern the behavior of objects in the re… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Project page: https://liuff19.github.io/Physics3D

  50. arXiv:2406.04133  [pdf

    econ.EM

    GLOBUS: Global building renovation potential by 2070

    Authors: Shufan Zhang, Minda Ma, Nan Zhou, **yue Yan

    Abstract: Surpassing the two large emission sectors of transportation and industry, the building sector accounted for 34% and 37% of global energy consumption and carbon emissions in 2021, respectively. The building sector, the final piece to be addressed in the transition to net-zero carbon emissions, requires a comprehensive, multisectoral strategy for reducing emissions. Until now, the absence of data on… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 26 pages, 6 figures