Skip to main content

Showing 201–250 of 3,237 results for author: Chen, B

.
  1. arXiv:2403.16811  [pdf, ps, other

    hep-ex

    Cross section measurement of $e^+e^-\to ηψ(2S)$ and search for $e^+e^-\toη\tilde{X}(3872)$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The energy-dependent cross section for $e^+e^-\to ηψ(2S)$ is measured at eighteen center of mass energies from 4.288 GeV to 4.951 GeV using the BESIII detector. Using the same data samples, we also perform the first search for the reaction $e^+e^-\toη\tilde{X}(3872)$, but no evidence is found for the $\tilde{X}(3872)$ in the $π^+π^- J/ψ$ mass distribution. At each of the eighteen center of mass en… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  2. arXiv:2403.16378  [pdf, other

    cs.IR

    Play to Your Strengths: Collaborative Intelligence of Conventional Recommender Models and Large Language Models

    Authors: Yunjia Xi, Weiwen Liu, Jianghao Lin, Chuhan Wu, Bo Chen, Ruiming Tang, Weinan Zhang, Yong Yu

    Abstract: The rise of large language models (LLMs) has opened new opportunities in Recommender Systems (RSs) by enhancing user behavior modeling and content understanding. However, current approaches that integrate LLMs into RSs solely utilize either LLM or conventional recommender model (CRM) to generate final recommendations, without considering which data segments LLM or CRM excel in. To fill in this gap… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  3. arXiv:2403.16131  [pdf, other

    cs.CV

    Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement

    Authors: Xiuquan Hou, Meiqin Liu, Senlin Zhang, ** Wei, Badong Chen

    Abstract: DETR-like methods have significantly increased detection performance in an end-to-end manner. The mainstream two-stage frameworks of them perform dense self-attention and select a fraction of queries for sparse cross-attention, which is proven effective for improving performance but also introduces a heavy computational burden and high dependence on stable query selection. This paper demonstrates… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  4. arXiv:2403.16015  [pdf, other

    cs.RO

    MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment

    Authors: Ziyan Xiong, Bo Chen, Shiyu Huang, Wei-Wei Tu, Zhaofeng He, Yang Gao

    Abstract: The advent of deep reinforcement learning (DRL) has significantly advanced the field of robotics, particularly in the control and coordination of quadruped robots. However, the complexity of real-world tasks often necessitates the deployment of multi-robot systems capable of sophisticated interaction and collaboration. To address this need, we introduce the Multi-agent Quadruped Environment (MQE),… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Open-source code is available at https://github.com/ziyanx02/multiagent-quadruped-environment

  5. arXiv:2403.15069  [pdf, other

    cs.AR

    Allspark: Workload Orchestration for Visual Transformers on Processing In-Memory Systems

    Authors: Mengke Ge, Junpeng Wang, Binhan Chen, Yingjian Zhong, Haitao Du, Song Chen, Yi Kang

    Abstract: The advent of Transformers has revolutionized computer vision, offering a powerful alternative to convolutional neural networks (CNNs), especially with the local attention mechanism that excels at capturing local structures within the input and achieve state-of-the-art performance. Processing in-memory (PIM) architecture offers extensive parallelism, low data movement costs, and scalable memory ba… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: The article is currently under review by IEEE Transactions on Computers, and has been submitted to HPCA'2024 and ISCA'2024

  6. arXiv:2403.14998  [pdf, other

    hep-ex

    Precise measurement of the $e^+e^-\to D_s^+D_s^-$ cross sections at center-of-mass energies from threshold to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using the $e^+e^-$ collision data collected with the BESIII detector operating at the BEPCII collider, at center-of-mass energies from the threshold to $4.95$~GeV, we present precise measurements of the cross sections for the process $e^+e^-\to D_s^+D_s^-$ using a single tag method. The resulting cross section lineshape exhibits several new structures, thereby offering an input for coupled channel… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures, published to PRL

  7. arXiv:2403.14268  [pdf

    eess.AS cs.SD

    Speech-Aware Neural Diarization with Encoder-Decoder Attractor Guided by Attention Constraints

    Authors: PeiYing Lee, HauYun Guo, Berlin Chen

    Abstract: End-to-End Neural Diarization with Encoder-Decoder based Attractor (EEND-EDA) is an end-to-end neural model for automatic speaker segmentation and labeling. It achieves the capability to handle flexible number of speakers by estimating the number of attractors. EEND-EDA, however, struggles to accurately capture local speaker dynamics. This work proposes an auxiliary loss that aims to guide the Tra… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted to The 28th International Conference on Technologies and Applications of Artificial Intelligence (TAAI), in Chinese language

    Report number: TAAI2023-Domestic-131

  8. arXiv:2403.14057  [pdf

    cond-mat.supr-con cond-mat.str-el

    Exploring Fermi Surface Nesting and the Nature of Heavy Quasiparticles in the Spin-Triplet Superconductor Candidate CeRh$_2$As$_2$

    Authors: Bo Chen, Hao Liu, Qi-Yi Wu, Chen Zhang, Xue-Qing Ye, Yin-Zou Zhao, Jiao-Jiao Song, Xin-Yi Tian, Ba-Lei Tan, Zheng-Tai Liu, Mao Ye, Zhen-Hua Chen, Yao-Bo Huang, Da-Wei Shen, Ya-Hua Yuan, Jun He, Yu-Xia Duan, Jian-Qiao Meng

    Abstract: In this study, we investigate the electronic structure of a spin-triplet superconductor candidate CeRh$_2$As$_2$ using high-resolution angle-resolved photoemission spectroscopy and density functional theory calculations. Notably, Fermi surface nesting hints at connections to magnetic excitation or quadrupole density wave phenomena, elucidating the superconducting mechanisms. Measured band structur… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 6 pages, 3 figures

  9. arXiv:2403.13437  [pdf, other

    hep-ex

    Search for $ΔS=2$ nonleptonic hyperon decays $Ω^-\toΣ^{0}π^{-}$ and $Ω^-\to nK^{-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the center-of-mass energy of $\sqrt{s} = 3.686$ GeV, we search for the first time for two nonleptonic hyperon decays that change strangeness by two units, $Ω^-\toΣ^{0}π^-$ and $Ω^-\to nK^{-}$. No significant signal is observed. The upper limits on their decay branching fractions are determined to be… ▽ More

    Submitted 14 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  10. arXiv:2403.13352  [pdf, other

    cs.CV

    AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation

    Authors: **gkun An, Yinghao Zhu, Zongjian Li, Haoran Feng, Bohua Chen, Yemin Shi, Chengwei Pan

    Abstract: Text-to-Image (T2I) diffusion models have achieved remarkable success in image generation. Despite their progress, challenges remain in both prompt-following ability, image quality and lack of high-quality datasets, which are essential for refining these models. As acquiring labeled data is costly, we introduce AGFSync, a framework that enhances T2I diffusion models through Direct Preference Optim… ▽ More

    Submitted 3 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  11. arXiv:2403.13338  [pdf, other

    cs.CV

    Adaptive Critical Subgraph Mining for Cognitive Impairment Conversion Prediction with T1-MRI-based Brain Network

    Authors: Yilin Leng, Wenju Cui, Bai Chen, Xi Jiang, Shuangqing Chen, Jian Zheng

    Abstract: Prediction the conversion to early-stage dementia is critical for mitigating its progression but remains challenging due to subtle cognitive impairments and structural brain changes. Traditional T1-weighted magnetic resonance imaging (T1-MRI) research focus on identifying brain atrophy regions but often fails to address the intricate connectivity between them. This limitation underscores the neces… ▽ More

    Submitted 26 June, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: 20 pages

  12. arXiv:2403.13208  [pdf, other

    cs.RO

    CaDRE: Controllable and Diverse Generation of Safety-Critical Driving Scenarios using Real-World Trajectories

    Authors: Peide Huang, Wenhao Ding, Jonathan Francis, Bingqing Chen, Ding Zhao

    Abstract: Simulation is an indispensable tool in the development and testing of autonomous vehicles (AVs), offering an efficient and safe alternative to road testing by allowing the exploration of a wide range of scenarios. Despite its advantages, a significant challenge within simulation-based testing is the generation of safety-critical scenarios, which are essential to ensure that AVs can handle rare but… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  13. arXiv:2403.12660  [pdf, other

    cs.IR cs.AI

    ERASE: Benchmarking Feature Selection Methods for Deep Recommender Systems

    Authors: Pengyue Jia, Ye**g Wang, Zhaocheng Du, Xiangyu Zhao, Yichao Wang, Bo Chen, Wanyu Wang, Huifeng Guo, Ruiming Tang

    Abstract: Deep Recommender Systems (DRS) are increasingly dependent on a large number of feature fields for more precise recommendations. Effective feature selection methods are consequently becoming critical for further enhancing the accuracy and optimizing storage efficiencies to align with the deployment demands. This research area, particularly in the context of DRS, is nascent and faces three core chal… ▽ More

    Submitted 19 June, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted to KDD 2024

  14. arXiv:2403.12211  [pdf, other

    cs.CV cs.AI

    A Unified Model for Longitudinal Multi-Modal Multi-View Prediction with Missingness

    Authors: Boqi Chen, Junier Oliva, Marc Niethammer

    Abstract: Medical records often consist of different modalities, such as images, text, and tabular information. Integrating all modalities offers a holistic view of a patient's condition, while analyzing them longitudinally provides a better understanding of disease progression. However, real-world longitudinal medical records present challenges: 1) patients may lack some or all of the data for a specific t… ▽ More

    Submitted 21 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  15. arXiv:2403.11427  [pdf, other

    cs.CV

    BAGS: Building Animatable Gaussian Splatting from a Monocular Video with Diffusion Priors

    Authors: Tingyang Zhang, Qingzhe Gao, Weiyu Li, Libin Liu, Baoquan Chen

    Abstract: Animatable 3D reconstruction has significant applications across various fields, primarily relying on artists' handcraft creation. Recently, some studies have successfully constructed animatable 3D models from monocular videos. However, these approaches require sufficient view coverage of the object within the input video and typically necessitate significant time and computational costs for train… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: https://talegqz.github.io/BAGS/

  16. arXiv:2403.11057  [pdf, other

    cs.CV cs.RO

    Large Language Models Powered Context-aware Motion Prediction

    Authors: Xiaoji Zheng, Lixiu Wu, Zhijie Yan, Yuanrong Tang, Hao Zhao, Chen Zhong, Bokui Chen, Jiangtao Gong

    Abstract: Motion prediction is among the most fundamental tasks in autonomous driving. Traditional methods of motion forecasting primarily encode vector information of maps and historical trajectory data of traffic participants, lacking a comprehensive understanding of overall traffic semantics, which in turn affects the performance of prediction tasks. In this paper, we utilized Large Language Models (LLMs… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 6 pages,4 figures

    MSC Class: 68T45

  17. arXiv:2403.10962  [pdf, other

    cs.CV eess.IV

    Exploiting Topological Priors for Boosting Point Cloud Generation

    Authors: Baiyuan Chen

    Abstract: This paper presents an innovative enhancement to the Sphere as Prior Generative Adversarial Network (SP-GAN) model, a state-of-the-art GAN designed for point cloud generation. A novel method is introduced for point cloud generation that elevates the structural integrity and overall quality of the generated point clouds by incorporating topological priors into the training process of the generator.… ▽ More

    Submitted 26 April, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: 7 pages, 3 figures

  18. arXiv:2403.10877  [pdf, ps, other

    hep-ex hep-ph

    Test of lepton universality and measurement of the form factors of $D^0\to K^{*}(892)^-μ^+ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: We report a first study of the semileptonic decay $D^0\rightarrow K^-π^0μ^{+}ν_μ$ by analyzing an $e^+e^-$ annihilation data sample of $7.9~\mathrm{fb}^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. The absolute branching fraction of $D^0\to K^-π^0μ^{+}ν_μ$ is measured for the first time to be $(0.729 \pm 0.014_{\rm stat} \pm 0.011_{\rm syst})\%$. Based on an a… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 9 pages, 3 figures

  19. arXiv:2403.10749  [pdf, other

    nucl-ex nucl-th

    Evolution of chirality from transverse wobbling in $^{135}$Pr

    Authors: N. Sensharma, U. Garg, Q. B. Chen, S. Frauendorf, S. Zhu, J. Arroyo, A. D. Ayangeakaa, D. P. Burdette, M. P. Carpenter, P. Copp, J. L. Cozzi, S. S. Ghugre, D. J. Hartley, K. B. Howard, R. V. F. Janssens, F. G. Kondev, T. Lauritsen, J. Li, R. Palit, A. Saracino, D. Seweryniak, S. Weyhmiller, J. Wu

    Abstract: Chirality is a distinct signature that characterizes triaxial shapes in nuclei. We report the first observation of chirality in the nucleus $^{135}$Pr using a high-statistics Gammasphere experiment with the $^{123}$Sb($^{16}$O,4n)$^{135}$Pr reaction. Two chiral-partner bands with the configuration $π(1h_{11/2})^1\otimes ν(1h_{11/2})^{-2}$ have been identified in this nucleus. Angular distribution… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 11 pages, 11 figures

  20. arXiv:2403.09895  [pdf, other

    cs.CE math.NA

    Overcoming the cohesive zone limit in the modelling of composites delamination with TUBA cohesive elements

    Authors: Giorgio Tosti Balducci, Boyang Chen

    Abstract: The wide adoption of composite structures in the aerospace industry requires reliable numerical methods to account for the effects of various damage mechanisms, including delamination. Cohesive elements are a versatile and physically representative way of modelling delamination. However, using their standard form which conforms to solid substrate elements, multiple elements are required in the nar… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  21. arXiv:2403.08654  [pdf, other

    eess.AS cs.SD

    An Efficient End-to-End Approach to Noise Invariant Speech Features via Multi-Task Learning

    Authors: Heitor R. Guimarães, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Boxing Chen, Tiago H. Falk

    Abstract: Self-supervised speech representation learning enables the extraction of meaningful features from raw waveforms. These features can then be efficiently used across multiple downstream tasks. However, two significant issues arise when considering the deployment of such methods ``in-the-wild": (i) Their large size, which can be prohibitive for edge applications; and (ii) their robustness to detrimen… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Under review on IEEE Transactions on Audio, Speech, and Language Processing (2024)

  22. arXiv:2403.08600  [pdf

    cs.NI

    Evaluation of Control/User-Plane Denial-of-Service (DoS) Attack on O-RAN Fronthaul Interface

    Authors: Ferlinda Feliana, Ting-Wei Hung, Binbin Chen, Ray-Guang Cheng

    Abstract: The open fronthaul interface defined by O-RAN ALLIANCE aims to support the interoperability between multi-vendor open radio access network (O-RAN) radio units (O-RU) and O-RAN distributed units (O-DU). This paper introduces a new tool that could be used to evaluate Denial-of-Service (DoS) attacks against the open fronthaul interface. We launched an array of control/user planes (C/U-Planes) attacks… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE INFOCOM Workshop: Next-generation Open and Programmable Radio Access Networks (NG-OPERA)

  23. arXiv:2403.07837  [pdf, other

    physics.optics

    Topological Protection of Optical Skyrmions through Complex Media

    Authors: An Aloysius Wang, Zimo Zhao, Yifei Ma, Yuxi Cai, Tade Marozsak, Binguo Chen, Honghui He, Lin Luo, Martin J Booth, Steve J Elston, Stephen M Morris, Chao He

    Abstract: Recent experimental realizations of optical Skyrmions through the techniques of structured light have opened the doors to a completely new way of representing data in electromagnetic fields, namely its topology. Apart from potentially enhancing the bandwidth of optical systems, the intrinsically discrete nature of the topological number allows Skyrmions to naturally interface with the digital worl… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  24. arXiv:2403.06886  [pdf, other

    gr-qc astro-ph.HE hep-th

    QED Effects on Kerr-Newman Black Hole Shadows

    Authors: Shaobing Yuan, Changkai Luo, Zezhou Hu, Zhenyu Zhang, Bin Chen

    Abstract: Incorporating first-order QED effects, we explore the shadows of Kerr-Newman black holes with a magnetic charge through the numerical backward ray-tracing method. Our investigation accounts for both the direct influence of the electromagnetic field on light rays and the distortion of the background spacetime metric due to QED corrections. We notice that the area of the shadow increases with the QE… ▽ More

    Submitted 26 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures; v2: references added and minor revisions

  25. arXiv:2403.06766  [pdf, other

    hep-ex

    Determination of the number of $ψ(3686)$ events taken at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The number of $ψ(3686)$ events collected by the BESIII detector during the 2021 run period is determined to be $(2259.3\pm 11.1)\times 10^6$ by counting inclusive $ψ(3686)$ hadronic events. The uncertainty is systematic and the statistical uncertainty is negligible. Meanwhile, the numbers of $ψ(3686)$ events collected during the 2009 and 2012 run periods are updated to be… ▽ More

    Submitted 28 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  26. arXiv:2403.06394  [pdf, other

    cs.CV

    FSViewFusion: Few-Shots View Generation of Novel Objects

    Authors: Rukhshanda Hussain, Hui Xian Grace Lim, Borchun Chen, Mubarak Shah, Ser Nam Lim

    Abstract: Novel view synthesis has observed tremendous developments since the arrival of NeRFs. However, Nerf models overfit on a single scene, lacking generalization to out of distribution objects. Recently, diffusion models have exhibited remarkable performance on introducing generalization in view synthesis. Inspired by these advancements, we explore the capabilities of a pretrained stable diffusion mode… ▽ More

    Submitted 12 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  27. arXiv:2403.05406  [pdf, other

    cs.LG cs.AI

    Considering Nonstationary within Multivariate Time Series with Variational Hierarchical Transformer for Forecasting

    Authors: Muyao Wang, Wenchao Chen, Bo Chen

    Abstract: The forecasting of Multivariate Time Series (MTS) has long been an important but challenging task. Due to the non-stationary problem across long-distance time steps, previous studies primarily adopt stationarization method to attenuate the non-stationary problem of the original series for better predictability. However, existing methods always adopt the stationarized series, which ignores the inhe… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: accepted by AAAI2024

  28. arXiv:2403.05283  [pdf

    cond-mat.soft physics.app-ph

    Closely piling up of multiple adhesive fronts in adhesive friction due to re-attachment

    Authors: Puyu Cao, Meicheng Yao, Bin Chen

    Abstract: To understand why the adhesive frictional force was in linear proportion to the real contact area in experiments, we investigate the adhesive friction generated by sliding elastic solids adhered to a rigid surface via multiple adhesive springs. Our results indicate that the shear-off force of the interface increases with the energetically guided re-attachment rate of adhesive springs, reaching sat… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  29. arXiv:2403.05067  [pdf, other

    physics.flu-dyn

    Prediction of turbulent energy based on low-rank resolvent modes and machine learning

    Authors: Yitong Fan, Bo Chen, Weipeng Li

    Abstract: A modelling framework based on the resolvent analysis and machine learning is proposed to predict the turbulent energy in incompressible channel flows. In the framework, the optimal resolvent response modes are selected as the basis functions modelling the low-rank behaviour of high-dimensional nonlinear turbulent flow-fields, and the corresponding weight functions are determined by data-driven ne… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 13 pages, 8 figures

  30. arXiv:2403.04797  [pdf, other

    cs.CL cs.LG

    Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding

    Authors: Zhenyu Zhang, Run** Chen, Shiwei Liu, Zhewei Yao, Olatunji Ruwase, Beidi Chen, Xiaoxia Wu, Zhangyang Wang

    Abstract: This paper aims to overcome the "lost-in-the-middle" challenge of large language models (LLMs). While recent advancements have successfully enabled LLMs to perform stable language modeling with up to 4 million tokens, the persistent difficulty faced by most LLMs in identifying relevant information situated in the middle of the context has not been adequately tackled. To address this problem, this… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  31. arXiv:2403.04652  [pdf, other

    cs.CL cs.AI

    Yi: Open Foundation Models by 01.AI

    Authors: 01. AI, :, Alex Young, Bei Chen, Chao Li, Chengen Huang, Ge Zhang, Guanwei Zhang, Heng Li, Jiangcheng Zhu, Jianqun Chen, **g Chang, Kaidong Yu, Peng Liu, Qiang Liu, Shawn Yue, Senbin Yang, Shiming Yang, Tao Yu, Wen Xie, Wenhao Huang, Xiaohui Hu, Xiaoyi Ren, Xinyao Niu, Pengcheng Nie , et al. (7 additional authors not shown)

    Abstract: We introduce the Yi model family, a series of language and multimodal models that demonstrate strong multi-dimensional capabilities. The Yi model family is based on 6B and 34B pretrained language models, then we extend them to chat models, 200K long context models, depth-upscaled models, and vision-language models. Our base models achieve strong performance on a wide range of benchmarks like MMLU,… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  32. arXiv:2403.03715  [pdf, other

    cs.CV

    MeaCap: Memory-Augmented Zero-shot Image Captioning

    Authors: Zequn Zeng, Yan Xie, Hao Zhang, Chiyu Chen, Zhengjue Wang, Bo Chen

    Abstract: Zero-shot image captioning (IC) without well-paired image-text data can be divided into two categories, training-free and text-only-training. Generally, these two types of methods realize zero-shot IC by integrating pretrained vision-language models like CLIP for image-text similarity evaluation and a pre-trained language model (LM) for caption generation. The main difference between them is wheth… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  33. arXiv:2403.03689  [pdf, other

    cs.CL cs.AI

    General2Specialized LLMs Translation for E-commerce

    Authors: Kaidi Chen, Ben Chen, Dehong Gao, Huangyu Dai, Wen Jiang, Wei Ning, Shanqing Yu, Libin Yang, Xiaoyan Cai

    Abstract: Existing Neural Machine Translation (NMT) models mainly handle translation in the general domain, while overlooking domains with special writing formulas, such as e-commerce and legal documents. Taking e-commerce as an example, the texts usually include amounts of domain-related words and have more grammar problems, which leads to inferior performances of current NMT methods. To address these prob… ▽ More

    Submitted 6 April, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: 4 pages, 1 figure, WWW2024 accepted

  34. arXiv:2403.03536  [pdf, other

    cs.IR cs.AI

    Towards Efficient and Effective Unlearning of Large Language Models for Recommendation

    Authors: Hangyu Wang, Jianghao Lin, Bo Chen, Yang Yang, Ruiming Tang, Weinan Zhang, Yong Yu

    Abstract: The significant advancements in large language models (LLMs) give rise to a promising research direction, i.e., leveraging LLMs as recommenders (LLMRec). The efficacy of LLMRec arises from the open-world knowledge and reasoning capabilities inherent in LLMs. LLMRec acquires the recommendation capabilities through instruction tuning based on user interaction data. However, in order to protect user… ▽ More

    Submitted 30 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by Frontier of Computer Science

  35. arXiv:2403.03507  [pdf, other

    cs.LG

    GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

    Authors: Jiawei Zhao, Zhenyu Zhang, Beidi Chen, Zhangyang Wang, Anima Anandkumar, Yuandong Tian

    Abstract: Training Large Language Models (LLMs) presents significant memory challenges, predominantly due to the growing size of weights and optimizer states. Common memory-reduction approaches, such as low-rank adaptation (LoRA), add a trainable low-rank matrix to the frozen pre-trained weight in each layer, reducing trainable parameters and optimizer states. However, such approaches typically underperform… ▽ More

    Submitted 2 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: ICML 2024 (Oral)

  36. arXiv:2403.03500  [pdf, other

    hep-ex

    Observation of the decay $h_{c}\to3(π^{+}π^{-})π^{0}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on $(2712.4\pm14.1)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector, we study the decays $h_{c}\to3(π^{+}π^{-})π^{0}$, $h_{c}\to2(π^{+}π^{-})ω$, $h_{c}\to2(π^{+}π^{-})π^{0}η$, $h_{c}\to2(π^{+}π^{-})η$, and $h_{c}\to p\bar{p}$ via $ψ(3686)\toπ^{0}h_{c}$. The decay channel $h_{c}\to3(π^{+}π^{-})π^{0}$ is observed for the first time, and its branching fraction is determined to… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, 3 figures

  37. arXiv:2403.03258  [pdf, other

    cond-mat.str-el

    Interaction-driven Roton Condensation in C = 2/3 Fractional Quantum Anomalous Hall State

    Authors: Hongyu Lu, Han-Qing Wu, Bin-Bin Chen, Kai Sun, Zi Yang Meng

    Abstract: The interplay of topological order and charge order exhibits rich physics. Recent experiments that succesfully realized the frational quantum anomalous Hall (FQAH) effect in twisted MoTe$_2$ bilayers and rhombohedral multilayer graphene without external magnetic field further call for deeper understanding of the relation between topological order and charge order in quantum moiré materials. In the… ▽ More

    Submitted 10 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  38. arXiv:2403.02950  [pdf, other

    cs.AI cs.CR

    A general approach to enhance the survivability of backdoor attacks by decision path coupling

    Authors: Yufei Zhao, Dingji Wang, Bihuan Chen, Ziqian Chen, Xin Peng

    Abstract: Backdoor attacks have been one of the emerging security threats to deep neural networks (DNNs), leading to serious consequences. One of the mainstream backdoor defenses is model reconstruction-based. Such defenses adopt model unlearning or pruning to eliminate backdoors. However, little attention has been paid to survive from such defenses. To bridge the gap, we propose Venom, the first generic ba… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  39. arXiv:2403.02828  [pdf, other

    physics.optics physics.app-ph

    A chip-integrated comb-based microwave oscillator

    Authors: Wei Sun, Zhiyang Chen, Linze Li, Chen Shen, **bao Long, Huamin Zheng, Luyu Yang, Qiushi Chen, Zhouze Zhang, Baoqi Shi, Shichang Li, Lan Gao, Yi-Han Luo, Baile Chen, Junqiu Liu

    Abstract: Low-noise microwave oscillators are cornerstones for wireless communication, radar and clocks. Optical frequency combs have enabled photonic microwaves with unrivalled noise performance and bandwidth. Emerging interest is to generate microwaves using chip-based frequency combs, namely microcombs. Here, we demonstrate the first, fully integrated, microcomb-based, microwave oscillator chip. The chip… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  40. arXiv:2403.01792  [pdf, other

    cs.SD eess.AS

    ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by Magnitude Conditioning

    Authors: Kuan-Hsun Ho, Jeih-weih Hung, Berlin Chen

    Abstract: Speech separation has recently made significant progress thanks to the fine-grained vision used in time-domain methods. However, several studies have shown that adopting Short-Time Fourier Transform (STFT) for feature extraction could be beneficial when encountering harsher conditions, such as noise or reverberation. Therefore, we propose a magnitude-conditioned time-domain framework, ConSep, to i… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  41. arXiv:2403.01785  [pdf, other

    cs.SD eess.AS

    What do neural networks listen to? Exploring the crucial bands in Speech Enhancement using Sinc-convolution

    Authors: Kuan-Hsun Ho, Jeih-weih Hung, Berlin Chen

    Abstract: This study introduces a reformed Sinc-convolution (Sincconv) framework tailored for the encoder component of deep networks for speech enhancement (SE). The reformed Sincconv, based on parametrized sinc functions as band-pass filters, offers notable advantages in terms of training efficiency, filter diversity, and interpretability. The reformed Sinc-conv is evaluated in conjunction with various SE… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  42. arXiv:2403.01761  [pdf, other

    hep-ex

    Observation of $ψ(3686)\to 3φ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (645 additional authors not shown)

    Abstract: Using $(2.712\pm0.014)\times 10^9$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we report the first observation of $ψ(3686)\to 3φ$ decay with a significance larger than 10$σ$. The branching fraction of this decay is determined to be $(1.46\pm0.05\pm0.17)\times10^{-5}$, where the first uncertainty is statistical and the second is systematic. No significant str… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  43. arXiv:2403.01710  [pdf, other

    cs.RO

    Sensor-based Multi-Robot Coverage Control with Spatial Separation in Unstructured Environments

    Authors: Xinyi Wang, Jiwen Xu, Chuanxiang Gao, Yizhou Chen, Jihan Zhang, Chenggang Wang, Ben M. Chen

    Abstract: Multi-robot systems have increasingly become instrumental in tackling search and coverage problems. However, the challenge of optimizing task efficiency without compromising task success still persists, particularly in expansive, unstructured environments with dense obstacles. This paper presents an innovative, decentralized Voronoi-based approach for search and coverage to reactively navigate t… ▽ More

    Submitted 10 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  44. arXiv:2403.01154  [pdf, ps, other

    math.AG

    Boundedness of klt complements on Fano fibrations over surfaces

    Authors: Bingyi Chen

    Abstract: Let $(X,B)$ be an $ε$-lc pair of dimension $d$ with a closed point $x\in X$. Birkar conjectured that there is an effective Cartier divisor $H$ passing through $x$ such that $(X,B+tH)$ is lc near $x$, where $t$ is a positive real number depending only on $d,ε$. We prove that Birkar's conjecture is equivalent to Shokurov's conjecture on boundedness of klt complements on Fano fibrations and we confir… ▽ More

    Submitted 26 April, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: Version 2, showed that Shokurov's conjecture implies Birkar's conjecture, so they are equivalent (see Theorem 1.6). Version 3, added Example 1.10 to indicate that the order O(ε^2) in Theorem 1.7 is optimal. arXiv admin note: text overlap with arXiv:1811.10709 by other authors

  45. arXiv:2403.00985  [pdf, other

    astro-ph.SR

    Episodic energy release during the main- and post-impulsive phase of a solar flare

    Authors: Yuqian Wei, Bin Chen, Sijie Yu, Haimin Wang, Yixian Zhang, Lindsay Glesener

    Abstract: When and where the magnetic field energy is released and converted in eruptive solar flares remains an outstanding topic in solar physics. To shed light on this question, here we report multi-wavelength observations of a C9.4-class eruptive limb flare that occurred on 2017 August 20. The flare, accompanied by a magnetic flux rope eruption and a white light coronal mass ejection, features three pos… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 21 pages, 14 figures

  46. arXiv:2402.19248  [pdf, other

    cs.CL

    Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark

    Authors: Zhikun Xu, Yinghui Li, Ruixue Ding, Xinyu Wang, Boli Chen, Yong Jiang, Hai-Tao Zheng, Wenlian Lu, Pengjun Xie, Fei Huang

    Abstract: How to better evaluate the capabilities of Large Language Models (LLMs) is the focal point and hot topic in current LLMs research. Previous work has noted that due to the extremely high cost of iterative updates of LLMs, they are often unable to answer the latest dynamic questions well. To promote the improvement of Chinese LLMs' ability to answer dynamic questions, in this paper, we introduce CDQ… ▽ More

    Submitted 1 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Work in progress!

  47. arXiv:2402.18951  [pdf, other

    cs.CV

    Percept, Chat, and then Adapt: Multimodal Knowledge Transfer of Foundation Models for Open-World Video Recognition

    Authors: Boyu Chen, Siran Chen, Kunchang Li, Qinglin Xu, Yu Qiao, Yali Wang

    Abstract: Open-world video recognition is challenging since traditional networks are not generalized well on complex environment variations. Alternatively, foundation models with rich knowledge have recently shown their generalization power. However, how to apply such knowledge has not been fully explored for open-world video recognition. To this end, we propose a generic knowledge transfer pipeline, which… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 35 pages, 6 figures, 8 tables

  48. arXiv:2402.17913  [pdf, other

    physics.flu-dyn cs.AI cs.LG

    Using AI libraries for Incompressible Computational Fluid Dynamics

    Authors: Boyang Chen, Claire E. Heaney, Christopher C. Pain

    Abstract: Recently, there has been a huge effort focused on develo** highly efficient open source libraries to perform Artificial Intelligence (AI) related computations on different computer architectures (for example, CPUs, GPUs and new AI processors). This has not only made the algorithms based on these libraries highly efficient and portable between different architectures, but also has substantially s… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 24 pages, 6 figures

  49. arXiv:2402.17189  [pdf

    cs.CL cs.AI cs.SD eess.AS

    An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement

    Authors: Tzu-Ting Yang, Hsin-Wei Wang, Yi-Cheng Wang, Chi-Han Lin, Berlin Chen

    Abstract: With the massive developments of end-to-end (E2E) neural networks, recent years have witnessed unprecedented breakthroughs in automatic speech recognition (ASR). However, the codeswitching phenomenon remains a major obstacle that hinders ASR from perfection, as the lack of labeled data and the variations between languages often lead to degradation of ASR performance. In this paper, we focus exclus… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: ICASSP 2024

  50. arXiv:2402.16855  [pdf, other

    cs.CV

    MB-RACS: Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network

    Authors: Yujun Huang, Bin Chen, Naiqi Li, Baoyi An, Shu-Tao Xia, Yaowei Wang

    Abstract: Conventional compressed sensing (CS) algorithms typically apply a uniform sampling rate to different image blocks. A more strategic approach could be to allocate the number of measurements adaptively, based on each image block's complexity. In this paper, we propose a Measurement-Bounds-based Rate-Adaptive Image Compressed Sensing Network (MB-RACS) framework, which aims to adaptively determine the… ▽ More

    Submitted 18 January, 2024; originally announced February 2024.