Skip to main content

Showing 1–50 of 3,135 results for author: Yu, H

.
  1. arXiv:2407.03205  [pdf, other

    cs.CV

    Category-Aware Dynamic Label Assignment with High-Quality Oriented Proposal

    Authors: Mingkui Feng, Hancheng Yu, Xiaoyu Dang, Ming Zhou

    Abstract: Objects in aerial images are typically embedded in complex backgrounds and exhibit arbitrary orientations. When employing oriented bounding boxes (OBB) to represent arbitrary oriented objects, the periodicity of angles could lead to discontinuities in label regression values at the boundaries, inducing abrupt fluctuations in the loss function. To address this problem, an OBB representation based o… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2407.02785  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Identifying Direct Bandgap Silicon Structures with High-throughput Search and Machine Learning Methods

    Authors: Rui Wang, Hongyu Yu, Yang Zhong, Hongjun Xiang

    Abstract: Utilizations of silicon-based luminescent devices are restricted by the indirect-gap nature of diamond silicon. In this study, the high-throughput method is employed to expedite discoveries of direct-gap silicon crystals. The machine learning (ML) potential is utilized to construct a dataset comprising 2637 silicon allotropes, which is subsequently screened using an ML Hamiltonian model and densit… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2407.02052  [pdf, other

    eess.AS cs.SD

    The USTC-NERCSLIP Systems for The ICMC-ASR Challenge

    Authors: Minghui Wu, Luzhen Xu, Jie Zhang, Haitao Tang, Yanyan Yue, Ruizhi Liao, **tao Zhao, Zhengzhe Zhang, Yichi Wang, Haoyin Yan, Hongliang Yu, Tongle Ma, Jiachen Liu, Chongliang Wu, Yongchao Li, Yanyong Zhang, Xin Fang, Yue Zhang

    Abstract: This report describes the submitted system to the In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) challenge, which considers the ASR task with multi-speaker overlap** and Mandarin accent dynamics in the ICMC case. We implement the front-end speaker diarization using the self-supervised learning representation based multi-speaker embedding and beamforming using the speaker position,… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at ICASSP 2024

  4. arXiv:2407.01976  [pdf, other

    cs.CL cs.AI cs.MM

    A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document Understanding

    Authors: **ghui Lu, Haiyang Yu, Yanjie Wang, Yongjie Ye, **gqun Tang, Ziwei Yang, Binghong Wu, Qi Liu, Hao Feng, Han Wang, Hao Liu, Can Huang

    Abstract: Recently, many studies have demonstrated that exclusively incorporating OCR-derived text and spatial layouts with large language models (LLMs) can be highly effective for document understanding tasks. However, existing methods that integrate spatial layouts with text have limitations, such as producing overly long text sequences or failing to fully leverage the autoregressive traits of LLMs. In th… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2407.01914  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Switchable Ferroelectricity in Subnano Silicon Thin Films

    Authors: Hongyu Yu, Shihan deng, Muting Xie, Yuwen Zhang, Xizhi Shi, Jianxin Zhong, Chaoyu He, Hongjun Xiang

    Abstract: Recent advancements underscore the critical need to develop ferroelectric materials compatible with silicon. We systematically explore possible ferroelectric silicon quantum films and discover a low-energy variant (hex-OR-2*2-P) with energy just 1 meV/atom above the ground state (hex-OR-2*2). Both hex-OR-2*2 and hex-OR-2*2-P are confirmed to be dynamically and mechanically stable semiconductors wi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 18 pages, 3 figures

  6. arXiv:2407.01336  [pdf, other

    cs.IT eess.SP

    Compressed Sensing Inspired User Acquisition for Downlink Integrated Sensing and Communication Transmissions

    Authors: Yi Song, Fernando Pedraza, Shuangyang Li, Siyao Li, Han Yu, Giuseppe Caire

    Abstract: This paper investigates radar-assisted user acquisition for downlink multi-user multiple-input multiple-output (MIMO) transmission using Orthogonal Frequency Division Multiplexing (OFDM) signals. Specifically, we formulate a concise mathematical model for the user acquisition problem, where each user is characterized by its delay and beamspace response. Therefore, we propose a two-stage method for… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  7. Personalized Federated Continual Learning via Multi-granularity Prompt

    Authors: Hao Yu, Xin Yang, Xin Gao, Yan Kang, Hao Wang, Junbo Zhang, Tianrui Li

    Abstract: Personalized Federated Continual Learning (PFCL) is a new practical scenario that poses greater challenges in sharing and personalizing knowledge. PFCL not only relies on knowledge fusion for server aggregation at the global spatial-temporal perspective but also needs model improvement for each client according to the local requirements. Existing methods, whether in Personalized Federated Learning… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

    Comments: Accepted by KDD 2024 Research Track

  8. arXiv:2406.19741  [pdf, other

    cs.RO cs.AI

    ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning

    Authors: Christopher E. Mower, Yuhui Wan, Hongzhan Yu, Antoine Grosnit, Jonas Gonzalez-Billandon, Matthieu Zimmer, **long Wang, Xinyu Zhang, Yao Zhao, Anbang Zhai, Puze Liu, Daniel Palenicek, Davide Tateo, Cesar Cadena, Marco Hutter, Jan Peters, Guangjian Tian, Yuzheng Zhuang, Kun Shao, Xingyue Quan, Jianye Hao, Jun Wang, Haitham Bou-Ammar

    Abstract: We present a framework for intuitive robot programming by non-experts, leveraging natural language prompts and contextual information from the Robot Operating System (ROS). Our system integrates large language models (LLMs), enabling non-experts to articulate task requirements to the system through a chat interface. Key features of the framework include: integration of ROS with an AI agent connect… ▽ More

    Submitted 2 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: This document contains 26 pages and 13 figures

  9. arXiv:2406.19411  [pdf, ps, other

    math.GR

    On exact products of two dihedral groups

    Authors: Kan Hu, Hao Yu

    Abstract: An exact product of two finite groups $H$ and $K$ is a finite group $X$ which contains $H$ and $K$ as subgroups, satisfying $X=HK$ and $H\cap K=\{1_X\}$. In this paper, we provide a classification of the exact products of two dihedral groups of orders $2m$ and $2n$ for all odd numbers $m,n\geq 3$.

    Submitted 15 June, 2024; originally announced June 2024.

  10. arXiv:2406.18988  [pdf

    physics.optics astro-ph.IM physics.app-ph

    Hyper-sampling imaging

    Authors: Ze Zhang, Hemeng Xue, Mingtao Shang, Hongfei Yu, **chao Liang, Meiling Guan, Chengming Sun, Huahua Wang, Shufeng Wang, Zhengyu Ye, Feng Gao, Lu Gao

    Abstract: In our research, we have developed a novel mechanism that allows for a significant reduction in the smallest sampling unit of digital image sensors (DIS) to as small as 1/16th of a pixel, through measuring the intra-pixel quantum efficiency for the first time and recomputing the image. Employing our method, the physical sampling resolution of DIS can be enhanced by 16 times. The method has undergo… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  11. arXiv:2406.17419  [pdf, other

    cs.CL cs.AI

    Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA

    Authors: Minzheng Wang, Longze Chen, Cheng Fu, Shengyi Liao, Xinghua Zhang, Bingli Wu, Haiyang Yu, Nan Xu, Lei Zhang, Run Luo, Yunshui Li, Min Yang, Fei Huang, Yongbin Li

    Abstract: Long-context modeling capabilities have garnered widespread attention, leading to the emergence of Large Language Models (LLMs) with ultra-context windows. Meanwhile, benchmarks for evaluating long-context LLMs are gradually catching up. However, existing benchmarks employ irrelevant noise texts to artificially extend the length of test cases, diverging from the real-world scenarios of long-contex… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: We release our code and data publicly at https://github.com/MozerWang/Loong

  12. arXiv:2406.17402  [pdf, other

    gr-qc hep-th quant-ph

    Quantum gravitomagnetic interaction

    Authors: Di Hao, Jiawei Hu, Hongwei Yu

    Abstract: In the framework of linearized quantum gravity, we study the quantum gravitational interaction between two nonpointlike objects induced by fluctuating gravitomagnetic fields in vacuum. We find that, in addition to the quantum gravitational interaction induced by fluctuating gravitoelectric fields previously studied, there exists a quantum gravitomagnetic interaction. This interaction originates fr… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 18 pages, 1 figure

    Journal ref: Phys. Rev. D 109, 126016 (2024)

  13. arXiv:2406.17262  [pdf, other

    cs.CL

    D2LLM: Decomposed and Distilled Large Language Models for Semantic Search

    Authors: Zihan Liao, Hang Yu, Jianguo Li, Jun Wang, Wei Zhang

    Abstract: The key challenge in semantic search is to create models that are both accurate and efficient in pinpointing relevant sentences for queries. While BERT-style bi-encoders excel in efficiency with pre-computed embeddings, they often miss subtle nuances in search tasks. Conversely, GPT-style LLMs with cross-encoder designs capture these nuances but are computationally intensive, hindering real-time a… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  14. arXiv:2406.16287  [pdf, other

    math.NA

    Energetic Spectral-Element Time Marching Methods for Phase-Field Nonlinear Gradient Systems

    Authors: Shiqin Liu, Haijun Yu

    Abstract: We propose two efficient energetic spectral-element methods in time for marching nonlinear gradient systems with the phase-field Allen--Cahn equation as an example: one fully implicit nonlinear method and one semi-implicit linear method. Different from other spectral methods in time using spectral Petrov-Galerkin or weighted Galerkin approximations, the presented implicit method employs an energet… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 28 pages, 10 figures

  15. arXiv:2406.16242  [pdf, other

    math.DG

    Foliation of area minimizing hypersurfaces in asymptotically flat manifolds and Schoen's conjecture

    Authors: Shihang He, Yuguang Shi, Haobin Yu

    Abstract: In this paper, we demonstrate that any asymptotically flat manifold $(M^n, g)$ with $4\leq n\leq 7$ can be foliated by a family of area-minimizing hypersurfaces, each of which is asymptotic to Cartesian coordinate hyperplanes defined at an end of $(M^n, g)$. As an application of this foliation, we show that for any asymptotically flat manifold $(M^n, g)$ with $4\leq n\leq 7$, nonnegative scalar cu… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 39pages, 8 figures. Comments are welcome!

  16. arXiv:2406.15363  [pdf

    cs.CL

    Exploring LLM Multi-Agents for ICD Coding

    Authors: Rumeng Li, Xun Wang, Hong Yu

    Abstract: Large Language Models (LLMs) have demonstrated impressive and diverse abilities that can benefit various domains, such as zero and few-shot information extraction from clinical text without domain-specific training. However, for the ICD coding task, they often hallucinate key details and produce high recall but low precision results due to the high-dimensional and skewed distribution of the ICD co… ▽ More

    Submitted 1 April, 2024; originally announced June 2024.

  17. arXiv:2406.13972  [pdf, other

    cs.SE

    CREF: An LLM-based Conversational Software Repair Framework for Programming Tutors

    Authors: Boyang Yang, Haoye Tian, Weiguo Pian, Haoran Yu, Haitao Wang, Jacques Klein, Tegawendé F. Bissyandé, Shunfu **

    Abstract: Program repair techniques offer cost-saving benefits for debugging within software development and programming education scenarios. With the proven effectiveness of Large Language Models (LLMs) in code-related tasks, researchers have explored their potential for program repair. However, it is crucial to recognize that existing repair benchmarks may have influenced LLM training data, potentially ca… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  18. arXiv:2406.13578  [pdf, other

    cs.CL

    Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration

    Authors: Han-Cheng Yu, Yu-An Shih, Kin-Man Law, Kai-Yu Hsieh, Yu-Chen Cheng, Hsin-Chih Ho, Zih-An Lin, Wen-Chuan Hsu, Yao-Chung Fan

    Abstract: In this paper, we tackle the task of distractor generation (DG) for multiple-choice questions. Our study introduces two key designs. First, we propose \textit{retrieval augmented pretraining}, which involves refining the language model pretraining to align it more closely with the downstream task of DG. Second, we explore the integration of knowledge graphs to enhance the performance of DG. Throug… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Findings at ACL 2024

  19. arXiv:2406.12793  [pdf, other

    cs.CL

    ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

    Authors: Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, **g Zhang, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong, Mingdao Liu, Minlie Huang , et al. (32 additional authors not shown)

    Abstract: We introduce ChatGLM, an evolving family of large language models that we have been develo** over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  20. arXiv:2406.12195  [pdf, other

    quant-ph cs.LG

    Quantum Compiling with Reinforcement Learning on a Superconducting Processor

    Authors: Z. T. Wang, Qiuhao Chen, Yuxuan Du, Z. H. Yang, Xiaoxia Cai, Kaixuan Huang, **gning Zhang, Kai Xu, Jun Du, Yinan Li, Yuling Jiao, Xingyao Wu, Wu Liu, Xiliang Lu, Huikai Xu, Yirong **, Ruixia Wang, Haifeng Yu, S. P. Zhao

    Abstract: To effectively implement quantum algorithms on noisy intermediate-scale quantum (NISQ) processors is a central task in modern quantum technology. NISQ processors feature tens to a few hundreds of noisy qubits with limited coherence times and gate operations with errors, so NISQ algorithms naturally require employing circuits of short lengths via quantum compilation. Here, we develop a reinforcemen… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  21. arXiv:2406.11508  [pdf, other

    eess.SY

    Leveraging Cooperative Connected Automated Vehicles for Mixed Traffic Safety

    Authors: Chenguang Zhao, Tamas G. Molnar, Huan Yu

    Abstract: The introduction of connected and automated vehicles (CAV) is believed to reduce congestion, enhance safety, and improve traffic efficiency. Numerous research studies have focused on controlling pure CAV platoons in fully connected automated traffic, as well as single or multiple CAVs in mixed traffic with human-driven vehicles (HVs). CAV cruising control designs have been proposed to stabilize th… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  22. arXiv:2406.11274  [pdf, other

    cs.CL

    Skip-Layer Attention: Bridging Abstract and Detailed Dependencies in Transformers

    Authors: Qian Chen, Wen Wang, Qinglin Zhang, Siqi Zheng, Shiliang Zhang, Chong Deng, Hai Yu, Jiaqing Liu, Yukun Ma, Chong Zhang

    Abstract: The Transformer architecture has significantly advanced deep learning, particularly in natural language processing, by effectively managing long-range dependencies. However, as the demand for understanding complex relationships grows, refining the Transformer's architecture becomes critical. This paper introduces Skip-Layer Attention (SLA) to enhance Transformer models by enabling direct attention… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 7 pages, 1 figure

  23. arXiv:2406.10753  [pdf, other

    astro-ph.CO

    Testing the parametric model for self-interacting dark matter using matched halos in cosmological simulations

    Authors: Daneng Yang, Ethan O. Nadler, Hai-Bo Yu

    Abstract: We systemically evaluate the performance of the self-interacting dark matter (SIDM) halo model proposed in arXiv:2305.16176 with matched halos from high-resolution cosmological CDM and SIDM simulations. The model incorporates SIDM effects along mass evolution histories of CDM halos and it is applicable to both isolated halos and suhbhalos. We focus on the accuracy of the model in predicting halo d… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 20 pages, 19 figures

  24. arXiv:2406.10593  [pdf, other

    cs.AI cs.DB cs.IR

    QDA-SQL: Questions Enhanced Dialogue Augmentation for Multi-Turn Text-to-SQL

    Authors: Yinggang Sun, Ziming Guo, Haining Yu, Chuanyi Liu, Xiang Li, Bingxuan Wang, Xiangzhan Yu, Tiancheng Zhao

    Abstract: Fine-tuning large language models (LLMs) for specific domain tasks has achieved great success in Text-to-SQL tasks. However, these fine-tuned models often face challenges with multi-turn Text-to-SQL tasks caused by ambiguous or unanswerable questions. It is desired to enhance LLMs to handle multiple types of questions in multi-turn Text-to-SQL tasks. To address this, we propose a novel data augmen… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 13 pages, 7 figures

  25. arXiv:2406.10583  [pdf, other

    hep-ex

    Demonstration of neutron identification in neutrino interactions in the MicroBooNE liquid argon time projection chamber

    Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, A. Barnard, G. Barr, D. Barrow, J. Barrow, V. Basque, J. Bateman, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book , et al. (165 additional authors not shown)

    Abstract: A significant challenge in measurements of neutrino oscillations is reconstructing the incoming neutrino energies. While modern fully-active tracking calorimeters such as liquid argon time projection chambers in principle allow the measurement of all final state particles above some detection threshold, undetected neutrons remain a considerable source of missing energy with little to no data const… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Report number: FERMILAB-PUB-24-0301

  26. arXiv:2406.10123  [pdf, other

    hep-ex physics.ins-det

    Improving neutrino energy estimation of charged-current interaction events with recurrent neural networks in MicroBooNE

    Authors: MicroBooNE collaboration, P. Abratenko, O. Alterkait, D. Andrade Aldana, L. Arellano, J. Asaadi, A. Ashkenazi, S. Balasubramanian, B. Baller, A. Barnard, G. Barr, D. Barrow, J. Barrow, V. Basque, J. Bateman, O. Benevides Rodrigues, S. Berkman, A. Bhanderi, A. Bhat, M. Bhattacharya, M. Bishai, A. Blake, B. Bogart, T. Bolton, J. Y. Book , et al. (164 additional authors not shown)

    Abstract: We present a deep learning-based method for estimating the neutrino energy of charged-current neutrino-argon interactions. We employ a recurrent neural network (RNN) architecture for neutrino energy estimation in the MicroBooNE experiment, utilizing liquid argon time projection chamber (LArTPC) detector technology. Traditional energy estimation approaches in LArTPCs, which largely rely on reconstr… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Report number: FERMILAB-PUB-24-0287

  27. arXiv:2406.09683  [pdf, other

    astro-ph.GA

    Interstellar Nitrogen Isotope Ratios: Measurements on tracers of C$^{14}$N and C$^{15}$N

    Authors: J. L. Chen, J. S. Zhang, C. Henkel, Y. T. Yan, H. Z. Yu, Y. X. Wang, Y. P. Zou, J. Y. Zhao, X. Y. Wang

    Abstract: The nitrogen isotope ratio 14N/15N is a powerful tool to trace Galactic stellar nucleosynthesis and constraining Galactic chemical evolution. Previous observations have found lower 14N/15N ratios in the Galactic center and higher values in the Galactic disk. This is consistent with the inside-out formation scenario of our Milky Way. However, previous studies mostly utilized double isotope ratios a… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 34 pages, 9 figures, 6 tables

    Journal ref: The Astrophysical Journal (2004)

  28. arXiv:2406.09394  [pdf, other

    cs.CV cs.GR

    WonderWorld: Interactive 3D Scene Generation from a Single Image

    Authors: Hong-Xing Yu, Haoyi Duan, Charles Herrmann, William T. Freeman, Jiajun Wu

    Abstract: We present WonderWorld, a novel framework for interactive 3D scene extrapolation that enables users to explore and shape virtual environments based on a single input image and user-specified text. While significant improvements have been made to the visual quality of scene generation, existing methods are run offline, taking tens of minutes to hours to generate a scene. By leveraging Fast Gaussian… ▽ More

    Submitted 14 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: Project website: https://WonderWorld-2024.github.io/

  29. arXiv:2406.09205  [pdf, other

    cs.CL cs.AI

    ReadCtrl: Personalizing text generation with readability-controlled instruction learning

    Authors: Hieu Tran, Zonghai Yao, Lingxi Li, Hong Yu

    Abstract: Content generation conditioning on users's readability is an important application for personalization. In an era of large language models (LLMs), readability-controlled text generation based on LLMs has become increasingly important. This paper introduces a novel methodology called "Readability-Controlled Instruction Learning (ReadCtrl)," which aims to instruction-tune LLMs to tailor users' reada… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 9 pages

  30. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  31. arXiv:2406.08301  [pdf, other

    nucl-ex

    Jet modification via $π^0$-hadron correlations in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, S. Afanasiev, C. Aidala, N. N. Ajitanand, Y. Akiba, H. Al-Bataineh, J. Alexander, M. Alfred, K. Aoki, N. Apadula, L. Aphecetche, J. Asai, H. Asano, E. T. Atomssa, R. Averbeck, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, G. Baksay, L. Baksay, A. Baldisseri , et al. (510 additional authors not shown)

    Abstract: High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is obs… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 534 authors from 83 institutions, 12 pages, 7 figures. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  32. arXiv:2406.08275  [pdf

    cond-mat.mtrl-sci

    An accurate and transferable machine learning interatomic potential for equimolar and non-equimolar high-entropy diborides

    Authors: Hong Meng, Yiwen Liu, Hulei Yu, Lei Zhuang, Yanhui Chu

    Abstract: Machine learning interatomic potentials have become a powerful tool to achieve molecular dynamics (MD) simulations with the accuracy of ab initio methods while beyond their length and timescale limitations. Here, we develop an efficient moment tensor potential (MTP) for high-entropy diborides (HEBs) based on unary and binary diborides with Ti-V-Cr-Zr-Nb-Mo-Hf-Ta-W principal elements. Notably, the… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 3 figures

  33. arXiv:2406.08243  [pdf

    cond-mat.mtrl-sci

    An efficient strategy to construct general machine learning potentials for high-entropy ceramics

    Authors: Yiwen Liu, Hong Meng, Zijie Zhu, Hulei Yu, Lei Zhuang, Yanhui Chu

    Abstract: Molecular dynamics (MD) simulations are considered an efficient and low-cost means to develop remarkable properties of high-entropy ceramics with vast composition space, yet the lack of general potentials severely limits their applications. Herein, taking high-entropy carbides (HECs) as the model, we propose a strategy to efficiently construct a general neuroevolution potential (NEP) with broad co… ▽ More

    Submitted 18 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 28 pages, 6 figures

  34. arXiv:2406.07637  [pdf, other

    astro-ph.GA

    The destiny of NGC 6530: past and future

    Authors: Delong Jia, Heng Yu, Zhengyi Shao, Lu Li

    Abstract: Studying the structures of open clusters is crucial for understanding stellar evolution and galactic dynamics. Based on Gaia DR3 data, we apply the hierarchical clustering algorithm to a young open cluster NGC 6530 and group its members into 5 substructures. By linear tracing with the kinematic information of their members, we find that: Sub 1 is the core of the cluster. It is expanding slowly. Su… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 17 pages, 11 figures, accepted for publication in ApJ

  35. arXiv:2406.07472  [pdf, other

    cs.CV

    4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models

    Authors: Heng Yu, Chaoyang Wang, Peiye Zhuang, Willi Menapace, Aliaksandr Siarohin, Junli Cao, Laszlo A Jeni, Sergey Tulyakov, Hsin-Ying Lee

    Abstract: Existing dynamic scene generation methods mostly rely on distilling knowledge from pre-trained 3D generative models, which are typically fine-tuned on synthetic object datasets. As a result, the generated scenes are often object-centric and lack photorealism. To address these limitations, we introduce a novel pipeline designed for photorealistic text-to-4D scene generation, discarding the dependen… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  36. arXiv:2406.07103  [pdf, other

    eess.AS cs.AI

    MR-RawNet: Speaker verification system with multiple temporal resolutions for variable duration utterances using raw waveforms

    Authors: Seung-bin Kim, Chan-yeong Lim, Jungwoo Heo, Ju-ho Kim, Hyun-seo Shin, Kyo-Won Koo, Ha-** Yu

    Abstract: In speaker verification systems, the utilization of short utterances presents a persistent challenge, leading to performance degradation primarily due to insufficient phonetic information to characterize the speakers. To overcome this obstacle, we propose a novel structure, MR-RawNet, designed to enhance the robustness of speaker verification systems against variable duration utterances using raw… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 5 pages, accepted by Interspeech 2024

  37. arXiv:2406.07056  [pdf, other

    cs.CL

    Effectively Compress KV Heads for LLM

    Authors: Hao Yu, Zelan Yang, Shen Li, Yong Li, Jianxin Wu

    Abstract: The advent of pre-trained large language models (LLMs) has revolutionized various natural language processing tasks. These models predominantly employ an auto-regressive decoding mechanism that utilizes Key-Value (KV) caches to eliminate redundant calculations for previous tokens. Nevertheless, as context lengths and batch sizes increase, the linear expansion in memory footprint of KV caches becom… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  38. arXiv:2406.06080  [pdf, other

    astro-ph.CO

    Probing vector chirality in the early Universe

    Authors: Junsup Shim, Ue-Li Pen, Hao-Ran Yu, Teppei Okumura

    Abstract: We explore the potential of detecting parity violation in primordial vector fossils using late-time galaxy spins. Utilizing $N$-body simulations, we use halo spins as a reliable proxy for galaxy spins to investigate how effectively such primordial vectorial parity asymmetry remains in galaxy spins at low redshifts. We develop a novel approach to generate initial conditions with substantial parity… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages, 3 figures. Submitted to PRL

  39. arXiv:2406.06056  [pdf, other

    cs.CL

    Synth-SBDH: A Synthetic Dataset of Social and Behavioral Determinants of Health for Clinical Text

    Authors: Avijit Mitra, Emily Druhl, Raelene Goodwin, Hong Yu

    Abstract: Social and behavioral determinants of health (SBDH) play a crucial role in health outcomes and are frequently documented in clinical text. Automatically extracting SBDH information from clinical text relies on publicly available good-quality datasets. However, existing SBDH datasets exhibit substantial limitations in their availability and coverage. In this study, we introduce Synth-SBDH, a novel… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Github: https://github.com/avipartho/Synth-SBDH

  40. arXiv:2406.06045  [pdf, other

    cs.CV cs.AI

    Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training

    Authors: Ke Niu, Haiyang Yu, Xuelin Qian, Teng Fu, Bin Li, Xiangyang Xue

    Abstract: Existing person re-identification (Re-ID) methods principally deploy the ImageNet-1K dataset for model initialization, which inevitably results in sub-optimal situations due to the large domain gap. One of the key challenges is that building large-scale person Re-ID datasets is time-consuming. Some previous efforts address this problem by collecting person images from the internet e.g., LUPerson,… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  41. arXiv:2406.06028  [pdf, other

    cs.CV

    ReCon1M:A Large-scale Benchmark Dataset for Relation Comprehension in Remote Sensing Imagery

    Authors: Xian Sun, Qiwei Yan, Chubo Deng, Chenglong Liu, Yi Jiang, Zhongyan Hou, Wanxuan Lu, Fanglong Yao, Xiaoyu Liu, Lingxiang Hao, Hongfeng Yu

    Abstract: Scene Graph Generation (SGG) is a high-level visual understanding and reasoning task aimed at extracting entities (such as objects) and their interrelationships from images. Significant progress has been made in the study of SGG in natural images in recent years, but its exploration in the domain of remote sensing images remains very limited. The complex characteristics of remote sensing images ne… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  42. arXiv:2406.05644  [pdf, other

    cs.CL cs.AI cs.CR cs.CY

    How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States

    Authors: Zhenhong Zhou, Haiyang Yu, Xinghua Zhang, Rongwu Xu, Fei Huang, Yongbin Li

    Abstract: Large language models (LLMs) rely on safety alignment to avoid responding to malicious user inputs. Unfortunately, jailbreak can circumvent safety guardrails, resulting in LLMs generating harmful content and raising concerns about LLM safety. Due to language models with intensive parameters often regarded as black boxes, the mechanisms of alignment and jailbreak are challenging to elucidate. In th… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: 27 pages

  43. arXiv:2406.05391  [pdf, other

    cs.LG

    DUPLEX: Dual GAT for Complex Embedding of Directed Graphs

    Authors: Zhaoru Ke, Hang Yu, Jianguo Li, Haipeng Zhang

    Abstract: Current directed graph embedding methods build upon undirected techniques but often inadequately capture directed edge information, leading to challenges such as: (1) Suboptimal representations for nodes with low in/out-degrees, due to the insufficient neighbor interactions; (2) Limited inductive ability for representing new nodes post-training; (3) Narrow generalizability, as training is overly c… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  44. arXiv:2406.03848  [pdf, other

    physics.ao-ph cs.AI cs.LG

    OceanCastNet: A Deep Learning Ocean Wave Model with Energy Conservation

    Authors: Ziliang Zhang, Huaming Yu, Danqin Ren

    Abstract: Traditional wave forecasting models, although based on energy conservation equations, are computationally expensive. On the other hand, existing deep learning geophysical fluid models, while computationally efficient, often suffer from issues such as energy dissipation in long-term forecasts. This paper proposes a novel energy-balanced deep learning wave forecasting model called OceanCastNet (OCN)… ▽ More

    Submitted 9 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  45. arXiv:2406.02956  [pdf, other

    astro-ph.CO

    Alleviating Hubble constant tension and growth tension via a transition of absolute magnitude favored by Pantheon+ sample

    Authors: Yang Liu, Hongwei Yu, Puxun Wu

    Abstract: We establish a cosmological-model-independent method to extract the apparent magnitude and its derivative at different redshifts from the Pantheon+ type Ia supernova sample, and find that the obtained values deviate clearly from the prediction of the $Λ$CDM model at the lowest redshift. This deviation can be explained as a result of a transition of the absolute magnitude $M$ in the low redshift re… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 20 pages, 5 figures, 4 tables. Accepted for publication in the Physical Review D (Letter)

  46. arXiv:2406.02948  [pdf, other

    stat.ME stat.AP

    Copula-based semiparametric nonnormal transformed linear model for survival data with dependent censoring

    Authors: Huazhen Yu, Lixin Zhang

    Abstract: Although the independent censoring assumption is commonly used in survival analysis, it can be violated when the censoring time is related to the survival time, which often happens in many practical applications. To address this issue, we propose a flexible semiparametric method for dependent censored data. Our approach involves fitting the survival time and the censoring time with a joint transfo… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  47. arXiv:2406.01602  [pdf, other

    physics.data-an hep-ex nucl-ex

    Effectiveness of denoising diffusion probabilistic models for fast and high-fidelity whole-event simulation in high-energy heavy-ion experiments

    Authors: Yeonju Go, Dmitrii Torbunov, Timothy Rinn, Yi Huang, Haiwang Yu, Brett Viren, Meifeng Lin, Yihui Ren, ** Huang

    Abstract: Artificial intelligence (AI) generative models, such as generative adversarial networks (GANs), variational auto-encoders, and normalizing flows, have been widely used and studied as efficient alternatives for traditional scientific simulations. However, they have several drawbacks, including training instability and inability to cover the entire data distribution, especially for regions where dat… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

    Comments: 11 pages, 7 figures

  48. arXiv:2406.01304  [pdf, other

    cs.CL cs.AI cs.SE

    CodeR: Issue Resolving with Multi-Agent and Task Graphs

    Authors: Dong Chen, Shaoxin Lin, Muhan Zeng, Daoguang Zan, Jian-Gang Wang, Anton Cheshkov, Jun Sun, Hao Yu, Guoliang Dong, Artem Aliev, Jie Wang, Xiao Cheng, Guangtai Liang, Yuchi Ma, Pan Bian, Tao Xie, Qianxiang Wang

    Abstract: GitHub issue resolving recently has attracted significant attention from academia and industry. SWE-bench is proposed to measure the performance in resolving issues. In this paper, we propose CodeR, which adopts a multi-agent framework and pre-defined task graphs to Repair & Resolve reported bugs and add new features within code Repository. On SWE-bench lite, CodeR is able to solve 28.33% of issue… ▽ More

    Submitted 10 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: https://github.com/NL2Code/CodeR

  49. arXiv:2406.01235  [pdf, other

    eess.IV

    Boosting Spatial-Spectral Masked Auto-Encoder Through Mining Redundant Spectra for HSI-SAR/LiDAR Classification

    Authors: Junyan Lin, Xuepeng **, Feng Gao, Junyu Dong, Hui Yu

    Abstract: Although recent masked image modeling (MIM)-based HSI-LiDAR/SAR classification methods have gradually recognized the importance of the spectral information, they have not adequately addressed the redundancy among different spectra, resulting in information leakage during the pretraining stage. This issue directly impairs the representation ability of the model. To tackle the problem, we propose a… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by IGARSS 2024

  50. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.