Skip to main content

Showing 1–50 of 936 results for author: Lü, S

.
  1. arXiv:2407.02973  [pdf, other

    astro-ph.GA

    NOEMA formIng Cluster survEy (NICE): Characterizing eight massive galaxy groups at $1.5 < z < 4$ in the COSMOS field

    Authors: Nikolaj B. Sillassen, Shuowen **, Georgios E. Magdis, Emanuele Daddi, Tao Wang, Shiying Lu, Hanwen Sun, Vinod Arumugam, Daizhong Liu, Malte Brinch, Chiara D'Eugenio, Raphael Gobat, Carlos Gómez-Guijarro, Michael Rich, Eva Schinnerer, Veronica Strazzullo, Qinghua Tan, Francesco Valentino, Yijun Wang, Mengyuan Xiao, Luwenjia Zhou, David Blánquez-Sesé, Zheng Cai, Yanmei Chen, Laure Ciesla , et al. (19 additional authors not shown)

    Abstract: The NOEMA formIng Cluster survEy (NICE) is a large program targeting 69 massive galaxy group candidates at $z>2$ in six deep fields. We report spectroscopic confirmation of eight groups at $1.65\leq z\leq3.61$ in COSMOS. Homogeneously selected as significant overdensities of red IRAC sources with red Herschel colors, four groups are confirmed by CO and [CI] with NOEMA 3mm observations, three are c… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 44 pages (27pp appendix), 32 figures, 18 tables, accepted for publication in A&A

  2. arXiv:2407.00588  [pdf, other

    math.AP math.NA

    Forward and backward problems for coupled subdiffusion systems

    Authors: Dian Feng, Yikan Liu, Shuai Lu

    Abstract: In this article, we investigate both forward and backward problems for coupled systems of time-fractional diffusion equations, encompassing scenarios of strong coupling. For the forward problem, we establish the well-posedness of the system, leveraging the eigensystem of the corresponding elliptic system as the foundation. When considering the backward problem, specifically the determination of in… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 26 pages, 7 figures

    MSC Class: 35R11; 35K58; 35B44

  3. arXiv:2407.00178  [pdf, other

    physics.ins-det

    Shower Separation in Five Dimensions for Highly Granular Calorimeters using Machine Learning

    Authors: S. Lai, J. Utehs, A. Wilhahn, M. C. Fouz, O. Bach, E. Brianne, A. Ebrahimi, K. Gadow, P. Göttlicher, O. Hartbrich, D. Heuchel, A. Irles, K. Krüger, J. Kvasnicka, S. Lu, C. Neubüser, A. Provenza, M. Reinecke, F. Sefkow, S. Schuwalow, M. De Silva, Y. Sudo, H. L. Tran, L. Liu, R. Masuda , et al. (26 additional authors not shown)

    Abstract: To achieve state-of-the-art jet energy resolution for Particle Flow, sophisticated energy clustering algorithms must be developed that can fully exploit available information to separate energy deposits from charged and neutral particles. Three published neural network-based shower separation models were applied to simulation and experimental data to measure the performance of the highly granular… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  4. arXiv:2406.16005  [pdf, other

    cs.DC

    A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications

    Authors: Lei Chen, Shi Liu, Chenxi Wang, Haoran Ma, Yifan Qiao, Zhe Wang, Chenggang Wu, Youyou Lu, Xiaobing Feng, Huimin Cui, Shan Lu, Harry Xu

    Abstract: With rapid advances in network hardware, far memory has gained a great deal of traction due to its ability to break the memory capacity wall. Existing far memory systems fall into one of two data paths: one that uses the kernel's paging system to transparently access far memory at the page granularity, and a second that bypasses the kernel, fetching data at the object granularity. While it is gene… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  5. arXiv:2406.15484  [pdf, other

    cs.CL cs.AI cs.CY

    JobFair: A Framework for Benchmarking Gender Hiring Bias in Large Language Models

    Authors: Ze Wang, Zekun Wu, Xin Guan, Michael Thaler, Adriano Koshiyama, Skylar Lu, Sachin Beepath, Ediz Ertekin Jr., Maria Perez-Ortiz

    Abstract: This paper presents a novel framework for benchmarking hierarchical gender hiring bias in Large Language Models (LLMs) for resume scoring, revealing significant issues of reverse bias and overdebiasing. Our contributions are fourfold: First, we introduce a framework using a real, anonymized resume dataset from the Healthcare, Finance, and Construction industries, meticulously used to avoid confoun… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Submitted to EMNLP 2024

  6. arXiv:2406.14523  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Optical and Raman selection rules for odd-parity clean superconductors

    Authors: Shuangyuan Lu, Xu Yang, Yuan-Ming Lu

    Abstract: We derive selection rules in optical absorption and Raman scattering spectra, that can determine the parity of pairing order parameters under inversion symmetry in two classes of \emph{clean} superconductors: (i) chiral superconductors with strong spin-orbit couplings, (ii) singlet superconductors with negligible spin-orbit couplings. Experimentally, the inversion parity of pair wave functions can… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 16 pages, 12 figures

    Journal ref: Phys. Rev. B 109, 245119 (2024)

  7. arXiv:2406.12718  [pdf, other

    cs.CV cs.AI cs.CL

    AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

    Authors: Wenbin An, Feng Tian, Sicong Leng, Jiahao Nie, Haonan Lin, QianYing Wang, Guang Dai, ** Chen, Shijian Lu

    Abstract: Despite their great success across various multimodal tasks, Large Vision-Language Models (LVLMs) are facing a prevalent problem with object hallucinations, where the generated textual responses are inconsistent with ground-truth objects in the given image. This paper investigates various LVLMs and pinpoints attention deficiency toward discriminative local image features as one root cause of objec… ▽ More

    Submitted 21 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  8. arXiv:2406.12386  [pdf, other

    cs.CL

    IPEval: A Bilingual Intellectual Property Agency Consultation Evaluation Benchmark for Large Language Models

    Authors: Qiyao Wang, Jianguo Huang, Shule Lu, Yuan Lin, Kan Xu, Liang Yang, Hongfei Lin

    Abstract: The rapid development of Large Language Models (LLMs) in vertical domains, including intellectual property (IP), lacks a specific evaluation benchmark for assessing their understanding, application, and reasoning abilities. To fill this gap, we introduce IPEval, the first evaluation benchmark tailored for IP agency and consulting tasks. IPEval comprises 2657 multiple-choice questions across four m… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  9. arXiv:2406.11937  [pdf, other

    physics.ins-det hep-ex physics.data-an

    Using graph neural networks to reconstruct charged pion showers in the CMS High Granularity Calorimeter

    Authors: M. Aamir, B. Acar, G. Adamov, T. Adams, C. Adloff, S. Afanasiev, C. Agrawal, C. Agrawal, A. Ahmad, H. A. Ahmed, S. Akbar, N. Akchurin, B. Akgul, B. Akgun, R. O. Akpinar, E. Aktas, A. AlKadhim, V. Alexakhin, J. Alimena, J. Alison, A. Alpana, W. Alshehri, P. Alvarez Dominguez, M. Alyari, C. Amendola , et al. (550 additional authors not shown)

    Abstract: A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadr… ▽ More

    Submitted 30 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Prepared for submission to JINST

  10. arXiv:2406.11571  [pdf, other

    astro-ph.GA

    PRIMER: JWST/MIRI reveals the evolution of star-forming structures in galaxies at z<2.5

    Authors: Yipeng Lyu, Benjamin Magnelli, David Elbaz, Pablo G. Pérez-González, Camila Correa, Emanuele Daddi, Carlos Gómez-Guijarro, James S. Dunlop, Norman A. Grogin, Anton M. Koekemoer, Derek J. McLeod, Shiying Lu

    Abstract: The stellar structures of star-forming galaxies (SFGs) undergo significant size growth during their mass assembly and must pass through a compaction phase as they evolve into quiescent galaxies (QGs). To shed light on the mechanisms behind this structural evolution, we study the morphology of the star-forming components of 665 SFGs at 0<z<2.5 measured using JWST/MIRI observation and compare them w… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 24 pages, 17 figures, submitted to A&A, comments are welcome

  11. arXiv:2406.10724  [pdf, other

    eess.IV cs.CV cs.LG

    Beyond the Visible: Jointly Attending to Spectral and Spatial Dimensions with HSI-Diffusion for the FINCH Spacecraft

    Authors: Ian Vyse, Rishit Dagli, Dav Vrat Chadha, John P. Ma, Hector Chen, Isha Ruparelia, Prithvi Seran, Matthew Xie, Eesa Aamer, Aidan Armstrong, Naveen Black, Ben Borstein, Kevin Caldwell, Orrin Dahanaggamaarachchi, Joe Dai, Abeer Fatima, Stephanie Lu, Maxime Michet, Anoushka Paul, Carrie Ann Po, Shivesh Prakash, Noa Prosser, Riddhiman Roy, Mirai Shinjo, Iliya Shofman , et al. (4 additional authors not shown)

    Abstract: Satellite remote sensing missions have gained popularity over the past fifteen years due to their ability to cover large swaths of land at regular intervals, making them ideal for monitoring environmental trends. The FINCH mission, a 3U+ CubeSat equipped with a hyperspectral camera, aims to monitor crop residue cover in agricultural fields. Although hyperspectral imaging captures both spectral and… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: To appear in 38th Annual Small Satellite Conference

  12. arXiv:2406.10511  [pdf, other

    cs.DC cs.AR cs.PF math.NA

    Efficient Hardware Accelerator Based on Medium Granularity Dataflow for SpTRSV

    Authors: Qian Chen, Xiaofeng Yang, Shengli Lu

    Abstract: Sparse triangular solve (SpTRSV) is widely used in various domains. Numerous studies have been conducted using CPUs, GPUs, and specific hardware accelerators, where dataflow can be categorized into coarse and fine granularity. Coarse dataflow offers good spatial locality but suffers from low parallelism, while fine dataflow provides high parallelism but disrupts the spatial structure, leading to i… ▽ More

    Submitted 27 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  13. arXiv:2406.10416  [pdf, other

    cs.CR cs.DC cs.LG

    Byzantine-Robust Decentralized Federated Learning

    Authors: Minghong Fang, Zifan Zhang, Hairi, Prashant Khanduri, Jia Liu, Songtao Lu, Yuchen Liu, Neil Gong

    Abstract: Federated learning (FL) enables multiple clients to collaboratively train machine learning models without revealing their private training data. In conventional FL, the system follows the server-assisted architecture (server-assisted FL), where the training process is coordinated by a central server. However, the server-assisted FL framework suffers from poor scalability due to a communication bot… ▽ More

    Submitted 20 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: To appear in ACM Conference on Computer and Communications Security 2024 (CCS '24)

  14. arXiv:2406.09121  [pdf, other

    cs.CV

    MMRel: A Relation Understanding Dataset and Benchmark in the MLLM Era

    Authors: Jiahao Nie, Gongjie Zhang, Wenbin An, Yap-Peng Tan, Alex C. Kot, Shijian Lu

    Abstract: Despite the recent advancements in Multi-modal Large Language Models (MLLMs), understanding inter-object relations, i.e., interactions or associations between distinct objects, remains a major challenge for such models. This issue significantly hinders their advanced reasoning capabilities and is primarily due to the lack of large-scale, high-quality, and diverse multi-modal data essential for tra… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  15. arXiv:2406.04252  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Sub-nanometer depth resolution and single dopant visualization achieved by tilt-coupled multislice electron ptychography

    Authors: Zehao Dong, Yang Zhang, Chun-Chien Chiu, Sicheng Lu, Jianbing Zhang, Yu-Chen Liu, Suya Liu, Jan-Chi Yang, Pu Yu, Yayu Wang, Zhen Chen

    Abstract: Real-space imaging of three-dimensional atomic structures is a critical yet challenging task in materials science. Although scanning transmission electron microscopy has achieved sub-angstrom lateral resolution through techniques like electron ptychography1,2, depth resolution remains limited to only 2 to 3 nanometers with a single projection setup3,4. Attaining better depth resolution typically n… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 27 pages, 5 figures, 10 supplementary figures

  16. arXiv:2406.03496  [pdf, other

    cs.CL cs.AI cs.LG

    Wings: Learning Multimodal LLMs without Text-only Forgetting

    Authors: Yi-Kai Zhang, Shiyin Lu, Yang Li, Yanqing Ma, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, De-Chuan Zhan, Han-Jia Ye

    Abstract: Multimodal large language models (MLLMs), initiated with a trained LLM, first align images with text and then fine-tune on multimodal mixed inputs. However, the MLLM catastrophically forgets the text-only instructions, which do not include images and can be addressed within the initial LLM. In this paper, we present Wings, a novel MLLM that excels in both text-only dialogues and multimodal compreh… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  17. arXiv:2406.02672  [pdf, other

    astro-ph.GA astro-ph.CO

    A comparison of pre-existing $Λ$CDM predictions with the abundance of JWST galaxies at high redshift

    Authors: Shengdong Lu, Carlos S. Frenk, Sownak Bose, Cedric G. Lacey, Shaun Cole, Carlton M. Baugh, John C. Helly

    Abstract: Observations with the James Webb Space Telescope have revealed a high abundance of bright galaxies at redshift, $z\gtrsim 12$, which has been widely interpreted as conflicting with the $Λ$CDM model. In Cowley et al. (2018) predictions were made - prior to the JWST observations - for the expected abundance of these galaxies using the Durham semi-analytic galaxy formation model, GALFORM, which is kn… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 14 pages, 8 figures, submitted to MNRAS on 4 June, 2024

  18. arXiv:2406.02539  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Parrot: Multilingual Visual Instruction Tuning

    Authors: Hai-Long Sun, Da-Wei Zhou, Yang Li, Shiyin Lu, Chao Yi, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, De-Chuan Zhan, Han-Jia Ye

    Abstract: The rapid development of Multimodal Large Language Models (MLLMs) like GPT-4V has marked a significant step towards artificial general intelligence. Existing methods mainly focus on aligning vision encoders with LLMs through supervised fine-tuning (SFT) to endow LLMs with multimodal abilities, making MLLMs' inherent ability to react to multiple languages progressively deteriorate as the training p… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  19. arXiv:2406.02260  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Near-Room-Temperature Field-Controllable Exchange Bias in 2D van der Waals Ferromagnet Fe3GaTe2

    Authors: Jifeng Shao, Xiaolong Yin, Chunhao Bao, Sirong Lu, Xiaoming Ma, Shu Guo, Le Wang, Xi Zhang, Zhiyue Li, Longxiang Li, Yue Zhao, Tingyong Chen

    Abstract: Exchange bias (EB) is a cornerstone of modern magnetic memory and sensing technologies. Its extension to the realm of two-dimensional (2D) van der Waals (vdW) magnets holds promise for revolutionary advancements in miniaturized and efficient atomic spintronic devices. However, the blocking temperature of EB in 2D vdW magnets is currently well below room temperature ~130 K. This study reports a rob… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 14 pages, 5 figures

  20. arXiv:2406.00734  [pdf, other

    cs.LG

    GLADformer: A Mixed Perspective for Graph-level Anomaly Detection

    Authors: Fan Xu, Nan Wang, Hao Wu, Xuezhi Wen, Dalin Zhang, Siyang Lu, Binyong Li, Wei Gong, Hai Wan, Xibin Zhao

    Abstract: Graph-Level Anomaly Detection (GLAD) aims to distinguish anomalous graphs within a graph dataset. However, current methods are constrained by their receptive fields, struggling to learn global features within the graphs. Moreover, most contemporary methods are based on spatial domain and lack exploration of spectral characteristics. In this paper, we propose a multi-perspective hybrid graph-level… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  21. arXiv:2405.20797  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Ovis: Structural Embedding Alignment for Multimodal Large Language Model

    Authors: Shiyin Lu, Yang Li, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, Han-Jia Ye

    Abstract: Current Multimodal Large Language Models (MLLMs) typically integrate a pre-trained LLM with another pre-trained vision transformer through a connector, such as an MLP, endowing the LLM with visual capabilities. However, the misalignment between two embedding strategies in MLLMs -- the structural textual embeddings based on an embedding look-up table and the continuous embeddings generated directly… ▽ More

    Submitted 17 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  22. arXiv:2405.20598  [pdf

    cond-mat.str-el cond-mat.mes-hall

    Mott insulating phase and coherent-incoherent crossover across magnetic phase transition in 2D antiferromagnetic CrSBr

    Authors: Fan Wu, Xuefeng Zhang, Yi Chen, Ding Pei, Mengwen Zhan, Zicheng Tao, Cheng Chen, Shipeng Lu, **gzhi Chen, Shujie Tang, Xia Wang, Yanfeng Guo, Lexian Yang, Yan Zhang, Yulin Chen, Qixi Mi, Gang Li, Zhongkai Liu

    Abstract: In two-dimensional van der Waals magnetic materials, the interplay between magnetism and electron correlation can give rise to new ground states and lead to novel transport and optical properties. A fundamental question in these materials is how the electron correlation manifests and interacts with the magnetic orders. In this study, we demonstrate that the recently discovered 2D antiferromagnetic… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  23. arXiv:2405.20340  [pdf, other

    cs.CV

    MotionLLM: Understanding Human Behaviors from Human Motions and Videos

    Authors: Ling-Hao Chen, Shunlin Lu, Ailing Zeng, Hao Zhang, Benyou Wang, Ruimao Zhang, Lei Zhang

    Abstract: This study delves into the realm of multi-modality (i.e., video and motion modalities) human behavior understanding by leveraging the powerful capabilities of Large Language Models (LLMs). Diverging from recent LLMs designed for video-only or motion-only understanding, we argue that understanding human behavior necessitates joint modeling from both videos and motion sequences (e.g., SMPL sequences… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: MotionLLM version 1.0, project page see https://lhchen.top/MotionLLM

  24. arXiv:2405.19767  [pdf

    physics.geo-ph

    MAE-GAN: A Novel Strategy for Simultaneous Super-resolution Reconstruction and Denoising of Post-stack Seismic Profile

    Authors: Wenshuo Yu, Shiqi Dong, Shao** Lu, Xintong Dong

    Abstract: Post-stack seismic profiles are images reflecting containing geological structures which provides a critical foundation for understanding the distribution of oil and gas resources. However, due to the limitations of seismic acquisition equipment and data collecting geometry, the post-stack profiles suffer from low resolution and strong noise issues, which severely affects subsequent seismic interp… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  25. arXiv:2405.19487  [pdf, other

    cs.CL

    A Full-duplex Speech Dialogue Scheme Based On Large Language Models

    Authors: Peng Wang, Songshuo Lu, Yaohua Tang, Sijie Yan, Yuanjun Xiong, Wei Xia

    Abstract: We present a generative dialogue system capable of operating in a full-duplex manner, allowing for seamless interaction. It is based on a large language model (LLM) carefully aligned to be aware of a perception module, a motor function module, and the concept of a simple finite state machine (called neural FSM) with two states. The perception and motor function modules operate simultaneously, allo… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  26. arXiv:2405.18891  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Inverse Design of Promising Alloys for Electrocatalytic CO$_2$ Reduction via Generative Graph Neural Networks Combined with Bird Swarm Algorithm

    Authors: Zhilong Song, Linfeng Fan, Shuaihua Lu, Qionghua Zhou, Chongyi Ling, **lan Wang

    Abstract: Directly generating material structures with optimal properties is a long-standing goal in material design. One of the fundamental challenges lies in how to overcome the limitation of traditional generative models to efficiently explore the global chemical space rather than a small localized space. Herein, we develop a framework named MAGECS to address this dilemma, by integrating the bird swarm a… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  27. arXiv:2405.18858  [pdf, other

    math.OC

    Distributed Bilevel Optimization with Communication Compression

    Authors: Yutong He, Jie Hu, Xinmeng Huang, Songtao Lu, Bin Wang, Kun Yuan

    Abstract: Stochastic bilevel optimization tackles challenges involving nested optimization structures. Its fast-growing scale nowadays necessitates efficient distributed algorithms. In conventional distributed bilevel methods, each worker must transmit full-dimensional stochastic gradients to the server every iteration, leading to significant communication overhead and thus hindering efficiency and scalabil… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  28. arXiv:2405.17792  [pdf, other

    hep-ex hep-ph

    JUNO Sensitivity to Invisible Decay Modes of Neutrons

    Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

    Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 4 tables

  29. arXiv:2405.16444  [pdf, other

    cs.LG

    CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion

    Authors: Jiayi Yao, Hanchen Li, Yuhan Liu, Siddhant Ray, Yihua Cheng, Qizheng Zhang, Kuntai Du, Shan Lu, Junchen Jiang

    Abstract: Large language models (LLMs) often incorporate multiple text chunks in their inputs to provide the necessary contexts. To speed up the prefill of the long LLM inputs, one can pre-compute the KV cache of a text and re-use the KV cache when the context is reused as the prefix of another LLM input. However, the reused text chunks are not always the input prefix, and when they are not, their precomput… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  30. arXiv:2405.15920  [pdf, other

    cs.LG stat.ML

    SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

    Authors: Shuai Zhang, Heshan Devaka Fernando, Miao Liu, Keerthiram Murugesan, Songtao Lu, Pin-Yu Chen, Tianyi Chen, Meng Wang

    Abstract: This paper studies the transfer reinforcement learning (RL) problem where multiple RL problems have different reward functions but share the same underlying transition dynamics. In this setting, the Q-function of each RL problem (task) can be decomposed into a successor feature (SF) and a reward map**: the former characterizes the transition dynamics, and the latter characterizes the task-specif… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.16173

  31. arXiv:2405.14325  [pdf, other

    cs.CV

    Dinomaly: The Less Is More Philosophy in Multi-Class Unsupervised Anomaly Detection

    Authors: Jia Guo, Shuai Lu, Weihang Zhang, Huiqi Li

    Abstract: Recent studies highlighted a practical setting of unsupervised anomaly detection (UAD) that builds a unified model for multi-class images, serving as an alternative to the conventional one-class-one-model setup. Despite various advancements addressing this challenging task, the detection performance under the multi-class setting still lags far behind state-of-the-art class-separated models. Our re… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  32. arXiv:2405.11205  [pdf, other

    cs.CV

    Fuse & Calibrate: A bi-directional Vision-Language Guided Framework for Referring Image Segmentation

    Authors: Yichen Yan, Xingjian He, Sihan Chen, Shichen Lu, **g Liu

    Abstract: Referring Image Segmentation (RIS) aims to segment an object described in natural language from an image, with the main challenge being a text-to-pixel correlation. Previous methods typically rely on single-modality features, such as vision or language features, to guide the multi-modal fusion process. However, this approach limits the interaction between vision and language, leading to a lack of… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: 12 pages, 4 figures ICIC2024

  33. arXiv:2405.08847  [pdf

    physics.optics

    Double symmetry and phase-controlled continuous transformation between skyrmion and meron topology

    Authors: Sen Lu, Xiong Xiong, Xuefei Zi, Zhe Shen

    Abstract: Topological quasiparticles, including skyrmions and merons, are topological textures with sophisticated vectorial structures that can be used for optical information storage, precision metrology, position sensing, etc. Here, we build a simple model to generate the isolated Néel-type field-skyrmion and derive the analytical solution of it. By employing a series of well-designed double-symmetry aper… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  34. arXiv:2405.07696  [pdf, other

    cs.CV

    MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders

    Authors: Xueying Jiang, Sheng **, Xiaoqin Zhang, Ling Shao, Shijian Lu

    Abstract: Monocular 3D object detection aims for precise 3D localization and identification of objects from a single-view image. Despite its recent progress, it often struggles while handling pervasive object occlusions that tend to complicate and degrade the prediction of object dimensions, depths, and orientations. We design MonoMAE, a monocular 3D detector inspired by Masked Autoencoders that addresses t… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  35. arXiv:2405.07468  [pdf

    cs.CL cs.AI

    Evaluating large language models in medical applications: a survey

    Authors: Xiaolan Chen, Jiayang Xiang, Shanfu Lu, Yexin Liu, Mingguang He, Danli Shi

    Abstract: Large language models (LLMs) have emerged as powerful tools with transformative potential across numerous domains, including healthcare and medicine. In the medical domain, LLMs hold promise for tasks ranging from clinical decision support to patient education. However, evaluating the performance of LLMs in medical contexts presents unique challenges due to the complex and critical nature of medic… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 4 figures, 1 table

  36. arXiv:2405.06938  [pdf, ps, other

    math.PR math.DS

    Stochastic functional partial differential equations with monotone coefficients: Poisson stability measures, exponential mixing and limit theorems

    Authors: Shuaishuai Lu, Xue Yang, Yong Li

    Abstract: This paper examines Poisson stable (including stationary, periodic, almost periodic, Levitan almost periodic, Bohr almost automorphic, pseudo-periodic, Birkhoff recurrent, pseudo-recurrent, etc.) measures and limit theorems for stochastic functional partial differential equations(SFPDEs) with monotone coefficients. We first show the existence and uniqueness of entrance measure $μ_{t}$ for SFPDEs b… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  37. arXiv:2405.06563  [pdf, other

    cs.CL

    What Can Natural Language Processing Do for Peer Review?

    Authors: Ilia Kuznetsov, Osama Mohammed Afzal, Koen Dercksen, Nils Dycke, Alexander Goldberg, Tom Hope, Dirk Hovy, Jonathan K. Kummerfeld, Anne Lauscher, Kevin Leyton-Brown, Sheng Lu, Mausam, Margot Mieskes, Aurélie Névéol, Danish Pruthi, Lizhen Qu, Roy Schwartz, Noah A. Smith, Thamar Solorio, **gyan Wang, Xiaodan Zhu, Anna Rogers, Nihar B. Shah, Iryna Gurevych

    Abstract: The number of scientific articles produced every year is growing rapidly. Providing quality control over them is crucial for scientists and, ultimately, for the public good. In modern science, this process is largely delegated to peer review -- a distributed procedure in which each submission is evaluated by several independent experts in the field. Peer review is widely used, yet it is hard, time… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  38. arXiv:2405.06223  [pdf, ps, other

    math.PR

    McKean-Vlasov SPDEs with Hölder continuous coefficients: existence, uniqueness, ergodicity, exponential mixing and limit theorems

    Authors: Shuaishuai Lu, Xue Yang, Yong Li

    Abstract: This paper investigates the existence and uniqueness of solutions, as well as the ergodicity and exponential mixing to invariant measures, and limit theorems for a class of McKean-Vlasov SPDEs characterized by Hlder continuity. We rigorously establish the existence and uniqueness of strong solutions for a specific class of finite-dimensional systems with Hölder continuous coefficients. Extending t… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  39. arXiv:2405.05367  [pdf, ps, other

    hep-th gr-qc

    A Space/Time Interchange Symmetry of Rotating AdS Black Holes in General Dimensions

    Authors: Si-Yue Lu, Peng Zhao, H. Lu

    Abstract: We revisit the previously known local inversion symmetry of the five-dimensional Kerr-AdS metric that relates the over-rotating black hole to the under-rotating one and reinterpret it as an interchanging symmetry between time and the longitudinal angular coordinates. We generalize this to all $D$ dimensions, including $D=4$, thereby enlarging the trivial linear $\mathbb Z_N$ symmetry of the… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: LaTex, 9 pages

  40. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  41. arXiv:2405.01762  [pdf, ps, other

    cs.LG

    EiG-Search: Generating Edge-Induced Subgraphs for GNN Explanation in Linear Time

    Authors: Shengyao Lu, Bang Liu, Keith G. Mills, Jiao He, Di Niu

    Abstract: Understanding and explaining the predictions of Graph Neural Networks (GNNs), is crucial for enhancing their safety and trustworthiness. Subgraph-level explanations are gaining attention for their intuitive appeal. However, most existing subgraph-level explainers face efficiency challenges in explaining GNNs due to complex search processes. The key challenge is to find a balance between intuitiven… ▽ More

    Submitted 16 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 19 pages

    Journal ref: ICML 2024

  42. arXiv:2405.01460  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders

    Authors: Yi Yu, Yufei Wang, Song Xia, Wenhan Yang, Shijian Lu, Yap-Peng Tan, Alex C. Kot

    Abstract: Unlearnable examples (UEs) seek to maximize testing error by making subtle modifications to training examples that are correctly labeled. Defenses against these poisoning attacks can be categorized based on whether specific interventions are adopted during training. The first approach is training-time defense, such as adversarial training, which can mitigate poisoning effects but is computationall… ▽ More

    Submitted 6 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  43. arXiv:2404.17719  [pdf, other

    cs.NE

    Stochastic Spiking Neural Networks with First-to-Spike Coding

    Authors: Yi Jiang, Sen Lu, Abhronil Sengupta

    Abstract: Spiking Neural Networks (SNNs), recognized as the third generation of neural networks, are known for their bio-plausibility and energy efficiency, especially when implemented on neuromorphic hardware. However, the majority of existing studies on SNNs have concentrated on deterministic neurons with rate coding, a method that incurs substantial computational overhead due to lengthy information integ… ▽ More

    Submitted 1 July, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

  44. arXiv:2404.17437  [pdf

    physics.geo-ph

    Transformer For Low-frequency Extrapolating of Seismic Data

    Authors: Zheng Cong, Xintong Dong, Shao** Lu, Shiqi Dong, Xunqian Tong

    Abstract: Full waveform inversion (FWI) is used to reconstruct the physical properties of subsurface media which plays an important role in seismic exploration. However, the precision of FWI is seriously affected by the absence or inaccuracy of low-frequency information. Therefore, reconstructing the low-frequency signals accurately is highly significant in seismic data processing. Low-frequency extrapolati… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  45. arXiv:2404.15081  [pdf, other

    cs.CV cs.CR cs.LG

    Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models

    Authors: **gyao Xu, Yuetong Lu, Yandong Li, Siyang Lu, Dongdong Wang, Xiang Wei

    Abstract: Diffusion models (DMs) embark a new era of generative modeling and offer more opportunities for efficient generating high-quality and realistic data samples. However, their widespread use has also brought forth new challenges in model security, which motivates the creation of more effective adversarial attackers on DMs to understand its vulnerability. We propose CAAT, a simple but generic and effi… ▽ More

    Submitted 14 June, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Published at CVPR 2024, code:https://github.com/CO2-cityao/CAAT

  46. arXiv:2404.13513  [pdf

    cond-mat.mtrl-sci

    Ferroelectricity in an Antiferromagnetic Vanadium Trichloride Monolayer

    Authors: **ghao Deng, De** Guo, Yao Wen, Shuangzan Lu, Zhengbo Cheng, Zemin Pan, Tao Jian, Yusong Bai, Hui Zhang, Wei Ji, Jun He, Chendong Zhang

    Abstract: Multiferroicity allows magnetism to be controlled using electric fields or vice versa, which has gained tremendous interest in both fundamental research and device applications. A reduced dimensionality of multiferroic materials is highly desired for device miniaturization, but the coexistence of ferroelectricity and magnetism at the two-dimensional limit is still debated. Here, we used a NbSe2 su… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  47. arXiv:2404.13312  [pdf

    physics.geo-ph

    Seismic Interpolation Transformer for Consecutively Missing Data: A Case Study in DAS-VSP Data

    Authors: Ming Cheng, Jun Lin, Xintong Dong, Shao** Lu, Tie Zhong

    Abstract: Distributed optical fiber acoustic sensing (DAS) is a rapidly-developed seismic acquisition technology with advantages of low cost, high resolution, high sensitivity, and small interval, etc. Nonetheless, consecutively missing cases often appear in real seismic data acquired by DAS system due to some factors, including optical fiber damage and inferior coupling between cable and well. Recently, so… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  48. arXiv:2404.12768  [pdf, other

    cs.CV cs.AI cs.GR

    MixLight: Borrowing the Best of both Spherical Harmonics and Gaussian Models

    Authors: Xinlong Ji, Fangneng Zhan, Shijian Lu, Shi-Sheng Huang, Hua Huang

    Abstract: Accurately estimating scene lighting is critical for applications such as mixed reality. Existing works estimate illumination by generating illumination maps or regressing illumination parameters. However, the method of generating illumination maps has poor generalization performance and parametric models such as Spherical Harmonic (SH) and Spherical Gaussian (SG) fall short in capturing high-freq… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  49. arXiv:2404.12381  [pdf, other

    physics.app-ph physics.optics

    Wavelength-accurate and wafer-scale process for nonlinear frequency mixers in thin-film lithium niobate

    Authors: C. J. Xin, Shengyuan Lu, Jiayu Yang, Amirhassan Shams-Ansari, Boris Desiatov, Letícia S. Magalhães, Soumya S. Ghosh, Erin McGee, Dylan Renaud, Nicholas Achuthan, Arseniy Zvyagintsev, David Barton III, Neil Sinclair, Marko Lončar

    Abstract: Recent advancements in thin-film lithium niobate (TFLN) photonics have led to a new generation of high-performance electro-optic devices, including modulators, frequency combs, and microwave-to-optical transducers. However, the broader adoption of TFLN-based devices that rely on all-optical nonlinearities have been limited by the sensitivity of quasi-phase matching (QPM), realized via ferroelectri… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  50. arXiv:2404.10443  [pdf, ps, other

    cs.LG cs.AI

    AGHINT: Attribute-Guided Representation Learning on Heterogeneous Information Networks with Transformer

    Authors: **hui Yuan, Shan Lu, Peibo Duan, Jieyue He

    Abstract: Recently, heterogeneous graph neural networks (HGNNs) have achieved impressive success in representation learning by capturing long-range dependencies and heterogeneity at the node level. However, few existing studies have delved into the utilization of node attributes in heterogeneous information networks (HINs). In this paper, we investigate the impact of inter-node attribute disparities on HGNN… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 9 pages, 5 figures