Skip to main content

Showing 51–100 of 1,258 results for author: Lu, W

.
  1. arXiv:2404.00069  [pdf, other

    cs.LG

    A Two-Phase Recall-and-Select Framework for Fast Model Selection

    Authors: Jianwei Cui, Wenhang Shi, Honglin Tao, Wei Lu, Xiaoyong Du

    Abstract: As the ubiquity of deep learning in various machine learning applications has amplified, a proliferation of neural network models has been trained and shared on public model repositories. In the context of a targeted machine learning assignment, utilizing an apt source model as a starting point typically outperforms the strategy of training from scratch, particularly with limited training data. De… ▽ More

    Submitted 28 March, 2024; originally announced April 2024.

  2. arXiv:2403.18238  [pdf, other

    cs.CV

    TAFormer: A Unified Target-Aware Transformer for Video and Motion Joint Prediction in Aerial Scenes

    Authors: Liangyu Xu, Wanxuan Lu, Hongfeng Yu, Yongqiang Mao, Hanbo Bi, Chenglong Liu, Xian Sun, Kun Fu

    Abstract: As drone technology advances, using unmanned aerial vehicles for aerial surveys has become the dominant trend in modern low-altitude remote sensing. The surge in aerial video data necessitates accurate prediction for future scenarios and motion states of the interested target, particularly in applications like traffic management and disaster response. Existing video prediction methods focus solely… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 17 pages, 9 figures

  3. arXiv:2403.17671  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci physics.app-ph

    Revealing the Microscopic Mechanism of Elementary Vortex Pinning in Superconductors

    Authors: C. Chen, Y. Liu, Y. Chen, Y. N. Hu, T. Z. Zhang, D. Li, X. Wang, C. X. Wang, Z. Y. W. Lu, Y. H. Zhang, Q. L. Zhang, X. L. Dong, R. Wang, D. L. Feng, T. Zhang

    Abstract: Vortex pinning is a crucial factor that determines the critical current of practical superconductors. However, the understanding of its underlying mechanism has long been phenomenological without a clear microscopic description. Here using high-resolution scanning tunneling microscopy, we studied single vortex pinning induced by point defect in layered FeSe-based superconductors. We found the defe… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 28 pages, 12 figures, Supplementary Materials included. Comments are welcome

  4. arXiv:2403.15192  [pdf, other

    cs.CV cs.AI

    SFOD: Spiking Fusion Object Detector

    Authors: Yimeng Fan, Wei Zhang, Changsong Liu, Mingyang Li, Wenrui Lu

    Abstract: Event cameras, characterized by high temporal resolution, high dynamic range, low power consumption, and high pixel bandwidth, offer unique capabilities for object detection in specialized contexts. Despite these advantages, the inherent sparsity and asynchrony of event data pose challenges to existing object detection algorithms. Spiking Neural Networks (SNNs), inspired by the way the human brain… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  5. arXiv:2403.13228  [pdf, ps, other

    math.RA math.CA

    Hilbert's Irreducibility Theorem for Linear Differential Operators

    Authors: Ruyong Feng, Zewang Guo, Wei Lu

    Abstract: We prove a differential analogue of Hilbert's irreducibility theorem. Let $\mathcal{L}$ be a linear differential operator with coefficients in $C(\mathbb{X})(x)$ that is irreducible over $\overline{C(\mathbb{X})}(x)$, where $\mathbb{X}$ is an irreducible affine algebraic variety over an algebraically closed field $C$ of characteristic zero. We show that the set of $c\in \mathbb{X}(C)$ such that th… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    MSC Class: 16S32; 68W30

  6. Giant electrode effect on tunneling magnetoresistance and electroresistance in van der Waals intrinsic multiferroic tunnel junctions using VS2

    Authors: Zhi Yan, Ruixia Yang, Cheng Fang, Wentian Lu, Xiaohong Xu

    Abstract: Van der Waals multiferroic tunnel junctions (vdW-MFTJs) with multiple nonvolatile resistive states are highly suitable for new physics and next-generation storage electronics. However, currently reported vdW-MFTJs are based on two types of materials, i.e., vdW ferromagnetic and ferroelectric materials, forming a multiferroic system. This undoubtedly introduces additional interfaces, increasing the… ▽ More

    Submitted 7 May, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 13 pages, 9 figures

  7. arXiv:2403.12152  [pdf, other

    cs.CV

    Development of Automated Neural Network Prediction for Echocardiographic Left ventricular Ejection Fraction

    Authors: Yuting Zhang, Boyang Liu, Karina V. Bunting, David Brind, Alexander Thorley, Andreas Karwath, Wenqi Lu, Diwei Zhou, Xiaoxia Wang, Alastair R. Mobley, Otilia Tica, Georgios Gkoutos, Dipak Kotecha, **ming Duan

    Abstract: The echocardiographic measurement of left ventricular ejection fraction (LVEF) is fundamental to the diagnosis and classification of patients with heart failure (HF). In order to quantify LVEF automatically and accurately, this paper proposes a new pipeline method based on deep neural networks and ensemble learning. Within the pipeline, an Atrous Convolutional Neural Network (ACNN) was first train… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted to Frontiers in Medicine

  8. arXiv:2403.11221  [pdf, other

    cs.DC cs.DB

    Lion: Minimizing Distributed Transactions through Adaptive Replica Provision (Extended Version)

    Authors: Qiushi Zheng, Zhanhao Zhao, Wei Lu, Chang Yao, Yuxing Chen, Anqun Pan, Xiaoyong Du

    Abstract: Distributed transaction processing often involves multiple rounds of cross-node communications, and therefore tends to be slow. To improve performance, existing approaches convert distributed transactions into single-node transactions by either migrating co-accessed partitions onto the same nodes or establishing a super node housing replicas of the entire database. However, migration-based methods… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  9. arXiv:2403.09747  [pdf, other

    cs.CL cs.AI

    Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors

    Authors: Guanghua Li, Wensheng Lu, Wei Zhang, Defu Lian, Kezhong Lu, Rui Mao, Kai Shu, Hao Liao

    Abstract: The proliferation of fake news has had far-reaching implications on politics, the economy, and society at large. While Fake news detection methods have been employed to mitigate this issue, they primarily depend on two essential elements: the quality and relevance of the evidence, and the effectiveness of the verdict prediction mechanism. Traditional methods, which often source information from st… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  10. arXiv:2403.09244  [pdf, other

    physics.acc-ph physics.ins-det

    High precision proton beam monitor system concept design on CSNS based on SiC

    Authors: Ye He, Xingchen Li, Zijun Xu, Ming Qi, Congcong Wang, Chenwei Wang, Hai Lu, Xiaojun Nie, Ruirui Fan, Hantao **g, Weiming Song, Keqi Wang, Kai Liu, Peilian Liu, Hui Li, Zaiyi Li, Chenxi Fu, Xiyuan Zhang, Xiaoshen Kang, Zhan Li, Weiguo Lu, Suyu Xiao, Xin Shi

    Abstract: A high precision beam monitor system based on silicon carbide PIN sensor is designed for China Spallation Neutron Source 1.6 GeV proton beam to monitor the proton beam fluence.The concept design of the beam monitor system is finished together with front-end electronics with silicon carbide PIN sensors, readout system and mechanical system.Several tests are performed to study the performance of eac… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  11. arXiv:2403.09131  [pdf, other

    cs.CL cs.AI

    ProSwitch: Knowledge-Guided Instruction Tuning to Generate Professional and Non-Professional Styled Text

    Authors: Chang Zong, Yuyan Chen, Weiming Lu, Jian Shao, Yueting Zhuang

    Abstract: Large Language Models (LLMs) have demonstrated efficacy in various linguistic applications, including text summarization and controlled text generation. However, studies into their capacity of switching between styles via fine-tuning remain underexplored. This study concentrates on textual professionalism and introduces a novel methodology, named ProSwitch, which equips a language model with the a… ▽ More

    Submitted 15 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 8 pages

    MSC Class: 68T50 ACM Class: I.2.7

  12. arXiv:2403.08757  [pdf, other

    stat.ML cs.LG math.CO physics.app-ph

    Efficient Combinatorial Optimization via Heat Diffusion

    Authors: Hengyuan Ma, Wenlian Lu, Jianfeng Feng

    Abstract: Combinatorial optimization problems are widespread but inherently challenging due to their discrete nature.The primary limitation of existing methods is that they can only access a small fraction of the solution space at each iteration, resulting in limited efficiency for searching the global optimal. To overcome this challenge, diverging from conventional efforts of expanding the solver's search… ▽ More

    Submitted 14 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Code is available in https://github.com/AwakerMhy/HeO

  13. arXiv:2403.05063  [pdf, other

    cs.IR cs.AI

    Aligning Large Language Models for Controllable Recommendations

    Authors: Wensheng Lu, Jianxun Lian, Wei Zhang, Guanghua Li, Mingyang Zhou, Hao Liao, Xing Xie

    Abstract: Inspired by the exceptional general intelligence of Large Language Models (LLMs), researchers have begun to explore their application in pioneering the next generation of recommender systems - systems that are conversational, explainable, and controllable. However, existing literature primarily concentrates on integrating domain-specific knowledge into LLMs to enhance accuracy, often neglecting th… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 13 pages

    MSC Class: 68T50

  14. arXiv:2403.00807  [pdf

    cs.IR cs.CL cs.DC cs.DL

    Enhancing Cloud-Based Large Language Model Processing with Elasticsearch and Transformer Models

    Authors: Chunhe Ni, Jiang Wu, Hongbo Wang, Wenran Lu, Chenwei Zhang

    Abstract: Large Language Models (LLMs) are a class of generative AI models built using the Transformer network, capable of leveraging vast datasets to identify, summarize, translate, predict, and generate language. LLMs promise to revolutionize society, yet training these foundational models poses immense challenges. Semantic vector search within large language models is a potent technique that can signific… ▽ More

    Submitted 24 February, 2024; originally announced March 2024.

  15. arXiv:2403.00806  [pdf

    cs.IR cs.CE cs.CL cs.CV

    Enhanced User Interaction in Operating Systems through Machine Learning Language Models

    Authors: Chenwei Zhang, Wenran Lu, Chunhe Ni, Hongbo Wang, Jiang Wu

    Abstract: With the large language model showing human-like logical reasoning and understanding ability, whether agents based on the large language model can simulate the interaction behavior of real users, so as to build a reliable virtual recommendation A/B test scene to help the application of recommendation research is an urgent, important and economic value problem. The combination of interaction design… ▽ More

    Submitted 24 February, 2024; originally announced March 2024.

  16. arXiv:2402.19248  [pdf, other

    cs.CL

    Let LLMs Take on the Latest Challenges! A Chinese Dynamic Question Answering Benchmark

    Authors: Zhikun Xu, Yinghui Li, Ruixue Ding, Xinyu Wang, Boli Chen, Yong Jiang, Hai-Tao Zheng, Wenlian Lu, Pengjun Xie, Fei Huang

    Abstract: How to better evaluate the capabilities of Large Language Models (LLMs) is the focal point and hot topic in current LLMs research. Previous work has noted that due to the extremely high cost of iterative updates of LLMs, they are often unable to answer the latest dynamic questions well. To promote the improvement of Chinese LLMs' ability to answer dynamic questions, in this paper, we introduce CDQ… ▽ More

    Submitted 1 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Work in progress!

  17. SFTformer: A Spatial-Frequency-Temporal Correlation-Decoupling Transformer for Radar Echo Extrapolation

    Authors: Liangyu Xu, Wanxuan Lu, Hongfeng Yu, Fanglong Yao, Xian Sun, Kun Fu

    Abstract: Extrapolating future weather radar echoes from past observations is a complex task vital for precipitation nowcasting. The spatial morphology and temporal evolution of radar echoes exhibit a certain degree of correlation, yet they also possess independent characteristics. {Existing methods learn unified spatial and temporal representations in a highly coupled feature space, emphasizing the correla… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 16 pages, 11 figures, TGRS

  18. arXiv:2402.17574  [pdf, other

    cs.AI cs.CL

    Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization

    Authors: Wenqi Zhang, Ke Tang, Hai Wu, Mengna Wang, Yongliang Shen, Guiyang Hou, Zeqi Tan, Peng Li, Yueting Zhuang, Weiming Lu

    Abstract: Large Language Models (LLMs) exhibit robust problem-solving capabilities for diverse tasks. However, most LLM-based agents are designed as specific task solvers with sophisticated prompt engineering, rather than agents capable of learning and evolving through interactions. These task solvers necessitate manually crafted prompts to inform task rules and regulate LLM behaviors, inherently incapacita… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL-2024 Main, camera-ready version

  19. arXiv:2402.16889  [pdf, other

    cs.LG cs.AI cs.CR

    Generative Models are Self-Watermarked: Declaring Model Authentication through Re-Generation

    Authors: Aditya Desu, Xuanli He, Qiongkai Xu, Wei Lu

    Abstract: As machine- and AI-generated content proliferates, protecting the intellectual property of generative models has become imperative, yet verifying data ownership poses formidable challenges, particularly in cases of unauthorized reuse of generated data. The challenge of verifying data ownership is further amplified by using Machine Learning as a Service (MLaaS), which often functions as a black-box… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  20. arXiv:2402.16371  [pdf, other

    eess.IV

    Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction

    Authors: Wen-Yang Lu, Eduardo Pavez, Antonio Ortega, Xin Zhao, Shan Liu

    Abstract: Current video coding standards, including H.264/AVC, HEVC, and VVC, employ discrete cosine transform (DCT), discrete sine transform (DST), and secondary to Karhunen-Loeve transforms (KLTs) decorrelate the intra-prediction residuals. However, the efficiency of these transforms in decorrelation can be limited when the signal has a non-smooth and non-periodic structure, such as those occurring in tex… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures

  21. arXiv:2402.14320  [pdf, other

    cs.CL cs.AI

    Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering

    Authors: Chang Zong, Yuchen Yan, Weiming Lu, Jian Shao, Eliot Huang, Heng Chang, Yueting Zhuang

    Abstract: Recent progress with LLM-based agents has shown promising results across various tasks. However, their use in answering questions from knowledge bases remains largely unexplored. Implementing a KBQA system using traditional methods is challenging due to the shortage of task-specific training data and the complexity of creating task-focused model structures. In this paper, we present Triad, a unifi… ▽ More

    Submitted 15 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 8 pages

    MSC Class: 68T50 ACM Class: I.2.7

  22. arXiv:2402.13064  [pdf, other

    cs.CL

    Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

    Authors: Haoran Li, Qingxiu Dong, Zhengyang Tang, Chaojun Wang, Xingxing Zhang, Haoyang Huang, Shaohan Huang, Xiaolong Huang, Zeqiang Huang, Dongdong Zhang, Yuxian Gu, Xin Cheng, Xun Wang, Si-Qing Chen, Li Dong, Wei Lu, Zhifang Sui, Benyou Wang, Wai Lam, Furu Wei

    Abstract: We introduce Generalized Instruction Tuning (called GLAN), a general and scalable method for instruction tuning of Large Language Models (LLMs). Unlike prior work that relies on seed examples or existing datasets to construct instruction tuning data, GLAN exclusively utilizes a pre-curated taxonomy of human knowledge and capabilities as input and generates large-scale synthetic instruction data ac… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Work in progress

  23. arXiv:2402.12916  [pdf

    cs.LG cs.AI

    Data Pipeline Training: Integrating AutoML to Optimize the Data Flow of Machine Learning Models

    Authors: Jiang Wu, Hongbo Wang, Chunhe Ni, Chenwei Zhang, Wenran Lu

    Abstract: Data Pipeline plays an indispensable role in tasks such as modeling machine learning and develo** data products. With the increasing diversification and complexity of Data sources, as well as the rapid growth of data volumes, building an efficient Data Pipeline has become crucial for improving work efficiency and solving complex problems. This paper focuses on exploring how to optimize data flow… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  24. arXiv:2402.10738  [pdf, other

    cs.CL

    Let's Learn Step by Step: Enhancing In-Context Learning Ability with Curriculum Learning

    Authors: Yinpeng Liu, Jiawei Liu, Xiang Shi, Qikai Cheng, Yong Huang, Wei Lu

    Abstract: Demonstration ordering, which is an important strategy for in-context learning (ICL), can significantly affects the performance of large language models (LLMs). However, most of the current approaches of ordering require high computational costs to introduce the priori knowledge. In this paper, inspired by the human learning process, we propose a simple but effective demonstration ordering method… ▽ More

    Submitted 16 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  25. arXiv:2402.10456  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Generative Modeling for Tabular Data via Penalized Optimal Transport Network

    Authors: Wenhui Sophia Lu, Chenyang Zhong, Wing Hung Wong

    Abstract: The task of precisely learning the probability distribution of rows within tabular data and producing authentic synthetic samples is both crucial and non-trivial. Wasserstein generative adversarial network (WGAN) marks a notable improvement in generative modeling, addressing the challenges faced by its predecessor, generative adversarial network. However, due to the mixed data types and multimodal… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 37 pages, 23 figures

  26. arXiv:2402.09463  [pdf

    eess.IV

    Multi-Center Fetal Brain Tissue Annotation (FeTA) Challenge 2022 Results

    Authors: Kelly Payette, Céline Steger, Roxane Licandro, Priscille de Dumast, Hongwei Bran Li, Matthew Barkovich, Liu Li, Maik Dannecker, Chen Chen, Cheng Ouyang, Niccolò McConnell, Alina Miron, Yongmin Li, Alena Uus, Irina Grigorescu, Paula Ramirez Gilliland, Md Mahfuzur Rahman Siddiquee, Daguang Xu, Andriy Myronenko, Haoyu Wang, Ziyan Huang, ** Ye, Mireia Alenyà, Valentin Comte, Oscar Camara , et al. (42 additional authors not shown)

    Abstract: Segmentation is a critical step in analyzing the develo** human fetal brain. There have been vast improvements in automatic segmentation methods in the past several years, and the Fetal Brain Tissue Annotation (FeTA) Challenge 2021 helped to establish an excellent standard of fetal brain segmentation. However, FeTA 2021 was a single center study, and the generalizability of algorithms across dif… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Results from FeTA Challenge 2022, held at MICCAI; Manuscript submitted. Supplementary Info (including submission methods descriptions) available here: https://zenodo.org/records/10628648

  27. arXiv:2402.05647  [pdf, other

    astro-ph.HE astro-ph.SR

    Upper limits on the radio pulses from magnetars and a central compact object with FAST

    Authors: Wan-** Lu, ** Zhou, Pei Wang, Yi-Xuan Shao, Xiang-Dong Li, Jacco Vink, Di Li, Yang Chen

    Abstract: Magnetars and central compact objects (CCOs) are subgroups of neutron stars that show a number of properties distinguished from canonical radio pulsars. We performed radio observations of three magnetars SGR 0418+5729, 1E 2259+586, 4U 0142+61, and a CCO PSR J1852+0040 with the Fivehundred-meter Aperture Spherical radio Telescope (FAST) at 1.25 GHz, aiming to search for radio pulsations in their qu… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 7 pages, 3 figures, 2 tables; accepted for publication in ApJ

  28. arXiv:2402.05121  [pdf, other

    cs.AI cs.CL

    Large Language Model for Table Processing: A Survey

    Authors: Weizheng Lu, Jiaming Zhang, **g Zhang, Yueguo Chen

    Abstract: Tables, typically two-dimensional and structured to store large amounts of data, are essential in daily activities like database queries, spreadsheet calculations, and generating reports from web tables. Automating these table-centric tasks with Large Language Models (LLMs) offers significant public benefits, garnering interest from academia and industry. This survey provides an extensive overview… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  29. arXiv:2402.04995  [pdf

    cond-mat.mtrl-sci cond-mat.other

    Observation of Giant Spin Splitting and d-wave Spin Texture in Room Temperature Altermagnet RuO2

    Authors: Zihan Lin, Dong Chen, Wenlong Lu, Xin Liang, Shiyu Feng, Kohei Yamagami, Jacek Osiecki, Mats Leandersson, Balasubramanian Thiagarajan, Junwei Liu, Claudia Felser, Junzhang Ma

    Abstract: Recently, a novel magnetic phase called altermagnetism has been proposed, ushering in a third distinct magnetic phase beyond ferromagnetism and antiferromagnetism. It is expected that this groundbreaking phase exhibits unique physical properties such as C-paired spin-valley locking, anomalous Hall effect, nontrivial Berry phase, and giant magnetoresistance, etc. Among all the predicted candidates,… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 32 pages, 12 figures

  30. arXiv:2402.03585  [pdf, other

    cs.CV eess.IV

    Decoder-Only Image Registration

    Authors: Xi Jia, Wenqi Lu, Xinxing Cheng, **ming Duan

    Abstract: In unsupervised medical image registration, the predominant approaches involve the utilization of a encoder-decoder network architecture, allowing for precise prediction of dense, full-resolution displacement fields from given paired images. Despite its widespread use in the literature, we argue for the necessity of making both the encoder and decoder learnable in such an architecture. For this, w… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  31. arXiv:2402.02374  [pdf, other

    cs.CV

    PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal

    Authors: Tao Wang, Wanglong Lu, Kaihao Zhang, Wenhan Luo, Tae-Kyun Kim, Tong Lu, Hongdong Li, Ming-Hsuan Yang

    Abstract: Existing single image reflection removal (SIRR) methods using deep learning tend to miss key low-frequency (LF) and high-frequency (HF) differences in images, affecting their effectiveness in removing reflections. To address this problem, this paper proposes a novel prompt-guided reflection removal (PromptRR) framework that uses frequency information as new visual prompts for better reflection per… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 10 pages, 10 figures

  32. arXiv:2402.00374  [pdf, other

    quant-ph

    Quantum Information Geometry with Non-Hermitian Systems

    Authors: Wangjun Lu, Zhao-Hui Peng, HongTao

    Abstract: Information geometry is the application of differential geometry in statistics, where the Fisher-Rao metric serves as the Riemannian metric on the statistical manifold, providing an intrinsic property for parameter sensitivity. In this paper, we explore the Fisher-Rao metric with the non-Hermitian systems. By approximating the Lindblad master equation in the non-Hermitian Hamiltonian, we calculate… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  33. arXiv:2401.15862  [pdf, other

    math.NA

    PML-based boundary integral equation method for electromagnetic scattering problems in a layered-medium

    Authors: Gang Bao, Wangtao Lu, Tao Yin, Lu Zhang

    Abstract: This paper proposes a new boundary integral equation (BIE) methodology based on the perfectly matched layer (PML) truncation technique for solving the electromagnetic scattering problems in a multi-layered medium. Instead of using the original PML stretched fields, artificial fields which are also equivalent to the solutions in the physical region are introduced. This significantly simplifies the… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 26 pages, 13 figures

  34. arXiv:2401.15543  [pdf

    cs.LG physics.acc-ph

    Anomaly Detection of Particle Orbit in Accelerator using LSTM Deep Learning Technology

    Authors: Zhiyuan Chen, Wei Lu, Radhika Bhong, Yimin Hu, Brian Freeman, Adam Carpenter

    Abstract: A stable, reliable, and controllable orbit lock system is crucial to an electron (or ion) accelerator because the beam orbit and beam energy instability strongly affect the quality of the beam delivered to experimental halls. Currently, when the orbit lock system fails operators must manually intervene. This paper develops a Machine Learning based fault detection methodology to identify orbit lock… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 6 pages

    ACM Class: I.5.4

  35. arXiv:2401.15252  [pdf, ps, other

    math.DS

    On Time-Varying Delayed Stochastic Differential Systems with Non-Markovian Switching Parameters

    Authors: Xinyu Wu, Zidong Wang, Wenlian Lu

    Abstract: This paper focuses on time-varying delayed stochastic differential systems with stochastically switching parameters formulated by a unified switching behavior combining a discrete adapted process and a Cox process. Unlike prior studies limited to stationary and ergodic switching scenarios, our research emphasizes non-Markovian, non-stationary, and non-ergodic cases. It arrives at more general resu… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  36. arXiv:2401.11261  [pdf, other

    cs.LG cs.CV

    Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient

    Authors: Weiguo Lu, Xuan Wu, Deng Ding, **qiao Duan, Jirong Zhuang, Gangnan Yuan

    Abstract: Diffusion models (DMs) are a type of generative model that has a huge impact on image synthesis and beyond. They achieve state-of-the-art generation results in various generative tasks. A great diversity of conditioning inputs, such as text or bounding boxes, are accessible to control the generation. In this work, we propose a conditioning mechanism utilizing Gaussian mixture models (GMMs) as feat… ▽ More

    Submitted 1 February, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

  37. arXiv:2401.10923  [pdf, other

    math.OC stat.ML

    Online estimation of the inverse of the Hessian for stochastic optimization with application to universal stochastic Newton algorithms

    Authors: Antoine Godichon-Baggioni, Wei Lu, Bruno Portier

    Abstract: This paper addresses second-order stochastic optimization for estimating the minimizer of a convex function written as an expectation. A direct recursive estimation technique for the inverse Hessian matrix using a Robbins-Monro procedure is introduced. This approach enables to drastically reduces computational complexity. Above all, it allows to develop universal stochastic Newton methods and inve… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  38. arXiv:2401.07574  [pdf, other

    quant-ph

    Generating Bell states and Werner states of two qubits via optical field

    Authors: Dengkui Jiang, Cuilu Zhai, Yaju Song, Zhaohui Peng, Jibing Yuan, Shiqing Tang, Wangjun Lu

    Abstract: In this paper, we investigate how the evolution of the states of two qubits initially in a direct product state can be controlled by the optical field in a Tavis-Cummings (TC) model. For the two qubits initially in the direct product state, we find that their matrix elements at any moment can be modulated by the coefficients of the optical field initial states in the number state space. We propose… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  39. Clifford algebra Cl(0,6) approach to beyond the standard model and naturalness problems

    Authors: Wei Lu

    Abstract: Is there more to Dirac's gamma matrices than meets the eye? It turns out that gamma zero can be factorized into a product of three operators. This revelation facilitates the expansion of Dirac's space-time algebra to Clifford algebra Cl(0,6). The resultant rich geometric structure can be leveraged to establish a combined framework of the standard model and gravity, wherein a gravi-weak interaction… ▽ More

    Submitted 4 March, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 73 pages

    Journal ref: International Journal of Geometric Methods in Modern Physics, Vol. 21, No. 05, 2450089 (2024)

  40. arXiv:2401.02385  [pdf, other

    cs.CL cs.AI

    TinyLlama: An Open-Source Small Language Model

    Authors: Peiyuan Zhang, Guangtao Zeng, Tianduo Wang, Wei Lu

    Abstract: We present TinyLlama, a compact 1.1B language model pretrained on around 1 trillion tokens for approximately 3 epochs. Building on the architecture and tokenizer of Llama 2, TinyLlama leverages various advances contributed by the open-source community (e.g., FlashAttention and Lit-GPT), achieving better computational efficiency. Despite its relatively small size, TinyLlama demonstrates remarkable… ▽ More

    Submitted 3 June, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: Technical Report

  41. arXiv:2401.02009  [pdf, other

    cs.CL cs.AI

    Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives

    Authors: Wenqi Zhang, Yongliang Shen, Linjuan Wu, Qiuying Peng, Jun Wang, Yueting Zhuang, Weiming Lu

    Abstract: The reflection capacity of Large Language Model (LLM) has garnered extensive attention. A post-hoc prompting strategy, e.g., reflexion and self-refine, refines LLM's response based on self-evaluated or external feedback. However, recent research indicates without external feedback, LLM's intrinsic reflection is unstable. Our investigation unveils that the key bottleneck is the quality of the self-… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: Accepted by ACL 2024 Main

  42. arXiv:2401.01557  [pdf, other

    physics.med-ph physics.app-ph

    Angular scanning VHEE (very high energy electron) pencil beam delivery for radiotherapy

    Authors: Bing Zhou, Zhiyuan Guo, Yang Wan, Shuang Liu, Jianfei Hua, Wei Lu

    Abstract: The use of very high energy electrons (VHEE) for radiotherapy has been actively studied for over two decades due to its advantageous dose distribution, deep penetration depth and great potential of ultra-high dose-rate irradiation. However, the high entrance dose of VHEE beams can damage the surface skin of patients and hinder its widespread application. To address this challenge, a novel method u… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 14 pages, 5 figures

  43. arXiv:2401.01300  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Engineering the strain and interlayer excitons of 2D materials via lithographically engraved hexagonal boron nitride

    Authors: Yu-Chiang Hsieh, Zhen-You Lin, Shin-Ji Fung, Wen-Shin Lu, Sheng-Chin Ho, Siang-** Hong, Sheng-Zhu Ho, Chiu-Hua Huang, Kenji Watanabe, Takashi Taniguchi, Yang-Hao Chan, Yi-Chun Chen, Chung-Lin Wu, Tse-Ming Chen

    Abstract: Strain engineering has quickly emerged as a viable option to modify the electronic, optical and magnetic properties of 2D materials. However, it remains challenging to arbitrarily control the strain. Here we show that by creating atomically-flat surface nanostructures in hexagonal boron nitride, we achieve an arbitrary on-chip control of both the strain distribution and magnitude on high-quality m… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 8 pages, 5 figures

    Journal ref: Nano Lett. 23, 7244-7251 (2023)

  44. arXiv:2401.01270  [pdf, other

    cs.LG

    Optimal Rates of Kernel Ridge Regression under Source Condition in Large Dimensions

    Authors: Haobo Zhang, Yicheng Li, Weihao Lu, Qian Lin

    Abstract: Motivated by the studies of neural networks (e.g.,the neural tangent kernel theory), we perform a study on the large-dimensional behavior of kernel ridge regression (KRR) where the sample size $n \asymp d^γ$ for some $γ> 0$. Given an RKHS $\mathcal{H}$ associated with an inner product kernel defined on the sphere $\mathbb{S}^{d}$, we suppose that the true function $f_ρ^{*} \in [\mathcal{H}]^{s}$,… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 61 pages, 11 figures

  45. arXiv:2401.00865  [pdf, other

    cs.DC

    Xorbits: Automating Operator Tiling for Distributed Data Science

    Authors: Weizheng Lu, Kaisheng He, Xuye Qin, Chengjie Li, Zhong Wang, Tao Yuan, Xia Liao, Feng Zhang, Yueguo Chen, Xiaoyong Du

    Abstract: Data science pipelines commonly utilize dataframe and array operations for tasks such as data preprocessing, analysis, and machine learning. The most popular tools for these tasks are pandas and NumPy. However, these tools are limited to executing on a single node, making them unsuitable for processing large-scale data. Several systems have attempted to distribute data science applications to clus… ▽ More

    Submitted 19 March, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: ICDE 2024 Industrial and Application Track

  46. arXiv:2401.00746  [pdf, other

    q-bio.NC cs.NE physics.bio-ph

    Learn to integrate parts for whole through correlated neural variability

    Authors: Zhichao Zhu, Yang Qi, Wenlian Lu, Jianfeng Feng

    Abstract: Sensory perception originates from the responses of sensory neurons, which react to a collection of sensory signals linked to various physical attributes of a singular perceptual object. Unraveling how the brain extracts perceptual information from these neuronal responses is a pivotal challenge in both computational neuroscience and machine learning. Here we introduce a statistical mechanical the… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: 18 pages, 5 figures

  47. arXiv:2401.00657  [pdf, other

    math.OC cs.CV math.SP

    Optimizing ADMM and Over-Relaxed ADMM Parameters for Linear Quadratic Problems

    Authors: **tao Song, Wenqi Lu, Yunwen Lei, Yuchao Tang, Zhenkuan Pan, **ming Duan

    Abstract: The Alternating Direction Method of Multipliers (ADMM) has gained significant attention across a broad spectrum of machine learning applications. Incorporating the over-relaxation technique shows potential for enhancing the convergence rate of ADMM. However, determining optimal algorithmic parameters, including both the associated penalty and relaxation parameters, often relies on empirical approa… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: Accepted to AAAI 2024

  48. arXiv:2401.00104  [pdf, other

    cs.LG cs.AI stat.ME

    Causal State Distillation for Explainable Reinforcement Learning

    Authors: Wenhao Lu, Xufeng Zhao, Thilo Fryen, Jae Hee Lee, Mengdi Li, Sven Magg, Stefan Wermter

    Abstract: Reinforcement learning (RL) is a powerful technique for training intelligent agents, but understanding why these agents make specific decisions can be quite challenging. This lack of transparency in RL models has been a long-standing problem, making it difficult for users to grasp the reasons behind an agent's behaviour. Various approaches have been explored to address this problem, with one promi… ▽ More

    Submitted 1 April, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: https://lukaswill.github.io/; Accepted as oral by CLeaR 2024

  49. arXiv:2312.16134  [pdf, other

    math.NA

    Does PML exponentially absorb outgoing waves scattering from a periodic surface?

    Authors: Wangtao Lu, Kuanrong Shen, Ruming Zhang

    Abstract: The PML method is well-known for its exponential convergence rate and easy implementation for scattering problems with unbounded domains. For rough-surface scattering problems, authors in [5] proved that the PML method converges at most algebraically in the physical domain. However, the authors also asked a question whether exponential convergence still holds for compact subsets. In [25], one of o… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  50. arXiv:2312.13032  [pdf, other

    cs.LG cs.AI

    NodeMixup: Tackling Under-Reaching for Graph Neural Networks

    Authors: Weigang Lu, Ziyu Guan, Wei Zhao, Yaming Yang, Long **

    Abstract: Graph Neural Networks (GNNs) have become mainstream methods for solving the semi-supervised node classification problem. However, due to the uneven location distribution of labeled nodes in the graph, labeled nodes are only accessible to a small portion of unlabeled nodes, leading to the \emph{under-reaching} issue. In this study, we firstly reveal under-reaching by conducting an empirical investi… ▽ More

    Submitted 20 December, 2023; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI-24