Skip to main content

Showing 1–50 of 156 results for author: Shu, L

.
  1. arXiv:2406.15740  [pdf, other

    astro-ph.IM physics.ins-det

    The FRB-searching pipeline of the Tianlai Cylinder Pathfinder Array

    Authors: Zijie Yu, Furen Deng, Shijie Sun, Chenhui Niu, Jixia Li, Fengquan Wu, Wei-Yang Wang, Yougang Wang, Shifan Zuo, Lin Shu, Jie Hao, Xiaohui Liu, Reza Ansari, Ue-Li Pen, Albert Stebbins, Peter Timbie, Xuelei Chen

    Abstract: This paper presents the design, calibration, and survey strategy of the Fast Radio Burst (FRB) digital backend and its real-time data processing pipeline employed in the Tianlai Cylinder Pathfinder array. The array, consisting of three parallel cylindrical reflectors and equipped with 96 dual-polarization feeds, is a radio interferometer array designed for conducting drift scans of the northern ce… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 27 pages, 21 figures, 7 tables, RAA accepted

  2. arXiv:2406.06592  [pdf, other

    cs.CL cs.LG

    Improve Mathematical Reasoning in Language Models by Automated Process Supervision

    Authors: Liangchen Luo, Yinxiao Liu, Rosanne Liu, Samrat Phatale, Harsh Lara, Yunxuan Li, Lei Shu, Yun Zhu, Lei Meng, Jiao Sun, Abhinav Rastogi

    Abstract: Complex multi-step reasoning tasks, such as solving mathematical problems or generating code, remain a significant hurdle for even the most advanced large language models (LLMs). Verifying LLM outputs with an Outcome Reward Model (ORM) is a standard inference-time technique aimed at enhancing the reasoning performance of LLMs. However, this still proves insufficient for reasoning tasks with a leng… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 18 pages, 5 figures, 1 table

  3. arXiv:2405.16178  [pdf, other

    cs.CL

    Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection

    Authors: Yun Zhu, Jia-Chen Gu, Caitlin Sikora, Ho Ko, Yinxiao Liu, Chu-Cheng Lin, Lei Shu, Liangchen Luo, Lei Meng, Bang Liu, **dong Chen

    Abstract: Large language models (LLMs) augmented with retrieval exhibit robust performance and extensive versatility by incorporating external contexts. However, the input length grows linearly in the number of retrieved documents, causing a dramatic increase in latency. In this paper, we propose a novel paradigm named Sparse RAG, which seeks to cut computation costs through sparsity. Specifically, Sparse R… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  4. arXiv:2405.11502  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    CTGNN: Crystal Transformer Graph Neural Network for Crystal Material Property Prediction

    Authors: Zijian Du, Luozhijie **, Le Shu, Yan Cen, Yuanfeng Xu, Yongfeng Mei, Hao Zhang

    Abstract: The combination of deep learning algorithm and materials science has made significant progress in predicting novel materials and understanding various behaviours of materials. Here, we introduced a new model called as the Crystal Transformer Graph Neural Network (CTGNN), which combines the advantages of Transformer model and graph neural networks to address the complexity of structure-properties r… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 17 pages

  5. arXiv:2405.07429  [pdf, other

    cs.RO

    JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation

    Authors: Xubo Luo, Xue Wan, Yixing Gao, Yaolin Tian, Wei Zhang, Leizheng Shu

    Abstract: Unmanned aerial vehicles (UAVs) visual localization in planetary aims to estimate the absolute pose of the UAV in the world coordinate system through satellite maps and images captured by on-board cameras. However, since planetary scenes often lack significant landmarks and there are modal differences between satellite maps and UAV images, the accuracy and real-time performance of UAV positioning… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 8 pages

  6. arXiv:2403.09030  [pdf

    cs.SD cs.LG eess.AS

    An AI-Driven Approach to Wind Turbine Bearing Fault Diagnosis from Acoustic Signals

    Authors: Zhao Wang, Xiaomeng Li, Na Li, Longlong Shu

    Abstract: This study aimed to develop a deep learning model for the classification of bearing faults in wind turbine generators from acoustic signals. A convolutional LSTM model was successfully constructed and trained by using audio data from five predefined fault types for both training and validation. To create the dataset, raw audio signal data was collected and processed in frames to capture time and f… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  7. Two-Dimensional Phase-Fluctuating Superconductivity in Bulk-Crystalline NdO$_{0.5}$F$_{0.5}$BiS$_2$

    Authors: C. S. Chen, J. Küspert, I. Biało, J. Mueller, K. W. Chen, M. Y. Zou, D. G. Mazzone, D. Bucher, K. Tanaka, O. Ivashko, M. v. Zimmermann, Qisi Wang, Lei Shu, J. Chang

    Abstract: We present a combined growth and transport study of superconducting single-crystalline NdO$_{0.5}$F$_{0.5}$BiS$_2$. Evidence of two-dimensional superconductivity with significant phase fluctuations of preformed Cooper pairs preceding the superconducting transition is reported. This result is based on three key observations. (1) The resistive superconducting transition temperature $T_c$ (defined by… ▽ More

    Submitted 24 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  8. arXiv:2401.09755  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Crystal Transformer Based Universal Atomic Embedding for Accurate and Transferable Prediction of Materials Properties

    Authors: Luozhijie **, Zijian Du, Le Shu, Yongfeng Mei, Hao Zhang

    Abstract: In this work, we propose a novel approach to generate universal atomic embeddings, significantly enhancing the representational and accuracy aspects of atomic embeddings, which ultimately improves the accuracy of property prediction. Moreover, we demonstrate the excellent transferability of universal atomic embeddings across different databases and various property tasks. Our approach centers on d… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 24 pages, 5 figures

  9. arXiv:2401.07382  [pdf, other

    cs.CL cs.AI

    Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation

    Authors: Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng

    Abstract: Reinforcement learning (RL) can align language models with non-differentiable reward signals, such as human preferences. However, a major challenge arises from the sparsity of these reward signals - typically, there is only a single reward for an entire output. This sparsity of rewards can lead to inefficient and unstable learning. To address this challenge, our paper introduces an novel framework… ▽ More

    Submitted 19 February, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

  10. arXiv:2401.04546  [pdf, ps, other

    cond-mat.supr-con

    Multi-condensate lengths with degenerate excitation gaps in BaNi$_2$As$_2$ revealed by muon spin relaxation study

    Authors: Kaiwen Chen, Zihao Zhu, Yaofeng Xie, Adrian D. Hillier, James S. Lord, Pengcheng Dai, Lei Shu

    Abstract: The recently discovered (Ba,Sr)Ni$_2$As$_2$ family provides an ideal platform for investigating the interaction between electronic nematicity and superconductivity. Here we report the muon spin relaxation ($μ$SR) measurements on BaNi$_2$As$_2$. Transverse-field $μ$SR experiments indicate that the temperature dependence of superfluid density is best fitted with a single-band $s$-wave model. On the… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: Accepted by Phys. Rev. B

  11. arXiv:2311.16344  [pdf, other

    cs.CV cs.GR

    Spatially Adaptive Cloth Regression with Implicit Neural Representations

    Authors: Lei Shu, Vinicius Azevedo, Barbara Solenthaler, Markus Gross

    Abstract: The accurate representation of fine-detailed cloth wrinkles poses significant challenges in computer graphics. The inherently non-uniform structure of cloth wrinkles mandates the employment of intricate discretization strategies, which are frequently characterized by high computational demands and complex methodologies. Addressing this, the research introduced in this paper elucidates a novel anis… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 16 pages, 13 figures

    MSC Class: 68T07 ACM Class: I.3.0

  12. arXiv:2311.15717  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Evidence of spin density waves in La$_3$Ni$_2$O$_{7-δ}$

    Authors: Kaiwen Chen, Xiangqi Liu, Jiachen Jiao, Muyuan Zou, Yixuan Luo, Qiong Wu, Ningyuan Zhang, Yanfeng Guo, Lei Shu

    Abstract: The recently discovered superconductivity with critical temperature $T_c$ up to 80 K in the double-layer Nickelate La$_3$Ni$_2$O$_{7-δ}$ under pressure has drawn great attention. Here we report the positive muon spin relaxation ($μ^+$SR) study of polycrystalline La$_3$Ni$_2$O$_{6.92}$ under ambient pressure. Zero-field $μ^+$SR experiments reveal the existence of magnetic order in La$_3$Ni$_2$O… ▽ More

    Submitted 13 May, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  13. arXiv:2311.09204  [pdf, other

    cs.CL cs.AI

    Fusion-Eval: Integrating Assistant Evaluators with LLMs

    Authors: Lei Shu, Nevan Wichers, Liangchen Luo, Yun Zhu, Yinxiao Liu, **dong Chen, Lei Meng

    Abstract: Evaluating natural language systems poses significant challenges, particularly in the realms of natural language understanding and high-level reasoning. In this paper, we introduce 'Fusion-Eval', an innovative approach that leverages Large Language Models (LLMs) to integrate insights from various assistant evaluators. The LLM is given the example to evaluate along with scores from the assistant ev… ▽ More

    Submitted 6 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  14. arXiv:2311.09179  [pdf, other

    cs.CL

    SiRA: Sparse Mixture of Low Rank Adaptation

    Authors: Yun Zhu, Nevan Wichers, Chu-Cheng Lin, Xinyi Wang, Tianlong Chen, Lei Shu, Han Lu, Canoee Liu, Liangchen Luo, **dong Chen, Lei Meng

    Abstract: Parameter Efficient Tuning has been an prominent approach to adapt the Large Language Model to downstream tasks. Most previous works considers adding the dense trainable parameters, where all parameters are used to adapt certain task. We found this less effective empirically using the example of LoRA that introducing more trainable parameters does not help. Motivated by this we investigate the imp… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  15. arXiv:2310.04815  [pdf, other

    cs.LG

    Critique Ability of Large Language Models

    Authors: Liangchen Luo, Zi Lin, Yinxiao Liu, Lei Shu, Yun Zhu, **gbo Shang, Lei Meng

    Abstract: Critical thinking is essential for rational decision-making and problem-solving. This skill hinges on the ability to provide precise and reasoned critiques and is a hallmark of human intelligence. In the era of large language models (LLMs), this study explores the ability of LLMs to deliver accurate critiques across various tasks. We are interested in this topic as a capable critic model could not… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  16. arXiv:2309.16982  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Superconducting Properties of La$_2$(Cu$_{1-x}$Ni_x)$_5$As$_3$O$_2$: A $\rm μ$SR Study

    Authors: Qiong Wu, Kaiwen Chen, Zihao Zhu, Cheng Tan, Yanxing Yang, Xin Li, Toni Shiroka, Xu Chen, Jiangang Guo, Xiaolong Chen, Lei Shu

    Abstract: We report the results of muon spin rotation and relaxation ($\rm μ$SR) measurements on the recently discovered layered Cu-based superconducting material La$_{2}($Cu$_{1-x}$Ni$_{x}$)$_{5}$As$_{3}$O$_{2}$ ($x =$ 0.40, 0.45). Transverse-field $\rm μ$SR experiments on both samples show that the temperature dependence of superfluid density is best described by a two-band model. The absolute values of z… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Journal ref: Phys. Rev. B 107, 214502 (2003)

  17. arXiv:2309.16947  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Muon Spin Relaxation Study of frustrated Tm$_3$Sb$_3$Mg$_2$O$_{14}$ with kagomé lattice

    Authors: Yanxing Yang, Kaiwen Chen, Zhaofeng Ding, Adrian D. Hillier, Lei Shu

    Abstract: The structure and magnetic properties of rare-earth ions Tm$^{3+}$ kagomé lattice Tm$_3$Sb$_3$Mg$_2$O$_{14}$ are studied by X-ray diffraction, magnetic susceptibility and muon spin relaxation ($μ$SR) experiments. The existence of a small amount of Tm/Mg site-mixing disorder is revealed. DC magnetic susceptibility measurement shows that Tm$^{3+}$ magnetic moments are antiferromagnetically correlate… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Journal ref: Chin. Phys. Lett. 39 (2022) 107502

  18. arXiv:2309.08026  [pdf, other

    physics.soc-ph math.DS

    Determinants of successful mitigation in coupled social-climate dynamics

    Authors: Longmei Shu, Feng Fu

    Abstract: Understanding the impact of human behavior is crucial for successful mitigation of climate change across the globe. To shed light onto this issue, here we couple the forest dieback model with human behaviors. Using evolutionary game theory, we build a time-delay system where forest growth is impacted by both temperature and human mitigation choices, the latter being informed by temperature forecas… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  19. arXiv:2308.11807  [pdf, other

    cs.CL

    Towards an On-device Agent for Text Rewriting

    Authors: Yun Zhu, Yinxiao Liu, Felix Stahlberg, Shankar Kumar, Yu-hui Chen, Liangchen Luo, Lei Shu, Renjie Liu, **dong Chen, Lei Meng

    Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities for text rewriting. Nonetheless, the large sizes of these models make them impractical for on-device inference, which would otherwise allow for enhanced privacy and economical inference. Creating a smaller yet potent language model for text rewriting presents a formidable challenge because it requires balancing the need for a s… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  20. arXiv:2308.00063  [pdf, ps, other

    math.DS

    Isospectral Reductions of Non-negative Matrices

    Authors: Alexandre Baraviera, Pedro Duarte, Longmei Shu, Maria Joana Torres

    Abstract: Isospectral reduction is an important tool for network/matrix analysis as it reduces the dimension of a matrix/network while preserving all its eigenvalues and eigenvectors. The main contribution of this manuscript is a proposed algorithmic scheme to approximate the stationary measure of a stochastic matrix based on isospectral reduction. This scheme can be advantageous when there is more than one… ▽ More

    Submitted 27 September, 2023; v1 submitted 31 July, 2023; originally announced August 2023.

  21. arXiv:2305.15685  [pdf, other

    cs.CL cs.AI

    RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting

    Authors: Lei Shu, Liangchen Luo, Jayakumar Hoskere, Yun Zhu, Yinxiao Liu, Simon Tong, **dong Chen, Lei Meng

    Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities in creative tasks such as storytelling and E-mail generation. However, as LLMs are primarily trained on final text results rather than intermediate revisions, it might be challenging for them to perform text rewriting tasks. Most studies in the rewriting tasks focus on a particular transformation type within the boundaries of s… ▽ More

    Submitted 19 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Journal ref: AAAI 2024

  22. arXiv:2304.11658  [pdf, other

    cs.LG

    Capturing Fine-grained Semantics in Contrastive Graph Representation Learning

    Authors: Lin Shu, Chuan Chen, Zibin Zheng

    Abstract: Graph contrastive learning defines a contrastive task to pull similar instances close and push dissimilar instances away. It learns discriminative node embeddings without supervised labels, which has aroused increasing attention in the past few years. Nevertheless, existing methods of graph contrastive learning ignore the differences between diverse semantics existed in graphs, which learn coarse-… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

  23. arXiv:2301.08986  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Adapting a Language Model While Preserving its General Knowledge

    Authors: Zixuan Ke, Yijia Shao, Haowei Lin, Hu Xu, Lei Shu, Bing Liu

    Abstract: Domain-adaptive pre-training (or DA-training for short), also known as post-training, aims to train a pre-trained general-purpose language model (LM) using an unlabeled corpus of a particular domain to adapt the LM so that end-tasks in the domain can give improved performances. However, existing DA-training methods are in some sense blind as they do not explicitly identify what knowledge in the LM… ▽ More

    Submitted 21 January, 2023; originally announced January 2023.

    Comments: EMNLP 2022

  24. arXiv:2210.05549  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Continual Training of Language Models for Few-Shot Learning

    Authors: Zixuan Ke, Haowei Lin, Yijia Shao, Hu Xu, Lei Shu, Bing Liu

    Abstract: Recent work on applying large language models (LMs) achieves impressive performance in many NLP applications. Adapting or posttraining an LM using an unlabeled domain corpus can produce even better performance for end-tasks in the domain. This paper proposes the problem of continually extending an LM by incrementally post-train the LM with a sequence of unlabeled domain corpora to expand its knowl… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Journal ref: EMNLP 2022

  25. A Fast Transient Backend to Detect FRBs with the Tianlai Dish Pathfinder Array

    Authors: Zijie Yu, Furen Deng, Shijie Sun, Chenhui Niu, Jixia Li, Fengquan Wu, Wei-Yang Wang, Yougang Wang, Hui Feng, Lin Shu, Jie Hao, Reza Ansari, Albert Stebbins, Xuelei Chen

    Abstract: The Tianlai Dish Pathfinder array is a radio interferometer array consisting of 16 six meter dish antennas. The original digital backend integration time is at the seconds level, designed for HI intensity map** experiment. A new digital backend with millisecond response is added to enable it to search for fast radio burst (FRB) during its observations. The design and calibration of this backend,… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: 16 pages, 14 figures, RAA accepted

    Journal ref: Research in Astronomy and Astrophysics, 22, 125007 (2022)

  26. arXiv:2209.04277  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Flexo-photovoltaic effect and above-bandgap photovoltage in halide perovskites

    Authors: Zhiguo Wang, Shengwen Shu, Xiaoyong Wei, Renhong Liang, Shanming Ke, Longlong Shu, Gustau Catalan

    Abstract: Halide perovskites have outstanding photovoltaic properties which have been optimized through interfacial engineering. However, as these materials approach the limits imposed by the physics of semiconductor junctions, it is urgent to explore alternatives, such as the bulk photovoltaic effect, whose physical origin is different and not bound by the same limits. In this context, we focus on the flex… ▽ More

    Submitted 4 January, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: 20 pages, 11 figures

  27. arXiv:2208.13685  [pdf, other

    cs.LG cs.CR

    FedEgo: Privacy-preserving Personalized Federated Graph Learning with Ego-graphs

    Authors: Taolin Zhang, Chuan Chen, Yaomin Chang, Lin Shu, Zibin Zheng

    Abstract: As special information carriers containing both structure and feature information, graphs are widely used in graph mining, e.g., Graph Neural Networks (GNNs). However, in some practical scenarios, graph data are stored separately in multiple distributed parties, which may not be directly shared due to conflicts of interest. Hence, federated graph neural networks are proposed to address such data s… ▽ More

    Submitted 9 September, 2022; v1 submitted 29 August, 2022; originally announced August 2022.

    Comments: 25 pages, submitted to ACM Transactions on Knowledge Discovery from Data (TKDD)

  28. Eco-Evolutionary Dynamics of Bimatrix Games

    Authors: Longmei Shu, Feng Fu

    Abstract: Feedbacks between strategies and the environment are common in social-ecological, evolutionary-ecological, and even psychological-economic systems. Utilizing common resources is always a dilemma for community members, like tragedy of the commons. Here we consider replicator dynamics with feedback-evolving games, where the payoffs switch between two different matrices. Although each payoff matrix o… ▽ More

    Submitted 28 August, 2022; originally announced August 2022.

  29. Spin excitations in the quantum dipolar magnet Yb(BaBO$_3$)$_3$

    Authors: C. Y. Jiang, Y. X. Yang, Y. X. Gao, Z. T. Wan, Z. H. Zhu, T. Shiroka, C. S. Chen, Q. Wu, X. Li, J. C. Jiao, K. W. Chen, Y. Bao, Z. M. Tian, L. Shu

    Abstract: We report results of magnetization, specific-heat and muon-spin relaxation measurements on single crystals of disorder-free Yb$^{3+}$ triangular lattice Yb(BaBO$_3$)$_3$. The magnetization experiments show anisotropic magnetic properties with Curie-Weiss temperatures $θ_{\perp}=-1.40$~K ($H \perp c$) and $θ_{\parallel}=-1.16$~K ($H \parallel c$) determined from low temperature data. The absence of… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: accepted by Phys. Rev. B

  30. arXiv:2203.13238  [pdf, other

    cs.CV cs.AI

    Open-set Recognition via Augmentation-based Similarity Learning

    Authors: Sepideh Esmaeilpour, Lei Shu, Bing Liu

    Abstract: The primary assumption of conventional supervised learning or classification is that the test samples are drawn from the same distribution as the training samples, which is called closed set learning or classification. In many practical scenarios, this is not the case because there are unknowns or unseen class samples in the test data, which is called the open set scenario, and the unknowns need t… ▽ More

    Submitted 21 August, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

  31. arXiv:2203.12839  [pdf

    cond-mat.str-el

    Probing FeSi, a d-electron topological Kondo insulator candidate, with magnetic field, pressure, and microwaves

    Authors: Alexander Breindel, Yuhang Deng, Camilla M. Moir, Yuankan Fang, Sheng Ran, Hongbo Lou, Shubin Li, Qiaoshi Zeng, Lei Shu, Christian T. Wolowiec, Ivan K. Schuller, Priscila F. S. Rosa, Zachary Fisk, John Singleton, M. Brian Maple

    Abstract: Recently, evidence for a conducting surface state below 19 K was reported for the correlated d-electron small gap semiconductor FeSi. In the work reported herein, the conducting surface state and the bulk phase of FeSi were probed via electrical resistivity measurements as a function of temperature T, magnetic field B to 60 T and pressure P to 7.6 GPa, and by means of a magnetic field modulated mi… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  32. arXiv:2202.02976  [pdf, other

    cs.CL cs.AI cs.LG

    Measuring and Reducing Model Update Regression in Structured Prediction for NLP

    Authors: Deng Cai, Elman Mansimov, Yi-An Lai, Yixuan Su, Lei Shu, Yi Zhang

    Abstract: Recent advance in deep learning has led to the rapid adoption of machine learning-based NLP models in a wide range of applications. Despite the continuous gain in accuracy, backward compatibility is also an important aspect for industrial applications, yet it received little research attention. Backward compatibility requires that the new model does not regress on cases that were correctly handled… ▽ More

    Submitted 8 October, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: NeurIPS2022

  33. arXiv:2202.01924  [pdf, other

    cs.CL cs.AI

    Zero-Shot Aspect-Based Sentiment Analysis

    Authors: Lei Shu, Hu Xu, Bing Liu, Jiahua Chen

    Abstract: Aspect-based sentiment analysis (ABSA) typically requires in-domain annotated data for supervised training/fine-tuning. It is a big challenge to scale ABSA to a large number of new domains. This paper aims to train a unified model that can perform zero-shot ABSA without using any annotated data for a new domain. We propose a method called contrastive post-training on review Natural Language Infere… ▽ More

    Submitted 14 February, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

  34. arXiv:2201.12978  [pdf, other

    cond-mat.str-el

    Muon Spin Relaxation Study of Spin Dynamics in Quantum Spin Liquid Candidate H$_3$LiIr$_2$O$_6$

    Authors: Yan-Xing Yang, Liang-Long Huang, Zi-Hao Zhu, Chang-Sheng Chen, Qiong Wu, Zhao-Feng Ding, Cheng Tan, Pabi K. Biswas, Adrian D. Hillier, You-Guo Shi, Da-Peng Yu, Cai Liu, Le Wang, Fei Ye, Jia-Wei Mei, Lei Shu

    Abstract: We present detail thermodynamic and muon spin relaxation ($μ$SR) studies of quantum spin liquid (QSL) candidate H$_3$LiIr$_2$O$_6$. In agreement with the low temperature thermodynamic evidence (\textit{e.g.} bulk magnetization and heat capacity) for the absence of magnetic transition, zero-field (ZF)-$μ$SR measurements indicate the absence of static magnetic ordering or spin freezing down to our l… ▽ More

    Submitted 30 January, 2022; originally announced January 2022.

  35. Three-dimensional Sandglass Magnet with Non-Kramers ions

    Authors: Yan-Xing Yang, Yao Wang, Zhao-Feng Ding, Adrian D. Hillier, Lei Shu

    Abstract: Magnetic susceptibility, specific heat, and muon spin relaxation ($μ$SR) measurements have been performed on a newly synthesized three-dimensional sandglass-type lattice Tm$_3$SbO$_7$, where two inequivalent sets of non-Kramers Tm$^{3+}$ ions (Tm$^{3+}_1$ and Tm$^{3+}_2)$ show crystal electrical field effect at different temperature ranges. The existence of an ordered or a glassy state down to 0.1… ▽ More

    Submitted 3 May, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

  36. arXiv:2112.11646  [pdf, other

    quant-ph cond-mat.str-el

    Contextuality in infinite one-dimensional translation-invariant local Hamiltonians: strengths and limits

    Authors: Kaiyan Yang, Xiao Zeng, Yu**g Luo, Guowu Yang, Lan Shu, Miguel Navascués, Zizhu Wang

    Abstract: In recent years there has been a growing interest in treating many-body systems as Bell scenarios, where lattice sites play the role of distant parties and only near-neighbor statistics are accessible. We investigate contextuality arising from three Bell scenarios in infinite, translation-invariant 1D models: nearest-neighbor with two dichotomic observables per site; nearest- and next-to-nearest n… ▽ More

    Submitted 21 July, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

    Comments: 16 pages, updated version with new results

    Journal ref: npj Quantum Information 8, 89 (2022)

  37. arXiv:2112.10021  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Continual Learning with Knowledge Transfer for Sentiment Classification

    Authors: Zixuan Ke, Bing Liu, Hao Wang, Lei Shu

    Abstract: This paper studies continual learning (CL) for sentiment classification (SC). In this setting, the CL system learns a sequence of SC tasks incrementally in a neural network, where each task builds a classifier to classify the sentiment of reviews of a particular product category or domain. Two natural questions are: Can the system transfer the knowledge learned in the past from the previous tasks… ▽ More

    Submitted 18 December, 2021; originally announced December 2021.

    Journal ref: ECML-PKDD 2020

  38. arXiv:2112.06523  [pdf

    cond-mat.str-el cond-mat.mtrl-sci cond-mat.supr-con

    Fluctuating magnetic droplets immersed in a sea of quantum spin liquid

    Authors: Z. H. Zhu, B. L. Pan, L. P. Nie, J. M. Ni, Y. X. Yang, C. S. Chen, Y. Y. Huang, E. J. Cheng, Y. J. Yu, A. D. Hillier, X. H. Chen, T. Wu, Y. Zhou, S. Y. Li, L. Shu

    Abstract: The search of quantum spin liquid (QSL), an exotic magnetic state with strongly-fluctuating and highly-entangled spins down to zero temperature, is a main theme in current condensed matter physics. However, there is no smoking-gun evidence for deconfined spinons in any QSL candidate so far. The disorders and competing exchange interactions may prevent the formation of an ideal QSL state on frustra… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Journal ref: The Innovation 4, 100459 (2023)

  39. arXiv:2112.06252  [pdf, ps, other

    math.FA math.CA

    The factorizations of $H^ρ(\mathbb{R}^n)$ via multilinear Calderón-Zygmund operators on weighted Lebesgue spaces

    Authors: Dinghuai Wang, Rongxiang Zhu, Lisheng Shu

    Abstract: We extend the recently much-studied Hardy factorization theorems to the weight case. The key point of this paper is to establish the factorization theorems without individual condition on the weight functions. As a direct application, we obtain the characterizations of $BMO(\mathbb{R}^n)$ space and Lipschitz spaces via the weighted boundedness of commutators of multilinear Calderón-Zygmund operato… ▽ More

    Submitted 12 December, 2021; originally announced December 2021.

    Comments: 22 pages

  40. arXiv:2112.05970  [pdf, other

    cond-mat.supr-con

    Muon spin rotation and relaxation study on topological noncentrosymmetric superconductor PbTaSe$_2$

    Authors: Z. H. Zhu, C. Tan, J. Zhang, P. K. Biswas, A. D. Hillier, M. X. Wang, Y. X. Yang, C. S. Chen, Z. F. Ding, S. Y. Li, L. Shu

    Abstract: Topological superconductivity is an exotic phenomenon due to the symmetry-protected topological surface state, in which a quantum system has an energy gap in the bulk but supports gapless excitations confined to its boundary. Symmetries including central and time-reversal (TRS), along with their relations with topology, are crucial for topological superconductivity. We report muon spin relaxation/… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

  41. arXiv:2112.02714  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    CLASSIC: Continual and Contrastive Learning of Aspect Sentiment Classification Tasks

    Authors: Zixuan Ke, Bing Liu, Hu Xu, Lei Shu

    Abstract: This paper studies continual learning (CL) of a sequence of aspect sentiment classification(ASC) tasks in a particular CL setting called domain incremental learning (DIL). Each task is from a different domain or product. The DIL setting is particularly suited to ASC because in testing the system needs not know the task/domain to which the test data belongs. To our knowledge, this setting has not b… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Journal ref: EMNLP 2021

  42. arXiv:2112.02706  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Achieving Forgetting Prevention and Knowledge Transfer in Continual Learning

    Authors: Zixuan Ke, Bing Liu, Nianzu Ma, Hu Xu, Lei Shu

    Abstract: Continual learning (CL) learns a sequence of tasks incrementally with the goal of achieving two main objectives: overcoming catastrophic forgetting (CF) and encouraging knowledge transfer (KT) across tasks. However, most existing techniques focus only on overcoming CF and have no mechanism to encourage KT, and thus do not do well in KT. Although several papers have tried to deal with both CF and K… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Journal ref: NeurIPS 2021

  43. arXiv:2111.04198  [pdf, other

    cs.CL

    TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

    Authors: Yixuan Su, Fangyu Liu, Zaiqiao Meng, Tian Lan, Lei Shu, Ehsan Shareghi, Nigel Collier

    Abstract: Masked language models (MLMs) such as BERT and RoBERTa have revolutionized the field of Natural Language Understanding in the past few years. However, existing pre-trained MLMs often output an anisotropic distribution of token representations that occupies a narrow subset of the entire representation space. Such token representations are not ideal, especially for tasks that demand discriminative s… ▽ More

    Submitted 28 April, 2022; v1 submitted 7 November, 2021; originally announced November 2021.

    Comments: Camera-ready for NAACL 2022

  44. arXiv:2111.02724  [pdf

    cs.CV cs.AI

    Tea Chrysanthemum Detection under Unstructured Environments Using the TC-YOLO Model

    Authors: Chao Qi, Junfeng Gao, Simon Pearson, Helen Harman, Kunjie Chen, Lei Shu

    Abstract: Tea chrysanthemum detection at its flowering stage is one of the key components for selective chrysanthemum harvesting robot development. However, it is a challenge to detect flowering chrysanthemums under unstructured field environments given the variations on illumination, occlusion and object scale. In this context, we propose a highly fused and lightweight deep learning architecture based on Y… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

  45. arXiv:2110.00964  [pdf, ps, other

    math.FA math.CA

    New function classes of Morrey-Campanato type and their applications

    Authors: Dinghuai Wang, Lisheng Shu

    Abstract: The aim of this paper is to introduce and investigative some new function classes of Morrey-Campanato type. Let $0<p<\infty$ and $0\leq λ<n+p$. We say that $f\in \mathcal{\bar{L}}^{p,λ}(Ω)$ if $$\sup_{x_{0}\in Ω,ρ>0}ρ^{-λ}\int_{Ω(x_{0},ρ)}\big|f(x)-|f|_{Ω(x_{0},ρ)}\big|^pdx<\infty,$$ where $Ω(x_{0},ρ)=Q(x_{0},ρ)\cap Ω$ and $Q(x,ρ)$ is denote the cube of $\mathbb{R}^n$. Some basic properties and ch… ▽ More

    Submitted 3 October, 2021; originally announced October 2021.

    Comments: 29 pages

  46. arXiv:2109.14739  [pdf, other

    cs.CL

    Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

    Authors: Yixuan Su, Lei Shu, Elman Mansimov, Arshit Gupta, Deng Cai, Yi-An Lai, Yi Zhang

    Abstract: Pre-trained language models have been recently shown to benefit task-oriented dialogue (TOD) systems. Despite their success, existing methods often formulate this task as a cascaded generation problem which can lead to error accumulation across different sub-tasks and greater data annotation overhead. In this study, we present PPTOD, a unified plug-and-play model for task-oriented dialogue. In add… ▽ More

    Submitted 1 March, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: Camera-ready for ACL2022 main conference

  47. Zero-Shot Out-of-Distribution Detection Based on the Pre-trained Model CLIP

    Authors: Sepideh Esmaeilpour, Bing Liu, Eric Robertson, Lei Shu

    Abstract: In an out-of-distribution (OOD) detection problem, samples of known classes(also called in-distribution classes) are used to train a special classifier. In testing, the classifier can (1) classify the test samples of known classes to their respective classes and also (2) detect samples that do not belong to any of the known classes (i.e., they belong to some unknown or OOD classes). This paper stu… ▽ More

    Submitted 22 March, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

  48. arXiv:2102.09271  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Intrinsic new properties of a quantum spin liquid

    Authors: Yanxing Yang, Xin Li, Cheng Tan, Zihao Zhu, Jian Zhang, Zhaofeng Ding, Qiong Wu, Changshen Chen, Toni Shiroka, Yuanhua Xia, Douglas E. MacLaughlin, Chandra M. Varma, Lei Shu

    Abstract: Quantum fluctuations are expected to lead to highly entangled spin-liquid states in certain two-dimensional spin-1/2 compounds. We have synthesized and measured thermodynamic properties and muon spin relaxation rates in the copper-based two-dimensional triangular-lattice spin liquids Lu$_3$Cu$_2$Sb$_3$O$_{14}$ and Lu$_3$CuZnSb$_3$O$_{14}$. The former is the least disordered of this kind discovered… ▽ More

    Submitted 21 July, 2022; v1 submitted 18 February, 2021; originally announced February 2021.

  49. arXiv:2011.00169  [pdf, other

    cs.CL

    Understanding Pre-trained BERT for Aspect-based Sentiment Analysis

    Authors: Hu Xu, Lei Shu, Philip S. Yu, Bing Liu

    Abstract: This paper analyzes the pre-trained hidden representations learned from reviews on BERT for tasks in aspect-based sentiment analysis (ABSA). Our work is motivated by the recent progress in BERT-based language models for ABSA. However, it is not clear how the general proxy task of (masked) language model trained on unlabeled corpus without annotations of aspects or opinions can provide important fe… ▽ More

    Submitted 30 October, 2020; originally announced November 2020.

    Comments: COLING 2020

  50. arXiv:2009.12046  [pdf, other

    cs.CL

    Controllable Text Generation with Focused Variation

    Authors: Lei Shu, Alexandros Papangelis, Yi-Chia Wang, Gokhan Tur, Hu Xu, Zhaleh Feizollahi, Bing Liu, Piero Molino

    Abstract: This work introduces Focused-Variation Network (FVN), a novel model to control language generation. The main problems in previous controlled language generation models range from the difficulty of generating text according to the given attributes, to the lack of diversity of the generated texts. FVN addresses these issues by learning disjoint discrete latent spaces for each attribute inside codebo… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.