Skip to main content

Showing 251–300 of 3,162 results for author: Zhao, X

.
  1. arXiv:2402.18191  [pdf, other

    cs.CL

    Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation

    Authors: Yuan Ge, Yilun Liu, Chi Hu, Weibin Meng, Shimin Tao, Xiaofeng Zhao, Hongxia Ma, Li Zhang, Hao Yang, Tong Xiao

    Abstract: With contributions from the open-source community, a vast amount of instruction tuning (IT) data has emerged. Given the significant resource allocation required by training and evaluating models, it is advantageous to have an efficient method for selecting high-quality IT data. However, existing methods for instruction data selection have limitations such as relying on fragile external APIs, being… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  2. arXiv:2402.18166  [pdf, other

    cs.IR

    Sequence-level Semantic Representation Fusion for Recommender Systems

    Authors: Lanling Xu, Zhen Tian, Bingqian Li, Junjie Zhang, **peng Wang, Mingchen Cai, Wayne Xin Zhao

    Abstract: With the rapid development of recommender systems, there is increasing side information that can be employed to improve the recommendation performance. Specially, we focus on the utilization of the associated \emph{textual data} of items (eg product title) and study how text features can be effectively fused with ID features in sequential recommendation. However, there exists distinct data charact… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 8 pages, 5 figures

  3. arXiv:2402.18144  [pdf

    cs.AI cs.CY

    Random Silicon Sampling: Simulating Human Sub-Population Opinion Using a Large Language Model Based on Group-Level Demographic Information

    Authors: Seungjong Sun, Eungu Lee, Dongyan Nan, Xiangying Zhao, Wonbyung Lee, Bernard J. Jansen, Jang Hyun Kim

    Abstract: Large language models exhibit societal biases associated with demographic information, including race, gender, and others. Endowing such language models with personalities based on demographic data can enable generating opinions that align with those of humans. Building on this idea, we propose "random silicon sampling," a method to emulate the opinions of the human population sub-group. Our study… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 25 pages, 4 figures, 19 Tables

    ACM Class: I.2.7

  4. arXiv:2402.18099  [pdf, other

    cs.CL cs.AI

    Editing Factual Knowledge and Explanatory Ability of Medical Large Language Models

    Authors: Derong Xu, Ziheng Zhang, Zhihong Zhu, Zhenxi Lin, Qidong Liu, Xian Wu, Tong Xu, Wanyu Wang, Yuyang Ye, Xiangyu Zhao, Yefeng Zheng, Enhong Chen

    Abstract: Model editing aims to precisely alter the behaviors of large language models (LLMs) in relation to specific knowledge, while leaving unrelated knowledge intact. This approach has proven effective in addressing issues of hallucination and outdated information in LLMs. However, the potential of using model editing to modify knowledge in the medical field remains largely unexplored, even though resol… ▽ More

    Submitted 4 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  5. arXiv:2402.17564  [pdf, other

    cs.CL

    Unleashing the Potential of Large Language Models as Prompt Optimizers: An Analogical Analysis with Gradient-based Model Optimizers

    Authors: Xinyu Tang, Xiaolei Wang, Wayne Xin Zhao, Siyuan Lu, Yaliang Li, Ji-Rong Wen

    Abstract: Automatic prompt optimization is an important approach to improving the performance of large language models (LLMs). Recent research demonstrates the potential of using LLMs as prompt optimizers, which can generate improved task prompts via iterative refinement. In this paper, we propose a novel perspective to investigate the design of LLM-based prompt optimizers, by drawing an analogy with gradie… ▽ More

    Submitted 16 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  6. arXiv:2402.17505  [pdf, other

    cs.IR cs.CL

    BASES: Large-scale Web Search User Simulation with Large Language Model based Agents

    Authors: Ruiyang Ren, Peng Qiu, Yingqi Qu, **g Liu, Wayne Xin Zhao, Hua Wu, Ji-Rong Wen, Haifeng Wang

    Abstract: Due to the excellent capacities of large language models (LLMs), it becomes feasible to develop LLM-based agents for reliable user simulation. Considering the scarcity and limit (e.g., privacy issues) of real user data, in this paper, we conduct large-scale user simulation for web search, to improve the analysis and modeling of user search behavior. Specially, we propose BASES, a novel user simula… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  7. arXiv:2402.17497  [pdf, other

    cs.CL cs.IR

    REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering

    Authors: Yuhao Wang, Ruiyang Ren, Junyi Li, Wayne Xin Zhao, **g Liu, Ji-Rong Wen

    Abstract: Considering the limited internal parametric knowledge, retrieval-augmented generation (RAG) has been widely used to extend the knowledge scope of large language models (LLMs). Despite the extensive efforts on RAG research, in existing methods, LLMs cannot precisely assess the relevance of retrieved documents, thus likely leading to misleading or even incorrect utilization of external knowledge (i.… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  8. arXiv:2402.17334  [pdf, other

    cs.IR cs.AI

    BiVRec: Bidirectional View-based Multimodal Sequential Recommendation

    Authors: Jiaxi Hu, **gtong Gao, Xiangyu Zhao, Yuehong Hu, Yuxuan Liang, Yiqi Wang, Ming He, Zitao Liu, Hongzhi Yin

    Abstract: The integration of multimodal information into sequential recommender systems has attracted significant attention in recent research. In the initial stages of multimodal sequential recommendation models, the mainstream paradigm was ID-dominant recommendations, wherein multimodal information was fused as side information. However, due to their limitations in terms of transferability and information… ▽ More

    Submitted 4 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  9. arXiv:2402.17124  [pdf, other

    cs.CL

    Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models

    Authors: Xinran Zhao, Hongming Zhang, Xiaoman Pan, Wenlin Yao, Dong Yu, Tongshuang Wu, Jianshu Chen

    Abstract: For a LLM to be trustworthy, its confidence level should be well-calibrated with its actual performance. While it is now common sense that LLM performances are greatly impacted by prompts, the confidence calibration in prompting LLMs has yet to be thoroughly explored. In this paper, we explore how different prompting strategies influence LLM confidence calibration and how it could be improved. We… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 17 pages, 10 figures

  10. arXiv:2402.16500  [pdf, ps, other

    cond-mat.mes-hall

    The Map between Symmetries and Orbital Rules to Realize Tunable Band Gap in Quantum Anomalous Hall Effect Material

    Authors: Jiaohong Shu, Xinxin Zhao, Weiqin Fan, Lili Wang, Guanglong Chen, Jianbao Wu, Yiming Mi

    Abstract: We establish the map between symmetries and orbital rules to realize tunable band gap in quantum anomalous Hall effect material. This band gap is determined by the SOC between local orbitals associated with band crossing, which is constrained by at least one of lattice symmetries. The band gap could be turned on/off by breaking or kee** corresponding lattice symmetry through rotation of magnetiz… ▽ More

    Submitted 24 April, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  11. arXiv:2402.16438  [pdf, other

    cs.CL

    Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models

    Authors: Tianyi Tang, Wenyang Luo, Haoyang Huang, Dongdong Zhang, Xiaolei Wang, Xin Zhao, Furu Wei, Ji-Rong Wen

    Abstract: Large language models (LLMs) demonstrate remarkable multilingual capabilities without being pre-trained on specially curated multilingual parallel corpora. It remains a challenging problem to explain the underlying mechanisms by which LLMs process multilingual texts. In this paper, we delve into the composition of Transformer architectures in LLMs to pinpoint language-specific regions. Specially,… ▽ More

    Submitted 6 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024

  12. arXiv:2402.16402  [pdf, other

    cs.LG cs.AI

    Graph Learning with Distributional Edge Layouts

    Authors: Xinjian Zhao, Chaolong Ying, Tianshu Yu

    Abstract: Graph Neural Networks (GNNs) learn from graph-structured data by passing local messages between neighboring nodes along edges on certain topological layouts. Typically, these topological layouts in modern GNNs are deterministically computed (e.g., attention-based GNNs) or locally sampled (e.g., GraphSage) under heuristic assumptions. In this paper, we for the first time pose that these layouts can… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 20 pages, 10 figures

  13. arXiv:2402.16371  [pdf, other

    eess.IV

    Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction

    Authors: Wen-Yang Lu, Eduardo Pavez, Antonio Ortega, Xin Zhao, Shan Liu

    Abstract: Current video coding standards, including H.264/AVC, HEVC, and VVC, employ discrete cosine transform (DCT), discrete sine transform (DST), and secondary to Karhunen-Loeve transforms (KLTs) decorrelate the intra-prediction residuals. However, the efficiency of these transforms in decorrelation can be limited when the signal has a non-smooth and non-periodic structure, such as those occurring in tex… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures

  14. arXiv:2402.16358  [pdf, other

    cs.LG cs.CL cs.IR

    An Integrated Data Processing Framework for Pretraining Foundation Models

    Authors: Yiding Sun, Feng Wang, Yutao Zhu, Wayne Xin Zhao, Jiaxin Mao

    Abstract: The ability of the foundation models heavily relies on large-scale, diverse, and high-quality pretraining data. In order to improve data quality, researchers and practitioners often have to manually curate datasets from difference sources and develop dedicated data cleansing pipeline for each data repository. Lacking a unified data processing framework, this process is repetitive and cumbersome. T… ▽ More

    Submitted 23 April, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 6 pages, 2 figures; accepted by SIGIR'24 demo track

  15. arXiv:2402.16346  [pdf, other

    cs.LG math.AT

    Boosting Graph Pooling with Persistent Homology

    Authors: Chaolong Ying, Xinjian Zhao, Tianshu Yu

    Abstract: Recently, there has been an emerging trend to integrate persistent homology (PH) into graph neural networks (GNNs) to enrich expressive power. However, naively plugging PH features into GNN layers always results in marginal improvement with low interpretability. In this paper, we investigate a novel mechanism for injecting global topological invariance into pooling layers using PH, motivated by th… ▽ More

    Submitted 1 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  16. arXiv:2402.15784  [pdf, other

    cs.CV

    IRConStyle: Image Restoration Framework Using Contrastive Learning and Style Transfer

    Authors: Dongqi Fan, Xin Zhao, Liang Chang

    Abstract: Recently, the contrastive learning paradigm has achieved remarkable success in high-level tasks such as classification, detection, and segmentation. However, contrastive learning applied in low-level tasks, like image restoration, is limited, and its effectiveness is uncertain. This raises a question: Why does the contrastive learning paradigm not yield satisfactory results in image restoration? I… ▽ More

    Submitted 7 March, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

  17. arXiv:2402.15429  [pdf, other

    cs.CV cs.AI cs.LG

    ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation

    Authors: Yi Zhang, Yun Tang, Wenjie Ruan, Xiaowei Huang, Siddartha Khastgir, Paul Jennings, Xingyu Zhao

    Abstract: Text-to-Image (T2I) Diffusion Models (DMs) have shown impressive abilities in generating high-quality images based on simple text descriptions. However, as is common with many Deep Learning (DL) models, DMs are subject to a lack of robustness. While there are attempts to evaluate the robustness of T2I DMs as a binary or worst-case problem, they cannot answer how robust in general the model is when… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  18. arXiv:2402.15370  [pdf, other

    cs.CL cs.AI cs.LG

    Dual Encoder: Exploiting the Potential of Syntactic and Semantic for Aspect Sentiment Triplet Extraction

    Authors: Xiaowei Zhao, Yong Zhou, Xiujuan Xu

    Abstract: Aspect Sentiment Triple Extraction (ASTE) is an emerging task in fine-grained sentiment analysis. Recent studies have employed Graph Neural Networks (GNN) to model the syntax-semantic relationships inherent in triplet elements. However, they have yet to fully tap into the vast potential of syntactic and semantic information within the ASTE task. In this work, we propose a \emph{Dual Encoder: Explo… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted by COLING 2024

  19. arXiv:2402.13667  [pdf, other

    cs.CL

    GCOF: Self-iterative Text Generation for Copywriting Using Large Language Model

    Authors: Jianghui Zhou, Ya Gao, Jie Liu, Xuemin Zhao, Zhaohua Yang, Yue Wu, Lirong Shi

    Abstract: Large language models(LLM) such as ChatGPT have substantially simplified the generation of marketing copy, yet producing content satisfying domain specific requirements, such as effectively engaging customers, remains a significant challenge. In this work, we introduce the Genetic Copy Optimization Framework (GCOF) designed to enhance both efficiency and engagememnt of marketing copy creation. We… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 8 pages, 5 figures, 1 table

  20. arXiv:2402.13590  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el physics.chem-ph quant-ph

    Tunable topological phases in nanographene-based spin-1/2 alternating-exchange Heisenberg chains

    Authors: Chenxiao Zhao, Gonçalo Catarina, **-Jiang Zhang, João C. G. Henriques, Lin Yang, Ji Ma, Xinliang Feng, Oliver Gröning, Pascal Ruffieux, Joaquín Fernández-Rossier, Roman Fasel

    Abstract: Unlocking the potential of topological order within many-body spin systems has long been a central pursuit in the realm of quantum materials. Despite extensive efforts, the quest for a versatile platform enabling site-selective spin manipulation, essential for tuning and probing diverse topological phases, has persisted. Here, we utilize on-surface synthesis to construct spin-1/2 alternating-excha… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  21. arXiv:2402.13577  [pdf, other

    cs.CL

    BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models

    Authors: Xueliang Zhao, Xinting Huang, Tingchen Fu, Qintong Li, Shansan Gong, Lemao Liu, Wei Bi, Lingpeng Kong

    Abstract: Multimodal reasoning stands as a pivotal capability for large vision-language models (LVLMs). The integration with Domain-Specific Languages (DSL), offering precise visual representations, equips these models with the opportunity to execute more accurate reasoning in complex and professional domains. However, the vanilla Chain-of-Thought (CoT) prompting method faces challenges in effectively lever… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Preprint

  22. arXiv:2402.13508  [pdf, other

    astro-ph.HE astro-ph.GA

    PEARLS: NuSTAR and XMM-Newton Extragalactic Survey of the JWST North Ecliptic Pole Time-Domain Field II

    Authors: Xiurui Zhao, Francesca Civano, Christopher N. A. Willmer, Silvia Bonoli, Chien-Ting Chen, Samantha Creech, Renato Dupke, Francesca M. Fornasini, Rolf A. Jansen, Satoshi Kikuta, Anton M. Koekemoer, Sibasish Laha, Stefano Marchesi, Rosalia O'Brien, Ross Silver, S. P. Willner, Rogier A. Windhorst, Hao**g Yan, Jailson Alcaniz, Narciso Benitez, Saulo Carneiro, Javier Cenarro, David Cristóbal-Hornillos, Alessandro Ederoclite, Antonio Hernán-Caballero , et al. (8 additional authors not shown)

    Abstract: We present the second NuSTAR and XMM-Newton extragalactic survey of the JWST North Ecliptic Pole (NEP) Time-Domain Field (TDF). The first NuSTAR NEP-TDF survey (Zhao et al. 2021) had 681 ks total exposure time executed in NuSTAR cycle 5, in 2019 and 2020. This second survey, acquired from 2020 to 2022 in cycle 6, adds 880 ks of NuSTAR exposure time. The overall NuSTAR NEP-TDF survey is the most se… ▽ More

    Submitted 21 April, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: 37 pages, 27 figures, Accepted by ApJ

    Journal ref: ApJ 965, 188 (2024)

  23. arXiv:2402.13455  [pdf

    astro-ph.SR

    Light Bridges and Solar Active Region Evolution Processes

    Authors: Fuyu Li, Changhui Rao, Xinhua Zhao, Yang Guo, Xiaoying Gong, Yuhao Chen, Nanbin Xiang, Huaning Wang

    Abstract: The formation mechanism of light bridges (LBs) is strongly related to the dynamic evolution of solar active regions (ARs). To study the relationship between LB formation and AR evolution phases, we employ 109 LB samples from 69 ARs in 2014 using observational data from the Helioseismic and Magnetic Imager on board the Solar Dynamics Observatory (HMI/SDO). LBs are well matched with the weak field l… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  24. arXiv:2402.13033  [pdf, other

    cs.LG cs.IR cs.SI

    Enhancing Real-World Complex Network Representations with Hyperedge Augmentation

    Authors: Xiangyu Zhao, Zehui Li, Mingzhu Shen, Guy-Bart Stan, Pietro Liò, Yiren Zhao

    Abstract: Graph augmentation methods play a crucial role in improving the performance and enhancing generalisation capabilities in Graph Neural Networks (GNNs). Existing graph augmentation methods mainly perturb the graph structures and are usually limited to pairwise node relations. These methods cannot fully address the complexities of real-world large-scale networks that often involve higher-order node r… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Preprint. Under review. 17 pages, 4 figures, 14 tables. arXiv admin note: text overlap with arXiv:2306.05108

  25. arXiv:2402.12948  [pdf, other

    cs.CL

    GumbelSoft: Diversified Language Model Watermarking via the GumbelMax-trick

    Authors: Jiayi Fu, Xuandong Zhao, Ruihan Yang, Yuansen Zhang, Jiangjie Chen, Yanghua Xiao

    Abstract: Large language models (LLMs) excellently generate human-like text, but also raise concerns about misuse in fake news and academic dishonesty. Decoding-based watermark, particularly the GumbelMax-trick-based watermark(GM watermark), is a standout solution for safeguarding machine-generated texts due to its notable detectability. However, GM watermark encounters a major challenge with generation div… ▽ More

    Submitted 28 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  26. arXiv:2402.12573  [pdf, ps, other

    math.AG

    Fully faithful functors, skyscraper sheaves, and birational equivalence

    Authors: Chunyi Li, Xun Lin, Xiaolei Zhao

    Abstract: Let $X$ and $Y$ be two smooth projective varieties such that there is a fully faithful exact functor from $D^b(\mathrm{Coh}(X))$ to $D^b(\mathrm{Coh}(Y))$. We show that $X$ and $Y$ are birational equivalent if the functor maps one skyscraper sheaf to a skyscraper sheaf. Further assuming that $X$ and $Y$ are of the same dimension, we show that if $X$ has ample canonical bundle and… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 20 pages. Comments are very welcome!

  27. arXiv:2402.11436  [pdf, other

    cs.CL cs.AI

    Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement

    Authors: Wenda Xu, Guanglei Zhu, Xuandong Zhao, Liangming Pan, Lei Li, William Yang Wang

    Abstract: Recent studies show that large language models (LLMs) improve their performance through self-feedback on certain tasks while degrade on others. We discovered that such a contrary is due to LLM's bias in evaluating their own output. In this paper, we formally define LLM's self-bias - the tendency to favor its own generation - using two statistics. We analyze six LLMs (GPT-4, GPT-3.5, Gemini, LLaMA2… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  28. arXiv:2402.11207  [pdf, ps, other

    hep-ex

    Search for the production of deuterons and antideuterons in e^+e^- annihilation at center-of-mass energies between 4.13 and 4.70 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (593 additional authors not shown)

    Abstract: Using a data sample of $e^+e^-$ collision data corresponding to an integrated luminosity of 19 fb$^{-1}$ collected with the BESIII detector at the BEPCII collider, we search for the production of deuterons and antideuterons via $e^+e^-\to ppπ^-\bar{d}+c.c.$ for the first time at center-of-mass energies between 4.13 and 4.70 GeV. No significant signal is observed and the upper limit of the… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  29. arXiv:2402.11163  [pdf, other

    cs.CL

    KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph

    Authors: **hao Jiang, Kun Zhou, Wayne Xin Zhao, Yang Song, Chen Zhu, Hengshu Zhu, Ji-Rong Wen

    Abstract: In this paper, we aim to improve the reasoning ability of large language models (LLMs) over knowledge graphs (KGs) to answer complex questions. Inspired by existing methods that design the interaction strategy between LLMs and KG, we propose an autonomous LLM-based agent framework, called KG-Agent, which enables a small LLM to actively make decisions until finishing the reasoning process over KGs.… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: work in progress; efficient 7B LLM-based agent

  30. arXiv:2402.10468  [pdf, other

    cs.LG cs.AI

    Adversarial Curriculum Graph Contrastive Learning with Pair-wise Augmentation

    Authors: Xinjian Zhao, Liang Zhang, Yang Liu, Ruocheng Guo, Xiangyu Zhao

    Abstract: Graph contrastive learning (GCL) has emerged as a pivotal technique in the domain of graph representation learning. A crucial aspect of effective GCL is the caliber of generated positive and negative samples, which is intrinsically dictated by their resemblance to the original data. Nevertheless, precise control over similarity during sample generation presents a formidable challenge, often impedi… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  31. arXiv:2402.10405  [pdf, other

    cond-mat.soft physics.bio-ph

    Theory of Wetting Dynamics with Surface Binding

    Authors: Xue** Zhao, Susanne Liese, Alf Honigmann, Frank Jülicher, Christoph A. Weber

    Abstract: Biomolecules, such as proteins and RNAs, can phase separate in the cytoplasm of cells to form biological condensates. Such condensates are liquid-like droplets that can wet biological surfaces such as membranes. Many molecules that can participate in phase separation can also reversibly bind to membrane surfaces. When a droplet wets such a surface, these molecules can diffuse both inside the dropl… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  32. arXiv:2402.10244  [pdf, other

    quant-ph physics.optics

    Entanglement generation in capacitively coupled Transmon-cavity system

    Authors: Jian-Zhuang Wu, Lian-E Lu, Xin-Yu Zhao, Yong-Hong Ma

    Abstract: In this paper, the higher energy levels of the transmon qubit are taken into consideration to investigate the continuous variable entanglement generation between the transmon qubit and the single-mode cavity. Based on the framework of cavity quantum electrodynamics, we show the entanglement generation depends on the the driving field intensity, coupling strength, cavity field frequency, and qubit… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  33. arXiv:2402.10189  [pdf, other

    cs.CL cs.LG

    Uncertainty Quantification for In-Context Learning of Large Language Models

    Authors: Chen Ling, Xujiang Zhao, Xuchao Zhang, Wei Cheng, Yanchi Liu, Yiyou Sun, Mika Oishi, Takao Osaki, Katsushi Matsuda, Jie Ji, Guangji Bai, Liang Zhao, Haifeng Chen

    Abstract: In-context learning has emerged as a groundbreaking ability of Large Language Models (LLMs) and revolutionized various fields by providing a few task-relevant demonstrations in the prompt. However, trustworthy issues with LLM's response, such as hallucination, have also been actively discussed. Existing works have been devoted to quantifying the uncertainty in LLM's response, but they often overlo… ▽ More

    Submitted 28 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted to the main conference of NAACL 2024

  34. arXiv:2402.09910  [pdf, other

    cs.CL cs.LG

    DE-COP: Detecting Copyrighted Content in Language Models Training Data

    Authors: André V. Duarte, Xuandong Zhao, Arlindo L. Oliveira, Lei Li

    Abstract: How can we detect if copyrighted content was used in the training process of a language model, considering that the training data is typically undisclosed? We are motivated by the premise that a language model is likely to identify verbatim excerpts from its training text. We propose DE-COP, a method to determine whether a piece of copyrighted content was included in training. DE-COP's core approa… ▽ More

    Submitted 25 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    ACM Class: I.2

  35. arXiv:2402.09651  [pdf, other

    cs.SE cs.LG

    Practitioners' Challenges and Perceptions of CI Build Failure Predictions at Atlassian

    Authors: Yang Hong, Chakkrit Tantithamthavorn, Jirat Pasuksmit, Patanamon Thongtanunam, Arik Friedman, Xing Zhao, Anton Krasikov

    Abstract: Continuous Integration (CI) build failures could significantly impact the software development process and teams, such as delaying the release of new features and reducing developers' productivity. In this work, we report on an empirical study that investigates CI build failures throughout product development at Atlassian. Our quantitative analysis found that the repository dimension is the key fa… ▽ More

    Submitted 14 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  36. arXiv:2402.09543  [pdf, other

    cs.IR

    Rethinking Large Language Model Architectures for Sequential Recommendations

    Authors: Hanbing Wang, Xiaorui Liu, Wenqi Fan, Xiangyu Zhao, Venkataramana Kini, Devendra Yadav, Fei Wang, Zhen Wen, Jiliang Tang, Hui Liu

    Abstract: Recently, sequential recommendation has been adapted to the LLM paradigm to enjoy the power of LLMs. LLM-based methods usually formulate recommendation information into natural language and the model is trained to predict the next item in an auto-regressive manner. Despite their notable success, the substantial computational overhead of inference poses a significant obstacle to their real-world ap… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 8 pages, 5 figures, conference

  37. arXiv:2402.07787  [pdf, other

    cs.AI cs.CL

    Extensible Multi-Granularity Fusion Network for Aspect-based Sentiment Analysis

    Authors: Xiaowei Zhao, Yong Zhou, Xiujuan Xu, Yu Liu

    Abstract: Aspect-based Sentiment Analysis (ABSA) evaluates sentiment expressions within a text to comprehend sentiment information. Previous studies integrated external knowledge, such as knowledge graphs, to enhance the semantic features in ABSA models. Recent research has examined the use of Graph Neural Networks (GNNs) on dependency and constituent trees for syntactic analysis. With the ongoing developme… ▽ More

    Submitted 4 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 8 pages, 4 figures

  38. arXiv:2402.07031  [pdf, other

    cs.SE cs.AI cs.LG

    Instance-Level Safety-Aware Fidelity of Synthetic Data and Its Calibration

    Authors: Chih-Hong Cheng, Paul Stöckel, Xingyu Zhao

    Abstract: Modeling and calibrating the fidelity of synthetic data is paramount in sha** the future of safe and reliable self-driving technology by offering a cost-effective and scalable alternative to real-world data collection. We focus on its role in safety-critical applications, introducing four types of instance-level fidelity that go beyond mere visual input characteristics. The aim is to ensure that… ▽ More

    Submitted 2 May, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

  39. arXiv:2402.06639  [pdf, other

    physics.soc-ph

    Social Vulnerabilities and Wildfire Evacuations: A Case Study of the 2019 Kincade Fire

    Authors: Yuran Sun, Ana Forrister, Erica D. Kuligowski, Ruggiero Lovreglio, Thomas J. Cova, Xilei Zhao

    Abstract: Vulnerable populations are disproportionately impacted by natural hazards like wildfires. It is crucial to develop equitable and effective evacuation strategies to meet their unique needs. While existing studies offer valuable insights, we need to improve our understanding of how vulnerabilities affect wildfire evacuation decision-making, as well as how this varies spatially. The goal of this stud… ▽ More

    Submitted 23 January, 2024; originally announced February 2024.

  40. arXiv:2402.06579  [pdf, ps, other

    math.AG

    Some remarks about deformation theory and formality conjecture

    Authors: Huachen Chen, Laura Pertusi, Xiaolei Zhao

    Abstract: Using the algebraic criterion proved by Bandiera, Manetti and Meazzini, we show the formality conjecture for universally gluable objects with linearly reductive automorphism groups in the bounded derived category of a K3 surface. As an application, we prove the formality conjecture for polystable objects in the Kuznetsov components of Gushel--Mukai threefolds and quartic double solids.

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 15 pages, to appear in Annali dell'Università di Ferrara, special volume Edge Days: 2018-2022

  41. arXiv:2402.05864  [pdf, other

    cs.CL cs.CR cs.LG

    Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs

    Authors: Xuandong Zhao, Lei Li, Yu-Xiang Wang

    Abstract: In this paper, we propose a new decoding method called Permute-and-Flip (PF) decoder. It enjoys robustness properties similar to the standard sampling decoder, but is provably up to 2x better in its quality-robustness tradeoff than sampling and never worse than any other decoder. We also design a cryptographic watermarking scheme analogous to Aaronson's Gumbel watermark, but naturally tailored for… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  42. arXiv:2402.05395  [pdf, other

    stat.ME

    Efficient Estimation for Functional Accelerated Failure Time Model

    Authors: Changyu Liu, Wen Su, Kin-Yat Liu, Guosheng Yin, Xingqiu Zhao

    Abstract: We propose a functional accelerated failure time model to characterize effects of both functional and scalar covariates on the time to event of interest, and provide regularity conditions to guarantee model identifiability. For efficient estimation of model parameters, we develop a sieve maximum likelihood approach where parametric and nonparametric coefficients are bundled with an unknown baselin… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  43. arXiv:2402.04533  [pdf, ps, other

    cs.CE

    Minimizing Block Incentive Volatility Through Verkle Tree-Based Dynamic Transaction Storage

    Authors: Xiongfei Zhao, Gerui Zhang, Hou-Wan Long, Yain-Whar Si

    Abstract: Transaction fees are a crucial revenue source for miners in public and consortium blockchains. However, while public blockchains have additional revenue streams, transaction fees serve as the primary income for miners in consortium blockchains formed by various financial institutions. These miners allocate different levels of computing resources to process transactions and earn corresponding fees.… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  44. arXiv:2402.04527  [pdf, other

    cs.IR cs.AI

    RA-Rec: An Efficient ID Representation Alignment Framework for LLM-based Recommendation

    Authors: Xiaohan Yu, Li Zhang, Xin Zhao, Yue Wang, Zhongrui Ma

    Abstract: Large language models (LLM) have recently emerged as a powerful tool for a variety of natural language processing tasks, bringing a new surge of combining LLM with recommendation systems, termed as LLM-based RS. Current approaches generally fall into two main paradigms, the ID direct usage paradigm and the ID translation paradigm, noting their core weakness stems from lacking recommendation knowle… ▽ More

    Submitted 19 March, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 10 pages

  45. BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision

    Authors: Xin Zhao, Shiyu Hu, Yipei Wang, **g Zhang, Yimin Hu, Rongshuai Liu, Haibin Ling, Yin Li, Renshu Li, Kun Liu, Jiadong Li

    Abstract: Single object tracking (SOT) is a fundamental problem in computer vision, with a wide range of applications, including autonomous driving, augmented reality, and robot navigation. The robustness of SOT faces two main challenges: tiny target and fast motion. These challenges are especially manifested in videos captured by unmanned aerial vehicles (UAV), where the target is usually far away from the… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: This paper is published in IJCV (refer to DOI). Please cite the published IJCV

    Journal ref: Int J Comput Vis (2023)

  46. arXiv:2402.03829  [pdf, ps, other

    hep-ex

    Precise Measurement of Born Cross Sections for $e^+e^-\to D\bar{D}$ and Observation of One Structure between $\sqrt{s} = 3.80-4.95$ GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (604 additional authors not shown)

    Abstract: Using data samples collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 3.80 to 4.95 GeV, corresponding to an integrated luminosity of 20 fb$^{-1}$, a measurement of Born cross sections for the $e^+e^-\to D^{0}\bar{D}^{0}$ and $D^{+}D^{-}$ processes is presented with unprecedented precision. By performing a simultaneous fit to the dressed cross sections… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures, 1 tables, 1 Supplemental_Material

  47. arXiv:2402.03708  [pdf, other

    cs.CV

    SISP: A Benchmark Dataset for Fine-grained Ship Instance Segmentation in Panchromatic Satellite Images

    Authors: Pengming Feng, Mingjie Xie, Hongning Liu, Xuanjia Zhao, Guangjun He, Xueliang Zhang, Jian Guan

    Abstract: Fine-grained ship instance segmentation in satellite images holds considerable significance for monitoring maritime activities at sea. However, existing datasets often suffer from the scarcity of fine-grained information or pixel-wise localization annotations, as well as the insufficient image diversity and variations, thus limiting the research of this task. To this end, we propose a benchmark da… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 14 pages, 9 figures

  48. arXiv:2402.03697  [pdf, other

    cs.CV

    SHMC-Net: A Mask-guided Feature Fusion Network for Sperm Head Morphology Classification

    Authors: Nishchal Sapkota, Yejia Zhang, Sirui Li, Peixian Liang, Zhuo Zhao, **g**g Zhang, Xiaomin Zha, Yiru Zhou, Yunxia Cao, Danny Z Chen

    Abstract: Male infertility accounts for about one-third of global infertility cases. Manual assessment of sperm abnormalities through head morphology analysis encounters issues of observer variability and diagnostic discrepancies among experts. Its alternative, Computer-Assisted Semen Analysis (CASA), suffers from low-quality sperm images, small datasets, and noisy class labels. We propose a new approach fo… ▽ More

    Submitted 5 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Published on ISBI 2024

  49. arXiv:2402.02803  [pdf, other

    cs.IR cs.AI cs.LG

    Large Language Model Distilling Medication Recommendation Model

    Authors: Qidong Liu, Xian Wu, Xiangyu Zhao, Yuanshao Zhu, Zijian Zhang, Feng Tian, Yefeng Zheng

    Abstract: The recommendation of medication is a vital aspect of intelligent healthcare systems, as it involves prescribing the most suitable drugs based on a patient's specific health needs. Unfortunately, many sophisticated models currently in use tend to overlook the nuanced semantics of medical data, while only relying heavily on identities. Furthermore, these models face significant challenges in handli… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  50. arXiv:2402.02643  [pdf, other

    cs.DB cs.AI cs.CL cs.LG

    LLM-Enhanced Data Management

    Authors: Xuanhe Zhou, Xinyang Zhao, Guoliang Li

    Abstract: Machine learning (ML) techniques for optimizing data management problems have been extensively studied and widely deployed in recent five years. However traditional ML methods have limitations on generalizability (adapting to different scenarios) and inference ability (understanding the context). Fortunately, large language models (LLMs) have shown high generalizability and human-competitive abili… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.