Skip to main content

Showing 1–50 of 143 results for author: tao, s

.
  1. arXiv:2407.05389  [pdf, other

    cs.CV cs.AI

    Image-Conditional Diffusion Transformer for Underwater Image Enhancement

    Authors: Xingyang Nie, Su Pan, Xiaoyu Zhai, Shifei Tao, Fengzhong Qu, Biao Wang, Huilin Ge, Guojie Xiao

    Abstract: Underwater image enhancement (UIE) has attracted much attention owing to its importance for underwater operation and marine engineering. Motivated by the recent advance in generative models, we propose a novel UIE method based on image-conditional diffusion transformer (ICDT). Our method takes the degraded underwater image as the conditional input and converts it into latent space where ICDT is ap… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  2. arXiv:2407.03128  [pdf

    cond-mat.mtrl-sci physics.optics

    Thorium doped strontium fluoride crystal: a unique candidate for solid nuclear optical clock material

    Authors: Qiaorui Gong, Shanming Li, Shulong Zhang, Siliang Tao, Guoliang Deng, Peixiong Zhang, Chengchun Zhao, Yin Hang, Shining Zhu, Longsheng Ma

    Abstract: We report a candidate with unique advantages in the cultivation of solid-state nuclear clock material, Th:SrF2 crystal. It not only has a segregation coefficient close to 1, which can achieve highly efficient and uniform do** of Th, but also ensures a high transmittance (~69% at 150 nm) while achieving extremely high do** concentration (232Th>6*10^20 cm^(-3). In addition, SrF2 crystal will not… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  3. arXiv:2407.01896  [pdf, other

    cs.CL cs.IR

    LogEval: A Comprehensive Benchmark Suite for Large Language Models In Log Analysis

    Authors: Tianyu Cui, Shiyu Ma, Ziang Chen, Tong Xiao, Shimin Tao, Yilun Liu, Shenglin Zhang, Duoming Lin, Changchang Liu, Yuzhe Cai, Weibin Meng, Yongqian Sun, Dan Pei

    Abstract: Log analysis is crucial for ensuring the orderly and stable operation of information systems, particularly in the field of Artificial Intelligence for IT Operations (AIOps). Large Language Models (LLMs) have demonstrated significant potential in natural language processing tasks. In the AIOps domain, they excel in tasks such as anomaly detection, root cause analysis of faults, operations and maint… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2406.07225  [pdf, other

    quant-ph

    A generic and robust quantum agent inspired by deep meta-reinforcement learning

    Authors: Zibo Miao, Shihui Zhang, Yu Pan, Sibo Tao, Yu Chen

    Abstract: Deep reinforcement learning (deep RL) has enabled human- or superhuman- performances in various applications. Recently, deep RL has also been adopted to improve the performance of quantum control. However, a large volume of data is typically required to train the neural network in deep RL, making it inefficient compared with the traditional optimal quantum control method. Here, we thus develop a n… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  5. arXiv:2406.00276  [pdf

    cs.LG cs.AI cs.CE physics.data-an

    Non-destructive Degradation Pattern Decoupling for Ultra-early Battery Prototype Verification Using Physics-informed Machine Learning

    Authors: Shengyu Tao, Mengtian Zhang, Zixi Zhao, Haoyang Li, Ruifei Ma, Yunhong Che, Xin Sun, Lin Su, Xiangyu Chen, Zihao Zhou, Heng Chang, Tingwei Cao, Xiao Xiao, Yaojun Liu, Wenjun Yu, Zhongling Xu, Yang Li, Han Hao, Xuan Zhang, Xiaosong Hu, Guangmin ZHou

    Abstract: Manufacturing complexities and uncertainties have impeded the transition from material prototypes to commercial batteries, making prototype verification critical to quality assessment. A fundamental challenge involves deciphering intertwined chemical processes to characterize degradation patterns and their quantitative relationship with battery performance. Here we show that a physics-informed mac… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    ACM Class: J.2; G.3

  6. arXiv:2405.18643  [pdf, other

    cond-mat.mtrl-sci

    Temperature-Dependent Chirality in Halide Perovskites

    Authors: Mike Pols, Geert Brocks, Sofía Calero, Shuxia Tao

    Abstract: Using chiral organic cations in two-dimensional metal halide perovskites, chirality can be induced in the metal halide layers, which results in semiconductors with intriguing chiral optical and spin-selective transport properties. The chiral properties strongly depend on temperature, despite the basic crystal symmetry not changing fundamentally. We identify a set of descriptors that characterize t… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 16 pages, 4 figures

  7. arXiv:2405.10681  [pdf, other

    cs.IR

    Know in AdVance: Linear-Complexity Forecasting of Ad Campaign Performance with Evolving User Interest

    Authors: XiaoYu Wang, YongHui Guo, Hui Sheng, Peili Lv, Chi Zhou, Wei Huang, ShiQin Ta, Dongbo Huang, Xiu** Yang, Lan Xu, Hao Zhou, Yusheng Ji

    Abstract: Real-time Bidding (RTB) advertisers wish to \textit{know in advance} the expected cost and yield of ad campaigns to avoid trial-and-error expenses. However, Campaign Performance Forecasting (CPF), a sequence modeling task involving tens of thousands of ad auctions, poses challenges of evolving user interest, auction representation, and long context, making coarse-grained and static-modeling method… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 12 pages, 4 figures, accepted at ACM SIGKDD 2024

  8. arXiv:2405.03379  [pdf, other

    cs.LG cs.AI cs.RO

    Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement Learning

    Authors: Stone Tao, Arth Shukla, Tse-kai Chan, Hao Su

    Abstract: Reinforcement learning (RL) presents a promising framework to learn policies through environment interaction, but often requires an infeasible amount of interaction data to solve complex tasks from sparse rewards. One direction includes augmenting RL with offline data demonstrating desired tasks, but past work often require a lot of high-quality demonstration data that is difficult to obtain, espe… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted at The Twelfth International Conference on Learning Representations (ICLR 2024). Website: https://reverseforward-cl.github.io/

  9. arXiv:2404.17287  [pdf, other

    cs.CL

    When to Trust LLMs: Aligning Confidence with Response Quality

    Authors: Shuchang Tao, Liuyi Yao, Hanxing Ding, Yuexiang Xie, Qi Cao, Fei Sun, **yang Gao, Huawei Shen, Bolin Ding

    Abstract: Despite the success of large language models (LLMs) in natural language generation, much evidence shows that LLMs may produce incorrect or nonsensical text. This limitation highlights the importance of discerning when to trust LLMs, especially in safety-critical domains. Existing methods often express reliability by confidence level, however, their effectiveness is limited by the lack of objective… ▽ More

    Submitted 9 June, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted by ACL 2024

  10. arXiv:2404.16280  [pdf, ps, other

    cs.NE cs.AI cs.LG

    An Efficient Reconstructed Differential Evolution Variant by Some of the Current State-of-the-art Strategies for Solving Single Objective Bound Constrained Problems

    Authors: Sichen Tao, Ruihan Zhao, Kaiyu Wang, Shangce Gao

    Abstract: Complex single-objective bounded problems are often difficult to solve. In evolutionary computation methods, since the proposal of differential evolution algorithm in 1997, it has been widely studied and developed due to its simplicity and efficiency. These developments include various adaptive strategies, operator improvements, and the introduction of other search methods. After 2014, research ba… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  11. arXiv:2403.14118  [pdf, other

    cs.CL

    From Handcrafted Features to LLMs: A Brief Survey for Machine Translation Quality Estimation

    Authors: Haofei Zhao, Yilun Liu, Shimin Tao, Weibin Meng, Yimeng Chen, Xiang Geng, Chang Su, Min Zhang, Hao Yang

    Abstract: Machine Translation Quality Estimation (MTQE) is the task of estimating the quality of machine-translated text in real time without the need for reference translations, which is of great importance for the development of MT. After two decades of evolution, QE has yielded a wealth of results. This article provides a comprehensive overview of QE datasets, annotation methods, shared tasks, methodolog… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted by IJCNN 2024

  12. arXiv:2403.09135  [pdf, other

    cs.HC

    Towards Proactive Interactions for In-Vehicle Conversational Assistants Utilizing Large Language Models

    Authors: Huifang Du, Xue**g Feng, Jun Ma, Meng Wang, Shiyu Tao, Yijie Zhong, Yuan-Fang Li, Haofen Wang

    Abstract: Research demonstrates that the proactivity of in-vehicle conversational assistants (IVCAs) can help to reduce distractions and enhance driving safety, better meeting users' cognitive needs. However, existing IVCAs struggle with user intent recognition and context awareness, which leads to suboptimal proactive interactions. Large language models (LLMs) have shown potential for generalizing to vario… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  13. arXiv:2403.05545  [pdf

    cs.CY

    Unveiling the influence of behavioural, built environment and socio-economic features on the spatial and temporal variability of bus use using explainable machine learning

    Authors: Sui Tao, Francisco Rowe, Hongyu Shan

    Abstract: Understanding the variability of people's travel patterns is key to transport planning and policy-making. However, to what extent daily transit use displays geographic and temporal variabilities, and what are the contributing factors have not been fully addressed. Drawing on smart card data in Bei**g, China, this study seeks to address these deficits by adopting new indices to capture the spatial… ▽ More

    Submitted 6 February, 2024; originally announced March 2024.

    Comments: 58 pages including supplementary material

  14. arXiv:2403.04980  [pdf, other

    quant-ph

    Photonic simulation of Majorana-based Jones polynomials

    Authors: Jia-Kun Li, Kai Sun, Ze-Yan Hao, Jia-He Liang, Si-**g Tao, Jiannis K. Pachos, **-Shi Xu, Yong-Jian Han, Chuan-Feng Li, Guang-Can Guo

    Abstract: Jones polynomials were introduced as a tool to distinguish between topologically different links. Recently, they emerged as the central building block of topological quantum computation: by braiding non-Abelian anyons it is possible to realise quantum algorithms through the computation of Jones polynomials. So far, it has been a formidable task to evaluate Jones polynomials through the control and… ▽ More

    Submitted 31 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  15. arXiv:2402.18191  [pdf, other

    cs.CL

    Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation

    Authors: Yuan Ge, Yilun Liu, Chi Hu, Weibin Meng, Shimin Tao, Xiaofeng Zhao, Hongxia Ma, Li Zhang, Hao Yang, Tong Xiao

    Abstract: With contributions from the open-source community, a vast amount of instruction tuning (IT) data has emerged. Given the significant resource allocation required by training and evaluating models, it is advantageous to have an efficient method for selecting high-quality IT data. However, existing methods for instruction data selection have limitations such as relying on fragile external APIs, being… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  16. arXiv:2402.15200  [pdf, other

    cs.CL

    DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators

    Authors: Xinglin Lyu, Junhui Li, Yanqing Zhao, Min Zhang, Daimeng Wei, Shimin Tao, Hao Yang, Min Zhang

    Abstract: Generally, the decoder-only large language models (LLMs) are adapted to context-aware neural machine translation (NMT) in a concatenating way, where LLMs take the concatenation of the source sentence (i.e., intra-sentence context) and the inter-sentence context as the input, and then to generate the target tokens sequentially. This adaptation strategy, i.e., concatenation mode, considers intra-sen… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: under reviewing

  17. arXiv:2402.03075  [pdf, ps, other

    math.FA

    Some sharp bounds for Hardy type operators on mixed radial-angular type function spaces

    Authors: Ronghui Liu, Yanqi Yang, Shuang** Tao

    Abstract: In this paper, we are devoted to studying some sharp bounds for Hardy type operators on mixed radial-angular type function spaces. In addition, we will establish the sharp weak-type estimates for the fractional Hardy operator and its conjugate operator, respectively.

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 30 pages

    MSC Class: 42B35; 26D10; 46E30; 26D15

  18. arXiv:2401.13953  [pdf, other

    cond-mat.mtrl-sci

    Accelerating Structural Optimization through Fingerprinting Space Integration on the Potential Energy Surface

    Authors: Shuo Tao, Xuecheng Shao, Li Zhu

    Abstract: Structural optimization has been a crucial component in computational materials research, and structure predictions have relied heavily on this technique in particular. In this study, we introduce a novel method that enhances the efficiency of local optimization by integrating an extra fingerprint space into the optimization process. Our approach utilizes a mixed energy concept in the hyper potent… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 16 pages, 4 figures

  19. arXiv:2401.11382  [pdf, other

    cs.CL cs.AI

    Using Large Language Model for End-to-End Chinese ASR and NER

    Authors: Yuang Li, Jiawei Yu, Min Zhang, Mengxin Ren, Yanqing Zhao, Xiaofeng Zhao, Shimin Tao, **song Su, Hao Yang

    Abstract: Map** speech tokens to the same feature space as text tokens has become the paradigm for the integration of speech modality into decoder-only large language models (LLMs). An alternative approach is to use an encoder-decoder architecture that incorporates speech features through cross-attention. This approach, however, has received less attention in the literature. In this work, we connect the W… ▽ More

    Submitted 6 June, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: 5 pages, 2 figures, Accepted to InterSpeech 2024

  20. UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction

    Authors: Jiaxin Guo, Minghan Wang, Xiaosong Qiao, Daimeng Wei, Hengchao Shang, Zongyao Li, Zhengzhe Yu, Yinglu Li, Chang Su, Min Zhang, Shimin Tao, Hao Yang

    Abstract: Error correction techniques have been used to refine the output sentences from automatic speech recognition (ASR) models and achieve a lower word error rate (WER). Previous works usually adopt end-to-end models and has strong dependency on Pseudo Paired Data and Original Paired Data. But when only pre-training on Pseudo Paired Data, previous models have negative effect on correction. While fine-tu… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted in ICASSP 2023

  21. arXiv:2312.04378  [pdf, other

    cond-mat.mtrl-sci cond-mat.dis-nn

    Operando pair distribution function analysis of nanocrystalline functional materials: the case of $\mathrm{TiO_{2}}$-bronze nanocrystals in Li-ion battery electrodes

    Authors: Martin Aaskov Karlsen, Jonas Billet, Songsheng Tao, Isabel Van Driessche, Simon J. L. Billinge, Dorthe B. Ravnsbæk

    Abstract: Structural modelling of $operando$ pair distribution function (PDF) data of functional materials can be highly complex. To aid the understanding of complex operando PDF data, we here demonstrate a toolbox for PDF analysis. The tools include the structureMining, similarityMap**, nmfMap** apps available through the online service 'PDF in the cloud' (PDFitc, www.pdfitc.org), as well as noise-filt… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Preprint, 82 pages in total (front page: 1 page, abstract: 1 page, paper: 34 pages, supporting information: 40 pages, references: 5 pages, synopsis: 1 page), 35 figures in total (frontpage: 1 figure, paper: 8 figures, supporting information: 26 figures)

  22. arXiv:2311.13246  [pdf, other

    cs.CL

    CoachLM: Automatic Instruction Revisions Improve the Data Quality in LLM Instruction Tuning

    Authors: Yilun Liu, Shimin Tao, Xiaofeng Zhao, Ming Zhu, Wenbing Ma, Junhao Zhu, Chang Su, Yutai Hou, Miao Zhang, Min Zhang, Hongxia Ma, Li Zhang, Hao Yang, Yanfei Jiang

    Abstract: Instruction tuning is crucial for enabling Language Learning Models (LLMs) in responding to human instructions. The quality of instruction pairs used for tuning greatly affects the performance of LLMs. However, the manual creation of high-quality instruction datasets is costly, leading to the adoption of automatic generation of instruction pairs by LLMs as a popular alternative. To ensure the high… ▽ More

    Submitted 20 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Accepted by ICDE 2024

  23. arXiv:2310.17885  [pdf, other

    math.FA

    Sobolev regularity for a class of local fractional new maximal operators

    Authors: Rui Li, Shuang** Tao

    Abstract: This paper is devoted to studying the regularity properties for the new maximal operator $M_{\varphi}$ and the fractional new maximal operator $M_{\varphi,β}$ in the local case. Some new pointwise gradient estimates of $M_{\varphi,Ω}$ and $M_{\varphi,β,Ω}$ are given. Moreover, the boundedness of $M_{\varphi,Ω}$ and $M_{\varphi,β,Ω}$ on Sobolev space is established. As applications, we also obtain… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  24. arXiv:2309.14002  [pdf, other

    cond-mat.mtrl-sci

    Calculating the Circular Dichroism of Chiral Halide Perovskites: A Tight-Binding Approach

    Authors: Sofia Apergi, Geert Brocks, Shuxia Tao

    Abstract: Chiral metal halide perovskites have emerged as promising optoelectronic materials for emission and detection of circular polarized visible light. Despite chirality being realized by adding chiral organic cations or ligands, the chiroptical activity originates from the metal halide framework. The mechanism is not well understood, as an overarching modeling framework is lacking. Capturing chirality… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 19 pages, 4 figures

  25. arXiv:2309.13230  [pdf, other

    cs.CL

    Unify word-level and span-level tasks: NJUNLP's Participation for the WMT2023 Quality Estimation Shared Task

    Authors: Xiang Geng, Zhejian Lai, Yu Zhang, Shimin Tao, Hao Yang, Jiajun Chen, Shujian Huang

    Abstract: We introduce the submissions of the NJUNLP team to the WMT 2023 Quality Estimation (QE) shared task. Our team submitted predictions for the English-German language pair on all two sub-tasks: (i) sentence- and word-level quality prediction; and (ii) fine-grained error span detection. This year, we further explore pseudo data methods for QE based on NJUQE framework (https://github.com/NJUNLP/njuqe).… ▽ More

    Submitted 11 December, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: WMT2023 System Paper

    Journal ref: https://aclanthology.org/2023.wmt-1.71

  26. arXiv:2309.12003  [pdf, ps, other

    cs.IT cs.CR

    A quaternary analogue of Tang-Ding codes

    Authors: Minjia Shi, Sihui Tao, Jon-Lark Kim, Patrick Sole

    Abstract: In a recent paper, Tang and Ding introduced a class of binary cyclic codes of rate close to one half with a designed lower bound on their minimum distance. The definition involves the base $2$ expansion of the integers in their defining set. In this paper we propose an analogue for quaternary codes. In addition, the performances of the subfield subcode and of the trace code (two binary cyclic code… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  27. arXiv:2309.09588  [pdf, other

    cond-mat.mtrl-sci

    Mixing I and Br in Inorganic Perovskites: Atomistic Insights from Reactive Molecular Dynamics Simulations

    Authors: Mike Pols, Adri C. T. van Duin, Sofía Calero, Shuxia Tao

    Abstract: All-inorganic halide perovskites have received a lot of attention as attractive alternatives to overcome the stability issues of hybrid halide perovskites that are commonly associated with organic cations. To find a compromise between the optoelectronic properties of CsPbI$_{3}$ and CsPbBr$_{3}$, perovskites with CsPb(Br$_{\rm{x}}$I$_{\rm{1-x}}$)$_{3}$ mixed compositions are commonly used. An addi… ▽ More

    Submitted 29 May, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 23 pages, 6 figures

    Journal ref: J. Phys. Chem. C 128 (2024), 4111-4118

  28. arXiv:2309.09552  [pdf, other

    cs.AI cs.CL

    A Multitask Training Approach to Enhance Whisper with Contextual Biasing and Open-Vocabulary Keyword Spotting

    Authors: Yuang Li, Min Zhang, Chang Su, Yinglu Li, Xiaosong Qiao, Mengxin Ren, Miaomiao Ma, Daimeng Wei, Shimin Tao, Hao Yang

    Abstract: The recognition of rare named entities, such as personal names and terminologies, is challenging for automatic speech recognition (ASR) systems, especially when they are not frequently observed in the training data. In this paper, we introduce keyword spotting enhanced Whisper (KWS-Whisper), a novel ASR system that leverages the Whisper model and performs open-vocabulary keyword spotting (OV-KWS)… ▽ More

    Submitted 6 June, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 5 pages, 2 figures, Accepted to InterSpeech 2024

  29. arXiv:2309.02057  [pdf, other

    cs.IR

    Robust Recommender System: A Survey and Future Directions

    Authors: Kaike Zhang, Qi Cao, Fei Sun, Yunfan Wu, Shuchang Tao, Huawei Shen, Xueqi Cheng

    Abstract: With the rapid growth of information, recommender systems have become integral for providing personalized suggestions and overcoming information overload. However, their practical deployment often encounters "dirty" data, where noise or malicious information can lead to abnormal recommendations. Research on improving recommender systems' robustness against such dirty data has thus gained significa… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  30. arXiv:2308.13961  [pdf, other

    cs.CL

    Translate Meanings, Not Just Words: IdiomKB's Role in Optimizing Idiomatic Translation with Language Models

    Authors: Shuang Li, Jiangjie Chen, Siyu Yuan, Xinyi Wu, Hao Yang, Shimin Tao, Yanghua Xiao

    Abstract: To translate well, machine translation (MT) systems and general-purposed language models (LMs) need a deep understanding of both source and target languages and cultures. Therefore, idioms, with their non-compositional nature, pose particular challenges for Transformer-based systems, as literal translations often miss the intended meaning. Traditional methods, which replace idioms using existing k… ▽ More

    Submitted 24 December, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: Accepted to AAAI 2024

  31. arXiv:2308.07610  [pdf, other

    cs.SE cs.CL

    Interpretable Online Log Analysis Using Large Language Models with Prompt Strategies

    Authors: Yilun Liu, Shimin Tao, Weibin Meng, **gyu Wang, Wenbing Ma, Yanqing Zhao, Yuhang Chen, Hao Yang, Yanfei Jiang, Xun Chen

    Abstract: Automated log analysis is crucial in modern software-intensive systems for facilitating program comprehension throughout software maintenance and engineering life cycles. Existing methods perform tasks such as log parsing and log anomaly detection by providing a single prediction value without interpretation. However, given the increasing volume of system events, the limited interpretability of an… ▽ More

    Submitted 25 January, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Accepted by ICPC 2024

  32. arXiv:2308.04114  [pdf, other

    cs.CL

    Collective Human Opinions in Semantic Textual Similarity

    Authors: Yuxia Wang, Shimin Tao, Ning Xie, Hao Yang, Timothy Baldwin, Karin Verspoor

    Abstract: Despite the subjective nature of semantic textual similarity (STS) and pervasive disagreements in STS annotation, existing benchmarks have used averaged human ratings as the gold standard. Averaging masks the true distribution of human opinions on examples of low agreement, and prevents models from capturing the semantic vagueness that the individual ratings represent. In this work, we introduce U… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 16 pages, 7 figures

    Journal ref: TACL Submission batch: 7/2022; Revision batch: 1/2023; Published 2023

  33. arXiv:2308.01857  [pdf, other

    cs.AR

    iEDA: An Open-Source Intelligent Physical Implementation Toolkit and Library

    Authors: Xingquan Li, Simin Tao, Zengrong Huang, Shijian Chen, Zhisheng Zeng, Liwei Ni, Zhipeng Huang, Chunan Zhuang, Hongxi Wu, Weiguo Li1, Xueyan Zhao, He Liu, Shuaiying Long, Wei He, Bojun Liu, Sifeng Gan, Zihao Yu, Tong Liu, Yuchi Miao, Zhiyuan Yan, Hao Wang, Jie Zhao, Yifan Li, Ruizhi Liu, Xiaoze Lin , et al. (31 additional authors not shown)

    Abstract: Open-source EDA shows promising potential in unleashing EDA innovation and lowering the cost of chip design. This paper presents an open-source EDA project, iEDA, aiming for building a basic infrastructure for EDA technology evolution and closing the industrial-academic gap in the EDA area. iEDA now covers the whole flow of physical design (including Floorplan, Placement, CTS, Routing, Timing Opti… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  34. arXiv:2307.04965  [pdf

    physics.optics

    Acoustic diagnostics of femtosecond laser filamentation

    Authors: Binpeng Shang, Nan Zhang, Pengfei Qi, Shishi Tao, Lie Lin, Weiwei Liu

    Abstract: The promising application of femtosecond laser filamentation in atmospheric remote sensing brings imperative demand for diagnosing the spatiotemporal dynamics of filamentation. Acoustic emission (AE) during filamentation opens a door to give the insight into the dynamic evolution of filaments in air. In particular, the frequency features of the acoustic emission provide relevant information on the… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 8 pages,5 figures

    MSC Class: 78A60 ACM Class: J.2.9

  35. arXiv:2306.15266  [pdf, other

    cs.AI

    Internal Contrastive Learning for Generalized Out-of-distribution Fault Diagnosis (GOOFD) Framework

    Authors: Xingyue Wang, Hanrong Zhang, Ke Ma, Shuting Tao, Peng Peng, Hongwei Wang

    Abstract: Fault diagnosis is essential in industrial processes for monitoring the conditions of important machines. With the ever-increasing complexity of working conditions and demand for safety during production and operation, different diagnosis methods are required, and more importantly, an integrated fault diagnosis system that can cope with multiple tasks is highly desired. However, the diagnosis subt… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  36. arXiv:2306.15150  [pdf

    physics.optics

    Femtosecond Laser Filamentation in Atmospheric Turbulence

    Authors: Jiewei Guo, Lu Sun, Yuezheng Wang, Jiayun Xue, Zhi Zhang, Haiyi Liu, Shishi Tao, Pengfei Qi, Lie Lin, Weiwei Liu

    Abstract: The effects of turbulence intensity and turbulence region on the distribution of femtosecond laser filaments are experimentally elaborated. Through the ultrasonic signals emitted by the filaments, and it is observed that increasing turbulence intensity and expanding turbulence active region cause an increase in the start position of the filament, and a decrease in filament length, which can be wel… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: 9 pages, 4 figures

  37. arXiv:2306.12904  [pdf

    physics.optics

    Coupled air lasing gain and Mie scattering loss: aerosol effect in filament-induced plasma spectroscopy

    Authors: Jiayun Xue, Zhi Zhang, Yuezheng Wang, Binpeng Shang, Jiewei Guo, Shishi Tao, Nan Zhang, Lanjunguo, Pengfei Qi, Lie Lin, Weiwei Liu

    Abstract: Femtosecond laser filament-induced plasma spectroscopy (FIPS) demonstrates great potentials in the remote sensing for identifying atmospheric pollutant molecules. Due to the widespread aerosols in atmosphere, the remote detection based on FIPS would be affected from both the excitation and the propagation of fingerprint fluorescence, which still remain elusive. Here the physical model of filament-… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: 7 pages, 4 figures

  38. arXiv:2306.07486  [pdf, other

    cs.CL

    Knowledge-Prompted Estimator: A Novel Approach to Explainable Machine Translation Assessment

    Authors: Hao Yang, Min Zhang, Shimin Tao, Minghan Wang, Daimeng Wei, Yanfei Jiang

    Abstract: Cross-lingual Machine Translation (MT) quality estimation plays a crucial role in evaluating translation performance. GEMBA, the first MT quality assessment metric based on Large Language Models (LLMs), employs one-step prompting to achieve state-of-the-art (SOTA) in system-level MT quality estimation; however, it lacks segment-level analysis. In contrast, Chain-of-Thought (CoT) prompting outperfo… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  39. arXiv:2306.07281  [pdf

    physics.plasm-ph physics.optics

    Filament based Ionizing Radiation Sensing Technology

    Authors: Weiwei Liu, Jiewei Guo, Nan Zhang, Lu Sun, Haiyi Liu, Shihi Tao, Yuezheng Wang, Binpeng Shang, Pengfei Qi, Lie Lin

    Abstract: Accidental exposure to overdose ionizing radiation will inevitably lead to severe biological damage, thus detecting and localizing radiation is essential. Traditional measurement techniques are generally restricted to the limited detection range of few centimeters, posing a great risk to operators. The potential in remote sensing makes femtosecond laser filament technology great candidates for con… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 13 pages, 6 figures

  40. arXiv:2305.15792  [pdf, other

    cs.LG cs.CR

    IDEA: Invariant Defense for Graph Adversarial Robustness

    Authors: Shuchang Tao, Qi Cao, Huawei Shen, Yunfan Wu, Bingbing Xu, Xueqi Cheng

    Abstract: Despite the success of graph neural networks (GNNs), their vulnerability to adversarial attacks poses tremendous challenges for practical applications. Existing defense methods suffer from severe performance decline under unseen attacks, due to either limited observed adversarial examples or pre-defined heuristics. To address these limitations, we analyze the causalities in graph adversarial attac… ▽ More

    Submitted 25 April, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Submitted to Information Sciences

  41. Popularity Debiasing from Exposure to Interaction in Collaborative Filtering

    Authors: Yuanhao Liu, Qi Cao, Huawei Shen, Yunfan Wu, Shuchang Tao, Xueqi Cheng

    Abstract: Recommender systems often suffer from popularity bias, where popular items are overly recommended while sacrificing unpopular items. Existing researches generally focus on ensuring the number of recommendations exposure of each item is equal or proportional, using inverse propensity weighting, causal intervention, or adversarial training. However, increasing the exposure of unpopular items may not… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: Published as a SIGIR'23 short paper

  42. The construction and characterization of MgO transmission dynodes

    Authors: H. W. Chan, V. Prodanović, A. M. M. G. Theulings, S. Tao, J. Smedley, C. W. Hagen, P. M. Sarro, H. v. d. Graaf

    Abstract: In this work we demonstrate that ultra-thin (5 and 15 nm) MgO transmission dynodes (tynodes) with sufficient high transmission electron yield (TEY) can be constructed. These tynodes act as electron amplification stages in a novel vacuum electron multiplier: the Timed Photon Counter (TiPC). The ultra-thin membranes with a diameter of 30 μm are arranged in a square 64-by-64-array. The TEY was determ… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  43. arXiv:2303.10938  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Complete Suppression of Phase Segregation in Mixed-Halide Perovskite Nanocrystals under Periodic Heating

    Authors: Shengnan Feng, Rentong Duan, Yu Ju, Shuyi Li, Chunfeng Zhang, Shuxia Tao, Min Xiao, Xiaoyong Wang

    Abstract: Under continuous light illumination, it is known that localized domains with segregated halide compositions form in semiconducting mixed-halide perovskites, thus severely limiting their optoelectronic applications due to the negative changes in bandgap energies and charge-carrier characteristics. Here we deposit mixed-halide perovskite CsPbBr1.2I1.8 nanocrystals onto an indium tin oxide substrate,… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 25 pages, 4 figures

  44. arXiv:2302.12048  [pdf, ps, other

    eess.AS cs.SD

    Frequency bin-wise single channel speech presence probability estimation using multiple DNNs

    Authors: Shuai Tao, Himavanth Reddy, Jesper Rindom Jensen, Mads Græsbøll Christensen

    Abstract: In this work, we propose a frequency bin-wise method to estimate the single-channel speech presence probability (SPP) with multiple deep neural networks (DNNs) in the short-time Fourier transform domain. Since all frequency bins are typically considered simultaneously as input features for conventional DNN-based SPP estimators, high model complexity is inevitable. To reduce the model complexity an… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted for ICASSP 2023

  45. arXiv:2302.08051  [pdf, other

    cs.LG cs.CR cs.SI

    Graph Adversarial Immunization for Certifiable Robustness

    Authors: Shuchang Tao, Huawei Shen, Qi Cao, Yunfan Wu, Liang Hou, Xueqi Cheng

    Abstract: Despite achieving great success, graph neural networks (GNNs) are vulnerable to adversarial attacks. Existing defenses focus on develo** adversarial training or model modification. In this paper, we propose and formulate graph adversarial immunization, i.e., vaccinating part of graph structure to improve certifiable robustness of graph against any admissible adversarial attack. We first propose… ▽ More

    Submitted 23 September, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: Published in TKDE. Code: https://github.com/TaoShuchang/AdvImmune_node

  46. arXiv:2302.04659  [pdf, other

    cs.RO cs.AI

    ManiSkill2: A Unified Benchmark for Generalizable Manipulation Skills

    Authors: Jiayuan Gu, Fanbo Xiang, Xuanlin Li, Zhan Ling, Xiqiang Liu, Tongzhou Mu, Yihe Tang, Stone Tao, Xinyue Wei, Yunchao Yao, Xiaodi Yuan, Pengwei Xie, Zhiao Huang, Rui Chen, Hao Su

    Abstract: Generalizable manipulation skills, which can be composed to tackle long-horizon and complex daily chores, are one of the cornerstones of Embodied AI. However, existing benchmarks, mostly composed of a suite of simulatable environments, are insufficient to push cutting-edge research works because they lack object-level topological and geometric variations, are not based on fully dynamic simulation,… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Published as a conference paper at ICLR 2023. Project website: https://maniskill2.github.io/

  47. arXiv:2301.11485  [pdf

    physics.optics

    Sub-ppb aerosol detection at a distance of 30 meters by millijoule femtosecond laser pulse filamentation in air

    Authors: Jiewei Guo, Zhi Zhang, Nan Zhang, Binpeng Shang, Jiayun Xue, Yuezheng Wang, Shishi Tao, Bofu Xie, Lanjun Guo, Lie Lin, Weiwei Liu

    Abstract: In this work, sub-ppb aerosol detection is achieved by femtosecond laser filament with a single pulse energy of 4 mJ at a distance of 30 m. A concave mirror with an open aperture of 41.4 cm is employed in an off-axis optical system to focus the femtosecond laser beam and collect the fluorescence of NaCl aerosol. The simulation and experimental results show that the astigmatism can be greatly reduc… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  48. arXiv:2301.01609  [pdf, other

    cs.AI cs.MA

    Emergent collective intelligence from massive-agent cooperation and competition

    Authors: Hanmo Chen, Stone Tao, Jiaxin Chen, Weihan Shen, Xihui Li, Chenghui Yu, Sikai Cheng, Xiaolong Zhu, Xiu Li

    Abstract: Inspired by organisms evolving through cooperation and competition between different populations on Earth, we study the emergence of artificial collective intelligence through massive-agent reinforcement learning. To this end, We propose a new massive-agent reinforcement learning environment, Lux, where dynamic and massive agents in two teams scramble for limited resources and fight off the darkne… ▽ More

    Submitted 5 January, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: Published at NeurIPS 2022 Deep RL workshop. Code available at https://github.com/hanmochen/lux-open

  49. arXiv:2212.05830  [pdf, other

    cs.CL

    P-Transformer: Towards Better Document-to-Document Neural Machine Translation

    Authors: Yachao Li, Junhui Li, **g Jiang, Shimin Tao, Hao Yang, Min Zhang

    Abstract: Directly training a document-to-document (Doc2Doc) neural machine translation (NMT) via Transformer from scratch, especially on small datasets usually fails to converge. Our dedicated probing tasks show that 1) both the absolute position and relative position information gets gradually weakened or even vanished once it reaches the upper encoder layers, and 2) the vanishing of absolute position inf… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: Submitted to TASLP

  50. arXiv:2211.00981  [pdf, other

    cs.IR

    Relevance Assessments for Web Search Evaluation: Should We Randomise or Prioritise the Pooled Documents? (CORRECTED VERSION)

    Authors: Tetsuya Sakai, Sijie Tao, Zhaohao Zeng

    Abstract: In the context of depth-$k$ pooling for constructing web search test collections, we compare two approaches to ordering pooled documents for relevance assessors: the prioritisation strategy (PRI) used widely at NTCIR, and the simple randomisation strategy (RND). In order to address research questions regarding PRI and RND, we have constructed and released the WWW3E8 data set, which contains eight… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: 30 pages. This is a corrected version of an open-access TOIS paper ( https://dl.acm.org/doi/pdf/10.1145/3494833 )