Skip to main content

Showing 51–100 of 291 results for author: Cai, M

.
  1. arXiv:2310.05035  [pdf, other

    cs.CL cs.AI

    Self-Convinced Prompting: Few-Shot Question Answering with Repeated Introspection

    Authors: Haodi Zhang, Min Cai, Xinhe Zhang, Chen Jason Zhang, Rui Mao, Kaishun Wu

    Abstract: While large language models (LLMs) such as ChatGPT and PaLM have demonstrated remarkable performance in various language understanding and generation tasks, their capabilities in complex reasoning and intricate knowledge utilization still fall short of human-level proficiency. Recent studies have established the effectiveness of prompts in steering LLMs towards generating desired outputs. Building… ▽ More

    Submitted 10 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  2. arXiv:2310.04610  [pdf, other

    cs.AI cs.LG

    DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

    Authors: Shuaiwen Leon Song, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, Xiaoxia Wu, Jeff Rasley, Ammar Ahmad Awan, Connor Holmes, Martin Cai, Adam Ghanem, Zhongzhu Zhou, Yuxiong He, Pete Luferenko, Divya Kumar, Jonathan Weyn, Ruixiong Zhang, Sylwester Klocek, Volodymyr Vragov, Mohammed AlQuraishi, Gustaf Ahdritz, Christina Floristean, Cristina Negri , et al. (67 additional authors not shown)

    Abstract: In the upcoming decade, deep learning may revolutionize the natural sciences, enhancing our capacity to model and predict natural occurrences. This could herald a new era of scientific exploration, bringing significant advancements across sectors from drug development to renewable energy. To answer this call, we present DeepSpeed4Science initiative (deepspeed4science.ai) which aims to build unique… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  3. arXiv:2309.12530  [pdf, other

    cs.CV

    A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance

    Authors: Zeyi Huang, Andy Zhou, Zijian Lin, Mu Cai, Haohan Wang, Yong Jae Lee

    Abstract: Domain generalization studies the problem of training a model with samples from several domains (or distributions) and then testing the model with samples from a new, unseen domain. In this paper, we propose a novel approach for domain generalization that leverages recent advances in large vision-language models, specifically a CLIP teacher model, to train a smaller model that generalizes to unsee… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: to appear at ICCV2023

  4. arXiv:2309.10313  [pdf, other

    cs.CL cs.AI cs.LG

    Investigating the Catastrophic Forgetting in Multimodal Large Language Models

    Authors: Yuexiang Zhai, Shengbang Tong, Xiao Li, Mu Cai, Qing Qu, Yong Jae Lee, Yi Ma

    Abstract: Following the success of GPT4, there has been a surge in interest in multimodal large language model (MLLM) research. This line of research focuses on develo** general-purpose LLMs through fine-tuning pre-trained LLMs and vision models. However, catastrophic forgetting, a notorious phenomenon where the fine-tuned model fails to retain similar performance compared to the pre-trained model, still… ▽ More

    Submitted 5 December, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

  5. arXiv:2309.08813  [pdf, other

    eess.SY

    Control Barrier Function for Linearizable Systems with High Relative Degrees from Signal Temporal Logics: A Reference Governor Approach

    Authors: Kaier Liang, Mingyu Cai, Cristian-Ioan Vasile

    Abstract: This paper considers the safety-critical navigation problem with Signal Temporal Logic (STL) tasks. We developed an explicit reference governor-guided control barrier function (ERG-guided CBF) method that enables the application of first-order CBFs to high-order linearizable systems. This method significantly reduces the conservativeness of the existing CBF approaches for high-order systems. Furth… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  6. arXiv:2309.04198  [pdf, other

    cs.CL

    Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical Domain

    Authors: Yanrui Du, Sendong Zhao, Muzhen Cai, Ming Ma, Danyang Zhao, Jiawei Cao, Bing Qin

    Abstract: Extensive studies have been devoted to privatizing general-domain Large Language Models (LLMs) as Domain-Specific LLMs via feeding specific-domain data. However, these privatization efforts often ignored a critical aspect: Dual Logic Ability, which is a core reasoning ability for LLMs. The dual logic ability of LLMs ensures that they can maintain a consistent stance when confronted with both posit… ▽ More

    Submitted 23 February, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

  7. arXiv:2309.04175  [pdf, other

    cs.CL cs.AI

    Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese

    Authors: Haochun Wang, Sendong Zhao, Zewen Qiang, Zijian Li, Nuwa Xi, Yanrui Du, MuZhen Cai, Haoqiang Guo, Yuhan Chen, Haoming Xu, Bing Qin, Ting Liu

    Abstract: Large Language Models (LLMs) have demonstrated remarkable success in diverse natural language processing (NLP) tasks in general domains. However, LLMs sometimes generate responses with the hallucination about medical facts due to limited domain knowledge. Such shortcomings pose potential risks in the utilization of LLMs within medical contexts. To address this challenge, we propose knowledge-tunin… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 11 pages, 5 figures

  8. arXiv:2309.04174  [pdf, other

    cs.CL cs.AI

    Manifold-based Verbalizer Space Re-embedding for Tuning-free Prompt-based Classification

    Authors: Haochun Wang, Sendong Zhao, Chi Liu, Nuwa Xi, Muzhen Cai, Bing Qin, Ting Liu

    Abstract: Prompt-based classification adapts tasks to a cloze question format utilizing the [MASK] token and the filled tokens are then mapped to labels through pre-defined verbalizers. Recent studies have explored the use of verbalizer embeddings to reduce labor in this process. However, all existing studies require a tuning process for either the pre-trained models or additional trainable embeddings. Mean… ▽ More

    Submitted 29 January, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: Accepted by AAAI 2024, 11 pages, 3 figures

  9. arXiv:2308.12033  [pdf, other

    cs.CL cs.AI

    PREFER: Prompt Ensemble Learning via Feedback-Reflect-Refine

    Authors: Chenrui Zhang, Lin Liu, **peng Wang, Chuyuan Wang, Xiao Sun, Hongyu Wang, Mingchen Cai

    Abstract: As an effective tool for eliciting the power of Large Language Models (LLMs), prompting has recently demonstrated unprecedented abilities across a variety of complex tasks. To further improve the performance, prompt ensemble has attracted substantial interest for tackling the hallucination and instability of LLMs. However, existing methods usually adopt a two-stage paradigm, which requires a pre-p… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: 8 pages, 4 figures

  10. arXiv:2308.09370  [pdf, other

    cs.CL cs.SD eess.AS

    TrOMR:Transformer-Based Polyphonic Optical Music Recognition

    Authors: Yixuan Li, Hua** Liu, Qiang **, Miaomiao Cai, Peng Li

    Abstract: Optical Music Recognition (OMR) is an important technology in music and has been researched for a long time. Previous approaches for OMR are usually based on CNN for image understanding and RNN for music symbol classification. In this paper, we propose a transformer-based approach with excellent global perceptual capability for end-to-end polyphonic OMR, called TrOMR. We also introduce a novel con… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  11. arXiv:2308.01396  [pdf, other

    physics.optics

    Cross-phase modulation in the two dimensional spectroscopy

    Authors: Mao-Rui Cai, Xue Zhang, Zi-Qian Cheng, Teng-Fei Yan, Hui Dong

    Abstract: Develo** from the transient absorption (TA) spectroscopy, the two dimensional (2D) spectroscopy with pump-probe geometry has emerged as a versatile approach for alleviating the difficulty on implementing the 2D spectroscopy with other geometries. However, the presence of cross-phase modulation (XPM) in TA spectroscopy introduces significant spectral distortions, particularly when the pump and pr… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 11 pages, 7 figures

  12. Building a digital twin of EDFA: a grey-box modeling approach

    Authors: Yichen Liu, Xiaomin Liu, Yihao Zhang, Meng Cai, Mengfan Fu, Xueying Zhong, Lilin Yi, Weisheng Hu, Qunbi Zhuge

    Abstract: To enable intelligent and self-driving optical networks, high-accuracy physical layer models are required. The dynamic wavelength-dependent gain effects of non-constant-pump erbium-doped fiber amplifiers (EDFAs) remain a crucial problem in terms of modeling, as it determines optical-to-signal noise ratio as well as the magnitude of fiber nonlinearities. Black-box data-driven models have been widel… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  13. arXiv:2307.02011  [pdf, other

    eess.SP

    Precise WiFi Indoor Positioning using Deep Learning Algorithms

    Authors: Minxue Cai, Zihuai Lin

    Abstract: This study demonstrates a WiFi indoor positioning system using Deep Learning algorithms. A new method using fitting function in MATLAB will be utilized to compute the path loss coefficient and log-normal fading variance. To reduce the error, a new hybrid localization approach utilizing Received Signal Strength Indicator (RSSI) and Angle of Arrival (AoA) has been created. Three Deep Learning algori… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  14. arXiv:2307.01975  [pdf, ps, other

    math.NA math.PR

    Strong convergence rates for a full discretization of stochastic wave equation with nonlinear dam**

    Authors: Meng Cai, David Cohen, Xiaojie Wang

    Abstract: The paper establishes the strong convergence rates of a spatio-temporal full discretization of the stochastic wave equation with nonlinear dam** in dimension one and two. We discretize the SPDE by applying a spectral Galerkin method in space and a modified implicit exponential Euler scheme in time. The presence of the super-linearly growing dam** in the underlying model brings challenges into… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: 30 pages, 2 figures

  15. arXiv:2306.06094  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding

    Authors: Mu Cai, Zeyi Huang, Yuheng Li, Haohan Wang, Yong Jae Lee

    Abstract: Recently, large language models (LLMs) have made significant advancements in natural language understanding and generation. However, their potential in computer vision remains largely unexplored. In this paper, we introduce a new, exploratory approach that enables LLMs to process images using the Scalable Vector Graphics (SVG) format. By leveraging the XML-based textual descriptions of SVG represe… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  16. Three-way Imbalanced Learning based on Fuzzy Twin SVM

    Authors: Wanting Cai, Mingjie Cai, Qingguo Li, Qiong Liu

    Abstract: Three-way decision (3WD) is a powerful tool for granular computing to deal with uncertain data, commonly used in information systems, decision-making, and medical care. Three-way decision gets much research in traditional rough set models. However, three-way decision is rarely combined with the currently popular field of machine learning to expand its research. In this paper, three-way decision is… ▽ More

    Submitted 19 May, 2023; originally announced June 2023.

  17. Enhancing Language Representation with Constructional Information for Natural Language Understanding

    Authors: Lvxiaowei Xu, Jianwang Wu, Jiawei Peng, Zhilin Gong, Ming Cai, Tianxiang Wang

    Abstract: Natural language understanding (NLU) is an essential branch of natural language processing, which relies on representations generated by pre-trained language models (PLMs). However, PLMs primarily focus on acquiring lexico-semantic information, while they may be unable to adequately handle the meaning of constructions. To address this issue, we introduce construction grammar (CxG), which highlight… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Long paper, accepted at the ACL 2023

  18. arXiv:2305.14895  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    The Lobster Eye Imager for Astronomy Onboard the SATech-01 Satellite

    Authors: Z. X. Ling, X. J. Sun, C. Zhang, S. L. Sun, G. **, S. N. Zhang, X. F. Zhang, J. B. Chang, F. S. Chen, Y. F. Chen, Z. W. Cheng, W. Fu, Y. X. Han, H. Li, J. F. Li, Y. Li, Z. D. Li, P. R. Liu, Y. H. Lv, X. H. Ma, Y. J. Tang, C. B. Wang, R. J. Xie, Y. L. Xue, A. L. Yan , et al. (101 additional authors not shown)

    Abstract: The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (Fo… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted by RAA

  19. arXiv:2305.13688  [pdf

    physics.optics

    Experimental observation of Kerr-Raman solitons in a normal-dispersion FP resonator

    Authors: Tieying Li, Kan Wu, Xujia Zhang, Minglu Cai, Jian** Chen

    Abstract: Different from the Kerr effect,stimulated Raman scattering (SRS) is a delayed response to molecular vibrations in materials. In microcavities, when driven in an anomalous group velocity dispersion (GVD) regime, SRS typically leads to self-frequency shift of solitons and generation of breather solitons which have been verified both theoretically and experimentally. However, when driven in a normal… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  20. arXiv:2305.12114  [pdf, other

    cs.LG cs.AI cs.DC cs.IT

    GFDC: A Granule Fusion Density-Based Clustering with Evidential Reasoning

    Authors: Mingjie Cai, Zhishan Wu, Qingguo Li, Feng Xu, Jie Zhou

    Abstract: Currently, density-based clustering algorithms are widely applied because they can detect clusters with arbitrary shapes. However, they perform poorly in measuring global density, determining reasonable cluster centers or structures, assigning samples accurately and handling data with large density differences among clusters. To overcome their drawbacks, this paper proposes a granule fusion densit… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

  21. arXiv:2305.05367  [pdf

    stat.AP

    Exploring assessment method of technological advancement based on literature cross-citation

    Authors: Shengxuan Tang, Liming Zhang, Shuo Jiang, Ming Cai, Yao Xiao

    Abstract: Assessing advancements of technology is essential for creating science and technology policies and making informed investments in the technology market. However, current methods primarily focus on the characteristics of the technologies themselves, making it difficult to accurately assess technologies across various fields and generations. To address this challenge, we propose a novel approach tha… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 15 pages, 6 figures

  22. arXiv:2305.02916  [pdf, other

    physics.chem-ph quant-ph

    Enantiodetection via the 2D spectroscopy: extending the methodology to general experimental conditions

    Authors: Mao-Rui Cai, Chong Ye, Yong Li, Hui Dong

    Abstract: Develo** effective methods to measure the enantiomeric excess of the chiral mixture is one of the major topics in chiral molecular researches, yet remains challenging. Enantiodetection method via two-dimensional (2D) spectroscopy based on a four level model, containing a cyclic three-level system (CTLS), of chiral molecules was recently proposed and demonstrated, yet with a strict condition of t… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 9 pages, 4 figures

  23. arXiv:2305.00561  [pdf, other

    cs.AI cs.FL cs.MA cs.RO eess.SY

    Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable Environments

    Authors: Junchao Li, Mingyu Cai, Zhen Kan, Abstract: Motion planning of autonomous agents in partially known environments with incomplete information is a challenging problem, particularly for complex tasks. This paper proposes a model-free reinforcement learning approach to address this problem. We formulate motion planning as a probabilistic-labeled partially observable Markov decision process (PL-POMDP) problem and use linear temporal logic (LTL)… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: 32 pages, 22 figures, submitted to Autonomous Agents and Multi-Agent Systems

  24. arXiv:2304.13966  [pdf, ps, other

    math.NA

    Two kinds of numerical algorithms for ultra-slow diffusion equations

    Authors: Min Cai, Changpin Li, Yu Wang

    Abstract: In this article, two kinds of numerical algorithms are derived for the ultra-slow (or superslow) diffusion equation in one and two space dimensions, where the ultra-slow diffusion is characterized by the Caputo-Hadamard fractional derivative of order $α\in (0,1)$. To describe the spatial interaction, the Riesz fractional derivative and the fractional Laplacian are used in one and two space dimensi… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    MSC Class: 35R11; 65M06

  25. arXiv:2304.09176  [pdf, other

    cs.LG cs.AI

    Enhancing Personalized Ranking With Differentiable Group AUC Optimization

    Authors: Xiao Sun, Bo Zhang, Chenrui Zhang, Han Ren, Mingchen Cai

    Abstract: AUC is a common metric for evaluating the performance of a classifier. However, most classifiers are trained with cross entropy, and it does not optimize the AUC metric directly, which leaves a gap between the training and evaluation stage. In this paper, we propose the PDAOM loss, a Personalized and Differentiable AUC Optimization method with Maximum violation, which can be directly applied when… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: NeuRec@ICDM 2022

  26. arXiv:2304.04609  [pdf

    cond-mat.soft physics.app-ph

    Inverse design of artificial skins

    Authors: Zhiguang Liu, Minkun Cai, Shenda Hong, Junli Shi, Sai Xie, Chang Liu, Huifeng Du, James D. Morin, Gang Li, Wang Liu, Hong Wang, Ke Tang, Nicholas X. Fang, Chuan Fei Guo

    Abstract: Mimicking the perceptual functions of human cutaneous mechanoreceptors, artificial skins or flexible pressure sensors can transduce tactile stimuli to quantitative electrical signals. Conventional methods to design such devices follow a forward structure-to-property routine based on trial-and-error experiments/simulations, which take months or longer to determine one solution valid for one specifi… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  27. arXiv:2304.00790  [pdf, other

    cs.RO eess.SY

    LQR-CBF-RRT*: Safe and Optimal Motion Planning

    Authors: Guang Yang, Mingyu Cai, Ahmad Ahmad, Amanda Prorok, Roberto Tron, Calin Belta

    Abstract: We present LQR-CBF-RRT*, an incremental sampling-based algorithm for offline motion planning. Our framework leverages the strength of Control Barrier Functions (CBFs) and Linear Quadratic Regulators (LQR) to generate safety-critical and optimal trajectories for a robot with dynamics described by an affine control system. CBFs are used for safety guarantees, while LQRs are employed for optimal cont… ▽ More

    Submitted 27 September, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  28. arXiv:2303.15790  [pdf, other

    hep-ex hep-ph physics.ins-det

    STCF Conceptual Design Report: Volume 1 -- Physics & Detector

    Authors: M. Achasov, X. C. Ai, R. Aliberti, L. P. An, Q. An, X. Z. Bai, Y. Bai, O. Bakina, A. Barnyakov, V. Blinov, V. Bobrovnikov, D. Bodrov, A. Bogomyagkov, A. Bondar, I. Boyko, Z. H. Bu, F. M. Cai, H. Cai, J. J. Cao, Q. H. Cao, Z. Cao, Q. Chang, K. T. Chao, D. Y. Chen, H. Chen , et al. (413 additional authors not shown)

    Abstract: The Super $τ$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τ$-Charm factory -- the BEPCII,… ▽ More

    Submitted 5 October, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Journal ref: Front. Phys. 19(1), 14701 (2024)

  29. arXiv:2303.04525  [pdf, other

    cs.CV cs.RO

    Continuity-Aware Latent Interframe Information Mining for Reliable UAV Tracking

    Authors: Changhong Fu, Mutian Cai, Sihang Li, Kunhan Lu, Haobo Zuo, Chongjun Liu

    Abstract: Unmanned aerial vehicle (UAV) tracking is crucial for autonomous navigation and has broad applications in robotic automation fields. However, reliable UAV tracking remains a challenging task due to various difficulties like frequent occlusion and aspect ratio change. Additionally, most of the existing work mainly focuses on explicit information to improve tracking performance, ignoring potential i… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: 2023 IEEE International Conference on Robotics and Automation (ICRA)

  30. arXiv:2303.02566  [pdf, other

    stat.ML cs.LG stat.CO

    MFAI: A Scalable Bayesian Matrix Factorization Approach to Leveraging Auxiliary Information

    Authors: Zhiwei Wang, Fa Zhang, Cong Zheng, Xianghong Hu, Mingxuan Cai, Can Yang

    Abstract: In various practical situations, matrix factorization methods suffer from poor data quality, such as high data sparsity and low signal-to-noise ratio (SNR). Here, we consider a matrix factorization problem by utilizing auxiliary information, which is massively available in real-world applications, to overcome the challenges caused by poor data quality. Unlike existing methods that mainly rely on s… ▽ More

    Submitted 12 February, 2024; v1 submitted 4 March, 2023; originally announced March 2023.

  31. arXiv:2302.10491  [pdf, ps, other

    math.CO

    The Laplacian spectral ratio of connected graphs

    Authors: Zhen Lin, Jiajia Wang, Min Cai

    Abstract: Let $G$ be a simple connected undirected graph. The Laplacian spectral ratio of $G$, denoted by $R_L(G)$, is defined as the quotient between the largest and second smallest Laplacian eigenvalues of $G$, which is closely related to the structural parameters of a graph (or network), such as diameter, $t$-tough, perfect matching, average density of cuts, and synchronizability, etc. In this paper, we… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: 17 pages,3 figures

    MSC Class: 05C05; 05C50

  32. arXiv:2301.06017  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Miniature Magnetic Nano islands in a Morphotropic Cobaltite Matrix

    Authors: Shengru Chen, Dongke Rong, Yue Xu, Miming Cai, Xinyan Li, Qinghua Zhang, Shuai Xu, Yan-Xing Shang, Haitao Hong, Ting Cui, Qiao **, Jia-Ou Wang, Haizhong Guo, Lin Gu, Qiang Zheng, Can Wang, **xing Zhang, Gang-Qin Liu, Kui-juan **, Er-Jia Guo

    Abstract: High-density magnetic memories are key components in spintronics, quantum computing, and energy-efficient electronics. Reduced dimensionality and magnetic domain stability at the nanoscale are essential for the miniaturization of magnetic storage units. Yet, inducing magnetic order, and selectively tuning spin-orbital coupling at specific locations have remained challenging. Here we demonstrate th… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: 20 pages,4 figures

  33. arXiv:2212.09588  [pdf, other

    cs.CL

    Query Enhanced Knowledge-Intensive Conversation via Unsupervised Joint Modeling

    Authors: Mingzhu Cai, Siqi Bao, Xin Tian, Huang He, Fan Wang, Hua Wu

    Abstract: In this paper, we propose an unsupervised query enhanced approach for knowledge-intensive conversations, namely QKConv. There are three modules in QKConv: a query generator, an off-the-shelf knowledge selector, and a response generator. QKConv is optimized through joint training, which produces the response by exploring multiple candidate queries and leveraging corresponding selected knowledge. Th… ▽ More

    Submitted 26 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Accepted for publication at ACL2023

  34. arXiv:2212.02007  [pdf, other

    cs.RO eess.SY

    Mixed Cloud Control Testbed: Validating Vehicle-Road-Cloud Integration via Mixed Digital Twin

    Authors: Jianghong Dong, Qing Xu, Jiawei Wang, Chunying Yang, Mengchi Cai, Chaoyi Chen, Jianqiang Wang, Keqiang Li

    Abstract: Reliable and efficient validation technologies are critical for the recent development of multi-vehicle cooperation and vehicle-road-cloud integration. In this paper, we introduce our miniature experimental platform, Mixed Cloud Control Testbed (MCCT), developed based on a new notion of Mixed Digital Twin (mixedDT). Combining Mixed Reality with Digital Twin, mixedDT integrates the virtual and phys… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: 13 pages, 13 figures

  35. arXiv:2211.11157  [pdf, other

    math.LO

    On the Nonexistence of a Strong Minimal Pair

    Authors: Mingzhong Cai, Yiqun Liu, Yong Liu, Cheng Peng, Yue Yang

    Abstract: Two nonzero recursively enumerable (r.e.) degrees $\mathbf{a}$ and $\mathbf{b}$ form a strong minimal pair if $\mathbf{a} \wedge \mathbf{b}=\mathbf{0}$ and $\mathbf{b}\vee \mathbf{x}\geq \mathbf{a}$ for any nonzero r.e. degree $\mathbf{x}\leq \mathbf{a}$. We prove that there is no strong minimal pair in the r.e. degrees. Our construction goes beyond the usual $\mathbf{0}'''$-priority arguments and… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    MSC Class: 03D25

  36. arXiv:2211.10758  [pdf, ps, other

    math.NA

    A priori error estimates of two fully discrete coupled schemes for Biot's consolidation model

    Authors: Huipeng Gu, Mingchao Cai, **gzhi Li, Guoliang Ju

    Abstract: This paper concentrates on a priori error estimates of two fully discrete coupled schemes for Biot's consolidation model based on the three-field formulation introduced by Oyarzua et al. (SIAM Journal on Numerical Analysis, 2016). The spatial discretizations are based on the Taylor-Hood finite elements combined with Lagrange elements for the three primary variables. For time discretization, we con… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

  37. arXiv:2211.10007  [pdf, other

    astro-ph.HE astro-ph.IM

    First wide field-of-view X-ray observations by a lobster eye focusing telescope in orbit

    Authors: C. Zhang, Z. X. Ling, X. J. Sun, S. L. Sun, Y. Liu, Z. D. Li, Y. L. Xue, Y. F. Chen, Y. F. Dai, Z. Q. Jia, H. Y. Liu, X. F. Zhang, Y. H. Zhang, S. N. Zhang, F. S. Chen, Z. W. Cheng, W. Fu, Y. X. Han, H. Li, J. F. Li, Y. Li, P. R. Liu, X. H. Ma, Y. J. Tang, C. B. Wang , et al. (53 additional authors not shown)

    Abstract: As a novel X-ray focusing technology, lobster eye micro-pore optics (MPO) feature both a wide observing field of view and true imaging capability, promising sky monitoring with significantly improved sensitivity and spatial resolution in soft X-rays. Since first proposed by Angel (1979), the optics have been extensively studied, developed and trialed over the past decades. In this Letter, we repor… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 11 pages, 4 figures. Accepted for publication in Astrophysical Journal Letter

  38. arXiv:2211.09381  [pdf, other

    cs.SD eess.AS

    Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire

    Authors: Zhiyun Fan, Zhenlin Liang, Linhao Dong, Yi Liu, Shiyu Zhou, Meng Cai, Jun Zhang, Zejun Ma, Bo Xu

    Abstract: In multi-talker scenarios such as meetings and conversations, speech processing systems are usually required to segment the audio and then transcribe each segmentation. These two stages are addressed separately by speaker change detection (SCD) and automatic speech recognition (ASR). Most previous SCD systems rely solely on speaker information and ignore the importance of speech content. In this p… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  39. arXiv:2211.03885  [pdf, other

    cs.CV eess.IV

    Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Ziyao Yi, Yan Xiang, Zibin Liu, Shaoqing Li, Keming Shi, Dehui Kong, Ke Xu, Minsu Kwon, Yaqi Wu, Jiesi Zheng, Zhihao Fan, Xun Wu, Feng Zhang, Albert No, Minhyeok Cho, Zewen Chen, Xiaze Zhang, Ran Li , et al. (13 additional authors not shown)

    Abstract: The role of mobile cameras increased dramatically over the past few years, leading to more and more research in automatic image quality enhancement and RAW photo processing. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based image signal processing (ISP) pipeline replacing the standard mobile ISPs that can run on modern smartphone GPUs using TensorFlow Lite. Th… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  40. FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction

    Authors: Lvxiaowei Xu, Jianwang Wu, Jiawei Peng, Jiayu Fu, Ming Cai

    Abstract: Grammatical Error Correction (GEC) has been broadly applied in automatic correction and proofreading system recently. However, it is still immature in Chinese GEC due to limited high-quality data from native speakers in terms of category and scale. In this paper, we present FCGEC, a fine-grained corpus to detect, identify and correct the grammatical errors. FCGEC is a human-annotated corpus with m… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: Long paper, accepted at the Findings of EMNLP 2022

  41. arXiv:2210.10560  [pdf, other

    hep-ph hep-ex nucl-ex nucl-th

    Dilepton production in the photodisintegration of the deuteron

    Authors: Mengchu Cai, Tianbo Liu, Bo-Qiang Ma

    Abstract: We study the lepton pair production in the photodisintegration of the deuteron process. The complete seven-fold differential cross section is calculated via the Bethe-Heitler mechanism with final state interactions taken into account. The deuteron bound state is described by a relativistic covariant deuteron-nucleon vertex. With numerical results, we find that the differential cross section has st… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 30 pages, 16 figures

  42. arXiv:2210.04204  [pdf, ps, other

    math.NA

    Lasso trigonometric polynomial approximation for periodic function recovery in equidistant points

    Authors: Congpei An, Mou Cai

    Abstract: In this paper, we propose a fully discrete soft thresholding trigonometric polynomial approximation on $[-π,π],$ named Lasso trigonometric interpolation. This approximation is an $\ell_1$-regularized discrete least squares approximation under the same conditions of classical trigonometric interpolation on an equidistant grid. Lasso trigonometric interpolation is sparse and meanwhile it is an effic… ▽ More

    Submitted 21 September, 2023; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: 18 pages, 5 figures

  43. arXiv:2210.01910  [pdf, other

    cs.FL cs.LG

    Learning Signal Temporal Logic through Neural Network for Interpretable Classification

    Authors: Danyang Li, Mingyu Cai, Cristian-Ioan Vasile, Roberto Tron

    Abstract: Machine learning techniques using neural networks have achieved promising success for time-series data classification. However, the models that they produce are challenging to verify and interpret. In this paper, we propose an explainable neural-symbolic framework for the classification of time-series behaviors. In particular, we use an expressive formal language, namely Signal Temporal Logic (STL… ▽ More

    Submitted 30 June, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

  44. arXiv:2210.01162  [pdf, other

    cs.RO cs.AI cs.FL cs.LG math.OC

    Learning Minimally-Violating Continuous Control for Infeasible Linear Temporal Logic Specifications

    Authors: Mingyu Cai, Makai Mann, Zachary Serlin, Kevin Leahy, Cristian-Ioan Vasile

    Abstract: This paper explores continuous-time control synthesis for target-driven navigation to satisfy complex high-level tasks expressed as linear temporal logic (LTL). We propose a model-free framework using deep reinforcement learning (DRL) where the underlying dynamic system is unknown (an opaque box). Unlike prior work, this paper considers scenarios where the given LTL specification might be infeasib… ▽ More

    Submitted 16 March, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

  45. arXiv:2209.07459  [pdf, other

    cs.RO cs.CV cs.LG

    A Robotic Visual Gras** Design: Rethinking Convolution Neural Network with High-Resolutions

    Authors: Zhangli Zhou, Shaochen Wang, Ziyang Chen, Mingyu Cai, Zhen Kan

    Abstract: High-resolution representations are important for vision-based robotic gras** problems. Existing works generally encode the input images into low-resolution representations via sub-networks and then recover high-resolution representations. This will lose spatial information, and errors introduced by the decoder will be more serious when multiple types of objects are considered or objects are far… ▽ More

    Submitted 15 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

  46. arXiv:2209.04260  [pdf, other

    astro-ph.HE hep-ex hep-ph physics.space-ph

    Search for relativistic fractionally charged particles in space

    Authors: DAMPE Collaboration, F. Alemanno, C. Altomare, Q. An, P. Azzarello, F. C. T. Barbato, P. Bernardini, X. J. Bi, M. S. Cai, E. Casilli, E. Catanzani, J. Chang, D. Y. Chen, J. L. Chen, Z. F. Chen, M. Y. Cui, T. S. Cui, Y. X. Cui, H. T. Dai, A. De-Benedittis, I. De Mitri, F. de Palma, M. Deliyergiyev, A. Di Giovanni, M. Di Santo , et al. (126 additional authors not shown)

    Abstract: More than a century after the performance of the oil drop experiment, the possible existence of fractionally charged particles FCP still remains unsettled. The search for FCPs is crucial for some extensions of the Standard Model in particle physics. Most of the previously conducted searches for FCPs in cosmic rays were based on experiments underground or at high altitudes. However, there have been… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: 19 pages, 6 figures, accepted by PRD

    Report number: 106, 063026

    Journal ref: Physical Review D 106.6 (2022): 063026

  47. arXiv:2209.01481  [pdf, ps, other

    math.AG math.RT

    Decomposition of Frobenius pushforwards of line bundles on wonderful compactifications

    Authors: Merrick Cai, Vasily Krylov

    Abstract: De Concini-Procesi introduced varieties known as wonderful compactifications, which are smooth projective compactifications of semisimple adjoint groups $G$. We study the Frobenius pushforwards of invertible sheaves on the wonderful compactifications, and in particular its decomposition into locally free subsheaves. We give necessary and sufficient conditions for a specific line bundle to be a dir… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: 54 pages

  48. arXiv:2208.12931  [pdf, other

    stat.CO

    How to relate potential outcomes: Estimating individual treatment effects under a given specified partial correlation

    Authors: Mingyang Cai, Stef van Buuren, Gerko Vink

    Abstract: In most medical research, the average treatment effect is used to evaluate a treatment's performance. However, precision medicine requires knowledge of individual treatment effects: What is the difference between a unit's measurement under treatment and control conditions? In most treatment effect studies, such answers are not possible because the outcomes under both experimental conditions are no… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

  49. arXiv:2208.12930  [pdf, ps, other

    stat.CO math.ST

    Joint distribution properties of Fully Conditional Specification under the normal linear model with normal inverse-gamma priors

    Authors: Mingyang Cai, Stef van Buuren, Gerko Vink

    Abstract: Fully conditional specification (FCS) is a convenient and flexible multiple imputation approach. It specifies a sequence of simple regression models instead of a potential complex joint density for missing variables. However, FCS may not converge to a stationary distribution. Many authors have studied the convergence properties of FCS when priors of conditional models are non-informative. We exten… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

  50. arXiv:2208.12929  [pdf, other

    stat.CO

    Graphical and numerical diagnostic tools to assess multiple imputation models by posterior predictive checking

    Authors: Mingyang Cai, Stef van Buuren, Gerko Vink

    Abstract: Missing data are often dealt with multiple imputation. A crucial part of the multiple imputation process is selecting sensible models to generate plausible values for incomplete data. A method based on posterior predictive checking is proposed to diagnose imputation models based on posterior predictive checking. To assess the congeniality of imputation models, the proposed diagnostic method compar… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.