Skip to main content

Showing 51–100 of 741 results for author: Jiang, F

.
  1. arXiv:2401.12242  [pdf, other

    cs.CR cs.LG

    BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models

    Authors: Zhen Xiang, Fengqing Jiang, Zidi Xiong, Bhaskar Ramasubramanian, Radha Poovendran, Bo Li

    Abstract: Large language models (LLMs) are shown to benefit from chain-of-thought (COT) prompting, particularly when tackling tasks that require systematic reasoning processes. On the other hand, COT prompting also poses new vulnerabilities in the form of backdoor attacks, wherein the model will output unintended malicious content under specific backdoor-triggered conditions during inference. Traditional me… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted to ICLR2024

  2. arXiv:2401.10731  [pdf, other

    cs.CV

    Removal and Selection: Improving RGB-Infrared Object Detection via Coarse-to-Fine Fusion

    Authors: Tianyi Zhao, Maoxun Yuan, Feng Jiang, Nan Wang, Xingxing Wei

    Abstract: Object detection in visible (RGB) and infrared (IR) images has been widely applied in recent years. Leveraging the complementary characteristics of RGB and IR images, the object detector provides reliable and robust object localization from day to night. Most existing fusion strategies directly input RGB and IR images into deep neural networks, leading to inferior detection performance. However, t… ▽ More

    Submitted 7 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: 11pages, 11figures

  3. arXiv:2401.09150  [pdf, other

    cs.CL

    Bridging Research and Readers: A Multi-Modal Automated Academic Papers Interpretation System

    Authors: Feng Jiang, Kuang Wang, Haizhou Li

    Abstract: In the contemporary information era, significantly accelerated by the advent of Large-scale Language Models, the proliferation of scientific literature is reaching unprecedented levels. Researchers urgently require efficient tools for reading and summarizing academic papers, uncovering significant scientific literature, and employing diverse interpretative methodologies. To address this burgeoning… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  4. arXiv:2401.06426  [pdf, other

    cs.CV cs.AI

    UPDP: A Unified Progressive Depth Pruner for CNN and Vision Transformer

    Authors: Ji Liu, Dehua Tang, Yuanxian Huang, Li Zhang, Xiaocheng Zeng, Dong Li, Mingjie Lu, **zhang Peng, Yu Wang, Fan Jiang, Lu Tian, Ashish Sirasao

    Abstract: Traditional channel-wise pruning methods by reducing network channels struggle to effectively prune efficient CNN models with depth-wise convolutional layers and certain efficient modules, such as popular inverted residual blocks. Prior depth pruning methods by reducing network depths are not suitable for pruning some efficient models due to the existence of some normalization layers. Moreover, fi… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  5. arXiv:2401.05562  [pdf, ps, other

    cs.LG cs.CR cs.DC

    Brave: Byzantine-Resilient and Privacy-Preserving Peer-to-Peer Federated Learning

    Authors: Zhangchen Xu, Fengqing Jiang, Luyao Niu, **yuan Jia, Radha Poovendran

    Abstract: Federated learning (FL) enables multiple participants to train a global machine learning model without sharing their private training data. Peer-to-peer (P2P) FL advances existing centralized FL paradigms by eliminating the server that aggregates local models from participants and then updates the global model. However, P2P FL is vulnerable to (i) honest-but-curious participants whose objective is… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  6. arXiv:2401.02673  [pdf, other

    eess.AS cs.AI cs.SD

    A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model

    Authors: Dongdi Zhao, Jianbo Ma, Lu Lu, **ke Li, Xuan Ji, Lei Zhu, Fuming Fang, Ming Liu, Feijun Jiang

    Abstract: Far-field speech recognition is a challenging task that conventionally uses signal processing beamforming to attack noise and interference problem. But the performance has been found usually limited due to heavy reliance on environmental assumption. In this paper, we propose a unified multichannel far-field speech recognition system that combines the neural beamforming and transformer-based Listen… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  7. arXiv:2401.02203  [pdf, other

    stat.ML cs.LG

    Robust bilinear factor analysis based on the matrix-variate $t$ distribution

    Authors: Xuan Ma, Jianhua Zhao, Changchun Shang, Fen Jiang, Philip L. H. Yu

    Abstract: Factor Analysis based on multivariate $t$ distribution ($t$fa) is a useful robust tool for extracting common factors on heavy-tailed or contaminated data. However, $t$fa is only applicable to vector data. When $t$fa is applied to matrix data, it is common to first vectorize the matrix observations. This introduces two challenges for $t$fa: (i) the inherent matrix structure of the data is broken, a… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  8. arXiv:2312.07850  [pdf, other

    cs.AI

    Large Language Model Enhanced Multi-Agent Systems for 6G Communications

    Authors: Feibo Jiang, Li Dong, Yubo Peng, Kezhi Wang, Kun Yang, Cunhua Pan, Dusit Niyato, Octavia A. Dobre

    Abstract: The rapid development of the Large Language Model (LLM) presents huge opportunities for 6G communications, e.g., network optimization and management by allowing users to input task requirements to LLMs by nature language. However, directly applying native LLMs in 6G encounters various challenges, such as a lack of private communication data and knowledge, limited logical reasoning, evaluation, and… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Submitted for possible journal publication

  9. arXiv:2311.16153  [pdf, other

    cs.CR cs.AI

    Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications

    Authors: Fengqing Jiang, Zhangchen Xu, Luyao Niu, Boxin Wang, **yuan Jia, Bo Li, Radha Poovendran

    Abstract: Large language models (LLMs) are increasingly deployed as the service backend for LLM-integrated applications such as code completion and AI-powered search. LLM-integrated applications serve as middleware to refine users' queries with domain-specific knowledge to better inform LLMs and enhance the responses. Despite numerous opportunities and benefits, LLM-integrated applications also introduce ne… ▽ More

    Submitted 28 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

  10. arXiv:2311.15564  [pdf, other

    cs.CL cs.IR

    Boot and Switch: Alternating Distillation for Zero-Shot Dense Retrieval

    Authors: Fan Jiang, Qiongkai Xu, Tom Drummond, Trevor Cohn

    Abstract: Neural 'dense' retrieval models are state of the art for many datasets, however these models often exhibit limited domain transfer ability. Existing approaches to adaptation are unwieldy, such as requiring explicit supervision, complex model architectures, or massive external models. We present $\texttt{ABEL}$, a simple but effective unsupervised method to enhance passage retrieval in zero-shot se… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted by EMNLP 2023 Findings

  11. arXiv:2311.15563  [pdf, other

    cs.CL cs.IR

    Noisy Self-Training with Synthetic Queries for Dense Retrieval

    Authors: Fan Jiang, Tom Drummond, Trevor Cohn

    Abstract: Although existing neural retrieval models reveal promising results when training data is abundant and the performance keeps improving as training data increases, collecting high-quality annotated data is prohibitively costly. To this end, we introduce a novel noisy self-training framework combined with synthetic queries, showing that neural retrievers can be improved in a self-evolution manner wit… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted by EMNLP 2023 Findings

  12. arXiv:2311.12273  [pdf, other

    cs.NI eess.SY

    How AI-driven Digital Twins Can Empower Mobile Networks

    Authors: Tong Li, Fenyu Jiang, Qiaohong Yu, Wenzhen Huang, Tao Jiang, Depeng **

    Abstract: The growing complexity of next-generation networks exacerbates the modeling and algorithmic flaws of conventional network optimization methodology. In this paper, we propose a mobile network digital twin (MNDT) architecture for 6G networks. To address the modeling and algorithmic shortcomings, the MNDT uses a simulation-optimization structure. The feedback from the network simulation engine, which… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  13. arXiv:2311.09774  [pdf, other

    cs.CL cs.AI cs.LG

    HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

    Authors: Junying Chen, Xidong Wang, Anningzhe Gao, Feng Jiang, Shunian Chen, Hongbo Zhang, Dingjie Song, Wenya Xie, Chuyi Kong, Jianquan Li, Xiang Wan, Haizhou Li, Benyou Wang

    Abstract: Adapting a language model into a specific domain, a.k.a `domain adaption', is a common practice when specialized knowledge, e.g. medicine, is not encapsulated in a general language model like Llama2. The challenge lies in the heterogeneity of data across the two training stages, as it varies in languages, genres, or formats. To tackle this and simplify the learning protocol, we propose to transfor… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  14. arXiv:2311.05676  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.HE hep-ph

    Probabilistic Inference of the Structure and Orbit of Milky Way Satellites with Semi-Analytic Modeling

    Authors: Dylan Folsom, Oren Slone, Mariangela Lisanti, Fangzhou Jiang, Manoj Kaplinghat

    Abstract: Semi-analytic modeling furnishes an efficient avenue for characterizing the properties of dark matter halos associated with satellites of Milky Way-like systems, as it easily accounts for uncertainties arising from halo-to-halo variance, the orbital disruption of satellites, baryonic feedback, and the stellar-to-halo mass (SMHM) relation. We use the SatGen semi-analytic satellite generator -- whic… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 14 pages, 7 figures; 9 pages of appendices with 6 supplementary figures. Comments welcome! The code and data used in this work are available at https://github.com/folsomde/Semianalytic_Inference/

  15. arXiv:2310.11722  [pdf, other

    cs.CL cs.AI

    Quantifying Self-diagnostic Atomic Knowledge in Chinese Medical Foundation Model: A Computational Analysis

    Authors: Yaxin Fan, Feng Jiang, Benyou Wang, Peifeng Li, Haizhou Li

    Abstract: Foundation Models (FMs) have the potential to revolutionize the way users self-diagnose through search engines by offering direct and efficient suggestions. Recent studies primarily focused on the quality of FMs evaluated by GPT-4 or their ability to pass medical exams, no studies have quantified the extent of self-diagnostic atomic knowledge stored in FMs' memory, which is the basis of foundation… ▽ More

    Submitted 1 April, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

  16. arXiv:2310.09910  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.HE hep-ph

    Till the core collapses: the evolution and properties of self-interacting dark matter subhalos

    Authors: Zhichao Carton Zeng, Annika H. G. Peter, Xiaolong Du, Shengqi Yang, Andrew Benson, Francis-Yan Cyr-Racine, Fangzhou Jiang, Charlie Mace, R. Benton Metcalf

    Abstract: One of the hottest questions in the cosmology of self-interacting dark matter (SIDM) is whether scatterings can induce detectable core-collapse in halos by the present day. Because gravitational tides can accelerate core-collapse, the most promising targets to observe core-collapse are satellite galaxies and subhalo systems. However, simulating small subhalos is computationally intensive, especial… ▽ More

    Submitted 4 November, 2023; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: 35 pages, 19 figures, comments are welcome; added subsection Results F for v2 submission; submitted to PRD

  17. arXiv:2310.08826  [pdf, other

    cs.CV

    Revisiting Multi-modal 3D Semantic Segmentation in Real-world Autonomous Driving

    Authors: Feng Jiang, Chao** Tu, Gang Zhang, Jun Li, Hanqing Huang, Junyu Lin, Di Feng, Jian Pu

    Abstract: LiDAR and camera are two critical sensors for multi-modal 3D semantic segmentation and are supposed to be fused efficiently and robustly to promise safety in various real-world scenarios. However, existing multi-modal methods face two key challenges: 1) difficulty with efficient deployment and real-time execution; and 2) drastic performance degradation under weak calibration between LiDAR and came… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 7 pages, 3 figures

  18. arXiv:2309.13370  [pdf, ps, other

    math.AP

    Rayleigh-Taylor Instability in Stratified Compressible Fluids with/without the Interfacial Surface Tension

    Authors: Fei Jiang, Han Jiang, Song Jiang

    Abstract: Guo--Tice formally established in 2011 that the Rayleigh--Taylor instability inevitably occurs within stratified compressible viscous fluids in a slab domain $\mathbb{R}^2\times (h_-,h_+)$, irrespecive of the presence of interfacial surface tension, where the instability solutions are non-periodic with respect to both horizontal spacial variables $x_1$ and $x_2$, by applying a so-called ''normal m… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:0911.4703 by other authors

  19. arXiv:2309.09984  [pdf

    q-bio.NC cs.NE

    BDEC:Brain Deep Embedded Clustering model

    Authors: Xiaoxiao Ma, Chunzhi Yi, Zhicai Zhong, Hui Zhou, Baichun Wei, Haiqi Zhu, Feng Jiang

    Abstract: An essential premise for neuroscience brain network analysis is the successful segmentation of the cerebral cortex into functionally homogeneous regions. Resting-state functional magnetic resonance imaging (rs-fMRI), capturing the spontaneous activities of the brain, provides the potential for cortical parcellation. Previous parcellation methods can be roughly categorized into three groups, mainly… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  20. arXiv:2309.08803  [pdf, other

    cs.RO eess.SP

    Robust Indoor Localization with Ranging-IMU Fusion

    Authors: Fan Jiang, David Caruso, Ashutosh Dhekne, Qi Qu, Jakob Julian Engel, **g Dong

    Abstract: Indoor wireless ranging localization is a promising approach for low-power and high-accuracy localization of wearable devices. A primary challenge in this domain stems from non-line of sight propagation of radio waves. This study tackles a fundamental issue in wireless ranging: the unpredictability of real-time multipath determination, especially in challenging conditions such as when there is no… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  21. arXiv:2309.05899  [pdf

    physics.app-ph physics.chem-ph

    Kinetic Suppression of Photoinduced Halide Migration in Wide Bandgap Perovskites via Surface Passivation

    Authors: Farhad Akrami, Fangyuan Jiang, Rajiv Giridharagopal, David S. Ginger

    Abstract: In this work, we study the kinetics of photoinduced halide migration in FA$_{0.8}$Cs$_{0.2}$Pb(I$_{0.8}$Br$_{0.2}$)$_3$ wide (~1.69 eV) bandgap perovskites and show halide migration slows down following surface passivation with (3-aminopropyl) trimethoxysilane (APTMS). We use scanning Kelvin probe microscopy (SKPM) to probe the contact potential difference (CPD) shift under illumination, and the k… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  22. arXiv:2309.05849  [pdf, ps, other

    eess.SP

    A Novel Catastrophic Condition for Periodically Time-varying Convolutional Encoders Based on Time-varying Equivalent Convolutional Encoders

    Authors: Fan Jiang

    Abstract: A convolutional encoder is said to be catastrophic if it maps an information sequence of infinite weight into a code sequence of finite weight. As a consequence of this map**, a finite number of channel errors may cause an infinite number of information bit errors when decoding. This situation should be avoided. A catastrophic condition to determine if a time-invariant convolutional encoder is c… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  23. arXiv:2309.01249  [pdf, other

    cs.AI cs.CL cs.LG

    Large AI Model Empowered Multimodal Semantic Communications

    Authors: Feibo Jiang, Yubo Peng, Li Dong, Kezhi Wang, Kun Yang, Cunhua Pan, Xiaohu You

    Abstract: Multimodal signals, including text, audio, image and video, can be integrated into Semantic Communication (SC) for providing an immersive experience with low latency and high quality at the semantic level. However, the multimodal SC has several challenges, including data heterogeneity, semantic ambiguity, and signal fading. Recent advancements in large AI models, particularly in Multimodal Languag… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: To be submitted for journal publication

  24. arXiv:2308.15673  [pdf, other

    cs.CR cs.LG

    MDTD: A Multi Domain Trojan Detector for Deep Neural Networks

    Authors: Arezoo Rajabi, Surudhi Asokraj, Fengqing Jiang, Luyao Niu, Bhaskar Ramasubramanian, Jim Ritcey, Radha Poovendran

    Abstract: Machine learning models that use deep neural networks (DNNs) are vulnerable to backdoor attacks. An adversary carrying out a backdoor attack embeds a predefined perturbation called a trigger into a small subset of input samples and trains the DNN such that the presence of the trigger in the input results in an adversary-desired output class. Such adversarial retraining however needs to ensure that… ▽ More

    Submitted 2 September, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: Accepted to ACM Conference on Computer and Communications Security (ACM CCS) 2023

  25. arXiv:2308.15078  [pdf, other

    cs.AI cs.NI

    LAMBO: Large Language Model Empowered Edge Intelligence

    Authors: Li Dong, Feibo Jiang, Yubo Peng, Kezhi Wang, Kun Yang, Cunhua Pan, Robert Schober

    Abstract: Next-generation edge intelligence is anticipated to bring huge benefits to various applications, e.g., offloading systems. However, traditional deep offloading architectures face several issues, including heterogeneous constraints, partial perception, uncertain generalization, and lack of tractability. In this context, the integration of offloading with large language models (LLMs) presents numero… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: To be submitted for possible journal publication

  26. arXiv:2308.11534  [pdf, other

    cs.CL cs.AI

    PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator

    Authors: Chuyi Kong, Yaxin Fan, Xiang Wan, Feng Jiang, Benyou Wang

    Abstract: The unparalleled performance of closed-sourced ChatGPT has sparked efforts towards its democratization, with notable strides made by leveraging real user and ChatGPT dialogues, as evidenced by Vicuna. However, due to challenges in gathering dialogues involving human participation, current endeavors like Baize and UltraChat rely on ChatGPT conducting roleplay to simulate humans based on instruction… ▽ More

    Submitted 27 May, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted to ACL 2024 (main conference)

  27. arXiv:2308.10634  [pdf, other

    eess.SY

    Data-Driven Reachability Analysis of Pedestrians Using Behavior Modes

    Authors: August Söderlund, Frank J. Jiang, Vandana Narri, Amr Alanwar, Karl H. Johansson

    Abstract: In this paper, we present a data-driven approach for safely predicting the future state sets of pedestrians. Previous approaches to predicting the future state sets of pedestrians either do not provide safety guarantees or are overly conservative. Moreover, an additional challenge is the selection or identification of a model that sufficiently captures the motion of pedestrians. To address these i… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  28. arXiv:2308.08833  [pdf, other

    cs.CL cs.AI

    CMB: A Comprehensive Medical Benchmark in Chinese

    Authors: Xidong Wang, Guiming Hardy Chen, Dingjie Song, Zhiyi Zhang, Zhihong Chen, Qingying Xiao, Feng Jiang, Jianquan Li, Xiang Wan, Benyou Wang, Haizhou Li

    Abstract: Large Language Models (LLMs) provide a possibility to make a great breakthrough in medicine. The establishment of a standardized medical benchmark becomes a fundamental cornerstone to measure progression. However, medical environments in different regions have their local characteristics, e.g., the ubiquity and significance of traditional Chinese medicine within China. Therefore, merely translatin… ▽ More

    Submitted 4 April, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: Accepted to NAACL 2024 Main Conference

  29. arXiv:2308.08488  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder

    Authors: Yusheng Dai, Hang Chen, Jun Du, Xiaofei Ding, Ning Ding, Feijun Jiang, Chin-Hui Lee

    Abstract: In recent research, slight performance improvement is observed from automatic speech recognition systems to audio-visual speech recognition systems in the end-to-end framework with low-quality videos. Unmatching convergence rates and specialized input representations between audio and visual modalities are considered to cause the problem. In this paper, we propose two novel techniques to improve a… ▽ More

    Submitted 8 March, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: 6 pages, 2 figures, published in ICME2023

  30. arXiv:2308.08084  [pdf

    physics.chem-ph

    Architecture Optimization Dramatically Improves Reverse Bias Stability in Perovskite Solar Cells: A Role of Polymer Hole Transport Layers

    Authors: Fangyuan Jiang, Yangwei Shi, Tanka R. Rana, Daniel Morales, Isaac Gould, Declan P. McCarthy, Joel Smith, Grey Christoforo, Hannah Contreras, Stephen Barlow, Aditya D. Mohite, Henry Snaith, Seth R. Marder, J. Devin MacKenzie, Michael D. McGehee, David S. Ginger

    Abstract: We report that device architecture engineering has a substantial impact on the reverse bias instability that has been reported as a critical issue in commercializing perovskite solar cells. We demonstrate breakdown voltages exceeding -15 V in typical pin structured perovskite solar cells via two steps: i) using polymer hole transporting materials; ii) using a more electrochemically stable gold ele… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  31. arXiv:2308.06943  [pdf, ps, other

    cs.CY

    Current Status and Trends of Engineering Entrepreneurship Education in Australian Universities

    Authors: Jianhua Li, Sophie Mckenzie, Richard Dazeley, Frank Jiang, Keshav Sood

    Abstract: This research sheds light on the present and future landscape of Engineering Entrepreneurship Education (EEE) by exploring varied approaches and models adopted in Australian universities, evaluating program effectiveness, and offering recommendations for curriculum enhancement. While EEE programs have been in existence for over two decades, their efficacy remains underexplored. Using a multi-metho… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  32. arXiv:2308.03169  [pdf, ps, other

    math.GM

    Effects From Extra Die Rolls and Choosing the Highest or Lowest

    Authors: Fan Jiang, Elvin Jiang

    Abstract: This paper looks into the gain or loss from rolling a fair die multiple times and choosing the highest or lowest number as the outcome over rolling the die just once. Specifically, this paper gives a general formula for the expected value of choosing the highest or lowest value of any number of die rolls and sides. It also shows how, for a fixed number of rolls, the ratio between this expected val… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  33. arXiv:2307.13923  [pdf, other

    cs.CL

    GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical Error Correction with Supervised Fine-Tuning

    Authors: Yaxin Fan, Feng Jiang, Peifeng Li, Haizhou Li

    Abstract: Grammatical error correction aims to correct ungrammatical sentences automatically. Recently, some work has demonstrated the excellent capabilities of closed-source Large Language Models (LLMs, e.g., ChatGPT) in grammatical error correction. However, the potential of open-source LLMs remains unexplored. In this paper, we introduced GrammarGPT, an open-source LLM, to preliminary explore its potenti… ▽ More

    Submitted 17 August, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

  34. arXiv:2307.12682  [pdf

    q-bio.BM

    Pro-PRIME: A general Temperature-Guided Language model to engineer enhanced Stability and Activity in Proteins

    Authors: Pan Tan, Mingchen Li, Yuanxi Yu, Fan Jiang, Lirong Zheng, Banghao Wu, Xinyu Sun, Liqi Kang, Jie Song, Liang Zhang, Yi Xiong, Wanli Ouyang, Zhiqiang Hu, Guisheng Fan, Yufeng Pei, Liang Hong

    Abstract: Designing protein mutants of both high stability and activity is a critical yet challenging task in protein engineering. Here, we introduce Pro-PRIME, a deep learning zero-shot model, which can suggest protein mutants of improved stability and activity without any prior experimental mutagenesis data. By leveraging temperature-guided language modelling, Pro-PRIME demonstrated superior predictive po… ▽ More

    Submitted 13 May, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2304.03780

  35. arXiv:2307.11380  [pdf, other

    cs.CL

    Is ChatGPT Involved in Texts? Measure the Polish Ratio to Detect ChatGPT-Generated Text

    Authors: Lingyi Yang, Feng Jiang, Haizhou Li

    Abstract: The remarkable capabilities of large-scale language models, such as ChatGPT, in text generation have impressed readers and spurred researchers to devise detectors to mitigate potential risks, including misinformation, phishing, and academic dishonesty. Despite this, most previous studies have been predominantly geared towards creating detectors that differentiate between purely ChatGPT-generated t… ▽ More

    Submitted 30 December, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

  36. arXiv:2307.04318  [pdf, other

    stat.ME

    Two-Sample and Change-Point Inference for Non-Euclidean Valued Time Series

    Authors: Feiyu Jiang, Changbo Zhu, Xiaofeng Shao

    Abstract: Data objects taking value in a general metric space have become increasingly common in modern data analysis. In this paper, we study two important statistical inference problems, namely, two-sample testing and change-point detection, for such non-Euclidean data under temporal dependence. Typical examples of non-Euclidean valued time series include yearly mortality distributions, time-varying netwo… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

  37. arXiv:2307.03492  [pdf, other

    cs.AI cs.NI

    Large AI Model-Based Semantic Communications

    Authors: Feibo Jiang, Yubo Peng, Li Dong, Kezhi Wang, Kun Yang, Cunhua Pan, Xiaohu You

    Abstract: Semantic communication (SC) is an emerging intelligent paradigm, offering solutions for various future applications like metaverse, mixed-reality, and the Internet of everything. However, in current SC systems, the construction of the knowledge base (KB) faces several issues, including limited knowledge representation, frequent knowledge updates, and insecure knowledge sharing. Fortunately, the de… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: Plan to submit it to journal for possible publication

  38. arXiv:2306.16183  [pdf, ps, other

    math.FA math.CA math.OC

    Roots, trace, and extendability of flat nonnegative smooth functions

    Authors: Fushuai Jiang

    Abstract: Building on the univariate techniques developed by Ray and Schmidt-Hieber, we study the class $\mathcal{F}^s(\mathbb{R}^n)$ of multivariate nonnegative smooth functions that are sufficiently flat near their zeroes, which guarantees that $\varphi^r$ has Hölder differentiability $rs$ whenever $\varphi \in \mathcal{F}^s$. We then construct a continuous Whitney extension map that recovers an… ▽ More

    Submitted 9 January, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: 25 pages. Lemma 3.1 statement corrected

    MSC Class: 26B35; 26B05; 41A29

  39. arXiv:2306.12508  [pdf, other

    cs.LO cs.CC cs.DS eess.SY

    Polynomial Logical Zonotopes: A Set Representation for Reachability Analysis of Logical Systems

    Authors: Amr Alanwar, Frank J. Jiang, Karl H. Johansson

    Abstract: In this paper, we introduce a set representation called polynomial logical zonotopes for performing exact and computationally efficient reachability analysis on logical systems. Polynomial logical zonotopes are a generalization of logical zonotopes, which are able to represent up to 2^n binary vectors using only n generators. Due to their construction, logical zonotopes are only able to support ex… ▽ More

    Submitted 1 March, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

  40. arXiv:2306.10577  [pdf, other

    cs.LG stat.ML

    OpenDataVal: a Unified Benchmark for Data Valuation

    Authors: Kevin Fu Jiang, Weixin Liang, James Zou, Yongchan Kwon

    Abstract: Assessing the quality and impact of individual data points is critical for improving model performance and mitigating undesirable biases within the training dataset. Several data valuation algorithms have been proposed to quantify data quality, however, there lacks a systemic and standardized benchmarking system for data valuation. In this paper, we introduce OpenDataVal, an easy-to-use and unifie… ▽ More

    Submitted 13 October, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: 25 pages, NeurIPS 2023 Track on Datasets and Benchmarks

  41. arXiv:2306.10275  [pdf, other

    eess.SY

    Multi-Scale Simulation of Complex Systems: A Perspective of Integrating Knowledge and Data

    Authors: Huandong Wang, Huan Yan, Can Rong, Yuan Yuan, Fenyu Jiang, Zhenyu Han, Hongjie Sui, Depeng **, Yong Li

    Abstract: Complex system simulation has been playing an irreplaceable role in understanding, predicting, and controlling diverse complex systems. In the past few decades, the multi-scale simulation technique has drawn increasing attention for its remarkable ability to overcome the challenges of complex system simulation with unknown mechanisms and expensive computational costs. In this survey, we will syste… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

  42. arXiv:2306.05169  [pdf, other

    stat.ME econ.EM

    Matrix GARCH Model: Inference and Application

    Authors: Cheng Yu, Dong Li, Feiyu Jiang, Ke Zhu

    Abstract: Matrix-variate time series data are largely available in applications. However, no attempt has been made to study their conditional heteroskedasticity that is often observed in economic and financial data. To address this gap, we propose a novel matrix generalized autoregressive conditional heteroskedasticity (GARCH) model to capture the dynamics of conditional row and column covariance matrices o… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  43. arXiv:2305.15075  [pdf, other

    cs.CL cs.AI

    HuatuoGPT, towards Taming Language Model to Be a Doctor

    Authors: Hongbo Zhang, Junying Chen, Feng Jiang, Fei Yu, Zhihong Chen, Jianquan Li, Guiming Chen, Xiangbo Wu, Zhiyi Zhang, Qingying Xiao, Xiang Wan, Benyou Wang, Haizhou Li

    Abstract: In this paper, we present HuatuoGPT, a large language model (LLM) for medical consultation. The core recipe of HuatuoGPT is to leverage both \textit{distilled data from ChatGPT} and \textit{real-world data from doctors} in the supervised fine-tuned stage. The responses of ChatGPT are usually detailed, well-presented and informative while it cannot perform like a doctor in many aspects, e.g. for in… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  44. arXiv:2305.14790  [pdf, other

    cs.CL cs.AI

    Advancing Topic Segmentation and Outline Generation in Chinese Texts: The Paragraph-level Topic Representation, Corpus, and Benchmark

    Authors: Feng Jiang, Weihao Liu, Xiaomin Chu, Peifeng Li, Qiaoming Zhu, Haizhou Li

    Abstract: Topic segmentation and outline generation strive to divide a document into coherent topic sections and generate corresponding subheadings, unveiling the discourse topic structure of a document. Compared with sentence-level topic structure, the paragraph-level topic structure can quickly grasp and understand the overall context of the document from a higher level, benefitting many downstream tasks… ▽ More

    Submitted 26 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted by LREC-COLING 2024

  45. arXiv:2305.13755  [pdf, other

    cs.CL cs.AI

    Topic-driven Distant Supervision Framework for Macro-level Discourse Parsing

    Authors: Feng Jiang, Longwang He, Peifeng Li, Qiaoming Zhu, Haizhou Li

    Abstract: Discourse parsing, the task of analyzing the internal rhetorical structure of texts, is a challenging problem in natural language processing. Despite the recent advances in neural models, the lack of large-scale, high-quality corpora for training remains a major obstacle. Recent studies have attempted to overcome this limitation by using distant supervision, which utilizes results from other NLP t… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  46. arXiv:2305.08391  [pdf, other

    cs.CL

    Uncovering the Potential of ChatGPT for Discourse Analysis in Dialogue: An Empirical Study

    Authors: Yaxin Fan, Feng Jiang, Peifeng Li, Haizhou Li

    Abstract: Large language models, like ChatGPT, have shown remarkable capability in many downstream tasks, yet their ability to understand discourse structures of dialogues remains less explored, where it requires higher level capabilities of understanding and reasoning. In this paper, we aim to systematically inspect ChatGPT's performance in two discourse analysis tasks: topic segmentation and discourse par… ▽ More

    Submitted 5 March, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Accepted by LREC-COLING'2024

  47. arXiv:2305.05848  [pdf, other

    cs.IR cs.AI

    Dual Intent Enhanced Graph Neural Network for Session-based New Item Recommendation

    Authors: Di **, Luzhi Wang, Yizhen Zheng, Guojie Song, Fei Jiang, Xiang Li, Wei Lin, Shirui Pan

    Abstract: Recommender systems are essential to various fields, e.g., e-commerce, e-learning, and streaming media. At present, graph neural networks (GNNs) for session-based recommendations normally can only recommend items existing in users' historical sessions. As a result, these GNNs have difficulty recommending items that users have never interacted with (new items), which leads to a phenomenon of inform… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 10 Pages, 6 figures, WWW'2023

  48. arXiv:2305.05067  [pdf, other

    astro-ph.CO astro-ph.GA

    A quantitative comparison between velocity dependent SIDM cross sections constrained by the gravothermal and isothermal models

    Authors: Shengqi Yang, Fangzhou Jiang, Andrew Benson, Yi-Ming Zhong, Charlie Mace, Xiaolong Du, Zhichao Carton Zeng, Annika H. G. Peter, Moritz S. Fischer

    Abstract: One necessary step for probing the nature of self-interacting dark matter (SIDM) particles with astrophysical observations is to pin down any possible velocity dependence in the SIDM cross section. Major challenges for achieving this goal include eliminating, or mitigating, the impact of the baryonic components and tidal effects within the dark matter halos of interest -- the effects of these proc… ▽ More

    Submitted 26 April, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: 15 pages, 11 figures

  49. arXiv:2305.01195  [pdf, other

    cs.CL cs.LG

    Topic Shift Detection in Chinese Dialogues: Corpus and Benchmark

    Authors: Jiangyi Lin, Yaxin Fan, Feng Jiang, Xiaomin Chu, Peifeng Li

    Abstract: Dialogue topic shift detection is to detect whether an ongoing topic has shifted or should shift in a dialogue, which can be divided into two categories, i.e., response-known task and response-unknown task. Currently, only a few investigated the latter, because it is still a challenge to predict the topic shift without the response information. In this paper, we first annotate a Chinese Natural To… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  50. arXiv:2305.00489  [pdf

    cs.CV cs.MM

    Learned Focused Plenoptic Image Compression with Microimage Preprocessing and Global Attention

    Authors: Kedeng Tong, Xin **, Yuqing Yang, Chen Wang, **shi Kang, Fan Jiang

    Abstract: Focused plenoptic cameras can record spatial and angular information of the light field (LF) simultaneously with higher spatial resolution relative to traditional plenoptic cameras, which facilitate various applications in computer vision. However, the existing plenoptic image compression methods present ineffectiveness to the captured images due to the complex micro-textures generated by the micr… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: 14 pages, 15 figures, accepted by IEEE Transactions on Multimedia