Skip to main content

Showing 1–50 of 106 results for author: Zhong, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13209  [pdf, other

    eess.IV cs.CV physics.med-ph

    Diffusion Model-based FOD Restoration from High Distortion in dMRI

    Authors: Shuo Huang, Lujia Zhong, Yonggang Shi

    Abstract: Fiber orientation distributions (FODs) is a popular model to represent the diffusion MRI (dMRI) data. However, imaging artifacts such as susceptibility-induced distortion in dMRI can cause signal loss and lead to the corrupted reconstruction of FODs, which prohibits successful fiber tracking and connectivity analysis in affected brain regions such as the brain stem. Generative models, such as the… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 11 pages, 7 figures

  2. arXiv:2406.12793  [pdf, other

    cs.CL

    ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

    Authors: Team GLM, :, Aohan Zeng, Bin Xu, Bowen Wang, Chenhui Zhang, Da Yin, Diego Rojas, Guanyu Feng, Hanlin Zhao, Hanyu Lai, Hao Yu, Hongning Wang, Jiadai Sun, Jiajie Zhang, Jiale Cheng, Jiayi Gui, Jie Tang, **g Zhang, Juanzi Li, Lei Zhao, Lindong Wu, Lucen Zhong, Mingdao Liu, Minlie Huang , et al. (32 additional authors not shown)

    Abstract: We introduce ChatGLM, an evolving family of large language models that we have been develo** over time. This report primarily focuses on the GLM-4 language series, which includes GLM-4, GLM-4-Air, and GLM-4-9B. They represent our most capable models that are trained with all the insights and lessons gained from the preceding three generations of ChatGLM. To date, the GLM-4 models are pre-trained… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.10710  [pdf, other

    cs.AI cs.CL

    SyntheT2C: Generating Synthetic Data for Fine-Tuning Large Language Models on the Text2Cypher Task

    Authors: Ziije Zhong, Linqing Zhong, Zhaoze Sun, Qingyun **, Zengchang Qin, Xiaofan Zhang

    Abstract: Integrating Large Language Models (LLMs) with existing Knowledge Graph (KG) databases presents a promising avenue for enhancing LLMs' efficacy and mitigating their "hallucinations". Given that most KGs reside in graph databases accessible solely through specialized query languages (e.g., Cypher), there exists a critical need to bridge the divide between LLMs and KG databases by automating the tran… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 19 pages, 15 figures, 8 tables

  4. arXiv:2406.10594  [pdf, other

    cs.CL

    BlockPruner: Fine-grained Pruning for Large Language Models

    Authors: Longguang Zhong, Fanqi Wan, Ruijun Chen, Xiaojun Quan, Liangzhi Li

    Abstract: With the rapid growth in the size and complexity of large language models (LLMs), the costs associated with their training and inference have escalated significantly. Research indicates that certain layers in LLMs harbor substantial redundancy, and pruning these layers has minimal impact on the overall performance. While various layer pruning methods have been developed based on this insight, they… ▽ More

    Submitted 20 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  5. arXiv:2406.08491  [pdf, other

    quant-ph cs.DC

    FPGA-based Distributed Union-Find Decoder for Surface Codes

    Authors: Namitha Liyanage, Yue Wu, Siona Tagare, Lin Zhong

    Abstract: A fault-tolerant quantum computer must decode and correct errors faster than they appear to prevent exponential slowdown due to error correction. The Union-Find (UF) decoder is promising with an average time complexity slightly higher than $O(d^3)$. We report a distributed version of the UF decoder that exploits parallel computing resources for further speedup. Using an FPGA-based implementation,… ▽ More

    Submitted 20 March, 2024; originally announced June 2024.

    Comments: The article extends the work in arXiv:2301.08419, which also appeared in https://ieeexplore.ieee.org/document/10313800

  6. arXiv:2406.06889  [pdf, other

    physics.soc-ph cs.SI

    Universal spatial inflation of human mobility

    Authors: Lu Zhong, Lei Dong, Qi Wang, Chaoming Song, Jianxi Gao

    Abstract: Understanding the interplay between egocentric preference and urban structure in sha** human mobility has profound implications for improving epidemic intervention, social equity, and urban resilience. However, numerous existing studies either solely identify the egocentric preferences -- the anchoring effects from home -- or the impact of hierarchical urban structures. Here, we propose a networ… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 15 pages, 6 figures

  7. arXiv:2406.03746  [pdf, other

    cs.CL cs.AI

    Efficient Knowledge Infusion via KG-LLM Alignment

    Authors: Zhouyu Jiang, Ling Zhong, Mengshu Sun, Jun Xu, Rui Sun, Hui Cai, Shuhan Luo, Zhiqiang Zhang

    Abstract: To tackle the problem of domain-specific knowledge scarcity within large language models (LLMs), knowledge graph-retrievalaugmented method has been proven to be an effective and efficient technique for knowledge infusion. However, existing approaches face two primary challenges: knowledge mismatch between public available knowledge graphs and the specific domain of the task at hand, and poor infor… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ACL2024 Findings

  8. arXiv:2405.16501  [pdf, other

    cs.CV

    User-Friendly Customized Generation with Multi-Modal Prompts

    Authors: Linhao Zhong, Yan Hong, Wentao Chen, Binglin Zhou, Yiyi Zhang, Jianfu Zhang, Liqing Zhang

    Abstract: Text-to-image generation models have seen considerable advancement, catering to the increasing interest in personalized image creation. Current customization techniques often necessitate users to provide multiple images (typically 3-5) for each customized object, along with the classification of these objects and descriptive textual prompts for scenes. This paper questions whether the process can… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 11 pages, 8 figures

  9. arXiv:2405.13199  [pdf, ps, other

    eess.IV cs.CV

    TauAD: MRI-free Tau Anomaly Detection in PET Imaging via Conditioned Diffusion Models

    Authors: Lujia Zhong, Shuo Huang, Jiaxin Yue, Jianwei Zhang, Zhiwei Deng, Wenhao Chi, Yonggang Shi

    Abstract: The emergence of tau PET imaging over the last decade has enabled Alzheimer's disease (AD) researchers to examine tau pathology in vivo and more effectively characterize the disease trajectories of AD. Current tau PET analysis methods, however, typically perform inferences on large cortical ROIs and are limited in the detection of localized tau pathology that varies across subjects. Furthermore, a… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  10. arXiv:2405.03155  [pdf, other

    cs.RO

    CushSense: Soft, Stretchable, and Comfortable Tactile-Sensing Skin for Physical Human-Robot Interaction

    Authors: Boxin Xu, Luoyan Zhong, Grace Zhang, Xiaoyu Liang, Diego Virtue, Rishabh Madan, Tapomayukh Bhattacharjee

    Abstract: Whole-arm tactile feedback is crucial for robots to ensure safe physical interaction with their surroundings. This paper introduces CushSense, a fabric-based soft and stretchable tactile-sensing skin designed for physical human-robot interaction (pHRI) tasks such as robotic caregiving. Using stretchable fabric and hyper-elastic polymer, CushSense identifies contacts by monitoring capacitive change… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 8 pages, 8 figures, ICRA2024

  11. arXiv:2403.15836  [pdf, other

    cs.CV

    VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification

    Authors: Lanfeng Zhong, Xin Liao, Shaoting Zhang, Xiaofan Zhang, Guotai Wang

    Abstract: Despite that deep learning methods have achieved remarkable performance in pathology image classification, they heavily rely on labeled data, demanding extensive human annotation efforts. In this study, we present a novel human annotation-free method for pathology image classification by leveraging pre-trained Vision-Language Models (VLMs). Without human annotation, pseudo labels of the training s… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Under review

  12. arXiv:2403.09434  [pdf, other

    cs.CV

    Reconstruction and Simulation of Elastic Objects with Spring-Mass 3D Gaussians

    Authors: Licheng Zhong, Hong-Xing Yu, Jiajun Wu, Yunzhu Li

    Abstract: Reconstructing and simulating elastic objects from visual observations is crucial for applications in computer vision and robotics. Existing methods, such as 3D Gaussians, model 3D appearance and geometry, but lack the ability to estimate physical properties for objects and simulate them. The core challenge lies in integrating an expressive yet efficient physical dynamics model. We propose Spring-… ▽ More

    Submitted 7 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  13. arXiv:2403.02630  [pdf, other

    cs.LG cs.IR cs.SI

    FedHCDR: Federated Cross-Domain Recommendation with Hypergraph Signal Decoupling

    Authors: Hongyu Zhang, Dongyi Zheng, Lin Zhong, Xu Yang, Jiyuan Feng, Yunqing Feng, Qing Liao

    Abstract: In recent years, Cross-Domain Recommendation (CDR) has drawn significant attention, which utilizes user data from multiple domains to enhance the recommendation performance. However, current CDR methods require sharing user data across domains, thereby violating the General Data Protection Regulation (GDPR). Consequently, numerous approaches have been proposed for Federated Cross-Domain Recommenda… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 16 pages, 5 figures

  14. arXiv:2402.16906  [pdf, other

    cs.SE cs.AI cs.CL

    Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

    Authors: Li Zhong, Zilong Wang, **gbo Shang

    Abstract: Large language models (LLMs) are leading significant progress in code generation. Beyond one-pass code generation, recent works further integrate unit tests and program verifiers into LLMs to iteratively refine the generated programs. However, these works consider the generated programs as an indivisible entity, which falls short for LLMs in debugging the programs, especially when the programs con… ▽ More

    Submitted 6 June, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

    Comments: Preprint

  15. arXiv:2402.16107  [pdf, other

    cs.CL

    Knowledge Fusion of Chat LLMs: A Preliminary Technical Report

    Authors: Fanqi Wan, Ziyi Yang, Longguang Zhong, Xiaojun Quan, Xinting Huang, Wei Bi

    Abstract: Recently, FuseLLM introduced the concept of knowledge fusion to transfer the collective knowledge of multiple structurally varied LLMs into a target LLM through lightweight continual training. In this report, we extend the scalability and flexibility of the FuseLLM framework to realize the fusion of chat LLMs, resulting in FusionChat. FusionChat comprises two main stages. Firstly, we undertake kno… ▽ More

    Submitted 28 May, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: Technical Report, work in progress

  16. arXiv:2402.13007  [pdf, other

    cs.LG cs.CV

    Improve Cross-Architecture Generalization on Dataset Distillation

    Authors: Binglin Zhou, Linhao Zhong, Wentao Chen

    Abstract: Dataset distillation, a pragmatic approach in machine learning, aims to create a smaller synthetic dataset from a larger existing dataset. However, existing distillation methods primarily adopt a model-based paradigm, where the synthetic dataset inherits model-specific biases, limiting its generalizability to alternative models. In response to this constraint, we propose a novel methodology termed… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  17. arXiv:2402.08202  [pdf, other

    cs.LG

    Confronting Discrimination in Classification: Smote Based on Marginalized Minorities in the Kernel Space for Imbalanced Data

    Authors: Lingyun Zhong

    Abstract: Financial fraud detection poses a typical challenge characterized by class imbalance, where instances of fraud are extremely rare but can lead to unpredictable economic losses if misidentified. Precisely classifying these critical minority samples represents a challenging task within the classification. The primary difficulty arises from mainstream classifiers, which often exhibit "implicit discri… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 10 pages, 2 figures

  18. arXiv:2401.15159  [pdf, other

    cs.RO

    RABBIT: A Robot-Assisted Bed Bathing System with Multimodal Perception and Integrated Compliance

    Authors: Rishabh Madan, Skyler Valdez, David Kim, Sujie Fang, Luoyan Zhong, Diego Virtue, Tapomayukh Bhattacharjee

    Abstract: This paper introduces RABBIT, a novel robot-assisted bed bathing system designed to address the growing need for assistive technologies in personal hygiene tasks. It combines multimodal perception and dual (software and hardware) compliance to perform safe and comfortable physical human-robot interaction. Using RGB and thermal imaging to segment dry, soapy, and wet skin regions accurately, RABBIT… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 10 pages, 8 figures, 19th Annual ACM/IEEE International Conference on Human Robot Interaction (HRI)

  19. arXiv:2401.02708  [pdf, other

    cs.LG cs.AI stat.ML

    TripleSurv: Triplet Time-adaptive Coordinate Loss for Survival Analysis

    Authors: Liwen Zhang, Lianzhen Zhong, Fan Yang, Di Dong, Hui Hui, Jie Tian

    Abstract: A core challenge in survival analysis is to model the distribution of censored time-to-event data, where the event of interest may be a death, failure, or occurrence of a specific event. Previous studies have showed that ranking and maximum likelihood estimation (MLE)loss functions are widely-used for survival analysis. However, ranking loss only focus on the ranking of survival time and does not… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 9 pages,6 figures

  20. arXiv:2312.14950  [pdf, other

    cs.RO cs.AI cs.HC

    TypeFly: Flying Drones with Large Language Model

    Authors: Guojun Chen, Xiao**g Yu, Lin Zhong

    Abstract: Commanding a drone with a natural language is not only user-friendly but also opens the door for emerging language agents to control the drone. Emerging large language models (LLMs) provide a previously impossible opportunity to automatically translate a task description in a natural language to a program that can be executed by the drone. However, powerful LLMs and their vision counterparts are l… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  21. arXiv:2312.10115  [pdf, other

    cs.CV

    SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery

    Authors: Xin Guo, Jiangwei Lao, Bo Dang, Yingying Zhang, Lei Yu, Lixiang Ru, Liheng Zhong, Ziyuan Huang, Kang Wu, Dingxiang Hu, Huimei He, Jian Wang, **gdong Chen, Ming Yang, Yongjun Zhang, Yansheng Li

    Abstract: Prior studies on Remote Sensing Foundation Model (RSFM) reveal immense potential towards a generic model for Earth Observation. Nevertheless, these works primarily focus on a single modality without temporal and geo-context modeling, hampering their capabilities for diverse tasks. In this study, we present SkySense, a generic billion-scale model, pre-trained on a curated multi-modal Remote Sensing… ▽ More

    Submitted 22 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by CVPR2024

  22. arXiv:2311.04934  [pdf, other

    cs.CL cs.AI

    Prompt Cache: Modular Attention Reuse for Low-Latency Inference

    Authors: In Gim, Guojun Chen, Seung-seob Lee, Nikhil Sarda, Anurag Khandelwal, Lin Zhong

    Abstract: We present Prompt Cache, an approach for accelerating inference for large language models (LLM) by reusing attention states across different LLM prompts. Many input prompts have overlap** text segments, such as system messages, prompt templates, and documents provided for context. Our key insight is that by precomputing and storing the attention states of these frequently occurring text segments… ▽ More

    Submitted 25 April, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: To appear at MLSys 2024

  23. arXiv:2310.16305  [pdf, other

    cs.CV cs.LG

    Dolfin: Diffusion Layout Transformers without Autoencoder

    Authors: Yilin Wang, Zeyuan Chen, Liangjun Zhong, Zheng Ding, Zhizhou Sha, Zhuowen Tu

    Abstract: In this paper, we introduce a novel generative model, Diffusion Layout Transformers without Autoencoder (Dolfin), which significantly improves the modeling capability with reduced complexity compared to existing methods. Dolfin employs a Transformer-based diffusion process to model layout generation. In addition to an efficient bi-directional (non-causal joint) sequence representation, we further… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  24. arXiv:2310.08580  [pdf, other

    cs.CV cs.GR

    OmniControl: Control Any Joint at Any Time for Human Motion Generation

    Authors: Yiming Xie, Varun Jampani, Lei Zhong, Deqing Sun, Huaizu Jiang

    Abstract: We present a novel approach named OmniControl for incorporating flexible spatial control signals into a text-conditioned human motion generation model based on the diffusion process. Unlike previous methods that can only control the pelvis trajectory, OmniControl can incorporate flexible spatial control signals over different joints at different times with only one model. Specifically, we propose… ▽ More

    Submitted 14 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: ICLR 2024. Project page: https://neu-vi.github.io/omnicontrol/

  25. arXiv:2310.00597  [pdf, other

    cs.CL

    A Task-oriented Dialog Model with Task-progressive and Policy-aware Pre-training

    Authors: Lucen Zhong, Hengtong Lu, Caixia Yuan, Xiaojie Wang, Jiashen Sun, Ke Zeng, Guanglu Wan

    Abstract: Pre-trained conversation models (PCMs) have achieved promising progress in recent years. However, existing PCMs for Task-oriented dialog (TOD) are insufficient for capturing the sequential nature of the TOD-related tasks, as well as for learning dialog policy information. To alleviate these problems, this paper proposes a task-progressive PCM with two policy-aware pre-training tasks. The model is… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: Accepted at NLPCC 2023

  26. arXiv:2309.01645  [pdf

    cs.CL

    Exploring the effectiveness of ChatGPT-based feedback compared with teacher feedback and self-feedback: Evidence from Chinese to English translation

    Authors: Siyi Cao, Lin** Zhong

    Abstract: ChatGPT,a cutting-edge AI-powered Chatbot,can quickly generate responses on given commands. While it was reported that ChatGPT had the capacity to deliver useful feedback, it is still unclear about its effectiveness compared with conventional feedback approaches,such as teacher feedback (TF) and self-feedback (SF). To address this issue, this study compared the revised Chinese to English translati… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  27. arXiv:2308.10574  [pdf, other

    cs.CV

    CHORD: Category-level Hand-held Object Reconstruction via Shape Deformation

    Authors: Kailin Li, Lixin Yang, Haoyu Zhen, Zenan Lin, Xinyu Zhan, Licheng Zhong, Jian Xu, Kejian Wu, Cewu Lu

    Abstract: In daily life, humans utilize hands to manipulate objects. Modeling the shape of objects that are manipulated by the hand is essential for AI to comprehend daily tasks and to learn manipulation skills. However, previous approaches have encountered difficulties in reconstructing the precise shapes of hand-held objects, primarily owing to a deficiency in prior shape knowledge and inadequate data for… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: To be presented at ICCV 2023, Paris

  28. arXiv:2308.10526  [pdf, other

    cs.HC

    UbiPhysio: Support Daily Functioning, Fitness, and Rehabilitation with Action Understanding and Feedback in Natural Language

    Authors: Chongyang Wang, Yuan Feng, Lingxiao Zhong, Siyi Zhu, Chi Zhang, Siqi Zheng, Chen Liang, Yuntao Wang, Chengqi He, Chun Yu, Yuanchun Shi

    Abstract: We introduce UbiPhysio, a milestone framework that delivers fine-grained action description and feedback in natural language to support people's daily functioning, fitness, and rehabilitation activities. This expert-like capability assists users in properly executing actions and maintaining engagement in remote fitness and rehabilitation programs. Specifically, the proposed UbiPhysio framework com… ▽ More

    Submitted 17 January, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted by IMWUT/Ubicomp'24

  29. arXiv:2308.10335  [pdf, other

    cs.CL cs.AI cs.SE

    Can ChatGPT replace StackOverflow? A Study on Robustness and Reliability of Large Language Model Code Generation

    Authors: Li Zhong, Zilong Wang

    Abstract: Recently, the large language models (LLMs) have shown extraordinary ability in understanding natural language and generating programming code. It has been a common practice of software engineers to consult LLMs when encountering coding questions. Although efforts have been made to avoid syntax errors and align the code with the intended semantics, the reliability and robustness of the code generat… ▽ More

    Submitted 27 January, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

  30. arXiv:2308.06962  [pdf, other

    cs.CV

    Color-NeuS: Reconstructing Neural Implicit Surfaces with Color

    Authors: Licheng Zhong, Lixin Yang, Kailin Li, Haoyu Zhen, Mei Han, Cewu Lu

    Abstract: The reconstruction of object surfaces from multi-view images or monocular video is a fundamental issue in computer vision. However, much of the recent research concentrates on reconstructing geometry through implicit or explicit methods. In this paper, we shift our focus towards reconstructing mesh in conjunction with color. We remove the view-dependent color from neural volume rendering while ret… ▽ More

    Submitted 19 December, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

  31. arXiv:2306.16324  [pdf, other

    eess.IV cs.CV

    DoseDiff: Distance-aware Diffusion Model for Dose Prediction in Radiotherapy

    Authors: Yiwen Zhang, Chuanpu Li, Liming Zhong, Zeli Chen, Wei Yang, Xuetao Wang

    Abstract: Treatment planning, which is a critical component of the radiotherapy workflow, is typically carried out by a medical physicist in a time-consuming trial-and-error manner. Previous studies have proposed knowledge-based or deep-learning-based methods for predicting dose distribution maps to assist medical physicists in improving the efficiency of treatment planning. However, these dose prediction m… ▽ More

    Submitted 28 March, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

  32. arXiv:2306.08850  [pdf, other

    cs.SD eess.AS

    Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music

    Authors: Lifan Zhong, Erica Cooper, Junichi Yamagishi, Nobuaki Minematsu

    Abstract: With the growing amount of musical data available, automatic instrument recognition, one of the essential problems in Music Information Retrieval (MIR), is drawing more and more attention. While automatic recognition of single instruments has been well-studied, it remains challenging for polyphonic, multi-instrument musical recordings. This work presents our efforts toward building a robust end-to… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Submitted to APSIPA 2023

  33. arXiv:2305.18830  [pdf, other

    cs.CV

    Semi-supervised Pathological Image Segmentation via Cross Distillation of Multiple Attentions

    Authors: Lanfeng Zhong, Xin Liao, Shaoting Zhang, Guotai Wang

    Abstract: Segmentation of pathological images is a crucial step for accurate cancer diagnosis. However, acquiring dense annotations of such images for training is labor-intensive and time-consuming. To address this issue, Semi-Supervised Learning (SSL) has the potential for reducing the annotation cost, but it is challenged by a large number of unlabeled training images. In this paper, we propose a novel SS… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Provisional Accepted by MICCAI 2023

  34. arXiv:2305.08307  [pdf, other

    quant-ph cs.DC cs.DS

    Fusion Blossom: Fast MWPM Decoders for QEC

    Authors: Yue Wu, Lin Zhong

    Abstract: The Minimum-Weight Perfect Matching (MWPM) decoder is widely used in Quantum Error Correction (QEC) decoding. Despite its high accuracy, existing implementations of the MWPM decoder cannot catch up with quantum hardware, e.g., 1 million measurements per second for superconducting qubits. They suffer from a backlog of measurements that grows exponentially and as a result, cannot realize the power o… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  35. arXiv:2304.12853  [pdf, other

    cs.NI cs.CR cs.LG

    Adaptive Services Function Chain Orchestration For Digital Health Twin Use Cases: Heuristic-boosted Q-Learning Approach

    Authors: Jamila Alsayed Kassem, Li Zhong, Arie Taal, Paola Grosso

    Abstract: Digital Twin (DT) is a prominent technology to utilise and deploy within the healthcare sector. Yet, the main challenges facing such applications are: Strict health data-sharing policies, high-performance network requirements, and possible infrastructure resource limitations. In this paper, we address all the challenges by provisioning adaptive Virtual Network Functions (VNFs) to enforce security… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  36. arXiv:2304.10600  [pdf, other

    cs.CR cs.SE

    A Survey of Prevent and Detect Access Control Vulnerabilities

    Authors: Li Zhong

    Abstract: Broken access control is one of the most common security vulnerabilities in web applications. These vulnerabilities are the major cause of many data breach incidents, which result in privacy concern and revenue loss. However, preventing and detecting access control vulnerabilities proactively in web applications could be difficult. Currently, these vulnerabilities are actively detected by bug boun… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  37. arXiv:2304.06336  [pdf, other

    cs.LG cs.AI

    Attributed Multi-order Graph Convolutional Network for Heterogeneous Graphs

    Authors: Zhaoliang Chen, Zhihao Wu, Luying Zhong, Claudia Plant, Shi** Wang, Wenzhong Guo

    Abstract: Heterogeneous graph neural networks aim to discover discriminative node embeddings and relations from multi-relational networks.One challenge of heterogeneous graph learning is the design of learnable meta-paths, which significantly influences the quality of learned embeddings.Thus, in this paper, we propose an Attributed Multi-Order Graph Convolutional Network (AMOGCN), which automatically studie… ▽ More

    Submitted 18 April, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

  38. arXiv:2304.04038  [pdf, other

    cs.CV

    POEM: Reconstructing Hand in a Point Embedded Multi-view Stereo

    Authors: Lixin Yang, Jian Xu, Licheng Zhong, Xinyu Zhan, Zhicheng Wang, Kejian Wu, Cewu Lu

    Abstract: Enable neural networks to capture 3D geometrical-aware features is essential in multi-view based vision tasks. Previous methods usually encode the 3D information of multi-view stereo into the 2D features. In contrast, we present a novel method, named POEM, that directly operates on the 3D POints Embedded in the Multi-view stereo for reconstructing hand mesh in it. Point is a natural form of 3D inf… ▽ More

    Submitted 24 May, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR 2023. (v2 fix typos)

  39. arXiv:2303.06018  [pdf, other

    cs.SE cs.AI cs.LG cs.PL

    Hierarchical Neural Program Synthesis

    Authors: Linghan Zhong, Ryan Lindeborg, Jesse Zhang, Joseph J. Lim, Shao-Hua Sun

    Abstract: Program synthesis aims to automatically construct human-readable programs that satisfy given task specifications, such as input/output pairs or demonstrations. Recent works have demonstrated encouraging results in a variety of domains, such as string transformation, tensor manipulation, and describing behaviors of embodied agents. Most existing program synthesis methods are designed to synthesize… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  40. arXiv:2302.08682  [pdf, other

    cs.CV

    Random Padding Data Augmentation

    Authors: Nan Yang, Laicheng Zhong, Fan Huang, Dong Yuan, Wei Bao

    Abstract: The convolutional neural network (CNN) learns the same object in different positions in images, which can improve the recognition accuracy of the model. An implication of this is that CNN may know where the object is. The usefulness of the features' spatial information in CNNs has not been well investigated. In this paper, we found that the model's learning of features' position information hinder… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  41. arXiv:2302.05019  [pdf, other

    cs.IR

    A Comprehensive Survey on Automatic Knowledge Graph Construction

    Authors: Lingfeng Zhong, Jia Wu, Qian Li, Hao Peng, Xindong Wu

    Abstract: Automatic knowledge graph construction aims to manufacture structured human knowledge. To this end, much effort has historically been spent extracting informative fact patterns from different data sources. However, more recently, research interest has shifted to acquiring conceptualized structured knowledge beyond informative data. In addition, researchers have also been exploring new ways of hand… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: This paper contains 50 pages and 22 figures. This paper is submitted to ACM Computing Surveys

  42. arXiv:2301.08419  [pdf, other

    quant-ph cs.AR

    Scalable Quantum Error Correction for Surface Codes using FPGA

    Authors: Namitha Liyanage, Yue Wu, Alexander Deters, Lin Zhong

    Abstract: A fault-tolerant quantum computer must decode and correct errors faster than they appear. The faster errors can be corrected, the more time the computer can do useful work. The Union-Find (UF) decoder is promising with an average time complexity slightly higher than $O(d^3)$. We report a distributed version of the UF decoder that exploits parallel computing resources for further speedup. Using an… ▽ More

    Submitted 15 May, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

  43. arXiv:2301.02576  [pdf, other

    cs.DC cs.NI

    GCS: Generalized Cache Coherence For Efficient Synchronization

    Authors: Yanpeng Yu, Seung-seob Lee, Anurag Khandelwal, Lin Zhong

    Abstract: We explore the design of scalable synchronization primitives for disaggregated shared memory. Porting existing synchronization primitives to disaggregated shared memory results in poor scalability with the number of application threads because they layer synchronization primitives atop cache-coherence substrates, which engenders redundant inter-core communications. Substantially higher cache-coher… ▽ More

    Submitted 3 May, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: 14 pages, 11 figures

  44. arXiv:2212.12671  [pdf, other

    cs.OS cs.CR

    MProtect: Operating System Memory Management without Access

    Authors: Caihua Li, Seung-seob Lee, Min Hong Yun, Lin Zhong

    Abstract: Modern operating systems (OSes) have unfettered access to application data, assuming that applications trust them. This assumption, however, is problematic under many scenarios where either the OS provider is not trustworthy or the OS can be compromised due to its large attack surface. Our investigation began with the hypothesis that unfettered access to memory is not fundamentally necessary for t… ▽ More

    Submitted 24 December, 2022; originally announced December 2022.

  45. Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked Claims

    Authors: Qiang Sheng, Juan Cao, Xueyao Zhang, Xirong Li, Lei Zhong

    Abstract: False claims that have been previously fact-checked can still spread on social media. To mitigate their continual spread, detecting previously fact-checked claims is indispensable. Given a claim, existing works focus on providing evidence for detection by reranking candidate fact-checking articles (FC-articles) retrieved by BM25. However, these performances may be limited because they ignore the f… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

    Comments: ACL-IJCNLP 2021 Main Conference Long Paper

  46. arXiv:2110.11592  [pdf, other

    cs.CV cs.IR

    Learning Text-Image Joint Embedding for Efficient Cross-Modal Retrieval with Deep Feature Engineering

    Authors: Zhongwei Xie, Ling Liu, Yanzhao Wu, Luo Zhong, Lin Li

    Abstract: This paper introduces a two-phase deep feature engineering framework for efficient learning of semantics enhanced joint embedding, which clearly separates the deep feature engineering in data preprocessing from training the text-image joint embedding model. We use the Recipe1M dataset for the technical description and empirical validation. In preprocessing, we perform deep feature engineering by c… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Comments: accepted by ACM Transactions on Information Systems(TOIS). arXiv admin note: text overlap with arXiv:2108.00705, arXiv:2108.03788

  47. arXiv:2110.10275  [pdf

    cs.CV cs.LG

    Early- and in-season crop type map** without current-year ground truth: generating labels from historical information via a topology-based approach

    Authors: Chenxi Lin, Liheng Zhong, Xiao-Peng Song, **wei Dong, David B. Lobell, Zhenong **

    Abstract: Land cover classification in remote sensing is often faced with the challenge of limited ground truth. Incorporating historical information has the potential to significantly lower the expensive cost associated with collecting ground truth and, more importantly, enable early- and in-season map** that is helpful to many pre-harvest decisions. In this study, we propose a new approach that can effe… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  48. arXiv:2110.08578  [pdf, other

    cs.CV cs.AI

    Visual-aware Attention Dual-stream Decoder for Video Captioning

    Authors: Zhixin Sun, Xian Zhong, Shuqin Chen, Lin Li, Luo Zhong

    Abstract: Video captioning is a challenging task that captures different visual parts and describes them in sentences, for it requires visual and linguistic coherence. The attention mechanism in the current video captioning method learns to assign weight to each frame, promoting the decoder dynamically. This may not explicitly model the correlation and the temporal coherence of the visual features extracted… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

  49. Integrating Pattern- and Fact-based Fake News Detection via Model Preference Learning

    Authors: Qiang Sheng, Xueyao Zhang, Juan Cao, Lei Zhong

    Abstract: To defend against fake news, researchers have developed various methods based on texts. These methods can be grouped as 1) pattern-based methods, which focus on shared patterns among fake news posts rather than the claim itself; and 2) fact-based methods, which retrieve from external sources to verify the claim's veracity without considering patterns. The two groups of methods, which have differen… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: ACM CIKM 2021 Full Paper

  50. Learning Joint Embedding with Modality Alignments for Cross-Modal Retrieval of Recipes and Food Images

    Authors: Zhongwei Xie, Ling Liu, Lin Li, Luo Zhong

    Abstract: This paper presents a three-tier modality alignment approach to learning text-image joint embedding, coined as JEMA, for cross-modal retrieval of cooking recipes and food images. The first tier improves recipe text embedding by optimizing the LSTM networks with term extraction and ranking enhanced sequence patterns, and optimizes the image embedding by combining the ResNeXt-101 image encoder with… ▽ More

    Submitted 18 August, 2021; v1 submitted 8 August, 2021; originally announced August 2021.

    Comments: accepted by CIKM 2021. arXiv admin note: substantial text overlap with arXiv:2108.00705