Skip to main content

Showing 51–100 of 379 results for author: Cui, H

.
  1. arXiv:2311.05122  [pdf, ps, other

    cs.CV

    ScribblePolyp: Scribble-Supervised Polyp Segmentation through Dual Consistency Alignment

    Authors: Zixun Zhang, Yuncheng Jiang, Jun Wei, Hannah Cui, Zhen Li

    Abstract: Automatic polyp segmentation models play a pivotal role in the clinical diagnosis of gastrointestinal diseases. In previous studies, most methods relied on fully supervised approaches, necessitating pixel-level annotations for model training. However, the creation of pixel-level annotations is both expensive and time-consuming, impeding the development of model generalization. In response to this… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: Accepted by BIBM 2023

  2. arXiv:2311.02646  [pdf

    eess.IV physics.optics

    Flexible uniform-sampling foveated Fourier single-pixel imaging

    Authors: Huan Cui, Jie Cao, Qun Hao, Haoyu Zhang, Chang Zhou

    Abstract: Fourier single-pixel imaging (FSI) is a data-efficient single-pixel imaging (SPI). However, there is still a serious challenge to obtain higher imaging quality using fewer measurements, which limits the development of real-time SPI. In this work, a uniform-sampling foveated FSI (UFFSI) is proposed with three features, uniform sampling, effective sampling and flexible fovea, to achieve under-sampli… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: 7 pages,5 figures

  3. arXiv:2311.00287  [pdf, other

    cs.CL cs.AI cs.LG q-bio.QM

    Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models

    Authors: Ran Xu, Hejie Cui, Yue Yu, Xuan Kan, Wenqi Shi, Yuchen Zhuang, Wei **, Joyce Ho, Carl Yang

    Abstract: Clinical natural language processing requires methods that can address domain-specific challenges, such as complex medical terminology and clinical contexts. Recently, large language models (LLMs) have shown promise in this domain. Yet, their direct deployment can lead to privacy issues and are constrained by resources. To address this challenge, we delve into synthetic clinical text generation us… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  4. CHAIN: Exploring Global-Local Spatio-Temporal Information for Improved Self-Supervised Video Hashing

    Authors: Rukai Wei, Yu Liu, **gkuan Song, Heng Cui, Yanzhao Xie, Ke Zhou

    Abstract: Compressing videos into binary codes can improve retrieval speed and reduce storage overhead. However, learning accurate hash codes for video retrieval can be challenging due to high local redundancy and complex global dependencies between video frames, especially in the absence of labels. Existing self-supervised video hashing methods have been effective in designing expressive temporal encoders,… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: 12 pages, 8 figures, accepted by ACM MM 2023

  5. arXiv:2310.18804  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting

    Authors: Hejie Cui, Xinyu Fang, Zihan Zhang, Ran Xu, Xuan Kan, Xin Liu, Yue Yu, Manling Li, Yangqiu Song, Carl Yang

    Abstract: Images contain rich relational knowledge that can help machines understand the world. Existing methods on visual knowledge extraction often rely on the pre-defined format (e.g., sub-verb-obj tuples) or vocabulary (e.g., relation types), restricting the expressiveness of the extracted knowledge. In this work, we take a first exploration to a new paradigm of open visual knowledge extraction. To achi… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  6. arXiv:2310.17082  [pdf, ps, other

    astro-ph.HE

    Does or did the supernova remnant Cassiopeia A operate as a PeVatron?

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE;… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 11 pages, 3 figures, Accepted by the APJL

  7. arXiv:2310.14626  [pdf, other

    cs.CL cs.IR

    Conversational Recommender System and Large Language Model Are Made for Each Other in E-commerce Pre-sales Dialogue

    Authors: Yuanxing Liu, Wei-Nan Zhang, Yifan Chen, Yuchi Zhang, Haopeng Bai, Fan Feng, Hengbin Cui, Yongbin Li, Wanxiang Che

    Abstract: E-commerce pre-sales dialogue aims to understand and elicit user needs and preferences for the items they are seeking so as to provide appropriate recommendations. Conversational recommender systems (CRSs) learn user representation and provide accurate recommendations based on dialogue context, but rely on external knowledge. Large language models (LLMs) generate responses that mimic pre-sales dia… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

  8. Very high energy gamma-ray emission beyond 10 TeV from GRB 221009A

    Authors: Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the t… ▽ More

    Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 49pages, 11figures

    Journal ref: Science Advances, 9, eadj2778 (2023) 15 November 2023

  9. arXiv:2310.07801  [pdf, other

    cs.CV cs.AI stat.ME

    Trajectory-aware Principal Manifold Framework for Data Augmentation and Image Generation

    Authors: Elvis Han Cui, Bingbin Li, Yanan Li, Weng Kee Wong, Donghui Wang

    Abstract: Data augmentation for deep learning benefits model training, image transformation, medical imaging analysis and many other fields. Many existing methods generate new samples from a parametric distribution, like the Gaussian, with little attention to generate samples along the data manifold in either the input or feature space. In this paper, we verify that there are theoretical and practical advan… ▽ More

    Submitted 30 July, 2023; originally announced October 2023.

    Comments: 20 figures

  10. arXiv:2310.07268  [pdf, other

    cs.LG

    RaftFed: A Lightweight Federated Learning Framework for Vehicular Crowd Intelligence

    Authors: Changan Yang, Yaxing Chen, Yao Zhang, Helei Cui, Zhiwen Yu, Bin Guo, Zheng Yan, Zijiang Yang

    Abstract: Vehicular crowd intelligence (VCI) is an emerging research field. Facilitated by state-of-the-art vehicular ad-hoc networks and artificial intelligence, various VCI applications come to place, e.g., collaborative sensing, positioning, and map**. The collaborative property of VCI applications generally requires data to be shared among participants, thus forming network-wide intelligence. How to f… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 8 pages,8 figures

  11. arXiv:2310.03575  [pdf, other

    stat.ML cs.LG

    Analysis of learning a flow-based generative model from limited sample complexity

    Authors: Hugo Cui, Florent Krzakala, Eric Vanden-Eijnden, Lenka Zdeborová

    Abstract: We study the problem of training a flow-based generative model, parametrized by a two-layer autoencoder, to sample from a high-dimensional Gaussian mixture. We provide a sharp end-to-end analysis of the problem. First, we provide a tight closed-form characterization of the learnt velocity field, when parametrized by a shallow denoising auto-encoder trained on a finite number $n$ of samples from th… ▽ More

    Submitted 25 June, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  12. arXiv:2309.16924  [pdf, other

    cs.CV

    Incremental Rotation Averaging Revisited and More: A New Rotation Averaging Benchmark

    Authors: Xiang Gao, Hainan Cui, Shuhan Shen

    Abstract: In order to further advance the accuracy and robustness of the incremental parameter estimation-based rotation averaging methods, in this paper, a new member of the Incremental Rotation Averaging (IRA) family is introduced, which is termed as IRAv4. As the most significant feature of the IRAv4, a task-specific connected dominating set is extracted to serve as a more reliable and accurate reference… ▽ More

    Submitted 4 January, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Submitted to IEEE Transactions

  13. arXiv:2309.14345  [pdf, other

    cs.SE cs.AI

    Bias Testing and Mitigation in LLM-based Code Generation

    Authors: Dong Huang, Qingwen Bu, Jie Zhang, Xiaofei Xie, Junjie Chen, Heming Cui

    Abstract: Utilizing state-of-the-art Large Language Models (LLMs), automatic code generation models play a pivotal role in enhancing the productivity of software development procedures. As the adoption of LLMs becomes more widespread in software coding ecosystems, a pressing issue has emerged: does the generated code contain social bias and unfairness, such as those related to age, gender, and race? This is… ▽ More

    Submitted 24 May, 2024; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: Title changed

  14. arXiv:2309.13425  [pdf, other

    cs.LG

    MiliPoint: A Point Cloud Dataset for mmWave Radar

    Authors: Han Cui, Shu Zhong, Jiacheng Wu, Zichao Shen, Naim Dahnoun, Yiren Zhao

    Abstract: Millimetre-wave (mmWave) radar has emerged as an attractive and cost-effective alternative for human activity sensing compared to traditional camera-based systems. mmWave radars are also non-intrusive, providing better protection for user privacy. However, as a Radio Frequency (RF) based technology, mmWave radars rely on capturing reflected signals from objects, making them more prone to noise com… ▽ More

    Submitted 2 November, 2023; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted at NeurIPS 2023 Datasets & Benchmarks

  15. Fingerprints for anisotropic Kondo lattice behavior in the quasiparticle dynamics of the kagome metal Ni$_3$In

    Authors: Dong-Hyeon Gim, Dirk Wulferding, Chulwan Lee, Hengbo Cui, Kiwan Nam, Myung Joon Han, Kee Hoon Kim

    Abstract: We present a temperature- and polarization-resolved phononic and electronic Raman scattering study in combination with the first-principles calculations on the kagome metal Ni$_3$In with anisotropic transport properties and non-Fermi liquid behavior. At temperatures below 50 K and down to 2 K, several Raman phonon modes, including particularly an interlayer shear mode, exhibit appreciable frequenc… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: 18 pages, 6 figures; published in Phys. Rev. B

    Journal ref: Phys. Rev. B 108, 115143 (2023)

  16. arXiv:2309.08083  [pdf

    physics.app-ph

    On the Acoustoelasticity of Backward Lamb Wave in Prestressed Plate

    Authors: Zhongtao Hu, Guo-Yang Li, Hanyin Cui

    Abstract: Backward Lamb waves, which exhibit a group velocity that propagates in the opposite direction to their phase velocity, have recently garnered considerable attention for their potential applications in nondestructive testing. Herein we present a theoretical study on backward Lamb waves in the elastic plate subject to prestresses. We demonstrate that the group velocity of the first antisymmetric bac… ▽ More

    Submitted 25 September, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

  17. arXiv:2309.06799  [pdf, other

    cs.AI physics.geo-ph

    When Geoscience Meets Foundation Models: Towards General Geoscience Artificial Intelligence System

    Authors: Hao Zhang, **-Jian Xu, Hong-Wei Cui, Lin Li, Yaowen Yang, Chao-Sheng Tang, Niklas Boers

    Abstract: Geoscience foundation models (GFMs) represent a revolutionary approach within Earth sciences to integrate massive cross-disciplinary data for improved simulation and understanding of Earth system dynamics. As a data-centric artificial intelligence paradigm, GFMs extract valuable insights from petabytes of both structured and unstructured data. Their versatility in task specification, diverse input… ▽ More

    Submitted 14 March, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: the manuscript is under re-writing

  18. arXiv:2309.03750  [pdf, other

    cs.CV

    PBP: Path-based Trajectory Prediction for Autonomous Driving

    Authors: Sepideh Afshar, Nachiket Deo, Akshay Bhagat, Titas Chakraborty, Yunming Shao, Balarama Raju Buddharaju, Adwait Deshpande, Henggang Cui

    Abstract: Trajectory prediction plays a crucial role in the autonomous driving stack by enabling autonomous vehicles to anticipate the motion of surrounding agents. Goal-based prediction models have gained traction in recent years for addressing the multimodal nature of future trajectories. Goal-based prediction models simplify multimodal prediction by first predicting 2D goal locations of agents and then p… ▽ More

    Submitted 2 March, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Published at ICRA 2024; Sepideh Afshar and Nachiket Deo contributed equally

  19. MLN-net: A multi-source medical image segmentation method for clustered microcalcifications using multiple layer normalization

    Authors: Ke Wang, Zanting Ye, Xiang Xie, Haidong Cui, Tao Chen, Banteng Liu

    Abstract: Accurate segmentation of clustered microcalcifications in mammography is crucial for the diagnosis and treatment of breast cancer. Despite exhibiting expert-level accuracy, recent deep learning advancements in medical image segmentation provide insufficient contribution to practical applications, due to the domain shift resulting from differences in patient postures, individual gland density, and… ▽ More

    Submitted 3 January, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: 17 pages, 9 figures, 3 tables

    Journal ref: Knowledge-Based Systems, 2024, 283: 111127

  20. arXiv:2309.01941  [pdf, other

    q-bio.NC cs.AI cs.LG

    Dynamic Brain Transformer with Multi-level Attention for Functional Brain Network Analysis

    Authors: Xuan Kan, Antonio Aodong Chen Gu, Hejie Cui, Ying Guo, Carl Yang

    Abstract: Recent neuroimaging studies have highlighted the importance of network-centric brain analysis, particularly with functional magnetic resonance imaging. The emergence of Deep Neural Networks has fostered a substantial interest in predicting clinical outcomes and categorizing individuals based on brain networks. However, the conventional approach involving static brain network analysis offers limite… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE BHI 2023

    MSC Class: 68T07; 68T05 ACM Class: I.2.6; J.3

  21. arXiv:2308.10875   

    cs.NE cs.AI cs.LG

    Metaheuristic Algorithms in Artificial Intelligence with Applications to Bioinformatics, Biostatistics, Ecology and, the Manufacturing Industries

    Authors: Elvis Han Cui, Zizhao Zhang, Culsome Junwen Chen, Weng Kee Wong

    Abstract: Nature-inspired metaheuristic algorithms are important components of artificial intelligence, and are increasingly used across disciplines to tackle various types of challenging optimization problems. We apply a newly proposed nature-inspired metaheuristic algorithm called competitive swarm optimizer with mutated agents (CSO-MA) and demonstrate its flexibility and out-performance relative to its c… ▽ More

    Submitted 16 October, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Revision, unpublished manuscript

  22. arXiv:2308.08784  [pdf, other

    cs.SE cs.AI

    CodeCoT: Tackling Code Syntax Errors in CoT Reasoning for Code Generation

    Authors: Dong Huang, Qingwen Bu, Yuhao Qing, Heming Cui

    Abstract: Chain-of-thought (CoT) has emerged as a groundbreaking tool in NLP, notably for its efficacy in complex reasoning tasks, such as mathematical proofs. However, its application in code generation faces a distinct challenge, i.e., although the code generated with CoT reasoning is logically correct, it faces the problem of syntax error (e.g., invalid syntax error report) during code execution, which c… ▽ More

    Submitted 22 February, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: Title changed

  23. arXiv:2308.07304  [pdf, other

    cs.HC cs.CR

    BehaVR: User Identification Based on VR Sensor Data

    Authors: Ismat Jarin, Yu Duan, Rahmadi Trimananda, Hao Cui, Salma Elmalaki, Athina Markopoulou

    Abstract: Virtual reality (VR) platforms enable a wide range of applications, however pose unique privacy risks. In particular, VR devices are equipped with a rich set of sensors that collect personal and sensitive information (e.g., body motion, eye gaze, hand joints, and facial expression), which can be used to uniquely identify a user, even without explicit identifiers. In this paper, we are interested i… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  24. Effective Hamiltonian approach to the quantum phase transitions in the extended Jaynes-Cummings model

    Authors: H. T. Cui, Y. A. Yan, M. Qin, X. X. Yi

    Abstract: The study of phase transitions in dissipative quantum systems based on the Liouvillian is often hindered by the difficulty of constructing a time-local master equation when the system-environment coupling is strong. To address this issue, the complex discretization approximation for the environment is proposed to study the quantum phase transition in the extended Jaynes-Cumming model with an infin… ▽ More

    Submitted 6 April, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: 12pages, published version

    Journal ref: Phys. Rev. A 109.042202(2024)

  25. arXiv:2307.12121  [pdf, other

    cs.DC

    Online Container Scheduling for Low-Latency IoT Services in Edge Cluster Upgrade: A Reinforcement Learning Approach

    Authors: Hanshuai Cui, Zhiqing Tang, Jiong Lou, Weijia Jia

    Abstract: In Mobile Edge Computing (MEC), Internet of Things (IoT) devices offload computationally-intensive tasks to edge nodes, where they are executed within containers, reducing the reliance on centralized cloud infrastructure. Frequent upgrades are essential to maintain the efficient and secure operation of edge clusters. However, traditional cloud cluster upgrade strategies are ill-suited for edge clu… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  26. arXiv:2307.11563  [pdf, other

    cs.SE cs.AI

    Feature Map Testing for Deep Neural Networks

    Authors: Dong Huang, Qingwen Bu, Yahao Qing, Yichao Fu, Heming Cui

    Abstract: Due to the widespread application of deep neural networks~(DNNs) in safety-critical tasks, deep learning testing has drawn increasing attention. During the testing process, test cases that have been fuzzed or selected using test metrics are fed into the model to find fault-inducing test units (e.g., neurons and feature maps, activating which will almost certainly result in a model error) and repor… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: 12 pages, 5 figures. arXiv admin note: text overlap with arXiv:2307.11011

  27. arXiv:2307.11011  [pdf, other

    cs.LG cs.SE

    Neuron Sensitivity Guided Test Case Selection for Deep Learning Testing

    Authors: Dong Huang, Qingwen Bu, Yichao Fu, Yuhao Qing, Bocheng Xiao, Heming Cui

    Abstract: Deep Neural Networks~(DNNs) have been widely deployed in software to address various tasks~(e.g., autonomous driving, medical diagnosis). However, they could also produce incorrect behaviors that result in financial losses and even threaten human safety. To reveal the incorrect behaviors in DNN and repair them, DNN developers often collect rich unlabeled datasets from the natural world and label t… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  28. arXiv:2307.09763  [pdf, other

    cs.CV cs.AI

    Towards Building More Robust Models with Frequency Bias

    Authors: Qingwen Bu, Dong Huang, Heming Cui

    Abstract: The vulnerability of deep neural networks to adversarial samples has been a major impediment to their broad applications, despite their success in various fields. Recently, some works suggested that adversarially-trained models emphasize the importance of low-frequency information to achieve higher robustness. While several attempts have been made to leverage this frequency characteristic, they ha… ▽ More

    Submitted 27 July, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV23

  29. arXiv:2306.16002  [pdf, other

    physics.med-ph

    Super resolution dual-layer CBCT imaging with model-guided deep learning

    Authors: Jiongtao Zhu, Ting Su, Xin Zhang, Han Cui, Yuhang Tan, Hairong Zheng, Dong Liang, **chuan Guo, Yongshuai Ge

    Abstract: Objective: This study aims at investigating a novel super resolution CBCT imaging technique with the dual-layer flat panel detector (DL-FPD). Approach: In DL-FPD based CBCT imaging, the low-energy and high-energy projections acquired from the top and bottom detector layers contain intrinsically mismatched spatial information, from which super resolution CBCT images can be generated. To explain, a… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  30. arXiv:2306.14089  [pdf, other

    hep-ex

    Jet charge identification in ee-Z-qq process at Z pole operation

    Authors: Hanhua Cui, Mingrui Zhao, Yuexin Wang, Hao Liang, Manqi Ruan

    Abstract: Accurate jet charge identification is essential for precise electroweak and flavor measurements at the high-energy frontier. We propose a novel method called the Leading Particle Jet Charge method (LPJC) to determine the jet charge based on information about the leading charged particle. Tested on Z - bb and Z - cc samples at a center-of-mass energy of 91.2GeV, the LPJC achieves an effective taggi… ▽ More

    Submitted 18 March, 2024; v1 submitted 24 June, 2023; originally announced June 2023.

  31. arXiv:2306.10474  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    A Universal Semantic-Geometric Representation for Robotic Manipulation

    Authors: Tong Zhang, Yingdong Hu, Hanchen Cui, Hang Zhao, Yang Gao

    Abstract: Robots rely heavily on sensors, especially RGB and depth cameras, to perceive and interact with the world. RGB cameras record 2D images with rich semantic information while missing precise spatial information. On the other side, depth cameras offer critical 3D geometry data but capture limited semantics. Therefore, integrating both modalities is crucial for learning representations for robotic per… ▽ More

    Submitted 13 October, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: CoRL 2023. Project website: https://semantic-geometric-representation.github.io

  32. arXiv:2306.08807  [pdf, other

    cs.RO

    Sim-on-Wheels: Physical World in the Loop Simulation for Self-Driving

    Authors: Yuan Shen, Bhargav Chandaka, Zhi-hao Lin, Albert Zhai, Hang Cui, David Forsyth, Shenlong Wang

    Abstract: We present Sim-on-Wheels, a safe, realistic, and vehicle-in-loop framework to test autonomous vehicles' performance in the real world under safety-critical scenarios. Sim-on-wheels runs on a self-driving vehicle operating in the physical world. It creates virtual traffic participants with risky behaviors and seamlessly inserts the virtual events into images perceived from the physical world in rea… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  33. arXiv:2306.04802  [pdf, other

    cs.AI cs.CL cs.LG cs.SI

    A Review on Knowledge Graphs for Healthcare: Resources, Applications, and Promises

    Authors: Hejie Cui, Jiaying Lu, Shiyu Wang, Ran Xu, Wen**g Ma, Shaojun Yu, Yue Yu, Xuan Kan, Chen Ling, Tianfan Fu, Liang Zhao, Joyce Ho, Fei Wang, Carl Yang

    Abstract: Healthcare knowledge graphs (HKGs) are valuable tools for organizing biomedical concepts and their relationships with interpretable structures. The recent advent of large language models (LLMs) has paved the way for building more comprehensive and accurate HKGs. This, in turn, can improve the reliability of generated content and enable better evaluation of LLMs. However, the challenges of HKGs suc… ▽ More

    Submitted 19 February, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

  34. arXiv:2306.02532  [pdf, other

    cs.LG cs.AI q-bio.QM

    R-Mixup: Riemannian Mixup for Biological Networks

    Authors: Xuan Kan, Zimu Li, Hejie Cui, Yue Yu, Ran Xu, Shaojun Yu, Zilong Zhang, Ying Guo, Carl Yang

    Abstract: Biological networks are commonly used in biomedical and healthcare domains to effectively model the structure of complex biological systems with interactions linking biological entities. However, due to their characteristics of high dimensionality and low sample size, directly applying deep learning models on biological networks usually faces severe overfitting. In this work, we propose R-MIXUP, a… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted to KDD 2023

    MSC Class: 68T07; 68T05 ACM Class: I.2.6; J.3

  35. arXiv:2306.01016  [pdf, other

    cs.CL cs.AI cs.CV cs.LG cs.MM

    PV2TEA: Patching Visual Modality to Textual-Established Information Extraction

    Authors: Hejie Cui, Rongmei Lin, Nasser Zalmout, Chenwei Zhang, **gbo Shang, Carl Yang, Xian Li

    Abstract: Information extraction, e.g., attribute value extraction, has been extensively studied and formulated based only on text. However, many attributes can benefit from image-based extraction, like color, shape, pattern, among others. The visual modality has long been underutilized, mainly due to multimodal annotation difficulty. In this paper, we aim to patch the visual modality to the textual-establi… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ACL 2023 Findings

  36. arXiv:2306.00652  [pdf, other

    cs.CL cs.AI

    Explanation Graph Generation via Generative Pre-training over Synthetic Graphs

    Authors: Han Cui, Shangzhan Li, Yu Zhang, Qi Shi

    Abstract: The generation of explanation graphs is a significant task that aims to produce explanation graphs in response to user input, revealing the internal reasoning process. This task is challenging due to the significant discrepancy between unstructured user queries and structured explanation graphs. Current research commonly fine-tunes a text-based pre-trained language model on a small downstream data… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted by ACL23-Findings

  37. arXiv:2305.19949  [pdf, other

    cs.CV

    Treasure in Distribution: A Domain Randomization based Multi-Source Domain Generalization for 2D Medical Image Segmentation

    Authors: Ziyang Chen, Yongsheng Pan, Yiwen Ye, Hengfei Cui, Yong Xia

    Abstract: Although recent years have witnessed the great success of convolutional neural networks (CNNs) in medical image segmentation, the domain shift issue caused by the highly variable image quality of medical images hinders the deployment of CNNs in real-world clinical applications. Domain generalization (DG) methods aim to address this issue by training a robust model on the source domain, which has a… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: 12 pages, 4 figures, 8 tables, early accepted by MICCAI 2023

  38. arXiv:2305.18703  [pdf, other

    cs.CL cs.AI

    Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey

    Authors: Chen Ling, Xujiang Zhao, Jiaying Lu, Chengyuan Deng, Can Zheng, Junxiang Wang, Tanmoy Chowdhury, Yun Li, Hejie Cui, Xuchao Zhang, Tianjiao Zhao, Amit Panalkar, Dhagash Mehta, Stefano Pasquali, Wei Cheng, Haoyu Wang, Yanchi Liu, Zhengzhang Chen, Haifeng Chen, Chris White, Quanquan Gu, Jian Pei, Carl Yang, Liang Zhao

    Abstract: Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of applications. However, directly applying LLMs to solve sophisticated problems in specific domains meets many hurdles, caused by the heterogeneity of domain data, the sophistication of domain knowledge, the uniqueness of dom… ▽ More

    Submitted 29 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

  39. arXiv:2305.17030  [pdf, other

    astro-ph.HE hep-ph

    The First LHAASO Catalog of Gamma-Ray Sources

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: We present the first catalog of very-high energy and ultra-high energy gamma-ray sources detected by the Large High Altitude Air Shower Observatory (LHAASO). The catalog was compiled using 508 days of data collected by the Water Cherenkov Detector Array (WCDA) from March 2021 to September 2022 and 933 days of data recorded by the Kilometer Squared Array (KM2A) from January 2020 to September 2022.… ▽ More

    Submitted 27 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 40 pages, 13 figures, 4 tables

    Journal ref: The Astrophysical Journal Supplement Series, 271 (2024) 25

  40. arXiv:2305.14376  [pdf, other

    q-bio.NC cs.LG

    PTGB: Pre-Train Graph Neural Networks for Brain Network Analysis

    Authors: Yi Yang, Hejie Cui, Carl Yang

    Abstract: The human brain is the central hub of the neurobiological system, controlling behavior and cognition in complex ways. Recent advances in neuroscience and neuroimaging analysis have shown a growing interest in the interactions between brain regions of interest (ROIs) and their impact on neural development and disorder diagnosis. As a powerful deep model for analyzing graph-structured data, Graph Ne… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: Accepted to CHIL 2023, 19 pages

  41. arXiv:2305.11614  [pdf, other

    eess.SP

    Two-Bit RIS-Aided Communications at 3.5GHz: Some Insights from the Measurement Results Under Multiple Practical Scenes

    Authors: Shun Zhang, Haoran Sun, Runze Yu, Hongshenyuan Cui, Jian Ren, Feifei Gao, Shi **, Hongxiang Xie, Hao Wang

    Abstract: In this paper, we propose a two-bit reconfigurable intelligent surface (RIS)-aided communication system, which mainly consists of a two-bit RIS, a transmitter and a receiver. A corresponding prototype verification system is designed to perform experimental tests in practical environments. The carrier frequency is set as 3.5GHz, and the RIS array possesses 256 units, each of which adopts two-bit ph… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  42. arXiv:2305.11041  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    High-dimensional Asymptotics of Denoising Autoencoders

    Authors: Hugo Cui, Lenka Zdeborová

    Abstract: We address the problem of denoising data from a Gaussian mixture using a two-layer non-linear autoencoder with tied weights and a skip connection. We consider the high-dimensional limit where the number of training samples and the input dimension jointly tend to infinity while the number of hidden units remains bounded. We provide closed-form expressions for the denoising mean-squared test error.… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  43. arXiv:2305.06517  [pdf, other

    math.AG math.DG

    On area-minimizing Pfaffian varieties

    Authors: Hongbin Cui, Xiaoxiang Jiao, Xiaowei Xu

    Abstract: There are two significant families of minimal real matrix varieties: determinantal varieties and skew-symmetric determinantal varieties, the later ones are also known as Pfaffian varieties. In 1999, Kerckhove and Lawlor [Duke Math.J. 96(2),401--424,1999] proved that determinantal varieties are area-minimizing except for two families. In this paper we prove that all Pfaffian varieties are area-mini… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 29 pages, 1 figure

    MSC Class: Primary 49Q15; 53A10; 53A07; Secondary 14M12; 14M99

  44. Measurement of ultra-high-energy diffuse gamma-ray emission of the Galactic plane from 10 TeV to 1 PeV with LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The diffuse Galactic $γ$-ray emission, mainly produced via interactions between cosmic rays and the interstellar medium and/or radiation field, is a very important probe of the distribution, propagation, and interaction of cosmic rays in the Milky Way. In this work we report the measurements of diffuse $γ$-rays from the Galactic plane between 10 TeV and 1 PeV energies, with the square kilometer ar… ▽ More

    Submitted 19 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 12 pages, 8 figures, 5 tables; accepted for publication in Physical Review Letters; source mask file provided as ancillary file

    Journal ref: Phys. Rev. Lett. 131, 151001 (2023)

  45. U-NEED: A Fine-grained Dataset for User Needs-Centric E-commerce Conversational Recommendation

    Authors: Yuanxing Liu, Weinan Zhang, Baohua Dong, Yan Fan, Hang Wang, Fan Feng, Yifan Chen, Ziyu Zhuang, Hengbin Cui, Yongbin Li, Wanxiang Che

    Abstract: Conversational recommender systems (CRSs) aim to understand the information needs and preferences expressed in a dialogue to recommend suitable items to the user. Most of the existing conversational recommendation datasets are synthesized or simulated with crowdsourcing, which has a large gap with real-world scenarios. To bridge the gap, previous work contributes a dataset E-ConvRec, based on pre-… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: SIGIR23 Resource Track

  46. arXiv:2305.04142  [pdf, other

    cs.LG cs.CV cs.NE q-bio.NC

    Transformer-Based Hierarchical Clustering for Brain Network Analysis

    Authors: Wei Dai, Hejie Cui, Xuan Kan, Ying Guo, Sanne van Rooij, Carl Yang

    Abstract: Brain networks, graphical models such as those constructed from MRI, have been widely used in pathological prediction and analysis of brain functions. Within the complex brain system, differences in neuronal connection strengths parcellate the brain into various functional modules (network communities), which are critical for brain analysis. However, identifying such communities within the brain h… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE-ISBI 2023

    MSC Class: 68T07; 68T45; 68T20 ACM Class: I.2.6; I.2.10; J.3

  47. arXiv:2305.03963  [pdf, other

    cs.CR cs.AI cs.CV

    Beyond the Model: Data Pre-processing Attack to Deep Learning Models in Android Apps

    Authors: Ye Sang, Yu** Huang, Shuo Huang, Helei Cui

    Abstract: The increasing popularity of deep learning (DL) models and the advantages of computing, including low latency and bandwidth savings on smartphones, have led to the emergence of intelligent mobile applications, also known as DL apps, in recent years. However, this technological development has also given rise to several security concerns, including adversarial examples, model stealing, and data poi… ▽ More

    Submitted 11 May, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: Accepted to AsiaCCS WorkShop on Secure and Trustworthy Deep Learning Systems (SecTL 2023)

  48. arXiv:2305.03888  [pdf, other

    cs.CR cs.AI cs.AR

    Energy-Latency Attacks to On-Device Neural Networks via Sponge Poisoning

    Authors: Zijian Wang, Shuo Huang, Yu** Huang, Helei Cui

    Abstract: In recent years, on-device deep learning has gained attention as a means of develo** affordable deep learning applications for mobile devices. However, on-device models are constrained by limited energy and computation resources. In the mean time, a poisoning attack known as sponge poisoning has been developed.This attack involves feeding the model with poisoned examples to increase the energy c… ▽ More

    Submitted 11 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted to AsiaCCS Workshop on Secure and Trustworthy Deep Learning Systems (SecTL 2023)

  49. arXiv:2304.04928  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Emergence of flat bands and ferromagnetic fluctuations via orbital-selective electron correlations in Mn-based kagome metal

    Authors: Subhasis Samanta, Hwiwoo Park, Chanhyeon Lee, Sungmin Jeon, Hengbo Cui, Yong-Xin Yao, Jungseek Hwang, Kwang-Yong Choi, Heung-Sik Kim

    Abstract: Kagome lattice has been actively studied for the possible realization of frustration-induced two-dimensional flat bands and a number of correlation-induced phases. Currently, the search for kagome systems with a nearly dispersionless flat band close to the Fermi level is ongoing. Here, by combining theoretical and experimental tools, we present Sc$_3$Mn$_3$Al$_7$Si$_5$ as a novel realization of co… ▽ More

    Submitted 25 June, 2024; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: 14 pages, 7 figures

    Journal ref: Nat. Commun. 15, 5376 (2024)

  50. arXiv:2303.17740  [pdf, other

    cs.CR

    A CI-based Auditing Framework for Data Collection Practices

    Authors: Athina Markopoulou, Rahmadi Trimananda, Hao Cui

    Abstract: Apps and devices (mobile devices, web browsers, IoT, VR, voice assistants, etc.) routinely collect user data, and send them to first- and third-party servers through the network. Recently, there is a lot of interest in (1) auditing the actual data collection practices of those systems; and also in (2) checking the consistency of those practices against the statements made in the corresponding priv… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: 5 pages, 5 figures. The paper was first presented at the 4th Annual Symposium on Applications of Contextual Integrity, NYC, Sept. 2022