Skip to main content

Showing 1–50 of 56 results for author: Bai, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14550  [pdf, other

    cs.CL cs.AI

    GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models

    Authors: Shilong Li, Yancheng He, Hangyu Guo, Xingyuan Bu, Ge Bai, Jie Liu, Jiaheng Liu, Xingwei Qu, Yangguang Li, Wanli Ouyang, Wenbo Su, Bo Zheng

    Abstract: Long-context capabilities are essential for large language models (LLMs) to tackle complex and long-input tasks. Despite numerous efforts made to optimize LLMs for long contexts, challenges persist in robustly processing long inputs. In this paper, we introduce GraphReader, a graph-based agent system designed to handle long texts by structuring them into a graph and employing an agent to explore t… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: The first four authors contributed equally, 27 pages

  2. arXiv:2406.11429  [pdf, other

    cs.CL cs.AI

    Fusion Makes Perfection: An Efficient Multi-Grained Matching Approach for Zero-Shot Relation Extraction

    Authors: Shilong Li, Ge Bai, Zhang Zhang, Ying Liu, Chenji Lu, Daichi Guo, Ruifang Liu, Yong Sun

    Abstract: Predicting unseen relations that cannot be observed during the training phase is a challenging task in relation extraction. Previous works have made progress by matching the semantics between input instances and label descriptions. However, fine-grained matching often requires laborious manual annotation, and rich interactions between instances and label descriptions come with significant computat… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted to the main conference of NAACL2024

  3. arXiv:2405.16219  [pdf, other

    cs.LG stat.ML

    Deep Causal Generative Models with Property Control

    Authors: Qilong Zhao, Shiyu Wang, Guangji Bai, Bo Pan, Zhaohui Qin, Liang Zhao

    Abstract: Generating data with properties of interest by external users while following the right causation among its intrinsic factors is important yet has not been well addressed jointly. This is due to the long-lasting challenge of jointly identifying key latent variables, their causal relations, and their correlation with properties of interest, as well as how to leverage their discoveries toward causal… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 13 pages, 6 figures

  4. arXiv:2405.16075  [pdf, other

    cs.LG cs.AI

    Continuous Temporal Domain Generalization

    Authors: Zekun Cai, Guangji Bai, Renhe Jiang, Xuan Song, Liang Zhao

    Abstract: Temporal Domain Generalization (TDG) addresses the challenge of training predictive models under temporally varying data distributions. Traditional TDG approaches typically focus on domain data collected at fixed, discrete time intervals, which limits their capability to capture the inherent dynamics within continuous-evolving and irregularly-observed temporal domains. To overcome this, this work… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  5. arXiv:2405.05524  [pdf, other

    cs.CV cs.MM

    Universal Adversarial Perturbations for Vision-Language Pre-trained Models

    Authors: Peng-Fei Zhang, Zi Huang, Guangdong Bai

    Abstract: Vision-language pre-trained (VLP) models have been the foundation of numerous vision-language tasks. Given their prevalence, it becomes imperative to assess their adversarial robustness, especially when deploying them in security-crucial real-world applications. Traditionally, adversarial perturbations generated for this assessment target specific VLP models, datasets, and/or downstream tasks. Thi… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 9 pages, 5 figures

    MSC Class: 68T01 ACM Class: I.2.0

  6. arXiv:2405.04191  [pdf, other

    cs.LG cs.CV

    Effective and Robust Adversarial Training against Data and Label Corruptions

    Authors: Peng-Fei Zhang, Zi Huang, Xin-Shun Xu, Guangdong Bai

    Abstract: Corruptions due to data perturbations and label noise are prevalent in the datasets from unreliable sources, which poses significant threats to model training. Despite existing efforts in develo** robust models, current learning methods commonly overlook the possible co-existence of both corruptions, limiting the effectiveness and practicability of the model. In this paper, we develop an Effecti… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 12 pages, 8 figures

    MSC Class: 68T30 ACM Class: I.4.0

  7. arXiv:2405.00074  [pdf, other

    cs.LG cs.SE

    PAODING: A High-fidelity Data-free Pruning Toolkit for Debloating Pre-trained Neural Networks

    Authors: Mark Huasong Meng, Hao Guan, Liuhuo Wan, Sin Gee Teo, Guangdong Bai, ** Song Dong

    Abstract: We present PAODING, a toolkit to debloat pretrained neural network models through the lens of data-free pruning. To preserve the model fidelity, PAODING adopts an iterative process, which dynamically measures the effect of deleting a neuron to identify candidates that have the least impact to the output layer. Our evaluation shows that PAODING can significantly reduce the model size, generalize on… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 3 pages

  8. arXiv:2402.17946  [pdf, other

    cs.CL

    SparseLLM: Towards Global Pruning for Pre-trained Language Models

    Authors: Guangji Bai, Yijiang Li, Chen Ling, Kibaek Kim, Liang Zhao

    Abstract: The transformative impact of large language models (LLMs) like LLaMA and GPT on natural language processing is countered by their prohibitive computational demands. Pruning has emerged as a pivotal compression strategy, introducing sparsity to enhance both memory and computational efficiency. Yet, traditional global pruning is impractical for LLMs due to scalability issues, while local pruning, de… ▽ More

    Submitted 23 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Preprint. Under review

  9. arXiv:2402.14762  [pdf, other

    cs.CL cs.AI

    MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues

    Authors: Ge Bai, Jie Liu, Xingyuan Bu, Yancheng He, Jiaheng Liu, Zhanhui Zhou, Zhuoran Lin, Wenbo Su, Tiezheng Ge, Bo Zheng, Wanli Ouyang

    Abstract: The advent of Large Language Models (LLMs) has drastically enhanced dialogue systems. However, comprehensively evaluating the dialogue abilities of LLMs remains a challenge. Previous benchmarks have primarily focused on single-turn dialogues or provided coarse-grained and incomplete assessments of multi-turn dialogues, overlooking the complexity and fine-grained nuances of real-life dialogues. To… ▽ More

    Submitted 25 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: [ACL 2024] The first three authors contribute equally, 34 pages, repo at https://github.com/mtbench101/mt-bench-101

  10. arXiv:2402.10189  [pdf, other

    cs.CL cs.LG

    Uncertainty Quantification for In-Context Learning of Large Language Models

    Authors: Chen Ling, Xujiang Zhao, Xuchao Zhang, Wei Cheng, Yanchi Liu, Yiyou Sun, Mika Oishi, Takao Osaki, Katsushi Matsuda, Jie Ji, Guangji Bai, Liang Zhao, Haifeng Chen

    Abstract: In-context learning has emerged as a groundbreaking ability of Large Language Models (LLMs) and revolutionized various fields by providing a few task-relevant demonstrations in the prompt. However, trustworthy issues with LLM's response, such as hallucination, have also been actively discussed. Existing works have been devoted to quantifying the uncertainty in LLM's response, but they often overlo… ▽ More

    Submitted 28 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted to the main conference of NAACL 2024

  11. arXiv:2401.13672  [pdf, other

    cs.DB cs.AI cs.IR

    Transforming Agriculture with Intelligent Data Management and Insights

    Authors: Yu Pan, Jianxin Sun, Hongfeng Yu, Geng Bai, Yufeng Ge, Joe Luck, Tala Awada

    Abstract: Modern agriculture faces grand challenges to meet increased demands for food, fuel, feed, and fiber with population growth under the constraints of climate change and dwindling natural resources. Data innovation is urgently required to secure and improve the productivity, sustainability, and resilience of our agroecosystems. As various sensors and Internet of Things (IoT) instrumentation become mo… ▽ More

    Submitted 7 November, 2023; originally announced January 2024.

  12. arXiv:2401.04662  [pdf, other

    cs.CR

    The Devil Behind the Mirror: Tracking the Campaigns of Cryptocurrency Abuses on the Dark Web

    Authors: Pengcheng Xia, Zhou Yu, Kailong Wang, Kai Ma, Shuo Chen, Xiapu Luo, Ya** Zhou, Lei Wu, Guangdong Bai

    Abstract: The dark web has emerged as the state-of-the-art solution for enhanced anonymity. Just like a double-edged sword, it also inadvertently becomes the safety net and breeding ground for illicit activities. Among them, cryptocurrencies have been prevalently abused to receive illicit income while evading regulations. Despite the continuing efforts to combat illicit activities, there is still a lack of… ▽ More

    Submitted 7 April, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  13. arXiv:2401.02686  [pdf, other

    cs.CR cs.LG cs.SE

    Beyond Fidelity: Explaining Vulnerability Localization of Learning-based Detectors

    Authors: Baijun Cheng, Shengming Zhao, Kailong Wang, Meizhen Wang, Guangdong Bai, Ruitao Feng, Yao Guo, Lei Ma, Haoyu Wang

    Abstract: Vulnerability detectors based on deep learning (DL) models have proven their effectiveness in recent years. However, the shroud of opacity surrounding the decision-making process of these detectors makes it difficult for security analysts to comprehend. To address this, various explanation approaches have been proposed to explain the predictions by highlighting important features, which have been… ▽ More

    Submitted 21 February, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted by Tosem

  14. arXiv:2401.02659  [pdf, other

    cs.CR

    MalModel: Hiding Malicious Payload in Mobile Deep Learning Models with Black-box Backdoor Attack

    Authors: Jiayi Hua, Kailong Wang, Meizhen Wang, Guangdong Bai, Xiapu Luo, Haoyu Wang

    Abstract: Mobile malware has become one of the most critical security threats in the era of ubiquitous mobile computing. Despite the intensive efforts from security experts to counteract it, recent years have still witnessed a rapid growth of identified malware samples. This could be partly attributed to the newly-emerged technologies that may constantly open up under-studied attack surfaces for the adversa… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Due to the limitation "The abstract field cannot be longer than 1,920 characters", the abstract here is shorter than that in the PDF file

  15. arXiv:2401.00625  [pdf, ps, other

    cs.LG

    Beyond Efficiency: A Systematic Survey of Resource-Efficient Large Language Models

    Authors: Guangji Bai, Zheng Chai, Chen Ling, Shiyu Wang, Jiaying Lu, Nan Zhang, Tingwei Shi, Ziyang Yu, Mengdan Zhu, Yifei Zhang, Carl Yang, Yue Cheng, Liang Zhao

    Abstract: The burgeoning field of Large Language Models (LLMs), exemplified by sophisticated models like OpenAI's ChatGPT, represents a significant advancement in artificial intelligence. These models, however, bring forth substantial challenges in the high consumption of computational, memory, energy, and financial resources, especially in environments with limited resource capabilities. This survey aims t… ▽ More

    Submitted 3 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: Preprint. GitHub repo: https://github.com/tiingweii-shii/Awesome-Resource-Efficient-LLM-Papers

  16. arXiv:2312.12958  [pdf, other

    cs.CR cs.SC

    Symbolic Security Verification of Mesh Commissioning Protocol in Thread (extended version)

    Authors: Pankaj Upadhyay, Subodh Sharma, Guangdong Bai

    Abstract: The Thread protocol (or simply Thread ) is a popular networking protocol for the Internet of Things (IoT). It allows seamless integration of a set of applications and protocols, hence reducing the risk of incompatibility among different applications or user protocols. Thread has been deployed in many popular smart home products by the majority of IoT manufacturers, such as Apple TV, Apple HomePod… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 18 pages

    MSC Class: 68Q60 ACM Class: I.6.5

  17. arXiv:2312.12276  [pdf, other

    cs.LG

    POND: Multi-Source Time Series Domain Adaptation with Information-Aware Prompt Tuning

    Authors: Junxiang Wang, Guangji Bai, Wei Cheng, Zhengzhang Chen, Liang Zhao, Haifeng Chen

    Abstract: Time series domain adaptation stands as a pivotal and intricate challenge with diverse applications, including but not limited to human activity recognition, sleep stage classification, and machine fault diagnosis. Despite the numerous domain adaptation techniques proposed to tackle this complex problem, they primarily focus on domain adaptation from a single source domain. Yet, it is more crucial… ▽ More

    Submitted 7 June, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: accepted by KDD 2024

  18. arXiv:2311.15570  [pdf, other

    cs.LG cs.CV

    UFDA: Universal Federated Domain Adaptation with Practical Assumptions

    Authors: Xinhui Liu, Zhenghao Chen, Lu** Zhou, Dong Xu, Wei Xi, Gairui Bai, Yihan Zhao, Jizhong Zhao

    Abstract: Conventional Federated Domain Adaptation (FDA) approaches usually demand an abundance of assumptions, which makes them significantly less feasible for real-world situations and introduces security hazards. This paper relaxes the assumptions from previous FDAs and studies a more practical scenario named Universal Federated Domain Adaptation (UFDA). It only requires the black-box model and the label… ▽ More

    Submitted 19 December, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted by AAAI2024

    Journal ref: AAAI2024

  19. arXiv:2311.10331  [pdf, other

    eess.IV cs.CV

    Leveraging Multimodal Fusion for Enhanced Diagnosis of Multiple Retinal Diseases in Ultra-wide OCTA

    Authors: Hao Wei, Peilun Shi, Guitao Bai, Minqing Zhang, Shuangle Li, Wu Yuan

    Abstract: Ultra-wide optical coherence tomography angiography (UW-OCTA) is an emerging imaging technique that offers significant advantages over traditional OCTA by providing an exceptionally wide scanning range of up to 24 x 20 $mm^{2}$, covering both the anterior and posterior regions of the retina. However, the currently accessible UW-OCTA datasets suffer from limited comprehensive hierarchical informati… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  20. AGRAMPLIFIER: Defending Federated Learning Against Poisoning Attacks Through Local Update Amplification

    Authors: Zirui Gong, Liyue Shen, Yanjun Zhang, Leo Yu Zhang, **gwei Wang, Guangdong Bai, Yong Xiang

    Abstract: The collaborative nature of federated learning (FL) poses a major threat in the form of manipulation of local training data and local updates, known as the Byzantine poisoning attack. To address this issue, many Byzantine-robust aggregation rules (AGRs) have been proposed to filter out or moderate suspicious local updates uploaded by Byzantine participants. This paper introduces a novel approach… ▽ More

    Submitted 23 November, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: Accepted by IEEE TIFS, this is the complete version

  21. arXiv:2310.08537  [pdf, other

    cs.CV

    XAI Benchmark for Visual Explanation

    Authors: Yifei Zhang, Siyi Gu, James Song, Bo Pan, Guangji Bai, Liang Zhao

    Abstract: The rise of deep learning has ushered in significant progress in computer vision (CV) tasks, yet the "black box" nature of these models often precludes interpretability. This challenge has spurred the development of Explainable Artificial Intelligence (XAI) by generating explanations to AI's decision-making process. An explanation is aimed to not only faithfully reflect the true reasoning process… ▽ More

    Submitted 21 November, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  22. arXiv:2310.08420  [pdf, other

    cs.CV

    Visual Attention Prompted Prediction and Learning

    Authors: Yifei Zhang, Siyi Gu, Bo Pan, Guangji Bai, Meikang Qiu, Xiaofeng Yang, Liang Zhao

    Abstract: Visual explanation (attention)-guided learning uses not only labels but also explanations to guide model reasoning process. While visual attention-guided learning has shown promising results, it requires a large number of explanation annotations that are time-consuming to prepare. However, in many real-world situations, it is usually desired to prompt the model with visual attention without model… ▽ More

    Submitted 23 April, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  23. arXiv:2310.04334  [pdf, other

    cs.LG

    Saliency-Guided Hidden Associative Replay for Continual Learning

    Authors: Guangji Bai, Qilong Zhao, Xiaoyang Jiang, Yifei Zhang, Liang Zhao

    Abstract: Continual Learning is a burgeoning domain in next-generation AI, focusing on training neural networks over a sequence of tasks akin to human learning. While CL provides an edge over traditional supervised learning, its central challenge remains to counteract catastrophic forgetting and ensure the retention of prior tasks during subsequent learning. Amongst various strategies to tackle this, replay… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: Preprint. Do not distribute

  24. arXiv:2309.15493  [pdf, other

    cs.CV

    CauDR: A Causality-inspired Domain Generalization Framework for Fundus-based Diabetic Retinopathy Grading

    Authors: Hao Wei, Peilun Shi, Juzheng Miao, Minqing Zhang, Guitao Bai, Jianing Qiu, Furui Liu, Wu Yuan

    Abstract: Diabetic retinopathy (DR) is the most common diabetic complication, which usually leads to retinal damage, vision loss, and even blindness. A computer-aided DR grading system has a significant impact on hel** ophthalmologists with rapid screening and diagnosis. Recent advances in fundus photography have precipitated the development of novel retinal imaging cameras and their subsequent implementa… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 13 pages, 9 figures

  25. arXiv:2308.13466  [pdf, other

    cs.LG

    Staleness-Alleviated Distributed GNN Training via Online Dynamic-Embedding Prediction

    Authors: Guangji Bai, Ziyang Yu, Zheng Chai, Yue Cheng, Liang Zhao

    Abstract: Despite the recent success of Graph Neural Networks (GNNs), it remains challenging to train GNNs on large-scale graphs due to neighbor explosions. As a remedy, distributed computing becomes a promising solution by leveraging abundant computing resources (e.g., GPU). However, the node dependency of graph data increases the difficulty of achieving high concurrency in distributed GNN training, which… ▽ More

    Submitted 10 December, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Preprint. Do not distribute

  26. arXiv:2305.11389  [pdf, other

    cs.LG cs.AI

    Domain Generalization Deep Graph Transformation

    Authors: Shiyu Wang, Guangji Bai, Qingyang Zhu, Zhaohui Qin, Liang Zhao

    Abstract: Graph transformation that predicts graph transition from one mode to another is an important and common problem. Despite much progress in develo** advanced graph transformation techniques in recent years, the fundamental assumption typically required in machine-learning models that the testing and training data preserve the same distribution does not always hold. As a result, domain generalizati… ▽ More

    Submitted 23 May, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  27. arXiv:2302.02093  [pdf

    cs.AI cs.NE

    Knowledge-enhanced Neural Machine Reasoning: A Review

    Authors: Tanmoy Chowdhury, Chen Ling, Xuchao Zhang, Xujiang Zhao, Guangji Bai, Jian Pei, Haifeng Chen, Liang Zhao

    Abstract: Knowledge-enhanced neural machine reasoning has garnered significant attention as a cutting-edge yet challenging research area with numerous practical applications. Over the past few years, plenty of studies have leveraged various forms of external knowledge to augment the reasoning capabilities of deep models, tackling challenges such as effective knowledge integration, implicit knowledge mining,… ▽ More

    Submitted 6 February, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: 8 pages, 3 figures

  28. arXiv:2212.13242  [pdf, other

    cs.LG

    Saliency-Augmented Memory Completion for Continual Learning

    Authors: Guangji Bai, Chen Ling, Yuyang Gao, Liang Zhao

    Abstract: Continual Learning is considered a key step toward next-generation Artificial Intelligence. Among various methods, replay-based approaches that maintain and replay a small episodic memory of previous samples are one of the most successful strategies against catastrophic forgetting. However, since forgetting is inevitable given bounded memory and unbounded tasks, how to forget is a problem continua… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

    Comments: Published at SIAM SDM 2023. 15 pages, 6 figures. Code: https://github.com/BaiTheBest/SAMC

  29. Quantum Similarity Testing with Convolutional Neural Networks

    Authors: Ya-Dong Wu, Yan Zhu, Ge Bai, Yuexuan Wang, Giulio Chiribella

    Abstract: The task of testing whether two uncharacterized quantum devices behave in the same way is crucial for benchmarking near-term quantum computers and quantum simulators, but has so far remained open for continuous-variable quantum systems. In this Letter, we develop a machine learning algorithm for comparing unknown continuous variable states using limited and noisy data. The algorithm works on non-G… ▽ More

    Submitted 25 May, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  30. arXiv:2210.00729  [pdf, other

    cs.LG cs.SI

    Deep Spatial Domain Generalization

    Authors: Dazhou Yu, Guangji Bai, Yun Li, Liang Zhao

    Abstract: Spatial autocorrelation and spatial heterogeneity widely exist in spatial data, which make the traditional machine learning model perform badly. Spatial domain generalization is a spatial extension of domain generalization, which can generalize to unseen spatial domains in continuous 2D space. Specifically, it learns a model under varying data distributions that generalizes to unseen domains. Alth… ▽ More

    Submitted 27 December, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

  31. arXiv:2209.15223  [pdf, other

    eess.SP cs.HC

    ASTF: Visual Abstractions of Time-Varying Patterns in Radio Signals

    Authors: Ying Zhao, Luhao Ge, Huixuan Xie, Genghuai Bai, Zhao Zhang, Qiang Wei, Yun Lin, Yuchao Liu, Fangfang Zhou

    Abstract: A time-frequency diagram is a commonly used visualization for observing the time-frequency distribution of radio signals and analyzing their time-varying patterns of communication states in radio monitoring and management. While it excels when performing short-term signal analyses, it becomes inadaptable for long-term signal analyses because it cannot adequately depict signal time-varying patterns… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: 11 pages, 9 figures

  32. Saliency-Regularized Deep Multi-Task Learning

    Authors: Guangji Bai, Liang Zhao

    Abstract: Multitask learning is a framework that enforces multiple learning tasks to share knowledge to improve their generalization abilities. While shallow multitask learning can learn task relations, it can only handle predefined features. Modern deep multitask learning can jointly learn latent features and task sharing, but they are obscure in task relation. Also, they predefine which layers and neurons… ▽ More

    Submitted 3 July, 2022; originally announced July 2022.

    Comments: 10 pages, 7 Figures, KDD 2022

  33. RES: A Robust Framework for Guiding Visual Explanation

    Authors: Yuyang Gao, Tong Steven Sun, Guangji Bai, Siyi Gu, Sungsoo Ray Hong, Liang Zhao

    Abstract: Despite the fast progress of explanation techniques in modern Deep Neural Networks (DNNs) where the main focus is handling "how to generate the explanations", advanced research questions that examine the quality of the explanation itself (e.g., "whether the explanations are accurate") and improve the explanation quality (e.g., "how to adjust the model to generate more accurate explanations when ex… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Published in KDD 2022

    Journal ref: In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA

  34. Adversarial Robustness of Deep Neural Networks: A Survey from a Formal Verification Perspective

    Authors: Mark Huasong Meng, Guangdong Bai, Sin Gee Teo, Zhe Hou, Yan Xiao, Yun Lin, ** Song Dong

    Abstract: Neural networks have been widely applied in security applications such as spam and phishing detection, intrusion prevention, and malware detection. This black-box method, however, often has uncertainty and poor explainability in applications. Furthermore, neural networks themselves are often vulnerable to adversarial attacks. For those reasons, there is a high demand for trustworthy and rigorous m… ▽ More

    Submitted 11 October, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

  35. arXiv:2206.00057  [pdf, other

    cs.LG cs.DC

    Distributed Graph Neural Network Training with Periodic Stale Representation Synchronization

    Authors: Zheng Chai, Guangji Bai, Liang Zhao, Yue Cheng

    Abstract: Despite the recent success of Graph Neural Networks, it remains challenging to train a GNN on large graphs with millions of nodes and billions of edges, which are prevalent in many graph-based applications. Traditional sampling-based methods accelerate GNN training by drop** edges and nodes, which impairs the graph integrity and model performance. Differently, distributed GNN algorithms accelera… ▽ More

    Submitted 2 October, 2022; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: Preprint: 20 pages, 9 figures

  36. arXiv:2205.10664  [pdf, other

    cs.LG

    Temporal Domain Generalization with Drift-Aware Dynamic Neural Networks

    Authors: Guangji Bai, Chen Ling, Liang Zhao

    Abstract: Temporal domain generalization is a promising yet extremely challenging area where the goal is to learn models under temporally changing data distributions and generalize to unseen data distributions following the trends of the change. The advancement of this area is challenged by: 1) characterizing data distribution drift and its impacts on models, 2) expressiveness in tracking the model dynamics… ▽ More

    Submitted 9 February, 2023; v1 submitted 21 May, 2022; originally announced May 2022.

    Comments: Published in ICLR 2023 (Oral)

  37. arXiv:2204.00783  [pdf, other

    cs.LG cs.CR

    Supervised Robustness-preserving Data-free Neural Network Pruning

    Authors: Mark Huasong Meng, Guangdong Bai, Sin Gee Teo, ** Song Dong

    Abstract: When deploying pre-trained neural network models in real-world applications, model consumers often encounter resource-constraint platforms such as mobile and smart devices. They typically use the pruning technique to reduce the size and complexity of the model, generating a lighter one with less resource consumption. Nonetheless, most existing pruning methods are proposed with the premise that the… ▽ More

    Submitted 1 November, 2022; v1 submitted 2 April, 2022; originally announced April 2022.

  38. Flexible learning of quantum states with generative query neural networks

    Authors: Yan Zhu, Ya-Dong Wu, Ge Bai, Dong-Sheng Wang, Yuexuan Wang, Giulio Chiribella

    Abstract: Deep neural networks are a powerful tool for the characterization of quantum states. Existing networks are typically trained with experimental data gathered from the specific quantum state that needs to be characterized. But is it possible to train a neural network offline and to make predictions about quantum states other than the ones used for the training? Here we introduce a model of net… ▽ More

    Submitted 19 July, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Major improvements in the presentation; new numerical experiments that illustrate the applicability of our neural network to various types of states

    Journal ref: Nat Commun 13, 6222 (2022)

  39. arXiv:2110.13029  [pdf, other

    cs.LG cs.CY cs.SE

    Fair Enough: Searching for Sufficient Measures of Fairness

    Authors: Suvodeep Majumder, Joymallya Chakraborty, Gina R. Bai, Kathryn T. Stolee, Tim Menzies

    Abstract: Testing machine learning software for ethical bias has become a pressing current concern. In response, recent research has proposed a plethora of new fairness metrics, for example, the dozens of fairness metrics in the IBM AIF360 toolkit. This raises the question: How can any fairness tool satisfy such a diverse range of goals? While we cannot completely simplify the task of fairness testing, we c… ▽ More

    Submitted 21 March, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: 8 tables and 1 figure

  40. arXiv:2104.13050  [pdf, other

    cs.LG

    Confined Gradient Descent: Privacy-preserving Optimization for Federated Learning

    Authors: Yanjun Zhang, Guangdong Bai, Xue Li, Surya Nepal, Ryan K L Ko

    Abstract: Federated learning enables multiple participants to collaboratively train a model without aggregating the training data. Although the training data are kept within each participant and the local gradients can be securely synthesized, recent studies have shown that such privacy protection is insufficient. The global model parameters that have to be shared for optimization are susceptible to leak in… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  41. arXiv:2103.07012  [pdf, other

    cs.CR cs.SE

    ColdPress: An Extensible Malware Analysis Platform for Threat Intelligence

    Authors: Haoxi Tan, Mahin Chandramohan, Cristina Cifuentes, Guangdong Bai, Ryan K. L. Ko

    Abstract: Malware analysis is still largely a manual task. This slow and inefficient approach does not scale to the exponential rise in the rate of new unique malware generated. Hence, automating the process as much as possible becomes desirable. In this paper, we present ColdPress - an extensible malware analysis platform that automates the end-to-end process of malware threat intelligence gathering inte… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: The code is open source at https://github.com/uqcyber/ColdPress

  42. arXiv:2102.11191  [pdf, other

    cs.LG

    Sign-regularized Multi-task Learning

    Authors: Johnny Torres, Guangji Bai, Junxiang Wang, Liang Zhao, Carmen Vaca, Cristina Abad

    Abstract: Multi-task learning is a framework that enforces different learning tasks to share their knowledge to improve their generalization performance. It is a hot and active domain that strives to handle several core issues; particularly, which tasks are correlated and similar, and how to share the knowledge among correlated tasks. Existing works usually do not distinguish the polarity and magnitude of f… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: 17 pages, 4 figures, v1

  43. arXiv:2101.11866  [pdf, other

    cs.CR

    An Analytics Framework for Heuristic Inference Attacks against Industrial Control Systems

    Authors: Taejun Choi, Guangdong Bai, Ryan K L Ko, Naipeng Dong, Wenlu Zhang, Shunyao Wang

    Abstract: Industrial control systems (ICS) of critical infrastructure are increasingly connected to the Internet for remote site management at scale. However, cyber attacks against ICS - especially at the communication channels between humanmachine interface (HMIs) and programmable logic controllers (PLCs) - are increasing at a rate which outstrips the rate of mitigation. In this paper, we introduce a ven… ▽ More

    Submitted 28 January, 2021; originally announced January 2021.

  44. arXiv:2010.12218  [pdf, other

    cs.SE

    When the Open Source Community Meets COVID-19: Characterizing COVID-19 themed GitHub Repositories

    Authors: Liu Wang, Ruiqing Li, Jiaxin Zhu, Guangdong Bai, Haoyu Wang

    Abstract: Ever since the beginning of the outbreak of the COVID-19 pandemic, researchers from interdisciplinary domains have worked together to fight against the crisis. The open source community, plays a vital role in co** with the pandemic which is inherently a collaborative process. Plenty of COVID-19 related datasets, tools, software, deep learning models, are created and shared in research communitie… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

  45. arXiv:2010.05522  [pdf, other

    cs.CL

    Pre-trained Language Model Based Active Learning for Sentence Matching

    Authors: Guirong Bai, Shizhu He, Kang Liu, Jun Zhao, Zaiqing Nie

    Abstract: Active learning is able to significantly reduce the annotation cost for data-driven techniques. However, previous active learning approaches for natural language processing mainly depend on the entropy-based uncertainty criterion, and ignore the characteristics of natural language. In this paper, we propose a pre-trained language model based active learning approach for sentence matching. Differin… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: Accepted by the conference of coling 2020

  46. arXiv:2007.13639  [pdf, other

    cs.CR

    Don't Fish in Troubled Waters! Characterizing Coronavirus-themed Cryptocurrency Scams

    Authors: Pengcheng Xia, Haoyu Wang, Xiapu Luo, Lei Wu, Ya** Zhou, Guangdong Bai, Guoai Xu, Gang Huang, Xuanzhe Liu

    Abstract: As COVID-19 has been spreading across the world since early 2020, a growing number of malicious campaigns are capitalizing the topic of COVID-19. COVID-19 themed cryptocurrency scams are increasingly popular during the pandemic. However, these newly emerging scams are poorly understood by our community. In this paper, we present the first measurement study of COVID-19 themed cryptocurrency scams.… ▽ More

    Submitted 1 November, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

  47. arXiv:2007.06953  [pdf, other

    cs.CR

    PrivColl: Practical Privacy-Preserving Collaborative Machine Learning

    Authors: Yanjun Zhang, Guangdong Bai, Xue Li, Caitlin Curtis, Chen Chen, Ryan K L Ko

    Abstract: Collaborative learning enables two or more participants, each with their own training dataset, to collaboratively learn a joint model. It is desirable that the collaboration should not cause the disclosure of either the raw datasets of each individual owner or the local model parameters trained on them. This privacy-preservation requirement has been approached through differential privacy mechanis… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: 20 pages, 3 figures, to be published in 25th European Symposium on Research in Computer Security (ESORICS) 2020

  48. arXiv:2006.06419  [pdf, other

    cs.CR

    DEPOSafe: Demystifying the Fake Deposit Vulnerability in Ethereum Smart Contracts

    Authors: Ru Ji, Ningyu He, Lei Wu, Haoyu Wang, Guangdong Bai, Yao Guo

    Abstract: Cryptocurrency has seen an explosive growth in recent years, thanks to the evolvement of blockchain technology and its economic ecosystem. Besides Bitcoin, thousands of cryptocurrencies have been distributed on blockchains, while hundreds of cryptocurrency exchanges are emerging to facilitate the trading of digital assets. At the same time, it also attracts the attentions of attackers. Fake deposi… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: 10 pages, 6 figures

  49. arXiv:1807.11790  [pdf, other

    cs.GT cs.LG stat.ML

    Practical Constrained Optimization of Auction Mechanisms in E-Commerce Sponsored Search Advertising

    Authors: Gang Bai, Zhihui Xie, Liang Wang

    Abstract: Sponsored search in E-commerce platforms such as Amazon, Taobao and Tmall provides sellers an effective way to reach potential buyers with most relevant purpose. In this paper, we study the auction mechanism optimization problem in sponsored search on Alibaba's mobile E-commerce platform. Besides generating revenue, we are supposed to maintain an efficient marketplace with plenty of quality users,… ▽ More

    Submitted 31 July, 2018; originally announced July 2018.

    Comments: 6 pages, 1 figure

  50. arXiv:1806.04414  [pdf

    physics.app-ph cs.IT

    Controlling spectral energies of all harmonics in programmable way using time-domain digital coding metasurface

    Authors: Jie Zhao, Xi Yang, Jun Yan Dai, Qiang Cheng, Xiang Li, Ning Hua Qi, Jun Chen Ke, Guo Dong Bai, Shuo Liu, Shi **, Tie Jun Cui

    Abstract: Modern wireless communication is one of the most important information technologies, but its system architecture has been unchanged for many years. Here, we propose a much simpler architecture for wireless communication systems based on metasurface. We firstly propose a time-domain digital coding metasurface to reach a simple but efficient method to manipulate spectral distributions of harmonics.… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.