Skip to main content

Showing 1–50 of 137 results for author: Gao, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18032  [pdf, other

    cs.CR cs.DC cs.NI

    A Communication Satellite Servises Based Decentralized Network Protocol

    Authors: Xiao Yan, Bernie Gao

    Abstract: In this paper, we present a decentralized network protocol, Space Network Protocol, based on Communication Satellite Services. The protocol outlines a method for distributing information about the status of satellite communication services across the entire blockchain network, facilitating fairness and transparency in all communication services. Our primary objective is to standardize the services… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.14024  [pdf, other

    cs.CL

    LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

    Authors: Bofei Gao, Zefan Cai, Runxin Xu, Peiyi Wang, Ce Zheng, Runji Lin, Keming Lu, Junyang Lin, Chang Zhou, Wen Xiao, Junjie Hu, Tianyu Liu, Baobao Chang

    Abstract: Mathematical verfier achieves success in mathematical reasoning tasks by validating the correctness of solutions. However, existing verifiers are trained with binary classification labels, which are not informative enough for the model to accurately assess the solutions. To mitigate the aforementioned insufficiency of binary labels, we introduce step-wise natural language feedbacks as rationale la… ▽ More

    Submitted 30 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 9 pages

  3. arXiv:2406.08980  [pdf, other

    q-bio.BM cs.LG

    From Theory to Therapy: Reframing SBDD Model Evaluation via Practical Metrics

    Authors: Bowen Gao, Haichuan Tan, Yanwen Huang, Minsi Ren, Xiao Huang, Wei-Ying Ma, Ya-Qin Zhang, Yanyan Lan

    Abstract: Recent advancements in structure-based drug design (SBDD) have significantly enhanced the efficiency and precision of drug discovery by generating molecules tailored to bind specific protein pockets. Despite these technological strides, their practical application in real-world drug development remains challenging due to the complexities of synthesizing and testing these molecules. The reliability… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.08961  [pdf, other

    q-bio.BM cs.LG

    SIU: A Million-Scale Structural Small Molecule-Protein Interaction Dataset for Unbiased Bioactivity Prediction

    Authors: Yanwen Huang, Bowen Gao, Yinjun Jia, Hongbo Ma, Wei-Ying Ma, Ya-Qin Zhang, Yanyan Lan

    Abstract: Small molecules play a pivotal role in modern medicine, and scrutinizing their interactions with protein targets is essential for the discovery and development of novel, life-saving therapeutics. The term "bioactivity" encompasses various biological effects resulting from these interactions, including both binding and functional responses. The magnitude of bioactivity dictates the therapeutic or t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2406.02069  [pdf, other

    cs.CL cs.AI

    PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

    Authors: Zefan Cai., Yichi Zhang, Bofei Gao, Yuliang Liu, Tianyu Liu, Keming Lu, Wayne Xiong, Yue Dong, Baobao Chang, Junjie Hu, Wen Xiao

    Abstract: In this study, we investigate whether attention-based information flow inside large language models (LLMs) is aggregated through noticeable patterns for long context processing. Our observations reveal that LLMs aggregate information through Pyramidal Information Funneling where attention is scattering widely in lower layers, progressively consolidating within specific contexts, and ultimately foc… ▽ More

    Submitted 16 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2405.19697  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity

    Authors: Yan Yang, Bin Gao, Ya-xiang Yuan

    Abstract: Bilevel reinforcement learning (RL), which features intertwined two-level problems, has attracted growing interest recently. The inherent non-convexity of the lower-level RL problem is, however, to be an impediment to develo** bilevel optimization methods. By employing the fixed point equation associated with the regularized RL, we characterize the hyper-gradient via fully first-order informatio… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 43 pages, 1 figure, 1 table

  7. arXiv:2405.17802  [pdf, other

    cs.LG cs.AI q-bio.BM

    Multi-level Interaction Modeling for Protein Mutational Effect Prediction

    Authors: Yuanle Mo, Xin Hong, Bowen Gao, Yinjun Jia, Yanyan Lan

    Abstract: Protein-protein interactions are central mediators in many biological processes. Accurately predicting the effects of mutations on interactions is crucial for guiding the modulation of these interactions, thereby playing a significant role in therapeutic development and drug discovery. Mutations generally affect interactions hierarchically across three levels: mutated residues exhibit different si… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  8. arXiv:2405.13378  [pdf, other

    cs.LG

    FedCache 2.0: Exploiting the Potential of Distilled Data in Knowledge Cache-driven Federated Learning

    Authors: Quyang Pan, Sheng Sun, Zhiyuan Wu, Yuwei Wang, Min Liu, Bo Gao

    Abstract: Federated Edge Learning (FEL) has emerged as a promising approach for enabling edge devices to collaboratively train machine learning models while preserving data privacy. Despite its advantages, practical FEL deployment faces significant challenges related to device constraints and device-server interactions, necessitating heterogeneous, user-adaptive model training with limited and uncertain com… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 20 pages, 8 figures, 10 tables

  9. arXiv:2405.11440  [pdf, other

    cs.CR cs.DC cs.NI

    A GAN-Based Data Poisoning Attack Against Federated Learning Systems and Its Countermeasure

    Authors: Wei Sun, Bo Gao, Ke Xiong, Yuwei Wang

    Abstract: As a distributed machine learning paradigm, federated learning (FL) is collaboratively carried out on privately owned datasets but without direct data access. Although the original intention is to allay data privacy concerns, "available but not visible" data in FL potentially brings new security threats, particularly poisoning attacks that target such "not visible" local data. Initial attempts hav… ▽ More

    Submitted 21 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: 18 pages, 16 figures

  10. arXiv:2405.02288  [pdf, other

    cs.CV cs.AI cs.RO

    Prospective Role of Foundation Models in Advancing Autonomous Vehicles

    Authors: Jianhua Wu, Bingzhao Gao, **cheng Gao, Jianhao Yu, Hongqing Chu, Qiankun Yu, Xun Gong, Yi Chang, H. Eric Tseng, Hong Chen, Jie Chen

    Abstract: With the development of artificial intelligence and breakthroughs in deep learning, large-scale Foundation Models (FMs), such as GPT, Sora, etc., have achieved remarkable results in many fields including natural language processing and computer vision. The application of FMs in autonomous driving holds considerable promise. For example, they can contribute to enhancing scene understanding and reas… ▽ More

    Submitted 17 May, 2024; v1 submitted 8 December, 2023; originally announced May 2024.

    Comments: 45 pages,8 figures

  11. arXiv:2405.01702  [pdf, other

    cs.LG math.OC stat.ML

    Optimization without Retraction on the Random Generalized Stiefel Manifold

    Authors: Simon Vary, Pierre Ablin, Bin Gao, P. -A. Absil

    Abstract: Optimization over the set of matrices $X$ that satisfy $X^\top B X = I_p$, referred to as the generalized Stiefel manifold, appears in many applications involving sampled covariance matrices such as the canonical correlation analysis (CCA), independent component analysis (ICA), and the generalized eigenvalue problem (GEVP). Solving these problems is typically done by iterative methods that require… ▽ More

    Submitted 5 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: This v2 is the camera-ready version for ICML 2024

    MSC Class: 90C26; 90C15

  12. arXiv:2404.15328  [pdf, other

    eess.SP cs.LG stat.ML

    Time topological analysis of EEG using signature theory

    Authors: Stéphane Chrétien, Ben Gao, Astrid Thebault-Guiochon, Rémi Vaucher

    Abstract: Anomaly detection in multivariate signals is a task of paramount importance in many disciplines (epidemiology, finance, cognitive sciences and neurosciences, oncology, etc.). In this perspective, Topological Data Analysis (TDA) offers a battery of "shape" invariants that can be exploited for the implementation of an effective detection scheme. Our contribution consists of extending the constructio… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 14 pages, 5 figures Under review for Journée des Statistiques 2024

  13. arXiv:2404.10255  [pdf, other

    cs.LG cs.CR cs.DC

    Privacy-Enhanced Training-as-a-Service for On-Device Intelligence: Concept, Architectural Scheme, and Open Problems

    Authors: Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Bo Gao, Tianliu He, Wen Wang

    Abstract: On-device intelligence (ODI) enables artificial intelligence (AI) applications to run on end devices, providing real-time and customized AI inference without relying on remote servers. However, training models for on-device deployment face significant challenges due to the decentralized and privacy-sensitive nature of users' data, along with end-side constraints related to network connectivity, co… ▽ More

    Submitted 27 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: 7 pages, 3 figures

  14. arXiv:2404.03331  [pdf, other

    math.OC cs.LG stat.ML

    LancBiO: dynamic Lanczos-aided bilevel optimization via Krylov subspace

    Authors: Bin Gao, Yan Yang, Ya-xiang Yuan

    Abstract: Bilevel optimization, with broad applications in machine learning, has an intricate hierarchical structure. Gradient-based methods have emerged as a common approach to large-scale bilevel problems. However, the computation of the hyper-gradient, which involves a Hessian inverse vector product, confines the efficiency and is regarded as a bottleneck. To circumvent the inverse, we construct a sequen… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 35 pages, 11 figures, 1 table

  15. arXiv:2404.01727  [pdf, other

    cs.RO cs.CV

    Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge

    Authors: Haoxiang Ma, Modi Shi, Boyang Gao, Di Huang

    Abstract: We focus on the generalization ability of the 6-DoF grasp detection method in this paper. While learning-based grasp detection methods can predict grasp poses for unseen objects using the grasp distribution learned from the training set, they often exhibit a significant performance drop when encountering objects with diverse shapes and structures. To enhance the grasp detection methods' generaliza… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024

  16. Boosting Visual Recognition in Real-world Degradations via Unsupervised Feature Enhancement Module with Deep Channel Prior

    Authors: Zhanwen Liu, Yuhang Li, Yang Wang, Bolin Gao, Yisheng An, Xiangmo Zhao

    Abstract: The environmental perception of autonomous vehicles in normal conditions have achieved considerable success in the past decade. However, various unfavourable conditions such as fog, low-light, and motion blur will degrade image quality and pose tremendous threats to the safety of autonomous driving. That is, when applied to degraded images, state-of-the-art visual models often suffer performance d… ▽ More

    Submitted 11 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 14 pages, 14 figures, publised to TIV2024

    Journal ref: IEEE Transactions on Intelligent Vehicles, April 2024

  17. arXiv:2403.19708  [pdf, other

    cs.CL cs.LG

    Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention

    Authors: Bin Gao, Zhuomin He, Puru Sharma, Qingxuan Kang, Djordje Jevdjic, Junbo Deng, Xingkun Yang, Zhou Yu, Pengfei Zuo

    Abstract: Interacting with humans through multi-turn conversations is a fundamental feature of large language models (LLMs). However, existing LLM serving engines executing multi-turn conversations are inefficient due to the need to repeatedly compute the key-value (KV) caches of historical tokens, incurring high serving costs. To address the problem, this paper proposes CachedAttention, a new attention mec… ▽ More

    Submitted 30 June, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted to USENIX Annual Technical Conference (ATC) 2024

  18. arXiv:2403.14233  [pdf, other

    cs.CV cs.AI cs.LG

    SoftPatch: Unsupervised Anomaly Detection with Noisy Data

    Authors: Xi Jiang, Ying Chen, Qiang Nie, Yong Liu, Jianlin Liu, Bin-Bin Gao, Jun Liu, Chengjie Wang, Feng Zheng

    Abstract: Although mainstream unsupervised anomaly detection (AD) algorithms perform well in academic datasets, their performance is limited in practical application due to the ideal experimental setting of clean training data. Training with noisy data is an inevitable problem in real-world anomaly detection but is seldom discussed. This paper considers label-level noise in image sensory anomaly detection f… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 36th Conference on Neural Information Processing Systems

    Journal ref: Advances in Neural Information Processing Systems 35, ISBN: 9781713871088, (2022)

  19. arXiv:2403.12987  [pdf, other

    q-bio.BM cs.LG

    Rethinking Specificity in SBDD: Leveraging Delta Score and Energy-Guided Diffusion

    Authors: Bowen Gao, Minsi Ren, Yuyan Ni, Yanwen Huang, Bo Qiang, Zhi-Ming Ma, Wei-Ying Ma, Yanyan Lan

    Abstract: In the field of Structure-based Drug Design (SBDD), deep learning-based generative models have achieved outstanding performance in terms of docking score. However, further study shows that the existing molecular generative methods and docking scores both have lacked consideration in terms of specificity, which means that generated molecules bind to almost every protein pocket with high affinity. T… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  20. arXiv:2403.12580  [pdf, other

    cs.CV

    Real-IAD: A Real-World Multi-View Dataset for Benchmarking Versatile Industrial Anomaly Detection

    Authors: Chengjie Wang, Wenbing Zhu, Bin-Bin Gao, Zhenye Gan, Jianning Zhang, Zhihao Gu, Shuguang Qian, Mingang Chen, Lizhuang Ma

    Abstract: Industrial anomaly detection (IAD) has garnered significant attention and experienced rapid development. However, the recent development of IAD approach has encountered certain difficulties due to dataset limitations. On the one hand, most of the state-of-the-art methods have achieved saturation (over 99% in AUROC) on mainstream datasets such as MVTec, and the differences of methods cannot be well… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: It is accepted by CVPR2024

  21. arXiv:2403.11511  [pdf, other

    cs.RO cs.CV

    Sim-to-Real Grasp Detection with Global-to-Local RGB-D Adaptation

    Authors: Haoxiang Ma, Ran Qin, Modi shi, Boyang Gao, Di Huang

    Abstract: This paper focuses on the sim-to-real issue of RGB-D grasp detection and formulates it as a domain adaptation problem. In this case, we present a global-to-local method to address hybrid domain gaps in RGB and depth data and insufficient multi-modal feature alignment. First, a self-supervised rotation pre-training strategy is adopted to deliver robust initialization for RGB and depth networks. We… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted at ICRA 2024

  22. arXiv:2402.13075  [pdf, other

    eess.SY cs.RO

    Formal Synthesis of Controllers for Safety-Critical Autonomous Systems: Developments and Challenges

    Authors: Xiang Yin, Bingzhao Gao, Xiao Yu

    Abstract: In recent years, formal methods have been extensively used in the design of autonomous systems. By employing mathematically rigorous techniques, formal methods can provide fully automated reasoning processes with provable safety guarantees for complex dynamic systems with intricate interactions between continuous dynamics and discrete logics. This paper provides a comprehensive review of formal co… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  23. arXiv:2402.02968  [pdf, other

    cs.CV cs.LG

    Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives

    Authors: Sheng Luo, Wei Chen, Wanxin Tian, Rui Liu, Luanxuan Hou, Xiubao Zhang, Haifeng Shen, Ruiqi Wu, Shuyi Geng, Yi Zhou, Ling Shao, Yi Yang, Bojun Gao, Qun Li, Guobin Wu

    Abstract: Foundation models have indeed made a profound impact on various fields, emerging as pivotal components that significantly shape the capabilities of intelligent systems. In the context of intelligent vehicles, leveraging the power of foundation models has proven to be transformative, offering notable advancements in visual understanding. Equipped with multi-modal and multi-task learning capabilitie… ▽ More

    Submitted 26 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted to IEEE Transactions on Intelligent Vehicles(T-IV). 24 pages, 9 figures, 1 table

  24. arXiv:2401.15558  [pdf, other

    cs.OS

    numaPTE: Managing Page-Tables and TLBs on NUMA Systems

    Authors: Bin Gao, Qingxuan Kang, Hao-Wei Tee, Kyle Timothy Ng Chu, Alireza Sanaee, Djordje Jevdjic

    Abstract: Memory management operations that modify page-tables, typically performed during memory allocation/deallocation, are infamous for their poor performance in highly threaded applications, largely due to process-wide TLB shootdowns that the OS must issue due to the lack of hardware support for TLB coherence. We study these operations in NUMA settings, where we observe up to 40x overhead for basic ope… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  25. arXiv:2401.08045  [pdf, other

    cs.CV

    Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities

    Authors: Xu Yan, Haiming Zhang, Yingjie Cai, **gming Guo, Weichao Qiu, Bin Gao, Kaiqiang Zhou, Yue Zhao, Huan **, Jiantao Gao, Zhen Li, Lihui Jiang, Wei Zhang, Hongbo Zhang, Dengxin Dai, Bingbing Liu

    Abstract: The rise of large foundation models, trained on extensive datasets, is revolutionizing the field of AI. Models such as SAM, DALL-E2, and GPT-4 showcase their adaptability by extracting intricate patterns and performing effectively across diverse tasks, thereby serving as potent building blocks for a wide range of AI applications. Autonomous driving, a vibrant front in AI applications, remains chal… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Github Repo: https://github.com/zhanghm1995/Forge_VFM4AD

  26. arXiv:2401.03685  [pdf, other

    cs.LG

    Logits Poisoning Attack in Federated Distillation

    Authors: Yuhan Tang, Zhiyuan Wu, Bo Gao, Tian Wen, Yuwei Wang, Sheng Sun

    Abstract: Federated Distillation (FD) is a novel and promising distributed machine learning paradigm, where knowledge distillation is leveraged to facilitate a more efficient and flexible cross-device knowledge transfer in federated learning. By optimizing local models with knowledge distillation, FD circumvents the necessity of uploading large-scale model parameters to the central server, simultaneously pr… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 13 pages, 3 figures, 5 tables

  27. arXiv:2401.01010  [pdf, other

    cs.CV cs.LG

    Unsupervised Continual Anomaly Detection with Contrastively-learned Prompt

    Authors: Jiaqi Liu, Kai Wu, Qiang Nie, Ying Chen, Bin-Bin Gao, Yong Liu, **bao Wang, Chengjie Wang, Feng Zheng

    Abstract: Unsupervised Anomaly Detection (UAD) with incremental training is crucial in industrial manufacturing, as unpredictable defects make obtaining sufficient labeled data infeasible. However, continual learning methods primarily rely on supervised annotations, while the application in UAD is limited due to the absence of supervision. Current UAD methods train separate models for different classes sequ… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI 2024

  28. arXiv:2401.00622  [pdf, other

    cs.LG

    Federated Class-Incremental Learning with New-Class Augmented Self-Distillation

    Authors: Zhiyuan Wu, Tianliu He, Sheng Sun, Yuwei Wang, Min Liu, Bo Gao, Xuefeng Jiang

    Abstract: Federated Learning (FL) enables collaborative model training among participants while guaranteeing the privacy of raw data. Mainstream FL methodologies overlook the dynamic nature of real-world data, particularly its tendency to grow in volume and diversify in classes over time. This oversight results in FL methods suffering from catastrophic forgetting, where the trained models inadvertently disc… ▽ More

    Submitted 17 April, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: 9 pages, 2 figures, 4 tables

  29. arXiv:2312.11489  [pdf, other

    cs.DC cs.LG

    Agglomerative Federated Learning: Empowering Larger Model Training via End-Edge-Cloud Collaboration

    Authors: Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Bo Gao, Quyang Pan, Tianliu He, Xuefeng Jiang

    Abstract: Federated Learning (FL) enables training Artificial Intelligence (AI) models over end devices without compromising their privacy. As computing tasks are increasingly performed by a combination of cloud, edge, and end devices, FL can benefit from this End-Edge-Cloud Collaboration (EECC) paradigm to achieve collaborative device-scale expansion with real-time access. Although Hierarchical Federated L… ▽ More

    Submitted 29 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Accepted by IEEE International Conference on Computer Communications (INFOCOM), 2024

  30. arXiv:2312.10983  [pdf, other

    cs.CV

    MatchDet: A Collaborative Framework for Image Matching and Object Detection

    Authors: **xiang Lai, Wenlong Wu, Bin-Bin Gao, Jun Liu, Jiawei Zhan, Congchong Nie, Yi Zeng, Chengjie Wang

    Abstract: Image matching and object detection are two fundamental and challenging tasks, while many related applications consider them two individual tasks (i.e. task-individual). In this paper, a collaborative framework called MatchDet (i.e. task-collaborative) is proposed for image matching and object detection to obtain mutual improvements. To achieve the collaborative learning of the two tasks, we propo… ▽ More

    Submitted 4 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Journal ref: AAAI 2024

  31. arXiv:2311.18173  [pdf

    eess.IV cs.CE cs.CV

    Quantification of cardiac capillarization in single-immunostained myocardial slices using weakly supervised instance segmentation

    Authors: Zhao Zhang, Xiwen Chen, William Richardson, Bruce Z. Gao, Abolfazl Razi, Tong Ye

    Abstract: Decreased myocardial capillary density has been reported as an important histopathological feature associated with various heart disorders. Quantitative assessment of cardiac capillarization typically involves double immunostaining of cardiomyocytes (CMs) and capillaries in myocardial slices. In contrast, single immunostaining of basement membrane components is a straightforward approach to simult… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  32. arXiv:2311.12035  [pdf, other

    q-bio.QM cs.LG

    Delta Score: Improving the Binding Assessment of Structure-Based Drug Design Methods

    Authors: Minsi Ren, Bowen Gao, Bo Qiang, Yanyan Lan

    Abstract: Structure-based drug design (SBDD) stands at the forefront of drug discovery, emphasizing the creation of molecules that target specific binding pockets. Recent advances in this area have witnessed the adoption of deep generative models and geometric deep learning techniques, modeling SBDD as a conditional generation task where the target structure serves as context. Historically, evaluation of th… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  33. arXiv:2311.10245  [pdf, other

    cs.CV eess.IV

    Segment Anything in Defect Detection

    Authors: Bozhen Hu, Bin Gao, Cheng Tan, Tongle Wu, Stan Z. Li

    Abstract: Defect detection plays a crucial role in infrared non-destructive testing systems, offering non-contact, safe, and efficient inspection capabilities. However, challenges such as low resolution, high noise, and uneven heating in infrared thermal images hinder comprehensive and accurate defect detection. In this study, we propose DefectSAM, a novel approach for segmenting defects on highly noisy the… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  34. arXiv:2311.08202  [pdf, other

    cs.LG

    Federated Skewed Label Learning with Logits Fusion

    Authors: Yuwei Wang, Runhan Li, Hao Tan, Xuefeng Jiang, Sheng Sun, Min Liu, Bo Gao, Zhiyuan Wu

    Abstract: Federated learning (FL) aims to collaboratively train a shared model across multiple clients without transmitting their local data. Data heterogeneity is a critical challenge in realistic FL settings, as it causes significant performance deterioration due to discrepancies in optimization among local models. In this work, we focus on label distribution skew, a common scenario in data heterogeneity,… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 9 pages, 4 figures, 4 tables

  35. arXiv:2310.13316  [pdf, other

    cs.CL cs.AI

    Coarse-to-Fine Dual Encoders are Better Frame Identification Learners

    Authors: Kaikai An, Ce Zheng, Bofei Gao, Haozhe Zhao, Baobao Chang

    Abstract: Frame identification aims to find semantic frames associated with target words in a sentence. Recent researches measure the similarity or matching score between targets and candidate frames by modeling frame definitions. However, they either lack sufficient representation learning of the definitions or face challenges in efficiently selecting the most suitable frame from over 1000 candidate frames… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to Findings of EMNLP2023

  36. arXiv:2310.13266  [pdf, ps, other

    cs.IT

    Measurement-Based Small-Scale Channel Model for Sub-6 GHz RIS-Assisted Communications

    Authors: Jian Sang, Jifeng Lan, Mingyong Zhou, Boning Gao, Wankai Tang, Xiao Li, Michail Matthaiou, Shi **, Marco Di Renzo

    Abstract: Reconfigurable intelligent surfaces (RISs) have attracted increasing interest from both academia and industry, thanks to their unique features on controlling electromagnetic (EM) waves. Although theoretical models for RIS-empowered communications have covered a variety of applications, yet, very few papers have investigated the modeling of real propagation characteristics. In this paper, we fill t… ▽ More

    Submitted 4 March, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  37. arXiv:2310.08860  [pdf, other

    cs.CL

    Guiding AMR Parsing with Reverse Graph Linearization

    Authors: Bofei Gao, Liang Chen, Peiyi Wang, Zhifang Sui, Baobao Chang

    Abstract: Abstract Meaning Representation (AMR) parsing aims to extract an abstract semantic graph from a given sentence. The sequence-to-sequence approaches, which linearize the semantic graph into a sequence of nodes and edges and generate the linearized graph directly, have achieved good performance. However, we observed that these approaches suffer from structure loss accumulation during the decoding pr… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP2023

  38. arXiv:2310.07229  [pdf, other

    cs.LG

    ProFSA: Self-supervised Pocket Pretraining via Protein Fragment-Surroundings Alignment

    Authors: Bowen Gao, Yinjun Jia, Yuanle Mo, Yuyan Ni, Weiying Ma, Zhiming Ma, Yanyan Lan

    Abstract: Pocket representations play a vital role in various biomedical applications, such as druggability estimation, ligand affinity prediction, and de novo drug design. While existing geometric features and pretrained representations have demonstrated promising results, they usually treat pockets independent of ligands, neglecting the fundamental interactions between them. However, the limited pocket-li… ▽ More

    Submitted 7 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  39. arXiv:2310.06437  [pdf, other

    cs.CV cs.LG

    Skeleton Ground Truth Extraction: Methodology, Annotation Tool and Benchmarks

    Authors: Cong Yang, Bipin Indurkhya, John See, Bo Gao, Yan Ke, Zeyd Boukhers, Zhenyu Yang, Marcin Grzegorzek

    Abstract: Skeleton Ground Truth (GT) is critical to the success of supervised skeleton extraction methods, especially with the popularity of deep learning techniques. Furthermore, we see skeleton GTs used not only for training skeleton detectors with Convolutional Neural Networks (CNN) but also for evaluating skeleton-related pruning and matching algorithms. However, most existing shape and image datasets s… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted for publication in the International Journal of Computer Vision (IJCV)

  40. arXiv:2310.06367  [pdf, other

    cs.LG

    DrugCLIP: Contrastive Protein-Molecule Representation Learning for Virtual Screening

    Authors: Bowen Gao, Bo Qiang, Haichuan Tan, Minsi Ren, Yinjun Jia, Minsi Lu, **g**g Liu, Weiying Ma, Yanyan Lan

    Abstract: Virtual screening, which identifies potential drugs from vast compound databases to bind with a particular protein pocket, is a critical step in AI-assisted drug discovery. Traditional docking methods are highly time-consuming, and can only work with a restricted search library in real-life applications. Recent supervised learning approaches using scoring functions for binding-affinity prediction,… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  41. arXiv:2308.12017  [pdf, other

    cs.CV

    DISCO: Distribution-Aware Calibration for Object Detection with Noisy Bounding Boxes

    Authors: Donghao Zhou, Jialin Li, **peng Li, Jiancheng Huang, Qiang Nie, Yong Liu, Bin-Bin Gao, Qiong Wang, Pheng-Ann Heng, Guangyong Chen

    Abstract: Large-scale well-annotated datasets are of great importance for training an effective object detector. However, obtaining accurate bounding box annotations is laborious and demanding. Unfortunately, the resultant noisy bounding boxes could cause corrupt supervision signals and thus diminish detection performance. Motivated by the observation that the real ground-truth is usually situated in the ag… ▽ More

    Submitted 27 January, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: 12 pages, 9 figures

  42. arXiv:2308.07816  [pdf, other

    cs.DC

    FedCache: A Knowledge Cache-driven Federated Learning Architecture for Personalized Edge Intelligence

    Authors: Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Ke Xu, Wen Wang, Xuefeng Jiang, Bo Gao, **da Lu

    Abstract: Edge Intelligence (EI) allows Artificial Intelligence (AI) applications to run at the edge, where data analysis and decision-making can be performed in real-time and close to data sources. To protect data privacy and unify data silos among end devices in EI, Federated Learning (FL) is proposed for collaborative training of shared AI models across devices without compromising data privacy. However,… ▽ More

    Submitted 31 January, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Accepted by IEEE Transactions on Mobile Computing (TMC)

  43. arXiv:2307.15898  [pdf, other

    cs.SD cs.AI eess.AS

    UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models

    Authors: Sen Fang, Bowen Gao, Yangjian Wu, Teik Toe Teoh

    Abstract: Multimodal large models have been recognized for their advantages in various performance and downstream tasks. The development of these models is crucial towards achieving general artificial intelligence in the future. In this paper, we propose a novel universal language representation learning method called UniBriVL, which is based on Bridging-Vision-and-Language (BriVL). Universal BriVL embeds a… ▽ More

    Submitted 9 September, 2023; v1 submitted 29 July, 2023; originally announced July 2023.

    Comments: Voice-Text fusion input; The first work of audio driven diffusion model. arXiv admin note: text overlap with arXiv:2303.04585

  44. arXiv:2306.02335  [pdf, ps, other

    cs.LG cs.CV

    Towards Robust Feature Learning with t-vFM Similarity for Continual Learning

    Authors: Bilan Gao, YoungBin Kim

    Abstract: Continual learning has been developed using standard supervised contrastive loss from the perspective of feature learning. Due to the data imbalance during the training, there are still challenges in learning better representations. In this work, we suggest using a different similarity metric instead of cosine similarity in supervised contrastive loss in order to learn more robust representations.… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

  45. arXiv:2305.13266  [pdf, other

    q-bio.BM cs.AI cs.LG

    Coarse-to-Fine: a Hierarchical Diffusion Model for Molecule Generation in 3D

    Authors: Bo Qiang, Yuxuan Song, Minkai Xu, **g**g Gong, Bowen Gao, Hao Zhou, Weiying Ma, Yanyan Lan

    Abstract: Generating desirable molecular structures in 3D is a fundamental problem for drug discovery. Despite the considerable progress we have achieved, existing methods usually generate molecules in atom resolution and ignore intrinsic local structures such as rings, which leads to poor quality in generated structures, especially when generating large molecules. Fragment-based molecule generation is a pr… ▽ More

    Submitted 26 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: ICML 2023 poster

  46. arXiv:2305.07835  [pdf, other

    cs.IT

    Multi-Scenario Broadband Channel Measurement and Modeling for Sub-6 GHz RIS-Assisted Wireless Communication Systems

    Authors: Jian Sang, Mingyong Zhou, Jifeng Lan, Boning Gao, Wankai Tang, Xiao Li, Shi **, Ertugrul Basar, Cen Li, Qiang Cheng, Tie Jun Cui

    Abstract: Reconfigurable intelligent surface (RIS)-empowered communication, has been considered widely as one of the revolutionary technologies for next generation networks. However, due to the novel propagation characteristics of RISs, underlying RIS channel modeling and measurement research is still in its infancy and not fully investigated. In this paper, we conduct multi-scenario broadband channel measu… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

  47. arXiv:2304.10093  [pdf, other

    cs.CV

    Clustered-patch Element Connection for Few-shot Learning

    Authors: **xiang Lai, Siqian Yang, Junhong Zhou, Wenlong Wu, Xiaochen Chen, Jun Liu, Bin-Bin Gao, Chengjie Wang

    Abstract: Weak feature representation problem has influenced the performance of few-shot classification task for a long time. To alleviate this problem, recent researchers build connections between support and query instances through embedding patch features to generate discriminative representations. However, we observe that there exists semantic mismatches (foreground/ background) among these local patche… ▽ More

    Submitted 10 May, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Journal ref: IJCAI 2023

  48. arXiv:2303.16510  [pdf, other

    stat.ML cs.LG math.OC

    Infeasible Deterministic, Stochastic, and Variance-Reduction Algorithms for Optimization under Orthogonality Constraints

    Authors: Pierre Ablin, Simon Vary, Bin Gao, P. -A. Absil

    Abstract: Orthogonality constraints naturally appear in many machine learning problems, from Principal Components Analysis to robust neural network training. They are usually solved using Riemannian optimization algorithms, which minimize the objective function while enforcing the constraint. However, enforcing the orthogonality constraint can be the most time-consuming operation in such algorithms. Recentl… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  49. arXiv:2303.13046  [pdf, other

    cs.IT eess.SP

    Quantized Phase Alignment by Discrete Phase Shifts for Reconfigurable Intelligent Surface-Assisted Communication Systems

    Authors: Jian Sang, Jifeng Lan, Mingyong Zhou, Boning Gao, Wankai Tang, Xiao Li, ** Yi, Shi **

    Abstract: Reconfigurable intelligent surface (RIS) has aroused a surge of interest in recent years. In this paper, we investigate the joint phase alignment and phase quantization on discrete phase shift designs for RIS-assisted single-input single-output (SISO) system. Firstly, the phenomena of phase distribution in far field and near field are respectively unveiled, paving the way for discretization of pha… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  50. arXiv:2303.09281  [pdf, other

    cs.CV

    SpatialFormer: Semantic and Target Aware Attentions for Few-Shot Learning

    Authors: **xiang Lai, Siqian Yang, Wenlong Wu, Tao Wu, Guannan Jiang, Xi Wang, Jun Liu, Bin-Bin Gao, Wei Zhang, Yuan Xie, Chengjie Wang

    Abstract: Recent Few-Shot Learning (FSL) methods put emphasis on generating a discriminative embedding features to precisely measure the similarity between support and query sets. Current CNN-based cross-attention approaches generate discriminative representations via enhancing the mutually semantic similar regions of support and query pairs. However, it suffers from two problems: CNN structure produces ina… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Journal ref: AAAI 2023