Skip to main content

Showing 1–50 of 140 results for author: Liang, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18521  [pdf, other

    cs.CL cs.CV

    CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

    Authors: Zirui Wang, Mengzhou Xia, Luxi He, Howard Chen, Yitao Liu, Richard Zhu, Kaiqu Liang, Xindi Wu, Haotian Liu, Sadhika Malladi, Alexis Chevalier, Sanjeev Arora, Danqi Chen

    Abstract: Chart understanding plays a pivotal role when applying Multimodal Large Language Models (MLLMs) to real-world tasks such as analyzing scientific papers or financial reports. However, existing datasets often focus on oversimplified and homogeneous charts with template-based questions, leading to an over-optimistic measure of progress. We demonstrate that although open-source models can appear to ou… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 121 pages, 90 figures

  2. arXiv:2406.18345  [pdf, other

    cs.LG eess.SP

    EmT: A Novel Transformer for Generalized Cross-subject EEG Emotion Recognition

    Authors: Yi Ding, Chengxuan Tong, Shuailei Zhang, Muyun Jiang, Yong Li, Kevin Lim Jun Liang, Cuntai Guan

    Abstract: Integrating prior knowledge of neurophysiology into neural network architecture enhances the performance of emotion decoding. While numerous techniques emphasize learning spatial and short-term temporal patterns, there has been limited emphasis on capturing the vital long-term contextual information associated with emotional cognitive processes. In order to address this discrepancy, we introduce a… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 11 pages, 5 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2406.03098  [pdf, ps, other

    cs.IT eess.SP

    A Data and Model-Driven Deep Learning Approach to Robust Downlink Beamforming Optimization

    Authors: Kai Liang, Gan Zheng, Zan Li, Kai-Kit Wong, Chan-Byoung Chae

    Abstract: This paper investigates the optimization of the long-standing probabilistically robust transmit beamforming problem with channel uncertainties in the multiuser multiple-input single-output (MISO) downlink transmission. This problem poses significant analytical and computational challenges. Currently, the state-of-the-art optimization method relies on convex restrictions as tractable approximations… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted for publication in the IEEE Journal on Selected Areas in Communications, Special Issue on Advanced Optimization Theory and Algorithms for Next Generation Wireless Communication Networks

  4. arXiv:2405.07518  [pdf, other

    cs.AR cs.AI

    SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

    Authors: Raghu Prabhakar, Ram Sivaramakrishnan, Darshan Gandhi, Yun Du, Mingran Wang, Xiangyu Song, Kejie Zhang, Tianren Gao, Angela Wang, Karen Li, Yongning Sheng, Joshua Brot, Denis Sokolov, Apurv Vivek, Calvin Leung, Arjun Sabnis, Jiayu Bai, Tuowen Zhao, Mark Gottscho, David Jackson, Mark Luttrell, Manish K. Shah, Edison Chen, Kaizhao Liang, Swayambhoo Jain , et al. (5 additional authors not shown)

    Abstract: Monolithic large language models (LLMs) like GPT-4 have paved the way for modern generative AI applications. Training, serving, and maintaining monolithic LLMs at scale, however, remains prohibitively expensive and challenging. The disproportionate increase in compute-to-memory ratio of modern AI accelerators have created a memory wall, necessitating new methods to deploy AI. Composition of Expert… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  5. arXiv:2404.09170  [pdf, other

    cs.CL

    Post-Semantic-Thinking: A Robust Strategy to Distill Reasoning Capacity from Large Language Models

    Authors: Xiaoshu Chen, Sihang Zhou, Ke Liang, Xinwang Liu

    Abstract: Chain of thought finetuning aims to endow small student models with reasoning capacity to improve their performance towards a specific task by allowing them to imitate the reasoning procedure of large language models (LLMs) beyond simply predicting the answer to the question. However, the existing methods 1) generate rationale before the answer, making their answer correctness sensitive to the hal… ▽ More

    Submitted 16 April, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

  6. arXiv:2404.00438  [pdf, other

    cs.DC cs.AI cs.LG math.OC stat.ML

    Communication Efficient Distributed Training with Distributed Lion

    Authors: Bo Liu, Lemeng Wu, Lizhang Chen, Kaizhao Liang, Jiaxu Zhu, Chen Liang, Raghuraman Krishnamoorthi, Qiang Liu

    Abstract: The Lion optimizer has been a promising competitor with the AdamW for training large AI models, with advantages on memory, computation, and sample efficiency. In this paper, we introduce Distributed Lion, an innovative adaptation of Lion for distributed training environments. Leveraging the sign operator in Lion, our Distributed Lion only requires communicating binary or lower-precision vectors be… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 22 pages

  7. arXiv:2403.19425  [pdf, ps, other

    eess.IV cs.CV

    A Robust Ensemble Algorithm for Ischemic Stroke Lesion Segmentation: Generalizability and Clinical Utility Beyond the ISLES Challenge

    Authors: Ezequiel de la Rosa, Mauricio Reyes, Sook-Lei Liew, Alexandre Hutton, Roland Wiest, Johannes Kaesmacher, Uta Hanning, Arsany Hakim, Richard Zubal, Waldo Valenzuela, David Robben, Diana M. Sima, Vincenzo Anania, Arne Brys, James A. Meakin, Anne Mickan, Gabriel Broocks, Christian Heitkamp, Shengbo Gao, Kongming Liang, Ziji Zhang, Md Mahfuzur Rahman Siddiquee, Andriy Myronenko, Pooya Ashtari, Sabine Van Huffel , et al. (33 additional authors not shown)

    Abstract: Diffusion-weighted MRI (DWI) is essential for stroke diagnosis, treatment decisions, and prognosis. However, image and disease variability hinder the development of generalizable AI algorithms with clinical value. We address this gap by presenting a novel ensemble algorithm derived from the 2022 Ischemic Stroke Lesion Segmentation (ISLES) challenge. ISLES'22 provided 400 patient scans with ischemi… ▽ More

    Submitted 3 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  8. The Power of Bamboo: On the Post-Compromise Security for Searchable Symmetric Encryption

    Authors: Tianyang Chen, Peng Xu, Stjepan Picek, Bo Luo, Willy Susilo, Hai **, Kaitai Liang

    Abstract: Dynamic searchable symmetric encryption (DSSE) enables users to delegate the keyword search over dynamically updated encrypted databases to an honest-but-curious server without losing keyword privacy. This paper studies a new and practical security risk to DSSE, namely, secret key compromise (e.g., a user's secret key is leaked or stolen), which threatens all the security guarantees offered by exi… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: This is a full version paper that includes the security proof. The paper with the same name has been published by NDSS 2023

    Journal ref: NDSS 2023

  9. arXiv:2403.01231  [pdf, other

    cs.CV

    Benchmarking Segmentation Models with Mask-Preserved Attribute Editing

    Authors: Zi** Yin, Kongming Liang, Bing Li, Zhanyu Ma, Jun Guo

    Abstract: When deploying segmentation models in practice, it is critical to evaluate their behaviors in varied and complex scenes. Different from the previous evaluation paradigms only in consideration of global attribute variations (e.g. adverse weather), we investigate both local and global attribute variations for robustness evaluation. To achieve this, we construct a mask-preserved attribute editing pip… ▽ More

    Submitted 10 March, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  10. arXiv:2403.01182  [pdf, other

    cs.CR

    d-DSE: Distinct Dynamic Searchable Encryption Resisting Volume Leakage in Encrypted Databases

    Authors: Dongli Liu, Wei Wang, Peng Xu, Laurence T. Yang, Bo Luo, Kaitai Liang

    Abstract: Dynamic Searchable Encryption (DSE) has emerged as a solution to efficiently handle and protect large-scale data storage in encrypted databases (EDBs). Volume leakage poses a significant threat, as it enables adversaries to reconstruct search queries and potentially compromise the security and privacy of data. Padding strategies are common countermeasures for the leakage, but they significantly in… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 23pages, 13 figures, will be published in USENIX Security'24

  11. arXiv:2403.01155  [pdf, other

    cs.CR

    Query Recovery from Easy to Hard: Jigsaw Attack against SSE

    Authors: Hao Nie, Wei Wang, Peng Xu, Xianglong Zhang, Laurence T. Yang, Kaitai Liang

    Abstract: Searchable symmetric encryption schemes often unintentionally disclose certain sensitive information, such as access, volume, and search patterns. Attackers can exploit such leakages and other available knowledge related to the user's database to recover queries. We find that the effectiveness of query recovery attacks depends on the volume/frequency distribution of keywords. Queries containing ke… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 21 pages, accepted in USENIX Security 2024

  12. arXiv:2402.15653  [pdf, other

    cs.CV

    Low-Frequency Black-Box Backdoor Attack via Evolutionary Algorithm

    Authors: Yanqi Qiao, Dazhuang Liu, Rui Wang, Kaitai Liang

    Abstract: While convolutional neural networks (CNNs) have achieved success in computer vision tasks, it is vulnerable to backdoor attacks. Such attacks could mislead the victim model to make attacker-chosen prediction with a specific trigger pattern. Until now, the trigger injection of existing attacks is mainly limited to spatial domain. Recent works take advantage of perceptual properties of planting spec… ▽ More

    Submitted 6 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  13. arXiv:2402.09246  [pdf, other

    cs.RO cs.AI eess.SY math.OC

    Who Plays First? Optimizing the Order of Play in Stackelberg Games with Many Robots

    Authors: Haimin Hu, Gabriele Dragotto, Zixu Zhang, Kaiqu Liang, Bartolomeo Stellato, Jaime F. Fisac

    Abstract: We consider the multi-agent spatial navigation problem of computing the socially optimal order of play, i.e., the sequence in which the agents commit to their decisions, and its associated equilibrium in an N-player Stackelberg trajectory game. We model this problem as a mixed-integer optimization problem over the space of all possible Stackelberg games associated with the order of play's permutat… ▽ More

    Submitted 24 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Robotics: Science and Systems (RSS) 2024

  14. arXiv:2402.06529  [pdf, other

    cs.AI cs.CL cs.LG

    Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity

    Authors: Kaiqu Liang, Zixu Zhang, Jaime Fernández Fisac

    Abstract: Large language models (LLMs) exhibit advanced reasoning skills, enabling robots to comprehend natural language instructions and strategically plan high-level actions through proper grounding. However, LLM hallucination may result in robots confidently executing plans that are misaligned with user goals or, in extreme cases, unsafe. Additionally, inherent ambiguity in natural language instructions… ▽ More

    Submitted 3 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: 27 pages, 11 figures. Code is available at https://github.com/kevinliang888/IntroPlan

  15. arXiv:2402.04972  [pdf, other

    cs.FL cs.GT

    Distributed Fair Assignment and Rebalancing for Mobility-on-Demand Systems via an Auction-based Method

    Authors: Kaier Liang, Cristian-Ioan Vasile

    Abstract: In this paper, we consider fair assignment of complex requests for Mobility-On-Demand systems. We model the transportation requests as temporal logic formulas that must be satisfied by a fleet of vehicles. We require that the assignment of requests to vehicles is performed in a distributed manner based only on communication between vehicles while ensuring fair allocation. Our approach to the vehic… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  16. arXiv:2401.08937  [pdf, other

    cs.CV

    ICON: Incremental CONfidence for Joint Pose and Radiance Field Optimization

    Authors: Weiyao Wang, Pierre Gleize, Hao Tang, Xingyu Chen, Kevin J Liang, Matt Feiszli

    Abstract: Neural Radiance Fields (NeRF) exhibit remarkable performance for Novel View Synthesis (NVS) given a set of 2D images. However, NeRF training requires accurate camera pose for each input view, typically obtained by Structure-from-Motion (SfM) pipelines. Recent works have attempted to relax this constraint, but they still often rely on decent initial poses which they can refine. Here we aim at remov… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  17. arXiv:2312.15086  [pdf, other

    cs.LG cs.CV

    HyperMix: Out-of-Distribution Detection and Classification in Few-Shot Settings

    Authors: Nikhil Mehta, Kevin J Liang, **g Huang, Fu-Jen Chu, Li Yin, Tal Hassner

    Abstract: Out-of-distribution (OOD) detection is an important topic for real-world machine learning systems, but settings with limited in-distribution samples have been underexplored. Such few-shot OOD settings are challenging, as models have scarce opportunities to learn the data distribution before being tasked with identifying OOD samples. Indeed, we demonstrate that recent state-of-the-art OOD methods f… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  18. arXiv:2312.07009  [pdf, other

    cs.CV

    Vision-language Assisted Attribute Learning

    Authors: Kongming Liang, Xinran Wang, Rui Wang, Donghui Gao, Ling **, Weidong Liu, Xiatian Zhu, Zhanyu Ma, Jun Guo

    Abstract: Attribute labeling at large scale is typically incomplete and partial, posing significant challenges to model optimization. Existing attribute learning methods often treat the missing labels as negative or simply ignore them all during training, either of which could hamper the model performance to a great extent. To overcome these limitations, in this paper we leverage the available vision-langua… ▽ More

    Submitted 14 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted by IEEE IC-NIDC 2023

  19. arXiv:2312.02496  [pdf

    cs.CL cs.AI

    MKA: A Scalable Medical Knowledge Assisted Mechanism for Generative Models on Medical Conversation Tasks

    Authors: Ke Liang, Sifan Wu, Jiayi Gu

    Abstract: Using natural language processing (NLP) technologies to develop medical chatbots makes the diagnosis of the patient more convenient and efficient, which is a typical application in healthcare AI. Because of its importance, lots of research have been come out. Recently, the neural generative models have shown their impressive ability as the core of chatbot, while it cannot scale well when directly… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  20. arXiv:2312.01167  [pdf, other

    cs.CV cs.LG stat.ML

    Meta-Learned Attribute Self-Interaction Network for Continual and Generalized Zero-Shot Learning

    Authors: Vinay K Verma, Nikhil Mehta, Kevin J Liang, Aakansha Mishra, Lawrence Carin

    Abstract: Zero-shot learning (ZSL) is a promising approach to generalizing a model to categories unseen during training by leveraging class attributes, but challenges remain. Recently, methods using generative models to combat bias towards classes seen during training have pushed state of the art, but these generative models can be slow or computationally expensive to train. Also, these generative models as… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024. arXiv admin note: substantial text overlap with arXiv:2102.11856

  21. arXiv:2311.18259  [pdf, other

    cs.CV cs.AI

    Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

    Authors: Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, **g Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, **g Huang, Md Mohaiminul Islam, Suyog Jain , et al. (76 additional authors not shown)

    Abstract: We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike repair). 740 participants from 13 cities worldwide performed these activities in 123 different natural scene contexts, yielding long-form captures from… ▽ More

    Submitted 29 April, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: updated baseline results and dataset statistics to match the released v2 data; added table to appendix comparing stats of Ego-Exo4D alongside other datasets

  22. arXiv:2311.17454  [pdf, other

    cs.CR

    Eden: An Ultra Fast, Provably Secure, and Fully Decentralized Blockchain Interoperability Protocol

    Authors: Ke Liang

    Abstract: As the blockchain ecosystem continues to evolve and expand, the need for seamless interoperability between disparate blockchain networks has become increasingly paramount. Interoperability not only enhances the functionality and reach of individual blockchains but also fosters a collaborative environment that can unlock new possibilities for decentralized applications. In this paper, we present Ed… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  23. arXiv:2311.17104  [pdf, other

    cs.LG cs.AI q-bio.MN

    Single-Cell Deep Clustering Method Assisted by Exogenous Gene Information: A Novel Approach to Identifying Cell Types

    Authors: Dayu Hu, Ke Liang, Hao Yu, Xinwang Liu

    Abstract: In recent years, the field of single-cell data analysis has seen a marked advancement in the development of clustering methods. Despite advancements, most of these algorithms still concentrate on analyzing the provided single-cell matrix data. However, in medical applications, single-cell data often involves a wealth of exogenous information, including gene networks. Overlooking this aspect could… ▽ More

    Submitted 15 December, 2023; v1 submitted 28 November, 2023; originally announced November 2023.

  24. arXiv:2311.17103  [pdf, other

    q-bio.GN cs.AI cs.LG

    Single-cell Multi-view Clustering via Community Detection with Unknown Number of Clusters

    Authors: Dayu Hu, Zhibin Dong, Ke Liang, Jun Wang, Siwei Wang, Xinwang Liu

    Abstract: Single-cell multi-view clustering enables the exploration of cellular heterogeneity within the same cell from different views. Despite the development of several multi-view clustering methods, two primary challenges persist. Firstly, most existing methods treat the information from both single-cell RNA (scRNA) and single-cell Assay of Transposase Accessible Chromatin (scATAC) views as equally sign… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  25. arXiv:2311.12320  [pdf, other

    cs.AI

    A Survey on Multimodal Large Language Models for Autonomous Driving

    Authors: Can Cui, Yunsheng Ma, Xu Cao, Wenqian Ye, Yang Zhou, Kaizhao Liang, **tai Chen, Juanwu Lu, Zichong Yang, Kuei-Da Liao, Tianren Gao, Erlong Li, Kun Tang, Zhipeng Cao, Tong Zhou, Ao Liu, Xinrui Yan, Shuqi Mei, Jianguo Cao, Ziran Wang, Chao Zheng

    Abstract: With the emergence of Large Language Models (LLMs) and Vision Foundation Models (VFMs), multimodal AI systems benefiting from large models have the potential to equally perceive the real world, make decisions, and control tools as humans. In recent months, LLMs have shown widespread attention in autonomous driving and map systems. Despite its immense potential, there is still a lack of a comprehen… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  26. arXiv:2310.12846  [pdf, other

    math.NA cs.AI

    Physical Information Neural Networks for Solving High-index Differential-algebraic Equation Systems Based on Radau Methods

    Authors: Jiasheng Chen, Juan Tang, Ming Yan, Shuai Lai, Kun Liang, Jianguang Lu, Wenqiang Yang

    Abstract: As is well known, differential algebraic equations (DAEs), which are able to describe dynamic changes and underlying constraints, have been widely applied in engineering fields such as fluid dynamics, multi-body dynamics, mechanical systems and control theory. In practical physical modeling within these domains, the systems often generate high-index DAEs. Classical implicit numerical methods typic… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  27. arXiv:2310.05898  [pdf, other

    cs.LG cs.AI math.OC stat.AP stat.ML

    Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts

    Authors: Lizhang Chen, Bo Liu, Kaizhao Liang, Qiang Liu

    Abstract: Lion (Evolved Sign Momentum), a new optimizer discovered through program search, has shown promising results in training large AI models. It performs comparably or favorably to AdamW but with greater memory efficiency. As we can expect from the results of a random search program, Lion incorporates elements from several existing algorithms, including signed momentum, decoupled weight decay, Polak,… ▽ More

    Submitted 19 April, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

  28. arXiv:2309.15135  [pdf, other

    cs.LG cs.AI cs.CV

    Contrastive Continual Multi-view Clustering with Filtered Structural Fusion

    Authors: Xinhang Wan, Jiyuan Liu, Hao Yu, Ao Li, Xinwang Liu, Ke Liang, Zhibin Dong, En Zhu

    Abstract: Multi-view clustering thrives in applications where views are collected in advance by extracting consistent and complementary information among views. However, it overlooks scenarios where data views are collected sequentially, i.e., real-time data. Due to privacy issues or memory burden, previous views are not available with time in these situations. Some methods are proposed to handle it but are… ▽ More

    Submitted 4 March, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  29. arXiv:2309.11845  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    TMac: Temporal Multi-Modal Graph Learning for Acoustic Event Classification

    Authors: Meng Liu, Ke Liang, Dayu Hu, Hao Yu, Yue Liu, Lingyuan Meng, Wenxuan Tu, Sihang Zhou, Xinwang Liu

    Abstract: Audiovisual data is everywhere in this digital age, which raises higher requirements for the deep learning models developed on them. To well handle the information of the multi-modal data is the key to a better audiovisual modal. We observe that these audiovisual data naturally have temporal attributes, such as the time information for each frame in the video. More concretely, such data is inheren… ▽ More

    Submitted 26 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: This work has been accepted by ACM MM 2023 for publication

  30. arXiv:2309.11745  [pdf, other

    eess.IV cs.CV cs.LG

    PIE: Simulating Disease Progression via Progressive Image Editing

    Authors: Kaizhao Liang, Xu Cao, Kuei-Da Liao, Tianren Gao, Wenqian Ye, Zhengyu Chen, Jianguo Cao, Tejas Nama, Jimeng Sun

    Abstract: Disease progression simulation is a crucial area of research that has significant implications for clinical diagnosis, prognosis, and treatment. One major challenge in this field is the lack of continuous medical imaging monitoring of individual patients over time. To address this issue, we develop a novel framework termed Progressive Image Editing (PIE) that enables controlled manipulation of dis… ▽ More

    Submitted 5 October, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: Code and checkpoints for replicating our results can be found at https://github.com/IrohXu/PIE and https://huggingface.co/IrohXu/stable-diffusion-mimic-cxr-v0.1

  31. arXiv:2309.00127  [pdf, other

    cs.LG cs.CR

    FTA: Stealthy and Adaptive Backdoor Attack with Flexible Triggers on Federated Learning

    Authors: Yanqi Qiao, Dazhuang Liu, Congwen Chen, Rui Wang, Kaitai Liang

    Abstract: Current backdoor attacks against federated learning (FL) strongly rely on universal triggers or semantic patterns, which can be easily detected and filtered by certain defense mechanisms such as norm clip**, comparing parameter divergences among local updates. In this work, we propose a new stealthy and robust backdoor attack with flexible triggers against FL defenses. To achieve this, we build… ▽ More

    Submitted 28 September, 2023; v1 submitted 31 August, 2023; originally announced September 2023.

  32. Efficient Multi-View Graph Clustering with Local and Global Structure Preservation

    Authors: Yi Wen, Suyuan Liu, Xinhang Wan, Siwei Wang, Ke Liang, Xinwang Liu, Xihong Yang, Pei Zhang

    Abstract: Anchor-based multi-view graph clustering (AMVGC) has received abundant attention owing to its high efficiency and the capability to capture complementary structural information across multiple views. Intuitively, a high-quality anchor graph plays an essential role in the success of AMVGC. However, the existing AMVGC methods only consider single-structure information, i.e., local or global structur… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2308.16541

  33. Scalable Incomplete Multi-View Clustering with Structure Alignment

    Authors: Yi Wen, Siwei Wang, Ke Liang, Weixuan Liang, Xinhang Wan, Xinwang Liu, Suyuan Liu, Jiyuan Liu, En Zhu

    Abstract: The success of existing multi-view clustering (MVC) relies on the assumption that all views are complete. However, samples are usually partially available due to data corruption or sensor malfunction, which raises the research of incomplete multi-view clustering (IMVC). Although several anchor-based IMVC methods have been proposed to process the large-scale incomplete data, they still suffer from… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  34. arXiv:2308.10402  [pdf, other

    cs.CV cs.AI cs.CL cs.HC

    Simple Baselines for Interactive Video Retrieval with Questions and Answers

    Authors: Kaiqu Liang, Samuel Albanie

    Abstract: To date, the majority of video retrieval systems have been optimized for a "single-shot" scenario in which the user submits a query in isolation, ignoring previous interactions with the system. Recently, there has been renewed interest in interactive systems to enhance retrieval, but existing approaches are complex and deliver limited gains in performance. In this work, we revisit this topic and p… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: ICCV 2023, project page: https://github.com/kevinliang888/IVR-QA-baselines

  35. arXiv:2308.09000  [pdf, other

    cs.CV cs.LG

    DealMVC: Dual Contrastive Calibration for Multi-view Clustering

    Authors: Xihong Yang, Jiaqi **, Siwei Wang, Ke Liang, Yue Liu, Yi Wen, Suyuan Liu, Sihang Zhou, Xinwang Liu, En Zhu

    Abstract: Benefiting from the strong view-consistent information mining capacity, multi-view contrastive clustering has attracted plenty of attention in recent years. However, we observe the following drawback, which limits the clustering performance from further improvement. The existing multi-view models mainly focus on the consistency of the same samples in different views while ignoring the circumstance… ▽ More

    Submitted 6 November, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

  36. arXiv:2308.08963  [pdf, other

    cs.LG

    CONVERT:Contrastive Graph Clustering with Reliable Augmentation

    Authors: Xihong Yang, Cheng Tan, Yue Liu, Ke Liang, Siwei Wang, Sihang Zhou, Jun Xia, Stan Z. Li, Xinwang Liu, En Zhu

    Abstract: Contrastive graph node clustering via learnable data augmentation is a hot research spot in the field of unsupervised graph learning. The existing methods learn the sampling distribution of a pre-defined augmentation to generate data-driven augmentations automatically. Although promising clustering performance has been achieved, we observe that these strategies still rely on pre-defined augmentati… ▽ More

    Submitted 20 October, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

  37. arXiv:2308.06827  [pdf, other

    cs.LG cs.AI

    Reinforcement Graph Clustering with Unknown Cluster Number

    Authors: Yue Liu, Ke Liang, Jun Xia, Xihong Yang, Sihang Zhou, Meng Liu, Xinwang Liu, Stan Z. Li

    Abstract: Deep graph clustering, which aims to group nodes into disjoint clusters by neural networks in an unsupervised manner, has attracted great attention in recent years. Although the performance has been largely improved, the excellent performance of the existing methods heavily relies on an accurately predefined cluster number, which is not always available in the real-world scenario. To enable the de… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

  38. arXiv:2307.12499  [pdf, other

    cs.LG cs.CV

    AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models

    Authors: Xuelong Dai, Kaisheng Liang, Bin Xiao

    Abstract: Unrestricted adversarial attacks present a serious threat to deep learning models and adversarial defense techniques. They pose severe security problems for deep learning applications because they can effectively bypass defense mechanisms. However, previous attack methods often utilize Generative Adversarial Networks (GANs), which are not theoretically provable and thus generate unrealistic exampl… ▽ More

    Submitted 27 February, 2024; v1 submitted 23 July, 2023; originally announced July 2023.

  39. arXiv:2307.09146  [pdf, other

    cs.CV

    PRO-Face S: Privacy-preserving Reversible Obfuscation of Face Images via Secure Flow

    Authors: Lin Yuan, Kai Liang, Xiao Pu, Yan Zhang, Jiaxu Leng, Tao Wu, Nannan Wang, Xinbo Gao

    Abstract: This paper proposes a novel paradigm for facial privacy protection that unifies multiple characteristics including anonymity, diversity, reversibility and security within a single lightweight framework. We name it PRO-Face S, short for Privacy-preserving Reversible Obfuscation of Face images via Secure flow-based model. In the framework, an Invertible Neural Network (INN) is utilized to process th… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  40. arXiv:2307.03942  [pdf, ps, other

    eess.IV cs.CV

    Ariadne's Thread:Using Text Prompts to Improve Segmentation of Infected Areas from Chest X-ray images

    Authors: Yi Zhong, Mengqiu Xu, Kongming Liang, Kaixin Chen, Ming Wu

    Abstract: Segmentation of the infected areas of the lung is essential for quantifying the severity of lung disease like pulmonary infections. Existing medical image segmentation methods are almost uni-modal methods based on image. However, these image-only methods tend to produce inaccurate results unless trained with large amounts of annotated data. To overcome this challenge, we propose a language-driven… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: Provisional Acceptance by MICCAI 2023

  41. arXiv:2307.03591  [pdf, other

    cs.AI cs.IR

    Structure Guided Multi-modal Pre-trained Transformer for Knowledge Graph Reasoning

    Authors: Ke Liang, Sihang Zhou, Yue Liu, Lingyuan Meng, Meng Liu, Xinwang Liu

    Abstract: Multimodal knowledge graphs (MKGs), which intuitively organize information in various modalities, can benefit multiple practical downstream tasks, such as recommendation systems, and visual question answering. However, most MKGs are still far from complete, which motivates the flourishing of MKG reasoning models. Recently, with the development of general artificial architectures, the pretrained tr… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessed

  42. arXiv:2307.03476  [pdf, other

    cs.LG cs.CV

    Unpaired Multi-View Graph Clustering with Cross-View Structure Matching

    Authors: Yi Wen, Siwei Wang, Qing Liao, Weixuan Liang, Ke Liang, Xinhang Wan, Xinwang Liu

    Abstract: Multi-view clustering (MVC), which effectively fuses information from multiple views for better performance, has received increasing attention. Most existing MVC methods assume that multi-view data are fully paired, which means that the map**s of all corresponding samples between views are pre-defined or given in advance. However, the data correspondence is often incomplete in real-world applica… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 15 pages

  43. arXiv:2306.05695  [pdf, other

    cs.IT eess.SP

    Power Beacon Energy Consumption Minimization in Wireless Powered Backscatter Communication Networks

    Authors: Haohang Yang, Yinghui Ye, Kai Liang, Xiaoli Chu

    Abstract: Internet-of-Things (IoT) networks are expected to support the wireless connection of massive energy limited IoT nodes. The emerging wireless powered backscatter communications (WPBC) enable IoT nodes to harvest energy from the incident radio frequency signals transmitted by a power beacon (PB) to support their circuit operation, but the energy consumption of the PB (a potentially high cost borne b… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  44. arXiv:2306.04962  [pdf, other

    cs.AI cs.LG

    arXiv4TGC: Large-Scale Datasets for Temporal Graph Clustering

    Authors: Meng Liu, Ke Liang, Yue Liu, Siwei Wang, Sihang Zhou, Xinwang Liu

    Abstract: Temporal graph clustering (TGC) is a crucial task in temporal graph learning. Its focus is on node clustering on temporal graphs, and it offers greater flexibility for large-scale graph structures due to the mechanism of temporal graph methods. However, the development of TGC is currently constrained by a significant problem: the lack of suitable and reliable large-scale temporal graph datasets to… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  45. arXiv:2305.18405  [pdf, other

    cs.LG cs.AI

    Dink-Net: Neural Clustering on Large Graphs

    Authors: Yue Liu, Ke Liang, Jun Xia, Sihang Zhou, Xihong Yang, Xinwang Liu, Stan Z. Li

    Abstract: Deep graph clustering, which aims to group the nodes of a graph into disjoint clusters with deep neural networks, has achieved promising progress in recent years. However, the existing methods fail to scale to the large graph with million nodes. To solve this problem, a scalable deep graph clustering method (Dink-Net) is proposed with the idea of dilation and shrink. Firstly, by discriminating nod… ▽ More

    Submitted 14 July, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: 19 pages, 5 figures

  46. arXiv:2305.14074  [pdf, other

    cs.AI cs.IR

    Message Intercommunication for Inductive Relation Reasoning

    Authors: Ke Liang, Lingyuan Meng, Sihang Zhou, Siwei Wang, Wenxuan Tu, Yue Liu, Meng Liu, Xinwang Liu

    Abstract: Inductive relation reasoning for knowledge graphs, aiming to infer missing links between brand-new entities, has drawn increasing attention. The models developed based on Graph Inductive Learning, called GraIL-based models, have shown promising potential for this task. However, the uni-directional message-passing mechanism hinders such models from exploiting hidden mutual relations between entitie… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  47. arXiv:2305.10738  [pdf, other

    cs.LG cs.AI

    Deep Temporal Graph Clustering

    Authors: Meng Liu, Yue Liu, Ke Liang, Wenxuan Tu, Siwei Wang, Sihang Zhou, Xinwang Liu

    Abstract: Deep graph clustering has recently received significant attention due to its ability to enhance the representation learning capabilities of models in unsupervised scenarios. Nevertheless, deep clustering for temporal graphs, which could capture crucial dynamic interaction information, has not been fully explored. It means that in many clustering-oriented real-world scenarios, temporal graphs can o… ▽ More

    Submitted 10 April, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

  48. arXiv:2304.11579  [pdf, other

    cs.CV

    StyLess: Boosting the Transferability of Adversarial Examples

    Authors: Kaisheng Liang, Bin Xiao

    Abstract: Adversarial attacks can mislead deep neural networks (DNNs) by adding imperceptible perturbations to benign examples. The attack transferability enables adversarial examples to attack black-box DNNs with unknown architectures or parameters, which poses threats to many real-world applications. We find that existing transferable attacks do not distinguish between style and content features during op… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  49. arXiv:2304.10297  [pdf, other

    cs.LG cs.AI cs.IR

    SARF: Aliasing Relation Assisted Self-Supervised Learning for Few-shot Relation Reasoning

    Authors: Lingyuan Meng, Ke Liang, Bin Xiao, Sihang Zhou, Yue Liu, Meng Liu, Xihong Yang, Xinwang Liu

    Abstract: Few-shot relation reasoning on knowledge graphs (FS-KGR) aims to infer long-tail data-poor relations, which has drawn increasing attention these years due to its practicalities. The pre-training of previous methods needs to manually construct the meta-relation set, leading to numerous labor costs. Self-supervised learning (SSL) is treated as a solution to tackle the issue, but still at an early st… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  50. arXiv:2304.07573  [pdf, other

    cs.IT

    Multi-Server Secure Aggregation with Unreliable Communication Links

    Authors: Kai Liang, Songze Li, Ming Ding, Youlong Wu

    Abstract: In many distributed learning setups such as federated learning (FL), client nodes at the edge use individually collected data to compute local gradients and send them to a central master server. The master server then aggregates the received gradients and broadcasts the aggregation to all clients, with which the clients can update the global model. In this paper, we consider multi-server federated… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.