Skip to main content

Showing 1–21 of 21 results for author: Duan, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10087  [pdf, ps, other

    cs.DC

    cuFastTuckerPlus: A Stochastic Parallel Sparse FastTucker Decomposition Using GPU Tensor Cores

    Authors: Zixuan Li, Mingxing Duan, Huizhang Luo, Wangdong Yang, Kenli Li, Keqin Li

    Abstract: Sparse tensors are prevalent in real-world applications, often characterized by their large-scale, high-order, and high-dimensional nature. Directly handling raw tensors is impractical due to the significant memory and computational overhead involved. The current mainstream approach involves compressing or decomposing the original tensor. One popular tensor decomposition algorithm is the Tucker de… ▽ More

    Submitted 23 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  2. arXiv:2402.18467  [pdf, other

    cs.CV

    Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation

    Authors: Zhiwei Yang, Kexue Fu, Minghong Duan, Linhao Qu, Shuo Wang, Zhijian Song

    Abstract: Weakly supervised semantic segmentation (WSSS) with image-level labels aims to achieve segmentation tasks without dense annotations. However, attributed to the frequent coupling of co-occurring objects and the limited supervision from image-level labels, the challenging co-occurrence problem is widely present and leads to false activation of objects in WSSS. In this work, we devise a 'Separate and… ▽ More

    Submitted 21 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: Accepted by CVPR 2024

  3. arXiv:2402.07841  [pdf, other

    cs.CL

    Do Membership Inference Attacks Work on Large Language Models?

    Authors: Michael Duan, Anshuman Suri, Niloofar Mireshghallah, Sewon Min, Weijia Shi, Luke Zettlemoyer, Yulia Tsvetkov, Ye** Choi, David Evans, Hannaneh Hajishirzi

    Abstract: Membership inference attacks (MIAs) attempt to predict whether a particular datapoint is a member of a target model's training data. Despite extensive research on traditional machine learning models, there has been limited work studying MIA on the pre-training data of large language models (LLMs). We perform a large-scale evaluation of MIAs over a suite of language models (LMs) trained on the Pile… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  4. arXiv:2402.02382  [pdf, other

    cs.CV cs.LG

    Revisiting the Power of Prompt for Visual Tuning

    Authors: Yuzhu Wang, Lechao Cheng, Chaowei Fang, Dingwen Zhang, Manni Duan, Meng Wang

    Abstract: Visual prompt tuning (VPT) is a promising solution incorporating learnable prompt tokens to customize pre-trained models for downstream tasks. However, VPT and its variants often encounter challenges like prompt initialization, prompt length, and subpar performance in self-supervised pretraining, hindering successful contextual adaptation. This study commences by exploring the correlation evolveme… ▽ More

    Submitted 27 May, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML2024, See https://github.com/WangYZ1608/Self-Prompt-Tuning

  5. arXiv:2401.15613  [pdf, other

    eess.IV cs.CV

    Towards Arbitrary-Scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework via Implicit Self-texture Enhancement

    Authors: Minghong Duan, Linhao Qu, Zhiwei Yang, Manning Wang, Chenxi Zhang, Zhijian Song

    Abstract: High-quality whole-slide scanners are expensive, complex, and time-consuming, thus limiting the acquisition and utilization of high-resolution pathology whole-slide images in daily clinical work. Deep learning-based single-image super-resolution techniques are an effective way to solve this problem by synthesizing high-resolution images from low-resolution ones. However, the existing super-resolut… ▽ More

    Submitted 26 June, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  6. arXiv:2308.05320  [pdf, other

    cs.CV cs.AI

    Generating Transferable and Stealthy Adversarial Patch via Attention-guided Adversarial Inpainting

    Authors: Yanjie Li, Mingxing Duan, Xuelong Dai, Bin Xiao

    Abstract: Adversarial patch attacks can fool the face recognition (FR) models via small patches. However, previous adversarial patch attacks often result in unnatural patterns that are easily noticeable. Generating transferable and stealthy adversarial patches that can efficiently deceive the black-box FR models while having good camouflage is challenging because of the huge stylistic difference between the… ▽ More

    Submitted 1 October, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Submitted to ICLR2024

  7. arXiv:2307.02140  [pdf, other

    cs.SE cs.AI cs.LG

    Towards Open Federated Learning Platforms: Survey and Vision from Technical and Legal Perspectives

    Authors: Moming Duan, Qinbin Li, Linshan Jiang, Bingsheng He

    Abstract: Traditional Federated Learning (FL) follows a server-dominated cooperation paradigm which narrows the application scenarios of FL and decreases the enthusiasm of data holders to participate. To fully unleash the potential of FL, we advocate rethinking the design of current FL frameworks and extending it to a more generalized concept: Open Federated Learning Platforms, positioned as a crowdsourcing… ▽ More

    Submitted 29 February, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Download Appendix from https://github.com/morningD/Towards-Open-Federated-Learning-Platforms-Survey/blob/main/TKDE-Tex/APPENDIX.pdf

  8. arXiv:2305.17007  [pdf, other

    cs.CV

    Improving Knowledge Distillation via Regularizing Feature Norm and Direction

    Authors: Yuzhu Wang, Lechao Cheng, Manni Duan, Yongheng Wang, Zunlei Feng, Shu Kong

    Abstract: Knowledge distillation (KD) exploits a large well-trained model (i.e., teacher) to train a small student model on the same dataset for the same task. Treating teacher features as knowledge, prevailing methods of knowledge distillation train student by aligning its features with the teacher's, e.g., by minimizing the KL-divergence between their logits or L2 distance between their intermediate featu… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: 16 pages, 8 figures, 6 tables

  9. arXiv:2305.13551  [pdf, other

    cs.CL cs.AI

    How Fragile is Relation Extraction under Entity Replacements?

    Authors: Yiwei Wang, Bryan Hooi, Fei Wang, Yujun Cai, Yuxuan Liang, Wenxuan Zhou, **g Tang, Manjuan Duan, Muhao Chen

    Abstract: Relation extraction (RE) aims to extract the relations between entity names from the textual context. In principle, textual context determines the ground-truth relation and the RE models should be able to correctly identify the relations reflected by the textual context. However, existing work has found that the RE models memorize the entity name patterns to make RE predictions while ignoring the… ▽ More

    Submitted 7 May, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

  10. arXiv:2304.04238  [pdf, other

    eess.IV cs.CV

    Towards Arbitrary-scale Histopathology Image Super-resolution: An Efficient Dual-branch Framework based on Implicit Self-texture Enhancement

    Authors: Linhao Qu, Minghong Duan, Zhiwei Yang, Manning Wang, Zhijian Song

    Abstract: Existing super-resolution models for pathology images can only work in fixed integer magnifications and have limited performance. Though implicit neural network-based methods have shown promising results in arbitrary-scale super-resolution of natural images, it is not effective to directly apply them in pathology images, because pathology images have special fine-grained image textures different f… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  11. arXiv:2302.11751  [pdf, other

    cs.LG cs.AI

    Data-Free Diversity-Based Ensemble Selection For One-Shot Federated Learning in Machine Learning Model Market

    Authors: Naibo Wang, Wenjie Feng, Fusheng Liu, Moming Duan, See-Kiong Ng

    Abstract: The emerging availability of trained machine learning models has put forward the novel concept of Machine Learning Model Market in which one can harness the collective intelligence of multiple well-trained models to improve the performance of the resultant model through one-shot federated learning and ensemble learning in a data-free manner. However, picking the models available in the market for… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  12. arXiv:2204.02574  [pdf, other

    cs.CV

    FocalClick: Towards Practical Interactive Image Segmentation

    Authors: Xi Chen, Zhiyan Zhao, Yilei Zhang, Manni Duan, Donglian Qi, Hengshuang Zhao

    Abstract: Interactive segmentation allows users to extract target masks by making positive/negative clicks. Although explored by many previous works, there is still a gap between academic approaches and industrial needs: first, existing models are not efficient enough to work on low power devices; second, they perform poorly when used to refine preexisting masks as they could not avoid destroying the correc… ▽ More

    Submitted 17 April, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: CVPR2022

  13. Flexible Clustered Federated Learning for Client-Level Data Distribution Shift

    Authors: Moming Duan, Duo Liu, Xinyuan Ji, Yu Wu, Liang Liang, Xianzhang Chen, Yujuan Tan

    Abstract: Federated Learning (FL) enables the multiple participating devices to collaboratively contribute to a global neural network model while kee** the training data locally. Unlike the centralized training setting, the non-IID, imbalanced (statistical heterogeneity) and distribution shifted training data of FL is distributed in the federated network, which will increase the divergences between the lo… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: Manuscript under review. arXiv admin note: substantial text overlap with arXiv:2010.06870

  14. CSAFL: A Clustered Semi-Asynchronous Federated Learning Framework

    Authors: Yu Zhang, Moming Duan, Duo Liu, Li Li, Ao Ren, Xianzhang Chen, Yujuan Tan, Chengliang Wang

    Abstract: Federated learning (FL) is an emerging distributed machine learning paradigm that protects privacy and tackles the problem of isolated data islands. At present, there are two main communication strategies of FL: synchronous FL and asynchronous FL. The advantages of synchronous FL are that the model has high precision and fast convergence speed. However, this synchronous communication strategy has… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: This paper will be presented at IJCNN 2021

  15. FedSAE: A Novel Self-Adaptive Federated Learning Framework in Heterogeneous Systems

    Authors: Li Li, Moming Duan, Duo Liu, Yu Zhang, Ao Ren, Xianzhang Chen, Yujuan Tan, Chengliang Wang

    Abstract: Federated Learning (FL) is a novel distributed machine learning which allows thousands of edge devices to train model locally without uploading data concentrically to the server. But since real federated settings are resource-constrained, FL is encountered with systems heterogeneity which causes a lot of stragglers directly and then leads to significantly accuracy reduction indirectly. To solve th… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: This paper will be presented at IJCNN 2021

  16. FedGroup: Efficient Clustered Federated Learning via Decomposed Data-Driven Measure

    Authors: Moming Duan, Duo Liu, Xinyuan Ji, Ren** Liu, Liang Liang, Xianzhang Chen, Yujuan Tan

    Abstract: Federated Learning (FL) enables the multiple participating devices to collaboratively contribute to a global neural network model while kee** the training data locally. Unlike the centralized training setting, the non-IID and imbalanced (statistical heterogeneity) training data of FL is distributed in the federated network, which will increase the divergences between the local models and global… ▽ More

    Submitted 27 July, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: This work will be presented at IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA) 2021. NOTE: This revision contains a crucial correction of the client cold start mechanism, please discard all previous manuscripts

  17. arXiv:2006.12590  [pdf, other

    cs.LG stat.ML

    C-SURE: Shrinkage Estimator and Prototype Classifier for Complex-Valued Deep Learning

    Authors: Yifei Xing, Rudrasis Chakraborty, Minxuan Duan, Stella Yu

    Abstract: The James-Stein (JS) shrinkage estimator is a biased estimator that captures the mean of Gaussian random vectors.While it has a desirable statistical property of dominance over the maximum likelihood estimator (MLE) in terms of mean squared error (MSE), not much progress has been made on extending the estimator onto manifold-valued data. We propose C-SURE, a novel Stein's unbiased risk estimate… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: Submitted to CVPR PBVS workshop

  18. arXiv:1912.01772  [pdf, ps, other

    cs.CL

    A Resource for Computational Experiments on Mapudungun

    Authors: Mingjun Duan, Carlos Fasola, Sai Krishna Rallabandi, Rodolfo M. Vega, Antonios Anastasopoulos, Lori Levin, Alan W Black

    Abstract: We present a resource for computational experiments on Mapudungun, a polysynthetic indigenous language spoken in Chile with upwards of 200 thousand speakers. We provide 142 hours of culturally significant conversations in the domain of medical treatment. The conversations are fully transcribed and translated into Spanish. The transcriptions also include annotations for code-switching and non-stand… ▽ More

    Submitted 4 April, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: accepted at LREC 2020

  19. arXiv:1907.01132  [pdf, ps, other

    cs.LG cs.DC stat.ML

    Astraea: Self-balancing Federated Learning for Improving Classification Accuracy of Mobile Deep Learning Applications

    Authors: Moming Duan, Duo Liu, Xianzhang Chen, Yujuan Tan, **ting Ren, Lei Qiao, Liang Liang

    Abstract: Federated learning (FL) is a distributed deep learning method which enables multiple participants, such as mobile phones and IoT devices, to contribute a neural network model while their private training data remains in local devices. This distributed approach is promising in the edge computing system where have a large corpus of decentralized data and require high privacy. However, unlike the com… ▽ More

    Submitted 8 May, 2020; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: Published as a conference paper at IEEE 37th International Conference on Computer Design (ICCD) 2019

  20. arXiv:1804.02810  [pdf, other

    cs.CV

    A Novel Multi-Task Tensor Correlation Neural Network for Facial Attribute Prediction

    Authors: Mingxing Duan, Kenli Li, Qi Tian

    Abstract: Face multi-attribute prediction benefits substantially from multi-task learning (MTL), which learns multiple face attributes simultaneously to achieve shared or mutually related representations of different attributes. The most widely used MTL convolutional neural network is heuristically or empirically designed by sharing all of the convolutional layers and splitting at the fully connected layers… ▽ More

    Submitted 9 April, 2018; originally announced April 2018.

    Comments: Submitted to ACM Multimedia 2018

  21. arXiv:1711.02038  [pdf, other

    quant-ph cs.LG stat.ML

    An efficient quantum algorithm for generative machine learning

    Authors: Xun Gao, Zhengyu Zhang, Luming Duan

    Abstract: A central task in the field of quantum computing is to find applications where quantum computer could provide exponential speedup over any classical computer. Machine learning represents an important field with broad applications where quantum computer may offer significant speedup. Several quantum algorithms for discriminative machine learning have been found based on efficient solving of linear… ▽ More

    Submitted 6 November, 2017; originally announced November 2017.

    Comments: 7+15 pages, 3+6 figures