Skip to main content

Showing 1–50 of 84 results for author: Yuan, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09771  [pdf, other

    cs.DS

    Block Coordinate Descent Methods for Optimization under J-Orthogonality Constraints with Applications

    Authors: Di He, Ganzhao Yuan, Xiao Wang, Pengxiang Xu

    Abstract: The J-orthogonal matrix, also referred to as the hyperbolic orthogonal matrix, is a class of special orthogonal matrix in hyperbolic space, notable for its advantageous properties. These matrices are integral to optimization under J-orthogonal constraints, which have widespread applications in statistical learning and data science. However, addressing these problems is generally challenging due to… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2405.12511  [pdf, other

    cs.DB

    Quantum Computing for Databases: Overview and Challenges

    Authors: Gongsheng Yuan, Yuxing Chen, Jiaheng Lu, Sai Wu, Zhiwei Ye, Ling Qian, Gang Chen

    Abstract: In the decades, the general field of quantum computing has experienced remarkable progress since its inception. A plethora of researchers not only proposed quantum algorithms showing the power of quantum computing but also constructed the prototype of quantum computers, making it walk into our tangible reality. Those remarkable advancements in quantum computing have opened doors for novel applicat… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. arXiv:2405.07608  [pdf, other

    cs.NI

    FNCC: Fast Notification Congestion Control in Data Center Networks

    Authors: **g Xu, Zhan Wang, Fan Yang, Ning Kang, Zhenlong Ma, Guojun Yuan, Guangming Tan, Ninghui Sun

    Abstract: Congestion control plays a pivotal role in large-scale data centers, facilitating ultra-low latency, high bandwidth, and optimal utilization. Even with the deployment of data center congestion control mechanisms such as DCQCN and HPCC, these algorithms often respond to congestion sluggishly. This sluggishness is primarily due to the slow notification of congestion. It takes almost one round-trip t… ▽ More

    Submitted 26 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  4. arXiv:2405.04371  [pdf, other

    cs.SI cs.AI cs.CY

    Community Detection for Heterogeneous Multiple Social Networks

    Authors: Ziqing Zhu, Guan Yuan, Tao Zhou, Jiuxin Cao

    Abstract: The community plays a crucial role in understanding user behavior and network characteristics in social networks. Some users can use multiple social networks at once for a variety of objectives. These users are called overlap** users who bridge different social networks. Detecting communities across multiple social networks is vital for interaction mining, information diffusion, and behavior mig… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: This paper was accepted by IEEE Transactions on Computational Social Systems(TCSS)

  5. arXiv:2405.01992  [pdf, other

    cs.CV

    SFFNet: A Wavelet-Based Spatial and Frequency Domain Fusion Network for Remote Sensing Segmentation

    Authors: Yunsong Yang, Genji Yuan, **jiang Li

    Abstract: In order to fully utilize spatial information for segmentation and address the challenge of handling areas with significant grayscale variations in remote sensing segmentation, we propose the SFFNet (Spatial and Frequency Domain Fusion Network) framework. This framework employs a two-stage network design: the first stage extracts features using spatial methods to obtain features with sufficient sp… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  6. arXiv:2405.01065  [pdf, other

    cs.CV

    MFDS-Net: Multi-Scale Feature Depth-Supervised Network for Remote Sensing Change Detection with Global Semantic and Detail Information

    Authors: Zhenyang Huang, Zhao** Fu, Song **tao, Genji Yuan, **jiang Li

    Abstract: Change detection as an interdisciplinary discipline in the field of computer vision and remote sensing at present has been receiving extensive attention and research. Due to the rapid development of society, the geographic information captured by remote sensing satellites is changing faster and more complex, which undoubtedly poses a higher challenge and highlights the value of change detection ta… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  7. arXiv:2403.10799  [pdf, other

    cs.CL cs.AI cs.LG

    Efficient Pruning of Large Language Model with Adaptive Estimation Fusion

    Authors: Jun Liu, Chao Wu, Changdi Yang, Hao Tang, Zhenglun Kong, Geng Yuan, Wei Niu, Dong Huang, Yanzhi Wang

    Abstract: Large language models (LLMs) have become crucial for many generative downstream tasks, leading to an inevitable trend and significant challenge to deploy them efficiently on resource-constrained devices. Structured pruning is a widely used method to address this challenge. However, when dealing with the complex structure of the multiple decoder layers, general methods often employ common estimatio… ▽ More

    Submitted 14 May, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  8. arXiv:2401.16720  [pdf, other

    cs.LG cs.CV

    SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing

    Authors: Sheng Li, Geng Yuan, Yue Dai, Youtao Zhang, Yanzhi Wang, Xulong Tang

    Abstract: There has been a proliferation of artificial intelligence applications, where model training is key to promising high-quality services for these applications. However, the model training process is both time-intensive and energy-intensive, inevitably affecting the user's demand for application efficiency. Layer freezing, an efficient model training technique, has been proposed to improve training… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  9. arXiv:2401.16694  [pdf, other

    cs.LG cs.CV cs.DC

    EdgeOL: Efficient in-situ Online Learning on Edge Devices

    Authors: Sheng Li, Geng Yuan, Yawen Wu, Yue Dai, Chao Wu, Alex K. Jones, **gtong Hu, Yanzhi Wang, Xulong Tang

    Abstract: Emerging applications, such as robot-assisted eldercare and object recognition, generally employ deep learning neural networks (DNNs) and naturally require: i) handling streaming-in inference requests and ii) adapting to possible deployment scenario changes. Online model fine-tuning is widely adopted to satisfy these needs. However, an inappropriate fine-tuning scheme could involve significant ene… ▽ More

    Submitted 15 March, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  10. arXiv:2401.11664  [pdf, other

    cs.LG cs.AI cs.AR

    Zero-Space Cost Fault Tolerance for Transformer-based Language Models on ReRAM

    Authors: Bingbing Li, Geng Yuan, Zigeng Wang, Shaoyi Huang, Hongwu Peng, Payman Behnam, Wujie Wen, Hang Liu, Caiwen Ding

    Abstract: Resistive Random Access Memory (ReRAM) has emerged as a promising platform for deep neural networks (DNNs) due to its support for parallel in-situ matrix-vector multiplication. However, hardware failures, such as stuck-at-fault defects, can result in significant prediction errors during model inference. While additional crossbars can be used to address these failures, they come with storage overhe… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  11. arXiv:2401.11261  [pdf, other

    cs.LG cs.CV

    Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient

    Authors: Weiguo Lu, Xuan Wu, Deng Ding, **qiao Duan, Jirong Zhuang, Gangnan Yuan

    Abstract: Diffusion models (DMs) are a type of generative model that has a huge impact on image synthesis and beyond. They achieve state-of-the-art generation results in various generative tasks. A great diversity of conditioning inputs, such as text or bounding boxes, are accessible to control the generation. In this work, we propose a conditioning mechanism utilizing Gaussian mixture models (GMMs) as feat… ▽ More

    Submitted 1 February, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

  12. arXiv:2401.01183  [pdf, other

    cs.CL cs.AI

    Unifying Structured Data as Graph for Data-to-Text Pre-Training

    Authors: Shujie Li, Liang Li, Ruiying Geng, Min Yang, Binhua Li, Guanghu Yuan, Wanwei He, Shao Yuan, Can Ma, Fei Huang, Yongbin Li

    Abstract: Data-to-text (D2T) generation aims to transform structured data into natural language text. Data-to-text pre-training has proved to be powerful in enhancing D2T generation and yields impressive performances. However, previous pre-training methods either oversimplified structured data into a sequence without considering input structures or designed training objectives tailored for a specific data s… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: Accepted for TACL. Pre-MIT Press publication version

  13. arXiv:2312.15469  [pdf, other

    stat.ML cs.LG stat.ME

    Efficient Estimation of the Central Mean Subspace via Smoothed Gradient Outer Products

    Authors: Gan Yuan, Mingyue Xu, Samory Kpotufe, Daniel Hsu

    Abstract: We consider the problem of sufficient dimension reduction (SDR) for multi-index models. The estimators of the central mean subspace in prior works either have slow (non-parametric) convergence rates, or rely on stringent distributional conditions (e.g., the covariate distribution $P_{\mathbf{X}}$ being elliptical symmetric). In this paper, we show that a fast parametric convergence rate of form… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    MSC Class: 62B05; 62G08

  14. arXiv:2310.15081  [pdf, other

    cs.CV

    E4S: Fine-grained Face Swap** via Editing With Regional GAN Inversion

    Authors: Maomao Li, Ge Yuan, Cairong Wang, Zhian Liu, Yong Zhang, Yongwei Nie, Jue Wang, Dong Xu

    Abstract: This paper proposes a novel approach to face swap** from the perspective of fine-grained facial editing, dubbed "editing for swap**" (E4S). The traditional face swap** methods rely on global feature extraction and fail to preserve the detailed source identity. In contrast, we propose a Regional GAN Inversion (RGI) method, which allows the explicit disentanglement of shape and texture. Specif… ▽ More

    Submitted 27 March, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Project Page: https://e4s2024.github.io/ ;. arXiv admin note: text overlap with arXiv:2211.14068

  15. MTS-LOF: Medical Time-Series Representation Learning via Occlusion-Invariant Features

    Authors: Huayu Li, Ana S. Carreon-Rascon, Xiwen Chen, Geng Yuan, Ao Li

    Abstract: Medical time series data are indispensable in healthcare, providing critical insights for disease diagnosis, treatment planning, and patient management. The exponential growth in data complexity, driven by advanced sensor technologies, has presented challenges related to data labeling. Self-supervised learning (SSL) has emerged as a transformative approach to address these challenges, eliminating… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  16. arXiv:2309.14363  [pdf, ps, other

    quant-ph cs.DS cs.ET

    Infeasibility of constructing a special orthogonal matrix for the deterministic remote preparation of arbitrary n-qubit state

    Authors: Wenjie Liu, Zixian Li, Gonglin Yuan

    Abstract: In this paper, we present a polynomial-complexity algorithm to construct a special orthogonal matrix for the deterministic remote state preparation (DRSP) of an arbitrary n-qubit state, and prove that if n>3, such matrices do not exist. Firstly, the construction problem is split into two sub-problems, i.e., finding a solution of a semi-orthogonal matrix and generating all semi-orthogonal matrices.… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: 31 figures

    Journal ref: Quantum Information & Computation, 2022. 22(15&16): p. 1289-1319

  17. arXiv:2309.12212  [pdf, other

    cs.ET cs.AR cs.LG

    SupeRBNN: Randomized Binary Neural Network Using Adiabatic Superconductor Josephson Devices

    Authors: Zhengang Li, Geng Yuan, Tomoharu Yamauchi, Zabihi Masoud, Yanyue Xie, Peiyan Dong, Xulong Tang, Nobuyuki Yoshikawa, Devesh Tiwari, Yanzhi Wang, Olivia Chen

    Abstract: Adiabatic Quantum-Flux-Parametron (AQFP) is a superconducting logic with extremely high energy efficiency. By employing the distinct polarity of current to denote logic `0' and `1', AQFP devices serve as excellent carriers for binary neural network (BNN) computations. Although recent research has made initial strides toward develo** an AQFP-based BNN accelerator, several critical challenges rema… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted by MICRO'23 (56th IEEE/ACM International Symposium on Microarchitecture)

  18. arXiv:2309.07438  [pdf, other

    cs.AI cs.NI

    Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges

    Authors: Fei Dou, ** Ye, Geng Yuan, Qin Lu, Wei Niu, Haijian Sun, Le Guan, Guoyu Lu, Gengchen Mai, Ninghao Liu, ** Lu, Zhengliang Liu, Zihao Wu, Chenjiao Tan, Shaochen Xu, Xianqiao Wang, Guoming Li, Lilong Chai, Sheng Li, ** Sun, Hongyue Sun, Yunli Shao, Changying Li, Tianming Liu, Wenzhan Song

    Abstract: Artificial General Intelligence (AGI), possessing the capacity to comprehend, learn, and execute tasks with human cognitive abilities, engenders significant anticipation and intrigue across scientific, commercial, and societal arenas. This fascination extends particularly to the Internet of Things (IoT), a landscape characterized by the interconnection of countless devices, sensors, and systems, c… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  19. arXiv:2308.09444  [pdf, other

    cs.LG stat.ML

    An Efficient 1 Iteration Learning Algorithm for Gaussian Mixture Model And Gaussian Mixture Embedding For Neural Network

    Authors: Weiguo Lu, Xuan Wu, Deng Ding, Gangnan Yuan

    Abstract: We propose an Gaussian Mixture Model (GMM) learning algorithm, based on our previous work of GMM expansion idea. The new algorithm brings more robustness and simplicity than classic Expectation Maximization (EM) algorithm. It also improves the accuracy and only take 1 iteration for learning. We theoretically proof that this new algorithm is guarantee to converge regardless the parameters initialis… ▽ More

    Submitted 6 September, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

  20. arXiv:2307.12216  [pdf, other

    cs.ET

    A Life-Cycle Energy and Inventory Analysis of Adiabatic Quantum-Flux-Parametron Circuits

    Authors: Masoud Zabihi, Yanyue Xie, Zhengang Li, Peiyan Dong, Geng Yuan, Olivia Chen, Massoud Pedram, Yanzhi Wang

    Abstract: The production process of superconductive integrated circuits is complex and consumes significant amounts of resources and energy. Therefore, it is crucial to evaluate the environmental impact of this emerging technology. An attractive option for the next generation of superconductive technology is Adiabatic Quantum-Flux-Parametron (AQFP) devices. This study is the first to present a comprehensive… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  21. arXiv:2306.05356  [pdf, other

    cs.CV

    ReliableSwap: Boosting General Face Swap** Via Reliable Supervision

    Authors: Ge Yuan, Maomao Li, Yong Zhang, Huicheng Zheng

    Abstract: Almost all advanced face swap** approaches use reconstruction as the proxy task, i.e., supervision only exists when the target and source belong to the same person. Otherwise, lacking pixel-level supervision, these methods struggle for source identity preservation. This paper proposes to construct reliable supervision, dubbed cycle triplets, which serves as the image-level guidance when the sour… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Project page: https://reliable-swap.github.io/ ; Github repository: https://github.com/ygtxr1997/ReliableSwap ; Demo (HuggingFace): https://huggingface.co/spaces/ygtxr1997/ReliableSwap_Demo ;

  22. arXiv:2306.00926  [pdf, other

    cs.CV

    Inserting Anybody in Diffusion Models via Celeb Basis

    Authors: Ge Yuan, Xiaodong Cun, Yong Zhang, Maomao Li, Chenyang Qi, Xintao Wang, Ying Shan, Huicheng Zheng

    Abstract: Exquisite demand exists for customizing the pretrained large text-to-image model, $\textit{e.g.}$, Stable Diffusion, to generate innovative concepts, such as the users themselves. However, the newly-added concept from previous customization methods often shows weaker combination abilities than the original ones even given several images during training. We thus propose a new personalization method… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Project page: http://celeb-basis.github.io ; Github repository: https://github.com/ygtxr1997/CelebBasis

  23. arXiv:2305.14751  [pdf, other

    cs.CL cs.AI

    DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade

    Authors: Zefan Cai, Xin Zheng, Tianyu Liu, Xu Wang, Haoran Meng, Jiaqi Han, Gang Yuan, Binghuai Lin, Baobao Chang, Yunbo Cao

    Abstract: In the constant updates of the product dialogue systems, we need to retrain the natural language understanding (NLU) model as new data from the real users would be merged into the existent data accumulated in the last updates. Within the newly added data, new intents would emerge and might have semantic entanglement with the existing intents, e.g. new intents that are semantically too specific or… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: work in progress. The first three authors contribute equally

  24. arXiv:2304.03641  [pdf, ps, other

    math.OC cs.LG math.NA

    A Block Coordinate Descent Method for Nonsmooth Composite Optimization under Orthogonality Constraints

    Authors: Ganzhao Yuan

    Abstract: Nonsmooth composite optimization with orthogonality constraints has a broad spectrum of applications in statistical learning and data science. However, this problem is generally challenging to solve due to its non-convex and non-smooth nature. Existing solutions are limited by one or more of the following restrictions: (i) they are full gradient methods that require high computational costs in eac… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  25. arXiv:2211.12005  [pdf, other

    cs.LG cs.CR stat.ML

    Self-Ensemble Protection: Training Checkpoints Are Good Data Protectors

    Authors: Sizhe Chen, Geng Yuan, Xinwen Cheng, Yifan Gong, Minghai Qin, Yanzhi Wang, Xiaolin Huang

    Abstract: As data becomes increasingly vital, a company would be very cautious about releasing data, because the competitors could use it to train high-performance models, thereby posing a tremendous threat to the company's commercial competence. To prevent training good models on the data, we could add imperceptible perturbations to it. Since such perturbations aim at hurting the entire training process, t… ▽ More

    Submitted 12 April, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: ICLR 2023

  26. arXiv:2211.10801  [pdf, other

    cs.CV cs.AI cs.LG

    Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training

    Authors: Zhenglun Kong, Haoyu Ma, Geng Yuan, Mengshu Sun, Yanyue Xie, Peiyan Dong, Xin Meng, Xuan Shen, Hao Tang, Minghai Qin, Tianlong Chen, Xiaolong Ma, Xiaohui Xie, Zhangyang Wang, Yanzhi Wang

    Abstract: Vision transformers (ViTs) have recently obtained success in many applications, but their intensive computation and heavy memory usage at both training and inference time limit their generalization. Previous compression algorithms usually start from the pre-trained dense models and only focus on efficient inference, while time-consuming training is still unavoidable. In contrast, this paper points… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: AAAI 2023

  27. arXiv:2211.01484  [pdf, other

    cs.CV cs.LG

    Data Level Lottery Ticket Hypothesis for Vision Transformers

    Authors: Xuan Shen, Zhenglun Kong, Minghai Qin, Peiyan Dong, Geng Yuan, Xin Meng, Hao Tang, Xiaolong Ma, Yanzhi Wang

    Abstract: The conventional lottery ticket hypothesis (LTH) claims that there exists a sparse subnetwork within a dense neural network and a proper random initialization method called the winning ticket, such that it can be trained from scratch to almost as good as the dense counterpart. Meanwhile, the research of LTH in vision transformers (ViTs) is scarcely evaluated. In this paper, we first show that the… ▽ More

    Submitted 29 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted by IJCAI 2023

  28. arXiv:2210.10629  [pdf, other

    cs.IR

    Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems

    Authors: Guanghu Yuan, Fajie Yuan, Yudong Li, Beibei Kong, Shujie Li, Lei Chen, Min Yang, Chenyun Yu, Bo Hu, Zang Li, Yu Xu, Xiaohu Qie

    Abstract: Existing benchmark datasets for recommender systems (RS) either are created at a small scale or involve very limited forms of user feedback. RS models evaluated on such datasets often lack practical values for large-scale real-world applications. In this paper, we describe Tenrec, a novel and publicly available data collection for RS that records various user feedback from four different recommend… ▽ More

    Submitted 4 June, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

  29. arXiv:2210.04623  [pdf, other

    cs.DC

    DeltaFS: Pursuing Zero Update Overhead via Metadata-Enabled Delta Compression for Log-structured File System on Mobile Devices

    Authors: Chao Wu, Cheng Ji, Geng Yuan, Riwei Pan, Weichao Guo, Chao Yu, Zongwei Zhu, Yanzhi Wang

    Abstract: Data compression has been widely adopted to release mobile devices from intensive write pressure. Delta compression is particularly promising for its high compression efficacy over conventional compression methods. However, this method suffers from non-trivial system overheads incurred by delta maintenance and read penalty, which prevents its applicability on mobile devices. To this end, this pape… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  30. arXiv:2209.11204  [pdf, other

    cs.LG cs.AI cs.CV

    Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training

    Authors: Geng Yuan, Yanyu Li, Sheng Li, Zhenglun Kong, Sergey Tulyakov, Xulong Tang, Yanzhi Wang, Jian Ren

    Abstract: Recently, sparse training has emerged as a promising paradigm for efficient deep learning on edge devices. The current research mainly devotes efforts to reducing training costs by further increasing model sparsity. However, increasing sparsity is not always ideal since it will inevitably introduce severe accuracy degradation at an extremely high sparsity level. This paper intends to explore other… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: Published in 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  31. arXiv:2209.09476  [pdf, other

    cs.LG cs.AI cs.CV

    SparCL: Sparse Continual Learning on the Edge

    Authors: Zifeng Wang, Zheng Zhan, Yifan Gong, Geng Yuan, Wei Niu, Tong Jian, Bin Ren, Stratis Ioannidis, Yanzhi Wang, Jennifer Dy

    Abstract: Existing work in continual learning (CL) focuses on mitigating catastrophic forgetting, i.e., model performance deterioration on past tasks when learning a new task. However, the training efficiency of a CL system is under-investigated, which limits the real-world application of CL systems under resource-limited scenarios. In this work, we propose a novel framework called Sparse Continual Learning… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: Published at NeurIPS 2022 as a conference paper

  32. arXiv:2208.05163  [pdf, other

    cs.CV cs.LG eess.IV

    Auto-ViT-Acc: An FPGA-Aware Automatic Acceleration Framework for Vision Transformer with Mixed-Scheme Quantization

    Authors: Zhengang Li, Mengshu Sun, Alec Lu, Haoyu Ma, Geng Yuan, Yanyue Xie, Hao Tang, Yanyu Li, Miriam Leeser, Zhangyang Wang, Xue Lin, Zhenman Fang

    Abstract: Vision transformers (ViTs) are emerging with significantly improved accuracy in computer vision tasks. However, their complex architecture and enormous computation/storage demand impose urgent needs for new hardware accelerator design methodology. This work proposes an FPGA-aware automatic ViT acceleration framework based on the proposed mixed-scheme quantization. To the best of our knowledge, thi… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: Published in FPL2022

  33. arXiv:2206.01244  [pdf, other

    cs.CV eess.IV

    Real-Time Portrait Stylization on the Edge

    Authors: Yanyu Li, Xuan Shen, Geng Yuan, Jiexiong Guan, Wei Niu, Hao Tang, Bin Ren, Yanzhi Wang

    Abstract: In this work we demonstrate real-time portrait stylization, specifically, translating self-portrait into cartoon or anime style on mobile devices. We propose a latency-driven differentiable architecture search method, maintaining realistic generative quality. With our framework, we obtain $10\times$ computation reduction on the generative model and achieve real-time video stylization on off-the-sh… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

  34. arXiv:2206.01198  [pdf, other

    cs.CV

    Pruning-as-Search: Efficient Neural Architecture Search via Channel Pruning and Structural Reparameterization

    Authors: Yanyu Li, Pu Zhao, Geng Yuan, Xue Lin, Yanzhi Wang, Xin Chen

    Abstract: Neural architecture search (NAS) and network pruning are widely studied efficient AI techniques, but not yet perfect. NAS performs exhaustive candidate architecture search, incurring tremendous search cost. Though (structured) pruning can simply shrink model dimension, it remains unclear how to decide the per-layer sparsity automatically and optimally. In this work, we revisit the problem of layer… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

  35. arXiv:2206.01191  [pdf, other

    cs.CV

    EfficientFormer: Vision Transformers at MobileNet Speed

    Authors: Yanyu Li, Geng Yuan, Yang Wen, Ju Hu, Georgios Evangelidis, Sergey Tulyakov, Yanzhi Wang, Jian Ren

    Abstract: Vision Transformers (ViT) have shown rapid progress in computer vision tasks, achieving promising results on various benchmarks. However, due to the massive number of parameters and model design, \textit{e.g.}, attention mechanism, ViT-based models are generally times slower than lightweight convolutional networks. Therefore, the deployment of ViT for real-time applications is particularly challen… ▽ More

    Submitted 10 October, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

  36. arXiv:2204.13737  [pdf, other

    cs.CR

    Extricating IoT Devices from Vendor Infrastructure with Karl

    Authors: Gina Yuan, David Mazières, Matei Zaharia

    Abstract: Most consumer IoT devices are vertically integrated with cloud-side infrastructure. Such architectures present enormous risk to user data, exacerbated by vendor heterogeneity and the inability for users to audit cloud-side activity. A more promising approach would be to leverage local hardware, providing users control over how their data is processed and why it can be shared with other devices or… ▽ More

    Submitted 31 May, 2023; v1 submitted 28 April, 2022; originally announced April 2022.

  37. arXiv:2203.16214  [pdf

    cs.LG

    Adaptive Divergence-based Non-negative Latent Factor Analysis

    Authors: Ye Yuan, Guangxiao Yuan, Renfang Wang, Xin Luo

    Abstract: High-Dimensional and Incomplete (HDI) data are frequently found in various industrial applications with complex interactions among numerous nodes, which are commonly non-negative for representing the inherent non-negativity of node interactions. A Non-negative Latent Factor (NLF) model is able to extract intrinsic features from such data efficiently. However, existing NLF models all adopt a static… ▽ More

    Submitted 22 October, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

  38. arXiv:2201.12691  [pdf, other

    math.OC cs.LG math.NA

    Coordinate Descent Methods for Fractional Minimization

    Authors: Ganzhao Yuan

    Abstract: We consider a class of structured fractional minimization problems, in which the numerator part of the objective is the sum of a differentiable convex function and a convex non-smooth function, while the denominator part is a convex or concave function. This problem is difficult to solve since it is non-convex. By exploiting the structure of the problem, we propose two Coordinate Descent (CD) meth… ▽ More

    Submitted 24 March, 2023; v1 submitted 29 January, 2022; originally announced January 2022.

  39. arXiv:2112.13890  [pdf, other

    cs.CV cs.AI cs.AR cs.LG

    SPViT: Enabling Faster Vision Transformers via Soft Token Pruning

    Authors: Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Mengshu Sun, Wei Niu, Xuan Shen, Geng Yuan, Bin Ren, Minghai Qin, Hao Tang, Yanzhi Wang

    Abstract: Recently, Vision Transformer (ViT) has continuously established new milestones in the computer vision field, while the high computation and memory cost makes its propagation in industrial production difficult. Pruning, a traditional model compression paradigm for hardware efficiency, has been widely applied in various DNN structures. Nevertheless, it stays ambiguous on how to perform exclusive pru… ▽ More

    Submitted 20 September, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

    Comments: ECCV 2022

  40. arXiv:2111.11581  [pdf, other

    cs.LG cs.AI cs.CV cs.DC

    Automatic Map** of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration

    Authors: Yifan Gong, Geng Yuan, Zheng Zhan, Wei Niu, Zhengang Li, Pu Zhao, Yuxuan Cai, Sijia Liu, Bin Ren, Xue Lin, Xulong Tang, Yanzhi Wang

    Abstract: Weight pruning is an effective model compression technique to tackle the challenges of achieving real-time deep neural network (DNN) inference on mobile devices. However, prior pruning schemes have limited application scenarios due to accuracy degradation, difficulty in leveraging hardware acceleration, and/or restriction on certain types of DNN layers. In this paper, we propose a general, fine-gr… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

  41. arXiv:2110.14032  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge

    Authors: Geng Yuan, Xiaolong Ma, Wei Niu, Zhengang Li, Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing **, Siyue Wang, Minghai Qin, Bin Ren, Yanzhi Wang, Sijia Liu, Xue Lin

    Abstract: Recently, a new trend of exploring sparsity for accelerating neural network training has emerged, embracing the paradigm of training on the edge. This paper proposes a novel Memory-Economic Sparse Training (MEST) framework targeting for accurate and fast execution on edge devices. The proposed MEST framework consists of enhancements by Elastic Mutation (EM) and Soft Memory Bound (&S) that ensure s… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021 Spotlight Paper

  42. arXiv:2110.08418  [pdf, ps, other

    stat.ML cs.LG

    Nuances in Margin Conditions Determine Gains in Active Learning

    Authors: Samory Kpotufe, Gan Yuan, Yunfan Zhao

    Abstract: We consider nonparametric classification with smooth regression functions, where it is well known that notions of margin in $E[Y|X]$ determine fast or slow rates in both active and passive learning. Here we elucidate a striking distinction between the two settings. Namely, we show that some seemingly benign nuances in notions of margin -- involving the uniqueness of the Bayes classifier, and which… ▽ More

    Submitted 25 February, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  43. arXiv:2109.04228  [pdf, other

    math.OC cs.LG

    Coordinate Descent Methods for DC Minimization: Optimality Conditions and Global Convergence

    Authors: Ganzhao Yuan

    Abstract: Difference-of-Convex (DC) minimization, referring to the problem of minimizing the difference of two convex functions, has been found rich applications in statistical learning and studied extensively for decades. However, existing methods are primarily based on multi-stage convex relaxation, only leading to weak optimality of critical points. This paper proposes a coordinate descent method for min… ▽ More

    Submitted 17 December, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

  44. Storing Multi-model Data in RDBMSs based on Reinforcement Learning

    Authors: Gongsheng Yuan, Jiaheng Lu, Shuxun Zhang, Zhengtong Yan

    Abstract: How to manage various data in a unified way is a significant research topic in the field of databases. To address this problem, researchers have proposed multi-model databases to support multiple data models in a uniform platform with a single unified query language. However, since relational databases are predominant in the current market, it is expensive to replace them with others. Besides, due… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

    Comments: 4 pages, 4 figures, CIKM

  45. arXiv:2109.00136  [pdf, other

    cs.DB

    MORTAL: A Tool of Automatically Designing Relational Storage Schemas for Multi-model Data through Reinforcement Learning

    Authors: Gongsheng Yuan, Jiaheng Lu

    Abstract: Considering relational databases having powerful capabilities in handling security, user authentication, query optimization, etc., several commercial and academic frameworks reuse relational databases to store and query semi-structured data (e.g., XML, JSON) or graph data (e.g., RDF, property graph). However, these works concentrate on managing one of the above data models with RDBMSs. That is, it… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

    Comments: 6 pages, 4 figures, ER

  46. Quantum-Inspired Keyword Search on Multi-Model Databases

    Authors: Gongsheng Yuan, Jiaheng Lu, Peifeng Su

    Abstract: With the rising applications implemented in different domains, it is inevitable to require databases to adopt corresponding appropriate data models to store and exchange data derived from various sources. To handle these data models in a single platform, the community of databases introduces a multi-model database. And many vendors are improving their products from supporting a single data model t… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

    Comments: 16 pages, 5 figures, Dasfaa

  47. arXiv:2108.08910  [pdf, other

    eess.IV cs.AI cs.CV cs.LG cs.NE

    Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search

    Authors: Zheng Zhan, Yifan Gong, Pu Zhao, Geng Yuan, Wei Niu, Yushu Wu, Tianyun Zhang, Malith Jayaweera, David Kaeli, Bin Ren, Xue Lin, Yanzhi Wang

    Abstract: Though recent years have witnessed remarkable progress in single image super-resolution (SISR) tasks with the prosperous development of deep neural networks (DNNs), the deep learning methods are confronted with the computation and memory consumption issues in practice, especially for resource-limited platforms such as mobile devices. To overcome the challenge and facilitate the real-time deploymen… ▽ More

    Submitted 14 February, 2023; v1 submitted 18 August, 2021; originally announced August 2021.

  48. arXiv:2107.00166  [pdf, other

    cs.LG cs.AI cs.CV

    Sanity Checks for Lottery Tickets: Does Your Winning Ticket Really Win the Jackpot?

    Authors: Xiaolong Ma, Geng Yuan, Xuan Shen, Tianlong Chen, Xuxi Chen, Xiaohan Chen, Ning Liu, Minghai Qin, Sijia Liu, Zhangyang Wang, Yanzhi Wang

    Abstract: There have been long-standing controversies and inconsistencies over the experiment setup and criteria for identifying the "winning ticket" in literature. To reconcile such, we revisit the definition of lottery ticket hypothesis, with comprehensive and more rigorous conditions. Under our new definition, we show concrete evidence to clarify whether the winning ticket exists across the major DNN arc… ▽ More

    Submitted 26 October, 2021; v1 submitted 30 June, 2021; originally announced July 2021.

    Comments: NeurIPS 2021 camera ready

  49. arXiv:2106.15304  [pdf, other

    cs.CV

    Towards Fast and Accurate Multi-Person Pose Estimation on Mobile Devices

    Authors: Xuan Shen, Geng Yuan, Wei Niu, Xiaolong Ma, Jiexiong Guan, Zhengang Li, Bin Ren, Yanzhi Wang

    Abstract: The rapid development of autonomous driving, abnormal behavior detection, and behavior recognition makes an increasing demand for multi-person pose estimation-based applications, especially on mobile platforms. However, to achieve high accuracy, state-of-the-art methods tend to have a large model size and complex post-processing algorithm, which costs intense computation and long end-to-end latenc… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

  50. arXiv:2106.14943  [pdf, other

    cs.CV cs.AI

    Achieving Real-Time Object Detection on MobileDevices with Neural Pruning Search

    Authors: Pu Zhao, Wei Niu, Geng Yuan, Yuxuan Cai, Bin Ren, Yanzhi Wang, Xue Lin

    Abstract: Object detection plays an important role in self-driving cars for security development. However, mobile systems on self-driving cars with limited computation resources lead to difficulties for object detection. To facilitate this, we propose a compiler-aware neural pruning search framework to achieve high-speed inference on autonomous vehicles for 2D and 3D object detection. The framework automati… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: Presented on the HiPEAC 2021 workshop (cogarch 2021)