Skip to main content

Showing 1–50 of 50 results for author: Ye, P

Searching in archive cs. Search in all archives.
.
  1. Start from Zero: Triple Set Prediction for Automatic Knowledge Graph Completion

    Authors: Wen Zhang, Ya**g Xu, Peng Ye, Zhiwei Huang, Zezhong Xu, Jiaoyan Chen, Jeff Z. Pan, Huajun Chen

    Abstract: Knowledge graph (KG) completion aims to find out missing triples in a KG. Some tasks, such as link prediction and instance completion, have been proposed for KG completion. They are triple-level tasks with some elements in a missing triple given to predict the missing element of the triple. However, knowing some elements of the missing triple in advance is not always a realistic setting. In this p… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Paper accepted by TKDE in 2024

  2. arXiv:2406.10391  [pdf, other

    q-bio.QM cs.LG

    BEACON: Benchmark for Comprehensive RNA Tasks and Language Models

    Authors: Yuchen Ren, Zhiyuan Chen, Lifeng Qiao, Hongtai **g, Yuchen Cai, Sheng Xu, Peng Ye, Xinzhu Ma, Siqi Sun, Hongliang Yan, Dong Yuan, Wanli Ouyang, Xihui Liu

    Abstract: RNA plays a pivotal role in translating genetic instructions into functional outcomes, underscoring its importance in biological processes and disease mechanisms. Despite the emergence of numerous deep learning approaches for RNA, particularly universal RNA language models, there remains a significant lack of standardized benchmarks to assess the effectiveness of these methods. In this study, we i… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2406.07050  [pdf, other

    cs.CV

    DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification

    Authors: Jiamu Sheng, **gyi Zhou, Jiong Wang, Peng Ye, Jiayuan Fan

    Abstract: The effectiveness and efficiency of modeling complex spectral-spatial relations are both crucial for Hyperspectral image (HSI) classification. Most existing methods based on CNNs and transformers still suffer from heavy computational burdens and have room for improvement in capturing the global-local spectral-spatial feature representation. To this end, we propose a novel lightweight parallel desi… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2406.03051  [pdf, other

    cs.CV

    Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision

    Authors: Minglei Li, Peng Ye, Yongqi Huang, Lin Zhang, Tao Chen, Tong He, Jiayuan Fan, Wanli Ouyang

    Abstract: Parameter-efficient fine-tuning (PEFT) has become increasingly important as foundation models continue to grow in both popularity and size. Adapter has been particularly well-received due to their potential for parameter reduction and adaptability across diverse tasks. However, striking a balance between high efficiency and robust generalization across tasks remains a challenge for adapter-based m… ▽ More

    Submitted 5 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  5. arXiv:2406.01645  [pdf, other

    cs.LG cs.AI

    FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation

    Authors: Kun Chen, Tao Chen, Peng Ye, Hao Chen, Kang Chen, Tao Han, Wanli Ouyang, Lei Bai

    Abstract: Data assimilation is a vital component in modern global medium-range weather forecasting systems to obtain the best estimation of the atmospheric state by combining the short-term forecast and observations. Recently, AI-based data assimilation approaches have attracted increasing attention for their significant advantages over traditional techniques in terms of computational consumption. However,… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  6. arXiv:2406.01125  [pdf, other

    cs.CV

    $Δ$-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers

    Authors: Pengtao Chen, Mingzhu Shen, Peng Ye, Jianjian Cao, Chongjun Tu, Christos-Savvas Bouganis, Yiren Zhao, Tao Chen

    Abstract: Diffusion models are widely recognized for generating high-quality and diverse images, but their poor real-time performance has led to numerous acceleration works, primarily focusing on UNet-based structures. With the more successful results achieved by diffusion transformers (DiT), there is still a lack of exploration regarding the impact of DiT structure on generation, as well as the absence of… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures, 6 tables

  7. arXiv:2405.17461  [pdf, other

    cs.LG cs.CV

    EMR-Merging: Tuning-Free High-Performance Model Merging

    Authors: Chenyu Huang, Peng Ye, Tao Chen, Tong He, Xiangyu Yue, Wanli Ouyang

    Abstract: The success of pretrain-finetune paradigm brings about the release of numerous model weights. In this case, merging models finetuned on different tasks to enable a single model with multi-task capabilities is gaining increasing attention for its practicability. Existing model merging methods usually suffer from (1) significant performance degradation or (2) requiring tuning by additional data or t… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  8. arXiv:2405.11856  [pdf, other

    cs.RO eess.SY

    Modeling and simulation of a mechanism for suppressing the flip** problem of a jum** robot

    Authors: Qi Li, Liang Peng, Zhiyuan Wu, Pengda Ye, Weitao Zhang, Yi Xu, Qing Shi

    Abstract: In order to solve the problem of stable jum** of micro robot, we design a special mechanism: elastic passive joint (EPJ). EPJ can assist in achieving smooth jum** through the opening-closing process when the robot jumps. First, we introduce the composition and operation principle of EPJ, and perform a dynamic modeling of the robot's jum** process. Then, in order to verify the effectiveness o… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  9. arXiv:2403.15835  [pdf, other

    cs.CV

    Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

    Authors: Hancheng Ye, Chong Yu, Peng Ye, Renqiu Xia, Yansong Tang, Jiwen Lu, Tao Chen, Bo Zhang

    Abstract: Recent Vision Transformer Compression (VTC) works mainly follow a two-stage scheme, where the importance score of each model unit is first evaluated or preset in each submodule, followed by the sparsity score evaluation according to the target sparsity constraint. Such a separate evaluation process induces the gap between importance and sparsity score distributions, thus causing high search costs… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024. Our code will be available at www.github.com/HankYe/Once-for-Both

  10. arXiv:2403.12646  [pdf, other

    cs.LG

    Prompt-fused framework for Inductive Logical Query Answering

    Authors: Zezhong Xu, Peng Ye, Lei Liang, Huajun Chen, Wen Zhang

    Abstract: Answering logical queries on knowledge graphs (KG) poses a significant challenge for machine reasoning. The primary obstacle in this task stems from the inherent incompleteness of KGs. Existing research has predominantly focused on addressing the issue of missing edges in KGs, thereby neglecting another aspect of incompleteness: the emergence of new entities. Furthermore, most of the existing meth… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted by COLING 2024

  11. arXiv:2403.06417  [pdf, other

    cs.CV

    Enhanced Sparsification via Stimulative Training

    Authors: Shengji Tang, Weihao Lin, Hancheng Ye, Peng Ye, Chong Yu, Baopu Li, Tao Chen

    Abstract: Sparsification-based pruning has been an important category in model compression. Existing methods commonly set sparsity-inducing penalty terms to suppress the importance of dropped weights, which is regarded as the suppressed sparsification paradigm. However, this paradigm inactivates the dropped parts of networks causing capacity damage before pruning, thereby leading to performance degradation.… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 26 pages

  12. arXiv:2403.02991  [pdf, other

    cs.CV

    MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer

    Authors: Jianjian Cao, Peng Ye, Shengze Li, Chong Yu, Yansong Tang, Jiwen Lu, Tao Chen

    Abstract: Vision-Language Transformers (VLTs) have shown great success recently, but are meanwhile accompanied by heavy computation costs, where a major reason can be attributed to the large number of visual and language tokens. Existing token pruning research for compressing VLTs mainly follows a single-modality-based scheme yet ignores the critical role of aligning different modalities for guiding the tok… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 19 pages, 9 figures, Published in CVPR2024

    Journal ref: In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  13. arXiv:2402.16868  [pdf, other

    cs.IT cs.AI

    Codebook-enabled Generative End-to-end Semantic Communication Powered by Transformer

    Authors: Peigen Ye, Ya** Sun, Shumin Yao, Hao Chen, Xiaodong Xu, Shuguang Cui

    Abstract: Codebook-based generative semantic communication attracts increasing attention, since only indices are required to be transmitted when the codebook is shared between transmitter and receiver. However, due to the fact that the semantic relations among code vectors are not necessarily related to the distance of the corresponding code indices, the performance of the codebook-enabled semantic communic… ▽ More

    Submitted 5 March, 2024; v1 submitted 22 January, 2024; originally announced February 2024.

    Comments: IEEE INFOCOM PerAI6G 2024(accepted)

  14. arXiv:2402.04290  [pdf, other

    cs.LG cs.AI

    CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling

    Authors: Junchao Gong, Lei Bai, Peng Ye, Wanghan Xu, Na Liu, Jianhua Dai, Xiaokang Yang, Wanli Ouyang

    Abstract: Precipitation nowcasting based on radar data plays a crucial role in extreme weather prediction and has broad implications for disaster management. Despite progresses have been made based on deep learning, two key challenges of precipitation nowcasting are not well-solved: (i) the modeling of complex precipitation system evolutions with different scales, and (ii) accurate forecasts for extreme pre… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  15. arXiv:2401.12665  [pdf, other

    cs.CV cs.AI

    ClipSAM: CLIP and SAM Collaboration for Zero-Shot Anomaly Segmentation

    Authors: Shengze Li, Jianjian Cao, Peng Ye, Yuhan Ding, Chongjun Tu, Tao Chen

    Abstract: Recently, foundational models such as CLIP and SAM have shown promising performance for the task of Zero-Shot Anomaly Segmentation (ZSAS). However, either CLIP-based or SAM-based ZSAS methods still suffer from non-negligible key drawbacks: 1) CLIP primarily focuses on global feature alignment across different inputs, leading to imprecise segmentation of local anomalous parts; 2) SAM tends to gener… ▽ More

    Submitted 29 January, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 17 pages,17 figures

  16. arXiv:2401.02880  [pdf, other

    cs.CR

    Lotto: Secure Participant Selection against Adversarial Servers in Federated Learning

    Authors: Zhifeng Jiang, Peng Ye, Shiqi He, Wei Wang, Ruichuan Chen, Bo Li

    Abstract: In Federated Learning (FL), common privacy-enhancing techniques, such as secure aggregation and distributed differential privacy, rely on the critical assumption of an honest majority among participants to withstand various attacks. In practice, however, servers are not always trusted, and an adversarial server can strategically select compromised clients to create a dishonest majority, thereby un… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: This article has been accepted to USENIX Security '24

  17. arXiv:2312.16240  [pdf, other

    cs.CV cs.AI

    Merging Vision Transformers from Different Tasks and Domains

    Authors: Peng Ye, Chenyu Huang, Mingzhu Shen, Tao Chen, Yongqi Huang, Yuning Zhang, Wanli Ouyang

    Abstract: This work targets to merge various Vision Transformers (ViTs) trained on different tasks (i.e., datasets with different object categories) or domains (i.e., datasets with the same categories but different environments) into one unified model, yielding still good performance on each task or domain. Previous model merging works focus on either CNNs or NLP models, leaving the ViTs merging research un… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  18. arXiv:2312.15681  [pdf, other

    cs.CV cs.AI

    Partial Fine-Tuning: A Successor to Full Fine-Tuning for Vision Transformers

    Authors: Peng Ye, Yongqi Huang, Chongjun Tu, Minglei Li, Tao Chen, Tong He, Wanli Ouyang

    Abstract: Fine-tuning pre-trained foundation models has gained significant popularity in various research fields. Existing methods for fine-tuning can be roughly divided into two categories, namely Parameter-Efficient Fine-Tuning and High-Performance Fine-Tuning. The former aims at improving efficiency, while the latter focuses on enhancing performance. Beyond these methods, we demonstrate that Partial Fine… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  19. arXiv:2312.15617  [pdf, other

    cs.CR cs.CV

    GanFinger: GAN-Based Fingerprint Generation for Deep Neural Network Ownership Verification

    Authors: Huali Ren, Anli Yan, Xiaojun Ren, Pei-Gen Ye, Chong-zhi Gao, Zhili Zhou, ** Li

    Abstract: Deep neural networks (DNNs) are extensively employed in a wide range of application scenarios. Generally, training a commercially viable neural network requires significant amounts of data and computing resources, and it is easy for unauthorized users to use the networks illegally. Therefore, network ownership verification has become one of the most crucial steps in safeguarding digital assets. To… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 9 pages, 6 figures

  20. arXiv:2312.14200  [pdf, other

    cs.CV

    Efficient Architecture Search via Bi-level Data Pruning

    Authors: Chongjun Tu, Peng Ye, Weihao Lin, Hancheng Ye, Chong Yu, Tao Chen, Baopu Li, Wanli Ouyang

    Abstract: Improving the efficiency of Neural Architecture Search (NAS) is a challenging but significant task that has received much attention. Previous works mainly adopted the Differentiable Architecture Search (DARTS) and improved its search strategies or modules to enhance search efficiency. Recently, some methods have started considering data reduction for speedup, but they are not tightly coupled with… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 11 pages

    MSC Class: 68T05(Primary)

  21. arXiv:2312.13514  [pdf, other

    cs.CV

    Rethinking of Feature Interaction for Multi-task Learning on Dense Prediction

    Authors: **gdong Zhang, Jiayuan Fan, Peng Ye, Bo Zhang, Hancheng Ye, Baopu Li, Yancheng Cai, Tao Chen

    Abstract: Existing works generally adopt the encoder-decoder structure for Multi-task Dense Prediction, where the encoder extracts the task-generic features, and multiple decoders generate task-specific features for predictions. We observe that low-level representations with rich details and high-level representations with abundant task information are not both involved in the multi-task interaction process… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  22. arXiv:2312.12462  [pdf, other

    physics.ao-ph cs.AI cs.LG

    Towards an end-to-end artificial intelligence driven global weather forecasting system

    Authors: Kun Chen, Lei Bai, Fenghua Ling, Peng Ye, Tao Chen, **g-Jia Luo, Hao Chen, Yi Xiao, Kang Chen, Tao Han, Wanli Ouyang

    Abstract: The weather forecasting system is important for science and society, and significant achievements have been made in applying artificial intelligence (AI) to medium-range weather forecasting. However, existing AI-based weather forecasting models rely on analysis or reanalysis products from traditional numerical weather prediction (NWP) systems as initial conditions for making predictions. Initial s… ▽ More

    Submitted 8 April, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  23. arXiv:2310.17769  [pdf, other

    cs.CL cs.AI

    Social Contract AI: Aligning AI Assistants with Implicit Group Norms

    Authors: Jan-Philipp Fränken, Sam Kwok, Peixuan Ye, Kanishk Gandhi, Dilip Arumugam, Jared Moore, Alex Tamkin, Tobias Gerstenberg, Noah D. Goodman

    Abstract: We explore the idea of aligning an AI assistant by inverting a model of users' (unknown) preferences from observed interactions. To validate our proposal, we run proof-of-concept simulations in the economic ultimatum game, formalizing user preferences as policies that guide the actions of simulated players. We find that the AI assistant accurately aligns its behavior to match standard policies fro… ▽ More

    Submitted 3 December, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: SoLaR NeurIPS 2023 Workshop (https://solar-neurips.github.io/)

  24. arXiv:2310.07644  [pdf, other

    cs.AI cs.CL cs.LG

    Rethinking the BERT-like Pretraining for DNA Sequences

    Authors: Chaoqi Liang, Weiqiang Bai, Lifeng Qiao, Yuchen Ren, Jianle Sun, Peng Ye, Hongliang Yan, Xinzhu Ma, Wangmeng Zuo, Wanli Ouyang

    Abstract: With the success of large-scale pretraining in NLP, there is an increasing trend of applying it to the domain of life sciences. In particular, pretraining methods based on DNA sequences have garnered growing attention due to their potential to capture generic information about genes. However, existing pretraining methods for DNA sequences largely rely on direct adoptions of BERT pretraining from N… ▽ More

    Submitted 11 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  25. arXiv:2309.11268  [pdf, other

    cs.CV

    StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding

    Authors: Renqiu Xia, Bo Zhang, Haoyang Peng, Hancheng Ye, Xiangchao Yan, Peng Ye, Botian Shi, Yu Qiao, Junchi Yan

    Abstract: Charts are common in literature across different scientific fields, conveying rich information easily accessible to readers. Current chart-related tasks focus on either chart perception which refers to extracting information from the visual charts, or performing reasoning given the extracted data, e.g. in a tabular form. In this paper, we aim to establish a unified and label-efficient learning par… ▽ More

    Submitted 18 February, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: SimChart9K is available for downloading at: https://github.com/UniModal4Reasoning/SimChart9K 26 pages, 15 figures

  26. arXiv:2308.13772  [pdf, other

    cs.CV

    Boosting Residual Networks with Group Knowledge

    Authors: Shengji Tang, Peng Ye, Baopu Li, Weihao Lin, Tao Chen, Tong He, Chong Yu, Wanli Ouyang

    Abstract: Recent research understands the residual networks from a new perspective of the implicit ensemble model. From this view, previous methods such as stochastic depth and stimulative training have further improved the performance of the residual network by sampling and training of its subnets. However, they both use the same supervision for all subnets of different capacities and neglect the valuable… ▽ More

    Submitted 14 December, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: Accepted by AAAI2024

  27. arXiv:2308.08721  [pdf, other

    cs.CV

    RFD-ECNet: Extreme Underwater Image Compression with Reference to Feature Dictionar

    Authors: Mengyao Li, Liquan Shen, Peng Ye, Guorui Feng, Zheyin Wang

    Abstract: Thriving underwater applications demand efficient extreme compression technology to realize the transmission of underwater images (UWIs) in very narrow underwater bandwidth. However, existing image compression methods achieve inferior performance on UWIs because they do not consider the characteristics of UWIs: (1) Multifarious underwater styles of color shift and distance-dependent clarity, cause… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  28. arXiv:2308.06093  [pdf, other

    cs.CV cs.LG

    Experts Weights Averaging: A New General Training Scheme for Vision Transformers

    Authors: Yongqi Huang, Peng Ye, Xiaoshui Huang, Sheng Li, Tao Chen, Tong He, Wanli Ouyang

    Abstract: Structural re-parameterization is a general training scheme for Convolutional Neural Networks (CNNs), which achieves performance improvement without increasing inference cost. As Vision Transformers (ViTs) are gradually surpassing CNNs in various visual tasks, one may question: if a training scheme specifically for ViTs exists that can also achieve performance improvement without increasing infere… ▽ More

    Submitted 25 August, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: 12 pages, 2 figures

  29. arXiv:2306.08964  [pdf, other

    cs.CV

    Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification

    Authors: **gyi Zhou, Jiamu Sheng, Jiayuan Fan, Peng Ye, Tong He, Bin Wang, Tao Chen

    Abstract: The effectiveness of spectral-spatial feature learning is crucial for the hyperspectral image (HSI) classification task. Diffusion models, as a new class of groundbreaking generative models, have the ability to learn both contextual semantics and textual details from the distinct timestep dimension, enabling the modeling of complex spectral-spatial relations in HSIs. However, existing diffusion-ba… ▽ More

    Submitted 3 June, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

  30. arXiv:2305.02507  [pdf, other

    cs.LG cs.AI cs.CV

    Stimulative Training++: Go Beyond The Performance Limits of Residual Networks

    Authors: Peng Ye, Tong He, Shengji Tang, Baopu Li, Tao Chen, Lei Bai, Wanli Ouyang

    Abstract: Residual networks have shown great success and become indispensable in recent deep neural network models. In this work, we aim to re-investigate the training process of residual networks from a novel social psychology perspective of loafing, and further propose a new training scheme as well as three improved strategies for boosting residual networks beyond their performance limits. Previous resear… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2210.04153

  31. arXiv:2302.11868  [pdf, ps, other

    cs.CV

    A2S-NAS: Asymmetric Spectral-Spatial Neural Architecture Search For Hyperspectral Image Classification

    Authors: Lin Zhan, Jiayuan Fan, Peng Ye, Jianjian Cao

    Abstract: Existing deep learning-based hyperspectral image (HSI) classification works still suffer from the limitation of the fixed-sized receptive field, leading to difficulties in distinctive spectral-spatial features for ground objects with various sizes and arbitrary shapes. Meanwhile, plenty of previous works ignore asymmetric spectral-spatial dimensions in HSI. To address the above issues, we propose… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted by 48th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)

  32. arXiv:2302.09838  [pdf, other

    cs.CV cs.LG

    JNDMix: JND-Based Data Augmentation for No-reference Image Quality Assessment

    Authors: Jiamu Sheng, Jiayuan Fan, Peng Ye, Jianjian Cao

    Abstract: Despite substantial progress in no-reference image quality assessment (NR-IQA), previous training models often suffer from over-fitting due to the limited scale of used datasets, resulting in model performance bottlenecks. To tackle this challenge, we explore the potential of leveraging data augmentation to improve data efficiency and enhance model robustness. However, most existing data augmentat… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: Accepted by 48th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2023)

  33. arXiv:2301.06393  [pdf, other

    cs.LG cs.AI

    $β$-DARTS++: Bi-level Regularization for Proxy-robust Differentiable Architecture Search

    Authors: Peng Ye, Tong He, Baopu Li, Tao Chen, Lei Bai, Wanli Ouyang

    Abstract: Neural Architecture Search has attracted increasing attention in recent years. Among them, differential NAS approaches such as DARTS, have gained popularity for the search efficiency. However, they still suffer from three main issues, that are, the weak stability due to the performance collapse, the poor generalization ability of the searched architectures, and the inferior robustness to different… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.01665

  34. arXiv:2210.06771  [pdf, other

    cs.LG

    Feature Reconstruction Attacks and Countermeasures of DNN training in Vertical Federated Learning

    Authors: Peng Ye, Zhifeng Jiang, Wei Wang, Bo Li, Baochun Li

    Abstract: Federated learning (FL) has increasingly been deployed, in its vertical form, among organizations to facilitate secure collaborative training over siloed data. In vertical FL (VFL), participants hold disjoint features of the same set of sample instances. Among them, only one has labels. This participant, known as the active party, initiates the training and interacts with the other participants, k… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

  35. arXiv:2210.04153  [pdf, other

    cs.CV

    Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing

    Authors: Peng Ye, Shengji Tang, Baopu Li, Tao Chen, Wanli Ouyang

    Abstract: Residual networks have shown great success and become indispensable in today's deep models. In this work, we aim to re-investigate the training process of residual networks from a novel social psychology perspective of loafing, and further propose a new training strategy to strengthen the performance of residual networks. As residual networks can be viewed as ensembles of relatively shallow networ… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: NIPS2022 accept

  36. arXiv:2209.08779  [pdf, other

    cs.AI cs.DB cs.LO

    Neural-Symbolic Entangled Framework for Complex Query Answering

    Authors: Zezhong Xu, Wen Zhang, Peng Ye, Hui Chen, Huajun Chen

    Abstract: Answering complex queries over knowledge graphs (KG) is an important yet challenging task because of the KG incompleteness issue and cascading errors during reasoning. Recent query embedding (QE) approaches to embed the entities and relations in a KG and the first-order logic (FOL) queries into a low dimensional space, answering queries by dense similarity search. However, previous works mainly co… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: Paper accepted by NeurIPS2022

  37. arXiv:2209.05815  [pdf, other

    cs.LO

    Ruleformer: Context-aware Differentiable Rule Mining over Knowledge Graph

    Authors: Zezhong Xu, Peng Ye, Hui Chen, Meng Zhao, Huajun Chen, Wen Zhang

    Abstract: Rule mining is an effective approach for reasoning over knowledge graph (KG). Existing works mainly concentrate on mining rules. However, there might be several rules that could be applied for reasoning for one relation, and how to select appropriate rules for completion of different triples has not been discussed. In this paper, we propose to take the context information into consideration, which… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: COLING 2022

  38. arXiv:2208.14812  [pdf, other

    cs.SD eess.AS

    Domain Shift-oriented Machine Anomalous Sound Detection Model Based on Self-Supervised Learning

    Authors: **g-ke Yan, Xin Wang, Qin Wang, Qin Qin, Huang-he Li, Peng-fei Ye, Yue-** He, **g Zeng

    Abstract: Thanks to the development of deep learning, research on machine anomalous sound detection based on self-supervised learning has made remarkable achievements. However, there are differences in the acoustic characteristics of the test set and the training set under different operating conditions of the same machine (domain shifts). It is challenging for the existing detection methods to learn the do… ▽ More

    Submitted 7 September, 2022; v1 submitted 31 August, 2022; originally announced August 2022.

  39. arXiv:2208.05271  [pdf, other

    cs.CV cs.AI

    Efficient Joint-Dimensional Search with Solution Space Regularization for Real-Time Semantic Segmentation

    Authors: Peng Ye, Baopu Li, Tao Chen, Jiayuan Fan, Zhen Mei, Chen Lin, Chongyan Zuo, Qinghua Chi, Wanli Ouyan

    Abstract: Semantic segmentation is a popular research topic in computer vision, and many efforts have been made on it with impressive results. In this paper, we intend to search an optimal network structure that can run in real-time for this problem. Towards this goal, we jointly search the depth, channel, dilation rate and feature spatial resolution, which results in a search space consisting of about 2.78… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

  40. arXiv:2204.04916  [pdf, ps, other

    cs.CL

    A Token-level Contrastive Framework for Sign Language Translation

    Authors: Biao Fu, Peigen Ye, Liang Zhang, Pei Yu, Cong Hu, Yidong Chen, Xiaodong Shi

    Abstract: Sign Language Translation (SLT) is a promising technology to bridge the communication gap between the deaf and the hearing people. Recently, researchers have adopted Neural Machine Translation (NMT) methods, which usually require large-scale corpus for training, to achieve SLT. However, the publicly available SLT corpus is very limited, which causes the collapse of the token representations and th… ▽ More

    Submitted 21 March, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted to ICASSP 2023

  41. arXiv:2203.01665  [pdf, other

    cs.LG cs.AI

    $β$-DARTS: Beta-Decay Regularization for Differentiable Architecture Search

    Authors: Peng Ye, Baopu Li, Yikang Li, Tao Chen, Jiayuan Fan, Wanli Ouyang

    Abstract: Neural Architecture Search~(NAS) has attracted increasingly more attention in recent years because of its capability to design deep neural networks automatically. Among them, differential NAS approaches such as DARTS, have gained popularity for the search efficiency. However, they suffer from two main issues, the weak robustness to the performance collapse and the poor generalization ability of th… ▽ More

    Submitted 3 March, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: CVPR2022

  42. arXiv:2201.12725  [pdf, other

    cs.CV

    Generalized Global Ranking-Aware Neural Architecture Ranker for Efficient Image Classifier Search

    Authors: Bicheng Guo, Tao Chen, Shibo He, Haoyu Liu, Lilin Xu, Peng Ye, Jiming Chen

    Abstract: Neural Architecture Search (NAS) is a powerful tool for automating effective image processing DNN designing. The ranking has been advocated to design an efficient performance predictor for NAS. The previous contrastive method solves the ranking problem by comparing pairs of architectures and predicting their relative performance. However, it only focuses on the rankings between two involved archit… ▽ More

    Submitted 12 July, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

  43. arXiv:2107.13467  [pdf, other

    cs.CV cs.AI cs.LG

    Recursively Conditional Gaussian for Ordinal Unsupervised Domain Adaptation

    Authors: Xiaofeng Liu, Site Li, Yubin Ge, Pengyi Ye, Jane You, Jun Lu

    Abstract: The unsupervised domain adaptation (UDA) has been widely adopted to alleviate the data scalability issue, while the existing works usually focus on classifying independently discrete labels. However, in many tasks (e.g., medical diagnosis), the labels are discrete and successively distributed. The UDA for ordinal classification requires inducing non-trivial ordinal distribution prior to the latent… ▽ More

    Submitted 17 August, 2021; v1 submitted 28 July, 2021; originally announced July 2021.

    Comments: Accepted to ICCV 2021 (Oral)

  44. arXiv:2107.05672  [pdf, other

    cs.DS

    In-Database Regression in Input Sparsity Time

    Authors: Rajesh Jayaram, Alireza Samadian, David P. Woodruff, Peng Ye

    Abstract: Sketching is a powerful dimensionality reduction technique for accelerating algorithms for data analysis. A crucial step in sketching methods is to compute a subspace embedding (SE) for a large matrix $\mathbf{A} \in \mathbb{R}^{N \times d}$. SE's are the primary tool for obtaining extremely efficient solutions for many linear-algebraic tasks, such as least squares regression and low rank approxim… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

  45. arXiv:2107.01335  [pdf, other

    cs.CC cs.DS cs.LG

    Average-Case Communication Complexity of Statistical Problems

    Authors: Cyrus Rashtchian, David P. Woodruff, Peng Ye, Hanlin Zhu

    Abstract: We study statistical problems, such as planted clique, its variants, and sparse principal component analysis in the context of average-case communication complexity. Our motivation is to understand the statistical-computational trade-offs in streaming, sketching, and query-based models. Communication complexity is the main tool for proving lower bounds in these models, yet many prior results do no… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: 28 pages. Conference on Learning Theory (COLT), 2021

  46. arXiv:2105.13753  [pdf

    cs.CV

    New Encoder Learning for Captioning Heavy Rain Images via Semantic Visual Feature Matching

    Authors: Chang-Hwan Son, Pung-Hwi Ye

    Abstract: Image captioning generates text that describes scenes from input images. It has been developed for high quality images taken in clear weather. However, in bad weather conditions, such as heavy rain, snow, and dense fog, the poor visibility owing to rain streaks, rain accumulation, and snowflakes causes a serious degradation of image quality. This hinders the extraction of useful visual features an… ▽ More

    Submitted 15 September, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Journal ref: Journal of Imaging Science and Technology, Sept. 2021

  47. arXiv:2002.05364  [pdf, other

    eess.SP cs.LG

    Fast Reinforcement Learning for Anti-jamming Communications

    Authors: Pei-Gen Ye, Yuan-Gen Wang, ** Li, Liang Xiao

    Abstract: This letter presents a fast reinforcement learning algorithm for anti-jamming communications which chooses previous action with probability $τ$ and applies $ε$-greedy with probability $(1-τ)$. A dynamic threshold based on the average value of previous several actions is designed and probability $τ$ is formulated as a Gaussian-like function to guide the wireless devices. As a concrete example, the… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  48. arXiv:2002.03322  [pdf, other

    cs.CV cs.MM

    VIFB: A Visible and Infrared Image Fusion Benchmark

    Authors: Xingchen Zhang, ** Ye, Gang Xiao

    Abstract: Visible and infrared image fusion is one of the most important areas in image processing due to its numerous applications. While much progress has been made in recent years with efforts on develo** fusion algorithms, there is a lack of code library and benchmark which can gauge the state-of-the-art. In this paper, after briefly reviewing recent advances of visible and infrared image fusion, we p… ▽ More

    Submitted 20 July, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: 11 pages, 5 figures, 5 tables. Accepted to CVPRW2020. Compared to the CVPRW2020 version, this version corrects minor mistakes in Table 4 and the first paragraph of Section 4.2

  49. arXiv:1410.8553  [pdf, other

    cs.CL cs.LG stat.ML

    A random forest system combination approach for error detection in digital dictionaries

    Authors: Michael Bloodgood, Peng Ye, Paul Rodrigues, David Zajic, David Doermann

    Abstract: When digitizing a print bilingual dictionary, whether via optical character recognition or manual entry, it is inevitable that errors are introduced into the electronic version that is created. We investigate automating the process of detecting errors in an XML representation of a digitized print dictionary using a hybrid approach that combines rule-based, feature-based, and language model-based m… ▽ More

    Submitted 30 October, 2014; originally announced October 2014.

    Comments: 9 pages, 7 figures, 10 tables; appeared in Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, April 2012

    ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

    Journal ref: In Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, pages 78-86, Avignon, France, April 2012. Association for Computational Linguistics

  50. arXiv:1410.8149  [pdf

    cs.CL cs.LG

    Detecting Structural Irregularity in Electronic Dictionaries Using Language Modeling

    Authors: Paul Rodrigues, David Zajic, David Doermann, Michael Bloodgood, Peng Ye

    Abstract: Dictionaries are often developed using tools that save to Extensible Markup Language (XML)-based standards. These standards often allow high-level repeating elements to represent lexical entries, and utilize descendants of these repeating elements to represent the structure within each lexical entry, in the form of an XML tree. In many cases, dictionaries are published that have errors and inconsi… ▽ More

    Submitted 29 October, 2014; originally announced October 2014.

    Comments: 6 pages, 2 figures, 11 tables; appeared in Proceedings of Electronic Lexicography in the 21st Century (eLex), November 2011

    ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

    Journal ref: In Proceedings of Electronic Lexicography in the 21st Century (eLex), pages 227-232, Bled, Slovenia, November 2011. Tro**a Institute for Applied Slovene Studies