Skip to main content

Showing 1–50 of 137 results for author: Wei, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16564  [pdf, other

    cs.CV

    FASTC: A Fast Attentional Framework for Semantic Traversability Classification Using Point Cloud

    Authors: Yirui Chen, Peng** Wei, Zhenhuan Liu, Bingchao Wang, Jie Yang, Wei Liu

    Abstract: Producing traversability maps and understanding the surroundings are crucial prerequisites for autonomous navigation. In this paper, we address the problem of traversability assessment using point clouds. We propose a novel pillar feature extraction module that utilizes PointNet to capture features from point clouds organized in vertical volume and a 2D encoder-decoder structure to conduct travers… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted to ECAI2023 Our code is publicly available at [this](https://github.com/chenyirui/FASTC)

  2. arXiv:2406.14282  [pdf, other

    cs.CL cs.AI

    Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

    Authors: Junjie Wang, Mingyang Chen, Binbin Hu, Dan Yang, Ziqi Liu, Yue Shen, Peng Wei, Zhiqiang Zhang, **jie Gu, Jun Zhou, Jeff Z. Pan, Wen Zhang, Huajun Chen

    Abstract: Improving the performance of large language models (LLMs) in complex question-answering (QA) scenarios has always been a research focal point. Recent studies have attempted to enhance LLMs' performance by combining step-wise planning with external retrieval. While effective for advanced models like GPT-3.5, smaller LLMs face challenges in decomposing complex questions, necessitating supervised fin… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Work in progress

  3. arXiv:2406.03712  [pdf, other

    cs.CL cs.LG

    A Survey on Medical Large Language Models: Technology, Application, Trustworthiness, and Future Directions

    Authors: Lei Liu, Xiaoyan Yang, Junchi Lei, Xiaoyang Liu, Yue Shen, Zhiqiang Zhang, Peng Wei, **jie Gu, Zhixuan Chu, Zhan Qin, Kui Ren

    Abstract: Large language models (LLMs), such as GPT series models, have received substantial attention due to their impressive capabilities for generating and understanding human-level language. More recently, LLMs have emerged as an innovative and powerful adjunct in the medical field, transforming traditional practices and heralding a new era of enhanced healthcare services. This survey provides a compreh… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2406.00632  [pdf, other

    cs.CV

    Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior

    Authors: Yukai Shi, Yupei Lin, Pengxu Wei, Xiaoyu Xian, Tianshui Chen, Liang Lin

    Abstract: Recently, researchers have proposed various deep learning methods to accurately detect infrared targets with the characteristics of indistinct shape and texture. Due to the limited variety of infrared datasets, training deep learning models with good generalization poses a challenge. To augment the infrared dataset, researchers employ data augmentation techniques, which often involve generating ne… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  5. arXiv:2405.11459  [pdf, other

    eess.SP cs.CL q-bio.NC

    Du-IN: Discrete units-guided mask modeling for decoding speech from Intracranial Neural signals

    Authors: Hui Zheng, Hai-Teng Wang, Wei-Bang Jiang, Zhong-Tao Chen, Li He, Pei-Yang Lin, Peng-Hu Wei, Guo-Guang Zhao, Yun-Zhe Liu

    Abstract: Invasive brain-computer interfaces have garnered significant attention due to their high performance. The current intracranial stereoElectroEncephaloGraphy (sEEG) foundation models typically build univariate representations based on a single channel. Some of them further use Transformer to model the relationship among channels. However, due to the locality and specificity of brain computation, the… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  6. arXiv:2405.04336  [pdf, other

    cs.AI

    Temporal and Heterogeneous Graph Neural Network for Remaining Useful Life Prediction

    Authors: Zhihao Wen, Yuan Fang, Pengcheng Wei, Fayao Liu, Zhenghua Chen, Min Wu

    Abstract: Predicting Remaining Useful Life (RUL) plays a crucial role in the prognostics and health management of industrial systems that involve a variety of interrelated sensors. Given a constant stream of time series sensory data from such systems, deep learning models have risen to prominence at identifying complex, nonlinear temporal dependencies in these data. In addition to the temporal dependencies… ▽ More

    Submitted 1 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: 12 pages

  7. arXiv:2405.03967  [pdf, other

    cs.LG cs.AI cs.AR

    SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems

    Authors: Kailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu, Guru Venkataramani

    Abstract: Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets. However, RL training often faces memory limitations, leading to execution latencies and prolonged training times. To overcome this, SwiftRL explores Processing-In-Memory (PIM) architectures to accelerate RL workloads. We achieve near-linear performance scaling by implementing… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  8. arXiv:2405.00542  [pdf, other

    eess.IV cs.CV

    UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement

    Authors: Ruiquan Ge, Zhaojie Fang, Pengxue Wei, Zhanghao Chen, Hongyang Jiang, Ahmed Elazab, Wangting Li, Xiang Wan, Shaochong Zhang, Changmiao Wang

    Abstract: Fundus photography, in combination with the ultra-wide-angle fundus (UWF) techniques, becomes an indispensable diagnostic tool in clinical settings by offering a more comprehensive view of the retina. Nonetheless, UWF fluorescein angiography (UWF-FA) necessitates the administration of a fluorescent dye via injection into the patient's hand or elbow unlike UWF scanning laser ophthalmoscopy (UWF-SLO… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  9. arXiv:2404.14309  [pdf, other

    cs.CV

    Towards Better Adversarial Purification via Adversarial Denoising Diffusion Training

    Authors: Yiming Liu, Kezhao Liu, Yao Xiao, Ziyi Dong, Xiaogang Xu, Pengxu Wei, Liang Lin

    Abstract: Recently, diffusion-based purification (DBP) has emerged as a promising approach for defending against adversarial attacks. However, previous studies have used questionable methods to evaluate the robustness of DBP models, their explanations of DBP robustness also lack experimental support. We re-examine DBP robustness using precise gradient, and discuss the impact of stochasticity on DBP robustne… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  10. arXiv:2404.09790  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

    Authors: Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, **hua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou , et al. (63 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4

  11. arXiv:2404.09263  [pdf, other

    cs.CV cs.AI

    Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection

    Authors: ** Yang, ** Wei, Huan Li, Ziyang Ren

    Abstract: Video moment retrieval and highlight detection are two highly valuable tasks in video understanding, but until recently they have been jointly studied. Although existing studies have made impressive advancement recently, they predominantly follow the data-driven bottom-up paradigm. Such paradigm overlooks task-specific and inter-task effects, resulting in poor model performance. In this paper, we… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  12. arXiv:2403.16131  [pdf, other

    cs.CV

    Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement

    Authors: Xiuquan Hou, Meiqin Liu, Senlin Zhang, ** Wei, Badong Chen

    Abstract: DETR-like methods have significantly increased detection performance in an end-to-end manner. The mainstream two-stage frameworks of them perform dense self-attention and select a fraction of queries for sparse cross-attention, which is proven effective for improving performance but also introduces a heavy computational burden and high dependence on stable query selection. This paper demonstrates… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  13. IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images

    Authors: Meilin Wang, Yexing Song, Pengxu Wei, Xiaoyu Xian, Yukai Shi, Liang Lin

    Abstract: Deep learning technologies have demonstrated their effectiveness in removing cloud cover from optical remote-sensing images. Convolutional Neural Networks (CNNs) exert dominance in the cloud removal tasks. However, constrained by the inherent limitations of convolutional operations, CNNs can address only a modest fraction of cloud occlusion. In recent years, diffusion models have achieved state-of… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE TGRS, we first present an iterative diffusion process for cloud removal, the code is available at: https://github.com/SongYxing/IDF-CR

  14. arXiv:2403.11852  [pdf, other

    cs.RO cs.AI

    Reinforcement Learning with Latent State Inference for Autonomous On-ramp Merging under Observation Delay

    Authors: Amin Tabrizian, Zhitong Huang, Peng Wei

    Abstract: This paper presents a novel approach to address the challenging problem of autonomous on-ramp merging, where a self-driving vehicle needs to seamlessly integrate into a flow of vehicles on a multi-lane highway. We introduce the Lane-kee**, Lane-changing with Latent-state Inference and Safety Controller (L3IS) agent, designed to perform the on-ramp merging task safely without comprehensive knowle… ▽ More

    Submitted 21 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  15. arXiv:2402.13769  [pdf, other

    cs.IR

    General Debiasing for Graph-based Collaborative Filtering via Adversarial Graph Dropout

    Authors: An Zhang, Wenchang Ma, Pengbo Wei, Leheng Sheng, Xiang Wang

    Abstract: Graph neural networks (GNNs) have shown impressive performance in recommender systems, particularly in collaborative filtering (CF). The key lies in aggregating neighborhood information on a user-item interaction graph to enhance user/item representations. However, we have discovered that this aggregation mechanism comes with a drawback, which amplifies biases present in the interaction graph. For… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted to WWW 2024

  16. arXiv:2401.06992  [pdf, other

    cs.CV cs.AI

    Progressive Feature Fusion Network for Enhancing Image Quality Assessment

    Authors: Kaiqun Wu, Xiaoling Jiang, Rui Yu, Yonggang Luo, Tian Jiang, Xi Wu, Peng Wei

    Abstract: Image compression has been applied in the fields of image storage and video broadcasting. However, it's formidably tough to distinguish the subtle quality differences between those distorted images generated by different algorithms. In this paper, we propose a new image quality assessment framework to decide which image is better in an image group. To capture the subtle differences, a fine-grained… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: Data Compression Conference

  17. arXiv:2312.10299  [pdf, other

    cs.CV cs.AI cs.LG

    Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge

    Authors: Conghan Yue, Zhengwei Peng, Junlong Ma, Shiyan Du, Pengxu Wei, Dongyu Zhang

    Abstract: Diffusion models exhibit powerful generative capabilities enabling noise map** to data via reverse stochastic differential equations. However, in image restoration, the focus is on the map** relationship from low-quality to high-quality images. Regarding this issue, we introduce the Generalized Ornstein-Uhlenbeck Bridge (GOUB) model. By leveraging the natural mean-reverting property of the gen… ▽ More

    Submitted 17 May, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: ICML 2024

  18. arXiv:2311.05374  [pdf, other

    cs.CL cs.AI

    TencentLLMEval: A Hierarchical Evaluation of Real-World Capabilities for Human-Aligned LLMs

    Authors: Shuyi Xie, Wenlin Yao, Yong Dai, Shaobo Wang, Donlin Zhou, Lifeng **, Xinhua Feng, Pengzhi Wei, Yujie Lin, Zhichao Hu, Dong Yu, Zhengyou Zhang, **g Nie, Yuhong Liu

    Abstract: Large language models (LLMs) have shown impressive capabilities across various natural language tasks. However, evaluating their alignment with human preferences remains a challenge. To this end, we propose a comprehensive human evaluation framework to assess LLMs' proficiency in following instructions on diverse real-world tasks. We construct a hierarchical task tree encompassing 7 major areas co… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  19. arXiv:2310.15138  [pdf, other

    cs.RO cs.CV

    Fusion-Driven Tree Reconstruction and Fruit Localization: Advancing Precision in Agriculture

    Authors: Kaiming Fu, Peng Wei, Juan Villacres, Zhaodan Kong, Stavros G. Vougioukas, Brian N. Bailey

    Abstract: Fruit distribution is pivotal in sha** the future of both agriculture and agricultural robotics, paving the way for a streamlined supply chain. This study introduces an innovative methodology that harnesses the synergy of RGB imagery, LiDAR, and IMU data, to achieve intricate tree reconstructions and the pinpoint localization of fruits. Such integration not only offers insights into the fruit di… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: This work was presented at IEEE/RSI International Conference on Intelligent Robots and Systems (IROS) Workshop

  20. arXiv:2309.04803  [pdf, other

    cs.CV cs.AI

    Towards Real-World Burst Image Super-Resolution: Benchmark and Method

    Authors: Pengxu Wei, Yu**g Sun, Xingbei Guo, Chang Liu, Jie Chen, Xiangyang Ji, Liang Lin

    Abstract: Despite substantial advances, single-image super-resolution (SISR) is always in a dilemma to reconstruct high-quality images with limited information from one input image, especially in realistic scenarios. In this paper, we establish a large-scale real-world burst super-resolution dataset, i.e., RealBSR, to explore the faithful reconstruction of image details from multiple frames. Furthermore, we… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV2023

  21. arXiv:2308.15016  [pdf, other

    cs.CV

    C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model

    Authors: Longbin Ji, Pengfei Wei, Yi Ren, **glin Liu, Chen Zhang, Xiang Yin

    Abstract: Co-speech gesture generation is crucial for automatic digital avatar animation. However, existing methods suffer from issues such as unstable training and temporal inconsistency, particularly in generating high-fidelity and comprehensive gestures. Additionally, these methods lack effective control over speaker identity and temporal editing of the generated gestures. Focusing on capturing temporal… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 12 pages, 6 figures, 7 tables

  22. arXiv:2308.02263  [pdf, other

    cs.SD cs.CL eess.AS

    Efficient Monaural Speech Enhancement using Spectrum Attention Fusion

    Authors: **yu Long, Jetic Gū, Binhao Bai, Zhibo Yang, ** Wei, Junli Li

    Abstract: Speech enhancement is a demanding task in automated speech processing pipelines, focusing on separating clean speech from noisy channels. Transformer based models have recently bested RNN and CNN models in speech enhancement, however at the same time they are much more computationally expensive and require much more high quality training data, which is always hard to come by. In this paper, we pre… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  23. arXiv:2308.01117  [pdf

    cs.RO eess.SY

    Optimization-Based Motion Planning for Autonomous Agricultural Vehicles Turning in Constrained Headlands

    Authors: Chen Peng, Peng Wei, Zhenghao Fei, Yuankai Zhu, Stavros G. Vougioukas

    Abstract: Headland maneuvering is a crucial aspect of unmanned field operations for autonomous agricultural vehicles (AAVs). While motion planning for headland turning in open fields has been extensively studied and integrated into commercial auto-guidance systems, the existing methods primarily address scenarios with ample headland space and thus may not work in more constrained headland geometries. Commer… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

  24. arXiv:2307.16242  [pdf, other

    cs.CV

    SR-R$^2$KAC: Improving Single Image Defocus Deblurring

    Authors: Peng Tang, Zhiqiang Xu, Pengfei Wei, Xiaobin Hu, Peilin Zhao, Xin Cao, Chunlai Zhou, Tobias Lasser

    Abstract: We propose an efficient deep learning method for single image defocus deblurring (SIDD) by further exploring inverse kernel properties. Although the current inverse kernel method, i.e., kernel-sharing parallel atrous convolution (KPAC), can address spatially varying defocus blurs, it has difficulty in handling large blurs of this kind. To tackle this issue, we propose a Residual and Recursive Ke… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Submitted to IEEE Transactions on Cybernetics on 2023-July-24

  25. arXiv:2307.11530  [pdf, other

    eess.IV cs.CV

    UWAT-GAN: Fundus Fluorescein Angiography Synthesis via Ultra-wide-angle Transformation Multi-scale GAN

    Authors: Zhaojie Fang, Zhanghao Chen, Pengxue Wei, Wangting Li, Shaochong Zhang, Ahmed Elazab, Gangyong Jia, Ruiquan Ge, Changmiao Wang

    Abstract: Fundus photography is an essential examination for clinical and differential diagnosis of fundus diseases. Recently, Ultra-Wide-angle Fundus (UWF) techniques, UWF Fluorescein Angiography (UWF-FA) and UWF Scanning Laser Ophthalmoscopy (UWF-SLO) have been gradually put into use. However, Fluorescein Angiography (FA) and UWF-FA require injecting sodium fluorescein which may have detrimental influence… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: 26th International Conference on Medical Image Computing and Computer Assisted Intervention

  26. arXiv:2307.07218  [pdf, other

    eess.AS cs.SD

    Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis

    Authors: Ziyue Jiang, **glin Liu, Yi Ren, **zheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao

    Abstract: Zero-shot text-to-speech (TTS) aims to synthesize voices with unseen speech prompts, which significantly reduces the data and computation requirements for voice cloning by skip** the fine-tuning process. However, the prompting mechanisms of zero-shot TTS still face challenges in the following aspects: 1) previous works of zero-shot TTS are typically trained with single-sentence prompts, which si… ▽ More

    Submitted 10 April, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted by ICLR 2024

  27. arXiv:2306.11647  [pdf, ps, other

    cs.RO eess.SY

    Safe and Scalable Real-Time Trajectory Planning Framework for Urban Air Mobility

    Authors: Abenezer Taye, Roberto Valenti, Akshay Rajhans, Anastasia Mavrommati, Pieter J. Mosterman, Peng Wei

    Abstract: This paper presents a real-time trajectory planning framework for Urban Air Mobility (UAM) that is both safe and scalable. The proposed framework employs a decentralized, free-flight concept of operation in which each aircraft independently performs separation assurance and conflict resolution, generating safe trajectories by accounting for the future states of nearby aircraft. The framework consi… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  28. arXiv:2306.00187  [pdf, other

    cs.MA

    AccMER: Accelerating Multi-Agent Experience Replay with Cache Locality-aware Prioritization

    Authors: Kailash Gogineni, Yongsheng Mei, Peng Wei, Tian Lan, Guru Venkataramani

    Abstract: Multi-Agent Experience Replay (MER) is a key component of off-policy reinforcement learning~(RL) algorithms. By remembering and reusing experiences from the past, experience replay significantly improves the stability of RL algorithms and their learning efficiency. In many scenarios, multiple agents interact in a shared environment during online training under centralized training and decentralize… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted to ASAP'23

  29. arXiv:2305.13996  [pdf, other

    cs.RO

    One-Shot Strategically Deconflicted Route and Operational Volume Generation for Urban Air Mobility Operations

    Authors: Ellis L Thompson, Yan Xu, Peng Wei

    Abstract: In the UAM space, strategic deconfliction provides an all-essential layer to airspace automation by providing safe, pre-emptive deconfliction or assignment of airspace resources to airspace users pre-flight. Strategic deconfliction approaches provide an elegant solution to pre-flight deconfliction operations. This overall creates safer and more efficient airspace and reduces the workload on contro… ▽ More

    Submitted 15 August, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 8 pages, 7 Figures

  30. arXiv:2305.13411  [pdf, other

    cs.MA

    Towards Efficient Multi-Agent Learning Systems

    Authors: Kailash Gogineni, Peng Wei, Tian Lan, Guru Venkataramani

    Abstract: Multi-Agent Reinforcement Learning (MARL) is an increasingly important research field that can model and control multiple large-scale autonomous systems. Despite its achievements, existing multi-agent learning methods typically involve expensive computations in terms of training time and power arising from large observation-action space and a huge number of training steps. Therefore, a key challen… ▽ More

    Submitted 23 May, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at MLArchSys, ISCA 2023. Compared to arXiv:2302.05007, we explore a neighbor sampling strategy to improve the locality of data access within the mini-batch sampling phase. Our preliminary experiments provide performance improvement ranging from 26.66% (3 agents) to 27.39% (12 agents) in the sampling phase training run-time

  31. arXiv:2305.10556  [pdf, other

    cs.AI

    Integrated Conflict Management for UAM with Strategic Demand Capacity Balancing and Learning-based Tactical Deconfliction

    Authors: Shulu Chen, Antony Evans, Marc Brittain, Peng Wei

    Abstract: Urban air mobility (UAM) has the potential to revolutionize our daily transportation, offering rapid and efficient deliveries of passengers and cargo between dedicated locations within and around the urban environment. Before the commercialization and adoption of this emerging transportation mode, however, aviation safety must be guaranteed, i.e., all the aircraft have to be safely separated by st… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  32. arXiv:2305.08328  [pdf, other

    cs.IR cs.LG

    FedAds: A Benchmark for Privacy-Preserving CVR Estimation with Vertical Federated Learning

    Authors: Penghui Wei, Hongjian Dou, Shaoguo Liu, Rongjun Tang, Li Liu, Liang Wang, Bo Zheng

    Abstract: Conversion rate (CVR) estimation aims to predict the probability of conversion event after a user has clicked an ad. Typically, online publisher has user browsing interests and click feedbacks, while demand-side advertising platform collects users' post-click behaviors such as dwell time and conversion decisions. To estimate CVR accurately and protect data privacy better, vertical federated learni… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: SIGIR 2023, Resource Track

  33. arXiv:2305.08293  [pdf, other

    cs.CV cs.MM

    Identity-Preserving Talking Face Generation with Landmark and Appearance Priors

    Authors: Weizhi Zhong, Chaowei Fang, Yinqi Cai, Pengxu Wei, Gangming Zhao, Liang Lin, Guanbin Li

    Abstract: Generating talking face videos from audio attracts lots of research interest. A few person-specific methods can generate vivid videos but require the target speaker's videos for training or fine-tuning. Existing person-generic methods have difficulty in generating realistic and lip-synced videos while preserving identity information. To tackle this problem, we propose a two-stage framework consist… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: CVPR2023, Code: https://github.com/Weizhi-Zhong/IP_LAP

  34. arXiv:2305.05838  [pdf, other

    cs.CV cs.MM

    Generative Steganographic Flow

    Authors: ** Wei, Ge Luo, Qi Song, Xinpeng Zhang, Zhenxing Qian, Sheng Li

    Abstract: Generative steganography (GS) is a new data hiding manner, featuring direct generation of stego media from secret data. Existing GS methods are generally criticized for their poor performances. In this paper, we propose a novel flow based GS approach -- Generative Steganographic Flow (GSF), which provides direct generation of stego images without cover image. We take the stego image generation and… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: The accepted paper in ICME 2022

  35. arXiv:2305.03472  [pdf, other

    cs.MM cs.AI

    Generative Steganography Diffusion

    Authors: ** Wei, Qing Zhou, Zichi Wang, Zhenxing Qian, Xinpeng Zhang, Sheng Li

    Abstract: Generative steganography (GS) is an emerging technique that generates stego images directly from secret data. Various GS methods based on GANs or Flow have been developed recently. However, existing GAN-based GS methods cannot completely recover the hidden secret data due to the lack of network invertibility, while Flow-based methods produce poor image quality due to the stringent reversibility re… ▽ More

    Submitted 6 September, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Shall not be reproduced without permission, rights reserved!

  36. arXiv:2304.00040  [pdf

    cs.LG

    A robust deep learning-based damage identification approach for SHM considering missing data

    Authors: Fan Deng, Xiaoming Tao, Pengxiang Wei, Shiyin Wei

    Abstract: Data-driven method for Structural Health Monitoring (SHM), that mine the hidden structural performance from the correlations among monitored time series data, has received widely concerns recently. However, missing data significantly impacts the conduction of this method. Missing data is a frequently encountered issue in time series data in SHM and many other real-world applications, that harms to… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  37. arXiv:2303.07693  [pdf, other

    cs.LG cs.AI

    Adaptive Policy Learning for Offline-to-Online Reinforcement Learning

    Authors: Han Zheng, Xufang Luo, Pengfei Wei, Xuan Song, Dongsheng Li, **g Jiang

    Abstract: Conventional reinforcement learning (RL) needs an environment to collect fresh data, which is impractical when online interactions are costly. Offline RL provides an alternative solution by directly learning from the previously collected dataset. However, it will yield unsatisfactory performance if the quality of the offline datasets is poor. In this paper, we consider an offline-to-online setting… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: AAAI2023

  38. arXiv:2303.03052  [pdf, other

    cs.CV cs.LG

    Masked Images Are Counterfactual Samples for Robust Fine-tuning

    Authors: Yao Xiao, Ziyi Tang, Pengxu Wei, Cong Liu, Liang Lin

    Abstract: Deep learning models are challenged by the distribution shift between the training data and test data. Recently, the large models pre-trained on diverse data have demonstrated unprecedented robustness to various distribution shifts. However, fine-tuning these models can lead to a trade-off between in-distribution (ID) performance and out-of-distribution (OOD) robustness. Existing methods for tackl… ▽ More

    Submitted 2 April, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023 (v2: improve the clarity; v3: camera ready version)

  39. arXiv:2302.10418  [pdf, other

    cs.LG cs.AI cs.MA

    MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization

    Authors: Yongsheng Mei, Hanhan Zhou, Tian Lan, Guru Venkataramani, Peng Wei

    Abstract: Experience replay is crucial for off-policy reinforcement learning (RL) methods. By remembering and reusing the experiences from past different policies, experience replay significantly improves the training efficiency and stability of RL algorithms. Many decision-making problems in practice naturally involve multiple agents and require multi-agent reinforcement learning (MARL) under centralized t… ▽ More

    Submitted 27 February, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: The 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023). arXiv admin note: text overlap with arXiv:2302.05593

  40. arXiv:2302.05007  [pdf, other

    cs.MA

    Scalability Bottlenecks in Multi-Agent Reinforcement Learning Systems

    Authors: Kailash Gogineni, Peng Wei, Tian Lan, Guru Venkataramani

    Abstract: Multi-Agent Reinforcement Learning (MARL) is a promising area of research that can model and control multiple, autonomous decision-making agents. During online training, MARL algorithms involve performance-intensive computations such as exploration and exploitation phases originating from large observation-action space belonging to multiple agents. In this article, we seek to characterize the scal… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  41. arXiv:2302.02636  [pdf, other

    cs.IR cs.LG

    Hybrid Contrastive Constraints for Multi-Scenario Ad Ranking

    Authors: Shanlei Mu, Penghui Wei, Wayne Xin Zhao, Shaoguo Liu, Liang Wang, Bo Zheng

    Abstract: Multi-scenario ad ranking aims at leveraging the data from multiple domains or channels for training a unified ranking model to improve the performance at each individual scenario. Although the research on this task has made important progress, it still lacks the consideration of cross-scenario relations, thus leading to limitation in learning capability and difficulty in interrelation modeling. I… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 10 pages, 5 figures

  42. arXiv:2302.02592  [pdf, other

    cs.IR

    RLTP: Reinforcement Learning to Pace for Delayed Impression Modeling in Preloaded Ads

    Authors: Penghui Wei, Yongqiang Chen, Shaoguo Liu, Liang Wang, Bo Zheng

    Abstract: To increase brand awareness, many advertisers conclude contracts with advertising platforms to purchase traffic and then deliver advertisements to target audiences. In a whole delivery period, advertisers usually desire a certain impression count for the ads, and they also expect that the delivery performance is as good as possible (e.g., obtaining high click-through rate). Advertising platforms e… ▽ More

    Submitted 16 June, 2024; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: KDD 2023 (Applied Data Science Track). The first two authors contributed equally

  43. arXiv:2302.02293  [pdf, other

    cs.RO

    Autonomous Exploration Method for Fast Unknown Environment Map** by Using UAV Equipped with Limited FOV Sensor

    Authors: Yinghao Zhao, Li Yan, Hong Xie, Jicheng Dai, Pengcheng Wei

    Abstract: Autonomous exploration is one of the important parts to achieve the fast autonomous map** and target search. However, most of the existing methods are facing low-efficiency problems caused by low-quality trajectory or back-and-forth maneuvers. To improve the exploration efficiency in unknown environments, a fast autonomous exploration planner (FAEP) is proposed in this paper. Different from exis… ▽ More

    Submitted 4 February, 2023; originally announced February 2023.

    Comments: 10 pages,10 figures. arXiv admin note: substantial text overlap with arXiv:2202.12507

  44. A Framework for Operational Volume Generation for Urban Air Mobility Strategic Deconfliction

    Authors: Ellis Lee Thompson, Yan Xu, Peng Wei

    Abstract: Strategic pre-flight functions focus on the planning and deconfliction of routes for aircraft systems. The urban air mobility concept calls for higher levels of autonomy with onboard and en route functions but also strategic and pre-flight systems. Existing endeavours into strategic pre-flight functions focus on improving the route generation and strategic deconfliction of these routes. Introduced… ▽ More

    Submitted 27 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: 8 pages, 3 Figures

    Journal ref: 2023 International Conference on Unmanned Aircraft Systems (ICUAS), Warsaw, Poland, 2023, pp. 71-78

  45. arXiv:2212.08296  [pdf, other

    cs.CV

    DQnet: Cross-Model Detail Querying for Camouflaged Object Detection

    Authors: Wei Sun, Chengao Liu, Linyan Zhang, Yu Li, Pengxu Wei, Chang Liu, Jialing Zou, Jianbin Jiao, Qixiang Ye

    Abstract: Camouflaged objects are seamlessly blended in with their surroundings, which brings a challenging detection task in computer vision. Optimizing a convolutional neural network (CNN) for camouflaged object detection (COD) tends to activate local discriminative regions while ignoring complete object extent, causing the partial activation issue which inevitably leads to missing or redundant regions of… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  46. arXiv:2211.12268  [pdf, other

    cs.CV

    Out-of-Candidate Rectification for Weakly Supervised Semantic Segmentation

    Authors: Zesen Cheng, Pengchong Qiao, Kehan Li, Siheng Li, Pengxu Wei, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen

    Abstract: Weakly supervised semantic segmentation is typically inspired by class activation maps, which serve as pseudo masks with class-discriminative regions highlighted. Although tremendous efforts have been made to recall precise and complete locations for each class, existing methods still commonly suffer from the unsolicited Out-of-Candidate (OC) error predictions that not belongs to the label candida… ▽ More

    Submitted 14 March, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Accepted to CVPR2023

  47. arXiv:2211.11337  [pdf, other

    cs.CV cs.CL cs.MM

    DreamArtist: Towards Controllable One-Shot Text-to-Image Generation via Positive-Negative Prompt-Tuning

    Authors: Ziyi Dong, Pengxu Wei, Liang Lin

    Abstract: Large-scale text-to-image generation models have achieved remarkable progress in synthesizing high-quality, feature-rich images with high resolution guided by texts. However, these models often struggle with novel concepts, eg, new styles, object entities, etc. Although recent attempts have employed fine-tuning or prompt-tuning strategies to teach the pre-trained diffusion model novel concepts fro… ▽ More

    Submitted 5 April, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  48. arXiv:2211.11191  [pdf, other

    cs.IR

    Correlative Preference Transfer with Hierarchical Hypergraph Network for Multi-Domain Recommendation

    Authors: Zixuan Xu, Penghui Wei, Shaoguo Liu, Weimin Zhang, Liang Wang, Bo Zheng

    Abstract: Advanced recommender systems usually involve multiple domains (such as scenarios or categories) for various marketing strategies, and users interact with them to satisfy diverse demands. The goal of multi-domain recommendation (MDR) is to improve the recommendation performance of all domains simultaneously. Conventional graph neural network based methods usually deal with each domain separately, o… ▽ More

    Submitted 19 April, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted by WWW 2023 research track. The first two authors contributed equally

  49. arXiv:2211.05658  [pdf, other

    q-bio.QM cs.NE q-bio.NC

    Multi-objective optimization via evolutionary algorithm (MOVEA) for high-definition transcranial electrical stimulation of the human brain

    Authors: Mo Wang, Kexin Lou, Zeming Liu, Pengfei Wei, Quanying Liu

    Abstract: Designing a transcranial electrical stimulation (TES) strategy requires considering multiple objectives, such as intensity in the target area, focality, stimulation depth, and avoidance zone, which are often mutually exclusive. A computational framework for optimizing different strategies and comparing trade-offs between these objectives is currently lacking. In this paper, we propose a general fr… ▽ More

    Submitted 3 April, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Journal ref: NeuroImage, Volume 280, 2020

  50. arXiv:2211.02147  [pdf, other

    eess.SY cs.LG

    A Survey on Reinforcement Learning in Aviation Applications

    Authors: Pouria Razzaghi, Amin Tabrizian, Wei Guo, Shulu Chen, Abenezer Taye, Ellis Thompson, Alexis Bregeon, Ali Baheri, Peng Wei

    Abstract: Compared with model-based control and optimization methods, reinforcement learning (RL) provides a data-driven, learning-based framework to formulate and solve sequential decision-making problems. The RL framework has become promising due to largely improved data availability and computing power in the aviation industry. Many aviation-based applications can be formulated or treated as sequential d… ▽ More

    Submitted 22 November, 2022; v1 submitted 3 November, 2022; originally announced November 2022.