Skip to main content

Showing 1–50 of 112 results for author: Lyu, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00496  [pdf, other

    cs.LG cs.AI

    A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation

    Authors: Aicheng Gong, Kai Yang, Jiafei Lyu, Xiu Li

    Abstract: Task allocation is a key combinatorial optimization problem, crucial for modern applications such as multi-robot cooperation and resource scheduling. Decision makers must allocate entities to tasks reasonably across different scenarios. However, traditional methods assume static attributes and numbers of tasks and entities, often relying on dynamic programming and heuristic algorithms for solution… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2406.19043  [pdf

    eess.IV cs.AI cs.CV cs.DB

    CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

    Authors: Zi Wang, Fanwen Wang, Chen Qin, Jun Lyu, Ouyang Cheng, Shuo Wang, Yan Li, Mengyao Yu, Haoyu Zhang, Kunyuan Guo, Zhang Shi, Qirong Li, Ziqiang Xu, Ya**g Zhang, Hao Li, Sha Hua, Binghua Chen, Longyu Sun, Mengting Sun, Qin Li, Ying-Hua Chu, Wenjia Bai, **g Qin, Xiahai Zhuang, Claudia Prieto , et al. (7 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures, 2 tables

  3. arXiv:2406.12646  [pdf, other

    eess.IV cs.AI cs.CV

    An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation

    Authors: Qin Li, Yizhe Zhang, Yan Li, Jun Lyu, Meng Liu, Longyu Sun, Mengting Sun, Qirong Li, Wenyue Mao, Xinran Wu, Ya**g Zhang, Yinghua Chu, Shuo Wang, Chengyan Wang

    Abstract: The segmentation foundation model, e.g., Segment Anything Model (SAM), has attracted increasing interest in the medical image community. Early pioneering studies primarily concentrated on assessing and improving SAM's performance from the perspectives of overall accuracy and efficiency, yet little attention was given to the fairness considerations. This oversight raises questions about the potenti… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to MICCAI-2024

  4. arXiv:2406.07381  [pdf, other

    cs.AI cs.LG

    World Models with Hints of Large Language Models for Goal Achieving

    Authors: Zeyuan Liu, Ziyu Huan, Xiyao Wang, Jiafei Lyu, Jian Tao, Xiu Li, Furong Huang, Huazhe Xu

    Abstract: Reinforcement learning struggles in the face of long-horizon tasks and sparse goals due to the difficulty in manual reward specification. While existing methods address this by adding intrinsic rewards, they may fail to provide meaningful guidance in long-horizon decision-making tasks with large state and action spaces, lacking purposeful exploration. Inspired by human cognition, we propose a new… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  5. arXiv:2405.15369  [pdf, other

    cs.LG cs.AI

    Cross-Domain Policy Adaptation by Capturing Representation Mismatch

    Authors: Jiafei Lyu, Chenjia Bai, **gwen Yang, Zongqing Lu, Xiu Li

    Abstract: It is vital to learn effective policies that can be transferred to different domains with dynamics discrepancies in reinforcement learning (RL). In this paper, we consider dynamics adaptation settings where there exists dynamics mismatch between the source domain and the target domain, and one can get access to sufficient source domain data, while can only have limited interactions with the target… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  6. arXiv:2405.02818  [pdf, other

    cs.IT

    Site-Specific Deployment Optimization of Intelligent Reflecting Surface for Coverage Enhancement

    Authors: Dongsheng Fu, Xintong Chen, Jiangbin Lyu, Liqun Fu

    Abstract: Intelligent Reflecting Surface (IRS) is a promising technology for next generation wireless networks. Despite substantial research in IRS-aided communications, the assumed antenna and channel models are typically simplified without considering site-specific characteristics, which in turn critically affect the IRS deployment and performance in a given environment. In this paper, we first investigat… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 7 pages, 7 figures. To appear in VTC2024-Spring

  7. arXiv:2405.02660  [pdf, other

    cs.IT eess.SP

    AFDM Channel Estimation in Multi-Scale Multi-Lag Channels

    Authors: Rongyou Cao, Yuheng Zhong, Jiangbin Lyu, Deqing Wang, Liqun Fu

    Abstract: Affine Frequency Division Multiplexing (AFDM) is a brand new chirp-based multi-carrier (MC) waveform for high mobility communications, with promising advantages over Orthogonal Frequency Division Multiplexing (OFDM) and other MC waveforms. Existing AFDM research focuses on wireless communication at high carrier frequency (CF), which typically considers only Doppler frequency shift (DFS) as a resul… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 6 pages, 6 figures. Investigate AFDM under underwater multi-scale multi-lag channels. Derive the new input-output formula with the impact of Doppler time scaling. Propose two new channel estimation methods to tackle different level of Doppler factors. Perform diversity analyis based on CFR overlap probability (COP) and mutual incoherent property (MIP)

  8. arXiv:2405.02655  [pdf, other

    cs.IT

    Fast Online Movement Optimization of Aerial Base Stations Based on Global Connectivity Map

    Authors: Yiling Wang, Jiangbin Lyu, Liqun Fu

    Abstract: Unmanned aerial vehicles (UAVs) can serve as aerial base stations (ABSs) to provide wireless connectivity for ground users (GUs) in diverse scenarios. However, it is an NP-hard problem with exponential complexity in $M$ and $N$, in order to maximize the coverage rate (CR) of $M$ GUs by jointly placing $N$ ABSs with limited coverage range. This problem becomes even more intricate when the coverage… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 6 pages, 6 figures. Investigate site-specific movement optimization of UAV-mounted aerial base stations to cover a group of moving ground users, based on site-specific Global Connectivity Map. arXiv admin note: text overlap with arXiv:2312.10490

  9. arXiv:2404.13295  [pdf, other

    cs.SE

    Detecting Build Dependency Errors in Incremental Builds

    Authors: Jun Lyu, Shanshan Li, He Zhang, Yang Zhang, Guo** Rong, Manuel Rigger

    Abstract: Incremental and parallel builds performed by build tools such as Make are the heart of modern C/C++ software projects. Their correct and efficient execution depends on build scripts. However, build scripts are prone to errors. The most prevalent errors are missing dependencies (MDs) and redundant dependencies (RDs). The state-of-the-art methods for detecting these errors rely on clean builds (i.e.… ▽ More

    Submitted 25 April, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

  10. arXiv:2404.06107  [pdf, other

    cs.CL

    Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets

    Authors: Zi Long, Zhenhao Tang, Xianghua Fu, Jian Chen, Shilong Hou, **ze Lyu

    Abstract: Recent research in the field of multimodal machine translation (MMT) has indicated that the visual modality is either dispensable or offers only marginal advantages. However, most of these conclusions are drawn from the analysis of experimental results based on a limited set of bilingual sentence-image pairs, such as Multi30k. In these kinds of datasets, the content of one bilingual parallel sente… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: bucc 2024 accepted

  11. arXiv:2403.18117  [pdf

    cs.CV

    TDIP: Tunable Deep Image Processing, a Real Time Melt Pool Monitoring Solution

    Authors: Javid Akhavan, Youmna Mahmoud, Ke Xu, Jiaqi Lyu, Souran Manoochehri

    Abstract: In the era of Industry 4.0, Additive Manufacturing (AM), particularly metal AM, has emerged as a significant contributor due to its innovative and cost-effective approach to fabricate highly intricate geometries. Despite its potential, this industry still lacks real-time capable process monitoring algorithms. Recent advancements in this field suggest that Melt Pool (MP) signatures during the fabri… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    ACM Class: I.4.9

  12. arXiv:2403.10047  [pdf, other

    cs.CV

    TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model

    Authors: Jiahao Lyu, ** Wei, Gangyan Zeng, Zeng Li, Enze Xie, Wei Wang, Yu Zhou

    Abstract: Existing scene text spotters are designed to locate and transcribe texts from images. However, it is challenging for a spotter to achieve precise detection and recognition of scene texts simultaneously. Inspired by the glimpse-focus spotting pipeline of human beings and impressive performances of Pre-trained Language Models (PLMs) on visual tasks, we ask: 1) "Can machines spot texts without precis… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 12 pages, 8 figures

  13. arXiv:2402.17780  [pdf, other

    eess.SP cs.LG physics.med-ph

    Constraint Latent Space Matters: An Anti-anomalous Waveform Transformation Solution from Photoplethysmography to Arterial Blood Pressure

    Authors: Cheng Bian, Xiaoyu Li, Qi Bi, Guangpu Zhu, Jiegeng Lyu, Weile Zhang, Yelei Li, Zi**g Zeng

    Abstract: Arterial blood pressure (ABP) holds substantial promise for proactive cardiovascular health management. Notwithstanding its potential, the invasive nature of ABP measurements confines their utility primarily to clinical environments, limiting their applicability for continuous monitoring beyond medical facilities. The conversion of photoplethysmography (PPG) signals into ABP equivalents has garner… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted by AAAI-2024, main track

  14. arXiv:2402.03807  [pdf, other

    cs.LG cs.AI

    SEABO: A Simple Search-Based Method for Offline Imitation Learning

    Authors: Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu

    Abstract: Offline reinforcement learning (RL) has attracted much attention due to its ability in learning from static offline datasets and eliminating the need of interacting with the environment. Nevertheless, the success of offline RL relies heavily on the offline transitions annotated with reward labels. In practice, we often need to hand-craft the reward function, which is sometimes difficult, labor-int… ▽ More

    Submitted 21 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in ICLR2024

  15. arXiv:2402.02701  [pdf, other

    cs.LG cs.AI stat.ML

    Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence

    Authors: Jiafei Lyu, Le Wan, Xiu Li, Zongqing Lu

    Abstract: Recently, there are many efforts attempting to learn useful policies for continuous control in visual reinforcement learning (RL). In this scenario, it is important to learn a generalizable policy, as the testing environment may differ from the training environment, e.g., there exist distractors during deployment. Many practical algorithms are proposed to handle this problem. However, to the best… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Part of this work is accepted as AAMAS 2024 extended abstract

  16. arXiv:2401.09750  [pdf, other

    cs.LG

    Exploration and Anti-Exploration with Distributional Random Network Distillation

    Authors: Kai Yang, Jian Tao, Jiafei Lyu, Xiu Li

    Abstract: Exploration remains a critical issue in deep reinforcement learning for an agent to attain high returns in unknown environments. Although the prevailing exploration Random Network Distillation (RND) algorithm has been demonstrated to be effective in numerous environments, it often needs more discriminative power in bonus allocation. This paper highlights the "bonus inconsistency" issue within RND,… ▽ More

    Submitted 19 May, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: ICML 2024 accepted

  17. arXiv:2401.07753  [pdf, other

    cs.CV

    Low-light Stereo Image Enhancement and De-noising in the Low-frequency Information Enhanced Image Space

    Authors: Minghua Zhao, Xiangdong Qin, Shuangli Du, Xuefei Bai, Jiahao Lyu, Yiguang Liu

    Abstract: Unlike single image task, stereo image enhancement can use another view information, and its key stage is how to perform cross-view feature interaction to extract useful information from another view. However, complex noise in low-light image and its impact on subsequent feature encoding and interaction are ignored by the existing methods. In this paper, a method is proposed to perform enhancement… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  18. arXiv:2312.10490  [pdf, other

    cs.IT cs.LG

    Spatial Deep Learning for Site-Specific Movement Optimization of Aerial Base Stations

    Authors: Jiangbin Lyu, Xu Chen, Jiefeng Zhang, Liqun Fu

    Abstract: Unmanned aerial vehicles (UAVs) can be utilized as aerial base stations (ABSs) to provide wireless connectivity for ground users (GUs) in various emergency scenarios. However, it is a NP-hard problem with exponential complexity in $M$ and $N$, in order to maximize the coverage rate of $M$ GUs by jointly placing $N$ ABSs with limited coverage range. The problem is further complicated when the cover… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: Manuscript submitted to IEEE Trans. Wireless Communications on 15 Jan. 2023; revised 11 Sep. 2023; accepted 5 Dec. 2023

  19. arXiv:2312.10475  [pdf, ps, other

    cs.IT eess.SP

    IRS-Aided Sectorized Base Station Design and 3D Coverage Performance Analysis

    Authors: Xintong Chen, Jiangbin Lyu, Liqun Fu

    Abstract: Intelligent reflecting surface (IRS) is regarded as a revolutionary paradigm that can reconfigure the wireless propagation environment for enhancing the desired signal and/or weakening the interference, and thus improving the quality of service (QoS) for communication systems. In this paper, we propose an IRS-aided sectorized BS design where the IRS is mounted in front of a transmitter (TX) and re… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: Manuscript submitted to IEEE IWQoS 2023 on 12 Feb. 2023; accepted 13 April 2023; published 27 July 2023. An associated Chinese patent was applied on 9 Aug. 2022 and granted on 1 Sep. 2023, under No. ZL202210948626.X

  20. arXiv:2312.07226  [pdf, other

    eess.IV cs.CV

    Super-Resolution on Rotationally Scanned Photoacoustic Microscopy Images Incorporating Scanning Prior

    Authors: Kai Pan, Linyang Li, Li Lin, Pu** Cheng, Junyan Lyu, Lei Xi, Xiaoyin Tang

    Abstract: Photoacoustic Microscopy (PAM) images integrating the advantages of optical contrast and acoustic resolution have been widely used in brain studies. However, there exists a trade-off between scanning speed and image resolution. Compared with traditional raster scanning, rotational scanning provides good opportunities for fast PAM imaging by optimizing the scanning mechanism. Recently, there is a t… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  21. arXiv:2312.03442  [pdf, other

    cs.CV

    High-Quality Facial Geometry and Appearance Capture at Home

    Authors: Yuxuan Han, Junfeng Lyu, Feng Xu

    Abstract: Facial geometry and appearance capture have demonstrated tremendous success in 3D scanning real humans in studios. Recent works propose to democratize this technique while kee** the results high quality. However, they are still inconvenient for daily usage. In addition, they focus on an easier problem of only capturing facial skin. This paper proposes a novel method for high-quality face capture… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Project page: https://yxuhan.github.io/CoRA/index.html ; Github repo: https://github.com/yxuhan/CoRA

  22. arXiv:2311.13231  [pdf, other

    cs.LG cs.AI cs.CV

    Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model

    Authors: Kai Yang, Jian Tao, Jiafei Lyu, Chunjiang Ge, Jiaxin Chen, Qimai Li, Weihan Shen, Xiaolong Zhu, Xiu Li

    Abstract: Using reinforcement learning with human feedback (RLHF) has shown significant promise in fine-tuning diffusion models. Previous methods start by training a reward model that aligns with human preferences, then leverage RL techniques to fine-tune the underlying models. However, crafting an efficient reward model demands extensive datasets, optimal architecture, and manual hyperparameter tuning, mak… ▽ More

    Submitted 23 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: CVPR 2024 accepted; huggingface daily paper

  23. arXiv:2311.11212  [pdf, other

    cs.AI cs.LG

    Can We Utilize Pre-trained Language Models within Causal Discovery Algorithms?

    Authors: Chanhui Lee, Juhyeon Kim, Yongjun Jeong, Juhyun Lyu, Junghee Kim, Sangmin Lee, Sangjun Han, Hyeokjun Choe, Soyeon Park, Woohyung Lim, Sungbin Lim, Sanghack Lee

    Abstract: Scaling laws have allowed Pre-trained Language Models (PLMs) into the field of causal reasoning. Causal reasoning of PLM relies solely on text-based descriptions, in contrast to causal discovery which aims to determine the causal relationships between variables utilizing data. Recently, there has been current research regarding a method that mimics causal discovery by aggregating the outcomes of r… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    ACM Class: I.2

  24. arXiv:2311.09806  [pdf, other

    cs.CV

    EvaSurf: Efficient View-Aware Implicit Textured Surface Reconstruction on Mobile Devices

    Authors: **gnan Gao, Zhuo Chen, Yichao Yan, Bowen Pan, Zhe Wang, Jiang**g Lyu, Xiaokang Yang

    Abstract: Reconstructing real-world 3D objects has numerous applications in computer vision, such as virtual reality, video games, and animations. Ideally, 3D reconstruction methods should generate high-fidelity results with 3D consistency in real-time. Traditional methods match pixels between images using photo-consistency constraints or learned features, while differentiable rendering methods like Neural… ▽ More

    Submitted 18 November, 2023; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Project Page: http://g-1nonly.github.io/EvaSurf-Website/

  25. arXiv:2310.20145  [pdf, other

    cs.LG stat.ML

    Efficient Robust Bayesian Optimization for Arbitrary Uncertain Inputs

    Authors: Lin Yang, Junlong Lyu, Wenlong Lyu, Zhitang Chen

    Abstract: Bayesian Optimization (BO) is a sample-efficient optimization algorithm widely employed across various applications. In some challenging BO tasks, input uncertainty arises due to the inevitable randomness in the optimization process, such as machining errors, execution noise, or contextual variability. This uncertainty deviates the input from the intended value before evaluation, resulting in sign… ▽ More

    Submitted 3 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  26. arXiv:2310.15017  [pdf, other

    cs.LG cs.AI

    The primacy bias in Model-based RL

    Authors: Zhongjian Qiao, Jiafei Lyu, Xiu Li

    Abstract: The primacy bias in deep reinforcement learning (DRL), which refers to the agent's tendency to overfit early data and lose the ability to learn from new data, can significantly decrease the performance of DRL algorithms. Previous studies have shown that employing simple techniques, such as resetting the agent's parameters, can substantially alleviate the primacy bias. However, we observe that rese… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  27. arXiv:2309.14872  [pdf, other

    cs.CV

    Directional Texture Editing for 3D Models

    Authors: Shengqi Liu, Zhuo Chen, **gnan Gao, Yichao Yan, Wenhan Zhu, Jiang**g Lyu, Xiaokang Yang

    Abstract: Texture editing is a crucial task in 3D modeling that allows users to automatically manipulate the surface materials of 3D models. However, the inherent complexity of 3D models and the ambiguous text description lead to the challenge in this task. To address this challenge, we propose ITEM3D, a \textbf{T}exture \textbf{E}diting \textbf{M}odel designed for automatic \textbf{3D} object editing accor… ▽ More

    Submitted 6 March, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: project page: https://shengqiliu1.github.io/ITEM3D

  28. arXiv:2309.10836  [pdf, other

    cs.CV

    CMRxRecon: An open cardiac MRI dataset for the competition of accelerated image reconstruction

    Authors: Chengyan Wang, Jun Lyu, Shuo Wang, Chen Qin, Kunyuan Guo, Xinyu Zhang, Xiaotong Yu, Yan Li, Fanwen Wang, Jianhua **, Zhang Shi, Ziqiang Xu, Yapeng Tian, Sha Hua, Zhensen Chen, Meng Liu, Mengting Sun, Xutong Kuang, Kang Wang, Haoran Wang, Hao Li, Yinghua Chu, Guang Yang, Wenjia Bai, Xiahai Zhuang , et al. (3 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (CMR) has emerged as a valuable diagnostic tool for cardiac diseases. However, a limitation of CMR is its slow imaging speed, which causes patient discomfort and introduces artifacts in the images. There has been growing interest in deep learning-based CMR imaging algorithms that can reconstruct high-quality images from highly under-sampled k-space data. However,… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 14 pages, 8 figures

  29. arXiv:2308.11449  [pdf, ps, other

    math.NA cs.AI math.AP math.PR

    Convergence guarantee for consistency models

    Authors: Junlong Lyu, Zhitang Chen, Shoubo Feng

    Abstract: We provide the first convergence guarantees for the Consistency Models (CMs), a newly emerging type of one-step generative models that can generate comparable samples to those generated by Diffusion Models. Our main result is that, under the basic assumptions on score-matching errors, consistency errors and smoothness of the data distribution, CMs can efficiently sample from any realistic data dis… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: 20 pages, 1 figures

  30. FoodSAM: Any Food Segmentation

    Authors: Xing Lan, Jiayi Lyu, Hanyu Jiang, Kun Dong, Zehai Niu, Yi Zhang, Jian Xue

    Abstract: In this paper, we explore the zero-shot capability of the Segment Anything Model (SAM) for food image segmentation. To address the lack of class-specific information in SAM-generated masks, we propose a novel framework, called FoodSAM. This innovative approach integrates the coarse semantic mask with SAM-generated masks to enhance semantic segmentation quality. Besides, we recognize that the ingre… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Code is available at https://github.com/jamesjg/FoodSAM

  31. arXiv:2307.12577  [pdf, other

    cs.CV

    PRIOR: Prototype Representation Joint Learning from Medical Images and Reports

    Authors: Pu** Cheng, Li Lin, Junyan Lyu, Yi** Huang, Wenhan Luo, Xiaoying Tang

    Abstract: Contrastive learning based vision-language joint pre-training has emerged as a successful representation learning strategy. In this paper, we present a prototype representation learning framework incorporating both global and local alignment between medical images and reports. In contrast to standard global multi-modality alignment methods, we employ a local alignment module for fine-grained repre… ▽ More

    Submitted 11 March, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023

  32. arXiv:2307.07177  [pdf, other

    cs.CV cs.AI

    TriFormer: A Multi-modal Transformer Framework For Mild Cognitive Impairment Conversion Prediction

    Authors: Linfeng Liu, Junyan Lyu, Siyu Liu, Xiaoying Tang, Shekhar S. Chandra, Fatima A. Nasrallah

    Abstract: The prediction of mild cognitive impairment (MCI) conversion to Alzheimer's disease (AD) is important for early treatment to prevent or slow the progression of AD. To accurately predict the MCI conversion to stable MCI or progressive MCI, we propose Triformer, a novel transformer-based framework with three specialized transformers to incorporate multi-model data. Triformer uses I) an image transfo… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  33. arXiv:2307.04296  [pdf, other

    eess.IV cs.CV

    K-Space-Aware Cross-Modality Score for Synthesized Neuroimage Quality Assessment

    Authors: Guoyang Xie, **bao Wang, Yawen Huang, Jiayi Lyu, Feng Zheng, Yefeng Zheng, Yaochu **

    Abstract: The problem of how to assess cross-modality medical image synthesis has been largely unexplored. The most used measures like PSNR and SSIM focus on analyzing the structural features but neglect the crucial lesion location and fundamental k-space speciality of medical images. To overcome this problem, we propose a new metric K-CROSS to spur progress on this challenging problem. Specifically, K-CROS… ▽ More

    Submitted 9 February, 2024; v1 submitted 9 July, 2023; originally announced July 2023.

  34. arXiv:2307.02334  [pdf, ps, other

    eess.IV cs.CV

    Dual Arbitrary Scale Super-Resolution for Multi-Contrast MRI

    Authors: Jiamiao Zhang, Yichen Chi, Jun Lyu, Wenming Yang, Yapeng Tian

    Abstract: Limited by imaging systems, the reconstruction of Magnetic Resonance Imaging (MRI) images from partial measurement is essential to medical imaging research. Benefiting from the diverse and complementary information of multi-contrast MR images in different imaging modalities, multi-contrast Super-Resolution (SR) reconstruction is promising to yield SR images with higher quality. In the medical scen… ▽ More

    Submitted 10 July, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: Accepted by MICCAI2023

  35. arXiv:2306.08200  [pdf, other

    cs.CV cs.LG

    POP: Prompt Of Prompts for Continual Learning

    Authors: Zhiyuan Hu, Jiancheng Lyu, Dashan Gao, Nuno Vasconcelos

    Abstract: Continual learning (CL) has attracted increasing attention in the recent past. It aims to mimic the human ability to learn new concepts without catastrophic forgetting. While existing CL methods accomplish this to some extent, they are still prone to semantic drift of the learned feature space. Foundation models, which are endowed with a robust feature representation, learned from very large datas… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  36. arXiv:2306.03615  [pdf, other

    cs.LG

    PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation

    Authors: Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li

    Abstract: In preference-based Reinforcement Learning (RL), obtaining a large number of preference labels are both time-consuming and costly. Furthermore, the queried human preferences cannot be utilized for the new tasks. In this paper, we propose Zero-shot Cross-task Preference Alignment and Robust Reward Learning (PEARL), which learns policies from cross-task preference transfer without any human labels o… ▽ More

    Submitted 5 June, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML 2024

  37. arXiv:2306.00656  [pdf, other

    cs.LG cs.AI

    Normalization Enhances Generalization in Visual Reinforcement Learning

    Authors: Lu Li, Jiafei Lyu, Guozheng Ma, Zilin Wang, Zhenjie Yang, Xiu Li, Zhiheng Li

    Abstract: Recent advances in visual reinforcement learning (RL) have led to impressive success in handling complex tasks. However, these methods have demonstrated limited generalization capability to visual disturbances, which poses a significant challenge for their real-world application and adaptability. Though normalization techniques have demonstrated huge success in supervised and unsupervised learning… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  38. arXiv:2305.18443  [pdf, other

    cs.LG

    Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse

    Authors: Jiafei Lyu, Le Wan, Zongqing Lu, Xiu Li

    Abstract: Sample efficiency is one of the most critical issues for online reinforcement learning (RL). Existing methods achieve higher sample efficiency by adopting model-based methods, Q-ensemble, or better exploration mechanisms. We, instead, propose to train an off-policy RL agent via updating on a fixed sampled batch multiple times, thus reusing these samples and better exploiting them within a single o… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: 37 pages

  39. arXiv:2305.10465  [pdf, other

    cs.CV cs.AI

    Towards Robust Probabilistic Modeling on SO(3) via Rotation Laplace Distribution

    Authors: Yingda Yin, Jiangran Lyu, Yang Wang, He Wang, Baoquan Chen

    Abstract: Estimating the 3DoF rotation from a single RGB image is an important yet challenging problem. As a popular approach, probabilistic rotation modeling additionally carries prediction uncertainty information, compared to single-prediction rotation regression. For modeling probabilistic distribution over SO(3), it is natural to use Gaussian-like Bingham distribution and matrix Fisher, however they are… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Submitted to TPAMI. arXiv admin note: substantial text overlap with arXiv:2303.01743

  40. arXiv:2305.03049  [pdf, other

    cs.CV

    NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds

    Authors: Jun-Kun Chen, Jipeng Lyu, Yu-Xiong Wang

    Abstract: This paper proposes NeuralEditor that enables neural radiance fields (NeRFs) natively editable for general shape editing tasks. Despite their impressive results on novel-view synthesis, it remains a fundamental challenge for NeRFs to edit the shape of the scene. Our key insight is to exploit the explicit point cloud representation as the underlying structure to construct NeRFs, inspired by the int… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: CVPR 2023

  41. arXiv:2304.04660  [pdf, other

    cs.LG cs.AI

    Uncertainty-driven Trajectory Truncation for Data Augmentation in Offline Reinforcement Learning

    Authors: Junjie Zhang, Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Jun Yang, Le Wan, Xiu Li

    Abstract: Equipped with the trained environmental dynamics, model-based offline reinforcement learning (RL) algorithms can often successfully learn good policies from fixed-sized datasets, even some datasets with poor quality. Unfortunately, however, it can not be guaranteed that the generated samples from the trained dynamics model are reliable (e.g., some synthetic samples may lie outside of the support r… ▽ More

    Submitted 26 July, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

  42. arXiv:2303.17594  [pdf, other

    cs.CV

    MobileInst: Video Instance Segmentation on the Mobile

    Authors: Renhong Zhang, Tianheng Cheng, Shusheng Yang, Haoyi Jiang, Shuai Zhang, Jiancheng Lyu, Xin Li, Xiaowen Ying, Dashan Gao, Wenyu Liu, Xinggang Wang

    Abstract: Video instance segmentation on mobile devices is an important yet very challenging edge AI problem. It mainly suffers from (1) heavy computation and memory costs for frame-by-frame pixel-level instance perception and (2) complicated heuristics for tracking objects. To address those issues, we present MobileInst, a lightweight and mobile-friendly framework for video instance segmentation on mobile… ▽ More

    Submitted 18 December, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Accepted by AAAI 2024 Main Track; Code will be released

  43. arXiv:2303.12696  [pdf, other

    cs.CV cs.AI

    Dense Network Expansion for Class Incremental Learning

    Authors: Zhiyuan Hu, Yunsheng Li, Jiancheng Lyu, Dashan Gao, Nuno Vasconcelos

    Abstract: The problem of class incremental learning (CIL) is considered. State-of-the-art approaches use a dynamic architecture based on network expansion (NE), in which a task expert is added per task. While effective from a computational standpoint, these methods lead to models that grow quickly with the number of tasks. A new NE method, dense network expansion (DNE), is proposed to achieve a better trade… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR2023

  44. arXiv:2302.13328  [pdf, other

    cs.RO

    Reinforcement Learning Based Pushing and Gras** Objects from Ungraspable Poses

    Authors: Hao Zhang, Hongzhuo Liang, Lin Cong, Jianzhi Lyu, Long Zeng, **fa Feng, Jianwei Zhang

    Abstract: Gras** an object when it is in an ungraspable pose is a challenging task, such as books or other large flat objects placed horizontally on a table. Inspired by human manipulation, we address this problem by pushing the object to the edge of the table and then gras** it from the hanging part. In this paper, we develop a model-free Deep Reinforcement Learning framework to synergize pushing and g… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

    Comments: Accepted by ICRA 2023; Project page https://haozhang990127.github.io/PaG/

  45. arXiv:2301.13359  [pdf, other

    cs.CV cs.AI

    IM-IAD: Industrial Image Anomaly Detection Benchmark in Manufacturing

    Authors: Guoyang Xie, **bao Wang, Jiaqi Liu, Jiayi Lyu, Yong Liu, Chengjie Wang, Feng Zheng, Yaochu **

    Abstract: Image anomaly detection (IAD) is an emerging and vital computer vision task in industrial manufacturing (IM). Recently, many advanced algorithms have been reported, but their performance deviates considerably with various IM settings. We realize that the lack of a uniform IM benchmark is hindering the development and usage of IAD methods in real-world applications. In addition, it is difficult for… ▽ More

    Submitted 27 January, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

  46. arXiv:2301.12640  [pdf, other

    cs.LG math.AP math.OC

    Reweighted Interacting Langevin Diffusions: an Accelerated Sampling Methodfor Optimization

    Authors: Junlong Lyu, Zhitang Chen, Wenlong Lyu, Jianye Hao

    Abstract: We proposed a new technique to accelerate sampling methods for solving difficult optimization problems. Our method investigates the intrinsic connection between posterior distribution sampling and optimization with Langevin dynamics, and then we propose an interacting particle scheme that approximates a Reweighted Interacting Langevin Diffusion system (RILD). The underlying system is designed by a… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

  47. IRS-Assisted RF-powered IoT Networks:System Modeling and Performance Analysis

    Authors: Hongguang Sun, Zelun Zhao, Hu Cheng, Jiangbin Lyu, Xijun Wang, Yan Zhang, Tony Q. S. Quek

    Abstract: Emerged as a promising solution for future wireless communication systems, intelligent reflecting surface (IRS) is capable of reconfiguring the wireless propagation environment by adjusting the phase-shift of a large number of reflecting elements. To quantify the gain achieved by IRSs in the radio frequency (RF) powered Internet of Things (IoT) networks, in this work, we consider an IRS-assisted c… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

    Comments: 32 pages, 11 figures, submitted to IEEE Transactions on Communications

    Journal ref: IEEE Transactions on Communications, 2023

  48. arXiv:2210.10969  [pdf, other

    cs.CV

    SSiT: Saliency-guided Self-supervised Image Transformer for Diabetic Retinopathy Grading

    Authors: Yi** Huang, Junyan Lyu, Pu** Cheng, Roger Tam, Xiaoying Tang

    Abstract: Self-supervised Learning (SSL) has been widely applied to learn image representations through exploiting unlabeled images. However, it has not been fully explored in the medical image analysis field. In this work, Saliency-guided Self-Supervised image Transformer (SSiT) is proposed for Diabetic Retinopathy (DR) grading from fundus images. We novelly introduce saliency maps into SSL, with a goal of… ▽ More

    Submitted 12 March, 2024; v1 submitted 19 October, 2022; originally announced October 2022.

  49. arXiv:2210.05740  [pdf, other

    cs.LG cs.AI math.OC

    Stochastic Constrained DRO with a Complexity Independent of Sample Size

    Authors: Qi Qi, Jiameng Lyu, Kung sik Chan, Er Wei Bai, Tianbao Yang

    Abstract: Distributionally Robust Optimization (DRO), as a popular method to train robust models against distribution shift between training and test sets, has received tremendous attention in recent years. In this paper, we propose and analyze stochastic algorithms that apply to both non-convex and convex losses for solving Kullback Leibler divergence constrained DRO problem. Compared with existing methods… ▽ More

    Submitted 16 August, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: 37 pages, 16 figures

    Journal ref: Transactions on Machine Learning Research, 2023

  50. arXiv:2210.04251  [pdf, other

    cs.LG cs.AI

    State Advantage Weighting for Offline RL

    Authors: Jiafei Lyu, Aicheng Gong, Le Wan, Zongqing Lu, Xiu Li

    Abstract: We present state advantage weighting for offline reinforcement learning (RL). In contrast to action advantage $A(s,a)$ that we commonly adopt in QSA learning, we leverage state advantage $A(s,s^\prime)$ and QSS learning for offline RL, hence decoupling the action from values. We expect the agent can get to the high-reward state and the action is determined by how the agent can get to that correspo… ▽ More

    Submitted 8 November, 2022; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: 3rd Offline RL workshop at NeurIPS 2022. arXiv admin note: text overlap with arXiv:2206.07989