Skip to main content

Showing 1–50 of 60 results for author: Duan, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.03179  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization

    Authors: Tiantian Geng, Teng Wang, Yanfu Zhang, **ming Duan, Weili Guan, Feng Zheng

    Abstract: Video localization tasks aim to temporally locate specific instances in videos, including temporal action localization (TAL), sound event detection (SED) and audio-visual event localization (AVEL). Existing methods over-specialize on each task, overlooking the fact that these instances often occur in the same video to form the complete video content. In this work, we present UniAV, a Unified Audio… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  2. arXiv:2402.03585  [pdf, other

    cs.CV eess.IV

    Decoder-Only Image Registration

    Authors: Xi Jia, Wenqi Lu, Xinxing Cheng, **ming Duan

    Abstract: In unsupervised medical image registration, the predominant approaches involve the utilization of a encoder-decoder network architecture, allowing for precise prediction of dense, full-resolution displacement fields from given paired images. Despite its widespread use in the literature, we argue for the necessity of making both the encoder and decoder learnable in such an architecture. For this, w… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2310.19022  [pdf, other

    math.OC cs.LG eess.SY

    Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback

    Authors: **gliang Duan, Jie Li, Xuyang Chen, Kai Zhao, Shengbo Eben Li, Lin Zhao

    Abstract: In recent times, significant advancements have been made in delving into the optimization landscape of policy gradient methods for achieving optimal control in linear time-invariant (LTI) systems. Compared with state-feedback control, output-feedback control is more prevalent since the underlying state of the system may not be fully observed in many practical settings. This paper analyzes the opti… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Journal ref: IEEE Transactions on Cybernetics, 2023

  4. arXiv:2310.05858  [pdf, other

    cs.LG eess.SY

    DSAC-T: Distributional Soft Actor-Critic with Three Refinements

    Authors: **gliang Duan, Wenxuan Wang, Liming Xiao, Jiaxin Gao, Shengbo Eben Li

    Abstract: Reinforcement learning (RL) has proven to be highly effective in tackling complex decision-making and control tasks. However, prevalent model-free RL methods often face severe performance degradation due to the well-known overestimation issue. In response to this problem, we recently introduced an off-policy RL algorithm, called distributional soft actor-critic (DSAC or DSAC-v1), which can effecti… ▽ More

    Submitted 28 December, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  5. arXiv:2310.04992  [pdf, other

    eess.IV cs.CV

    VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence

    Authors: Jianing Qiu, Jian Wu, Hao Wei, Peilun Shi, Minqing Zhang, Yunyun Sun, Lin Li, Hanruo Liu, Hongyi Liu, Simeng Hou, Yuyang Zhao, Xuehui Shi, Junfang Xian, Xiaoxia Qu, Sirui Zhu, Lijie Pan, Xiaoniao Chen, Xiaojia Zhang, Shuai Jiang, Kebing Wang, Chenlong Yang, Mingqiang Chen, Sujie Fan, Jianhua Hu, Aiguo Lv , et al. (17 additional authors not shown)

    Abstract: We present VisionFM, a foundation model pre-trained with 3.4 million ophthalmic images from 560,457 individuals, covering a broad range of ophthalmic diseases, modalities, imaging devices, and demography. After pre-training, VisionFM provides a foundation to foster multiple ophthalmic artificial intelligence (AI) applications, such as disease screening and diagnosis, disease prognosis, subclassifi… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  6. arXiv:2307.15273  [pdf, other

    cs.CV cs.LG eess.IV

    Recovering high-quality FODs from a reduced number of diffusion-weighted images using a model-driven deep learning architecture

    Authors: J Bartlett, C E Davey, L A Johnston, J Duan

    Abstract: Fibre orientation distribution (FOD) reconstruction using deep learning has the potential to produce accurate FODs from a reduced number of diffusion-weighted images (DWIs), decreasing total imaging time. Diffusion acquisition invariant representations of the DWI signals are typically used as input to these methods to ensure that they can be applied flexibly to data with different b-vectors and b-… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 10 pages, 7 figures, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  7. arXiv:2307.05382  [pdf, other

    eess.SP cs.AI cs.LG

    Protecting the Future: Neonatal Seizure Detection with Spatial-Temporal Modeling

    Authors: Ziyue Li, Yuchen Fang, You Li, Kan Ren, Yansen Wang, Xufang Luo, Juanyong Duan, Congrui Huang, Dongsheng Li, Lili Qiu

    Abstract: A timely detection of seizures for newborn infants with electroencephalogram (EEG) has been a common yet life-saving practice in the Neonatal Intensive Care Unit (NICU). However, it requires great human efforts for real-time monitoring, which calls for automated solutions to neonatal seizure detection. Moreover, the current automated methods focusing on adult epilepsy monitoring often fail due to… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: Accepted in IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2023

  8. arXiv:2307.02997  [pdf, other

    eess.IV cs.CV

    Fourier-Net+: Leveraging Band-Limited Representation for Efficient 3D Medical Image Registration

    Authors: Xi Jia, Alexander Thorley, Alberto Gomez, Wenqi Lu, Dipak Kotecha, **ming Duan

    Abstract: U-Net style networks are commonly utilized in unsupervised image registration to predict dense displacement fields, which for high-resolution volumetric image data is a resource-intensive and time-consuming task. To tackle this challenge, we first propose Fourier-Net, which replaces the costly U-Net style expansive path with a parameter-free model-driven decoder. Instead of directly predicting a f… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: Under review. arXiv admin note: text overlap with arXiv:2211.16342

  9. arXiv:2305.18355  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization

    Authors: Fei Kong, **hao Duan, RuiPeng Ma, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu

    Abstract: Recently, diffusion models have achieved remarkable success in generating tasks, including image and audio generation. However, like other generative models, diffusion models are prone to privacy issues. In this paper, we propose an efficient query-based membership inference attack (MIA), namely Proximal Initialization Attack (PIA), which utilizes groundtruth trajectory obtained by $ε$ initialized… ▽ More

    Submitted 9 October, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  10. arXiv:2304.08845  [pdf, other

    cs.LG eess.SY

    Feasible Policy Iteration

    Authors: Yujie Yang, Zhilong Zheng, Shengbo Eben Li, **gliang Duan, **g**g Liu, Xianyuan Zhan, Ya-Qin Zhang

    Abstract: Safe reinforcement learning (RL) aims to find the optimal policy and its feasible region in a constrained optimal control problem (OCP). Ensuring feasibility and optimality simultaneously has been a major challenge. Existing methods either attempt to solve OCPs directly with constrained optimization algorithms, leading to unstable training processes and unsatisfactory feasibility, or restrict poli… ▽ More

    Submitted 28 January, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

  11. arXiv:2304.01041  [pdf, other

    cs.RO eess.SY

    Integrated Behavior Planning and Motion Control for Autonomous Vehicles with Traffic Rules Compliance

    Authors: Haichao Liu, Kai Chen, Yulin Li, Zhenmin Huang, Jianghua Duan, Jun Ma

    Abstract: In this article, we propose an optimization-based integrated behavior planning and motion control scheme, which is an interpretable and adaptable urban autonomous driving solution that complies with complex traffic rules while ensuring driving safety. Inherently, to ensure compliance with traffic rules, an innovative design of potential functions (PFs) is presented to characterize various traffic… ▽ More

    Submitted 30 November, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 7 pages, 5 figures, accepted for publication in The 2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)

  12. arXiv:2303.12930  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline

    Authors: Tiantian Geng, Teng Wang, **ming Duan, Runmin Cong, Feng Zheng

    Abstract: Existing audio-visual event localization (AVE) handles manually trimmed videos with only a single instance in each of them. However, this setting is unrealistic as natural videos often contain numerous audio-visual events with different categories. To better adapt to real-life applications, in this paper we focus on the task of dense-localizing audio-visual events, which aims to jointly localize a… ▽ More

    Submitted 24 March, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR2023

  13. arXiv:2210.07553  [pdf, other

    cs.RO cs.LG eess.SY

    Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate

    Authors: Dongjie Yu, Wenjun Zou, Yujie Yang, Haitong Ma, Shengbo Eben Li, **gliang Duan, Jianyu Chen

    Abstract: Safe reinforcement learning (RL) that solves constraint-satisfactory policies provides a promising way to the broader safety-critical applications of RL in real-world problems such as robotics. Among all safe RL approaches, model-based methods reduce training time violations further due to their high sample efficiency. However, lacking safety robustness against the model uncertainties remains an i… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 12 pages, 6 figures

  14. On the Optimization Landscape of Dynamic Output Feedback: A Case Study for Linear Quadratic Regulator

    Authors: **gliang Duan, Wenhan Cao, Yang Zheng, Lin Zhao

    Abstract: The convergence of policy gradient algorithms in reinforcement learning hinges on the optimization landscape of the underlying optimal control problem. Theoretical insights into these algorithms can often be acquired from analyzing those of linear quadratic control. However, most of the existing literature only considers the optimization landscape for static full-state or output feedback policies… ▽ More

    Submitted 29 October, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2201.09598

    Journal ref: 2022 IEEE 61st Conference on Decision and Control (CDC)

  15. arXiv:2208.04939  [pdf, other

    eess.IV cs.CV

    U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?

    Authors: Xi Jia, Joseph Bartlett, Tianyang Zhang, Wenqi Lu, Zhaowen Qiu, **ming Duan

    Abstract: Due to their extreme long-range modeling capability, vision transformer-based networks have become increasingly popular in deformable image registration. We believe, however, that the receptive field of a 5-layer convolutional U-Net is sufficient to capture accurate deformations without needing long-range dependencies. The purpose of this study is therefore to investigate whether U-Net-based metho… ▽ More

    Submitted 13 August, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: Accepted to MICCAI-MLMI 2022

  16. arXiv:2206.02346  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs

    Authors: Dongsheng Ding, Kaiqing Zhang, Jiali Duan, Tamer Başar, Mihailo R. Jovanović

    Abstract: We study sequential decision making problems aimed at maximizing the expected total reward while satisfying a constraint on the expected total utility. We employ the natural policy gradient method to solve the discounted infinite-horizon optimal control problem for Constrained Markov Decision Processes (constrained MDPs). Specifically, we propose a new Natural Policy Gradient Primal-Dual (NPG-PD)… ▽ More

    Submitted 17 October, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 72 pages, 4 figures, 2 tables; revised sample complexity and computational experiments, and added zero constraint violation

  17. arXiv:2205.12857  [pdf, other

    eess.IV cs.CV

    Structure Unbiased Adversarial Model for Medical Image Segmentation

    Authors: Tianyang Zhang, Shaoming Zheng, Jun Cheng, Xi Jia, Joseph Bartlett, Xinxing Cheng, Huazhu Fu, Zhaowen Qiu, Jiang Liu, **ming Duan

    Abstract: Generative models have been widely proposed in image recognition to generate more images where the distribution is similar to that of the real ones. It often introduces a discriminator network to differentiate the real data from the generated ones. Such models utilise a discriminator network tasked with differentiating style transferred data from data contained in the target dataset. However in do… ▽ More

    Submitted 11 August, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Will revise the paper and resubmit

  18. arXiv:2204.04403  [pdf, other

    cs.RO eess.SY

    Improve Generalization of Driving Policy at Signalized Intersections with Adversarial Learning

    Authors: Yangang Ren, Guojian Zhan, Liye Tang, Shengbo Eben Li, Jianhua Jiang, **gliang Duan

    Abstract: Intersections are quite challenging among various driving scenes wherein the interaction of signal lights and distinct traffic actors poses great difficulty to learn a wise and robust driving policy. Current research rarely considers the diversity of intersections and stochastic behaviors of traffic participants. For practical applications, the randomness usually leads to some devastating events,… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

  19. arXiv:2204.02857  [pdf, other

    eess.SY

    Primal-dual Estimator Learning: an Offline Constrained Moving Horizon Estimation Method with Feasibility and Near-optimality Guarantees

    Authors: Wenhan Cao, **gliang Duan, Shengbo Eben Li, Chen Chen, Chang Liu, Yu Wang

    Abstract: This paper proposes a primal-dual framework to learn a stable estimator for linear constrained estimation problems leveraging the moving horizon approach. To avoid the online computational burden in most existing methods, we learn a parameterized function offline to approximate the primal estimate. Meanwhile, a dual estimator is trained to check the suboptimality of the primal estimator during exe… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  20. On the Optimization Landscape of Dynamic Output Feedback Linear Quadratic Control

    Authors: **gliang Duan, Wenhan Cao, Yang Zheng, Lin Zhao

    Abstract: The convergence of policy gradient algorithms hinges on the optimization landscape of the underlying optimal control problem. Theoretical insights into these algorithms can often be acquired from analyzing those of linear quadratic control. However, most of the existing literature only considers the optimization landscape for static full-state or output feedback policies (controllers). We investig… ▽ More

    Submitted 29 October, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

    Journal ref: IEEE Transactions on Automatic Control (full paper), 2023

  21. arXiv:2112.09357  [pdf, other

    cs.CV cs.SD eess.AS

    Interpreting Audiograms with Multi-stage Neural Networks

    Authors: Shufan Li, Congxi Lu, Linkai Li, Jirong Duan, ** Fu, Haoshuai Zhou

    Abstract: Audiograms are a particular type of line charts representing individuals' hearing level at various frequencies. They are used by audiologists to diagnose hearing loss, and further select and tune appropriate hearing aids for customers. There have been several projects such as Autoaudio that aim to accelerate this process through means of machine learning. But all existing models at their best can… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 12pages,12 figures. The code for this project is available at https://github.com/jacklishufan/MAIN2021

  22. arXiv:2112.04489  [pdf, other

    eess.IV cs.CV

    Learn2Reg: comprehensive multi-task medical image registration challenge, dataset and evaluation in the era of deep learning

    Authors: Alessa Hering, Lasse Hansen, Tony C. W. Mok, Albert C. S. Chung, Hanna Siebert, Stephanie Häger, Annkristin Lange, Sven Kuckertz, Stefan Heldmann, Wei Shao, Sulaiman Vesal, Mirabela Rusu, Geoffrey Sonn, Théo Estienne, Maria Vakalopoulou, Luyi Han, Yunzhi Huang, Pew-Thian Yap, Mikael Brudfors, Yaël Balbastre, Samuel Joutard, Marc Modat, Gal Lifshitz, Dan Raviv, **xin Lv , et al. (28 additional authors not shown)

    Abstract: Image registration is a fundamental medical image analysis task, and a wide variety of approaches have been proposed. However, only a few studies have comprehensively compared medical image registration approaches on a wide range of clinically relevant tasks. This limits the development of registration methods, the adoption of research advances into practice, and a fair benchmark across competing… ▽ More

    Submitted 7 October, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

  23. Optimization Landscape of Gradient Descent for Discrete-time Static Output Feedback

    Authors: **gliang Duan, Jie Li, Shengbo Eben Li, Lin Zhao

    Abstract: In this paper, we analyze the optimization landscape of gradient descent methods for static output feedback (SOF) control of discrete-time linear time-invariant systems with quadratic cost. The SOF setting can be quite common, for example, when there are unmodeled hidden states in the underlying process. We first establish several important properties of the SOF cost function, including coercivity… ▽ More

    Submitted 10 March, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

    Journal ref: 2022 American Control Conference (ACC)

  24. arXiv:2109.05540  [pdf, other

    cs.RO eess.SY

    Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-lane Scenarios

    Authors: **gliang Duan, Yangang Ren, Fawang Zhang, Yang Guan, Dongjie Yu, Shengbo Eben Li, Bo Cheng, Lin Zhao

    Abstract: In this paper, we propose a new reinforcement learning (RL) algorithm, called encoding distributional soft actor-critic (E-DSAC), for decision-making in autonomous driving. Unlike existing RL-based decision-making methods, E-DSAC is suitable for situations where the number of surrounding vehicles is variable and eliminates the requirement for manually pre-designed sorting rules, resulting in highe… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

  25. arXiv:2108.11623  [pdf, other

    cs.LG cs.RO eess.SY

    Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian

    Authors: Baiyu Peng, **gliang Duan, Jianyu Chen, Shengbo Eben Li, Gen** Xie, Congsheng Zhang, Yang Guan, Yao Mu, Enxin Sun

    Abstract: Safety is essential for reinforcement learning (RL) applied in the real world. Adding chance constraints (or probabilistic constraints) is a suitable way to enhance RL safety under uncertainty. Existing chance-constrained RL methods like the penalty methods and the Lagrangian methods either exhibit periodic oscillations or learn an over-conservative or unsafe policy. In this paper, we address thes… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

  26. Iterative Self-consistent Parallel Magnetic Resonance Imaging Reconstruction based on Nonlocal Low-Rank Regularization

    Authors: Ting Pan, Jizhong Duan, Junfeng Wang, Yu Liu

    Abstract: Iterative self-consistent parallel imaging reconstruction (SPIRiT) is an effective self-calibrated reconstruction model for parallel magnetic resonance imaging (PMRI). The joint L1 norm of wavelet coefficients and joint total variation (TV) regularization terms are incorporated into the SPIRiT model to improve the reconstruction performance. The simultaneous two-directional low-rankness (STDLR) in… ▽ More

    Submitted 17 April, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

    Journal ref: Magnetic Resonance Imaging, vol. 88, pp. 62-75, 2022

  27. arXiv:2107.07907  [pdf, other

    eess.IV cs.CV cs.MM

    Lightness Modulated Deep Inverse Tone Map**

    Authors: Kanglin Liu, Gaofeng Cao, Jiang Duan, Guo** Qiu

    Abstract: Single-image HDR reconstruction or inverse tone map** (iTM) is a challenging task. In particular, recovering information in over-exposed regions is extremely difficult because details in such regions are almost completely lost. In this paper, we present a deep learning based iTM method that takes advantage of the feature extraction and map** power of deep convolutional neural networks (CNNs) a… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 11 pages, 10 figures

  28. arXiv:2105.12227  [pdf, other

    cs.CV eess.IV

    Learning a Model-Driven Variational Network for Deformable Image Registration

    Authors: Xi Jia, Alexander Thorley, Wei Chen, Huaqi Qiu, Linlin Shen, Iain B Styles, Hyung ** Chang, Ales Leonardis, Antonio de Marvao, Declan P. O'Regan, Daniel Rueckert, **ming Duan

    Abstract: Data-driven deep learning approaches to image registration can be less accurate than conventional iterative approaches, especially when training data is limited. To address this whilst retaining the fast inference speed of deep learning, we propose VR-Net, a novel cascaded variational network for unsupervised deformable image registration. Using the variable splitting optimization scheme, we first… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  29. Fixed-Dimensional and Permutation Invariant State Representation of Autonomous Driving

    Authors: **gliang Duan, Dongjie Yu, Shengbo Eben Li, Wenxuan Wang, Yangang Ren, Ziyu Lin, Bo Cheng

    Abstract: In this paper, we propose a new state representation method, called encoding sum and concatenation (ESC), for the state representation of decision-making in autonomous driving. Unlike existing state representation methods, ESC is applicable to a variable number of surrounding vehicles and eliminates the need for manually pre-designed sorting rules, leading to higher representation ability and gene… ▽ More

    Submitted 4 March, 2022; v1 submitted 24 May, 2021; originally announced May 2021.

    Journal ref: IEEE Transactions on Intelligent Transportation Systems, 2021

  30. arXiv:2104.05810  [pdf, other

    cs.MA cs.GT eess.SY

    A Distributed and Resilient Bargaining Game for Weather-Predictive Microgrid Energy Cooperation

    Authors: Lu An, Jie Duan, Mo-Yuen Chow, Alexandra Duel-Hallen

    Abstract: A bargaining game is investigated for cooperative energy management in microgrids. This game incorporates a fully distributed and realistic cooperative power scheduling algorithm (CoDES) as well as a distributed Nash Bargaining Solution (NBS)-based method of allocating the overall power bill resulting from CoDES. A novel weather-based stochastic renewable generation (RG) prediction method is incor… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 9 pages, 8 figures, published in IEEE Transactions on Industrial Informatics

    Journal ref: IEEE Transactions on Industrial Informatics 15 (8), 4721-4730, 2019

  31. arXiv:2103.05505  [pdf

    eess.SY cs.LG

    Approximate Optimal Filter for Linear Gaussian Time-invariant Systems

    Authors: Kaiming Tang, Shengbo Eben Li, Yuming Yin, Yang Guan, **gliang Duan, Wenhan Cao, Jie Li

    Abstract: State estimation is critical to control systems, especially when the states cannot be directly measured. This paper presents an approximate optimal filter, which enables to use policy iteration technique to obtain the steady-state gain in linear Gaussian time-invariant systems. This design transforms the optimal filtering problem with minimum mean square error into an optimal control problem, call… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

  32. arXiv:2102.11736  [pdf, other

    eess.SY cs.AI

    Recurrent Model Predictive Control

    Authors: Zhengyu Liu, **gliang Duan, Wenxuan Wang, Shengbo Eben Li, Yuming Yin, Ziyu Lin, Qi Sun, Bo Cheng

    Abstract: This paper proposes an off-line algorithm, called Recurrent Model Predictive Control (RMPC), to solve general nonlinear finite-horizon optimal control problems. Unlike traditional Model Predictive Control (MPC) algorithms, it can make full use of the current computing resources and adaptively select the longest model prediction horizon. Our algorithm employs a recurrent function to approximate the… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2102.10289

  33. Recurrent Model Predictive Control: Learning an Explicit Recurrent Controller for Nonlinear Systems

    Authors: Zhengyu Liu, **gliang Duan, Wenxuan Wang, Shengbo Eben Li, Yuming Yin, Ziyu Lin, Bo Cheng

    Abstract: This paper proposes an offline control algorithm, called Recurrent Model Predictive Control (RMPC), to solve large-scale nonlinear finite-horizon optimal control problems. It can be regarded as an explicit solver of traditional Model Predictive Control (MPC) algorithms, which can adaptively select appropriate model prediction horizon according to current computing resources, so as to improve the p… ▽ More

    Submitted 8 April, 2022; v1 submitted 20 February, 2021; originally announced February 2021.

    Journal ref: IEEE Transactions on Industrial Electronics, 2022

  34. arXiv:2102.08539  [pdf, other

    cs.LG cs.AI eess.SY

    Separated Proportional-Integral Lagrangian for Chance Constrained Reinforcement Learning

    Authors: Baiyu Peng, Yao Mu, **gliang Duan, Yang Guan, Shengbo Eben Li, Jianyu Chen

    Abstract: Safety is essential for reinforcement learning (RL) applied in real-world tasks like autonomous driving. Chance constraints which guarantee the satisfaction of state constraints at a high probability are suitable to represent the requirements in real-world environment with uncertainty. Existing chance constrained RL methods like the penalty method and the Lagrangian method either exhibit periodic… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  35. arXiv:2012.11974  [pdf, other

    eess.IV

    Complementary Time-Frequency Domain Networks for Dynamic Parallel MR Image Reconstruction

    Authors: Chen Qin, **ming Duan, Kerstin Hammernik, Jo Schlemper, Thomas Küstner, René Botnar, Claudia Prieto, Anthony N. Price, Joseph V. Hajnal, Daniel Rueckert

    Abstract: Purpose: To introduce a novel deep learning based approach for fast and high-quality dynamic multi-coil MR reconstruction by learning a complementary time-frequency domain network that exploits spatio-temporal correlations simultaneously from complementary domains. Theory and Methods: Dynamic parallel MR image reconstruction is formulated as a multi-variable minimisation problem, where the data… ▽ More

    Submitted 18 June, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: Accepted by Magnetic Resonance in Medicine

  36. arXiv:2012.06458  [pdf, other

    math.OC eess.SY

    On Training Effective Reinforcement Learning Agents for Real-time Power Grid Operation and Control

    Authors: Ruisheng Diao, Di Shi, Bei Zhang, Siqi Wang, Haifeng Li, Chunlei Xu, Tu Lan, Desong Bian, Jiajun Duan

    Abstract: Deriving fast and effectively coordinated control actions remains a grand challenge affecting the secure and economic operation of today's large-scale power grid. This paper presents a novel artificial intelligence (AI) based methodology to achieve multi-objective real-time power grid control for real-world implementation. State-of-the-art off-policy reinforcement learning (RL) algorithm, soft act… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

  37. arXiv:2009.04395  [pdf, other

    cs.LG eess.SP

    Automated Model Selection for Time-Series Anomaly Detection

    Authors: Yuanxiang Ying, Juanyong Duan, Chunlei Wang, Yu**g Wang, Congrui Huang, Bixiong Xu

    Abstract: Time-series anomaly detection is a popular topic in both academia and industrial fields. Many companies need to monitor thousands of temporal signals for their applications and services and require instant feedback and alerts for potential incidents in time. The task is challenging because of the complex characteristics of time-series, which are messy, stochastic, and often without proper labels.… ▽ More

    Submitted 25 August, 2020; originally announced September 2020.

  38. arXiv:2007.06810  [pdf

    eess.SY cs.GT cs.LG

    Ternary Policy Iteration Algorithm for Nonlinear Robust Control

    Authors: Jie Li, Shengbo Eben Li, Yang Guan, **gliang Duan, Wenyu Li, Yuming Yin

    Abstract: The uncertainties in plant dynamics remain a challenge for nonlinear control problems. This paper develops a ternary policy iteration (TPI) algorithm for solving nonlinear robust control problems with bounded uncertainties. The controller and uncertainty of the system are considered as game players, and the robust control problem is formulated as a two-player zero-sum differential game. In order t… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  39. arXiv:2007.05993  [pdf, other

    eess.IV cs.CV

    Deep Network Interpolation for Accelerated Parallel MR Image Reconstruction

    Authors: Chen Qin, Jo Schlemper, Kerstin Hammernik, **ming Duan, Ronald M Summers, Daniel Rueckert

    Abstract: We present a deep network interpolation strategy for accelerated parallel MR image reconstruction. In particular, we examine the network interpolation in parameter space between a source model that is formulated in an unrolled scheme with L1 and SSIM losses and its counterpart that is trained with an adversarial loss. We show that by interpolating between the two different models of the same netwo… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Presented at 2020 ISMRM Conference & Exhibition (Abstract #4958)

  40. arXiv:2007.02070  [pdf, other

    eess.SY

    Continuous-time finite-horizon ADP for automated vehicle controller design with high efficiency

    Authors: Ziyu Lin, **gliang Duan, Shengbo Eben Li, Haitong Ma, Yuming Yin

    Abstract: The design of an automated vehicle controller can be generally formulated into an optimal control problem. This paper proposes a continuous-time finite-horizon approximate dynamicprogramming (ADP) method, which can synthesis off-line near-optimal control policy with analytical vehicle dynamics. Lying on the general Policy Iteration framework, it employs value andpolicy neural networks to approxima… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

    Comments: 7 pages,conference

  41. Hierarchical Reinforcement Learning for Self-Driving Decision-Making without Reliance on Labeled Driving Data

    Authors: **gliang Duan, Shengbo Eben Li, Yang Guan, Qi Sun, Bo Cheng

    Abstract: Decision making for self-driving cars is usually tackled by manually encoding rules from drivers' behaviors or imitating drivers' manipulation using supervised learning techniques. Both of them rely on mass driving data to cover all possible driving scenarios. This paper presents a hierarchical reinforcement learning method for decision making of self-driving cars, which does not depend on a large… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Journal ref: IET Intelligent Transport Systems, 2020, 14(5): 297-305

  42. arXiv:2001.02811  [pdf, other

    cs.LG cs.AI eess.SY

    Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors

    Authors: **gliang Duan, Yang Guan, Shengbo Eben Li, Yangang Ren, Bo Cheng

    Abstract: In reinforcement learning (RL), function approximation errors are known to easily lead to the Q-value overestimations, thus greatly reducing policy performance. This paper presents a distributional soft actor-critic (DSAC) algorithm, which is an off-policy RL method for continuous control setting, to improve the policy performance by mitigating Q-value overestimations. We first discover in theory… ▽ More

    Submitted 11 June, 2021; v1 submitted 8 January, 2020; originally announced January 2020.

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2021

  43. arXiv:1912.09278  [pdf, other

    eess.IV cs.CV cs.LG

    $Σ$-net: Systematic Evaluation of Iterative Deep Neural Networks for Fast Parallel MR Image Reconstruction

    Authors: Kerstin Hammernik, Jo Schlemper, Chen Qin, **ming Duan, Ronald M. Summers, Daniel Rueckert

    Abstract: Purpose: To systematically investigate the influence of various data consistency layers, (semi-)supervised learning and ensembling strategies, defined in a $Σ$-net, for accelerated parallel MR image reconstruction using deep learning. Theory and Methods: MR image reconstruction is formulated as learned unrolled optimization scheme with a Down-Up network as regularization and varying data consist… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: Submitted to Magnetic Resonance in Medicine

  44. arXiv:1912.05480  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    $Σ$-net: Ensembled Iterative Deep Neural Networks for Accelerated Parallel MR Image Reconstruction

    Authors: Jo Schlemper, Chen Qin, **ming Duan, Ronald M. Summers, Kerstin Hammernik

    Abstract: We explore an ensembled $Σ$-net for fast parallel MR imaging, including parallel coil networks, which perform implicit coil weighting, and sensitivity networks, involving explicit sensitivity maps. The networks in $Σ$-net are trained in a supervised way, including content and GAN losses, and with various ways of data consistency, i.e., proximal map**s, gradient descent and variable splitting. A… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: fastMRI challenge submission (team: holykspace)

  45. arXiv:1911.11397  [pdf, other

    eess.SY cs.LG math.OC

    Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints

    Authors: **gliang Duan, Zhengyu Liu, Shengbo Eben Li, Qi Sun, Zhenzhong Jia, Bo Cheng

    Abstract: This paper presents a constrained adaptive dynamic programming (CADP) algorithm to solve general nonlinear nonaffine optimal control problems with known dynamics. Unlike previous ADP algorithms, it can directly deal with problems with state constraints. Firstly, a constrained generalized policy iteration (CGPI) framework is developed to handle state constraints by transforming the traditional poli… ▽ More

    Submitted 8 April, 2022; v1 submitted 26 November, 2019; originally announced November 2019.

    Journal ref: Neurocomputing 484 (2022) 128-141

  46. arXiv:1911.04263  [pdf

    eess.SP

    AI-Based Autonomous Line Flow Control via Topology Adjustment for Maximizing Time-Series ATCs

    Authors: Tu Lan, Jiajun Duan, Bei Zhang, Di Shi, Zhiwei Wang, Ruisheng Diao, Xiaohu Zhang

    Abstract: This paper presents a novel AI-based approach for maximizing time-series available transfer capabilities (ATCs) via autonomous topology control considering various practical constraints and uncertainties. Several AI techniques including supervised learning and deep reinforcement learning (DRL) are adopted and improved to train effective AI agents for achieving the desired performance. First, imita… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: The paper has been submitted to IEEE PES GM 2020

  47. arXiv:1911.03723  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    Deep learning for cardiac image segmentation: A review

    Authors: Chen Chen, Chen Qin, Huaqi Qiu, Giacomo Tarroni, **ming Duan, Wenjia Bai, Daniel Rueckert

    Abstract: Deep learning has become the most widely used approach for cardiac image segmentation in recent years. In this paper, we provide a review of over 100 cardiac image segmentation papers using deep learning, which covers common imaging modalities including magnetic resonance imaging (MRI), computed tomography (CT), and ultrasound (US) and major anatomical structures of interest (ventricles, atria and… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

    Comments: Under review

  48. arXiv:1909.11795  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Data consistency networks for (calibration-less) accelerated parallel MR image reconstruction

    Authors: Jo Schlemper, **ming Duan, Cheng Ouyang, Chen Qin, Jose Caballero, Joseph V. Hajnal, Daniel Rueckert

    Abstract: We present simple reconstruction networks for multi-coil data by extending deep cascade of CNN's and exploiting the data consistency layer. In particular, we propose two variants, where one is inspired by POCSENSE and the other is calibration-less. We show that the proposed approaches are competitive relative to the state of the art both quantitatively and qualitatively.

    Submitted 25 September, 2019; originally announced September 2019.

    Comments: Presented at ISMRM 27th Annual Meeting & Exhibition (Abstract #4663)

  49. arXiv:1909.10995  [pdf, other

    cs.LG eess.IV stat.ML

    dAUTOMAP: decomposing AUTOMAP to achieve scalability and enhance performance

    Authors: Jo Schlemper, Ilkay Oksuz, James R. Clough, **ming Duan, Andrew P. King, Julia A. Schnabel, Joseph V. Hajnal, Daniel Rueckert

    Abstract: AUTOMAP is a promising generalized reconstruction approach, however, it is not scalable and hence the practicality is limited. We present dAUTOMAP, a novel way for decomposing the domain transformation of AUTOMAP, making the model scale linearly. We show dAUTOMAP outperforms AUTOMAP with significantly fewer parameters.

    Submitted 25 September, 2019; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: Presented at ISMRM 27th Annual Meeting & Exhibition (Abstract #658)

  50. Relaxed Actor-Critic with Convergence Guarantees for Continuous-Time Optimal Control of Nonlinear Systems

    Authors: **gliang Duan, Jie Li, Qiang Ge, Shengbo Eben Li, Monimoy Bujarbaruah, Fei Ma, Dezhao Zhang

    Abstract: This paper presents the Relaxed Continuous-Time Actor-critic (RCTAC) algorithm, a method for finding the nearly optimal policy for nonlinear continuous-time (CT) systems with known dynamics and infinite horizon, such as the path-tracking control of vehicles. RCTAC has several advantages over existing adaptive dynamic programming algorithms for CT systems. It does not require the ``admissibility" o… ▽ More

    Submitted 30 March, 2023; v1 submitted 11 September, 2019; originally announced September 2019.

    Journal ref: IEEE Transactions on Intelligent Vehicles, 2023 (Early Access)