Skip to main content

Showing 1–24 of 24 results for author: Ding, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.13550  [pdf, other

    cs.CV eess.IV

    Pointsoup: High-Performance and Extremely Low-Decoding-Latency Learned Geometry Codec for Large-Scale Point Cloud Scenes

    Authors: Kang You, Kai Liu, Li Yu, Pan Gao, Dandan Ding

    Abstract: Despite considerable progress being achieved in point cloud geometry compression, there still remains a challenge in effectively compressing large-scale scenes with sparse surfaces. Another key challenge lies in reducing decoding latency, a crucial requirement in real-world application. In this paper, we propose Pointsoup, an efficient learning-based geometry codec that attains high-performance an… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  2. arXiv:2403.08200  [pdf, ps, other

    eess.SY eess.SP

    Prototy** and Experimental Results for Environment-Aware Millimeter Wave Beam Alignment via Channel Knowledge Map

    Authors: Zhuoyin Dai, Di Wu, Zhenjun Dong, Kun Li, Dingyang Ding, Sihan Wang, Yong Zeng

    Abstract: Channel knowledge map (CKM), which aims to directly reflect the intrinsic channel properties of the local wireless environment, is a novel technique for achieving environmentaware communication. In this paper, to alleviate the large training overhead in millimeter wave (mmWave) beam alignment, an environment-aware and training-free beam alignment prototype is established based on a typical CKM, te… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  3. arXiv:2401.11615  [pdf, other

    eess.IV

    Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding

    Authors: Yichi Zhang, Zhihao Duan, Ming Lu, Dandan Ding, Fengqing Zhu, Zhan Ma

    Abstract: While convolution and self-attention are extensively used in learned image compression (LIC) for transform coding, this paper proposes an alternative called Contextual Clustering based LIC (CLIC) which primarily relies on clustering operations and local attention for correlation characterization and compact representation of an image. As seen, CLIC expands the receptive field into the entire image… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

  4. arXiv:2401.03115  [pdf, other

    cs.CV cs.MM eess.IV

    Transferable Learned Image Compression-Resistant Adversarial Perturbations

    Authors: Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen

    Abstract: Adversarial attacks can readily disrupt the image classification system, revealing the vulnerability of DNN-based recognition tasks. While existing adversarial perturbations are primarily applied to uncompressed images or compressed images by the traditional image compression method, i.e., JPEG, limited studies have investigated the robustness of models for image classification in the context of D… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted as poster at Data Compression Conference 2024 (DCC 2024)

  5. arXiv:2312.17194  [pdf, other

    math.OC cs.LG eess.SY

    Resilient Constrained Reinforcement Learning

    Authors: Dongsheng Ding, Zhengyan Huan, Alejandro Ribeiro

    Abstract: We study a class of constrained reinforcement learning (RL) problems in which multiple constraint specifications are not identified before training. It is challenging to identify appropriate constraint specifications due to the undefined trade-off between the reward maximization objective and the constraint satisfaction, which is ubiquitous in constrained decision-making. To tackle this issue, we… ▽ More

    Submitted 29 December, 2023; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 42 pages, 25 figures; HTML converted

  6. arXiv:2311.18103  [pdf, other

    eess.IV cs.CV

    Corner-to-Center Long-range Context Model for Efficient Learned Image Compression

    Authors: Yang Sui, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Bo Yuan, Zhenzhong Chen

    Abstract: In the framework of learned image compression, the context model plays a pivotal role in capturing the dependencies among latent representations. To reduce the decoding time resulting from the serial autoregressive context model, the parallel context model has been proposed as an alternative that necessitates only two passes during the decoding phase, thus facilitating efficient image compression… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  7. arXiv:2306.11700  [pdf, other

    math.OC cs.LG eess.SY

    Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs

    Authors: Dongsheng Ding, Chen-Yu Wei, Kaiqing Zhang, Alejandro Ribeiro

    Abstract: We study the problem of computing an optimal policy of an infinite-horizon discounted constrained Markov decision process (constrained MDP). Despite the popularity of Lagrangian-based policy search methods used in practice, the oscillation of policy iterates in these methods has not been fully understood, bringing out issues such as violation of constraints and sensitivity to hyper-parameters. To… ▽ More

    Submitted 16 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: 65 pages, 17 figures, and 1 table; NeurIPS 2023

  8. arXiv:2306.00212  [pdf, ps, other

    cs.LG cs.AI eess.SY math.OC

    Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning

    Authors: Dongsheng Ding, Xiaohan Wei, Zhuoran Yang, Zhaoran Wang, Mihailo R. Jovanović

    Abstract: We examine online safe multi-agent reinforcement learning using constrained Markov games in which agents compete by maximizing their expected total rewards under a constraint on expected total utilities. Our focus is confined to an episodic two-player zero-sum constrained Markov game with independent transition functions that are unknown to agents, adversarial reward functions, and stochastic util… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: 59 pages, a full version of the main paper in the 5th Annual Conference on Learning for Dynamics and Control

  9. arXiv:2305.08078  [pdf, other

    eess.IV cs.CV

    Supervised Domain Adaptation for Recognizing Retinal Diseases from Wide-Field Fundus Images

    Authors: Qijie Wei, **gyuan Yang, Bo Wang, **rui Wang, Jianchun Zhao, Xinyu Zhao, Sheng Yang, Niranchana Manivannan, Youxin Chen, Dayong Ding, **g Zhou, Xirong Li

    Abstract: This paper addresses the emerging task of recognizing multiple retinal diseases from wide-field (WF) and ultra-wide-field (UWF) fundus images. For an effective use of existing large amount of labeled color fundus photo (CFP) data and the relatively small amount of WF and UWF data, we propose a supervised domain adaptation method named Cross-domain Collaborative Learning (CdCL). Inspired by the suc… ▽ More

    Submitted 23 October, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

    Comments: Accepted by BIBM2023

  10. arXiv:2303.12917  [pdf, other

    eess.IV

    Lossless Point Cloud Attribute Compression Using Cross-scale, Cross-group, and Cross-color Prediction

    Authors: Jianqiang Wang, Dandan Ding, Zhan Ma

    Abstract: This work extends the multiscale structure originally developed for point cloud geometry compression to point cloud attribute compression. To losslessly encode the attribute while maintaining a low bitrate, accurate probability prediction is critical. With this aim, we extensively exploit cross-scale, cross-group, and cross-color correlations of point cloud attribute to ensure accurate probability… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: 10 pages

  11. Using Learned Indexes to Improve Time Series Indexing Performance on Embedded Sensor Devices

    Authors: David Ding, Ivan Carvalho, Ramon Lawrence

    Abstract: Efficiently querying data on embedded sensor and IoT devices is challenging given the very limited memory and CPU resources. With the increasing volumes of collected data, it is critical to process, filter, and manipulate data on the edge devices where it is collected to improve efficiency and reduce network transmissions. Existing embedded index structures do not adapt to the data distribution an… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: To appear in SENSORNETS 2023

    Journal ref: Proceedings of the 12th International Conference on Sensor Networks (SENSORNETS 2023), pages 23-31

  12. arXiv:2301.12165  [pdf, other

    cs.CV eess.IV

    Dynamic Point Cloud Geometry Compression Using Multiscale Inter Conditional Coding

    Authors: Jianqiang Wang, Dandan Ding, Hao Chen, Zhan Ma

    Abstract: This work extends the Multiscale Sparse Representation (MSR) framework developed for static Point Cloud Geometry Compression (PCGC) to support the dynamic PCGC through the use of multiscale inter conditional coding. To this end, the reconstruction of the preceding Point Cloud Geometry (PCG) frame is progressively downscaled to generate multiscale temporal priors which are then scale-wise transferr… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

    Comments: 5 pages

  13. arXiv:2209.11456  [pdf, other

    eess.IV cs.CV

    Segmentation-based Information Extraction and Amalgamation in Fundus Images for Glaucoma Detection

    Authors: Yanni Wang, Gang Yang, Dayong Ding, Jianchun Zao

    Abstract: Glaucoma is a severe blinding disease, for which automatic detection methods are urgently needed to alleviate the scarcity of ophthalmologists. Many works have proposed to employ deep learning methods that involve the segmentation of optic disc and cup for glaucoma detection, in which the segmentation process is often considered merely as an upstream sub-task. The relationship between fundus image… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

  14. arXiv:2209.08276  [pdf, other

    cs.CV eess.IV

    CARNet:Compression Artifact Reduction for Point Cloud Attribute

    Authors: Dandan Ding, Junzhe Zhang, Jianqiang Wang, Zhan Ma

    Abstract: A learning-based adaptive loop filter is developed for the Geometry-based Point Cloud Compression (G-PCC) standard to reduce attribute compression artifacts. The proposed method first generates multiple Most-Probable Sample Offsets (MPSOs) as potential compression distortion approximations, and then linearly weights them for artifact mitigation. As such, we drive the filtered reconstruction as clo… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

    Comments: 13pages, 8figures

  15. arXiv:2206.09339  [pdf, other

    cs.IT eess.SP

    Channel Estimation for Delay Alignment Modulation

    Authors: Dingyang Ding, Yong Zeng

    Abstract: Delay alignment modulation (DAM) is a promising technology for inter-symbol interference (ISI)-free communication without relying on sophisticated channel equalization or multi-carrier transmissions. The key ideas of DAM are delay precompensation and path-based beamforming, so that the multi-path signal components will arrive at the receiver simultaneously and constructively, rather than causing t… ▽ More

    Submitted 31 March, 2023; v1 submitted 19 June, 2022; originally announced June 2022.

  16. arXiv:2206.02346  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs

    Authors: Dongsheng Ding, Kaiqing Zhang, Jiali Duan, Tamer Başar, Mihailo R. Jovanović

    Abstract: We study sequential decision making problems aimed at maximizing the expected total reward while satisfying a constraint on the expected total utility. We employ the natural policy gradient method to solve the discounted infinite-horizon optimal control problem for Constrained Markov Decision Processes (constrained MDPs). Specifically, we propose a new Natural Policy Gradient Primal-Dual (NPG-PD)… ▽ More

    Submitted 17 October, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 72 pages, 4 figures, 2 tables; revised sample complexity and computational experiments, and added zero constraint violation

  17. arXiv:2111.11289  [pdf, other

    cs.IT eess.SP

    Environment-Aware Beam Selection for IRS-Aided Communication with Channel Knowledge Map

    Authors: Dingyang Ding, Di Wu, Yong Zeng, Shi **, Rui Zhang

    Abstract: Intelligent reflecting surface (IRS)-aided communication is a promising technology for beyond 5G (B5G) systems, to reconfigure the radio environment proactively. However, IRS-aided communication in practice requires efficient channel estimation or passive beam training, whose overhead and complexity increase drastically with the number of reflecting elements/beam directions. To tackle this challen… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

  18. arXiv:2111.10633  [pdf, other

    cs.CV eess.IV

    Sparse Tensor-based Multiscale Representation for Point Cloud Geometry Compression

    Authors: Jianqiang Wang, Dandan Ding, Zhu Li, Xiaoxing Feng, Chuntong Cao, Zhan Ma

    Abstract: This study develops a unified Point Cloud Geometry (PCG) compression method through the processing of multiscale sparse tensor-based voxelized PCG. We call this compression method SparsePCGC. The proposed SparsePCGC is a low complexity solution because it only performs the convolutions on sparsely-distributed Most-Probable Positively-Occupied Voxels (MP-POV). The multiscale representation also all… ▽ More

    Submitted 21 October, 2022; v1 submitted 20 November, 2021; originally announced November 2021.

    Comments: 17 pages, 15 figures

  19. arXiv:2107.14795  [pdf, other

    cs.LG cs.CL cs.CV cs.SD eess.AS

    Perceiver IO: A General Architecture for Structured Inputs & Outputs

    Authors: Andrew Jaegle, Sebastian Borgeaud, Jean-Baptiste Alayrac, Carl Doersch, Catalin Ionescu, David Ding, Skanda Koppula, Daniel Zoran, Andrew Brock, Evan Shelhamer, Olivier Hénaff, Matthew M. Botvinick, Andrew Zisserman, Oriol Vinyals, Joāo Carreira

    Abstract: A central goal of machine learning is the development of systems that can solve many problems in as many data domains as possible. Current architectures, however, cannot be applied beyond a small set of stereotyped settings, as they bake in domain & task assumptions or scale poorly to large inputs or outputs. In this work, we propose Perceiver IO, a general-purpose architecture that handles data f… ▽ More

    Submitted 15 March, 2022; v1 submitted 30 July, 2021; originally announced July 2021.

    Comments: ICLR 2022 camera ready. Code: https://dpmd.ai/perceiver-code

  20. arXiv:2101.06341  [pdf, other

    eess.IV

    Advances In Video Compression System Using Deep Neural Network: A Review And Case Studies

    Authors: Dandan Ding, Zhan Ma, Di Chen, Qingshuang Chen, Zoe Liu, Fengqing Zhu

    Abstract: Significant advances in video compression system have been made in the past several decades to satisfy the nearly exponential growth of Internet-scale video traffic. From the application perspective, we have identified three major functional blocks including pre-processing, coding, and post-processing, that have been continuously investigated to maximize the end-user quality of experience (QoE) un… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

  21. arXiv:2012.00650  [pdf, other

    cs.CV eess.IV eess.SP

    Decomposition, Compression, and Synthesis (DCS)-based Video Coding: A Neural Exploration via Resolution-Adaptive Learning

    Authors: Ming Lu, Tong Chen, Dandan Ding, Fengqing Zhu, Zhan Ma

    Abstract: Inspired by the facts that retinal cells actually segregate the visual scene into different attributes (e.g., spatial details, temporal motion) for respective neuronal processing, we propose to first decompose the input video into respective spatial texture frames (STF) at its native spatial resolution that preserve the rich spatial details, and the other temporal motion frames (TMF) at a lower sp… ▽ More

    Submitted 15 January, 2024; v1 submitted 1 December, 2020; originally announced December 2020.

  22. arXiv:2011.03799  [pdf, other

    eess.IV cs.CV

    Multiscale Point Cloud Geometry Compression

    Authors: Jianqiang Wang, Dandan Ding, Zhu Li, Zhan Ma

    Abstract: Recent years have witnessed the growth of point cloud based applications because of its realistic and fine-grained representation of 3D objects and scenes. However, it is a challenging problem to compress sparse, unstructured, and high-precision 3D points for efficient communication. In this paper, leveraging the sparsity nature of point cloud, we propose a multiscale end-to-end learning framework… ▽ More

    Submitted 7 November, 2020; originally announced November 2020.

  23. arXiv:1910.00783  [pdf, other

    math.OC cs.LG eess.SY

    Global exponential stability of primal-dual gradient flow dynamics based on the proximal augmented Lagrangian: A Lyapunov-based approach

    Authors: Dongsheng Ding, Mihailo R. Jovanović

    Abstract: For a class of nonsmooth composite optimization problems with linear equality constraints, we utilize a Lyapunov-based approach to establish the global exponential stability of the primal-dual gradient flow dynamics based on the proximal augmented Lagrangian. The result holds when the differentiable part of the objective function is strongly convex with a Lipschitz continuous gradient; the non-dif… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: 6 pages, 3 figures

  24. arXiv:1907.12023  [pdf, other

    eess.IV cs.CV

    Two-Stream CNN with Loose Pair Training for Multi-modal AMD Categorization

    Authors: Weisen Wang, Zhiyan Xu, Weihong Yu, Jianchun Zhao, **gyuan Yang, Feng He, Zhikun Yang, Di Chen, Dayong Ding, Youxin Chen, Xirong Li

    Abstract: This paper studies automated categorization of age-related macular degeneration (AMD) given a multi-modal input, which consists of a color fundus image and an optical coherence tomography (OCT) image from a specific eye. Previous work uses a traditional method, comprised of feature extraction and classifier training that cannot be optimized jointly. By contrast, we propose a two-stream convolution… ▽ More

    Submitted 28 July, 2019; originally announced July 2019.

    Comments: accepted by MICCAI 2019