Skip to main content

Showing 1–43 of 43 results for author: Le, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.00021  [pdf, other

    cs.CV cs.GR eess.IV

    Neural Graphics Texture Compression Supporting Random Acces

    Authors: Farzad Farhadzadeh, Qiqi Hou, Hoang Le, Amir Said, Randall Rauwendaal, Alex Bourd, Fatih Porikli

    Abstract: Advances in rendering have led to tremendous growth in texture assets, including resolution, complexity, and novel textures components, but this growth in data volume has not been matched by advances in its compression. Meanwhile Neural Image Compression (NIC) has advanced significantly and shown promising results, but the proposed methods cannot be directly adapted to neural texture compression.… ▽ More

    Submitted 6 May, 2024; originally announced July 2024.

    Comments: ECCV submission

  2. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  3. arXiv:2405.01979  [pdf, other

    cs.IT eess.SP

    Graph Neural Network based Active and Passive Beamforming for Distributed STAR-RIS-Assisted Multi-User MISO Systems

    Authors: Ha An Le, Trinh Van Chien, Wan Choi

    Abstract: This paper investigates a joint active and passive beamforming design for distributed simultaneous transmitting and reflecting (STAR) reconfigurable intelligent surface (RIS) assisted multi-user (MU)- mutiple input single output (MISO) systems, where the energy splitting (ES) mode is considered for the STAR-RIS. We aim to design the active beamforming vectors at the base station (BS) and the passi… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 13 pages, 7 figures

  4. arXiv:2405.00681  [pdf, other

    eess.SP cs.IT cs.NI eess.SY

    Delay and Overhead Efficient Transmission Scheduling for Federated Learning in UAV Swarms

    Authors: Duc N. M. Hoang, Vu Tuan Truong, Hung Duy Le, Long Bao Le

    Abstract: This paper studies the wireless scheduling design to coordinate the transmissions of (local) model parameters of federated learning (FL) for a swarm of unmanned aerial vehicles (UAVs). The overall goal of the proposed design is to realize the FL training and aggregation processes with a central aggregator exploiting the sensory data collected by the UAVs but it considers the multi-hop wireless net… ▽ More

    Submitted 22 February, 2024; originally announced May 2024.

    Comments: accepted to WCNC'24

  5. arXiv:2403.17879  [pdf, other

    cs.CV eess.IV

    Low-Latency Neural Stereo Streaming

    Authors: Qiqi Hou, Farzad Farhadzadeh, Amir Said, Guillaume Sautiere, Hoang Le

    Abstract: The rise of new video modalities like virtual reality or autonomous driving has increased the demand for efficient multi-view video compression methods, both in terms of rate-distortion (R-D) performance and in terms of delay and runtime. While most recent stereo video compression approaches have shown promising performance, they compress left and right views sequentially, leading to poor parallel… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  6. arXiv:2401.05915  [pdf, other

    eess.IV

    Neural Implicit Surface Reconstruction of Freehand 3D Ultrasound Volume with Geometric Constraints

    Authors: Hongbo Chen, Logiraj Kumaralingam, Shuhang Zhang, Sheng Song, Fayi Zhang, Haibin Zhang, Thanh-Tu Pham, Edmond H. M. Lou, Kumaradevan Punithakumar, Lawrence H. Le, Rui Zheng

    Abstract: Three-dimensional (3D) freehand ultrasound (US) is a widely used imaging modality that allows non-invasive imaging of medical anatomy without radiation exposure. The surface reconstruction of US volume is vital to acquire the accurate anatomical structures needed for modeling, registration, and visualization. However, traditional methods cannot produce a high-quality surface due to image noise. De… ▽ More

    Submitted 1 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: Preprint

  7. arXiv:2401.03754  [pdf, other

    cs.IT eess.SP

    Joint Power Allocation and User Scheduling in Integrated Satellite-Terrestrial Cell-Free Massive MIMO IoT Systems

    Authors: Trinh Van Chien, Ha An Le, Ta Hai Tung, Hien Quoc Ngo, Symeon Chatzinotas

    Abstract: Both space and ground communications have been proven effective solutions under different perspectives in Internet of Things (IoT) networks. This paper investigates multiple-access scenarios, where plenty of IoT users are cooperatively served by a satellite in space and access points (APs) on the ground. Available users in each coherence interval are split into scheduled and unscheduled subsets to… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 15 pages, 10 figures, 1 table. Submitted for publication

  8. arXiv:2312.00921  [pdf, ps, other

    eess.IV cs.IT

    Bitstream Organization for Parallel Entropy Coding on Neural Network-based Video Codecs

    Authors: Amir Said, Hoang Le, Farzad Farhadzadeh

    Abstract: Video compression systems must support increasing bandwidth and data throughput at low cost and power, and can be limited by entropy coding bottlenecks. Efficiency can be greatly improved by parallelizing coding, which can be done at much larger scales with new neural-based codecs, but with some compression loss related to data organization. We analyze the bit rate overhead needed to support multi… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Journal ref: Proc. IEEE International Conference on Multimedia, Dec. 2023

  9. arXiv:2311.11096  [pdf, other

    eess.IV cs.CV

    On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation

    Authors: Duy Minh Ho Nguyen, Tan Ngoc Pham, Nghiem Tuong Diep, Nghi Quoc Phan, Quang Pham, Vinh Tong, Binh T. Nguyen, Ngan Hoang Le, Nhat Ho, Pengtao Xie, Daniel Sonntag, Mathias Niepert

    Abstract: Constructing a robust model that can effectively generalize to test samples under distribution shifts remains a significant challenge in the field of medical imaging. The foundational models for vision and language, pre-trained on extensive sets of natural image and text data, have emerged as a promising approach. It showcases impressive learning abilities across different tasks with the need for… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Advances in Neural Information Processing Systems (NeurIPS) 2023, Workshop on robustness of zero/few-shot learning in foundation models

  10. arXiv:2310.01258  [pdf, other

    eess.IV cs.CV cs.LG

    MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device

    Authors: Ties van Rozendaal, Tushar Singhal, Hoang Le, Guillaume Sautiere, Amir Said, Krishna Buska, Anjuman Raha, Dimitris Kalatzis, Hitarth Mehta, Frank Mayer, Liang Zhang, Markus Nagel, Auke Wiggers

    Abstract: Neural video codecs have recently become competitive with standard codecs such as HEVC in the low-delay setting. However, most neural codecs are large floating-point networks that use pixel-dense war** operations for temporal modeling, making them too computationally expensive for deployment on mobile devices. Recent work has demonstrated that running a neural decoder in real time on mobile is f… ▽ More

    Submitted 15 November, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Matches version published at WACV 2024

  11. arXiv:2309.06956  [pdf, other

    eess.IV cs.AI cs.LG

    Implicit Neural Multiple Description for DNA-based data storage

    Authors: Trung Hieu Le, Xavier Pic, Jeremy Mateos, Marc Antonini

    Abstract: DNA exhibits remarkable potential as a data storage solution due to its impressive storage density and long-term stability, stemming from its inherent biomolecular structure. However, develo** this novel medium comes with its own set of challenges, particularly in addressing errors arising from storage and biological manipulations. These challenges are further conditioned by the structural const… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Xavier Pic and Trung Hieu Le are both equal contributors and primary authors

  12. arXiv:2309.05472  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech

    Authors: Titouan Parcollet, Ha Nguyen, Solene Evain, Marcely Zanon Boito, Adrien Pupier, Salima Mdhaffar, Hang Le, Sina Alisamir, Natalia Tomashenko, Marco Dinarelli, Shucong Zhang, Alexandre Allauzen, Maximin Coavoux, Yannick Esteve, Mickael Rouvier, Jerome Goulian, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier

    Abstract: Self-supervised learning (SSL) is at the origin of unprecedented improvements in many different domains including computer vision and natural language processing. Speech processing drastically benefitted from SSL as most of the current domain-related tasks are now being approached with pre-trained models. This work introduces LeBenchmark 2.0 an open-source framework for assessing and building SSL-… ▽ More

    Submitted 18 March, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Published in Computer Science and Language. Preprint allowed

  13. Double RIS-Assisted MIMO Systems Over Spatially Correlated Rician Fading Channels and Finite Scatterers

    Authors: Ha An Le, Trinh Van Chien, Van Duc Nguyen, Wan Choi

    Abstract: This paper investigates double RIS-assisted MIMO communication systems over Rician fading channels with finite scatterers, spatial correlation, and the existence of a double-scattering link between the transceiver. First, the statistical information is driven in closed form for the aggregated channels, unveiling various influences of the system and environment on the average channel power gains. N… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 15 pages, 9 figures, accepted by IEEE Transactions on Communications

  14. arXiv:2307.04216  [pdf, other

    cs.LG cs.AI eess.IV

    Hierarchical Autoencoder-based Lossy Compression for Large-scale High-resolution Scientific Data

    Authors: Hieu Le, Jian Tao

    Abstract: Lossy compression has become an important technique to reduce data size in many domains. This type of compression is especially valuable for large-scale scientific data, whose size ranges up to several petabytes. Although Autoencoder-based models have been successfully leveraged to compress images and videos, such neural networks have not widely gained attention in the scientific data domain. Our… ▽ More

    Submitted 6 May, 2024; v1 submitted 9 July, 2023; originally announced July 2023.

    Comments: 14 pages

  15. arXiv:2306.13919  [pdf, other

    eess.IV

    INR-MDSQC: Implicit Neural Representation Multiple Description Scalar Quantization for robust image Coding

    Authors: Trung Hieu Le, Xavier Pic, Marc Antonini

    Abstract: Multiple Description Coding (MDC) is an error-resilient source coding method designed for transmission over noisy channels. We present a novel MDC scheme employing a neural network based on implicit neural representation. This involves overfitting the neural representation for images. Each description is transmitted along with model parameters and its respective latent spaces. Our method has advan… ▽ More

    Submitted 7 August, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

    Comments: Accepted at IEEE MMSP 2023

  16. Multiple description video coding for real-time applications using HEVC

    Authors: Trung Hieu Le, Marc Antonini, Marc Lambert, Karima Alioua

    Abstract: Remote control vehicles require the transmission of large amounts of data, and video is one of the most important sources for the driver. To ensure reliable video transmission, the encoded video stream is transmitted simultaneously over multiple channels. However, this solution incurs a high transmission cost due to the wireless channel's unreliable and random bit loss characteristics. To address… ▽ More

    Submitted 7 August, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted at IEEE ICIP 2023

  17. arXiv:2301.08752  [pdf, ps, other

    eess.IV cs.IT cs.LG cs.MM

    Optimized learned entropy coding parameters for practical neural-based image and video compression

    Authors: Amir Said, Reza Pourreza, Hoang Le

    Abstract: Neural-based image and video codecs are significantly more power-efficient when weights and activations are quantized to low-precision integers. While there are general-purpose techniques for reducing quantization effects, large losses can occur when specific entropy coding properties are not considered. This work analyzes how entropy coding is affected by parameter quantizations, and provides a m… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

    Comments: 2022 IEEE International Conference on Image Processing (ICIP)

    Journal ref: IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 2022, pp. 661-665

  18. Detecting COVID-19 from digitized ECG printouts using 1D convolutional neural networks

    Authors: Thao Nguyen, Hieu H. Pham, Huy Khiem Le, Anh Tu Nguyen, Ngoc Tien Thanh, Cuong Do

    Abstract: The COVID-19 pandemic has exposed the vulnerability of healthcare services worldwide, raising the need to develop novel tools to provide rapid and cost-effective screening and diagnosis. Clinical reports indicated that COVID-19 infection may cause cardiac injury, and electrocardiograms (ECG) may serve as a diagnostic biomarker for COVID-19. This study aims to utilize ECG signals to detect COVID-19… ▽ More

    Submitted 5 October, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: Accepted with minor revision by Plos One

  19. arXiv:2208.04303  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Boosting neural video codecs by exploiting hierarchical redundancy

    Authors: Reza Pourreza, Hoang Le, Amir Said, Guillaume Sautiere, Auke Wiggers

    Abstract: In video compression, coding efficiency is improved by reusing pixels from previously decoded frames via motion and residual compensation. We define two levels of hierarchical redundancy in video frames: 1) first-order: redundancy in pixel space, i.e., similarities in pixel values across neighboring frames, which is effectively captured using motion and residual compensation, 2) second-order: redu… ▽ More

    Submitted 16 September, 2022; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: WACV 2023

  20. arXiv:2207.14459  [pdf, other

    cs.IT eess.SP

    Generalized BER of MCIK-OFDM with Imperfect CSI: Selection combining GD versus ML receivers

    Authors: Vu-Duc Ngo, Thien Van Luong, Nguyen Cong Luong, Minh-Tuan Le, Thi Thanh Huyen Le, Xuan-Nam Tran

    Abstract: This paper analyzes the bit error rate (BER) of multicarrier index keying - orthogonal frequency division multiplexing (MCIK-OFDM) with selection combining (SC) diversity reception. Particularly, we propose a generalized framework to derive the BER for both the low-complexity greedy detector (GD) and maximum likelihood (ML) detector. Based on this, closedform expressions for the BERs of MCIK-OFDM… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  21. arXiv:2207.14454  [pdf, other

    cs.IT eess.SP

    Enhancing Diversity of OFDM with Joint Spread Spectrum and Subcarrier Index Modulations

    Authors: Vu-Duc Ngo, Thien Van Luong, Nguyen Cong Luong, Mai Xuan Trang, Minh-Tuan Le, Thi Thanh Huyen Le, Xuan-Nam Tran

    Abstract: This paper proposes a novel spread spectrum and sub-carrier index modulation (SS-SIM) scheme, which is integrated to orthogonal frequency division multiplexing (OFDM) framework to enhance the diversity over the conventional IM schemes. Particularly, the resulting scheme, called SS-SIMOFDM, jointly employs both spread spectrum and sub-carrier index modulations to form a precoding vector which is th… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  22. arXiv:2207.08338  [pdf, other

    cs.CV cs.MM eess.IV

    MobileCodec: Neural Inter-frame Video Compression on Mobile Devices

    Authors: Hoang Le, Liang Zhang, Amir Said, Guillaume Sautiere, Yang Yang, Pranav Shrestha, Fei Yin, Reza Pourreza, Auke Wiggers

    Abstract: Realizing the potential of neural video codecs on mobile devices is a big technological challenge due to the computational complexity of deep networks and the power-constrained mobile hardware. We demonstrate practical feasibility by leveraging Qualcomm's technology and innovation, bridging the gap from neural network-based codec simulations running on wall-powered workstations, to real-time opera… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: ACM MMSys 2022

  23. arXiv:2207.08077  [pdf, other

    cs.IT eess.SP

    RIS-Assisted MIMO Communication Systems: Model-based versus Autoencoder Approaches

    Authors: Ha An Le, Trinh Van Chien, Van Duc Nguyen, Wan Choi

    Abstract: This paper considers reconfigurable intelligent surface (RIS)-assisted point-to-point multiple-input multiple-output (MIMO) communication systems, where a transmitter communicates with a receiver through an RIS. Based on the main target of reducing the bit error rate (BER) and therefore enhancing the communication reliability, we study different model-based and data-driven (autoencoder) approaches… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

    Comments: 6 pages, 3 figures, and 2 tables. Accepted to present at IEEE PIMRC 2022

  24. arXiv:2206.03778  [pdf, other

    cs.CV eess.IV

    Learning Digital Terrain Models from Point Clouds: ALS2DTM Dataset and Rasterization-based GAN

    Authors: Hoàng-Ân Lê, Florent Guiotte, Minh-Tan Pham, Sébastien Lefèvre, Thomas Corpetti

    Abstract: Despite the popularity of deep neural networks in various domains, the extraction of digital terrain models (DTMs) from airborne laser scanning (ALS) point clouds is still challenging. This might be due to the lack of dedicated large-scale annotated dataset and the data-structure discrepancy between point clouds and DTMs. To promote data-driven DTM extraction, this paper collects from open sources… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

  25. arXiv:2204.01346  [pdf, ps, other

    eess.SY

    Concurrent learning in high-order tuners for parameter identification

    Authors: Justin H. Le, Andrew R. Teel

    Abstract: High-order tuners are algorithms that show promise in achieving greater efficiency than classic gradient-based algorithms in identifying the parameters of parametric models and/or in facilitating the progress of a control or optimization algorithm whose adaptive behavior relies on such models. For high-order tuners, robust stability properties, namely uniform global asymptotic (and exponential) st… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  26. DAM-AL: Dilated Attention Mechanism with Attention Loss for 3D Infant Brain Image Segmentation

    Authors: Dinh-Hieu Hoang, Gia-Han Diep, Minh-Triet Tran, Ngan T. H Le

    Abstract: While Magnetic Resonance Imaging (MRI) has played an essential role in infant brain analysis, segmenting MRI into a number of tissues such as gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF) is crucial and complex due to the extremely low intensity contrast between tissues at around 6-9 months of age as well as amplified noise, myelination, and incomplete volume. In this paper, w… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

  27. arXiv:2107.00845  [pdf, other

    cs.NI eess.SP

    A Business Model for Resource Sharing in Cell-Free UAVs-Assisted Wireless Networks

    Authors: Yan Kyaw Tun, Yu Min Park, Tra Huong Thi Le, Zhu Han, Choong Seon Hong

    Abstract: Unmanned aerial vehicles (UAVs) are widely deployed to enhance the wireless network capacity and to provide communication services to mobile users beyond the infrastructure coverage. Recently, with the help of a promising technology called network virtualization, multiple service providers (SPs) can share the infrastructures and wireless resources owned by the mobile network operators (MNOs). Then… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: This paper has been submitted to IEEE Transactions on Vehicular Technology

  28. LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech

    Authors: Solene Evain, Ha Nguyen, Hang Le, Marcely Zanon Boito, Salima Mdhaffar, Sina Alisamir, Ziyi Tong, Natalia Tomashenko, Marco Dinarelli, Titouan Parcollet, Alexandre Allauzen, Yannick Esteve, Benjamin Lecouteux, Francois Portet, Solange Rossato, Fabien Ringeval, Didier Schwab, Laurent Besacier

    Abstract: Self-Supervised Learning (SSL) using huge unlabeled data has been successfully explored for image and natural language processing. Recent works also investigated SSL from speech. They were notably successful to improve performance on downstream tasks such as automatic speech recognition (ASR). While these works suggest it is possible to reduce dependence on labeled data for building efficient spee… ▽ More

    Submitted 10 June, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Comments: Will be presented at Interspeech 2021

    Journal ref: Proc. Interspeech 2021

  29. arXiv:2104.11414  [pdf, other

    eess.SY

    Passive soft-reset controllers for nonlinear systems

    Authors: Justin H. Le, Andrew R. Teel

    Abstract: Soft-reset controllers are introduced as a way to approximate hard-reset controllers. The focus is on implementing reset controllers that are (strictly) passive and on analyzing their interconnection with passive plants. A passive hard-reset controller that has a strongly convex energy function can be approximated as a soft-reset controller. A hard-reset controller is a hybrid system whereas a sof… ▽ More

    Submitted 22 September, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

  30. arXiv:2104.10307  [pdf, ps, other

    eess.SY

    Analyzing the Effect of Persistent Asset Switches on a Class of Hybrid-Inspired Optimization Algorithms

    Authors: Matina Baradaran, Justin H. Le, Andrew R. Teel

    Abstract: Convex optimization challenges are currently pervasive in many science and engineering domains. In many applications of convex optimization, such as those involving multi-agent systems and resource allocation, the objective function can persistently switch during the execution of an optimization algorithm. Motivated by such applications, we analyze the effect of persistently switching objectives i… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

  31. arXiv:2103.12350  [pdf, other

    eess.IV cs.CV

    Roughness Index and Roughness Distance for Benchmarking Medical Segmentation

    Authors: Vidhiwar Singh Rathour, Kashu Yamakazi, T. Hoang Ngan Le

    Abstract: Medical image segmentation is one of the most challenging tasks in medical image analysis and has been widely developed for many clinical applications. Most of the existing metrics have been first designed for natural images and then extended to medical images. While object surface plays an important role in medical segmentation and quantitative analysis i.e. analyze brain tumor surface, measure g… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: Paper has been accepted at BIOIMAGING2021

  32. arXiv:2103.11893  [pdf, ps, other

    eess.SP cs.IT math.PR

    Thresholding Greedy Pursuit for Sparse Recovery Problems

    Authors: Hai Le, Alexei Novikov

    Abstract: We study here sparse recovery problems in the presence of additive noise. We analyze a thresholding version of the CoSaMP algorithm, named Thresholding Greedy Pursuit (TGP). We demonstrate that an appropriate choice of thresholding parameter, even without the knowledge of sparsity level of the signal and strength of the noise, can result in exact recovery with no false discoveries as the dimension… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

    Comments: First version

  33. arXiv:2103.11055  [pdf, other

    math.OC eess.SY

    Online Robust Control of Nonlinear Systems with Large Uncertainty

    Authors: Dimitar Ho, Hoang M. Le, John C. Doyle, Yisong Yue

    Abstract: Robust control is a core approach for controlling systems with performance guarantees that are robust to modeling error, and is widely used in real-world systems. However, current robust control approaches can only handle small system uncertainty, and thus require significant effort in system identification prior to controller design. We present an online approach that robustly controls a nonlinea… ▽ More

    Submitted 4 June, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

    Comments: 58 pages, 5 figures

    Journal ref: Proceedings of The 24th International Conference on Artificial Intelligence and Statistics 2021, PMLR 130:3475-3483

  34. arXiv:2103.09042  [pdf, ps, other

    eess.IV cs.CV

    Invertible Residual Network with Regularization for Effective Medical Image Segmentation

    Authors: Kashu Yamazaki, Vidhiwar Singh Rathour, T. Hoang Ngan Le

    Abstract: Deep Convolutional Neural Networks (CNNs) i.e. Residual Networks (ResNets) have been used successfully for many computer vision tasks, but are difficult to scale to 3D volumetric medical data. Memory is increasingly often the bottleneck when training 3D Convolutional Neural Networks (CNNs). Recently, invertible neural networks have been applied to significantly reduce activation memory footprint w… ▽ More

    Submitted 16 March, 2021; originally announced March 2021.

  35. arXiv:2103.05115  [pdf, other

    eess.IV cs.CV cs.LG

    Deep reinforcement learning in medical imaging: A literature review

    Authors: S. Kevin Zhou, Hoang Ngan Le, Khoa Luu, Hien V. Nguyen, Nicholas Ayache

    Abstract: Deep reinforcement learning (DRL) augments the reinforcement learning framework, which learns a sequence of actions that maximizes the expected reward, with the representative power of deep neural networks. Recent works have demonstrated the great potential of DRL in medicine and healthcare. This paper presents a literature review of DRL in medical imaging. We start with a comprehensive tutorial o… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Comments: 39 pages, 20 figures

  36. arXiv:2012.07627  [pdf, other

    eess.IV cs.LG

    Water Level Estimation Using Sentinel-1 Synthetic Aperture Radar Imagery And Digital Elevation Models

    Authors: Thai-Bao Duong-Nguyen, Thien-Nu Hoang, Phong Vo, Hoai-Bac Le

    Abstract: Hydropower dams and reservoirs have been identified as the main factors redefining natural hydrological cycles. Therefore, monitoring water status in reservoirs plays a crucial role in planning and managing water resources, as well as forecasting drought and flood. This task has been traditionally done by installing sensor stations on the ground nearby water bodies, which has multiple disadvantage… ▽ More

    Submitted 28 December, 2020; v1 submitted 11 December, 2020; originally announced December 2020.

  37. arXiv:2011.00747  [pdf, other

    cs.CL cs.SD eess.AS

    Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation

    Authors: Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab, Laurent Besacier

    Abstract: We introduce dual-decoder Transformer, a new model architecture that jointly performs automatic speech recognition (ASR) and multilingual speech translation (ST). Our models are based on the original Transformer architecture (Vaswani et al., 2017) but consist of two decoders, each responsible for one task (ASR or ST). Our major contribution lies in how these decoders interact with each other: one… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

    Comments: Accepted at COLING 2020 (Oral)

    Journal ref: The 28th International Conference on Computational Linguistics (COLING 2020)

  38. arXiv:2009.13770  [pdf, other

    eess.SY

    Hybrid Heavy-Ball Systems: Reset Methods for Optimization with Uncertainty

    Authors: Justin H. Le, Andrew R. Teel

    Abstract: Momentum methods for convex optimization often rely on precise choices of algorithmic parameters, based on knowledge of problem parameters, in order to achieve fast convergence, as well as to prevent oscillations that could severely restrict applications of these algorithms to cyber-physical systems. To address these issues, we propose two dynamical systems, named the Hybrid Heavy-Ball System and… ▽ More

    Submitted 22 March, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

  39. arXiv:2008.06828  [pdf, other

    cs.CV cs.LG eess.IV

    A novel approach to remove foreign objects from chest X-ray images

    Authors: Hieu X. Le, Phuong D. Nguyen, Thang H. Nguyen, Khanh N. Q. Le, Thanh T. Nguyen

    Abstract: We initially proposed a deep learning approach for foreign objects inpainting in smartphone-camera captured chest radiographs utilizing the cheXphoto dataset. Foreign objects which can significantly affect the quality of a computer-aided diagnostic prediction are captured under various settings. In this paper, we used multi-method to tackle both removal and inpainting chest radiographs. Firstly, a… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

    Comments: 9 pages, 7 figures, 7 tables

  40. arXiv:1905.10841  [pdf

    eess.IV cs.CV

    Utilizing Automated Breast Cancer Detection to Identify Spatial Distributions of Tumor Infiltrating Lymphocytes in Invasive Breast Cancer

    Authors: Han Le, Rajarsi Gupta, Le Hou, Shahira Abousamra, Danielle Fassler, Tahsin Kurc, Dimitris Samaras, Rebecca Batiste, Tianhao Zhao, Arvind Rao, Alison L. Van Dyke, Ashish Sharma, Erich Bremer, Jonas S. Almeida, Joel Saltz

    Abstract: Quantitative assessment of Tumor-TIL spatial relationships is increasingly important in both basic science and clinical aspects of breast cancer research. We have developed and evaluated convolutional neural network (CNN) analysis pipelines to generate combined maps of cancer regions and tumor infiltrating lymphocytes (TILs) in routine diagnostic breast cancer whole slide tissue images (WSIs). We… ▽ More

    Submitted 13 January, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

    Comments: The American Journal of Pathology

  41. arXiv:1905.02914  [pdf, other

    cs.RO cs.AI eess.SY

    Adaptive neural network based dynamic surface control for uncertain dual arm robots

    Authors: Dung Tien Pham, Thai Van Nguyen, Hai Xuan Le, Linh Nguyen, Nguyen Huu Thai, Tuan Anh Phan, Hai Tuan Pham, Anh Hoai Duong

    Abstract: The paper discusses an adaptive strategy to effectively control nonlinear manipulation motions of a dual arm robot (DAR) under system uncertainties including parameter variations, actuator nonlinearities and external disturbances. It is proposed that the control scheme is first derived from the dynamic surface control (DSC) method, which allows the robot's end-effectors to robustly track the desir… ▽ More

    Submitted 8 May, 2019; originally announced May 2019.

  42. A Control Lyapunov Perspective on Episodic Learning via Projection to State Stability

    Authors: Andrew J. Taylor, Victor D. Dorobantu, Meera Krishnamoorthy, Hoang M. Le, Yisong Yue, Aaron D. Ames

    Abstract: The goal of this paper is to understand the impact of learning on control synthesis from a Lyapunov function perspective. In particular, rather than consider uncertainties in the full system dynamics, we employ Control Lyapunov Functions (CLFs) as low-dimensional projections. To understand and characterize the uncertainty that these projected dynamics introduce in the system, we introduce a new no… ▽ More

    Submitted 17 March, 2019; originally announced March 2019.

  43. Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems

    Authors: Andrew J. Taylor, Victor D. Dorobantu, Hoang M. Le, Yisong Yue, Aaron D. Ames

    Abstract: Many modern nonlinear control methods aim to endow systems with guaranteed properties, such as stability or safety, and have been successfully applied to the domain of robotics. However, model uncertainty remains a persistent challenge, weakening theoretical guarantees and causing implementation failures on physical systems. This paper develops a machine learning framework centered around Control… ▽ More

    Submitted 4 March, 2019; originally announced March 2019.