Search | arXiv e-print repository

A Data and Model-Driven Deep Learning Approach to Robust Downlink Beamforming Optimization

Authors: Kai Liang, Gan Zheng, Zan Li, Kai-Kit Wong, Chan-Byoung Chae

Abstract: This paper investigates the optimization of the long-standing probabilistically robust transmit beamforming problem with channel uncertainties in the multiuser multiple-input single-output (MISO) downlink transmission. This problem poses significant analytical and computational challenges. Currently, the state-of-the-art optimization method relies on convex restrictions as tractable approximations… ▽ More This paper investigates the optimization of the long-standing probabilistically robust transmit beamforming problem with channel uncertainties in the multiuser multiple-input single-output (MISO) downlink transmission. This problem poses significant analytical and computational challenges. Currently, the state-of-the-art optimization method relies on convex restrictions as tractable approximations to ensure robustness against Gaussian channel uncertainties. However, this method not only exhibits high computational complexity and suffers from the rank relaxation issue but also yields conservative solutions. In this paper, we propose an unsupervised deep learning-based approach that incorporates the sampling of channel uncertainties in the training process to optimize the probabilistic system performance. We introduce a model-driven learning approach that defines a new beamforming structure with trainable parameters to account for channel uncertainties. Additionally, we employ a graph neural network to efficiently infer the key beamforming parameters. We successfully apply this approach to the minimum rate quantile maximization problem subject to outage and total power constraints. Furthermore, we propose a bisection search method to address the more challenging power minimization problem with probabilistic rate constraints by leveraging the aforementioned approach. Numerical results confirm that our approach achieves non-conservative robust performance, higher data rates, greater power efficiency, and faster execution compared to state-of-the-art optimization methods. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: This paper has been accepted for publication in the IEEE Journal on Selected Areas in Communications, Special Issue on Advanced Optimization Theory and Algorithms for Next Generation Wireless Communication Networks

arXiv:2405.11115 [pdf]

Ptychographic non-line-of-sight imaging for depth-resolved visualization of hidden objects

Authors: Pengming Song, Qianhao Zhao, Ruihai Wang, Ninghe Liu, Yingqi Qiang, Tianbo Wang, Xincheng Zhang, Yi Zhang, Liangcai Cao, Guoan Zheng

Abstract: Non-line-of-sight (NLOS) imaging enables the visualization of objects hidden from direct view, with applications in surveillance, remote sensing, and light detection and ranging. Here, we introduce a NLOS imaging technique termed ptychographic NLOS (pNLOS), which leverages coded ptychography for depth-resolved imaging of obscured objects. Our approach involves scanning a laser spot on a wall to il… ▽ More Non-line-of-sight (NLOS) imaging enables the visualization of objects hidden from direct view, with applications in surveillance, remote sensing, and light detection and ranging. Here, we introduce a NLOS imaging technique termed ptychographic NLOS (pNLOS), which leverages coded ptychography for depth-resolved imaging of obscured objects. Our approach involves scanning a laser spot on a wall to illuminate the hidden objects in an obscured region. The reflected wavefields from these objects then travel back to the wall, get modulated by the wall's complex-valued profile, and the resulting diffraction patterns are captured by a camera. By modulating the object wavefields, the wall surface serves the role of the coded layer as in coded ptychography. As we scan the laser spot to different positions, the reflected object wavefields on the wall translate accordingly, with the shifts varying for objects at different depths. This translational diversity enables the acquisition of a set of modulated diffraction patterns referred to as a ptychogram. By processing the ptychogram, we recover both the objects at different depths and the modulation profile of the wall surface. Experimental results demonstrate high-resolution, high-fidelity imaging of hidden objects, showcasing the potential of pNLOS for depth-aware vision beyond the direct line of sight. △ Less

Submitted 17 May, 2024; originally announced May 2024.

arXiv:2403.17184 [pdf, ps, other]

Robust Finite-time Stabilization of Linear Systems with Limited State Quantization

Authors: Yu Zhou, Andrey Polyakov, Gang Zheng

Abstract: This paper investigates the robust asymptotic stabilization of a linear time-invariant (LTI) system by a static feedback with a static state quantization. It is shown that the controllable LTI system can be stabilized to zero in a finite time by means of a nonlinear feedback with a quantizer having a limited (finite) number of values (quantization seeds) even when all parameters of the controller… ▽ More This paper investigates the robust asymptotic stabilization of a linear time-invariant (LTI) system by a static feedback with a static state quantization. It is shown that the controllable LTI system can be stabilized to zero in a finite time by means of a nonlinear feedback with a quantizer having a limited (finite) number of values (quantization seeds) even when all parameters of the controller and the quantizer are time-invariant. The control design is based on generalized homogeneity. A homogeneous spherical quantizer is introduced. The static homogeneous feedback is shown to be local (or global) finite-time stabilizer for the linear system (dependently of the system matrix). The tuning rules for both the quantizer and the feedback law are obtained in the form of Linear Matrix Inequalities (LMIs). The closed-loop system is proven to be robust with respect to some bounded matched and vanishing mismatched perturbations. Theoretical results are supported by numerical simulations. \ △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2402.00535 [pdf, ps, other]

A Low-Cost Multi-Band Waveform Security Framework in Resource-Constrained Communications

Authors: Tongyang Xu, Zhongxiang Wei, Tianhua Xu, Gan Zheng

Abstract: Traditional physical layer secure beamforming is achieved via precoding before signal transmission using channel state information (CSI). However, imperfect CSI will compromise the performance with imperfect beamforming and potential information leakage. In addition, multiple RF chains and antennas are needed to support the narrow beam generation, which complicates hardware implementation and is n… ▽ More Traditional physical layer secure beamforming is achieved via precoding before signal transmission using channel state information (CSI). However, imperfect CSI will compromise the performance with imperfect beamforming and potential information leakage. In addition, multiple RF chains and antennas are needed to support the narrow beam generation, which complicates hardware implementation and is not suitable for resource-constrained Internet-of-Things (IoT) devices. Moreover, with the advancement of hardware and artificial intelligence (AI), low-cost and intelligent eavesdrop** to wireless communications is becoming increasingly detrimental. In this paper, we propose a multi-carrier based multi-band waveform-defined security (WDS) framework, independent from CSI and RF chains, to defend against AI eavesdrop**. Ideally, the continuous variations of sub-band structures lead to an infinite number of spectral features, which can potentially prevent brute-force eavesdrop**. Sub-band spectral pattern information is efficiently constructed at legitimate users via a proposed chaotic sequence generator. A novel security metric, termed signal classification accuracy (SCA), is used to evaluate the security robustness under AI eavesdrop**. Communication error probability and complexity are also investigated to show the reliability and practical capability of the proposed framework. Finally, compared to traditional secure beamforming techniques, the proposed multi-band WDS framework reduces power consumption by up to six times. △ Less

Submitted 1 February, 2024; originally announced February 2024.

arXiv:2401.05850 [pdf, other]

Contrastive Loss Based Frame-wise Feature disentanglement for Polyphonic Sound Event Detection

Authors: Yadong Guan, Jiqing Han, Hongwei Song, Wenjie Song, Guibin Zheng, Tieran Zheng, Yongjun He

Abstract: Overlap** sound events are ubiquitous in real-world environments, but existing end-to-end sound event detection (SED) methods still struggle to detect them effectively. A critical reason is that these methods represent overlap** events using shared and entangled frame-wise features, which degrades the feature discrimination. To solve the problem, we propose a disentangled feature learning fram… ▽ More Overlap** sound events are ubiquitous in real-world environments, but existing end-to-end sound event detection (SED) methods still struggle to detect them effectively. A critical reason is that these methods represent overlap** events using shared and entangled frame-wise features, which degrades the feature discrimination. To solve the problem, we propose a disentangled feature learning framework to learn a category-specific representation. Specifically, we employ different projectors to learn the frame-wise features for each category. To ensure that these feature does not contain information of other categories, we maximize the common information between frame-wise features within the same category and propose a frame-wise contrastive loss. In addition, considering that the labeled data used by the proposed method is limited, we propose a semi-supervised frame-wise contrastive loss that can leverage large amounts of unlabeled data to achieve feature disentanglement. The experimental results demonstrate the effectiveness of our method. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: accepted by icassp2024

arXiv:2310.14355 [pdf]

A global product of fine-scale urban building height based on spaceborne lidar

Authors: Xiao Ma, Guang Zheng, Chi Xu, L. Monika Moskal, Peng Gong, Qinghua Guo, Huabing Huang, Xuecao Li, Yong Pang, Cheng Wang, Huan Xie, Bailang Yu, Bo Zhao, Yuyu Zhou

Abstract: Characterizing urban environments with broad coverages and high precision is more important than ever for achieving the UN's Sustainable Development Goals (SDGs) as half of the world's populations are living in cities. Urban building height as a fundamental 3D urban structural feature has far-reaching applications. However, so far, producing readily available datasets of recent urban building heig… ▽ More Characterizing urban environments with broad coverages and high precision is more important than ever for achieving the UN's Sustainable Development Goals (SDGs) as half of the world's populations are living in cities. Urban building height as a fundamental 3D urban structural feature has far-reaching applications. However, so far, producing readily available datasets of recent urban building heights with fine spatial resolutions and global coverages remains a challenging task. Here, we provide an up-to-date global product of urban building heights based on a fine grid size of 150 m around 2020 by combining the spaceborne lidar instrument of GEDI and multi-sourced data including remotely sensed images (i.e., Landsat-8, Sentinel-2, and Sentinel-1) and topographic data. Our results revealed that the estimated method of building height samples based on the GEDI data was effective with 0.78 of Pearson's r and 3.67 m of RMSE in comparison to the reference data. The map** product also demonstrated good performance as indicated by its strong correlation with the reference data (i.e., Pearson's r = 0.71, RMSE = 4.60 m). Compared with the currently existing products, our global urban building height map holds the ability to provide a higher spatial resolution (i.e., 150 m) with a great level of inherent details about the spatial heterogeneity and flexibility of updating using the GEDI samples as inputs. This work will boost future urban studies across many fields including climate, environmental, ecological, and social sciences. △ Less

Submitted 22 October, 2023; originally announced October 2023.

arXiv:2310.03750 [pdf]

doi 10.1016/j.isci.2024.109416

Health diagnosis and recuperation of aged Li-ion batteries with data analytics and equivalent circuit modeling

Authors: Riko I Made, **g Lin, **tao Zhang, Yu Zhang, Lionel C. H. Moh, Zhaolin Liu, Ning Ding, Sing Yang Chiam, Edwin Khoo, Xuesong Yin, Guangyuan Wesley Zheng

Abstract: Battery health assessment and recuperation play a crucial role in the utilization of second-life Li-ion batteries. However, due to ambiguous aging mechanisms and lack of correlations between the recovery effects and operational states, it is challenging to accurately estimate battery health and devise a clear strategy for cell rejuvenation. This paper presents aging and reconditioning experiments… ▽ More Battery health assessment and recuperation play a crucial role in the utilization of second-life Li-ion batteries. However, due to ambiguous aging mechanisms and lack of correlations between the recovery effects and operational states, it is challenging to accurately estimate battery health and devise a clear strategy for cell rejuvenation. This paper presents aging and reconditioning experiments of 62 commercial high-energy type lithium iron phosphate (LFP) cells, which supplement existing datasets of high-power LFP cells. The relatively large-scale data allow us to use machine learning models to predict cycle life and identify important indicators of recoverable capacity. Considering cell-to-cell inconsistencies, an average test error of $16.84\% \pm 1.87\%$ (mean absolute percentage error) for cycle life prediction is achieved by gradient boosting regressor given information from the first 80 cycles. In addition, it is found that some of the recoverable lost capacity is attributed to the lateral lithium non-uniformity within the electrodes. An equivalent circuit model is built and experimentally validated to demonstrate how such non-uniformity can be accumulated, and how it can give rise to recoverable capacity loss. SHapley Additive exPlanations (SHAP) analysis also reveals that battery operation history significantly affects the capacity recovery. △ Less

Submitted 21 September, 2023; originally announced October 2023.

Comments: 20 pages, 5 figures, 1 table

Journal ref: iScience (2024)

arXiv:2309.13611 [pdf]

Sparsity-regularized coded ptychography for robust and efficient lensless microscopy on a chip

Authors: Ninghe Liu, Qianhao Zhao, Guoan Zheng

Abstract: In ptychographic imaging, the trade-off between the number of acquisitions and the resultant imaging quality presents a complex optimization problem. Increasing the number of acquisitions typically yields reconstructions with higher spatial resolution and finer details. Conversely, a reduction in measurement frequency often compromises the quality of the reconstructed images, manifesting as increa… ▽ More In ptychographic imaging, the trade-off between the number of acquisitions and the resultant imaging quality presents a complex optimization problem. Increasing the number of acquisitions typically yields reconstructions with higher spatial resolution and finer details. Conversely, a reduction in measurement frequency often compromises the quality of the reconstructed images, manifesting as increased noise and coarser details. To address this challenge, we employ sparsity priors to reformulate the ptychographic reconstruction task as a total variation regularized optimization problem. We introduce a new computational framework, termed the ptychographic proximal total-variation (PPTV) solver, designed to integrate into existing ptychography settings without necessitating hardware modifications. Through comprehensive numerical simulations, we validate that PPTV-driven coded ptychography is capable of producing highly accurate reconstructions with a minimal set of eight intensity measurements. Convergence analysis further substantiates the robustness, stability, and computational feasibility of the proposed PPTV algorithm. Experimental results obtained from optical setups unequivocally demonstrate that the PPTV algorithm facilitates high-throughput, high-resolution imaging while significantly reducing the measurement burden. These findings indicate that the PPTV algorithm has the potential to substantially mitigate the resource-intensive requirements traditionally associated with high-quality ptychographic imaging, thereby offering a pathway toward the development of more compact and efficient ptychographic microscopy systems. △ Less

Submitted 24 September, 2023; originally announced September 2023.

Comments: 15 pages, 7 figures

arXiv:2309.12783 [pdf, ps, other]

Multi-objective Optimization of Space-Air-Ground Integrated Network Slicing Relying on a Pair of Central and Distributed Learning Algorithms

Authors: Guorong Zhou, Liqiang Zhao, Gan Zheng, Shenghui Song, Jiankang Zhang, Lajos Hanzo

Abstract: As an attractive enabling technology for next-generation wireless communications, network slicing supports diverse customized services in the global space-air-ground integrated network (SAGIN) with diverse resource constraints. In this paper, we dynamically consider three typical classes of radio access network (RAN) slices, namely high-throughput slices, low-delay slices and wide-coverage slices,… ▽ More As an attractive enabling technology for next-generation wireless communications, network slicing supports diverse customized services in the global space-air-ground integrated network (SAGIN) with diverse resource constraints. In this paper, we dynamically consider three typical classes of radio access network (RAN) slices, namely high-throughput slices, low-delay slices and wide-coverage slices, under the same underlying physical SAGIN. The throughput, the service delay and the coverage area of these three classes of RAN slices are jointly optimized in a non-scalar form by considering the distinct channel features and service advantages of the terrestrial, aerial and satellite components of SAGINs. A joint central and distributed multi-agent deep deterministic policy gradient (CDMADDPG) algorithm is proposed for solving the above problem to obtain the Pareto optimal solutions. The algorithm first determines the optimal virtual unmanned aerial vehicle (vUAV) positions and the inter-slice sub-channel and power sharing by relying on a centralized unit. Then it optimizes the intra-slice sub-channel and power allocation, and the virtual base station (vBS)/vUAV/virtual low earth orbit (vLEO) satellite deployment in support of three classes of slices by three separate distributed units. Simulation results verify that the proposed method approaches the Pareto-optimal exploitation of multiple RAN slices, and outperforms the benchmarkers. △ Less

Submitted 22 September, 2023; originally announced September 2023.

Comments: 19 pages, 14 figures, journal

arXiv:2309.01412 [pdf, ps, other]

Finite/fixed-time Stabilization of Linear Systems with States Quantization

Authors: Yu Zhou, Andrey Polyakov, Gang Zheng

Abstract: This paper develops a homogeneity-based approach to finite/fixed-time stabilization of linear time-invariant (LTI) system with quantized measurements. A sufficient condition for finite/fixed-time stabilization of multi-input LTI system under states quantization is derived. It is shown that a homogeneous quantized state feedback with logarithmic quantizer can guarantee finite/fixed-time stability o… ▽ More This paper develops a homogeneity-based approach to finite/fixed-time stabilization of linear time-invariant (LTI) system with quantized measurements. A sufficient condition for finite/fixed-time stabilization of multi-input LTI system under states quantization is derived. It is shown that a homogeneous quantized state feedback with logarithmic quantizer can guarantee finite/fixed-time stability of the closed-loop system provided that the quantization is sufficiently dense. Theoretical results are supported with numerical simulations. △ Less

Submitted 6 September, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

arXiv:2308.03658 [pdf, other]

Control-Oriented Deep Space Communications For Unmanned Space Exploration

Authors: Xinran Fang, Wei Feng, Yunfei Chen, Ning Ge, Gan Zheng

Abstract: In unmanned space exploration, the cooperation among space robots requires advanced communication techniques. In this paper, we propose a communication optimization scheme for a specific cooperation system named the "mother-daughter system". In this setup, the mother spacecraft orbits the planet, while daughter probes are distributed across the planetary surface. During each control cycle, the mot… ▽ More In unmanned space exploration, the cooperation among space robots requires advanced communication techniques. In this paper, we propose a communication optimization scheme for a specific cooperation system named the "mother-daughter system". In this setup, the mother spacecraft orbits the planet, while daughter probes are distributed across the planetary surface. During each control cycle, the mother spacecraft senses the environment, computes control commands and distributes them to daughter probes for actions. They synergistically form sensing-communication-computing-control ($\mathbf{SC^3}$) loops. Given the indivisibility of the $\mathbf{SC^3}$ loop, we optimize the mother-daughter downlink for closed-loop control. The optimization objective is the linear quadratic regulator (LQR) cost, and the optimization parameters are the block length and transmit power. To solve the nonlinear mixed-integer problem, we first identify the optimal block length and then transform the power allocation problem into a tractable convex problem. We further derive the approximate closed-form solutions for the proposed scheme and two communication-oriented schemes: the max-sum rate scheme and the max-min rate scheme. On this basis, we analyze their power allocation principles. In particular, for time-insensitive control tasks, we find that the proposed scheme demonstrates equivalence to the max-min rate scheme. These findings are verified through simulations. △ Less

Submitted 27 June, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

arXiv:2306.00188 [pdf, other]

Multi-environment lifelong deep reinforcement learning for medical imaging

Authors: Guangyao Zheng, Shuhao Lai, Vladimir Braverman, Michael A. Jacobs, Vishwa S. Parekh

Abstract: Deep reinforcement learning(DRL) is increasingly being explored in medical imaging. However, the environments for medical imaging tasks are constantly evolving in terms of imaging orientations, imaging sequences, and pathologies. To that end, we developed a Lifelong DRL framework, SERIL to continually learn new tasks in changing imaging environments without catastrophic forgetting. SERIL was devel… ▽ More Deep reinforcement learning(DRL) is increasingly being explored in medical imaging. However, the environments for medical imaging tasks are constantly evolving in terms of imaging orientations, imaging sequences, and pathologies. To that end, we developed a Lifelong DRL framework, SERIL to continually learn new tasks in changing imaging environments without catastrophic forgetting. SERIL was developed using selective experience replay based lifelong learning technique for the localization of five anatomical landmarks in brain MRI on a sequence of twenty-four different imaging environments. The performance of SERIL, when compared to two baseline setups: MERT(multi-environment-best-case) and SERT(single-environment-worst-case) demonstrated excellent performance with an average distance of $9.90\pm7.35$ pixels from the desired landmark across all 120 tasks, compared to $10.29\pm9.07$ for MERT and $36.37\pm22.41$ for SERT($p<0.05$), demonstrating the excellent potential for continuously learning multiple tasks across dynamically changing imaging environments. △ Less

Submitted 31 May, 2023; originally announced June 2023.

arXiv:2303.08140 [pdf, other]

doi 10.1186/s43074-023-00113-4

Digital staining in optical microscopy using deep learning -- a review

Authors: Lucas Kreiss, Shaowei Jiang, Xiang Li, Shiqi Xu, Kevin C. Zhou, Alexander Mühlberg, Kyung Chul Lee, Kanghyun Kim, Amey Chaware, Michael Ando, Laura Barisoni, Seung Ah Lee, Guoan Zheng, Kyle Lafata, Oliver Friedrich, Roarke Horstmeyer

Abstract: Until recently, conventional biochemical staining had the undisputed status as well-established benchmark for most biomedical problems related to clinical diagnostics, fundamental research and biotechnology. Despite this role as gold-standard, staining protocols face several challenges, such as a need for extensive, manual processing of samples, substantial time delays, altered tissue homeostasis,… ▽ More Until recently, conventional biochemical staining had the undisputed status as well-established benchmark for most biomedical problems related to clinical diagnostics, fundamental research and biotechnology. Despite this role as gold-standard, staining protocols face several challenges, such as a need for extensive, manual processing of samples, substantial time delays, altered tissue homeostasis, limited choice of contrast agents for a given sample, 2D imaging instead of 3D tomography and many more. Label-free optical technologies, on the other hand, do not rely on exogenous and artificial markers, by exploiting intrinsic optical contrast mechanisms, where the specificity is typically less obvious to the human observer. Over the past few years, digital staining has emerged as a promising concept to use modern deep learning for the translation from optical contrast to established biochemical contrast of actual stainings. In this review article, we provide an in-depth analysis of the current state-of-the-art in this field, suggest methods of good practice, identify pitfalls and challenges and postulate promising advances towards potential future implementations and applications. △ Less

Submitted 14 March, 2023; originally announced March 2023.

Comments: Review article, 4 main Figures, 3 Tables, 2 supplementary figures

arXiv:2303.06783 [pdf, other]

Asynchronous Decentralized Federated Lifelong Learning for Landmark Localization in Medical Imaging

Authors: Guangyao Zheng, Michael A. Jacobs, Vladimir Braverman, Vishwa S. Parekh

Abstract: Federated learning is a recent development in the machine learning area that allows a system of devices to train on one or more tasks without sharing their data to a single location or device. However, this framework still requires a centralized global model to consolidate individual models into one, and the devices train synchronously, which both can be potential bottlenecks for using federated l… ▽ More Federated learning is a recent development in the machine learning area that allows a system of devices to train on one or more tasks without sharing their data to a single location or device. However, this framework still requires a centralized global model to consolidate individual models into one, and the devices train synchronously, which both can be potential bottlenecks for using federated learning. In this paper, we propose a novel method of asynchronous decentralized federated lifelong learning (ADFLL) method that inherits the merits of federated learning and can train on multiple tasks simultaneously without the need for a central node or synchronous training. Thus, overcoming the potential drawbacks of conventional federated learning. We demonstrate excellent performance on the brain tumor segmentation (BRATS) dataset for localizing the left ventricle on multiple image sequences and image orientation. Our framework allows agents to achieve the best performance with a mean distance error of 7.81, better than the conventional all-knowing agent's mean distance error of 11.78, and significantly (p=0.01) better than a conventional lifelong learning agent with a distance error of 15.17 after eight rounds of training. In addition, all ADFLL agents have comparable or better performance than a conventional LL agent. In conclusion, we developed an ADFLL framework with excellent performance and speed-up compared to conventional RL agents. △ Less

Submitted 10 January, 2024; v1 submitted 12 March, 2023; originally announced March 2023.

arXiv:2302.10561 [pdf, ps, other]

doi 10.1109/LWC.2023.3337709

Model-free Optimization and Experimental Validation of RIS-assisted Wireless Communications under Rich Multipath Fading

Authors: Tianrui Chen, Minglei You, Yangyishi Zhang, Gan Zheng, Jean Baptiste Gros, Geoffroy Lerosey, Youssef Nasser, Fraser Burton, Gabriele Gradoni

Abstract: Reconfigurable intelligent surface (RIS) devices have emerged as an effective way to control the propagation channels for enhancing the end-users' performance. However, RIS optimization involves configuring the radio frequency response of a large number of radiating elements, which is challenging in real-world applications due to high computational complexity. In this paper, a model-free cross-ent… ▽ More Reconfigurable intelligent surface (RIS) devices have emerged as an effective way to control the propagation channels for enhancing the end-users' performance. However, RIS optimization involves configuring the radio frequency response of a large number of radiating elements, which is challenging in real-world applications due to high computational complexity. In this paper, a model-free cross-entropy (CE) algorithm is proposed to optimize the binary RIS configuration for improving the signal-to-noise ratio (SNR) at the receiver. One key advantage of the proposed method is that it only requires system performance indicators, e.g., the received SNR, without the need for channel models or channel state information. Both simulations and experiments are conducted to evaluate the performance of the proposed CE algorithm. This study provides an experimental demonstration of the channel hardening effect in a multi-antenna RIS-assisted wireless system under rich multipath fading. △ Less

Submitted 15 February, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

Comments: accepted by IEEE Wireless Communications Letters

arXiv:2302.06294 [pdf, other]

doi 10.1016/j.media.2023.102888

CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection

Authors: Chinedu Innocent Nwoye, Tong Yu, Saurav Sharma, Aditya Murali, Deepak Alapatt, Armine Vardazaryan, Kun Yuan, Jonas Hajek, Wolfgang Reiter, Amine Yamlahi, Finn-Henri Smidt, Xiaoyang Zou, Guoyan Zheng, Bruno Oliveira, Helena R. Torres, Satoshi Kondo, Satoshi Kasai, Felix Holm, Ege Özsoy, Shuangchun Gui, Han Li, Sista Raviteja, Rachana Sathish, Pranav Poudel, Binod Bhattarai , et al. (24 additional authors not shown)

Abstract: Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier effor… ▽ More Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier efforts and the CholecTriplet challenge introduced in 2021 have put together techniques aimed at recognizing these triplets from surgical footage. Estimating also the spatial locations of the triplets would offer a more precise intraoperative context-aware decision support for computer-assisted intervention. This paper presents the CholecTriplet2022 challenge, which extends surgical action triplet modeling from recognition to detection. It includes weakly-supervised bounding box localization of every visible surgical instrument (or tool), as the key actors, and the modeling of each tool-activity in the form of <instrument, verb, target> triplet. The paper describes a baseline method and 10 new deep learning algorithms presented at the challenge to solve the task. It also provides thorough methodological comparisons of the methods, an in-depth analysis of the obtained results across multiple metrics, visual and procedural challenges; their significance, and useful insights for future research directions and applications in surgery. △ Less

Submitted 14 July, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

Comments: MICCAI EndoVis CholecTriplet2022 challenge report. Published at Elsevier journal of Medical Image Analysis. 25 pages, 15 figures, 8 tables

Journal ref: Medical Image Analysis, Volume 89, 2023, 102888, ISSN 1361-8415

arXiv:2302.01035 [pdf, other]

doi 10.1109/TVT.2023.3238108

Deep Learning Based Predictive Beamforming Design

Authors: Ju** Zhang, Gan Zheng, Yangyishi Zhang, Ioannis Krikidis, Kai-Kit Wong

Abstract: This paper investigates deep learning techniques to predict transmit beamforming based on only historical channel data without current channel information in the multiuser multiple-input-single-output downlink. This will significantly reduce the channel estimation overhead and improve the spectrum efficiency especially in high-mobility vehicular communications. Specifically, we propose a joint lea… ▽ More This paper investigates deep learning techniques to predict transmit beamforming based on only historical channel data without current channel information in the multiuser multiple-input-single-output downlink. This will significantly reduce the channel estimation overhead and improve the spectrum efficiency especially in high-mobility vehicular communications. Specifically, we propose a joint learning framework that incorporates channel prediction and power optimization, and produces prediction for transmit beamforming directly. In addition, we propose to use the attention mechanism in the Long Short-Term Memory Recurrent Neural Networks to improve the accuracy of channel prediction. Simulation results using both a simple autoregressive process model and the more realistic 3GPP spatial channel model verify that our proposed predictive beamforming scheme can significantly improve the effective spectrum efficiency compared to traditional channel estimation and the method that separately predicts channel and then optimizes beamforming. △ Less

Submitted 2 February, 2023; originally announced February 2023.

Comments: Accepted in IEEE Transactions on Vehicular Technology

arXiv:2210.01530 [pdf, ps, other]

Generalized Homogeneous Rigid-BodyAttitude Control

Authors: Yu Zhou, Andrey Polyakov, Gang Zheng

Abstract: The attitude tracking problem for a full-actuated rigid body in 3D is studied using a impulsive system model based on Lie algebra so(3). A nonlinear homogeneous controller is designed to globally track a smooth attitude trajectory in a finite or a (nearly) fixed time. A global settling time estimate is obtained, which is easily adjustable by tuning the homogeneity degree. The local input-to-state… ▽ More The attitude tracking problem for a full-actuated rigid body in 3D is studied using a impulsive system model based on Lie algebra so(3). A nonlinear homogeneous controller is designed to globally track a smooth attitude trajectory in a finite or a (nearly) fixed time. A global settling time estimate is obtained, which is easily adjustable by tuning the homogeneity degree. The local input-to-state stability is proven. Simulations illustrating the performance of the proposed algorithm are presented. △ Less

Submitted 3 July, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

Comments: Global homogeneous attitude finite/fixed-time attitude controller. Strict Lyapunov and settling time estimate is provided

arXiv:2208.05772 [pdf, other]

KiPA22 Report: U-Net with Contour Regularization for Renal Structures Segmentation

Authors: Kangqing Ye, Peng Liu, Xiaoyang Zou, Qin Zhou, Guoyan Zheng

Abstract: Three-dimensional (3D) integrated renal structures (IRS) segmentation is important in clinical practice. With the advancement of deep learning techniques, many powerful frameworks focusing on medical image segmentation are proposed. In this challenge, we utilized the nnU-Net framework, which is the state-of-the-art method for medical image segmentation. To reduce the outlier prediction for the tum… ▽ More Three-dimensional (3D) integrated renal structures (IRS) segmentation is important in clinical practice. With the advancement of deep learning techniques, many powerful frameworks focusing on medical image segmentation are proposed. In this challenge, we utilized the nnU-Net framework, which is the state-of-the-art method for medical image segmentation. To reduce the outlier prediction for the tumor label, we combine contour regularization (CR) loss of the tumor label with Dice loss and cross-entropy loss to improve this phenomenon. △ Less

Submitted 6 September, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

arXiv:2112.08133 [pdf]

doi 10.1016/j.bios.2021.113699

Ptychographic sensor for large-scale lensless microbial monitoring with high spatiotemporal resolution

Authors: Shaowei Jiang, Chengfei Guo, Zichao Bian, Ruihai Wang, Jiakai Zhu, Pengming Song, Patrick Hu, Derek Hu, Zibang Zhang, Kazunori Hoshino, Bin Feng, Guoan Zheng

Abstract: Traditional microbial detection methods often rely on the overall property of microbial cultures and cannot resolve individual growth event at high spatiotemporal resolution. As a result, they require bacteria to grow to confluence and then interpret the results. Here, we demonstrate the application of an integrated ptychographic sensor for lensless cytometric analysis of microbial cultures over a… ▽ More Traditional microbial detection methods often rely on the overall property of microbial cultures and cannot resolve individual growth event at high spatiotemporal resolution. As a result, they require bacteria to grow to confluence and then interpret the results. Here, we demonstrate the application of an integrated ptychographic sensor for lensless cytometric analysis of microbial cultures over a large scale and with high spatiotemporal resolution. The reported device can be placed within a regular incubator or used as a standalone incubating unit for long-term microbial monitoring. For longitudinal study where massive data are acquired at sequential time points, we report a new temporal-similarity constraint to increase the temporal resolution of ptychographic reconstruction by 7-fold. With this strategy, the reported device achieves a centimeter-scale field of view, a half-pitch spatial resolution of 488 nm, and a temporal resolution of 15-second intervals. For the first time, we report the direct observation of bacterial growth in a 15-second interval by tracking the phase wraps of the recovered images, with high phase sensitivity like that in interferometric measurements. We also characterize cell growth via longitudinal dry mass measurement and perform rapid bacterial detection at low concentrations. For drug-screening application, we demonstrate proof-of-concept antibiotic susceptibility testing and perform single-cell analysis of antibiotic-induced filamentation. The combination of high phase sensitivity, high spatiotemporal resolution, and large field of view is unique among existing microscopy techniques. As a quantitative and miniaturized platform, it can improve studies with microorganisms and other biospecimens at resource-limited settings. △ Less

Submitted 15 December, 2021; originally announced December 2021.

Comments: 18 pages, 6 figures

arXiv:2111.10689 [pdf, ps, other]

Design and Analysis of SWIPT with Safety Constraints

Authors: Constantinos Psomas, Minglei You, Kai Liang, Gan Zheng, Ioannis Krikidis

Abstract: Simultaneous wireless information and power transfer (SWIPT) has long been proposed as a key solution for charging and communicating with low-cost and low-power devices. However, the employment of radio frequency (RF) signals for information/power transfer needs to comply with international health and safety regulations. In this paper, we provide a complete framework for the design and analysis of… ▽ More Simultaneous wireless information and power transfer (SWIPT) has long been proposed as a key solution for charging and communicating with low-cost and low-power devices. However, the employment of radio frequency (RF) signals for information/power transfer needs to comply with international health and safety regulations. In this paper, we provide a complete framework for the design and analysis of far-field SWIPT under safety constraints. In particular, we deal with two RF exposure regulations, namely, the specific absorption rate (SAR) and the maximum permissible exposure (MPE). The state-of-the-art regarding SAR and MPE is outlined together with a description as to how these can be modeled in the context of communication networks. We propose a deep learning approach for the design of robust beamforming subject to specific information, energy harvesting and SAR constraints. Furthermore, we present a thorough analytical study for the performance of large-scale SWIPT systems, in terms of information and energy coverage under MPE constraints. This work provides insights with regards to the optimal SWIPT design as well as the potentials from the proper development of SWIPT systems under health and safety restrictions. △ Less

Submitted 20 November, 2021; originally announced November 2021.

Comments: Proceedings of the IEEE

arXiv:2110.01989 [pdf]

doi 10.1364/OL.437832

High-throughput lensless whole slide imaging via continuous height-varying modulation of tilted sensor

Authors: Shaowei Jiang, Chengfei Guo, Patrick Hu, Derek Hu, Pengming Song, Tianbo Wang, Zichao Bian, Zibang Zhang, Guoan Zheng

Abstract: We report a new lensless microscopy configuration by integrating the concepts of transverse translational ptychography and defocus multi-height phase retrieval. In this approach, we place a tilted image sensor under the specimen for linearly-increasing phase modulation along one lateral direction. Similar to the operation of ptychography, we laterally translate the specimen and acquire the diffrac… ▽ More We report a new lensless microscopy configuration by integrating the concepts of transverse translational ptychography and defocus multi-height phase retrieval. In this approach, we place a tilted image sensor under the specimen for linearly-increasing phase modulation along one lateral direction. Similar to the operation of ptychography, we laterally translate the specimen and acquire the diffraction images for reconstruction. Since the axial distance between the specimen and the sensor varies at different lateral positions, laterally translating the specimen effectively introduces defocus multi-height measurements while eliminating axial scanning. Lateral translation further introduces sub-pixel shift for pixel super-resolution imaging and naturally expands the field of view for rapid whole slide imaging. We show that the equivalent height variation can be precisely estimated from the lateral shift of the specimen, thereby addressing the challenge of precise axial positioning in conventional multi-height phase retrieval. Using a sensor with a 1.67-micron pixel size, our low-cost and field-portable prototype can resolve 690-nm linewidth on the resolution target. We show that a whole slide image of a blood smear with a 120-mm^2 field of view can be acquired in 18 seconds. We also demonstrate accurate automatic white blood cell counting from the recovered image. The reported approach may provide a turnkey solution for addressing point-of-care- and telemedicine-related challenges. △ Less

Submitted 28 September, 2021; originally announced October 2021.

arXiv:2109.09161 [pdf, other]

Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition

Authors: Guolin Zheng, Yubei Xiao, Ke Gong, Pan Zhou, Xiaodan Liang, Liang Lin

Abstract: Unifying acoustic and linguistic representation learning has become increasingly crucial to transfer the knowledge learned on the abundance of high-resource language data for low-resource speech recognition. Existing approaches simply cascade pre-trained acoustic and language models to learn the transfer from speech to text. However, how to solve the representation discrepancy of speech and text i… ▽ More Unifying acoustic and linguistic representation learning has become increasingly crucial to transfer the knowledge learned on the abundance of high-resource language data for low-resource speech recognition. Existing approaches simply cascade pre-trained acoustic and language models to learn the transfer from speech to text. However, how to solve the representation discrepancy of speech and text is unexplored, which hinders the utilization of acoustic and linguistic information. Moreover, previous works simply replace the embedding layer of the pre-trained language model with the acoustic features, which may cause the catastrophic forgetting problem. In this work, we introduce Wav-BERT, a cooperative acoustic and linguistic representation learning method to fuse and utilize the contextual information of speech and text. Specifically, we unify a pre-trained acoustic model (wav2vec 2.0) and a language model (BERT) into an end-to-end trainable framework. A Representation Aggregation Module is designed to aggregate acoustic and linguistic representation, and an Embedding Attention Module is introduced to incorporate acoustic information into BERT, which can effectively facilitate the cooperation of two pre-trained models and thus boost the representation learning. Extensive experiments show that our Wav-BERT significantly outperforms the existing approaches and achieves state-of-the-art performance on low-resource speech recognition. △ Less

Submitted 9 October, 2021; v1 submitted 19 September, 2021; originally announced September 2021.

arXiv:2109.09086 [pdf, other]

doi 10.1109/TWC.2021.3094162

Embedding Model Based Fast Meta Learning for Downlink Beamforming Adaptation

Authors: Ju** Zhang, Yi Yuan, Gan Zheng, Ioannis Krikidis, Kai-Kit Wong

Abstract: This paper studies the fast adaptive beamforming for the multiuser multiple-input single-output downlink. Existing deep learning-based approaches assume that training and testing channels follow the same distribution which causes task mismatch, when the testing environment changes. Although meta learning can deal with the task mismatch, it relies on labelled data and incurs high complexity in the… ▽ More This paper studies the fast adaptive beamforming for the multiuser multiple-input single-output downlink. Existing deep learning-based approaches assume that training and testing channels follow the same distribution which causes task mismatch, when the testing environment changes. Although meta learning can deal with the task mismatch, it relies on labelled data and incurs high complexity in the pre-training and fine tuning stages. We propose a simple yet effective adaptive framework to solve the mismatch issue, which trains an embedding model as a transferable feature extractor, followed by fitting the support vector regression. Compared to the existing meta learning algorithm, our method does not necessarily need labelled data in the pre-training and does not need fine-tuning of the pre-trained model in the adaptation. The effectiveness of the proposed method is verified through two well-known applications, i.e., the signal to interference plus noise ratio balancing problem and the sum rate maximization problem. Furthermore, we extend our proposed method to online scenarios in non-stationary environments. Simulation results demonstrate the advantages of the proposed algorithm in terms of both performance and complexity. The proposed framework can also be applied to general radio resource management problems. △ Less

Submitted 19 September, 2021; originally announced September 2021.

Comments: Accepted in IEEE Transactions on Wireless Communications

arXiv:2109.07819 [pdf, other]

Model-driven Learning for Generic MIMO Downlink Beamforming With Uplink Channel Information

Authors: Ju** Zhang, Minglei You, Gan Zheng, Ioannis Krikidis, Liqiang Zhao

Abstract: Accurate downlink channel information is crucial to the beamforming design, but it is difficult to obtain in practice. This paper investigates a deep learning-based optimization approach of the downlink beamforming to maximize the system sum rate, when only the uplink channel information is available. Our main contribution is to propose a model-driven learning technique that exploits the structure… ▽ More Accurate downlink channel information is crucial to the beamforming design, but it is difficult to obtain in practice. This paper investigates a deep learning-based optimization approach of the downlink beamforming to maximize the system sum rate, when only the uplink channel information is available. Our main contribution is to propose a model-driven learning technique that exploits the structure of the optimal downlink beamforming to design an effective hybrid learning strategy with the aim to maximize the sum rate performance. This is achieved by jointly considering the learning performance of the downlink channel, the power and the sum rate in the training stage. The proposed approach applies to generic cases in which the uplink channel information is available, but its relation to the downlink channel is unknown and does not require an explicit downlink channel estimation. We further extend the developed technique to massive multiple-input multiple-output scenarios and achieve a distributed learning strategy for multicell systems without an inter-cell signalling overhead. Simulation results verify that our proposed method provides the performance close to the state of the art numerical algorithms with perfect downlink channel information and significantly outperforms existing data-driven methods in terms of the sum rate. △ Less

Submitted 16 September, 2021; originally announced September 2021.

Comments: Accepted in IEEE Transactions on Wireless Communications

arXiv:2107.08962 [pdf, other]

Frequency-Supervised MR-to-CT Image Synthesis

Authors: Zenglin Shi, Pascal Mettes, Guoyan Zheng, Cees Snoek

Abstract: This paper strives to generate a synthetic computed tomography (CT) image from a magnetic resonance (MR) image. The synthetic CT image is valuable for radiotherapy planning when only an MR image is available. Recent approaches have made large strides in solving this challenging synthesis problem with convolutional neural networks that learn a map** from MR inputs to CT outputs. In this paper, we… ▽ More This paper strives to generate a synthetic computed tomography (CT) image from a magnetic resonance (MR) image. The synthetic CT image is valuable for radiotherapy planning when only an MR image is available. Recent approaches have made large strides in solving this challenging synthesis problem with convolutional neural networks that learn a map** from MR inputs to CT outputs. In this paper, we find that all existing approaches share a common limitation: reconstruction breaks down in and around the high-frequency parts of CT images. To address this common limitation, we introduce frequency-supervised deep networks to explicitly enhance high-frequency MR-to-CT image reconstruction. We propose a frequency decomposition layer that learns to decompose predicted CT outputs into low- and high-frequency components, and we introduce a refinement module to improve high-frequency reconstruction through high-frequency adversarial learning. Experimental results on a new dataset with 45 pairs of 3D MR-CT brain images show the effectiveness and potential of the proposed approach. Code is available at \url{https://github.com/shizenglin/Frequency-Supervised-MR-to-CT-Image-Synthesis}. △ Less

Submitted 19 July, 2021; originally announced July 2021.

Comments: MICCAI workshop on Deep Generative Models, 2021

arXiv:2105.14746 [pdf, other]

Complex-domain super-resolution imaging with distributed optimization

Authors: Xuyang Chang, Liheng Bian, Shaowei Jiang, Guoan Zheng, Jun Zhang

Abstract: Complex-domain imaging has emerged as a valuable technique for investigating weak-scattered samples. However, due to the detector's pursuit of large pixel size for high throughput, the resolution limitation impedes its further development. In this work, we report a lensless on-chip complex-domain imaging system, together with a distributed-optimization-based pixel super-resolution technique (DO-PS… ▽ More Complex-domain imaging has emerged as a valuable technique for investigating weak-scattered samples. However, due to the detector's pursuit of large pixel size for high throughput, the resolution limitation impedes its further development. In this work, we report a lensless on-chip complex-domain imaging system, together with a distributed-optimization-based pixel super-resolution technique (DO-PSR). The system employs a diffuser shifting to realize phase modulation and increases observation diversity. The corresponding DO-PSR technique derives an alternating projection operator and an enhancing neural network to tackle the measurement fidelity and statistical prior regularization subproblems. Extensive experiments show that the system outperforms the existing techniques with as much as 11dB on PSNR, and one-order-of-magnitude higher cell counting precision. △ Less

Submitted 19 October, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

arXiv:2012.11896 [pdf, other]

Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition

Authors: Yubei Xiao, Ke Gong, Pan Zhou, Guolin Zheng, Xiaodan Liang, Liang Lin

Abstract: Low-resource automatic speech recognition (ASR) is challenging, as the low-resource target language data cannot well train an ASR model. To solve this issue, meta-learning formulates ASR for each source language into many small ASR tasks and meta-learns a model initialization on all tasks from different source languages to access fast adaptation on unseen target languages. However, for different s… ▽ More Low-resource automatic speech recognition (ASR) is challenging, as the low-resource target language data cannot well train an ASR model. To solve this issue, meta-learning formulates ASR for each source language into many small ASR tasks and meta-learns a model initialization on all tasks from different source languages to access fast adaptation on unseen target languages. However, for different source languages, the quantity and difficulty vary greatly because of their different data scales and diverse phonological systems, which leads to task-quantity and task-difficulty imbalance issues and thus a failure of multilingual meta-learning ASR (MML-ASR). In this work, we solve this problem by develo** a novel adversarial meta sampling (AMS) approach to improve MML-ASR. When sampling tasks in MML-ASR, AMS adaptively determines the task sampling probability for each source language. Specifically, for each source language, if the query loss is large, it means that its tasks are not well sampled to train ASR model in terms of its quantity and difficulty and thus should be sampled more frequently for extra learning. Inspired by this fact, we feed the historical task query loss of all source language domain into a network to learn a task sampling policy for adversarially increasing the current query loss of MML-ASR. Thus, the learnt task sampling policy can master the learning situation of each language and thus predicts good task sampling probability for each language for more effective learning. Finally, experiment results on two multilingual datasets show significant performance improvement when applying our AMS on MML-ASR, and also demonstrate the applicability of AMS to other low-resource speech tasks and transfer learning ASR approaches. △ Less

Submitted 12 April, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

Comments: accepted in AAAI2021

arXiv:2011.11879 [pdf]

Blind deblurring for microscopic pathology images using deep learning networks

Authors: Cheng Jiang, Jun Liao, Pei Dong, Zhaoxuan Ma, De Cai, Guoan Zheng, Yue** Liu, Hong Bu, Jianhua Yao

Abstract: Artificial Intelligence (AI)-powered pathology is a revolutionary step in the world of digital pathology and shows great promise to increase both diagnosis accuracy and efficiency. However, defocus and motion blur can obscure tissue or cell characteristics hence compromising AI algorithms'accuracy and robustness in analyzing the images. In this paper, we demonstrate a deep-learning-based approach… ▽ More Artificial Intelligence (AI)-powered pathology is a revolutionary step in the world of digital pathology and shows great promise to increase both diagnosis accuracy and efficiency. However, defocus and motion blur can obscure tissue or cell characteristics hence compromising AI algorithms'accuracy and robustness in analyzing the images. In this paper, we demonstrate a deep-learning-based approach that can alleviate the defocus and motion blur of a microscopic image and output a sharper and cleaner image with retrieved fine details without prior knowledge of the blur type, blur extent and pathological stain. In this approach, a deep learning classifier is first trained to identify the image blur type. Then, two encoder-decoder networks are trained and used alone or in combination to deblur the input image. It is an end-to-end approach and introduces no corrugated artifacts as traditional blind deconvolution methods do. We test our approach on different types of pathology specimens and demonstrate great performance on image blur correction and the subsequent improvement on the diagnosis outcome of AI algorithms. △ Less

Submitted 23 November, 2020; originally announced November 2020.

arXiv:2009.09163 [pdf, other]

Improving Spiking Sparse Recovery via Non-Convex Penalties

Authors: Xiang Zhang, Lei Yu, Gang Zheng

Abstract: Compared with digital methods, sparse recovery based on spiking neural networks has great advantages like high computational efficiency and low power-consumption. However, current spiking algorithms cannot guarantee more accurate estimates since they are usually designed to solve the classical optimization with convex penalties, especially the $\ell_{1}$-norm. In fact, convex penalties are observe… ▽ More Compared with digital methods, sparse recovery based on spiking neural networks has great advantages like high computational efficiency and low power-consumption. However, current spiking algorithms cannot guarantee more accurate estimates since they are usually designed to solve the classical optimization with convex penalties, especially the $\ell_{1}$-norm. In fact, convex penalties are observed to underestimate the true solution in practice, while non-convex ones can avoid the underestimation. Inspired by this, we propose an adaptive version of spiking sparse recovery algorithm to solve the non-convex regularized optimization, and provide an analysis on its global asymptotic convergence. Through experiments, the accuracy is greatly improved under different adaptive ways. △ Less

Submitted 19 September, 2020; originally announced September 2020.

arXiv:2008.06916 [pdf]

doi 10.1364/OL.400244

Virtual brightfield and fluorescence staining for Fourier ptychography via unsupervised deep learning

Authors: Ruihai Wang, Pengming Song, Shaowei Jiang, Chenggang Yan, Jiakai Zhu, Chengfei Guo, Zichao Bian, Tianbo Wang, Guoan Zheng

Abstract: Fourier ptychographic microscopy (FPM) is a computational approach geared towards creating high-resolution and large field-of-view images without mechanical scanning. To acquire color images of histology slides, it often requires sequential acquisitions with red, green, and blue illuminations. The color reconstructions often suffer from coherent artifacts that are not presented in regular incohere… ▽ More Fourier ptychographic microscopy (FPM) is a computational approach geared towards creating high-resolution and large field-of-view images without mechanical scanning. To acquire color images of histology slides, it often requires sequential acquisitions with red, green, and blue illuminations. The color reconstructions often suffer from coherent artifacts that are not presented in regular incoherent microscopy images. As a result, it remains a challenge to employ FPM for digital pathology applications, where resolution and color accuracy are of critical importance. Here we report a deep learning approach for performing unsupervised image-to-image translation of FPM reconstructions. A cycle-consistent adversarial network with multiscale structure similarity loss is trained to perform virtual brightfield and fluorescence staining of the recovered FPM images. In the training stage, we feed the network with two sets of unpaired images: 1) monochromatic FPM recovery, and 2) color or fluorescence images captured using a regular microscope. In the inference stage, the network takes the FPM input and outputs a virtually stained image with reduced coherent artifacts and improved image quality. We test the approach on various samples with different staining protocols. High-quality color and fluorescence reconstructions validate its effectiveness. △ Less

Submitted 16 August, 2020; originally announced August 2020.

arXiv:2006.08610 [pdf]

Autofocusing technologies for whole slide imaging and automated microscopy

Authors: Zichao Bian, Chengfei Guo, Shaowei Jiang, Jiakai Zhu, Ruihai Wang, Pengming Song, Zibang Zhang, Kazunori Hoshino, Guoan Zheng

Abstract: Whole slide imaging (WSI) has moved digital pathology closer to diagnostic practice in recent years. Due to the inherent tissue topography variability, accurate autofocusing remains a critical challenge for WSI and automated microscopy systems. The traditional focus map surveying method is limited in its ability to acquire a high degree of focus points while still maintaining high throughput. Real… ▽ More Whole slide imaging (WSI) has moved digital pathology closer to diagnostic practice in recent years. Due to the inherent tissue topography variability, accurate autofocusing remains a critical challenge for WSI and automated microscopy systems. The traditional focus map surveying method is limited in its ability to acquire a high degree of focus points while still maintaining high throughput. Real-time approaches decouple image acquisition from focusing, thus allowing for rapid scanning while maintaining continuous accurate focus. This work reviews the traditional focus map approach and discusses the choice of focus measure for focal plane determination. It also discusses various real-time autofocusing approaches including reflective-based triangulation, confocal pinhole detection, low-coherence interferometry, tilted sensor approach, independent dual sensor scanning, beam splitter array, phase detection, dual-LED illumination, and deep-learning approaches. The technical concepts, merits, and limitations of these methods are explained and compared to those of a traditional WSI system. This review may provide new insights for the development of high-throughput automated microscopy imaging systems that can be made broadly available and utilizable without loss of capacity. △ Less

Submitted 15 August, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

arXiv:2006.08114 [pdf]

doi 10.1364/OL.394923

Super-resolved multispectral lensless microscopy via angle-tilted, wavelength-multiplexed ptychographic modulation

Authors: Pengming Song, Ruihai Wang, Jiakai Zhu, Tianbo Wang, Zichao Bian, Zibang Zhang, Kazunori Hoshino, Michael Murphy, Shaowei Jiang, Chengfei Guo, Guoan Zheng

Abstract: We report an angle-tilted, wavelength-multiplexed ptychographic modulation approach for multispectral lensless on-chip microscopy. In this approach, we illuminate the specimen with lights at 5 wavelengths simultaneously. A prism is added at the illumination path for spectral dispersion. Lightwaves at different wavelengths, thus, hit the specimen at slightly different incident angles, breaking the… ▽ More We report an angle-tilted, wavelength-multiplexed ptychographic modulation approach for multispectral lensless on-chip microscopy. In this approach, we illuminate the specimen with lights at 5 wavelengths simultaneously. A prism is added at the illumination path for spectral dispersion. Lightwaves at different wavelengths, thus, hit the specimen at slightly different incident angles, breaking the ambiguities in mixed state ptychographic reconstruction. At the detection path, we place a thin diffuser in-between the specimen and the monochromatic image sensor for encoding the spectral information into 2D intensity measurements. By scanning the sample to different x-y positions, we acquire a sequence of monochromatic images for reconstructing the 5 complex object profiles at the 5 wavelengths. An up-sampling procedure is integrated into the recovery process to bypass the resolution limit imposed by the imager pixel size. We demonstrate a half-pitch resolution of 0.55 microns using an image sensor with 1.85-micron pixel size. We also demonstrate quantitative and high-quality multispectral reconstructions of stained tissue sections for digital pathology applications. △ Less

Submitted 14 June, 2020; originally announced June 2020.

arXiv:2003.14237 [pdf]

doi 10.1364/OL.417039

Single-pixel coherent diffraction imaging

Authors: Meng Li, Liheng Bian, Guoan Zheng, Andrew Maiden, Yang Liu, Yiming Li, Qionghai Dai, Jun Zhang

Abstract: Complex-field imaging is indispensable for numerous applications at wavelengths from X-ray to THz, with amplitude describing transmittance (or reflectivity) and phase revealing intrinsic structure of the target object. Coherent diffraction imaging (CDI) employs iterative phase retrieval algorithms to process diffraction measurements and is the predominant non-interferometric method to image comple… ▽ More Complex-field imaging is indispensable for numerous applications at wavelengths from X-ray to THz, with amplitude describing transmittance (or reflectivity) and phase revealing intrinsic structure of the target object. Coherent diffraction imaging (CDI) employs iterative phase retrieval algorithms to process diffraction measurements and is the predominant non-interferometric method to image complex fields. However, the working spectrum of CDI is quite narrow, because the diffraction measurements on which it relies require dense array detection with ultra-high dynamic range. Here we report a single-pixel CDI technique that works for a wide waveband. A single-pixel detector instead of an array sensor is employed in the far field for detection. It repeatedly records the DC-only component of the diffracted wavefront scattered from an object as it is illuminated by a sequence of binary modulation patterns. This decreases the measurements' dynamic range by several orders of magnitude. We employ an efficient single-pixel phase-retrieval algorithm to jointly recover the object's 2D amplitude and phase maps from the 1D intensity-only measurements. No a priori object information is needed in the recovery process. We validate the technique's quantitative phase imaging nature using both calibrated phase objects and biological samples, and demonstrate its wide working spectrum with both 488-nm visible light and 980-nm near-infrared light. Our approach paves the way for complex-field imaging in a wider waveband where 2D detector arrays are not available, with broad applications in life and material sciences. △ Less

Submitted 29 March, 2020; originally announced March 2020.

arXiv:2002.12589 [pdf, other]

doi 10.1109/TWC.2020.2977340

Deep Learning Enabled Optimization of Downlink Beamforming Under Per-Antenna Power Constraints: Algorithms and Experimental Demonstration

Authors: Ju** Zhang, Wenchao Xia, Minglei You, Gan Zheng, Sangarapillai Lambotharan, Kai-Kit Wong

Abstract: This paper studies fast downlink beamforming algorithms using deep learning in multiuser multiple-input-single-output systems where each transmit antenna at the base station has its own power constraint. We focus on the signal-to-interference-plus-noise ratio (SINR) balancing problem which is quasi-convex but there is no efficient solution available. We first design a fast subgradient algorithm th… ▽ More This paper studies fast downlink beamforming algorithms using deep learning in multiuser multiple-input-single-output systems where each transmit antenna at the base station has its own power constraint. We focus on the signal-to-interference-plus-noise ratio (SINR) balancing problem which is quasi-convex but there is no efficient solution available. We first design a fast subgradient algorithm that can achieve near-optimal solution with reduced complexity. We then propose a deep neural network structure to learn the optimal beamforming based on convolutional networks and exploitation of the duality of the original problem. Two strategies of learning various dual variables are investigated with different accuracies, and the corresponding recovery of the original solution is facilitated by the subgradient algorithm. We also develop a generalization method of the proposed algorithms so that they can adapt to the varying number of users and antennas without re-training. We carry out intensive numerical simulations and testbed experiments to evaluate the performance of the proposed algorithms. Results show that the proposed algorithms achieve close to optimal solution in simulations with perfect channel information and outperform the alleged theoretically optimal solution in experiments, illustrating a better performance-complexity tradeoff than existing schemes. △ Less

Submitted 28 February, 2020; originally announced February 2020.

Comments: This paper was accepted for publication in IEEE Transactions on Wireless Communications

arXiv:2001.05277 [pdf, other]

doi 10.1109/MWC.001.1900239

Model-Driven Beamforming Neural Networks

Authors: Wenchao Xia, Gan Zheng, Kai-Kit Wong, Hongbo Zhu

Abstract: Beamforming is evidently a core technology in recent generations of mobile communication networks. Nevertheless, an iterative process is typically required to optimize the parameters, making it ill-placed for real-time implementation due to high complexity and computational delay. Heuristic solutions such as zero-forcing (ZF) are simpler but at the expense of performance loss. Alternatively, deep… ▽ More Beamforming is evidently a core technology in recent generations of mobile communication networks. Nevertheless, an iterative process is typically required to optimize the parameters, making it ill-placed for real-time implementation due to high complexity and computational delay. Heuristic solutions such as zero-forcing (ZF) are simpler but at the expense of performance loss. Alternatively, deep learning (DL) is well understood to be a generalizing technique that can deliver promising results for a wide range of applications at much lower complexity if it is sufficiently trained. As a consequence, DL may present itself as an attractive solution to beamforming. To exploit DL, this article introduces general data- and model-driven beamforming neural networks (BNNs), presents various possible learning strategies, and also discusses complexity reduction for the DL-based BNNs. We also offer enhancement methods such as training-set augmentation and transfer learning in order to improve the generality of BNNs, accompanied by computer simulation results and testbed results showing the performance of such BNN solutions. △ Less

Submitted 15 January, 2020; originally announced January 2020.

arXiv:1912.03446 [pdf]

doi 10.1364/OL.45.000260

OpenWSI: a low-cost, high-throughput whole slide imaging system via single-frame autofocusing and open-source hardware

Authors: Chengfei Guo, Zichao Bian, Shaowei Jiang, Michael Murphy, Jiakai Zhu, Ruihai Wang, Pengming Song, Xiaopeng Shao, Yongbing Zhang, Guoan Zheng

Abstract: Recent advancements in whole slide imaging (WSI) have moved pathology closer to digital practice. Existing systems require precise mechanical control and the cost is prohibitive for most individual pathologists. Here we report a low-cost and high-throughput WSI system termed OpenWSI. The reported system is built using off-the-shelf components including a programmable LED array, a photographic lens… ▽ More Recent advancements in whole slide imaging (WSI) have moved pathology closer to digital practice. Existing systems require precise mechanical control and the cost is prohibitive for most individual pathologists. Here we report a low-cost and high-throughput WSI system termed OpenWSI. The reported system is built using off-the-shelf components including a programmable LED array, a photographic lens, and a low-cost computer numerical control (CNC) router. Different from conventional WSI platforms, our system performs real-time single-frame autofocusing using color-multiplexed illumination. For axial positioning control, we perform coarse adjustment using the CNC router and precise adjustment using the ultrasonic motor ring in the photographic lens. By using a 20X objective lens, we show that the OpenWSI system has a resolution of ~0.7 microns. It can acquire whole slide images of a 225-mm^2 region in ~2 mins, with throughput comparable to existing high-end platforms. The reported system offers a turnkey solution to transform the high-end WSI platforms into one that can be made broadly available and utilizable without loss of capacity. △ Less

Submitted 7 December, 2019; originally announced December 2019.

Journal ref: Optics Letters Vol. 45, Issue 1, pp. 260-263 (2020)

arXiv:1912.01974 [pdf]

doi 10.1364/OE.392370

Image-free real-time classification of fast moving objects using 'learned' spatial light modulation and a single-pixel detector

Authors: Zibang Zhang, Xiang Li, Manhong Yao, Shujun Zheng, Guoan Zheng, **gang Zhong

Abstract: Objects classification generally relies on image acquisition and analysis. Real-time classification of high-speed moving objects is challenging, as both high temporal resolution in image acquisition and low computational complexity in objects classification algorithms are required. Here we propose and experimentally demonstrate an approach for real-time moving objects classification without image… ▽ More Objects classification generally relies on image acquisition and analysis. Real-time classification of high-speed moving objects is challenging, as both high temporal resolution in image acquisition and low computational complexity in objects classification algorithms are required. Here we propose and experimentally demonstrate an approach for real-time moving objects classification without image acquisition. As objects classification algorithms rely on the feature information of objects, we propose to use spatial light modulation to acquire the feature information directly rather than performing image acquisition followed by features extraction. A convolutional neural network is designed and trained to learn the spatial features of the target objects. The trained network can generate structured patterns for spatial light modulation. Using the resulting structured patterns for spatial light modulation, the feature information of target objects can be compressively encoded into a short light intensity sequence. The resulting one-dimensional signal is collected by a single-pixel detector and fed to the convolutional neural network for objects classification. As experimentally demonstrated, the proposed approach can achieve accurate and real-time classification of fast moving objects. The proposed method has potential applications in the fields where fast moving objects classification in real time and for long duration is required. △ Less

Submitted 4 December, 2019; v1 submitted 2 December, 2019; originally announced December 2019.

arXiv:1910.03031 [pdf]

doi 10.1039/C9LC01027K

Wide-field, high-resolution lensless on-chip microscopy via near-field blind ptychographic modulation

Authors: Shaowei Jiang, Jiakai Zhu, Pengming Song, Chengfei Guo, Zichao Bian, Ruihai Wang, Yikun Huang, Shiyao Wang, He Zhang, Guoan Zheng

Abstract: We report a novel lensless on-chip microscopy platform based on near-field blind ptychographic modulation. In this platform, we place a thin diffuser in between the object and the image sensor for light wave modulation. By blindly scanning the unknown diffuser to different x-y positions, we acquire a sequence of modulated intensity images for quantitative object recovery. Different from previous p… ▽ More We report a novel lensless on-chip microscopy platform based on near-field blind ptychographic modulation. In this platform, we place a thin diffuser in between the object and the image sensor for light wave modulation. By blindly scanning the unknown diffuser to different x-y positions, we acquire a sequence of modulated intensity images for quantitative object recovery. Different from previous ptychographic implementations, we employ a unit magnification configuration with a Fresnel number of ~50,000, which is orders of magnitude higher than previous ptychographic setups. The unit magnification configuration allows us to have the entire sensor area, 6.4 mm by 4.6 mm, as the imaging field of view. The ultra-high Fresnel number enables us to directly recover the positional shift of the diffuser in the phase retrieval process, addressing the positioning accuracy issue plagued in regular ptychographic experiments. In our implementation, we use a low-cost, DIY scanning stage to perform blind diffuser modulation. Precise mechanical scanning that is critical in conventional ptychography experiments is no longer needed in our setup. We further employ an up-sampling phase retrieval scheme to bypass the resolution limit set by the imager pixel size and demonstrate a half-pitch resolution of 0.78 micron. We validate the imaging performance via in vitro cell cultures, transparent and stained tissue sections, and a thick biological sample. We show that the recovered quantitative phase map can be used to perform effective cell segmentation of the dense yeast culture. We also demonstrate 3D digital refocusing of the thick biological sample based on the recovered wavefront. The reported platform provides a cost-effective and turnkey solution for large field-of-view, high-resolution, and quantitative on-chip microscopy. △ Less

Submitted 11 February, 2020; v1 submitted 4 October, 2019; originally announced October 2019.

arXiv:1908.11056 [pdf, other]

Targeted Source Detection for Environmental Data

Authors: Guanjie Zheng, Mengqi Liu, Tao Wen, Hongjian Wang, Huaxiu Yao, Susan L. Brantley, Zhenhui Li

Abstract: In the face of growing needs for water and energy, a fundamental understanding of the environmental impacts of human activities becomes critical for managing water and energy resources, remedying water pollution, and making regulatory policy wisely. Among activities that impact the environment, oil and gas production, wastewater transport, and urbanization are included. In addition to the occurren… ▽ More In the face of growing needs for water and energy, a fundamental understanding of the environmental impacts of human activities becomes critical for managing water and energy resources, remedying water pollution, and making regulatory policy wisely. Among activities that impact the environment, oil and gas production, wastewater transport, and urbanization are included. In addition to the occurrence of anthropogenic contamination, the presence of some contaminants (e.g., methane, salt, and sulfate) of natural origin is not uncommon. Therefore, scientists sometimes find it difficult to identify the sources of contaminants in the coupled natural and human systems. In this paper, we propose a technique to simultaneously conduct source detection and prediction, which outperforms other approaches in the interdisciplinary case study of the identification of potential groundwater contamination within a region of high-density shale gas development. △ Less

Submitted 29 August, 2019; originally announced August 2019.

Comments: 8 pages, 4 figures, 1 table

arXiv:1908.05761 [pdf]

doi 10.1088/1361-6463/ab489d

Ptychographic modulation engine (PME): a low-cost DIY microscope add-on for coherent super-resolution imaging

Authors: Zichao Bian, Shaowei Jiang, Pengming Song, He Zhang, Pouria Hoveida, Kazunori Hoshino, Guoan Zheng

Abstract: Imaging of biological cells and tissues often relies on fluorescent labels, which offer high contrast with molecular specificity. The use of exogenous labeling agents, however, may alter the normal physiology of the bio-specimens. Complementary to the established fluorescence microscopy, label-free quantitative phase imaging provides an objective morphological measurement tool for bio-specimens an… ▽ More Imaging of biological cells and tissues often relies on fluorescent labels, which offer high contrast with molecular specificity. The use of exogenous labeling agents, however, may alter the normal physiology of the bio-specimens. Complementary to the established fluorescence microscopy, label-free quantitative phase imaging provides an objective morphological measurement tool for bio-specimens and is free of variability introduced by contrast agents. Here we report a simple and low-cost microscope add-on, termed Ptychographic Modulation Engine (PME), for super-resolution quantitative phase imaging. In this microscope add-on module, we attach a diffuser to a 3D-printed holder that can be mechanically moved to different x-y positions. We then use two vibrational motors to introduce random positional shifts to the diffuser. The add-on module can be placed between the objective lens and the specimen in most existing microscope platforms. Thanks to the diffuser modulation process, the otherwise inaccessible high-resolution object information can now be encoded into the captured images. In the ptychographic phase retrieval process, we jointly recover the complex object wavefront, the complex diffuser profile, and the unknown positional shifts of the diffuser. We demonstrate a 4-fold resolution gain over the diffraction limit of the employed 2X objective lens. We also test our approach for in-vivo cell imaging, where we are able to adjust the focus after the data has been captured. The reported microscope add-on provides a turnkey solution for super-resolution quantitative phase imaging. It may find applications in label-free bio-imaging where both large field-of-view and high resolution are needed. △ Less

Submitted 28 September, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

arXiv:1905.03371 [pdf]

doi 10.21037/qims.2019.05.04

Rapid and robust whole slide imaging based on LED-array illumination and color-multiplexed single-shot autofocusing

Authors: Shaowei Jiang, Zichao Bian, Xizhi Huang, Pengming Song, He Zhang, Yongbing Zhang, Guoan Zheng

Abstract: Background: The use of whole slide imaging (WSI) for digital pathology has recently been cleared for primary diagnosis in the US. A conventional WSI system scans the tissue slide to different positions and acquires the digital images. In a typical implementation, a focus map is created prior to the scanning process, leading to significant overhead time and a necessity for high positional accuracy… ▽ More Background: The use of whole slide imaging (WSI) for digital pathology has recently been cleared for primary diagnosis in the US. A conventional WSI system scans the tissue slide to different positions and acquires the digital images. In a typical implementation, a focus map is created prior to the scanning process, leading to significant overhead time and a necessity for high positional accuracy of the mechanical system. The resulting cost of WSI system is often prohibitive for frozen section procedure during surgery. Methods: We report a novel WSI scheme based on a programmable LED array for sample illumination. In between two regular brightfield image acquisitions, we acquire one additional image by turning on a red and a green LED for color multiplexed illumination. We then identify the translational shift of the red- and green-channel images by maximizing the image mutual information or cross-correlation. The resulting translational shift is used for dynamic focus correction in the scanning process. Since we track the differential focus during adjacent acquisitions, there is no positional repeatability requirement in our scheme. Results: We demonstrate a prototype WSI platform with a mean focusing error of ~0.3 microns. Different from previous implementations, this prototype platform requires no focus map surveying, no secondary camera or additional optics, and allows for continuous sample motion in the focus tracking process. Conclusions: A programmable LED array can be used for color-multiplexed single-shot autofocusing in WSI. The reported scheme may enable the development of cost-effective WSI platforms without positional repeatability requirement. It may also provide a turnkey solution for other high-content microscopy applications. △ Less

Submitted 8 May, 2019; originally announced May 2019.

Journal ref: Quantitative Imaging in Medicine and Surgery, 9(5), 823-831, (2019)

arXiv:1905.00162 [pdf]

doi 10.1063/1.5090552

Full-field Fourier ptychography (FFP): spatially varying pupil modeling and its application for rapid field-dependent aberration metrology

Authors: Pengming Song, Shaowei Jiang, He Zhang, Xizhi Huang, Yongbing Zhang, Guoan Zheng

Abstract: Digital aberration measurement and removal play a prominent role in computational imaging platforms aimed at achieving simple and compact optical arrangements. A recent important class of such platforms is Fourier ptychography, which is geared towards efficiently creating gigapixel images with high resolution and large field of view (FOV). In current FP implementations, pupil aberration is often r… ▽ More Digital aberration measurement and removal play a prominent role in computational imaging platforms aimed at achieving simple and compact optical arrangements. A recent important class of such platforms is Fourier ptychography, which is geared towards efficiently creating gigapixel images with high resolution and large field of view (FOV). In current FP implementations, pupil aberration is often recovered at each small segment of the entire FOV. This reconstruction strategy fails to consider the field-dependent nature of the optical pupil. Given the power series expansion of the wavefront aberration, the spatially varying pupil can be fully characterized by tens of coefficients over the entire FOV. With this observation, we report a Full-field Fourier Ptychography (FFP) scheme for rapid and robust aberration metrology. The meaning of 'full-field' in FFP is referred to the recovering of the 'full-field' coefficients that govern the field-dependent pupil over the entire FOV. The optimization degrees of freedom are at least two orders of magnitude lower than the previous implementations. We show that the image acquisition process of FFP can be completed in ~1s and the spatially varying aberration of the entire FOV can be recovered in ~35s using a CPU. The reported approach may facilitate the further development of Fourier ptychography. Since no moving part or calibration target is needed in this approach, it may find important applications in aberration metrology. The derivation of the full-field coefficients and its extension for Zernike modes also provide a general tool for analyzing spatially varying aberrations in computational imaging systems. △ Less

Submitted 30 April, 2019; originally announced May 2019.

Journal ref: APL Photonics 4, 050802 (2019)

arXiv:1904.11832 [pdf]

doi 10.1364/OL.44.003645

Super-resolution microscopy via ptychographic structured modulation of a diffuser

Authors: Pengming Song, Shaowei Jiang, He Zhang, Zichao Bian, Chengfei Guo, Kazunori Hoshino, Guoan Zheng

Abstract: We report a new coherent imaging technique, termed ptychographic structured modulation (PSM), for quantitative super-resolution microscopy. In this technique, we place a thin diffuser (i.e., a scattering lens) in between the sample and the objective lens to modulate the complex light waves from the object. The otherwise inaccessible high-resolution object information can thus be encoded into the c… ▽ More We report a new coherent imaging technique, termed ptychographic structured modulation (PSM), for quantitative super-resolution microscopy. In this technique, we place a thin diffuser (i.e., a scattering lens) in between the sample and the objective lens to modulate the complex light waves from the object. The otherwise inaccessible high-resolution object information can thus be encoded into the captured images. We then employ a ptychographic phase retrieval process to jointly recover the exit wavefront of the complex object and the unknown diffuser profile. Unlike the illumination-based super-resolution approach, the recovered image of our approach depends upon how the complex wavefront exits the sample - not enters it. Therefore, the sample thickness becomes irrelevant during reconstruction. After recovery, we can propagate the super-resolution complex wavefront to any position along the optical axis. We validate our approach using a resolution target, a quantitative phase target, a two-layer sample, and a thick PDMS sample. We demonstrate a 4.5-fold resolution gain over the diffraction limit. We also show that a 4-fold resolution gain can be achieved with as few as ~30 images. The reported approach may provide a quantitative super-resolution strategy for coherent light, X-ray, and electron imaging. △ Less

Submitted 5 June, 2019; v1 submitted 25 April, 2019; originally announced April 2019.

Journal ref: Optics Letters, 44(15), 3645-3648, (2019)

arXiv:1903.12565 [pdf]

doi 10.1364/OL.44.001976

Field-portable quantitative lensless microscopy based on translated speckle illumination and sub-sampled ptychographic phase retrieval

Authors: He Zhang, Zichao Bian, Shaowei Jiang, Jian Liu, Pengming Song, Guoan Zheng

Abstract: We report a compact, cost-effective and field-portable lensless imaging platform for quantitative microscopy. In this platform, the object is placed on top of an image sensor chip without using any lens. We use a low-cost galvo scanner to rapidly scan an unknown laser speckle pattern on the object. To address the positioning repeatability and accuracy issues, we directly recover the positional shi… ▽ More We report a compact, cost-effective and field-portable lensless imaging platform for quantitative microscopy. In this platform, the object is placed on top of an image sensor chip without using any lens. We use a low-cost galvo scanner to rapidly scan an unknown laser speckle pattern on the object. To address the positioning repeatability and accuracy issues, we directly recover the positional shifts of the speckle pattern based on the phase correlation of the captured images. To bypass the resolution limit set by the imager pixel size, we employ a sub-sampled ptychographic phase retrieval process to recover the complex object. We validate our approach using a resolution target, a phase target, and a biological sample. Our results show that accurate, high-quality complex images can be obtained from a lensless dataset with as few as ~10 images. We also demonstrate the reported approach to achieve a 6.4 mm by 4.6 mm field of view and a half pitch resolution of 1 miron. The reported approach may provide a quantitative lensless imaging strategy for addressing point-of-care, global-health, and telemedicine related challenges. △ Less

Submitted 28 March, 2019; originally announced March 2019.

Journal ref: Optics Letters, 44(8), 1976-1979, (2019)

arXiv:1901.03057 [pdf]

doi 10.1364/OE.27.007498

Near-field Fourier ptychography: super-resolution phase retrieval via speckle illumination

Authors: He Zhang, Shaowei Jiang, Jun Liao, Jun**g Deng, Jian Liu, Yongbing Zhang, Guoan Zheng

Abstract: Achieving high spatial resolution is the goal of many imaging systems. Designing a high-resolution lens with diffraction-limited performance over a large field of view remains a difficult task in imaging system design. On the other hand, creating a complex speckle pattern with wavelength-limited spatial features is effortless and can be implemented via a simple random diffuser. With this observati… ▽ More Achieving high spatial resolution is the goal of many imaging systems. Designing a high-resolution lens with diffraction-limited performance over a large field of view remains a difficult task in imaging system design. On the other hand, creating a complex speckle pattern with wavelength-limited spatial features is effortless and can be implemented via a simple random diffuser. With this observation and inspired by the concept of near-field ptychography, we report a new imaging modality, termed near-field Fourier ptychography, for tackling high-resolution imaging challenges in both microscopic and macroscopic imaging settings. The meaning of 'near-field' is referred to placing the object at a short defocus distance with a large Fresnel number. In our implementations, we project a speckle pattern with fine spatial features on the object instead of directly resolving the spatial features via a high-resolution lens. We then translate the object (or speckle) to different positions and acquire the corresponding images using a low-resolution lens. A ptychographic phase retrieval process is used to recover the complex object, the unknown speckle pattern, and the coherent transfer function at the same time. In a microscopic imaging setup, we use a 0.12 numerical aperture (NA) lens to achieve a NA of 0.85 in the reconstruction process. In a macroscale photographic imaging setup, we achieve ~7-fold resolution gain using a photographic lens. The final achievable resolution is not determined by the collection optics. Instead, it is determined by the feature size of the speckle pattern. The reported imaging modality can be employed in light, coherent X-ray, and transmission electron imaging systems to increase resolution and provide quantitative absorption and phase contrast of the object. △ Less

Submitted 9 February, 2019; v1 submitted 10 January, 2019; originally announced January 2019.

Comments: 15 pages, 14 figures

arXiv:1805.07766 [pdf, other]

Constrained Partial Group Decoding with Max-Min Fairness for Multi-color Multi-user Visible Light Communication

Authors: Guangtao Zheng, Chen Gong, Zhengyuan Xu

Abstract: A visible light communication (VLC) system can adopt multi-color light emitting diode (LED) arrays to support multiple users. In this paper, a multi-layer coding and constrained partial group decoding (CPGD) method is proposed to tackle strong color interference and increase the system throughput. After channel model formulation, user information rates are allocated and decoding order for all the… ▽ More A visible light communication (VLC) system can adopt multi-color light emitting diode (LED) arrays to support multiple users. In this paper, a multi-layer coding and constrained partial group decoding (CPGD) method is proposed to tackle strong color interference and increase the system throughput. After channel model formulation, user information rates are allocated and decoding order for all the received data layers is obtained by solving a max-min fairness problem using a greedy algorithm. An achievable rate is derived under the truncated Gaussian input distribution. To reduce the decoding complexity, a map on the decoding order and rate allocation is constructed for all positions of interest on the receiver plane and its size is reduced by a classification-based algorithm. Meanwhile, the symmetrical geometry of LED arrays is exploited. Finally, the transmitter-user association problem is formulated and solved by a genetic algorithm. It is observed that the system throughput increases as the receivers are slightly misaligned with corresponding LED arrays due to the reduced interference level, but decreases afterwards due to the weakened link gain. △ Less

Submitted 20 May, 2018; originally announced May 2018.

Comments: 28 pages, 12 figures, submitted to TCOM

arXiv:1604.05455 [pdf, ps, other]

A hybrid approach for cooperative output regulation with sampled compensator

Authors: Chao Yang, Zhi-Hong Guan, Ming Chi, Gui-Lin Zheng

Abstract: This work investigates the cooperative output regulation problem of linear multi-agent systems with hybrid sampled data control. Due to the limited data sensing and communication, in many practical situations, only sampled data are available for the cooperation of multi-agent systems. To overcome this problem, a distributed hybrid controller is presented for the cooperative output regulation, and… ▽ More This work investigates the cooperative output regulation problem of linear multi-agent systems with hybrid sampled data control. Due to the limited data sensing and communication, in many practical situations, only sampled data are available for the cooperation of multi-agent systems. To overcome this problem, a distributed hybrid controller is presented for the cooperative output regulation, and cooperative output regulation is achieved by well designed state feedback law. Then it proposed a method for the designing of sampled data controller to solve the cooperative output regulation problem with continuous linear systems and discrete-time communication data. Finally, numerical simulation example for cooperative tracking and a simulation example for optimal control of micro-grids are proposed to illustrate the result of the sampled data control law. △ Less

Submitted 19 April, 2016; originally announced April 2016.

Showing 1–48 of 48 results for author: Zheng, G