Search | arXiv e-print repository

Power System Capacity Planning Considering Seasonal Hydrogen Storage by Salt Caverns

Authors: Xueqian He, Tianguang Lu, **g Li, Wanxing Sheng, Rui Li

Abstract: In China, air conditioning in summer and electric heating in winter lead to seasonal volatility in load power. Therefore, it is urgent to develop economic and efficient long-term energy storage systems to enhance peak regulation. Power-to-hydrogen technology is a perspective solution to balance seasonal power fluctuation. However, current hydrogen storage methods have shortcomings such as small st… ▽ More In China, air conditioning in summer and electric heating in winter lead to seasonal volatility in load power. Therefore, it is urgent to develop economic and efficient long-term energy storage systems to enhance peak regulation. Power-to-hydrogen technology is a perspective solution to balance seasonal power fluctuation. However, current hydrogen storage methods have shortcomings such as small storage capacity, high levelized cost and low operation safety, which the salt cavern hydrogen storage could overcome. This paper considers the use of hydrogen storage in salt caverns as a means of peak shaving. To minimize the overall operating cost, a comprehensive power system capacity planning model is proposed with the consideration of hydrogen storage in salt caverns, which is implemented by adopting an improved fast unit commitment method. Considering the seasonal characteristics of the load power in Jiangsu Province, the capacity of the power system in 2050 has been planned. According to the case study, after the optimal deployment of the salt cavern hydrogen storage system (SCHSS), the construction capacity of renewable units (especially wind power) will be significantly increased with environmental friendliness and lower costs. Compared with the current energy storage method, the overall cost of SCHSS-incorporated power system will be reduced by 22.2% with a carbon emission reduction of 24.4%, and the amount of curtailed wind and solar power will be reduced by 27.0% and 13.6%, respectively. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2311.07891 [pdf]

Collaborative planning and optimization for electric-thermal-hydrogen-coupled energy systems with portfolio selection of the complete hydrogen energy chain

Authors: Xinning Yi, Tianguang Lu, Yixiao Li, Qian Ai, Ran Hao

Abstract: Under the global low-carbon target, the uneven spatiotemporal distribution of renewable energy resources exacerbates the uncertainty and seasonal power imbalance. Additionally, the issue of an incomplete hydrogen energy chain is widely overlooked in planning models, which hinders the complete analysis of the role of hydrogen in energy systems. Therefore, this paper proposes a high-resolution colla… ▽ More Under the global low-carbon target, the uneven spatiotemporal distribution of renewable energy resources exacerbates the uncertainty and seasonal power imbalance. Additionally, the issue of an incomplete hydrogen energy chain is widely overlooked in planning models, which hinders the complete analysis of the role of hydrogen in energy systems. Therefore, this paper proposes a high-resolution collaborative planning model for electricity-thermal-hydrogen-coupled energy systems considering both the spatiotemporal distribution characteristics of renewable energy resources and the multi-scale bottom-to-top investment strategy for the complete hydrogen energy chain. Considering the high-resolution system operation flexibility, this paper proposes a hydrogen chain-based fast clustering optimization method that can handle high-dimensional data and multi-time scale operation characteristics. The model optimizes the geographical distribution and capacity configuration of the Northeast China energy system in 2050, with hourly operational characteristics. The planning optimization covered single-energy devices, multi-energy-coupled conversion devices, and electric-hydrogen transmission networks. Last but not least, this paper thoroughly examines the optimal portfolio selection of different hydrogen technologies based on the differences in cost, flexibility, and efficiency. In the Pareto analysis, the proposed model reduces CO2 emissions by 60% with a competitive cost. This paper provides a zero-carbon pathway for multi-energy systems with a cost 4% less than the social cost of carbon $44.6/ton, and the integration of the complete hydrogen energy chain reduces the renewable energy curtailment by 97.0%. Besides, the portfolio selection results indicate that the system favors the SOEC with the highest energy efficiency and the PEMFC with the fastest dynamic response when achieving zero-carbon emissions △ Less

Submitted 13 November, 2023; originally announced November 2023.

Comments: 32 pages, 17 figures

arXiv:2309.07861 [pdf, other]

CiwaGAN: Articulatory information exchange

Authors: Gašper Beguš, Thomas Lu, Alan Zhou, Peter Wu, Gopala K. Anumanchipalli

Abstract: Humans encode information into sounds by controlling articulators and decode information from sounds using the auditory apparatus. This paper introduces CiwaGAN, a model of human spoken language acquisition that combines unsupervised articulatory modeling with an unsupervised model of information exchange through the auditory modality. While prior research includes unsupervised articulatory modeli… ▽ More Humans encode information into sounds by controlling articulators and decode information from sounds using the auditory apparatus. This paper introduces CiwaGAN, a model of human spoken language acquisition that combines unsupervised articulatory modeling with an unsupervised model of information exchange through the auditory modality. While prior research includes unsupervised articulatory modeling and information exchange separately, our model is the first to combine the two components. The paper also proposes an improved articulatory model with more interpretable internal representations. The proposed CiwaGAN model is the most realistic approximation of human spoken language acquisition using deep learning. As such, it is useful for cognitively plausible simulations of the human speech act. △ Less

Submitted 14 September, 2023; originally announced September 2023.

arXiv:2307.12278 [pdf]

Capacity Expansion of High Renewable Penetrated Energy Systems Considering Concentrating Solar Power for Seasonal Energy Balance

Authors: **g Li, Tianguang Lu, Xinning Yi, Shaorui Wang, Xueqian He

Abstract: With the increasing proportion of variable renewable energy which owns fluctuation characteristics and the promotion of the Clean Heating policy, the seasonal energy imbalance of the system has been more and more challenging. There is a lack of effective means to mitigate this challenge under the background of gradual compression of the traditional thermal unit construction. Concentrating solar po… ▽ More With the increasing proportion of variable renewable energy which owns fluctuation characteristics and the promotion of the Clean Heating policy, the seasonal energy imbalance of the system has been more and more challenging. There is a lack of effective means to mitigate this challenge under the background of gradual compression of the traditional thermal unit construction. Concentrating solar power (CSP) is a promising technology to replace thermal units by integrating emergency boilers to cope with extreme weather, and can meet long-time energy balance as a seasonal peak regulation source. In this paper, we propose a long-term high-resolution expansion planning model of the energy system under high renewable penetration which integrates CSP technology for seasonal energy balance. With the projection to 2050, by taking the energy system in Xinjiang province which is a typical area of the Clean Heating project with rich irradiance as a case study, it shows that the optimal deployment of CSP and electric boiler (EB) can reduce the cost, peak-valley difference of net load and renewable curtailment by 8.73%, 19.72% and 58.24% respectively at 65% renewable penetration compared to the base scenario. △ Less

Submitted 23 July, 2023; originally announced July 2023.

Comments: 17 pages, 13 figures

arXiv:2307.01146 [pdf, other]

AVSegFormer: Audio-Visual Segmentation with Transformer

Authors: Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu

Abstract: The combination of audio and vision has long been a topic of interest in the multi-modal community. Recently, a new audio-visual segmentation (AVS) task has been introduced, aiming to locate and segment the sounding objects in a given video. This task demands audio-driven pixel-level scene understanding for the first time, posing significant challenges. In this paper, we propose AVSegFormer, a nov… ▽ More The combination of audio and vision has long been a topic of interest in the multi-modal community. Recently, a new audio-visual segmentation (AVS) task has been introduced, aiming to locate and segment the sounding objects in a given video. This task demands audio-driven pixel-level scene understanding for the first time, posing significant challenges. In this paper, we propose AVSegFormer, a novel framework for AVS tasks that leverages the transformer architecture. Specifically, we introduce audio queries and learnable queries into the transformer decoder, enabling the network to selectively attend to interested visual features. Besides, we present an audio-visual mixer, which can dynamically adjust visual features by amplifying relevant and suppressing irrelevant spatial channels. Additionally, we devise an intermediate mask loss to enhance the supervision of the decoder, encouraging the network to produce more accurate intermediate predictions. Extensive experiments demonstrate that AVSegFormer achieves state-of-the-art results on the AVS benchmark. The code is available at https://github.com/vvvb-github/AVSegFormer. △ Less

Submitted 18 December, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

Comments: 7 pages, 6 figures

arXiv:2306.08955 [pdf, other]

A Comparison of Self-Supervised Pretraining Approaches for Predicting Disease Risk from Chest Radiograph Images

Authors: Yanru Chen, Michael T Lu, Vineet K Raghu

Abstract: Deep learning is the state-of-the-art for medical imaging tasks, but requires large, labeled datasets. For risk prediction, large datasets are rare since they require both imaging and follow-up (e.g., diagnosis codes). However, the release of publicly available imaging data with diagnostic labels presents an opportunity for self and semi-supervised approaches to improve label efficiency for risk p… ▽ More Deep learning is the state-of-the-art for medical imaging tasks, but requires large, labeled datasets. For risk prediction, large datasets are rare since they require both imaging and follow-up (e.g., diagnosis codes). However, the release of publicly available imaging data with diagnostic labels presents an opportunity for self and semi-supervised approaches to improve label efficiency for risk prediction. Though several studies have compared self-supervised approaches in natural image classification, object detection, and medical image interpretation, there is limited data on which approaches learn robust representations for risk prediction. We present a comparison of semi- and self-supervised learning to predict mortality risk using chest x-ray images. We find that a semi-supervised autoencoder outperforms contrastive and transfer learning in internal and external validation. △ Less

Submitted 15 June, 2023; originally announced June 2023.

Comments: 33 pages, 22 figures, Accepted for publication at MIDL 2023

arXiv:2305.01626 [pdf, other]

Basic syntax from speech: Spontaneous concatenation in unsupervised deep neural networks

Authors: Gašper Beguš, Thomas Lu, Zili Wang

Abstract: Computational models of syntax are predominantly text-based. Here we propose that basic syntax can be modeled directly from raw speech in a fully unsupervised way. We focus on one of the most ubiquitous and basic properties of syntax -- concatenation. We introduce spontaneous concatenation: a phenomenon where convolutional neural networks (CNNs) trained on acoustic recordings of individual words s… ▽ More Computational models of syntax are predominantly text-based. Here we propose that basic syntax can be modeled directly from raw speech in a fully unsupervised way. We focus on one of the most ubiquitous and basic properties of syntax -- concatenation. We introduce spontaneous concatenation: a phenomenon where convolutional neural networks (CNNs) trained on acoustic recordings of individual words start generating outputs with two or even three words concatenated without ever accessing data with multiple words in the input. Additionally, networks trained on two words learn to embed words into novel unobserved word combinations. To our knowledge, this is a previously unreported property of CNNs trained on raw speech in the Generative Adversarial Network setting and has implications both for our understanding of how these architectures learn as well as for modeling syntax and its evolution from raw acoustic inputs. △ Less

Submitted 2 May, 2023; originally announced May 2023.

arXiv:2305.00043 [pdf, other]

Secret Key Generation for IRS-Assisted Multi-Antenna Systems: A Machine Learning-Based Approach

Authors: Chen Chen, Junqing Zhang, Tianyu Lu, Magnus Sandell, Liquan Chen

Abstract: Physical-layer key generation (PKG) based on wireless channels is a lightweight technique to establish secure keys between legitimate communication nodes. Recently, intelligent reflecting surfaces (IRSs) have been leveraged to enhance the performance of PKG in terms of secret key rate (SKR), as it can reconfigure the wireless propagation environment and introduce more channel randomness. In this p… ▽ More Physical-layer key generation (PKG) based on wireless channels is a lightweight technique to establish secure keys between legitimate communication nodes. Recently, intelligent reflecting surfaces (IRSs) have been leveraged to enhance the performance of PKG in terms of secret key rate (SKR), as it can reconfigure the wireless propagation environment and introduce more channel randomness. In this paper, we investigate an IRS-assisted PKG system, taking into account the channel spatial correlation at both the base station (BS) and the IRS. Based on the considered system model, the closed-form expression of SKR is derived analytically considering correlated eavesdrop** channels. Aiming to maximise the SKR, a joint design problem of the BS precoding matrix and the IRS phase shift vector is formulated. To address this high-dimensional non-convex optimisation problem, we propose a novel unsupervised deep neural network (DNN)-based algorithm with a simple structure. Different from most previous works that adopt iterative optimisation to solve the problem, the proposed DNN-based algorithm directly obtains the BS precoding and IRS phase shifts as the output of the DNN. Simulation results reveal that the proposed DNN-based algorithm outperforms the benchmark methods with regard to SKR. △ Less

Submitted 28 April, 2023; originally announced May 2023.

Comments: This paper has been submitted to IEEE Transactions for possible publications. arXiv admin note: substantial text overlap with arXiv:2301.08179

arXiv:2301.08179 [pdf, other]

Machine Learning-Based Secret Key Generation for IRS-assisted Multi-antenna Systems

Authors: Chen Chen, Junqing Zhang, Tianyu Lu, Magnus Sandell, Liquan Chen

Abstract: Physical-layer key generation (PKG) based on wireless channels is a lightweight technique to establish secure keys between legitimate communication nodes. Recently, intelligent reflecting surfaces (IRSs) have been leveraged to enhance the performance of PKG in terms of secret key rate (SKR), as it can reconfigure the wireless propagation environment and introduce more channel randomness. In this p… ▽ More Physical-layer key generation (PKG) based on wireless channels is a lightweight technique to establish secure keys between legitimate communication nodes. Recently, intelligent reflecting surfaces (IRSs) have been leveraged to enhance the performance of PKG in terms of secret key rate (SKR), as it can reconfigure the wireless propagation environment and introduce more channel randomness. In this paper, we investigate an IRS-assisted PKG system, taking into account the channel spatial correlation at both the base station (BS) and the IRS. Based on the considered system model, the closed form expression of SKR is derived analytically. Aiming to maximize the SKR, a joint design problem of the BS precoding matrix and the IRS reflecting coefficient vector is formulated. To address this high-dimensional non-convex optimization problem, we propose a novel unsupervised deep neural network (DNN) based algorithm with a simple structure. Different from most previous works that adopt the iterative optimization to solve the problem, the proposed DNN based algorithm directly obtains the BS precoding and IRS phase shifts as the output of the DNN. Simulation results reveal that the proposed DNN-based algorithm outperforms the benchmark methods with regard to SKR. △ Less

Submitted 19 January, 2023; originally announced January 2023.

Comments: Accepted by ICC 2023

arXiv:2212.04813 [pdf]

Remote estimation of geologic composition using interferometric synthetic-aperture radar in California's Central Valley

Authors: Kyongsik Yun, Kyra Adams, John Reager, Zhen Liu, Caitlyn Chavez, Michael Turmon, Thomas Lu

Abstract: California's Central Valley is the national agricultural center, producing 1/4 of the nation's food. However, land in the Central Valley is sinking at a rapid rate (as much as 20 cm per year) due to continued groundwater pum**. Land subsidence has a significant impact on infrastructure resilience and groundwater sustainability. In this study, we aim to identify specific regions with different te… ▽ More California's Central Valley is the national agricultural center, producing 1/4 of the nation's food. However, land in the Central Valley is sinking at a rapid rate (as much as 20 cm per year) due to continued groundwater pum**. Land subsidence has a significant impact on infrastructure resilience and groundwater sustainability. In this study, we aim to identify specific regions with different temporal dynamics of land displacement and find relationships with underlying geological composition. Then, we aim to remotely estimate geologic composition using interferometric synthetic aperture radar (InSAR)-based land deformation temporal changes using machine learning techniques. We identified regions with different temporal characteristics of land displacement in that some areas (e.g., Helm) with coarser grain geologic compositions exhibited potentially reversible land deformation (elastic land compaction). We found a significant correlation between InSAR-based land deformation and geologic composition using random forest and deep neural network regression models. We also achieved significant accuracy with 1/4 sparse sampling to reduce any spatial correlations among data, suggesting that the model has the potential to be generalized to other regions for indirect estimation of geologic composition. Our results indicate that geologic composition can be estimated using InSAR-based land deformation data. In-situ measurements of geologic composition can be expensive and time consuming and may be impractical in some areas. The generalizability of the model sheds light on high spatial resolution geologic composition estimation utilizing existing measurements. △ Less

Submitted 4 December, 2022; originally announced December 2022.

Comments: 10 pages, 7 figures, NeurIPS 2022

arXiv:2211.08708 [pdf, ps, other]

Exploring Detection-based Method For Speaker Diarization @ Ego4D Audio-only Diarization Challenge 2022

Authors: Jiahao Wang, Guo Chen, Yin-Dong Zheng, Tong Lu

Abstract: We provide the technical report for Ego4D audio-only diarization challenge in ECCV 2022. Speaker diarization takes the audio streams as input and outputs the homogeneous segments according to the speaker's identity. It aims to solve the problem of "Who spoke when." In this paper, we explore a Detection-based method to tackle the audio-only speaker diarization task. Our method first extracts audio… ▽ More We provide the technical report for Ego4D audio-only diarization challenge in ECCV 2022. Speaker diarization takes the audio streams as input and outputs the homogeneous segments according to the speaker's identity. It aims to solve the problem of "Who spoke when." In this paper, we explore a Detection-based method to tackle the audio-only speaker diarization task. Our method first extracts audio features by audio backbone and then feeds the feature to a detection-generate network to get the speaker proposals. Finally, after postprocessing, we can get the diarization results. The validation dataset validates this method, and our method achieves 53.85 DER on the test dataset. These results rank 3rd on the leaderboard of Ego4D audio-only diarization challenge 2022. △ Less

Submitted 16 November, 2022; originally announced November 2022.

Comments: 2 pages

arXiv:2204.07446 [pdf, other]

doi 10.1109/ACCESS.2022.3201645

Wi-Fi and Bluetooth Contact Tracing Without User Intervention

Authors: Brosnan Yuen, Yifeng Bie, Duncan Cairns, Geoffrey Harper, Jason Xu, Charles Chang, Xiaodai Dong, Tao Lu

Abstract: Previous contact tracing systems required the users to perform many manual actions, such as installing smartphone applications, joining wireless networks, or carrying custom user devices. This increases the barrier to entry and lowers the user adoption rate. As a result, the contact tracing effectiveness is reduced. Unlike the systems above, we propose a new privacy preserving Wi-Fi and Bluetooth… ▽ More Previous contact tracing systems required the users to perform many manual actions, such as installing smartphone applications, joining wireless networks, or carrying custom user devices. This increases the barrier to entry and lowers the user adoption rate. As a result, the contact tracing effectiveness is reduced. Unlike the systems above, we propose a new privacy preserving Wi-Fi and Bluetooth (BLE) contact tracing system that does not require smartphone applications, joining wireless networks, or custom user devices. Our specially built routers seamlessly track smartphones, laptops, smartwatches, BLE headphones, and tablets without any user action, but do not trace user identity. Map** between devices and users is only carried out for confirmed cases and suspected contacts. Moreover, we can track the absolute positions of user devices within 1.0 m due to using bidirectional long short-term memory neural networks that are trained with data pre-collected by an autonomous robot. This allows public health authorities to track indirect droplet and surface transmissions that other contact tracing systems often overlook. △ Less

Submitted 23 July, 2022; v1 submitted 30 March, 2022; originally announced April 2022.

Report number: 2169-3536

Journal ref: IEEE Access Volume 11 (2022) 91027-91044

arXiv:2202.09635 [pdf, other]

Deep Single Image Deraining using An Asymetric Cycle Generative and Adversarial Framework

Authors: Wei Liu, Rui Jiang, Cheng Chen, Tao Lu, Zixiang Xiong

Abstract: In reality, rain and fog are often present at the same time, which can greatly reduce the clarity and quality of the scene image. However, most unsupervised single image deraining methods mainly focus on rain streak removal by disregarding the fog, which leads to low-quality deraining performance. In addition, the samples are rather homogeneous generated by these methods and lack diversity, result… ▽ More In reality, rain and fog are often present at the same time, which can greatly reduce the clarity and quality of the scene image. However, most unsupervised single image deraining methods mainly focus on rain streak removal by disregarding the fog, which leads to low-quality deraining performance. In addition, the samples are rather homogeneous generated by these methods and lack diversity, resulting in poor results in the face of complex rain scenes. To address the above issues, we propose a novel Asymetric Cycle Generative and Adversarial framework (ACGF) for single image deraining that trains on both synthetic and real rainy images while simultaneously capturing both rain streaks and fog features. ACGF consists of a Rain-fog2Clean (R2C) transformation block and a Clean2Rain-fog (C2R) transformation block. The former consists of parallel rain removal path and rain-fog feature extraction path by the rain and derain-fog network and the attention rain-fog feature extraction network (ARFE) , while the latter only contains a synthetic rain transformation path. In rain-fog feature extraction path, to better characterize the rain-fog fusion feature, we employ an ARFE to exploit the self-similarity of global and local rain-fog information by learning the spatial feature correlations. Moreover, to improve the translational capacity of C2R and the diversity of models, we design a rain-fog feature decoupling and reorganization network (RFDR) by embedding a rainy image degradation model and a mixed discriminator to preserve richer texture details in synthetic rain conversion path. Extensive experiments on benchmark rain-fog and rain datasets show that ACGF outperforms state-of-the-art deraining methods. We also conduct defogging performance evaluation experiments to further demonstrate the effectiveness of ACGF. △ Less

Submitted 18 May, 2023; v1 submitted 19 February, 2022; originally announced February 2022.

arXiv:2111.14281 [pdf, other]

Passive Indoor Localization with WiFi Fingerprints

Authors: Minh Tu Hoang, Brosnan Yuen, Kai Ren, Ahmed Elmoogy, Xiaodai Dong, Tao Lu, Robert Westendorp, Kishore Reddy Tarimala

Abstract: This paper proposes passive WiFi indoor localization. Instead of using WiFi signals received by mobile devices as fingerprints, we use signals received by routers to locate the mobile carrier. Consequently, software installation on the mobile device is not required. To resolve the data insufficiency problem, flow control signals such as request to send (RTS) and clear to send (CTS) are utilized. I… ▽ More This paper proposes passive WiFi indoor localization. Instead of using WiFi signals received by mobile devices as fingerprints, we use signals received by routers to locate the mobile carrier. Consequently, software installation on the mobile device is not required. To resolve the data insufficiency problem, flow control signals such as request to send (RTS) and clear to send (CTS) are utilized. In our model, received signal strength indicator (RSSI) and channel state information (CSI) are used as fingerprints for several algorithms, including deterministic, probabilistic and neural networks localization algorithms. We further investigated localization algorithms performance through extensive on-site experiments with various models of phones at hundreds of testing locations. We demonstrate that our passive scheme achieves an average localization error of 0.8 m when the phone is actively transmitting data frames and 1.5 m when it is not transmitting data frames. △ Less

Submitted 28 November, 2021; originally announced November 2021.

Comments: 10 pages, 9 figures, data is availabe in IEEE portal

arXiv:2111.13312 [pdf]

doi 10.1109/BHI50953.2021.9508588

Instrumented shoulder functional assessment using inertial measurement units for frozen shoulder

Authors: Ting-Yang Lu, Kai-Chun Liu, Chia-Yeh Hsieh, Chih-Ya Chang, Yu Tsao, Chia-Tai Chan

Abstract: Frozen shoulder (FS) is a shoulder condition that leads to pain and loss of shoulder range of motion. FS patients have difficulties in independently performing daily activities. Inertial measurement units (IMUs) have been developed to objectively measure upper limb range of motion (ROM) and shoulder function. In this work, we propose an IMU-based shoulder functional task assessment with kinematic… ▽ More Frozen shoulder (FS) is a shoulder condition that leads to pain and loss of shoulder range of motion. FS patients have difficulties in independently performing daily activities. Inertial measurement units (IMUs) have been developed to objectively measure upper limb range of motion (ROM) and shoulder function. In this work, we propose an IMU-based shoulder functional task assessment with kinematic parameters (e.g., smoothness, power, speed, and duration) in FS patients and analyze the functional performance on complete shoulder tasks and subtasks. Twenty FS patients and twenty healthy subjects were recruited in this study. Five shoulder functional tasks are performed by participants, such as washing hair (WH), washing upper back (WUB), washing lower back (WLB), placing an object on a high shelf (POH), and removing an object from back pocket (ROP). The results demonstrate that the used smoothness features can reflect the differences of movement fluency between FS patients and healthy controls (p < 0.05 and effect size > 0.8). Moreover, features of subtasks provided subtle information related to clinical conditions that have not been revealed in features of a complete task, especially the defined subtask 1 and 2 of each task. △ Less

Submitted 25 November, 2021; originally announced November 2021.

Comments: 4 pages, 6 tables, 2 figures, To appear in 2021 IEEE BHI

arXiv:2108.02317 [pdf]

Efficient Fourier single-pixel imaging with Gaussian random sampling

Authors: Ziheng Qiu, Xinyi Guo, Tianao Lu, Pan Qi, Zibang Zhang, **gang Zhong

Abstract: Fourier single-pixel imaging (FSI) is a branch of single-pixel imaging techniques. It uses Fourier basis patterns as structured patterns for spatial information acquisition in the Fourier domain. However, the spatial resolution of the image reconstructed by FSI mainly depends on the number of Fourier coefficients sampled. The reconstruction of a high-resolution image typically requires a number of… ▽ More Fourier single-pixel imaging (FSI) is a branch of single-pixel imaging techniques. It uses Fourier basis patterns as structured patterns for spatial information acquisition in the Fourier domain. However, the spatial resolution of the image reconstructed by FSI mainly depends on the number of Fourier coefficients sampled. The reconstruction of a high-resolution image typically requires a number of Fourier coefficients to be sampled, and therefore takes a long data acquisition time. Here we propose a new sampling strategy for FSI. It allows FSI to reconstruct a clear and sharp image with a reduced number of measurements. The core of the proposed sampling strategy is to perform a variable density sampling in the Fourier space and, more importantly, the density with respect to the importance of Fourier coefficients is subject to a one-dimensional Gaussian function. Combined with compressive sensing, the proposed sampling strategy enables better reconstruction quality than conventional sampling strategies, especially when the sampling ratio is low. We experimentally demonstrate compressive FSI combined with the proposed sampling strategy is able to reconstruct a sharp and clear image of 256-by-256 pixels with a sampling ratio of 10%. The proposed method enables fast single-pixel imaging and provides a new approach for efficient spatial information acquisition. △ Less

Submitted 28 June, 2021; originally announced August 2021.

arXiv:2105.11576 [pdf, other]

Pan-sharpening via High-pass Modification Convolutional Neural Network

Authors: Jiaming Wang, Zhenfeng Shao, Xiao Huang, Tao Lu, Ruiqian Zhang, Jiayi Ma

Abstract: Most existing deep learning-based pan-sharpening methods have several widely recognized issues, such as spectral distortion and insufficient spatial texture enhancement, we propose a novel pan-sharpening convolutional neural network based on a high-pass modification block. Different from existing methods, the proposed block is designed to learn the high-pass information, leading to enhance spatial… ▽ More Most existing deep learning-based pan-sharpening methods have several widely recognized issues, such as spectral distortion and insufficient spatial texture enhancement, we propose a novel pan-sharpening convolutional neural network based on a high-pass modification block. Different from existing methods, the proposed block is designed to learn the high-pass information, leading to enhance spatial information in each band of the multi-spectral-resolution images. To facilitate the generation of visually appealing pan-sharpened images, we propose a perceptual loss function and further optimize the model based on high-level features in the near-infrared space. Experiments demonstrate the superior performance of the proposed method compared to the state-of-the-art pan-sharpening methods, both quantitatively and qualitatively. The proposed model is open-sourced at https://github.com/jiaming-wang/HMB. △ Less

Submitted 24 May, 2021; originally announced May 2021.

Comments: 5 pages, 5 figures, accepted by the 28th IEEE International Conference on Image Processing (ICIP 2021)

arXiv:2105.10949 [pdf, other]

doi 10.1109/LGRS.2021.3112038

SSCAN: A Spatial-spectral Cross Attention Network for Hyperspectral Image Denoising

Authors: Zhiqiang Wang, Zhenfeng Shao, Xiao Huang, Jiaming Wang, Tao Lu, Sihang Zhang

Abstract: Hyperspectral images (HSIs) have been widely used in a variety of applications thanks to the rich spectral information they are able to provide. Among all HSI processing tasks, HSI denoising is a crucial step. Recently, deep learning-based image denoising methods have made great progress and achieved great performance. However, existing methods tend to ignore the correlations between adjacent spec… ▽ More Hyperspectral images (HSIs) have been widely used in a variety of applications thanks to the rich spectral information they are able to provide. Among all HSI processing tasks, HSI denoising is a crucial step. Recently, deep learning-based image denoising methods have made great progress and achieved great performance. However, existing methods tend to ignore the correlations between adjacent spectral bands, leading to problems such as spectral distortion and blurred edges in denoised results. In this study, we propose a novel HSI denoising network, termed SSCAN, that combines group convolutions and attention modules. Specifically, we use a group convolution with a spatial attention module to facilitate feature extraction by directing models' attention to band-wise important features. We propose a spectral-spatial attention block (SSAB) to exploit the spatial and spectral information in hyperspectral images in an effective manner. In addition, we adopt residual learning operations with skip connections to ensure training stability. The experimental results indicate that the proposed SSCAN outperforms several state-of-the-art HSI denoising algorithms. △ Less

Submitted 23 May, 2021; originally announced May 2021.

Comments: 5 pages, 5 figures, submitted to IEEE Signal Processing Letters

arXiv:2105.03579 [pdf, other]

Unsupervised Remote Sensing Super-Resolution via Migration Image Prior

Authors: Jiaming Wang, Zhenfeng Shao, Tao Lu, Xiao Huang, Ruiqian Zhang, Yu Wang

Abstract: Recently, satellites with high temporal resolution have fostered wide attention in various practical applications. Due to limitations of bandwidth and hardware cost, however, the spatial resolution of such satellites is considerably low, largely limiting their potentials in scenarios that require spatially explicit information. To improve image resolution, numerous approaches based on training low… ▽ More Recently, satellites with high temporal resolution have fostered wide attention in various practical applications. Due to limitations of bandwidth and hardware cost, however, the spatial resolution of such satellites is considerably low, largely limiting their potentials in scenarios that require spatially explicit information. To improve image resolution, numerous approaches based on training low-high resolution pairs have been proposed to address the super-resolution (SR) task. Despite their success, however, low/high spatial resolution pairs are usually difficult to obtain in satellites with a high temporal resolution, making such approaches in SR impractical to use. In this paper, we proposed a new unsupervised learning framework, called "MIP", which achieves SR tasks without low/high resolution image pairs. First, random noise maps are fed into a designed generative adversarial network (GAN) for reconstruction. Then, the proposed method converts the reference image to latent space as the migration image prior. Finally, we update the input noise via an implicit method, and further transfer the texture and structured information from the reference image. Extensive experimental results on the Draper dataset show that MIP achieves significant improvements over state-of-the-art methods both quantitatively and qualitatively. The proposed MIP is open-sourced at http://github.com/jiaming-wang/MIP. △ Less

Submitted 23 May, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

Comments: 6 pages, 4 figures. IEEE International Conference on Multimedia and Expo (ICME) 2021

arXiv:2104.07116 [pdf, other]

doi 10.1109/VTC2020-Fall49728.2020.9348592

Meteorologically Introduced Impacts on Aerial Channels and UAV Communications

Authors: Mengan Song, Yiming Huo, Tao Lu, Xiaodai Dong, Zhonghua Liang

Abstract: As 5G wireless systems and networks are now being globally commercialized and deployed, more diversified application scenarios are emerging, quickly resha** our societies and paving the road to the beyond 5G (6G) era when terahertz (THz) and unmanned aerial vehicle (UAV) communications may play critical roles. In this paper, aerial channel models under multiple meteorological conditions such as… ▽ More As 5G wireless systems and networks are now being globally commercialized and deployed, more diversified application scenarios are emerging, quickly resha** our societies and paving the road to the beyond 5G (6G) era when terahertz (THz) and unmanned aerial vehicle (UAV) communications may play critical roles. In this paper, aerial channel models under multiple meteorological conditions such as rain, fog and snow, have been investigated at frequencies of interest (from 2 GHz to 900 GHz) for UAV communications. Furthermore, the link budget and the received signal-to-noise ratio (SNR) performance under the existing air-to-ground (A2G) channel models are studied with antenna(s) system considered. The relationship between the 3D coverage radius and UAV altitude under the influence of multiple weather (MW) conditions is simulated. Numerical results show that medium rain has the most effects on the UAV's coverage for UAV communications at millimeter wave (mmWave) bands, while snow has the largest impacts at near THz bands. In addition, when the frequency increases, the corresponding increase in the number of antennas can effectively compensate for the propagation loss introduced by weather factors, while its form factor and weight can be kept to maintain the UAV's payload. △ Less

Submitted 14 April, 2021; originally announced April 2021.

Comments: 5 pages, 7 figures, accepted by IEEE VTC2020-FALL

arXiv:2103.15683 [pdf, other]

Omniscient Video Super-Resolution

Authors: Peng Yi, Zhongyuan Wang, Kui Jiang, Junjun Jiang, Tao Lu, Xin Tian, Jiayi Ma

Abstract: Most recent video super-resolution (SR) methods either adopt an iterative manner to deal with low-resolution (LR) frames from a temporally sliding window, or leverage the previously estimated SR output to help reconstruct the current frame recurrently. A few studies try to combine these two structures to form a hybrid framework but have failed to give full play to it. In this paper, we propose an… ▽ More Most recent video super-resolution (SR) methods either adopt an iterative manner to deal with low-resolution (LR) frames from a temporally sliding window, or leverage the previously estimated SR output to help reconstruct the current frame recurrently. A few studies try to combine these two structures to form a hybrid framework but have failed to give full play to it. In this paper, we propose an omniscient framework to not only utilize the preceding SR output, but also leverage the SR outputs from the present and future. The omniscient framework is more generic because the iterative, recurrent and hybrid frameworks can be regarded as its special cases. The proposed omniscient framework enables a generator to behave better than its counterparts under other frameworks. Abundant experiments on public datasets show that our method is superior to the state-of-the-art methods in objective metrics, subjective visual effects and complexity. Our code will be made public. △ Less

Submitted 29 March, 2021; originally announced March 2021.

arXiv:2103.11784 [pdf, other]

Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization

Authors: Zhe Chen, Wenhai Wang, Enze Xie, Tong Lu, ** Luo

Abstract: We present an extremely simple Ultra-Resolution Style Transfer framework, termed URST, to flexibly process arbitrary high-resolution images (e.g., 10000x10000 pixels) style transfer for the first time. Most of the existing state-of-the-art methods would fall short due to massive memory cost and small stroke size when processing ultra-high resolution images. URST completely avoids the memory proble… ▽ More We present an extremely simple Ultra-Resolution Style Transfer framework, termed URST, to flexibly process arbitrary high-resolution images (e.g., 10000x10000 pixels) style transfer for the first time. Most of the existing state-of-the-art methods would fall short due to massive memory cost and small stroke size when processing ultra-high resolution images. URST completely avoids the memory problem caused by ultra-high resolution images by (1) dividing the image into small patches and (2) performing patch-wise style transfer with a novel Thumbnail Instance Normalization (TIN). Specifically, TIN can extract thumbnail features' normalization statistics and apply them to small patches, ensuring the style consistency among different patches. Overall, the URST framework has three merits compared to prior arts. (1) We divide input image into small patches and adopt TIN, successfully transferring image style with arbitrary high-resolution. (2) Experiments show that our URST surpasses existing SOTA methods on ultra-high resolution images benefiting from the effectiveness of the proposed stroke perceptual loss in enlarging the stroke size. (3) Our URST can be easily plugged into most existing style transfer methods and directly improve their performance even without training. Code is available at https://git.io/URST. △ Less

Submitted 15 March, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

Comments: Accepted to AAAI 2022

arXiv:2006.14080 [pdf, other]

Accelerating MRI Reconstruction on TPUs

Authors: Tianjian Lu, Thibault Marin, Yue Zhuo, Yi-Fan Chen, Chao Ma

Abstract: The advanced magnetic resonance (MR) image reconstructions such as the compressed sensing and subspace-based imaging are considered as large-scale, iterative, optimization problems. Given the large number of reconstructions required by the practical clinical usage, the computation time of these advanced reconstruction methods is often unacceptable. In this work, we propose using Google's Tensor Pr… ▽ More The advanced magnetic resonance (MR) image reconstructions such as the compressed sensing and subspace-based imaging are considered as large-scale, iterative, optimization problems. Given the large number of reconstructions required by the practical clinical usage, the computation time of these advanced reconstruction methods is often unacceptable. In this work, we propose using Google's Tensor Processing Units (TPUs) to accelerate the MR image reconstruction. TPU is an application-specific integrated circuit (ASIC) for machine learning applications, which has recently been used to solve large-scale scientific computing problems. As proof-of-concept, we implement the alternating direction method of multipliers (ADMM) in TensorFlow to reconstruct images on TPUs. The reconstruction is based on multi-channel, sparsely sampled, and radial-trajectory $k$-space data with sparsity constraints. The forward and inverse non-uniform Fourier transform operations are formulated in terms of matrix multiplications as in the discrete Fourier transform. The sparsifying transform and its adjoint operations are formulated as convolutions. The data decomposition is applied to the measured $k$-space data such that the aforementioned tensor operations are localized within individual TPU cores. The data decomposition and the inter-core communication strategy are designed in accordance with the TPU interconnect network topology in order to minimize the communication time. The accuracy and the high parallel efficiency of the proposed TPU-based image reconstruction method are demonstrated through numerical examples. △ Less

Submitted 24 June, 2020; originally announced June 2020.

arXiv:2005.06394 [pdf, other]

A CNN-LSTM Quantifier for Single Access Point CSI Indoor Localization

Authors: Minh Tu Hoang, Brosnan Yuen, Kai Ren, Xiaodai Dong, Tao Lu, Robert Westendorp, Kishore Reddy

Abstract: This paper proposes a combined network structure between convolutional neural network (CNN) and long-short term memory (LSTM) quantifier for WiFi fingerprinting indoor localization. In contrast to conventional methods that utilize only spatial data with classification models, our CNN-LSTM network extracts both space and time features of the received channel state information (CSI) from a single ro… ▽ More This paper proposes a combined network structure between convolutional neural network (CNN) and long-short term memory (LSTM) quantifier for WiFi fingerprinting indoor localization. In contrast to conventional methods that utilize only spatial data with classification models, our CNN-LSTM network extracts both space and time features of the received channel state information (CSI) from a single router. Furthermore, the proposed network builds a quantification model rather than a limited classification model as in most of the literature work, which enables the estimation of testing points that are not identical to the reference points. We analyze the instability of CSI and demonstrate a mitigation solution using a comprehensive filter and normalization scheme. The localization accuracy is investigated through extensive on-site experiments with several mobile devices including mobile phone (Nexus 5) and laptop (Intel 5300 NIC) on hundreds of testing locations. Using only a single WiFi router, our structure achieves an average localization error of 2.5~m with $\mathrm{80\%}$ of the errors under 4~m, which outperforms the other reported algorithms by approximately $\mathrm{50\%}$ under the same test environment. △ Less

Submitted 13 May, 2020; originally announced May 2020.

Comments: Channel state information (CSI), WiFi indoor localization, convolutional neural network, long short-term memory, fingerprint-based localization

arXiv:2001.02400 [pdf, other]

doi 10.1109/JSEN.2020.2972850

Semi-Sequential Probabilistic Model For Indoor Localization Enhancement

Authors: Minh Tu Hoang, Brosnan Yuen, Xiaodai Dong, Tao Lu, Robert Westendorp, Kishore Reddy

Abstract: This paper proposes a semi-sequential probabilistic model (SSP) that applies an additional short term memory to enhance the performance of the probabilistic indoor localization. The conventional probabilistic methods normally treat the locations in the database indiscriminately. In contrast, SSP leverages the information of the previous position to determine the probable location since the user's… ▽ More This paper proposes a semi-sequential probabilistic model (SSP) that applies an additional short term memory to enhance the performance of the probabilistic indoor localization. The conventional probabilistic methods normally treat the locations in the database indiscriminately. In contrast, SSP leverages the information of the previous position to determine the probable location since the user's speed in an indoor environment is bounded and locations near the previous one have higher probability than the other locations. Although the SSP utilizes the previous location information, it does not require the exact moving speed and direction of the user. On-site experiments using the received signal strength indicator (RSSI) and channel state information (CSI) fingerprints for localization demonstrate that SSP reduces the maximum error and boosts the performance of existing probabilistic approaches by 25% - 30%. △ Less

Submitted 8 January, 2020; originally announced January 2020.

Report number: 1558-1748

Journal ref: IEEE Sensors Journal Volume 20 Issue 11 (2020) 6160 - 6169

arXiv:1911.13218 [pdf]

ModelHub.AI: Dissemination Platform for Deep Learning Models

Authors: Ahmed Hosny, Michael Schwier, Christoph Berger, Evin P Örnek, Mehmet Turan, Phi V Tran, Leon Weninger, Fabian Isensee, Klaus H Maier-Hein, Richard McKinley, Michael T Lu, Udo Hoffmann, Bjoern Menze, Spyridon Bakas, Andriy Fedorov, Hugo JWL Aerts

Abstract: Recent advances in artificial intelligence research have led to a profusion of studies that apply deep learning to problems in image analysis and natural language processing among others. Additionally, the availability of open-source computational frameworks has lowered the barriers to implementing state-of-the-art methods across multiple domains. Albeit leading to major performance breakthroughs… ▽ More Recent advances in artificial intelligence research have led to a profusion of studies that apply deep learning to problems in image analysis and natural language processing among others. Additionally, the availability of open-source computational frameworks has lowered the barriers to implementing state-of-the-art methods across multiple domains. Albeit leading to major performance breakthroughs in some tasks, effective dissemination of deep learning algorithms remains challenging, inhibiting reproducibility and benchmarking studies, impeding further validation, and ultimately hindering their effectiveness in the cumulative scientific progress. In develo** a platform for sharing research outputs, we present ModelHub.AI (www.modelhub.ai), a community-driven container-based software engine and platform for the structured dissemination of deep learning models. For contributors, the engine controls data flow throughout the inference cycle, while the contributor-facing standard template exposes model-specific functions including inference, as well as pre- and post-processing. Python and RESTful Application programming interfaces (APIs) enable users to interact with models hosted on ModelHub.AI and allows both researchers and developers to utilize models out-of-the-box. ModelHub.AI is domain-, data-, and framework-agnostic, catering to different workflows and contributors' preferences. △ Less

Submitted 26 November, 2019; originally announced November 2019.

arXiv:1911.09987 [pdf, other]

Transmission System Resilience Enhancement with Extended Steady-state Security Region in Consideration of Uncertain Topology Changes

Authors: Chong Wang, Feng Wu, ** Ju, Shunbo Lei, Tianguang Lu, Yunhe Hou

Abstract: The increasing extreme weather events poses unprecedented challenges on power system operation because of their uncertain and sequential impacts on power systems. This paper proposes the concept of an extended steady-state security region (ESSR), and resilience enhancement for transmission systems based on ESSR in consideration of uncertain varying topology changes caused by the extreme weather ev… ▽ More The increasing extreme weather events poses unprecedented challenges on power system operation because of their uncertain and sequential impacts on power systems. This paper proposes the concept of an extended steady-state security region (ESSR), and resilience enhancement for transmission systems based on ESSR in consideration of uncertain varying topology changes caused by the extreme weather events is implemented. ESSR is a ploytope describing a region, in which the operating points are within the operating constraints. In consideration of uncertain varying topology changes with ESSR, the resilience enhancement problem is built as a bilevel programming optimization model, in which the system operators deploy the optimal strategy against the most threatening scenario caused by the extreme weather events. To avoid the curse of dimensionality with regard to system topologies for a large scale system, the Monte Carlo method is used to generate uncertain system topologies, and a recursive McCormick envelope-based approach is proposed to connect generated system topologies to optimization variables. Karush Kuhn Tucker (KKT) conditions are used to transform the suboptimization model in the second level into a group of equivalent constraints in the first level. A simple test system and IEEE 118-bus system are used to validate the proposed. △ Less

Submitted 22 November, 2019; originally announced November 2019.

arXiv:1910.00650 [pdf]

doi 10.1016/j.jmr.2020.106790

pISTA-SENSE-ResNet for Parallel MRI Reconstruction

Authors: Tieyuan Lu, Xinlin Zhang, Yihui Huang, Yonggui Yang, Gang Guo, Lijun Bao, Feng Huang, Di Guo, Xiaobo Qu

Abstract: Magnetic resonance imaging has been widely applied in clinical diagnosis, however, is limited by its long data acquisition time. Although imaging can be accelerated by sparse sampling and parallel imaging, achieving promising reconstruction images with a fast reconstruction speed remains a challenge. Recently, deep learning approaches have attracted a lot of attention for its encouraging reconstru… ▽ More Magnetic resonance imaging has been widely applied in clinical diagnosis, however, is limited by its long data acquisition time. Although imaging can be accelerated by sparse sampling and parallel imaging, achieving promising reconstruction images with a fast reconstruction speed remains a challenge. Recently, deep learning approaches have attracted a lot of attention for its encouraging reconstruction results but without a proper interpretability. In this letter, to enable high-quality image reconstruction for the parallel magnetic resonance imaging, we design the network structure from the perspective of sparse iterative reconstruction and enhance it with the residual structure. The experimental results of a public knee dataset show that compared with the optimization-based method and the latest deep learning parallel imaging methods, the proposed network has less error in reconstruction and is more stable under different acceleration factors. △ Less

Submitted 24 September, 2019; originally announced October 2019.

arXiv:1908.11480 [pdf, other]

doi 10.1109/JSEN.2018.2874453

A Soft Range Limited K-Nearest Neighbours Algorithm for Indoor Localization Enhancement

Authors: Minh Tu Hoang, Yizhou Zhu, Brosnan Yuen, Tyler Reese, Xiaodai Dong, Tao Lu, Robert Westendorp, Michael Xie

Abstract: This paper proposes a soft range limited K nearest neighbours (SRL-KNN) localization fingerprinting algorithm. The conventional KNN determines the neighbours of a user by calculating and ranking the fingerprint distance measured at the unknown user location and the reference locations in the database. Different from that method, SRL-KNN scales the fingerprint distance by a range factor related to… ▽ More This paper proposes a soft range limited K nearest neighbours (SRL-KNN) localization fingerprinting algorithm. The conventional KNN determines the neighbours of a user by calculating and ranking the fingerprint distance measured at the unknown user location and the reference locations in the database. Different from that method, SRL-KNN scales the fingerprint distance by a range factor related to the physical distance between the user's previous position and the reference location in the database to reduce the spatial ambiguity in localization. Although utilizing the prior locations, SRL-KNN does not require knowledge of the exact moving speed and direction of the user. Moreover, to take into account of the temporal fluctuations of the received signal strength indicator (RSSI), RSSI histogram is incorporated into the distance calculation. Actual on-site experiments demonstrate that the new algorithm achieves an average localization error of $0.66$ m with $80\%$ of the errors under $0.89$ m, which outperforms conventional KNN algorithms by $45\%$ under the same test environment. △ Less

Submitted 29 August, 2019; originally announced August 2019.

Comments: Received signal strength indicator (RSSI), WiFi indoor localization, K-nearest neighbor (KNN), fingerprint-based localization

Report number: 1558-1748

Journal ref: IEEE Sensor Journal, vol. 18, pp.10208 - 10216, Dec. 2018

arXiv:1908.01974 [pdf, other]

Omni SCADA Intrusion Detection Using Deep Learning Algorithms

Authors: Jun Gao, Luyun Gan, Fabiola Buschendorf, Liao Zhang, Hua Liu, Peixue Li, Xiaodai Dong, Tao Lu

Abstract: We investigate deep learning based omni intrusion detection system (IDS) for supervisory control and data acquisition (SCADA) networks that are capable of detecting both temporally uncorrelated and correlated attacks. Regarding the IDSs developed in this paper, a feedforward neural network (FNN) can detect temporally uncorrelated attacks at an {F$_{1}$} of {99.967${\pm}$0.005\%} but correlated att… ▽ More We investigate deep learning based omni intrusion detection system (IDS) for supervisory control and data acquisition (SCADA) networks that are capable of detecting both temporally uncorrelated and correlated attacks. Regarding the IDSs developed in this paper, a feedforward neural network (FNN) can detect temporally uncorrelated attacks at an {F$_{1}$} of {99.967${\pm}$0.005\%} but correlated attacks as low as {58${\pm}$2\%}. In contrast, long-short term memory (LSTM) detects correlated attacks at {99.56${\pm}$0.01\%} while uncorrelated attacks at {99.3${\pm}$0.1\%}. Combining LSTM and FNN through an ensemble approach further improves the IDS performance with {F$_{1}$} of {99.68${\pm}$0.04\%} regardless the temporal correlations among the data packets. △ Less

Submitted 6 August, 2019; originally announced August 2019.

arXiv:1906.10242 [pdf, other]

Multi-label Classification with Optimal Thresholding for Multi-composition Spectroscopic Analysis

Authors: Luyun Gan, Brosnan Yuen, Tao Lu

Abstract: In this paper, we implement multi-label neural networks with optimal thresholding to identify gas species among a multi gas mixture in a cluttered environment. Using infrared absorption spectroscopy and tested on synthesized spectral datasets, our approach outperforms conventional binary relevance - partial least squares discriminant analysis when signal-to-noise ratio and training sample size are… ▽ More In this paper, we implement multi-label neural networks with optimal thresholding to identify gas species among a multi gas mixture in a cluttered environment. Using infrared absorption spectroscopy and tested on synthesized spectral datasets, our approach outperforms conventional binary relevance - partial least squares discriminant analysis when signal-to-noise ratio and training sample size are sufficient. △ Less

Submitted 24 June, 2019; originally announced June 2019.

Comments: 8 pages, 7 figures

arXiv:1904.11620 [pdf]

Improved visible to IR image transformation using synthetic data augmentation with cycle-consistent adversarial networks

Authors: Kyongsik Yun, Kevin Yu, Joseph Osborne, Sarah Eldin, Luan Nguyen, Alexander Huyen, Thomas Lu

Abstract: Infrared (IR) images are essential to improve the visibility of dark or camouflaged objects. Object recognition and segmentation based on a neural network using IR images provide more accuracy and insight than color visible images. But the bottleneck is the amount of relevant IR images for training. It is difficult to collect real-world IR images for special purposes, including space exploration,… ▽ More Infrared (IR) images are essential to improve the visibility of dark or camouflaged objects. Object recognition and segmentation based on a neural network using IR images provide more accuracy and insight than color visible images. But the bottleneck is the amount of relevant IR images for training. It is difficult to collect real-world IR images for special purposes, including space exploration, military and fire-fighting applications. To solve this problem, we created color visible and IR images using a Unity-based 3D game editor. These synthetically generated color visible and IR images were used to train cycle consistent adversarial networks (CycleGAN) to convert visible images to IR images. CycleGAN has the advantage that it does not require precisely matching visible and IR pairs for transformation training. In this study, we discovered that additional synthetic data can help improve CycleGAN performance. Neural network training using real data (N = 20) performed more accurate transformations than training using real (N = 10) and synthetic (N = 10) data combinations. The result indicates that the synthetic data cannot exceed the quality of the real data. Neural network training using real (N = 10) and synthetic (N = 100) data combinations showed almost the same performance as training using real data (N = 20). At least 10 times more synthetic data than real data is required to achieve the same performance. In summary, CycleGAN is used with synthetic data to improve the IR image conversion performance of visible images. △ Less

Submitted 25 April, 2019; originally announced April 2019.

Comments: 8 pages, 6 figures, SPIE

arXiv:1903.11703 [pdf, other]

doi 10.1109/JIOT.2019.2940368

Recurrent Neural Networks For Accurate RSSI Indoor Localization

Authors: Minh Tu Hoang, Brosnan Yuen, Xiaodai Dong, Tao Lu, Robert Westendorp, Kishore Reddy

Abstract: This paper proposes recurrent neuron networks (RNNs) for a fingerprinting indoor localization using WiFi. Instead of locating user's position one at a time as in the cases of conventional algorithms, our RNN solution aims at trajectory positioning and takes into account the relation among the received signal strength indicator (RSSI) measurements in a trajectory. Furthermore, a weighted average fi… ▽ More This paper proposes recurrent neuron networks (RNNs) for a fingerprinting indoor localization using WiFi. Instead of locating user's position one at a time as in the cases of conventional algorithms, our RNN solution aims at trajectory positioning and takes into account the relation among the received signal strength indicator (RSSI) measurements in a trajectory. Furthermore, a weighted average filter is proposed for both input RSSI data and sequential output locations to enhance the accuracy among the temporal fluctuations of RSSI. The results using different types of RNN including vanilla RNN, long short-term memory (LSTM), gated recurrent unit (GRU) and bidirectional LSTM (BiLSTM) are presented. On-site experiments demonstrate that the proposed structure achieves an average localization error of $0.75$ m with $80\%$ of the errors under $1$ m, which outperforms the conventional KNN algorithms and probabilistic algorithms by approximately $30\%$ under the same test environment. △ Less

Submitted 22 October, 2019; v1 submitted 27 March, 2019; originally announced March 2019.

Comments: Received signal strength indicator (RSSI), WiFi indoor localization, recurrent neuron network (RNN), long shortterm memory (LSTM), fingerprint-based localization

Report number: 2327-4662

Journal ref: IEEE Internet of Things Journal Volume 6, Issue 6 (2019) 10639 - 10651

arXiv:1902.02627 [pdf, other]

Fast Transient Simulation of High-Speed Channels Using Recurrent Neural Network

Authors: Thong Nguyen, Tianjian Lu, Ken Wu, Jose Schutt-Aine

Abstract: Generating eye diagrams by using a circuit simulator can be very computationally intensive, especially in the presence of nonlinearities. It often involves multiple Newton-like iterations at every time step when a SPICE-like circuit simulator handles a nonlinear system in the transient regime. In this paper, we leverage machine learning methods, to be specific, the recurrent neural network (RNN),… ▽ More Generating eye diagrams by using a circuit simulator can be very computationally intensive, especially in the presence of nonlinearities. It often involves multiple Newton-like iterations at every time step when a SPICE-like circuit simulator handles a nonlinear system in the transient regime. In this paper, we leverage machine learning methods, to be specific, the recurrent neural network (RNN), to generate black-box macromodels and achieve significant reduction of computation time. Through the proposed approach, an RNN model is first trained and then validated on a relatively short sequence generated from a circuit simulator. Once the training completes, the RNN can be used to make predictions on the remaining sequence in order to generate an eye diagram. The training cost can also be amortized when the trained RNN starts making predictions. Besides, the proposed approach requires no complex circuit simulations nor substantial domain knowledge. We use two high-speed link examples to demonstrate that the proposed approach provides adequate accuracy while the computation time can be dramatically reduced. In the high-speed link example with a PAM4 driver, the eye diagram generated by RNN models shows good agreement with that obtained from a commercial circuit simulator. This paper also investigates the impacts of various RNN topologies, training schemes, and tunable parameters on both the accuracy and the generalization capability of an RNN model. It is found out that the long short-term memory (LSTM) network outperforms the vanilla RNN in terms of the accuracy in predicting transient waveforms. △ Less

Submitted 7 February, 2019; v1 submitted 25 January, 2019; originally announced February 2019.

arXiv:1805.01534 [pdf, other]

Distributed and Multi-layer UAV Network for the Next-generation Wireless Communication

Authors: Yiming Huo, Xiaodai Dong, Tao Lu, Wei Xu, Marvin Yuen

Abstract: Unmanned aerial vehicles (UAVs) for wireless communications has rapidly grown into a research hotspot as the mass production of high-performance, low-cost, intelligent UAVs become more practical and feasible. In the meantime, fifth generation (5G) wireless communications is being standardized and planned for deployment globally. During this process, UAVs are gradually being considered as an import… ▽ More Unmanned aerial vehicles (UAVs) for wireless communications has rapidly grown into a research hotspot as the mass production of high-performance, low-cost, intelligent UAVs become more practical and feasible. In the meantime, fifth generation (5G) wireless communications is being standardized and planned for deployment globally. During this process, UAVs are gradually being considered as an important part of 5G and expected to play a critical role in enabling more functional diversity for 5G communications. In this article, we conduct an in-depth investigation of mainstream UAV designs and state-of-the-art UAV enabled wireless communication systems.We propose a hierarchical architecture of UAVs with multi-layer and distributed features to facilitate a smooth integration of different mainstream UAVs into the next-generation wireless communication networks. Furthermore, we unveil the critical comprehensive design tradeoffs, in light of both communication and aerodynamic principles. Empirical models and satellite measurement data are used to conduct numerical analysis of the meteorological impacts of UAV enabled, 5G high bands communications. △ Less

Submitted 3 May, 2018; originally announced May 2018.

Comments: 13 pages, 5 figures, 1 table. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:1711.11165 [pdf, other]

Safe Exploration for Identifying Linear Systems via Robust Optimization

Authors: Tyler Lu, Martin Zinkevich, Craig Boutilier, Binz Roy, Dale Schuurmans

Abstract: Safely exploring an unknown dynamical system is critical to the deployment of reinforcement learning (RL) in physical systems where failures may have catastrophic consequences. In scenarios where one knows little about the dynamics, diverse transition data covering relevant regions of state-action space is needed to apply either model-based or model-free RL. Motivated by the cooling of Google's da… ▽ More Safely exploring an unknown dynamical system is critical to the deployment of reinforcement learning (RL) in physical systems where failures may have catastrophic consequences. In scenarios where one knows little about the dynamics, diverse transition data covering relevant regions of state-action space is needed to apply either model-based or model-free RL. Motivated by the cooling of Google's data centers, we study how one can safely identify the parameters of a system model with a desired accuracy and confidence level. In particular, we focus on learning an unknown linear system with Gaussian noise assuming only that, initially, a nominal safe action is known. Define safety as satisfying specific linear constraints on the state space (e.g., requirements on process variable) that must hold over the span of an entire trajectory, and given a Probably Approximately Correct (PAC) style bound on the estimation error of model parameters, we show how to compute safe regions of action space by gradually growing a ball around the nominal safe action. One can apply any exploration strategy where actions are chosen from such safe regions. Experiments on a stylized model of data center cooling dynamics show how computing proper safe regions can increase the sample efficiency of safe exploration. △ Less

Submitted 29 November, 2017; originally announced November 2017.

Showing 1–36 of 36 results for author: Lu, T