-
A Successive Gap Constraint Linearization Method for Optimal Control Problems with Equilibrium Constraints
Authors:
Kangyu Lin,
Toshiyuki Ohtsuka
Abstract:
In this study, we propose a novel gap-constraint-based reformulation for optimal control problems with equilibrium constraints (OCPECs). We show that the proposed reformulation generates a new constraint system equivalent to the original one but more concise and with favorable differentiability. Moreover, constraint regularity can be recovered by a relaxation strategy. We show that the gap constra…
▽ More
In this study, we propose a novel gap-constraint-based reformulation for optimal control problems with equilibrium constraints (OCPECs). We show that the proposed reformulation generates a new constraint system equivalent to the original one but more concise and with favorable differentiability. Moreover, constraint regularity can be recovered by a relaxation strategy. We show that the gap constraint and its gradient can be evaluated efficiently. We then propose a successive gap constraint linearization method to solve the discretized OCPEC. We also provide an intuitive geometric interpretation of the gap constraint. Numerical experiments validate the effectiveness of the proposed reformulation and solution method.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Diffusion and Multi-Domain Adaptation Methods for Eosinophil Segmentation
Authors:
Kevin Lin,
Donald Brown,
Sana Syed,
Adam Greene
Abstract:
Eosinophilic Esophagitis (EoE) represents a challenging condition for medical providers today. The cause is currently unknown, the impact on a patient's daily life is significant, and it is increasing in prevalence. Traditional approaches for medical image diagnosis such as standard deep learning algorithms are limited by the relatively small amount of data and difficulty in generalization. As a r…
▽ More
Eosinophilic Esophagitis (EoE) represents a challenging condition for medical providers today. The cause is currently unknown, the impact on a patient's daily life is significant, and it is increasing in prevalence. Traditional approaches for medical image diagnosis such as standard deep learning algorithms are limited by the relatively small amount of data and difficulty in generalization. As a response, two methods have arisen that seem to perform well: Diffusion and Multi-Domain methods with current research efforts favoring diffusion methods. For the EoE dataset, we discovered that a Multi-Domain Adversarial Network outperformed a Diffusion based method with a FID of 42.56 compared to 50.65. Future work with diffusion methods should include a comparison with Multi-Domain adaptation methods to ensure that the best performance is achieved.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Robustness of Deep Learning for Accelerated MRI: Benefits of Diverse Training Data
Authors:
Kang Lin,
Reinhard Heckel
Abstract:
Deep learning based methods for image reconstruction are state-of-the-art for a variety of imaging tasks. However, neural networks often perform worse if the training data differs significantly from the data they are applied to. For example, a network trained for accelerated magnetic resonance imaging (MRI) on one scanner performs worse on another scanner. In this work, we investigate the impact o…
▽ More
Deep learning based methods for image reconstruction are state-of-the-art for a variety of imaging tasks. However, neural networks often perform worse if the training data differs significantly from the data they are applied to. For example, a network trained for accelerated magnetic resonance imaging (MRI) on one scanner performs worse on another scanner. In this work, we investigate the impact of the training data on the model's performance and robustness for accelerated MRI. We find that models trained on the combination of various data distributions, such as those obtained from different MRI scanners and anatomies, exhibit robustness equal or superior to models trained on the best single distribution for a specific target distribution. Thus training on diverse data tends to improve robustness. Furthermore, training on diverse data does not compromise in-distribution performance, i.e., a model trained on diverse data yields in-distribution performance at least as good as models trained on the more narrow individual distributions. Our results suggest that training a model for imaging on a variety of distributions tends to yield a more effective and robust model than maintaining separate models for individual distributions.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Uncertainty Quantification for Eosinophil Segmentation
Authors:
Kevin Lin,
Donald Brown,
Sana Syed,
Adam Greene
Abstract:
Eosinophilic Esophagitis (EoE) is an allergic condition increasing in prevalence. To diagnose EoE, pathologists must find 15 or more eosinophils within a single high-power field (400X magnification). Determining whether or not a patient has EoE can be an arduous process and any medical imaging approaches used to assist diagnosis must consider both efficiency and precision. We propose an improvemen…
▽ More
Eosinophilic Esophagitis (EoE) is an allergic condition increasing in prevalence. To diagnose EoE, pathologists must find 15 or more eosinophils within a single high-power field (400X magnification). Determining whether or not a patient has EoE can be an arduous process and any medical imaging approaches used to assist diagnosis must consider both efficiency and precision. We propose an improvement of Adorno et al's approach for quantifying eosinphils using deep image segmentation. Our new approach leverages Monte Carlo Dropout, a common approach in deep learning to reduce overfitting, to provide uncertainty quantification on current deep learning models. The uncertainty can be visualized in an output image to evaluate model performance, provide insight to how deep learning algorithms function, and assist pathologists in identifying eosinophils.
△ Less
Submitted 7 November, 2023; v1 submitted 28 September, 2023;
originally announced September 2023.
-
MPAI-EEV: Standardization Efforts of Artificial Intelligence based End-to-End Video Coding
Authors:
Chuanmin Jia,
Feng Ye,
Fanke Dong,
Kai Lin,
Leonardo Chiariglione,
Siwei Ma,
Huifang Sun,
Wen Gao
Abstract:
The rapid advancement of artificial intelligence (AI) technology has led to the prioritization of standardizing the processing, coding, and transmission of video using neural networks. To address this priority area, the Moving Picture, Audio, and Data Coding by Artificial Intelligence (MPAI) group is develo** a suite of standards called MPAI-EEV for "end-to-end optimized neural video coding." Th…
▽ More
The rapid advancement of artificial intelligence (AI) technology has led to the prioritization of standardizing the processing, coding, and transmission of video using neural networks. To address this priority area, the Moving Picture, Audio, and Data Coding by Artificial Intelligence (MPAI) group is develo** a suite of standards called MPAI-EEV for "end-to-end optimized neural video coding." The aim of this AI-based video standard project is to compress the number of bits required to represent high-fidelity video data by utilizing data-trained neural coding technologies. This approach is not constrained by how data coding has traditionally been applied in the context of a hybrid framework. This paper presents an overview of recent and ongoing standardization efforts in this area and highlights the key technologies and design philosophy of EEV. It also provides a comparison and report on some primary efforts such as the coding efficiency of the reference model. Additionally, it discusses emerging activities such as learned Unmanned-Aerial-Vehicles (UAVs) video coding which are currently planned, under development, or in the exploration phase. With a focus on UAV video signals, this paper addresses the current status of these preliminary efforts. It also indicates development timelines, summarizes the main technical details, and provides pointers to further points of reference. The exploration experiment shows that the EEV model performs better than the state-of-the-art video coding standard H.266/VVC in terms of perceptual evaluation metric.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Interference-Aware Deployment for Maximizing User Satisfaction in Multi-UAV Wireless Networks
Authors:
Chuan-Chi Lai,
Ang-Hsun Tsai,
Chia-Wei Ting,
Ko-Han Lin,
**g-Chi Ling,
Chia-En Tsai
Abstract:
In this letter, we study the deployment of Unmanned Aerial Vehicle mounted Base Stations (UAV-BSs) in multi-UAV cellular networks. We model the multi-UAV deployment problem as a user satisfaction maximization problem, that is, maximizing the proportion of served ground users (GUs) that meet a given minimum data rate requirement. We propose an interference-aware deployment (IAD) algorithm for servi…
▽ More
In this letter, we study the deployment of Unmanned Aerial Vehicle mounted Base Stations (UAV-BSs) in multi-UAV cellular networks. We model the multi-UAV deployment problem as a user satisfaction maximization problem, that is, maximizing the proportion of served ground users (GUs) that meet a given minimum data rate requirement. We propose an interference-aware deployment (IAD) algorithm for serving arbitrarily distributed outdoor GUs. The proposed algorithm can alleviate the problem of overlap** coverage between adjacent UAV-BSs to minimize inter-cell interference. Therefore, reducing co-channel interference between UAV-BSs will improve user satisfaction and ensure that most GUs can achieve the minimum data rate requirement. Simulation results show that our proposed IAD outperforms comparative methods by more than 10% in user satisfaction in high-density environments.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
A Non-Interior-Point Continuation Method for the Optimal Control Problem with Equilibrium Constraints
Authors:
Kangyu Lin,
Toshiyuki Ohtsuka
Abstract:
In this study, we focus on the numerical solution method for the optimal control problem with equilibrium constraints (OCPEC).It is extremely challenging to solve OCPEC owing to the absence of constraint regularity and strictly feasible interior points. To solve OCPEC efficiently, we first relax the discretized OCPEC to recover the constraint regularity and then map its Karush--Kuhn--Tucker (KKT)…
▽ More
In this study, we focus on the numerical solution method for the optimal control problem with equilibrium constraints (OCPEC).It is extremely challenging to solve OCPEC owing to the absence of constraint regularity and strictly feasible interior points. To solve OCPEC efficiently, we first relax the discretized OCPEC to recover the constraint regularity and then map its Karush--Kuhn--Tucker (KKT) conditions into a perturbed system of equations. Subsequently, we propose a novel two-stage solution method, called the non-interior-point continuation method, to solve the perturbed system. In the first stage, a non-interior-point method, which solves the perturbed system using the Newton method and globalizes convergence using a dedicated merit function, is employed. In the second stage, a predictor-corrector continuation method is utilized to track the solution trajectory as a function of the perturbed parameter, starting at the solution obtained in the first stage. The proposed method regularizes the KKT matrix and does not enforce iterates to remain in the feasible interior, which mitigates the numerical difficulties of solving OCPEC. Convergence properties are analyzed under certain assumptions. Numerical experiments demonstrate that the proposed method can accurately track the solution trajectory while demanding significantly less computation time compared to the interior-point method.
△ Less
Submitted 27 May, 2024; v1 submitted 19 October, 2022;
originally announced October 2022.
-
Image-to-Image MLP-mixer for Image Reconstruction
Authors:
Youssef Mansour,
Kang Lin,
Reinhard Heckel
Abstract:
Neural networks are highly effective tools for image reconstruction problems such as denoising and compressive sensing. To date, neural networks for image reconstruction are almost exclusively convolutional. The most popular architecture is the U-Net, a convolutional network with a multi-resolution architecture. In this work, we show that a simple network based on the multi-layer perceptron (MLP)-…
▽ More
Neural networks are highly effective tools for image reconstruction problems such as denoising and compressive sensing. To date, neural networks for image reconstruction are almost exclusively convolutional. The most popular architecture is the U-Net, a convolutional network with a multi-resolution architecture. In this work, we show that a simple network based on the multi-layer perceptron (MLP)-mixer enables state-of-the art image reconstruction performance without convolutions and without a multi-resolution architecture, provided that the training set and the size of the network are moderately large. Similar to the original MLP-mixer, the image-to-image MLP-mixer is based exclusively on MLPs operating on linearly-transformed image patches. Contrary to the original MLP-mixer, we incorporate structure by retaining the relative positions of the image patches. This imposes an inductive bias towards natural images which enables the image-to-image MLP-mixer to learn to denoise images based on fewer examples than the original MLP-mixer. Moreover, the image-to-image MLP-mixer requires fewer parameters to achieve the same denoising performance than the U-Net and its parameters scale linearly in the image resolution instead of quadratically as for the original MLP-mixer. If trained on a moderate amount of examples for denoising, the image-to-image MLP-mixer outperforms the U-Net by a slight margin. It also outperforms the vision transformer tailored for image reconstruction and classical un-trained methods such as BM3D, making it a very effective tool for image reconstruction problems.
△ Less
Submitted 4 February, 2022;
originally announced February 2022.
-
Complementary Fourier single-pixel imaging
Authors:
Dong Zhou,
Jie Cao,
Huan Cui,
Qun Hao,
Bing-Kun Chen,
Kai Lin
Abstract:
Single-pixel imaging, with the advantages of a wide spectrum, beyond-visual-field imaging, and robustness to light scattering, has attracted increasing attention in recent years. Fourier single-pixel imaging (FSI) can reconstruct sharp images under sub-Nyquist sampling. However, the conventional FSI has difficulty with balancing the imaging quality and efficiency. To overcome this issue, we propos…
▽ More
Single-pixel imaging, with the advantages of a wide spectrum, beyond-visual-field imaging, and robustness to light scattering, has attracted increasing attention in recent years. Fourier single-pixel imaging (FSI) can reconstruct sharp images under sub-Nyquist sampling. However, the conventional FSI has difficulty with balancing the imaging quality and efficiency. To overcome this issue, we proposed a novel approach called complementary Fourier single-pixel imaging (CFSI) to reduce measurements while retaining its robustness. The complementary nature of Fourier patterns based on a four-step phase-shift algorithm is combined with the complementary nature of a digital micromirror device. CFSI only requires two phase-shifted patterns to obtain one Fourier spectral value. Four light intensity values are obtained by load the two patterns, and the spectral value is calculated through differential measurement, which has good robustness to noise. The proposed method is verified by simulations and experiments compared with FSI based on two-, three-, and four-step phase shift algorithms. CFSI performed better than the other methods under the condition that the best imaging quality of CFSI is not reached. The reported technique provides an alternative approach to realize real-time and high-quality imaging.
△ Less
Submitted 2 August, 2021;
originally announced August 2021.
-
Analysis of bio-electro-chemical signals from passive sweat-based wearable electro-impedance spectroscopy (EIS) towards assessing blood glucose modulations
Authors:
Devangsingh Sankhala,
Madhavi Pali,
Kai-Chun Lin,
Badrinath Jagannath,
Sriram Muthukumar,
Shalini Prasad
Abstract:
There has been a recent tremendous interest in label-free detection of biomarkers which is a critical enabler of point-of-need diagnostics. A low-power, small form factor, multiplexed wearable system is proposed for continuous detection of glucose in passively expressed sweat using electrochemical impedance spectroscopy (EIS) measurement. The wearable EIS system consists of a sensing analog front…
▽ More
There has been a recent tremendous interest in label-free detection of biomarkers which is a critical enabler of point-of-need diagnostics. A low-power, small form factor, multiplexed wearable system is proposed for continuous detection of glucose in passively expressed sweat using electrochemical impedance spectroscopy (EIS) measurement. The wearable EIS system consists of a sensing analog front end integrated with low-volume (1-5 $μ$L) ultra-sensitive flexible biosensors. A passive sweat sensor was designed to integrate a glucose oxidase electrochemical system on active semiconducting material. The non-faradaic EIS response of the biosensor was used to calibrate the analog front end response using ratiometric Discrete Fourier Transform (DFT) for a shorter measurement time. In this work, a stringent assessment of a continuous glucose sensing platform is performed in a bottom-up approach, going from the biosensor to the system to the interaction with a human subject. The active semiconductor-based biosensors are dosed with glucose concentrations ranging from 5-200 mg/dL and detection is performed using the analog front end. In addition, a detailed analysis of battery life and performance of a wearable EIS system is discussed to define a figure of merit for an optimally integrated design. Moreover, a continuous glucose detection test is performed on a healthy human subject cohort to investigate the stability of the sensor-system mechanism for an 8-hour period, and a time-series-based, auto-regressive (AR) model was created for the system.
△ Less
Submitted 5 April, 2021;
originally announced April 2021.
-
PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control
Authors:
Dong Chen,
Kaian Chen. Zhaojian Li,
Tianshu Chu,
Rui Yao,
Feng Qiu,
Kaixiang Lin
Abstract:
This paper develops an efficient multi-agent deep reinforcement learning algorithm for cooperative controls in powergrids. Specifically, we consider the decentralized inverter-based secondary voltage control problem in distributed generators (DGs), which is first formulated as a cooperative multi-agent reinforcement learning (MARL) problem. We then propose a novel on-policy MARL algorithm, PowerNe…
▽ More
This paper develops an efficient multi-agent deep reinforcement learning algorithm for cooperative controls in powergrids. Specifically, we consider the decentralized inverter-based secondary voltage control problem in distributed generators (DGs), which is first formulated as a cooperative multi-agent reinforcement learning (MARL) problem. We then propose a novel on-policy MARL algorithm, PowerNet, in which each agent (DG) learns a control policy based on (sub-)global reward but local states from its neighboring agents. Motivated by the fact that a local control from one agent has limited impact on agents distant from it, we exploit a novel spatial discount factor to reduce the effect from remote agents, to expedite the training process and improve scalability. Furthermore, a differentiable, learning-based communication protocol is employed to foster the collaborations among neighboring agents. In addition, to mitigate the effects of system uncertainty and random noise introduced during on-policy learning, we utilize an action smoothing factor to stabilize the policy execution. To facilitate training and evaluation, we develop PGSim, an efficient, high-fidelity powergrid simulation platform. Experimental results in two microgrid setups show that the developed PowerNet outperforms a conventional model-based control, as well as several state-of-the-art MARL algorithms. The decentralized learning scheme and high sample efficiency also make it viable to large-scale power grids.
△ Less
Submitted 31 July, 2021; v1 submitted 24 November, 2020;
originally announced November 2020.
-
A Control Strategy for Capacity Allocation of Hybrid Energy Storage System Based on Hierarchical Processing of Demand Power
Authors:
Kai Lin
Abstract:
Pursuing optimal power distribution in hybrid energy storage systems has always been the goal of researchers. Here, HESS is a combination of lithium battery and supercapacitor; this combination has been proven to effectively compensate for some of the deficiencies of lithium batteries as an energy system for electric vehicles. For example, the energy storage system with only lithium batteries cann…
▽ More
Pursuing optimal power distribution in hybrid energy storage systems has always been the goal of researchers. Here, HESS is a combination of lithium battery and supercapacitor; this combination has been proven to effectively compensate for some of the deficiencies of lithium batteries as an energy system for electric vehicles. For example, the energy storage system with only lithium batteries cannot provide high power in a short time to meet the high acceleration performance of electric vehicles, and the excessive discharge current will cause the temperature of the battery pack to be too high, which will cause safety problems for the car. This paper proposes an intelligent energy management strategy combining fuzzy controller and improved Savitzky-Golay filter for real-time control. The simulation results show that compared with single ESS, the maximum current of the battery proposed by the strategy is reduced by 14.60%, and the usable cycle life of the battery is increased by 57.31% during the test driving cycle. Meanwhile, it explores various changes brought supercapacitor monomers in the same HESS, and predict the next supercapacitor will bring about 31.58% reduction of volume and mass.
△ Less
Submitted 10 November, 2020; v1 submitted 3 October, 2020;
originally announced October 2020.
-
Multi-modal Feature Fusion with Feature Attention for VATEX Captioning Challenge 2020
Authors:
Ke Lin,
Zhuoxin Gan,
Liwei Wang
Abstract:
This report describes our model for VATEX Captioning Challenge 2020. First, to gather information from multiple domains, we extract motion, appearance, semantic and audio features. Then we design a feature attention module to attend on different feature when decoding. We apply two types of decoders, top-down and X-LAN and ensemble these models to get the final result. The proposed method outperfor…
▽ More
This report describes our model for VATEX Captioning Challenge 2020. First, to gather information from multiple domains, we extract motion, appearance, semantic and audio features. Then we design a feature attention module to attend on different feature when decoding. We apply two types of decoders, top-down and X-LAN and ensemble these models to get the final result. The proposed method outperforms official baseline with a significant gap. We achieve 76.0 CIDEr and 50.0 CIDEr on English and Chinese private test set. We rank 2nd on both English and Chinese private test leaderboard.
△ Less
Submitted 5 June, 2020;
originally announced June 2020.
-
eCNN: A Block-Based and Highly-Parallel CNN Accelerator for Edge Inference
Authors:
Chao-Tsung Huang,
Yu-Chun Ding,
Huan-Ching Wang,
Chi-Wen Weng,
Kai-** Lin,
Li-Wei Wang,
Li-De Chen
Abstract:
Convolutional neural networks (CNNs) have recently demonstrated superior quality for computational imaging applications. Therefore, they have great potential to revolutionize the image pipelines on cameras and displays. However, it is difficult for conventional CNN accelerators to support ultra-high-resolution videos at the edge due to their considerable DRAM bandwidth and power consumption. There…
▽ More
Convolutional neural networks (CNNs) have recently demonstrated superior quality for computational imaging applications. Therefore, they have great potential to revolutionize the image pipelines on cameras and displays. However, it is difficult for conventional CNN accelerators to support ultra-high-resolution videos at the edge due to their considerable DRAM bandwidth and power consumption. Therefore, finding a further memory- and computation-efficient microarchitecture is crucial to speed up this coming revolution.
In this paper, we approach this goal by considering the inference flow, network model, instruction set, and processor design jointly to optimize hardware performance and image quality. We apply a block-based inference flow which can eliminate all the DRAM bandwidth for feature maps and accordingly propose a hardware-oriented network model, ERNet, to optimize image quality based on hardware constraints. Then we devise a coarse-grained instruction set architecture, FBISA, to support power-hungry convolution by massive parallelism. Finally,we implement an embedded processor---eCNN---which accommodates to ERNet and FBISA with a flexible processing architecture. Layout results show that it can support high-quality ERNets for super-resolution and denoising at up to 4K Ultra-HD 30 fps while using only DDR-400 and consuming 6.94W on average. By comparison, the state-of-the-art Diffy uses dual-channel DDR3-2133 and consumes 54.3W to support lower-quality VDSR at Full HD 30 fps. Lastly, we will also present application examples of high-performance style transfer and object recognition to demonstrate the flexibility of eCNN.
△ Less
Submitted 12 October, 2019;
originally announced October 2019.
-
Singing Voice Separation Using a Deep Convolutional Neural Network Trained by Ideal Binary Mask and Cross Entropy
Authors:
Kin Wah Edward Lin,
Balamurali B. T.,
Enyan Koh,
Simon Lui,
Dorien Herremans
Abstract:
Separating a singing voice from its music accompaniment remains an important challenge in the field of music information retrieval. We present a unique neural network approach inspired by a technique that has revolutionized the field of vision: pixel-wise image classification, which we combine with cross entropy loss and pretraining of the CNN as an autoencoder on singing voice spectrograms. The p…
▽ More
Separating a singing voice from its music accompaniment remains an important challenge in the field of music information retrieval. We present a unique neural network approach inspired by a technique that has revolutionized the field of vision: pixel-wise image classification, which we combine with cross entropy loss and pretraining of the CNN as an autoencoder on singing voice spectrograms. The pixel-wise classification technique directly estimates the sound source label for each time-frequency (T-F) bin in our spectrogram image, thus eliminating common pre- and postprocessing tasks. The proposed network is trained by using the Ideal Binary Mask (IBM) as the target output label. The IBM identifies the dominant sound source in each T-F bin of the magnitude spectrogram of a mixture signal, by considering each T-F bin as a pixel with a multi-label (for each sound source). Cross entropy is used as the training objective, so as to minimize the average probability error between the target and predicted label for each pixel. By treating the singing voice separation problem as a pixel-wise classification task, we additionally eliminate one of the commonly used, yet not easy to comprehend, postprocessing steps: the Wiener filter postprocessing.
The proposed CNN outperforms the first runner up in the Music Information Retrieval Evaluation eXchange (MIREX) 2016 and the winner of MIREX 2014 with a gain of 2.2702 ~ 5.9563 dB global normalized source to distortion ratio (GNSDR) when applied to the iKala dataset. An experiment with the DSD100 dataset on the full-tracks song evaluation task also shows that our model is able to compete with cutting-edge singing voice separation systems which use multi-channel modeling, data augmentation, and model blending.
△ Less
Submitted 4 December, 2018;
originally announced December 2018.