-
Data-Efficient Unsupervised Interpolation Without Any Intermediate Frame for 4D Medical Images
Authors:
JungEun Kim,
Hangyul Yoon,
Geondo Park,
Kyungsu Kim,
Eunho Yang
Abstract:
4D medical images, which represent 3D images with temporal information, are crucial in clinical practice for capturing dynamic changes and monitoring long-term disease progression. However, acquiring 4D medical images poses challenges due to factors such as radiation exposure and imaging duration, necessitating a balance between achieving high temporal resolution and minimizing adverse effects. Gi…
▽ More
4D medical images, which represent 3D images with temporal information, are crucial in clinical practice for capturing dynamic changes and monitoring long-term disease progression. However, acquiring 4D medical images poses challenges due to factors such as radiation exposure and imaging duration, necessitating a balance between achieving high temporal resolution and minimizing adverse effects. Given these circumstances, not only is data acquisition challenging, but increasing the frame rate for each dataset also proves difficult. To address this challenge, this paper proposes a simple yet effective Unsupervised Volumetric Interpolation framework, UVI-Net. This framework facilitates temporal interpolation without the need for any intermediate frames, distinguishing it from the majority of other existing unsupervised methods. Experiments on benchmark datasets demonstrate significant improvements across diverse evaluation metrics compared to unsupervised and supervised baselines. Remarkably, our approach achieves this superior performance even when trained with a dataset as small as one, highlighting its exceptional robustness and efficiency in scenarios with sparse supervision. This positions UVI-Net as a compelling alternative for 4D medical imaging, particularly in settings where data availability is limited. The source code is available at https://github.com/jungeun122333/UVI-Net.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Optimal Impact Angle Guidance via First-Order Optimization under Nonconvex Constraints
Authors:
Gyubin Park,
Jiwoo Choi,
Da Hoon Jeong,
Jong-Han Kim
Abstract:
Most of the optimal guidance problems can be formulated as nonconvex optimization problems, which can be solved indirectly by relaxation, convexification, or linearization. Although these methods are guaranteed to converge to the global optimum of the modified problems, the obtained solution may not guarantee global optimality or even the feasibility of the original nonconvex problems. In this pap…
▽ More
Most of the optimal guidance problems can be formulated as nonconvex optimization problems, which can be solved indirectly by relaxation, convexification, or linearization. Although these methods are guaranteed to converge to the global optimum of the modified problems, the obtained solution may not guarantee global optimality or even the feasibility of the original nonconvex problems. In this paper, we propose a computational optimal guidance approach that directly handles the nonconvex constraints encountered in formulating the guidance problems. The proposed computational guidance approach alternately solves the least squares problems and projects the solution onto nonconvex feasible sets, which rapidly converges to feasible suboptimal solutions or sometimes to the globally optimal solutions. The proposed algorithm is verified via a series of numerical simulations on impact angle guidance problems under state dependent maneuver vector constraints, and it is demonstrated that the proposed algorithm provides superior guidance performance than conventional techniques.
△ Less
Submitted 17 March, 2024; v1 submitted 30 September, 2023;
originally announced October 2023.
-
RADIO: Reference-Agnostic Dubbing Video Synthesis
Authors:
Dongyeun Lee,
Chaewon Kim,
Sangjoon Yu,
Jaejun Yoo,
Gyeong-Moon Park
Abstract:
One of the most challenging problems in audio-driven talking head generation is achieving high-fidelity detail while ensuring precise synchronization. Given only a single reference image, extracting meaningful identity attributes becomes even more challenging, often causing the network to mirror the facial and lip structures too closely. To address these issues, we introduce RADIO, a framework eng…
▽ More
One of the most challenging problems in audio-driven talking head generation is achieving high-fidelity detail while ensuring precise synchronization. Given only a single reference image, extracting meaningful identity attributes becomes even more challenging, often causing the network to mirror the facial and lip structures too closely. To address these issues, we introduce RADIO, a framework engineered to yield high-quality dubbed videos regardless of the pose or expression in reference images. The key is to modulate the decoder layers using latent space composed of audio and reference features. Additionally, we incorporate ViT blocks into the decoder to emphasize high-fidelity details, especially in the lip region. Our experimental results demonstrate that RADIO displays high synchronization without the loss of fidelity. Especially in harsh scenarios where the reference frame deviates significantly from the ground truth, our method outperforms state-of-the-art methods, highlighting its robustness.
△ Less
Submitted 6 November, 2023; v1 submitted 5 September, 2023;
originally announced September 2023.
-
On Correcting Errors in Existing Mathematical Approaches for UAV Trajectory Design Considering No-Fly-Zones
Authors:
Kanghyun Heo,
Gitae Park,
Kisong Lee
Abstract:
Motivated by the fact that current mathematical methods for the trajectory design of an unmanned aerial vehicle (UAV) considering no-fly-zones (NFZs) cannot perfectly avoid NFZs throughout the entire continuous trajectory, this study introduces a new constraint that ensures the complete avoidance of NFZs. Moreover, we provide mathematical proof demonstrating that a UAV operating within the propose…
▽ More
Motivated by the fact that current mathematical methods for the trajectory design of an unmanned aerial vehicle (UAV) considering no-fly-zones (NFZs) cannot perfectly avoid NFZs throughout the entire continuous trajectory, this study introduces a new constraint that ensures the complete avoidance of NFZs. Moreover, we provide mathematical proof demonstrating that a UAV operating within the proposed constraints will never violate NFZs. Under the proposed constraint on NFZs, we aim to optimize the scheduling, transmit power, length of the time slot, and the trajectory of the UAV to maximize the minimum throughput among ground nodes without violating NFZs. To find the optimal UAV strategy from the non-convex optimization problem formulated here, we use various optimization techniques, in this case quadratic transform, successive convex approximation, and the block coordinate descent algorithm. Simulation results confirm that the proposed constraint prevents NFZs from being violated over the entire trajectory in any scenario. Furthermore, the proposed scheme shows significantly higher throughput than the baseline scheme using the traditional NFZ constraint by achieving a zero outage probability due to NFZ violations.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Input-Output Feedback Linearization Preserving Task Priority for Multivariate Nonlinear Systems Having Singular Input Gain Matrix
Authors:
Sang-ik An,
Dongheui Lee,
Gyunghoon Park
Abstract:
We propose an extension of the input-output feedback linearization for a class of multivariate systems that are not input-output linearizable in a classical manner. The key observation is that the usual input-output linearization problem can be interpreted as the problem of solving simultaneous linear equations associated with the input gain matrix: thus, even at points where the input gain matrix…
▽ More
We propose an extension of the input-output feedback linearization for a class of multivariate systems that are not input-output linearizable in a classical manner. The key observation is that the usual input-output linearization problem can be interpreted as the problem of solving simultaneous linear equations associated with the input gain matrix: thus, even at points where the input gain matrix becomes singular, it is still possible to solve a part of linear equations, by which a subset of input-output relations is made linear or close to be linear. Based on this observation, we adopt the task priority-based approach in the input-output linearization problem. First, we generalize the classical Byrnes-Isidori normal form to a prioritized normal form having a triangular structure, so that the singularity of a subblock of the input gain matrix related to lower-priority tasks does not directly propagate to higher-priority tasks. Next, we present a prioritized input-output linearization via the multi-objective optimization with the lexicographical ordering, resulting in a prioritized semilinear form that establishes input output relations whose subset with higher priority is linear or close to be linear. Finally, Lyapunov analysis on ultimate boundedness and task achievement is provided, particularly when the proposed prioritized input-output linearization is applied to the output tracking problem. This work introduces a new control framework for complex systems having critical and noncritical control issues, by assigning higher priority to the critical ones.
△ Less
Submitted 4 May, 2023; v1 submitted 3 May, 2023;
originally announced May 2023.
-
Self-supervised Image Denoising with Downsampled Invariance Loss and Conditional Blind-Spot Network
Authors:
Yeong Il Jang,
Keuntek Lee,
Gu Yong Park,
Seyun Kim,
Nam Ik Cho
Abstract:
There have been many image denoisers using deep neural networks, which outperform conventional model-based methods by large margins. Recently, self-supervised methods have attracted attention because constructing a large real noise dataset for supervised training is an enormous burden. The most representative self-supervised denoisers are based on blind-spot networks, which exclude the receptive f…
▽ More
There have been many image denoisers using deep neural networks, which outperform conventional model-based methods by large margins. Recently, self-supervised methods have attracted attention because constructing a large real noise dataset for supervised training is an enormous burden. The most representative self-supervised denoisers are based on blind-spot networks, which exclude the receptive field's center pixel. However, excluding any input pixel is abandoning some information, especially when the input pixel at the corresponding output position is excluded. In addition, a standard blind-spot network fails to reduce real camera noise due to the pixel-wise correlation of noise, though it successfully removes independently distributed synthetic noise. Hence, to realize a more practical denoiser, we propose a novel self-supervised training framework that can remove real noise. For this, we derive the theoretic upper bound of a supervised loss where the network is guided by the downsampled blinded output. Also, we design a conditional blind-spot network (C-BSN), which selectively controls the blindness of the network to use the center pixel information. Furthermore, we exploit a random subsampler to decorrelate noise spatially, making the C-BSN free of visual artifacts that were often seen in downsample-based methods. Extensive experiments show that the proposed C-BSN achieves state-of-the-art performance on real-world datasets as a self-supervised denoiser and shows qualitatively pleasing results without any post-processing or refinement.
△ Less
Submitted 28 July, 2023; v1 submitted 19 April, 2023;
originally announced April 2023.
-
Efficient Point Mass Predictor for Continuous and Discrete Models with Linear Dynamics
Authors:
Jakub Matousek,
**drich Dunik,
Marek Brandner,
Chan Gook Park,
Yeongkwon Choe
Abstract:
This paper deals with state estimation of stochastic models with linear state dynamics, continuous or discrete in time. The emphasis is laid on a numerical solution to the state prediction by the time-update step of the grid-point-based point-mass filter (PMF), which is the most computationally demanding part of the PMF algorithm. A novel way of manipulating the grid, leading to the time-update in…
▽ More
This paper deals with state estimation of stochastic models with linear state dynamics, continuous or discrete in time. The emphasis is laid on a numerical solution to the state prediction by the time-update step of the grid-point-based point-mass filter (PMF), which is the most computationally demanding part of the PMF algorithm. A novel way of manipulating the grid, leading to the time-update in form of a convolution, is proposed. This reduces the PMF time complexity from quadratic to log-linear with respect to the number of grid points. Furthermore, the number of unique transition probability values is greatly reduced causing a significant reduction of the data storage needed. The proposed PMF prediction step is verified in a numerical study.
△ Less
Submitted 17 April, 2023; v1 submitted 24 February, 2023;
originally announced February 2023.
-
WaGI : Wavelet-based GAN Inversion for Preserving High-frequency Image Details
Authors:
Seung-Jun Moon,
Chaewon Kim,
Gyeong-Moon Park
Abstract:
Recent GAN inversion models focus on preserving image-specific details through various methods, e.g., generator tuning or feature mixing. While those are helpful for preserving details compared to a naiive low-rate latent inversion, they still fail to maintain high-frequency features precisely. In this paper, we point out that the existing GAN inversion models have inherent limitations in both str…
▽ More
Recent GAN inversion models focus on preserving image-specific details through various methods, e.g., generator tuning or feature mixing. While those are helpful for preserving details compared to a naiive low-rate latent inversion, they still fail to maintain high-frequency features precisely. In this paper, we point out that the existing GAN inversion models have inherent limitations in both structural and training aspects, which preclude the delicate reconstruction of high-frequency features. Especially, we prove that the widely-used loss term in GAN inversion, i.e., L2, is biased to reconstruct low-frequency features mainly. To overcome this problem, we propose a novel GAN inversion model, coined WaGI, which enables to handle high-frequency features explicitly, by using a novel wavelet-based loss term and a newly proposed wavelet fusion scheme. To the best of our knowledge, WaGI is the first attempt to interpret GAN inversion in the frequency domain. We demonstrate that WaGI shows outstanding results on both inversion and editing, compared to the existing state-of-the-art GAN inversion models. Especially, WaGI robustly preserves high-frequency features of images even in the editing scenario. We will release our code with the pre-trained model after the review.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
First Demonstration of the Korean eLoran Accuracy in a Narrow Waterway Using Improved ASF Maps
Authors:
Woohyun Kim,
Pyo-Woong Son,
Sul Gee Park,
Sang Hyun Park,
Jiwon Seo
Abstract:
The vulnerabilities of global navigation satellite systems (GNSSs) to radio frequency jamming and spoofing have attracted significant research attention. In particular, the large-scale jamming incidents that occurred in South Korea substantiate the practical importance of implementing a complementary navigation system. This letter briefly summarizes the efforts of South Korea to deploy an enhanced…
▽ More
The vulnerabilities of global navigation satellite systems (GNSSs) to radio frequency jamming and spoofing have attracted significant research attention. In particular, the large-scale jamming incidents that occurred in South Korea substantiate the practical importance of implementing a complementary navigation system. This letter briefly summarizes the efforts of South Korea to deploy an enhanced long-range navigation (eLoran) system, which is a terrestrial low-frequency radio navigation system that can complement GNSSs. After four years of research and development, the Korean eLoran testbed system has been recently deployed and is operational since June 1, 2021. Although its initial performance at sea is satisfactory, navigation through a narrow waterway is still challenging because a complete survey of the additional secondary factor (ASF), which is the largest source of error for eLoran, is practically difficult in a narrow waterway. This letter proposes an alternative way to survey the ASF in a narrow waterway and improve the ASF map generation methods. Moreover, the performance of the proposed approach was validated experimentally.
△ Less
Submitted 28 September, 2021; v1 submitted 18 September, 2021;
originally announced September 2021.
-
Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications
Authors:
Yoo Rhee Oh,
Kiyoung Park,
Jeon Gyu Park
Abstract:
With the recent advances in technology, automatic speech recognition (ASR) has been widely used in real-world applications. The efficiency of converting large amounts of speech into text accurately with limited resources has become more important than ever. This paper proposes a method to rapidly recognize a large speech database via a Transformer-based end-to-end model. Transformers have improved…
▽ More
With the recent advances in technology, automatic speech recognition (ASR) has been widely used in real-world applications. The efficiency of converting large amounts of speech into text accurately with limited resources has become more important than ever. This paper proposes a method to rapidly recognize a large speech database via a Transformer-based end-to-end model. Transformers have improved the state-of-the-art performance in many fields. However, they are not easy to use for long sequences. In this paper, various techniques to speed up the recognition of real-world speeches are proposed and tested, including decoding via multiple-utterance batched beam search, detecting end-of-speech based on a connectionist temporal classification (CTC), restricting the CTC prefix score, and splitting long speeches into short segments. Experiments are conducted with the Librispeech English and the real-world Korean ASR tasks to verify the proposed methods. From the experiments, the proposed system can convert 8 hours of speeches spoken at real-world meetings into text in less than 3 minutes with a 10.73% character error rate, which is 27.1% relatively lower than that of conventional systems.
△ Less
Submitted 11 September, 2021; v1 submitted 14 January, 2021;
originally announced January 2021.
-
Zero-dynamics Attack, Variations, and Countermeasures
Authors:
Hyungbo Shim,
Juhoon Back,
Yongsoon Eun,
Gyunghoon Park,
Jihan Kim
Abstract:
This chapter presents an overview on actuator attacks that exploit zero dynamics, and countermeasures against them. First, zero-dynamics attack is re-introduced based on a canonical representation called normal form. Then it is shown that the target dynamic system is at elevated risk if the associated zero dynamics is unstable. From there on, several questions are raised in series to ensure when t…
▽ More
This chapter presents an overview on actuator attacks that exploit zero dynamics, and countermeasures against them. First, zero-dynamics attack is re-introduced based on a canonical representation called normal form. Then it is shown that the target dynamic system is at elevated risk if the associated zero dynamics is unstable. From there on, several questions are raised in series to ensure when the target system is immune to the attack of this kind. The first question is: Is the target system secure from zero-dynamics attack if it does not have any unstable zeros? An answer provided for this question is: No, the target system may still be at risk due to another attack surface emerging in the process of implementation. This is followed by a series of next questions, and in the course of providing answers, variants of the classic zero-dynamics attack are presented, from which the vulnerability of the target system is explored in depth. At the end, countermeasures are proposed to render the attack ineffective. Because it is known that the zero-dynamics in continuous-time systems cannot be modified by feedback, the main idea of the countermeasure is to relocate any unstable zero to a stable region in the stage of digital implementation through modified digital samplers and holders. Adversaries can still attack actuators, but due to the re-located zeros, they are of little use in damaging the target system.
△ Less
Submitted 2 January, 2021;
originally announced January 2021.
-
Deep Metric Learning-based Image Retrieval System for Chest Radiograph and its Clinical Applications in COVID-19
Authors:
Aoxiao Zhong,
Xiang Li,
Dufan Wu,
Hui Ren,
Kyungsang Kim,
Younggon Kim,
Varun Buch,
Nir Neumark,
Bernardo Bizzo,
Won Young Tak,
Soo Young Park,
Yu Rim Lee,
Min Kyu Kang,
Jung Gil Park,
Byung Seok Kim,
Woo ** Chung,
Ning Guo,
Ittai Dayan,
Mannudeep K. Kalra,
Quanzheng Li
Abstract:
In recent years, deep learning-based image analysis methods have been widely applied in computer-aided detection, diagnosis and prognosis, and has shown its value during the public health crisis of the novel coronavirus disease 2019 (COVID-19) pandemic. Chest radiograph (CXR) has been playing a crucial role in COVID-19 patient triaging, diagnosing and monitoring, particularly in the United States.…
▽ More
In recent years, deep learning-based image analysis methods have been widely applied in computer-aided detection, diagnosis and prognosis, and has shown its value during the public health crisis of the novel coronavirus disease 2019 (COVID-19) pandemic. Chest radiograph (CXR) has been playing a crucial role in COVID-19 patient triaging, diagnosing and monitoring, particularly in the United States. Considering the mixed and unspecific signals in CXR, an image retrieval model of CXR that provides both similar images and associated clinical information can be more clinically meaningful than a direct image diagnostic model. In this work we develop a novel CXR image retrieval model based on deep metric learning. Unlike traditional diagnostic models which aims at learning the direct map** from images to labels, the proposed model aims at learning the optimized embedding space of images, where images with the same labels and similar contents are pulled together. It utilizes multi-similarity loss with hard-mining sampling strategy and attention mechanism to learn the optimized embedding space, and provides similar images to the query image. The model is trained and validated on an international multi-site COVID-19 dataset collected from 3 different sources. Experimental results of COVID-19 image retrieval and diagnosis tasks show that the proposed model can serve as a robust solution for CXR analysis and patient management for COVID-19. The model is also tested on its transferability on a different clinical decision support task, where the pre-trained model is applied to extract image features from a new dataset without any further training. These results demonstrate our deep metric learning based image retrieval model is highly efficient in the CXR retrieval, diagnosis and prognosis, and thus has great clinical value for the treatment and management of COVID-19 patients.
△ Less
Submitted 25 November, 2020;
originally announced December 2020.
-
ElderSim: A Synthetic Data Generation Platform for Human Action Recognition in Eldercare Applications
Authors:
Hochul Hwang,
Cheongjae Jang,
Geonwoo Park,
Junghyun Cho,
Ig-Jae Kim
Abstract:
To train deep learning models for vision-based action recognition of elders' daily activities, we need large-scale activity datasets acquired under various daily living environments and conditions. However, most public datasets used in human action recognition either differ from or have limited coverage of elders' activities in many aspects, making it challenging to recognize elders' daily activit…
▽ More
To train deep learning models for vision-based action recognition of elders' daily activities, we need large-scale activity datasets acquired under various daily living environments and conditions. However, most public datasets used in human action recognition either differ from or have limited coverage of elders' activities in many aspects, making it challenging to recognize elders' daily activities well by only utilizing existing datasets. Recently, such limitations of available datasets have actively been compensated by generating synthetic data from realistic simulation environments and using those data to train deep learning models. In this paper, based on these ideas we develop ElderSim, an action simulation platform that can generate synthetic data on elders' daily activities. For 55 kinds of frequent daily activities of the elders, ElderSim generates realistic motions of synthetic characters with various adjustable data-generating options, and provides different output modalities including RGB videos, two- and three-dimensional skeleton trajectories. We then generate KIST SynADL, a large-scale synthetic dataset of elders' activities of daily living, from ElderSim and use the data in addition to real datasets to train three state-of the-art human action recognition models. From the experiments following several newly proposed scenarios that assume different real and synthetic dataset configurations for training, we observe a noticeable performance improvement by augmenting our synthetic data. We also offer guidance with insights for the effective utilization of synthetic data to help recognize elders' daily activities.
△ Less
Submitted 28 October, 2020;
originally announced October 2020.
-
Deep Learning-based Four-region Lung Segmentation in Chest Radiography for COVID-19 Diagnosis
Authors:
Young-Gon Kim,
Kyungsang Kim,
Dufan Wu,
Hui Ren,
Won Young Tak,
Soo Young Park,
Yu Rim Lee,
Min Kyu Kang,
Jung Gil Park,
Byung Seok Kim,
Woo ** Chung,
Mannudeep K. Kalra,
Quanzheng Li
Abstract:
Purpose. Imaging plays an important role in assessing severity of COVID 19 pneumonia. However, semantic interpretation of chest radiography (CXR) findings does not include quantitative description of radiographic opacities. Most current AI assisted CXR image analysis framework do not quantify for regional variations of disease. To address these, we proposed a four region lung segmentation method t…
▽ More
Purpose. Imaging plays an important role in assessing severity of COVID 19 pneumonia. However, semantic interpretation of chest radiography (CXR) findings does not include quantitative description of radiographic opacities. Most current AI assisted CXR image analysis framework do not quantify for regional variations of disease. To address these, we proposed a four region lung segmentation method to assist accurate quantification of COVID 19 pneumonia. Methods. A segmentation model to separate left and right lung is firstly applied, and then a carina and left hilum detection network is used, which are the clinical landmarks to separate the upper and lower lungs. To improve the segmentation performance of COVID 19 images, ensemble strategy incorporating five models is exploited. Using each region, we evaluated the clinical relevance of the proposed method with the Radiographic Assessment of the Quality of Lung Edema (RALE). Results. The proposed ensemble strategy showed dice score of 0.900, which is significantly higher than conventional methods (0.854 0.889). Mean intensities of segmented four regions indicate positive correlation to the extent and density scores of pulmonary opacities under the RALE framework. Conclusion. A deep learning based model in CXR can accurately segment and quantify regional distribution of pulmonary opacities in patients with COVID 19 pneumonia.
△ Less
Submitted 26 September, 2020;
originally announced September 2020.
-
White Paper on Critical and Massive Machine Type Communication Towards 6G
Authors:
Nurul Huda Mahmood,
Stefan Böcker,
Andrea Munari,
Federico Clazzer,
Ingrid Moerman,
Konstantin Mikhaylov,
Onel Lopez,
Ok-Sun Park,
Eric Mercier,
Hannes Bartz,
Riku Jäntti,
Ravikumar Pragada,
Yihua Ma,
Elina Annanperä,
Christian Wietfeld,
Martin Andraud,
Gianluigi Liva,
Yan Chen,
Eduardo Garro,
Frank Burkhardt,
Hirley Alves,
Chen-Feng Liu,
Yalcin Sadi,
Jean-Baptiste Dore,
Eunah Kim
, et al. (6 additional authors not shown)
Abstract:
The society as a whole, and many vertical sectors in particular, is becoming increasingly digitalized. Machine Type Communication (MTC), encompassing its massive and critical aspects, and ubiquitous wireless connectivity are among the main enablers of such digitization at large. The recently introduced 5G New Radio is natively designed to support both aspects of MTC to promote the digital transfor…
▽ More
The society as a whole, and many vertical sectors in particular, is becoming increasingly digitalized. Machine Type Communication (MTC), encompassing its massive and critical aspects, and ubiquitous wireless connectivity are among the main enablers of such digitization at large. The recently introduced 5G New Radio is natively designed to support both aspects of MTC to promote the digital transformation of the society. However, it is evident that some of the more demanding requirements cannot be fully supported by 5G networks. Alongside, further development of the society towards 2030 will give rise to new and more stringent requirements on wireless connectivity in general, and MTC in particular. Driven by the societal trends towards 2030, the next generation (6G) will be an agile and efficient convergent network serving a set of diverse service classes and a wide range of key performance indicators (KPI). This white paper explores the main drivers and requirements of an MTC-optimized 6G network, and discusses the following six key research questions:
- Will the main KPIs of 5G continue to be the dominant KPIs in 6G; or will there emerge new key metrics?
- How to deliver different E2E service mandates with different KPI requirements considering joint-optimization at the physical up to the application layer?
- What are the key enablers towards designing ultra-low power receivers and highly efficient sleep modes?
- How to tackle a disruptive rather than incremental joint design of a massively scalable waveform and medium access policy for global MTC connectivity?
- How to support new service classes characterizing mission-critical and dependable MTC in 6G?
- What are the potential enablers of long term, lightweight and flexible privacy and security schemes considering MTC device requirements?
△ Less
Submitted 4 May, 2020; v1 submitted 29 April, 2020;
originally announced April 2020.
-
Transfer Learning from Synthetic to Real-Noise Denoising with Adaptive Instance Normalization
Authors:
Yoonsik Kim,
Jae Woong Soh,
Gu Yong Park,
Nam Ik Cho
Abstract:
Real-noise denoising is a challenging task because the statistics of real-noise do not follow the normal distribution, and they are also spatially and temporally changing. In order to cope with various and complex real-noise, we propose a well-generalized denoising architecture and a transfer learning scheme. Specifically, we adopt an adaptive instance normalization to build a denoiser, which can…
▽ More
Real-noise denoising is a challenging task because the statistics of real-noise do not follow the normal distribution, and they are also spatially and temporally changing. In order to cope with various and complex real-noise, we propose a well-generalized denoising architecture and a transfer learning scheme. Specifically, we adopt an adaptive instance normalization to build a denoiser, which can regularize the feature map and prevent the network from overfitting to the training set. We also introduce a transfer learning scheme that transfers knowledge learned from synthetic-noise data to the real-noise denoiser. From the proposed transfer learning, the synthetic-noise denoiser can learn general features from various synthetic-noise data, and the real-noise denoiser can learn the real-noise characteristics from real data. From the experiments, we find that the proposed denoising method has great generalization ability, such that our network trained with synthetic-noise achieves the best performance for Darmstadt Noise Dataset (DND) among the methods from published papers. We can also see that the proposed transfer learning scheme robustly works for real-noise images through the learning with a very small number of labeled data.
△ Less
Submitted 16 March, 2020; v1 submitted 25 February, 2020;
originally announced February 2020.
-
Natural and Realistic Single Image Super-Resolution with Explicit Natural Manifold Discrimination
Authors:
Jae Woong Soh,
Gu Yong Park,
Junho Jo,
Nam Ik Cho
Abstract:
Recently, many convolutional neural networks for single image super-resolution (SISR) have been proposed, which focus on reconstructing the high-resolution images in terms of objective distortion measures. However, the networks trained with objective loss functions generally fail to reconstruct the realistic fine textures and details that are essential for better perceptual quality. Recovering the…
▽ More
Recently, many convolutional neural networks for single image super-resolution (SISR) have been proposed, which focus on reconstructing the high-resolution images in terms of objective distortion measures. However, the networks trained with objective loss functions generally fail to reconstruct the realistic fine textures and details that are essential for better perceptual quality. Recovering the realistic details remains a challenging problem, and only a few works have been proposed which aim at increasing the perceptual quality by generating enhanced textures. However, the generated fake details often make undesirable artifacts and the overall image looks somewhat unnatural. Therefore, in this paper, we present a new approach to reconstructing realistic super-resolved images with high perceptual quality, while maintaining the naturalness of the result. In particular, we focus on the domain prior properties of SISR problem. Specifically, we define the naturalness prior in the low-level domain and constrain the output image in the natural manifold, which eventually generates more natural and realistic images. Our results show better naturalness compared to the recent super-resolution algorithms including perception-oriented ones.
△ Less
Submitted 9 November, 2019;
originally announced November 2019.
-
Model Predictive Control Framework for Improving Vehicle Cornering Performance Using Handling Characteristics
Authors:
Kyoungseok Han,
Giseo Park,
Gokul S. Sankar,
Kanghyun Nam,
Seibum B. Choi
Abstract:
This paper proposes a new control strategy to improve vehicle cornering performance in a model predictive control framework. The most distinguishing feature of the proposed method is that the natural handling characteristics of the production vehicle is exploited to reduce the complexity of the conventional control methods. For safety s sake, most production vehicles are built to exhibit an unders…
▽ More
This paper proposes a new control strategy to improve vehicle cornering performance in a model predictive control framework. The most distinguishing feature of the proposed method is that the natural handling characteristics of the production vehicle is exploited to reduce the complexity of the conventional control methods. For safety s sake, most production vehicles are built to exhibit an understeer handling characteristics to some extent. By monitoring how much the vehicle is biased into the understeer state, the controller attempts to adjust this amount in a way that improves the vehicle cornering performance. With this particular strategy, an innovative controller can be designed without road friction information, which complicates the conventional control methods. In addition, unlike the conventional controllers, the reference yaw rate that is highly dependent on road friction need not be defined due to the proposed control structure. The optimal control problem is formulated in a model predictive control framework to handle the constraints efficiently, and simulations in various test scenarios illustrate the effectiveness of the proposed approach.
△ Less
Submitted 14 November, 2019; v1 submitted 19 April, 2019;
originally announced April 2019.
-
Robust Stability of Discrete-time Disturbance Observers: Understanding Interplay of Sampling, Model Uncertainty and Discrete-time Designs
Authors:
Gyunghoon Park,
Chanhwa Lee,
Youngjun Joo,
Hyungbo Shim
Abstract:
In this paper, we address the problem of robust stability for uncertain sampled-data systems controlled by a discrete-time disturbance observer (DT-DOB). Unlike most of previous works that rely on the small-gain theorem, our approach is to investigate the location of the roots of the characteristic polynomial when the sampling is performed sufficiently fast. This approach provides a generalized fr…
▽ More
In this paper, we address the problem of robust stability for uncertain sampled-data systems controlled by a discrete-time disturbance observer (DT-DOB). Unlike most of previous works that rely on the small-gain theorem, our approach is to investigate the location of the roots of the characteristic polynomial when the sampling is performed sufficiently fast. This approach provides a generalized framework for the stability analysis in the sense that (i) many popular discretization methods are taken into account; (ii) under fast sampling, the obtained robust stability condition is necessary and sufficient except in a degenerative case; and (iii) systems of arbitrary order and of large uncertainty can be dealt with. The relation between sampling zeros---discrete-time zeros that newly appear due to the sampling---and robust stability is highlighted, and it is explicitly revealed that the sampling zeros can hamper stability of the overall system when the Q-filter and/or the nominal model are carelessly selected in discrete time. Finally, a design guideline for the Q-filter and the nominal model in the discrete-time domain is proposed for robust stabilization under the sampling against the arbitrarily large (but bounded) parametric uncertainty of the plant.
△ Less
Submitted 24 January, 2019;
originally announced January 2019.
-
2-gram-based Phonetic Feature Generation for Convolutional Neural Network in Assessment of Trademark Similarity
Authors:
Kyung Pyo Ko,
Kwang Hee Lee,
Mi So Jang,
Gun Hong Park
Abstract:
A trademark is a mark used to identify various commodities. If same or similar trademark is registered for the same or similar commodity, the purchaser of the goods may be confused. Therefore, in the process of trademark registration examination, the examiner judges whether the trademark is the same or similar to the other applied or registered trademarks. The confusion in trademarks is based on t…
▽ More
A trademark is a mark used to identify various commodities. If same or similar trademark is registered for the same or similar commodity, the purchaser of the goods may be confused. Therefore, in the process of trademark registration examination, the examiner judges whether the trademark is the same or similar to the other applied or registered trademarks. The confusion in trademarks is based on the visual, phonetic or conceptual similarity of the marks. In this paper, we focus specifically on the phonetic similarity between trademarks. We propose a method to generate 2D phonetic feature for convolutional neural network in assessment of trademark similarity. This proposed algorithm is tested with 12,553 trademark phonetic similar pairs and 34,020 trademark phonetic non-similar pairs from 2010 to 2016. As a result, we have obtained approximately 92% judgment accuracy.
△ Less
Submitted 10 February, 2018;
originally announced February 2018.
-
A Zero-stealthy Attack for Sampled-data Control Systems via Input Redundancy
Authors:
Jihan Kim,
Gyunghoon Park,
Hyungbo Shim,
Yongsoon Eun
Abstract:
In this paper, we introduce a new vulnerability of cyber-physical systems to malicious attack. It arises when the physical plant, that is modeled as a continuous-time LTI system, is controlled by a digital controller. In the sampled-data framework, most anomaly detectors monitor the plant's output only at discrete time instants, and thus, nothing abnormal can be detected as long as the sampled out…
▽ More
In this paper, we introduce a new vulnerability of cyber-physical systems to malicious attack. It arises when the physical plant, that is modeled as a continuous-time LTI system, is controlled by a digital controller. In the sampled-data framework, most anomaly detectors monitor the plant's output only at discrete time instants, and thus, nothing abnormal can be detected as long as the sampled output behaves normal. This implies that if an actuator attack drives the plant's state to pass through the kernel of the output matrix at each sensing time, then the attack compromises the system while remaining stealthy. We show that this type of attack always exists when the sampled-data system has an input redundancy, i.e., the number of inputs being larger than that of the outputs or the sampling rate of the actuators being higher than that of the sensors. Simulation results for the X-38 vehicle and for the other numerical examples illustrate this new attack strategy possibly brings disastrous consequences.
△ Less
Submitted 10 January, 2018;
originally announced January 2018.
-
Yet Another Tutorial of Disturbance Observer: Robust Stabilization and Recovery of Nominal Performance
Authors:
Hyungbo Shim,
Gyunghoon Park,
Youngjun Joo,
Juhoon Back,
Nam Hoon Jo
Abstract:
This paper presents a tutorial-style review on the recent results about the disturbance observer (DOB) in view of robust stabilization and recovery of the nominal performance. The analysis is based on the case when the bandwidth of Q-filter is large, and it is explained in a pedagogical manner that, even in the presence of plant uncertainties and disturbances, the behavior of real uncertain plant…
▽ More
This paper presents a tutorial-style review on the recent results about the disturbance observer (DOB) in view of robust stabilization and recovery of the nominal performance. The analysis is based on the case when the bandwidth of Q-filter is large, and it is explained in a pedagogical manner that, even in the presence of plant uncertainties and disturbances, the behavior of real uncertain plant can be made almost similar to that of disturbance-free nominal system both in the transient and in the steady-state. The conventional DOB is interpreted in a new perspective, and its restrictions and extensions are discussed.
△ Less
Submitted 19 June, 2016; v1 submitted 8 January, 2016;
originally announced January 2016.
-
Precision improvement of MEMS gyros for indoor mobile robots with horizontal motion inspired by methods of TRIZ
Authors:
Dongmyoung Shin,
Sung Gil Park,
Byung Soo Song,
Eung Su Kim,
Oleg Kupervasser,
Denis Pivovartchuk,
Ilya Gartseev,
Oleg Antipov,
Evgeniy Kruchenkov,
Alexey Milovanov,
Andrey Kochetov,
Igor Sazonov,
Igor Nogtev,
Sun Woo Hyun
Abstract:
In the paper, the problem of precision improvement for the MEMS gyrosensors on indoor robots with horizontal motion is solved by methods of TRIZ ("the theory of inventive problem solving").
In the paper, the problem of precision improvement for the MEMS gyrosensors on indoor robots with horizontal motion is solved by methods of TRIZ ("the theory of inventive problem solving").
△ Less
Submitted 18 March, 2014; v1 submitted 15 November, 2013;
originally announced November 2013.