-
Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision Tasks
Authors:
Minjong Cheon
Abstract:
In the realm of deep learning, the Kolmogorov-Arnold Network (KAN) has emerged as a potential alternative to multilayer projections (MLPs). However, its applicability to vision tasks has not been extensively validated. In our study, we demonstrated the effectiveness of KAN for vision tasks through multiple trials on the MNIST, CIFAR10, and CIFAR100 datasets, using a training batch size of 32. Our…
▽ More
In the realm of deep learning, the Kolmogorov-Arnold Network (KAN) has emerged as a potential alternative to multilayer projections (MLPs). However, its applicability to vision tasks has not been extensively validated. In our study, we demonstrated the effectiveness of KAN for vision tasks through multiple trials on the MNIST, CIFAR10, and CIFAR100 datasets, using a training batch size of 32. Our results showed that while KAN outperformed the original MLP-Mixer on CIFAR10 and CIFAR100, it performed slightly worse than the state-of-the-art ResNet-18. These findings suggest that KAN holds significant promise for vision tasks, and further modifications could enhance its performance in future evaluations.Our contributions are threefold: first, we showcase the efficiency of KAN-based algorithms for visual tasks; second, we provide extensive empirical assessments across various vision benchmarks, comparing KAN's performance with MLP-Mixer, CNNs, and Vision Transformers (ViT); and third, we pioneer the use of natural KAN layers in visual tasks, addressing a gap in previous research. This paper lays the foundation for future studies on KANs, highlighting their potential as a reliable alternative for image classification tasks.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Kolmogorov-Arnold Network for Satellite Image Classification in Remote Sensing
Authors:
Minjong Cheon
Abstract:
In this research, we propose the first approach for integrating the Kolmogorov-Arnold Network (KAN) with various pre-trained Convolutional Neural Network (CNN) models for remote sensing (RS) scene classification tasks using the EuroSAT dataset. Our novel methodology, named KCN, aims to replace traditional Multi-Layer Perceptrons (MLPs) with KAN to enhance classification performance. We employed mu…
▽ More
In this research, we propose the first approach for integrating the Kolmogorov-Arnold Network (KAN) with various pre-trained Convolutional Neural Network (CNN) models for remote sensing (RS) scene classification tasks using the EuroSAT dataset. Our novel methodology, named KCN, aims to replace traditional Multi-Layer Perceptrons (MLPs) with KAN to enhance classification performance. We employed multiple CNN-based models, including VGG16, MobileNetV2, EfficientNet, ConvNeXt, ResNet101, and Vision Transformer (ViT), and evaluated their performance when paired with KAN. Our experiments demonstrated that KAN achieved high accuracy with fewer training epochs and parameters. Specifically, ConvNeXt paired with KAN showed the best performance, achieving 94% accuracy in the first epoch, which increased to 96% and remained consistent across subsequent epochs. The results indicated that KAN and MLP both achieved similar accuracy, with KAN performing slightly better in later epochs. By utilizing the EuroSAT dataset, we provided a robust testbed to investigate whether KAN is suitable for remote sensing classification tasks. Given that KAN is a novel algorithm, there is substantial capacity for further development and optimization, suggesting that KCN offers a promising alternative for efficient image analysis in the RS field.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
KARINA: An Efficient Deep Learning Model for Global Weather Forecast
Authors:
Minjong Cheon,
Yo-Hwan Choi,
Seon-Yu Kang,
Yumi Choi,
Jeong-Gil Lee,
Daehyun Kang
Abstract:
Deep learning-based, data-driven models are gaining prevalence in climate research, particularly for global weather prediction. However, training the global weather data at high resolution requires massive computational resources. Therefore, we present a new model named KARINA to overcome the substantial computational demands typical of this field. This model achieves forecasting accuracy comparab…
▽ More
Deep learning-based, data-driven models are gaining prevalence in climate research, particularly for global weather prediction. However, training the global weather data at high resolution requires massive computational resources. Therefore, we present a new model named KARINA to overcome the substantial computational demands typical of this field. This model achieves forecasting accuracy comparable to higher-resolution counterparts with significantly less computational resources, requiring only 4 NVIDIA A100 GPUs and less than 12 hours of training. KARINA combines ConvNext, SENet, and Geocyclic Padding to enhance weather forecasting at a 2.5° resolution, which could filter out high-frequency noise. Geocyclic Padding preserves pixels at the lateral boundary of the input image, thereby maintaining atmospheric flow continuity in the spherical Earth. SENet dynamically improves feature response, advancing atmospheric process modeling, particularly in the vertical column process as numerous channels. In this vein, KARINA sets new benchmarks in weather forecasting accuracy, surpassing existing models like the ECMWF S2S reforecasts at a lead time of up to 7 days. Remarkably, KARINA achieved competitive performance even when compared to the recently developed models (Pangu-Weather, GraphCast, ClimaX, and FourCastNet) trained with high-resolution data having 100 times larger pixels. Conclusively, KARINA significantly advances global weather forecasting by efficiently modeling Earth's atmosphere with improved accuracy and resource efficiency.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Advancing Data-driven Weather Forecasting: Time-Sliding Data Augmentation of ERA5
Authors:
Minjong Cheon,
Daehyun Kang,
Yo-Hwan Choi,
Seon-Yu Kang
Abstract:
Modern deep learning techniques, which mimic traditional numerical weather prediction (NWP) models and are derived from global atmospheric reanalysis data, have caused a significant revolution within a few years. In this new paradigm, our research introduces a novel strategy that deviates from the common dependence on high-resolution data, which is often constrained by computational resources, and…
▽ More
Modern deep learning techniques, which mimic traditional numerical weather prediction (NWP) models and are derived from global atmospheric reanalysis data, have caused a significant revolution within a few years. In this new paradigm, our research introduces a novel strategy that deviates from the common dependence on high-resolution data, which is often constrained by computational resources, and instead utilizes low-resolution data (2.5 degrees) for global weather prediction and climate data analysis. Our main focus is evaluating data-driven weather prediction (DDWP) frameworks, specifically addressing sample size adequacy, structural improvements to the model, and the ability of climate data to represent current climatic trends. By using the Adaptive Fourier Neural Operator (AFNO) model via FourCastNet and a proposed time-sliding method to inflate the dataset of the ECMWF Reanalysis v5 (ERA5), this paper improves on conventional approaches by adding more variables and a novel approach to data augmentation and processing. Our findings reveal that despite the lower resolution, the proposed approach demonstrates considerable accuracy in predicting atmospheric conditions, effectively rivaling higher-resolution models. Furthermore, the study confirms the model's proficiency in reflecting current climate trends and its potential in predicting future climatic events, underscoring its utility in climate change strategies. This research marks a pivotal step in the realm of meteorological forecasting, showcasing the feasibility of lower-resolution data in producing reliable predictions and opening avenues for more accessible and inclusive climate modeling. The insights gleaned from this study not only contribute to the advancement of climate science but also lay the groundwork for future innovations in the field.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Design of a novel Korean learning application for efficient pronunciation correction
Authors:
Minjong Cheon,
Minseon Kim,
Hanseon Joo
Abstract:
The Korean wave, which denotes the global popularity of South Korea's cultural economy, contributes to the increasing demand for the Korean language. However, as there does not exist any application for foreigners to learn Korean, this paper suggested a design of a novel Korean learning application. Speech recognition, speech-to-text, and speech-to-waveform are the three key systems in the propose…
▽ More
The Korean wave, which denotes the global popularity of South Korea's cultural economy, contributes to the increasing demand for the Korean language. However, as there does not exist any application for foreigners to learn Korean, this paper suggested a design of a novel Korean learning application. Speech recognition, speech-to-text, and speech-to-waveform are the three key systems in the proposed system. The Google API and the librosa library will transform the user's voice into a sentence and MFCC. The software will then display the user's phrase and answer, with mispronounced elements highlighted in red, allowing users to more easily recognize the incorrect parts of their pronunciation. Furthermore, the Siamese network might utilize those translated spectrograms to provide a similarity score, which could subsequently be used to offer feedback to the user. Despite the fact that we were unable to collect sufficient foreigner data for this research, it is notable that we presented a novel Korean pronunciation correction method for foreigners.
△ Less
Submitted 4 May, 2022;
originally announced May 2022.
-
Color of Copper/Copper oxide
Authors:
Su Jae Kim,
Seonghoon Kim,
Jegon Lee,
Youngjae Jo,
Yu-Seong Seo,
Myounghoon Lee,
Yousil Lee,
Chae Ryong Cho,
Jong-pil Kim,
Miyeon Cheon,
Jungseek Hwang,
Yong In Kim,
Young-Hoon Kim,
Young-Min Kim,
Aloysius Soon,
Myunghwan Choi,
Woo Seok Choi,
Se-Young Jeong,
Young Hee Lee
Abstract:
Stochastic inhomogeneous oxidation is an inherent characteristic of copper (Cu), often hindering color tuning and bandgap engineering of oxides. Coherent control of the interface between metal and metal oxide remains unresolved. We demonstrate coherent propagation of an oxidation front in single-crystal Cu thin film to achieve a full-color spectrum for Cu by precisely controlling its oxide-layer t…
▽ More
Stochastic inhomogeneous oxidation is an inherent characteristic of copper (Cu), often hindering color tuning and bandgap engineering of oxides. Coherent control of the interface between metal and metal oxide remains unresolved. We demonstrate coherent propagation of an oxidation front in single-crystal Cu thin film to achieve a full-color spectrum for Cu by precisely controlling its oxide-layer thickness. Grain boundary-free and atomically flat films prepared by atomic-sputtering epitaxy allow tailoring of the oxide layer with an abrupt interface via heat treatment with a suppressed temperature gradient. Color tuning of nearly full-color RGB indices is realized by precise control of oxide-layer thickness; our samples covered ~50.4% of the sRGB color space. The color of copper/copper oxide is realized by the reconstruction of the quantitative yield color from oxide pigment (complex dielectric functions of Cu2O) and light-layer interference (reflectance spectra obtained from the Fresnel equations) to produce structural color. We further demonstrate laser-oxide lithography with micron-scale linewidth and depth through local phase transformation to oxides embedded in the metal, providing spacing necessary for semiconducting transport and optoelectronics functionality.
△ Less
Submitted 15 July, 2021;
originally announced July 2021.
-
NTIRE 2021 Challenge on Perceptual Image Quality Assessment
Authors:
**** Gu,
Haoming Cai,
Chao Dong,
Jimmy S. Ren,
Yu Qiao,
Shuhang Gu,
Radu Timofte,
Manri Cheon,
Sungjun Yoon,
Byungyeon Kang,
Junwoo Lee,
Qing Zhang,
Haiyang Guo,
Yi Bin,
Yuqing Hou,
Hengliang Luo,
**gyu Guo,
Zirui Wang,
Hai Wang,
Wenming Yang,
Qingyan Bai,
Shuwei Shi,
Weihao Xia,
Mingdeng Cao,
Jiahao Wang
, et al. (25 additional authors not shown)
Abstract:
This paper reports on the NTIRE 2021 challenge on perceptual image quality assessment (IQA), held in conjunction with the New Trends in Image Restoration and Enhancement workshop (NTIRE) workshop at CVPR 2021. As a new type of image processing technology, perceptual image processing algorithms based on Generative Adversarial Networks (GAN) have produced images with more realistic textures. These o…
▽ More
This paper reports on the NTIRE 2021 challenge on perceptual image quality assessment (IQA), held in conjunction with the New Trends in Image Restoration and Enhancement workshop (NTIRE) workshop at CVPR 2021. As a new type of image processing technology, perceptual image processing algorithms based on Generative Adversarial Networks (GAN) have produced images with more realistic textures. These output images have completely different characteristics from traditional distortions, thus pose a new challenge for IQA methods to evaluate their visual quality. In comparison with previous IQA challenges, the training and testing datasets in this challenge include the outputs of perceptual image processing algorithms and the corresponding subjective scores. Thus they can be used to develop and evaluate IQA methods on GAN-based distortions. The challenge has 270 registered participants in total. In the final testing stage, 13 participating teams submitted their models and fact sheets. Almost all of them have achieved much better results than existing IQA methods, while the winning method can demonstrate state-of-the-art performance.
△ Less
Submitted 28 June, 2021; v1 submitted 7 May, 2021;
originally announced May 2021.
-
Perceptual Image Quality Assessment with Transformers
Authors:
Manri Cheon,
Sung-Jun Yoon,
Byungyeon Kang,
Junwoo Lee
Abstract:
In this paper, we propose an image quality transformer (IQT) that successfully applies a transformer architecture to a perceptual full-reference image quality assessment (IQA) task. Perceptual representation becomes more important in image quality assessment. In this context, we extract the perceptual feature representations from each of input images using a convolutional neural network (CNN) back…
▽ More
In this paper, we propose an image quality transformer (IQT) that successfully applies a transformer architecture to a perceptual full-reference image quality assessment (IQA) task. Perceptual representation becomes more important in image quality assessment. In this context, we extract the perceptual feature representations from each of input images using a convolutional neural network (CNN) backbone. The extracted feature maps are fed into the transformer encoder and decoder in order to compare a reference and distorted images. Following an approach of the transformer-based vision models, we use extra learnable quality embedding and position embedding. The output of the transformer is passed to a prediction head in order to predict a final quality score. The experimental results show that our proposed model has an outstanding performance for the standard IQA datasets. For a large-scale IQA dataset containing output images of generative model, our model also shows the promising results. The proposed IQT was ranked first among 13 participants in the NTIRE 2021 perceptual image quality assessment challenge. Our work will be an opportunity to further expand the approach for the perceptual IQA task.
△ Less
Submitted 4 May, 2021; v1 submitted 29 April, 2021;
originally announced April 2021.
-
Ambiguity of Objective Image Quality Metrics: A New Methodology for Performance Evaluation
Authors:
Manri Cheon,
Toinon Vigier,
Lukáš Krasula,
Junghyuk Lee,
Patrick Le Callet,
Jong-Seok Lee
Abstract:
Objective image quality metrics try to estimate the perceptual quality of the given image by considering the characteristics of the human visual system. However, it is possible that the metrics produce different quality scores even for two images that are perceptually indistinguishable by human viewers, which have not been considered in the existing studies related to objective quality assessment.…
▽ More
Objective image quality metrics try to estimate the perceptual quality of the given image by considering the characteristics of the human visual system. However, it is possible that the metrics produce different quality scores even for two images that are perceptually indistinguishable by human viewers, which have not been considered in the existing studies related to objective quality assessment. In this paper, we address the issue of ambiguity of objective image quality assessment. We propose an approach to obtain an ambiguity interval of an objective metric, within which the quality score difference is not perceptually significant. In particular, we use the visual difference predictor, which can consider viewing conditions that are important for visual quality perception. In order to demonstrate the usefulness of the proposed approach, we conduct experiments with 33 state-of-the-art image quality metrics in the viewpoint of their accuracy and ambiguity for three image quality databases. The results show that the ambiguity intervals can be applied as an additional figure of merit when conventional performance measurement does not determine superiority between the metrics. The effect of the viewing distance on the ambiguity interval is also shown.
△ Less
Submitted 18 January, 2021;
originally announced January 2021.
-
Texture Transform Attention for Realistic Image Inpainting
Authors:
Ye** Kim,
Manri Cheon,
Junwoo Lee
Abstract:
Over the last few years, the performance of inpainting to fill missing regions has shown significant improvements by using deep neural networks. Most of inpainting work create a visually plausible structure and texture, however, due to them often generating a blurry result, final outcomes appear unrealistic and make feel heterogeneity. In order to solve this problem, the existing methods have used…
▽ More
Over the last few years, the performance of inpainting to fill missing regions has shown significant improvements by using deep neural networks. Most of inpainting work create a visually plausible structure and texture, however, due to them often generating a blurry result, final outcomes appear unrealistic and make feel heterogeneity. In order to solve this problem, the existing methods have used a patch based solution with deep neural network, however, these methods also cannot transfer the texture properly. Motivated by these observation, we propose a patch based method. Texture Transform Attention network(TTA-Net) that better produces the missing region inpainting with fine details. The task is a single refinement network and takes the form of U-Net architecture that transfers fine texture features of encoder to coarse semantic features of decoder through skip-connection. Texture Transform Attention is used to create a new reassembled texture map using fine textures and coarse semantics that can efficiently transfer texture information as a result. To stabilize training process, we use a VGG feature layer of ground truth and patch discriminator. We evaluate our model end-to-end with the publicly available datasets CelebA-HQ and Places2 and demonstrate that images of higher quality can be obtained to the existing state-of-the-art approaches.
△ Less
Submitted 8 December, 2020;
originally announced December 2020.
-
Smoother Network Tuning and Interpolation for Continuous-level Image Processing
Authors:
Hyeongmin Lee,
Taeoh Kim,
Hanbin Son,
Sangwook Baek,
Minsu Cheon,
Sangyoun Lee
Abstract:
In Convolutional Neural Network (CNN) based image processing, most studies propose networks that are optimized to single-level (or single-objective); thus, they underperform on other levels and must be retrained for delivery of optimal performance. Using multiple models to cover multiple levels involves very high computational costs. To solve these problems, recent approaches train networks on two…
▽ More
In Convolutional Neural Network (CNN) based image processing, most studies propose networks that are optimized to single-level (or single-objective); thus, they underperform on other levels and must be retrained for delivery of optimal performance. Using multiple models to cover multiple levels involves very high computational costs. To solve these problems, recent approaches train networks on two different levels and propose their own interpolation methods to enable arbitrary intermediate levels. However, many of them fail to generalize or have certain side effects in practical usage. In this paper, we define these frameworks as network tuning and interpolation and propose a novel module for continuous-level learning, called Filter Transition Network (FTN). This module is a structurally smoother module than existing ones. Therefore, the frameworks with FTN generalize well across various tasks and networks and cause fewer undesirable side effects. For stable learning of FTN, we additionally propose a method to initialize non-linear neural network layers with identity map**s. Extensive results for various image processing tasks indicate that the performance of FTN is comparable in multiple continuous levels, and is significantly smoother and lighter than that of other frameworks.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
NTIRE 2020 Challenge on Image Demoireing: Methods and Results
Authors:
Shanxin Yuan,
Radu Timofte,
Ales Leonardis,
Gregory Slabaugh,
Xiaotong Luo,
Jiangtao Zhang,
Yanyun Qu,
Ming Hong,
Yuan Xie,
Cuihua Li,
Dejia Xu,
Yihao Chu,
Qingyan Sun,
Shuai Liu,
Ziyao Zong,
Nan Nan,
Chenghua Li,
Sangmin Kim,
Hyungjoon Nam,
Jisu Kim,
Jechang Jeong,
Manri Cheon,
Sung-Jun Yoon,
Byungyeon Kang,
Junwoo Lee
, et al. (21 additional authors not shown)
Abstract:
This paper reviews the Challenge on Image Demoireing that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2020. Demoireing is a difficult task of removing moire patterns from an image to reveal an underlying clean image. The challenge was divided into two tracks. Track 1 targeted the single image demoireing problem, which seeks to rem…
▽ More
This paper reviews the Challenge on Image Demoireing that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2020. Demoireing is a difficult task of removing moire patterns from an image to reveal an underlying clean image. The challenge was divided into two tracks. Track 1 targeted the single image demoireing problem, which seeks to remove moire patterns from a single image. Track 2 focused on the burst demoireing problem, where a set of degraded moire images of the same scene were provided as input, with the goal of producing a single demoired image as output. The methods were ranked in terms of their fidelity, measured using the peak signal-to-noise ratio (PSNR) between the ground truth clean images and the restored images produced by the participants' methods. The tracks had 142 and 99 registered participants, respectively, with a total of 14 and 6 submissions in the final testing stage. The entries span the current state-of-the-art in image and burst image demoireing problems.
△ Less
Submitted 6 May, 2020;
originally announced May 2020.
-
Regularized Adaptation for Stable and Efficient Continuous-Level Learning on Image Processing Networks
Authors:
Hyeongmin Lee,
Taeoh Kim,
Hanbin Son,
Sangwook Baek,
Minsu Cheon,
Sangyoun Lee
Abstract:
In Convolutional Neural Network (CNN) based image processing, most of the studies propose networks that are optimized for a single-level (or a single-objective); thus, they underperform on other levels and must be retrained for delivery of optimal performance. Using multiple models to cover multiple levels involves very high computational costs. To solve these problems, recent approaches train the…
▽ More
In Convolutional Neural Network (CNN) based image processing, most of the studies propose networks that are optimized for a single-level (or a single-objective); thus, they underperform on other levels and must be retrained for delivery of optimal performance. Using multiple models to cover multiple levels involves very high computational costs. To solve these problems, recent approaches train the networks on two different levels and propose their own interpolation methods to enable the arbitrary intermediate levels. However, many of them fail to adapt hard tasks or interpolate smoothly, or the others still require large memory and computational cost. In this paper, we propose a novel continuous-level learning framework using a Filter Transition Network (FTN) which is a non-linear module that easily adapt to new levels, and is regularized to prevent undesirable side-effects. Additionally, for stable learning of FTN, we newly propose a method to initialize non-linear CNNs with identity map**s. Furthermore, FTN is extremely lightweight module since it is a data-independent module, which means it is not affected by the spatial resolution of the inputs. Extensive results for various image processing tasks indicate that the performance of FTN is stable in terms of adaptation and interpolation, and comparable to that of the other heavy frameworks.
△ Less
Submitted 11 March, 2020; v1 submitted 11 March, 2020;
originally announced March 2020.
-
An Outer-approximation Guided Optimization Approach for Constrained Neural Network Inverse Problems
Authors:
Myun-Seok Cheon
Abstract:
This paper discusses an outer-approximation guided optimization method for constrained neural network inverse problems with rectified linear units. The constrained neural network inverse problems refer to an optimization problem to find the best set of input values of a given trained neural network in order to produce a predefined desired output in presence of constraints on input values. This pap…
▽ More
This paper discusses an outer-approximation guided optimization method for constrained neural network inverse problems with rectified linear units. The constrained neural network inverse problems refer to an optimization problem to find the best set of input values of a given trained neural network in order to produce a predefined desired output in presence of constraints on input values. This paper analyzes the characteristics of optimal solutions of neural network inverse problems with rectified activation units and proposes an outer-approximation algorithm by exploiting their characteristics. The proposed outer-approximation guided optimization comprises primal and dual phases. The primal phase incorporates neighbor curvatures with neighbor outer-approximations to expedite the process. The dual phase identifies and utilizes the structure of local convex regions to improve the convergence to a local optimal solution. At last, computation experiments demonstrate the superiority of the proposed algorithm compared to a projected gradient method.
△ Less
Submitted 24 February, 2020;
originally announced February 2020.
-
Lightweight and Efficient Image Super-Resolution with Block State-based Recursive Network
Authors:
Jun-Ho Choi,
Jun-Hyuk Kim,
Manri Cheon,
Jong-Seok Lee
Abstract:
Recently, several deep learning-based image super-resolution methods have been developed by stacking massive numbers of layers. However, this leads too large model sizes and high computational complexities, thus some recursive parameter-sharing methods have been also proposed. Nevertheless, their designs do not properly utilize the potential of the recursive operation. In this paper, we propose a…
▽ More
Recently, several deep learning-based image super-resolution methods have been developed by stacking massive numbers of layers. However, this leads too large model sizes and high computational complexities, thus some recursive parameter-sharing methods have been also proposed. Nevertheless, their designs do not properly utilize the potential of the recursive operation. In this paper, we propose a novel, lightweight, and efficient super-resolution method to maximize the usefulness of the recursive architecture, by introducing block state-based recursive network. By taking advantage of utilizing the block state, the recursive part of our model can easily track the status of the current image features. We show the benefits of the proposed method in terms of model size, speed, and efficiency. In addition, we show that our method outperforms the other state-of-the-art methods.
△ Less
Submitted 29 November, 2018;
originally announced November 2018.
-
MAMNet: Multi-path Adaptive Modulation Network for Image Super-Resolution
Authors:
Jun-Hyuk Kim,
Jun-Ho Choi,
Manri Cheon,
Jong-Seok Lee
Abstract:
In recent years, single image super-resolution (SR) methods based on deep convolutional neural networks (CNNs) have made significant progress. However, due to the non-adaptive nature of the convolution operation, they cannot adapt to various characteristics of images, which limits their representational capability and, consequently, results in unnecessarily large model sizes. To address this issue…
▽ More
In recent years, single image super-resolution (SR) methods based on deep convolutional neural networks (CNNs) have made significant progress. However, due to the non-adaptive nature of the convolution operation, they cannot adapt to various characteristics of images, which limits their representational capability and, consequently, results in unnecessarily large model sizes. To address this issue, we propose a novel multi-path adaptive modulation network (MAMNet). Specifically, we propose a multi-path adaptive modulation block (MAMB), which is a lightweight yet effective residual block that adaptively modulates residual feature responses by fully exploiting their information via three paths. The three paths model three types of information suitable for SR: 1) channel-specific information (CSI) using global variance pooling, 2) inter-channel dependencies (ICD) based on the CSI, 3) and channel-specific spatial dependencies (CSD) via depth-wise convolution. We demonstrate that the proposed MAMB is effective and parameter-efficient for image SR than other feature modulation methods. In addition, experimental results show that our MAMNet outperforms most of the state-of-the-art methods with a relatively small number of parameters.
△ Less
Submitted 27 March, 2020; v1 submitted 29 November, 2018;
originally announced November 2018.
-
Deep Learning-based Image Super-Resolution Considering Quantitative and Perceptual Quality
Authors:
Jun-Ho Choi,
Jun-Hyuk Kim,
Manri Cheon,
Jong-Seok Lee
Abstract:
Recently, it has been shown that in super-resolution, there exists a tradeoff relationship between the quantitative and perceptual quality of super-resolved images, which correspond to the similarity to the ground-truth images and the naturalness, respectively. In this paper, we propose a novel super-resolution method that can improve the perceptual quality of the upscaled images while preserving…
▽ More
Recently, it has been shown that in super-resolution, there exists a tradeoff relationship between the quantitative and perceptual quality of super-resolved images, which correspond to the similarity to the ground-truth images and the naturalness, respectively. In this paper, we propose a novel super-resolution method that can improve the perceptual quality of the upscaled images while preserving the conventional quantitative performance. The proposed method employs a deep network for multi-pass upscaling in company with a discriminator network and two quantitative score predictor networks. Experimental results demonstrate that the proposed method achieves a good balance of the quantitative and perceptual quality, showing more satisfactory results than existing methods.
△ Less
Submitted 19 April, 2019; v1 submitted 13 September, 2018;
originally announced September 2018.
-
Generative adversarial network-based image super-resolution using perceptual content losses
Authors:
Manri Cheon,
Jun-Hyuk Kim,
Jun-Ho Choi,
Jong-Seok Lee
Abstract:
In this paper, we propose a deep generative adversarial network for super-resolution considering the trade-off between perception and distortion. Based on good performance of a recently developed model for super-resolution, i.e., deep residual network using enhanced upscale modules (EUSR), the proposed model is trained to improve perceptual performance with only slight increase of distortion. For…
▽ More
In this paper, we propose a deep generative adversarial network for super-resolution considering the trade-off between perception and distortion. Based on good performance of a recently developed model for super-resolution, i.e., deep residual network using enhanced upscale modules (EUSR), the proposed model is trained to improve perceptual performance with only slight increase of distortion. For this purpose, together with the conventional content loss, i.e., reconstruction loss such as L1 or L2, we consider additional losses in the training phase, which are the discrete cosine transform coefficients loss and differential content loss. These consider perceptual part in the content loss, i.e., consideration of proper high frequency components is helpful for the trade-off problem in super-resolution. The experimental results show that our proposed model has good performance for both perception and distortion, and is effective in perceptual super-resolution applications.
△ Less
Submitted 21 September, 2018; v1 submitted 13 September, 2018;
originally announced September 2018.
-
Impact of Three-Dimensional Video Scalability on Multi-View Activity Recognition using Deep Learning
Authors:
Jun-Ho Choi,
Manri Cheon,
Min-Su Choi,
Jong-Seok Lee
Abstract:
Human activity recognition is one of the important research topics in computer vision and video understanding. It is often assumed that high quality video sequences are available for recognition. However, relaxing such a requirement and implementing robust recognition using videos having reduced data rates can achieve efficiency in storing and transmitting video data. Three-dimensional video scala…
▽ More
Human activity recognition is one of the important research topics in computer vision and video understanding. It is often assumed that high quality video sequences are available for recognition. However, relaxing such a requirement and implementing robust recognition using videos having reduced data rates can achieve efficiency in storing and transmitting video data. Three-dimensional video scalability, which refers to the possibility of reducing spatial, temporal, and quality resolutions of videos, is an effective way for flexible representation and management of video data. In this paper, we investigate the impact of the video scalability on multi-view activity recognition. We employ both a spatiotemporal feature extraction-based method and a deep learning-based method using convolutional and recurrent neural networks. The recognition performance of the two methods is examined, along with in-depth analysis regarding how their performance vary with respect to various scalability combinations. In particular, we demonstrate that the deep learning-based method can achieve significantly improved robustness in comparison to the feature-based method. Furthermore, we investigate optimal scalability combinations with respect to bitrate in order to provide useful guidelines for an optimal operation policy in resource-constrained activity recognition systems.
△ Less
Submitted 28 September, 2017;
originally announced September 2017.
-
Observation of spin-orbit insulator-like behavior in LaOBiS$_{2-x}$F$_x$ (0.05 $\leq$ $x$ $\leq$ 0.2)
Authors:
G. C. Kim,
M. Cheon,
Y. C. Kim,
R. -K. Ko
Abstract:
We report the effects of electron do** on the crystal structure and electrical resistivity of LaOBiS$_{2-x}$F$_x$ (0.05 $\leq$ $x$ $\leq$ 0.2). The $ab$ plane is found to be relatively insensitive to the amount of F, whereas the $c$ axis shrinks continuously with increasing $x$, suggesting that the doped F atoms substitute selectively into the apical sites in the BiS$_2$ layer. At $x$ = 0.10, as…
▽ More
We report the effects of electron do** on the crystal structure and electrical resistivity of LaOBiS$_{2-x}$F$_x$ (0.05 $\leq$ $x$ $\leq$ 0.2). The $ab$ plane is found to be relatively insensitive to the amount of F, whereas the $c$ axis shrinks continuously with increasing $x$, suggesting that the doped F atoms substitute selectively into the apical sites in the BiS$_2$ layer. At $x$ = 0.10, as the temperature is decreased from room temperature, the electrical resistivity is temperature-independent from room temperature to 285 K, increases linearly with decreasing temperature from 285 K to 205 K and then shows obvious insulating behavior below 205 K, which may be due to strong spin-orbit coupling. LaOBiS$_{1.9}$F$_{0.1}$ shows the significantly weak and temperature-independent diamagnetism without any evident anomalies caused by a phase transition.
△ Less
Submitted 16 December, 2015;
originally announced December 2015.
-
Do** dependence of phase coherence between superconducting Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ grains
Authors:
G. C. Kim,
M. Cheon,
Y. C. Kim
Abstract:
In the present work, we report the new findings on the do** level dependence of the phase coherence between superconducting Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ (Bi-2212) grains. The experimental results from the strongly underdoped and overdoped regimes deviated from the expectation based on the do** level dependence of the superfluid density at $T$ = 0 K. These findings appear to be governed by int…
▽ More
In the present work, we report the new findings on the do** level dependence of the phase coherence between superconducting Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ (Bi-2212) grains. The experimental results from the strongly underdoped and overdoped regimes deviated from the expectation based on the do** level dependence of the superfluid density at $T$ = 0 K. These findings appear to be governed by interplay between competing orders inside the superconducting dome of cuprate superconductors. Two quantum critical points are likely to exist at the underdoped and overdoped regimes beneath the superconducting dome.
△ Less
Submitted 16 November, 2014; v1 submitted 26 August, 2014;
originally announced August 2014.
-
A ferromagnetic-like phase transition in new oxychalcogenide HgOCuSe
Authors:
G. C. Kim,
M. Cheon,
I. S. Park,
D. Ahmad,
Y. C. Kim
Abstract:
We report the synthesis of a new oxychalcogenide HgOCuSe sample. The resistivity decreases as a function of $T^{1.75}$ with decreasing temperature from room temperature down to around 80 K. There exists a very sharp ferromagnetic-like phase transition at around 60 K under a field of $H$ = 100 Oe. Contrary to the usual ferromagnetic materials, the descending and ascending branches of the magnetic h…
▽ More
We report the synthesis of a new oxychalcogenide HgOCuSe sample. The resistivity decreases as a function of $T^{1.75}$ with decreasing temperature from room temperature down to around 80 K. There exists a very sharp ferromagnetic-like phase transition at around 60 K under a field of $H$ = 100 Oe. Contrary to the usual ferromagnetic materials, the descending and ascending branches of the magnetic hysteresis curve, at 30 K, are reversed in the whole irreversible field range and the reverse irreversibility decreases at 5 K.
△ Less
Submitted 30 May, 2011;
originally announced May 2011.
-
Low Temperature Magnetic Domain Patterns in MnAs Films Grown on GaAs(001)
Authors:
M. Cheon,
S. Hegde,
S. Wang,
M. M. Bishara,
G. B. Kim,
H. Luo
Abstract:
Magnetic properties of MnAs were studied as a function of temperature with a superconducting interference device (5 K to 350 K), and atomic force microscopy/magnetic force microscopy (20 K to 360 K). Structural and magnetic properties of MnAs depend on film thickness both near and far below the Curie temperature. In samples with coexisting ferromagnetic alpha-MnAs and paramagnetic beta-MnAs, the…
▽ More
Magnetic properties of MnAs were studied as a function of temperature with a superconducting interference device (5 K to 350 K), and atomic force microscopy/magnetic force microscopy (20 K to 360 K). Structural and magnetic properties of MnAs depend on film thickness both near and far below the Curie temperature. In samples with coexisting ferromagnetic alpha-MnAs and paramagnetic beta-MnAs, the domain structures are affected by the distribution of two phases. The magnetic domain structures below the temperature range of this mixed phase resemble that of a single domain structure with uniform magnetization along the easy axis, except there are regions elongated along the easy axis embedded where the magnetization is along the second easy axis, i.e., normal to the films. The shape of those regions and their temperature dependence are also related to the MnAs layer thickness.
△ Less
Submitted 8 September, 2004;
originally announced September 2004.
-
Growth and properties of ferromagnetic In(1-x)Mn(x)Sb alloys
Authors:
T. Wojtowicz,
W. L. Lim,
X. Liu,
G. Cywinski,
M. Kutrowski,
L. V. Titova,
K. Yee,
M. Dobrowolska,
J. K. Furdyna,
K. M. Yu,
W. Walukiewicz,
G. B. Kim,
M. Cheon,
X. Chen,
S. M. Wang,
H. Luo,
I. Vurgaftman,
J. R. Meyer
Abstract:
We discuss a new narrow-gap ferromagnetic (FM) semiconductor alloy, In(1-x)Mn(x)Sb, and its growth by low-temperature molecular-beam epitaxy. The magnetic properties were investigated by direct magnetization measurements, electrical transport, magnetic circular dichroism, and the magneto-optical Kerr effect. These data clearly indicate that In(1-x)Mn(x)Sb possesses all the attributes of a system…
▽ More
We discuss a new narrow-gap ferromagnetic (FM) semiconductor alloy, In(1-x)Mn(x)Sb, and its growth by low-temperature molecular-beam epitaxy. The magnetic properties were investigated by direct magnetization measurements, electrical transport, magnetic circular dichroism, and the magneto-optical Kerr effect. These data clearly indicate that In(1-x)Mn(x)Sb possesses all the attributes of a system with carrier-mediated FM interactions, including well-defined hysteresis loops, a cusp in the temperature dependence of the resistivity, strong negative magnetoresistance, and a large anomalous Hall effect. The Curie temperatures in samples investigated thus far range up to 8.5 K, which are consistent with a mean-field-theory simulation of the carrier-induced ferromagnetism based on the 8-band effective band-orbital method.
△ Less
Submitted 1 July, 2003;
originally announced July 2003.
-
In(1-x)Mn(x)Sb - a new narrow gap ferromagnetic semiconductor
Authors:
T. Wojtowicz,
G. Cywinski,
W. L. Lim,
X. Liu,
M. Dobrowolska,
J. K. Furdyna,
K. M. Yu,
W. Walukiewicz,
G. B. Kim,
M. Cheon,
X. Chen,
S. M. Wang,
H. Luo
Abstract:
A narrow-gap ferromagnetic In(1-x)Mn(x)Sb semiconductor alloy was successfully grown by low-temperature molecular beam epitaxy on CdTe/GaAs hybrid substrates. Ferromagnetic order in In(1-x)Mn(x)Sb was unambiguously established by the observation of clear hysteresis loops both in direct magnetization measurements and in the anomalous Hall effect, with Curie temperatures T_C ranging up to 8.5 K. T…
▽ More
A narrow-gap ferromagnetic In(1-x)Mn(x)Sb semiconductor alloy was successfully grown by low-temperature molecular beam epitaxy on CdTe/GaAs hybrid substrates. Ferromagnetic order in In(1-x)Mn(x)Sb was unambiguously established by the observation of clear hysteresis loops both in direct magnetization measurements and in the anomalous Hall effect, with Curie temperatures T_C ranging up to 8.5 K. The observed values of T_C agree well with the existing models of carrier-induced ferromagnetism.
△ Less
Submitted 11 March, 2003;
originally announced March 2003.
-
Above-Room-Temperature Ferromagnetism in GaSb/Mn Digital Alloys
Authors:
X. Chen,
M. Na,
M. Cheon,
S. Wang,
H. Luo,
B. D. McCombe,
X. Liu,
Y. Sasaki,
T. Wojtowicz,
J. K. Furdyna,
S. J. Potashnik,
P. Schiffer
Abstract:
Digital alloys of GaSb/Mn have been fabricated by molecular beam epitaxy. Transmission electron micrographs showed good crystal quality with individual Mn-containing layers well resolved; no evidence of 3D MnSb precipitates was seen in as-grown samples. All samples studied exhibited ferromagnetism with temperature dependent hysteresis loops in the magnetization accompanied by metallic p-type con…
▽ More
Digital alloys of GaSb/Mn have been fabricated by molecular beam epitaxy. Transmission electron micrographs showed good crystal quality with individual Mn-containing layers well resolved; no evidence of 3D MnSb precipitates was seen in as-grown samples. All samples studied exhibited ferromagnetism with temperature dependent hysteresis loops in the magnetization accompanied by metallic p-type conductivity with a strong anomalous Hall effect (AHE) up to 400 K (limited by the experimental setup). The anomalous Hall effect shows hysteresis loops at low temperatures and above room temperature very similar to those seen in the magnetization. The strong AHE with hysteresis indicates that the holes interact with the Mn spins above room temperature. All samples are metallic, which is important for spintronics applications.
* To whom correspondence should be addressed. E-mail: [email protected]
△ Less
Submitted 18 March, 2002;
originally announced March 2002.
-
Anisotropic Domain Growth of ANNNI Model at Low Temperatures
Authors:
Mookyung Cheon,
Iksoo Chang
Abstract:
We investigate the ordering kinetics for axial next nearest neighbor Ising (ANNNI) model in one and two dimensions by the multi-spin heat bath dynamical simulation. This dynamics enables us to overcome the pinning effect and to observe the dynamical scaling law for domain growth in the ANNNI model at zero temperature. The domain growth exponent is 1/2 isotropically both in the ferromagnetic and…
▽ More
We investigate the ordering kinetics for axial next nearest neighbor Ising (ANNNI) model in one and two dimensions by the multi-spin heat bath dynamical simulation. This dynamics enables us to overcome the pinning effect and to observe the dynamical scaling law for domain growth in the ANNNI model at zero temperature. The domain growth exponent is 1/2 isotropically both in the ferromagnetic and the dry-(commensurate) antiphase. In the wet-(commensurate) antiphase, however, it is approximately 1/3 in the modulated direction, whereas it remains 1/2 in the non-modulated direction. We suggest that these exponent values are dictated by 3 and 4 body diffusion-reaction processes of domain walls.
△ Less
Submitted 2 April, 2001;
originally announced April 2001.