Search | arXiv e-print repository

Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning

Authors: Inwoo Hwang, Yunhyeok Kwak, Suhyung Choi, Byoung-Tak Zhang, Sanghack Lee

Abstract: Causal dynamics learning has recently emerged as a promising approach to enhancing robustness in reinforcement learning (RL). Typically, the goal is to build a dynamics model that makes predictions based on the causal relationships among the entities. Despite the fact that causal connections often manifest only under certain contexts, existing approaches overlook such fine-grained relationships an… ▽ More Causal dynamics learning has recently emerged as a promising approach to enhancing robustness in reinforcement learning (RL). Typically, the goal is to build a dynamics model that makes predictions based on the causal relationships among the entities. Despite the fact that causal connections often manifest only under certain contexts, existing approaches overlook such fine-grained relationships and lack a detailed understanding of the dynamics. In this work, we propose a novel dynamics model that infers fine-grained causal structures and employs them for prediction, leading to improved robustness in RL. The key idea is to jointly learn the dynamics model with a discrete latent variable that quantizes the state-action space into subgroups. This leads to recognizing meaningful context that displays sparse dependencies, where causal structures are learned for each subgroup throughout the training. Experimental results demonstrate the robustness of our method to unseen states and locally spurious correlations in downstream tasks where fine-grained causal reasoning is crucial. We further illustrate the effectiveness of our subgroup-based approach with quantization in discovering fine-grained causal relationships compared to prior methods. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: ICML 2024

arXiv:2406.00614 [pdf, other]

Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction

Authors: Yunhyeok Kwak, Inwoo Hwang, Dooyoung Kim, Sanghack Lee, Byoung-Tak Zhang

Abstract: Monte Carlo Tree Search (MCTS) has showcased its efficacy across a broad spectrum of decision-making problems. However, its performance often degrades under vast combinatorial action space, especially where an action is composed of multiple sub-actions. In this work, we propose an action abstraction based on the compositional structure between a state and sub-actions for improving the efficiency o… ▽ More Monte Carlo Tree Search (MCTS) has showcased its efficacy across a broad spectrum of decision-making problems. However, its performance often degrades under vast combinatorial action space, especially where an action is composed of multiple sub-actions. In this work, we propose an action abstraction based on the compositional structure between a state and sub-actions for improving the efficiency of MCTS under a factored action space. Our method learns a latent dynamics model with an auxiliary network that captures sub-actions relevant to the transition on the current state, which we call state-conditioned action abstraction. Notably, it infers such compositional relationships from high-dimensional observations without the known environment model. During the tree traversal, our method constructs the state-conditioned action abstraction for each node on-the-fly, reducing the search space by discarding the exploration of redundant sub-actions. Experimental results demonstrate the superior sample efficiency of our method compared to vanilla MuZero, which suffers from expansive action space. △ Less

Submitted 2 June, 2024; originally announced June 2024.

Comments: UAI 2024 (Oral). The first two authors contributed equally

arXiv:2405.07220 [pdf, other]

On Discovery of Local Independence over Continuous Variables via Neural Contextual Decomposition

Authors: Inwoo Hwang, Yunhyeok Kwak, Yeon-Ji Song, Byoung-Tak Zhang, Sanghack Lee

Abstract: Conditional independence provides a way to understand causal relationships among the variables of interest. An underlying system may exhibit more fine-grained causal relationships especially between a variable and its parents, which will be called the local independence relationships. One of the most widely studied local relationships is Context-Specific Independence (CSI), which holds in a specif… ▽ More Conditional independence provides a way to understand causal relationships among the variables of interest. An underlying system may exhibit more fine-grained causal relationships especially between a variable and its parents, which will be called the local independence relationships. One of the most widely studied local relationships is Context-Specific Independence (CSI), which holds in a specific assignment of conditioned variables. However, its applicability is often limited since it does not allow continuous variables: data conditioned to the specific value of a continuous variable contains few instances, if not none, making it infeasible to test independence. In this work, we define and characterize the local independence relationship that holds in a specific set of joint assignments of parental variables, which we call context-set specific independence (CSSI). We then provide a canonical representation of CSSI and prove its fundamental properties. Based on our theoretical findings, we cast the problem of discovering multiple CSSI relationships in a system as finding a partition of the joint outcome space. Finally, we propose a novel method, coined neural contextual decomposition (NCD), which learns such partition by imposing each set to induce CSSI via modeling a conditional distribution. We empirically demonstrate that the proposed method successfully discovers the ground truth local independence relationships in both synthetic dataset and complex system reflecting the real-world physical dynamics. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Comments: Conference on Causal Learning and Reasoning (CLeaR), 2023

arXiv:2401.16536 [pdf, other]

Saccade-Contingent Rendering

Authors: Yuna Kwak, Eric Penner, Xuan Wang, Mohammad R. Saeedpour-Parizi, Olivier Mercier, Xiuyun Wu, T. Scott Murdison, Phillip Guan

Abstract: Battery-constrained power consumption, compute limitations, and high frame rate requirements in head-mounted displays present unique challenges in the drive to present increasingly immersive and comfortable imagery in virtual reality. However, humans are not equally sensitive to all regions of the visual field, and perceptually-optimized rendering techniques are increasingly utilized to address th… ▽ More Battery-constrained power consumption, compute limitations, and high frame rate requirements in head-mounted displays present unique challenges in the drive to present increasingly immersive and comfortable imagery in virtual reality. However, humans are not equally sensitive to all regions of the visual field, and perceptually-optimized rendering techniques are increasingly utilized to address these bottlenecks. Many of these techniques are gaze-contingent and often render reduced detail away from a user's fixation. Such techniques are dependent on spatio-temporally-accurate gaze tracking and can result in obvious visual artifacts when eye tracking is inaccurate. In this work we present a gaze-contingent rendering technique which only requires saccade detection, bypassing the need for highly-accurate eye tracking. In our first experiment, we show that visual acuity is reduced for several hundred milliseconds after a saccade. In our second experiment, we use these results to reduce the rendered image resolution after saccades in a controlled psychophysical setup, and find that observers cannot discriminate between saccade-contingent reduced-resolution rendering and full-resolution rendering. Finally, in our third experiment, we introduce a 90 pixels per degree headset and validate our saccade-contingent rendering method under typical VR viewing conditions. △ Less

Submitted 29 January, 2024; originally announced January 2024.

Comments: main paper and supplementary materials

arXiv:2311.11470 [pdf, ps, other]

1st Place in ICCV 2023 Workshop Challenge Track 1 on Resource Efficient Deep Learning for Computer Vision: Budgeted Model Training Challenge

Authors: Youngjun Kwak, Seonghun Jeong, Yunseung Lee, Changick Kim

Abstract: The budgeted model training challenge aims to train an efficient classification model under resource limitations. To tackle this task in ImageNet-100, we describe a simple yet effective resource-aware backbone search framework composed of profile and instantiation phases. In addition, we employ multi-resolution ensembles to boost inference accuracy on limited resources. The profile phase obeys tim… ▽ More The budgeted model training challenge aims to train an efficient classification model under resource limitations. To tackle this task in ImageNet-100, we describe a simple yet effective resource-aware backbone search framework composed of profile and instantiation phases. In addition, we employ multi-resolution ensembles to boost inference accuracy on limited resources. The profile phase obeys time and memory constraints to determine the models' optimal batch-size, max epochs, and automatic mixed precision (AMP). And the instantiation phase trains models with the determined parameters from the profile phase. For improving intra-domain generalizations, the multi-resolution ensembles are formed by two-resolution images with randomly applied flips. We present a comprehensive analysis with expensive experiments. Based on our approach, we win first place in International Conference on Computer Vision (ICCV) 2023 Workshop Challenge Track 1 on Resource Efficient Deep Learning for Computer Vision (RCV). △ Less

Submitted 9 August, 2023; originally announced November 2023.

Comments: ICCV 2023 Workshop Challenge Track 1 on RCV

arXiv:2308.00558 [pdf, other]

Gradient Scaling on Deep Spiking Neural Networks with Spike-Dependent Local Information

Authors: Seongsik Park, Jeonghee Jo, Jongkil Park, Yeonjoo Jeong, Jaewook Kim, Suyoun Lee, Joon Young Kwak, Inho Kim, Jong-Keuk Park, Kyeong Seok Lee, Gye Weon Hwang, Hyun Jae Jang

Abstract: Deep spiking neural networks (SNNs) are promising neural networks for their model capacity from deep neural network architecture and energy efficiency from SNNs' operations. To train deep SNNs, recently, spatio-temporal backpropagation (STBP) with surrogate gradient was proposed. Although deep SNNs have been successfully trained with STBP, they cannot fully utilize spike information. In this work,… ▽ More Deep spiking neural networks (SNNs) are promising neural networks for their model capacity from deep neural network architecture and energy efficiency from SNNs' operations. To train deep SNNs, recently, spatio-temporal backpropagation (STBP) with surrogate gradient was proposed. Although deep SNNs have been successfully trained with STBP, they cannot fully utilize spike information. In this work, we proposed gradient scaling with local spike information, which is the relation between pre- and post-synaptic spikes. Considering the causality between spikes, we could enhance the training performance of deep SNNs. According to our experiments, we could achieve higher accuracy with lower spikes by adopting the gradient scaling on image classification tasks, such as CIFAR10 and CIFAR100. △ Less

Submitted 1 August, 2023; originally announced August 2023.

Comments: ICML-23 Localized Learning Workshop: Decentralized Model Updates via Non-Global Objectives

arXiv:2307.12459 [pdf, other]

Robust face anti-spoofing framework with Convolutional Vision Transformer

Authors: Yunseung Lee, Youngjun Kwak, **ho Shin

Abstract: Owing to the advances in image processing technology and large-scale datasets, companies have implemented facial authentication processes, thereby stimulating increased focus on face anti-spoofing (FAS) against realistic presentation attacks. Recently, various attempts have been made to improve face recognition performance using both global and local learning on face images; however, to the best o… ▽ More Owing to the advances in image processing technology and large-scale datasets, companies have implemented facial authentication processes, thereby stimulating increased focus on face anti-spoofing (FAS) against realistic presentation attacks. Recently, various attempts have been made to improve face recognition performance using both global and local learning on face images; however, to the best of our knowledge, this is the first study to investigate whether the robustness of FAS against domain shifts is improved by considering global information and local cues in face images captured using self-attention and convolutional layers. This study proposes a convolutional vision transformer-based framework that achieves robust performance for various unseen domain data. Our model resulted in 7.3%$p$ and 12.9%$p$ increases in FAS performance compared to models using only a convolutional neural network or vision transformer, respectively. It also shows the highest average rank in sub-protocols of cross-dataset setting over the other nine benchmark models for domain generalization. △ Less

Submitted 23 July, 2023; originally announced July 2023.

Comments: ICIP 2023

arXiv:2307.12450 [pdf, other]

ProtoFL: Unsupervised Federated Learning via Prototypical Distillation

Authors: Hansol Kim, Youngjun Kwak, Minyoung Jung, **ho Shin, Youngsung Kim, Changick Kim

Abstract: Federated learning (FL) is a promising approach for enhancing data privacy preservation, particularly for authentication systems. However, limited round communications, scarce representation, and scalability pose significant challenges to its deployment, hindering its full potential. In this paper, we propose 'ProtoFL', Prototypical Representation Distillation based unsupervised Federated Learning… ▽ More Federated learning (FL) is a promising approach for enhancing data privacy preservation, particularly for authentication systems. However, limited round communications, scarce representation, and scalability pose significant challenges to its deployment, hindering its full potential. In this paper, we propose 'ProtoFL', Prototypical Representation Distillation based unsupervised Federated Learning to enhance the representation power of a global model and reduce round communication costs. Additionally, we introduce a local one-class classifier based on normalizing flows to improve performance with limited data. Our study represents the first investigation of using FL to improve one-class classification performance. We conduct extensive experiments on five widely used benchmarks, namely MNIST, CIFAR-10, CIFAR-100, ImageNet-30, and Keystroke-Dynamics, to demonstrate the superior performance of our proposed framework over previous methods in the literature. △ Less

Submitted 7 August, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

Comments: Accepted by ICCV 2023. Hansol Kim and Youngjun Kwak contributed equally to this work

arXiv:2302.09461 [pdf, other]

Liveness score-based regression neural networks for face anti-spoofing

Authors: Youngjun Kwak, Minyoung Jung, Hunjae Yoo, **Ho Shin, Changick Kim

Abstract: Previous anti-spoofing methods have used either pseudo maps or user-defined labels, and the performance of each approach depends on the accuracy of the third party networks generating pseudo maps and the way in which the users define the labels. In this paper, we propose a liveness score-based regression network for overcoming the dependency on third party networks and users. First, we introduce a… ▽ More Previous anti-spoofing methods have used either pseudo maps or user-defined labels, and the performance of each approach depends on the accuracy of the third party networks generating pseudo maps and the way in which the users define the labels. In this paper, we propose a liveness score-based regression network for overcoming the dependency on third party networks and users. First, we introduce a new labeling technique, called pseudo-discretized label encoding for generating discretized labels indicating the amount of information related to real images. Secondly, we suggest the expected liveness score based on a regression network for training the difference between the proposed supervision and the expected liveness score. Finally, extensive experiments were conducted on four face anti-spoofing benchmarks to verify our proposed method on both intra-and cross-dataset tests. The experimental results show our approach outperforms previous methods. △ Less

Submitted 20 March, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

Comments: Submission to ICASSP 2023

arXiv:2211.02291 [pdf, other]

SelecMix: Debiased Learning by Contradicting-pair Sampling

Authors: Inwoo Hwang, Sangjun Lee, Yunhyeok Kwak, Seong Joon Oh, Damien Teney, **-Hwa Kim, Byoung-Tak Zhang

Abstract: Neural networks trained with ERM (empirical risk minimization) sometimes learn unintended decision rules, in particular when their training data is biased, i.e., when training labels are strongly correlated with undesirable features. To prevent a network from learning such features, recent methods augment training data such that examples displaying spurious correlations (i.e., bias-aligned example… ▽ More Neural networks trained with ERM (empirical risk minimization) sometimes learn unintended decision rules, in particular when their training data is biased, i.e., when training labels are strongly correlated with undesirable features. To prevent a network from learning such features, recent methods augment training data such that examples displaying spurious correlations (i.e., bias-aligned examples) become a minority, whereas the other, bias-conflicting examples become prevalent. However, these approaches are sometimes difficult to train and scale to real-world data because they rely on generative models or disentangled representations. We propose an alternative based on mixup, a popular augmentation that creates convex combinations of training examples. Our method, coined SelecMix, applies mixup to contradicting pairs of examples, defined as showing either (i) the same label but dissimilar biased features, or (ii) different labels but similar biased features. Identifying such pairs requires comparing examples with respect to unknown biased features. For this, we utilize an auxiliary contrastive model with the popular heuristic that biased features are learned preferentially during training. Experiments on standard benchmarks demonstrate the effectiveness of the method, in particular when label noise complicates the identification of bias-conflicting examples. △ Less

Submitted 4 November, 2022; originally announced November 2022.

Comments: NeurIPS 2022

arXiv:2208.07542 [pdf, ps, other]

An Immersed Weak Galerkin Method for Elliptic Interface Problems on Polygonal Meshes

Authors: Hyeokjoo Park, Do Y. Kwak

Abstract: In this paper we present an immersed weak Galerkin method for solving second-order elliptic interface problems on polygonal meshes, where the meshes do not need to be aligned with the interface. The discrete space consists of constants on each edge and broken linear polynomials satisfying the interface conditions in each element. For triangular meshes, such broken linear plynomials coincide with t… ▽ More In this paper we present an immersed weak Galerkin method for solving second-order elliptic interface problems on polygonal meshes, where the meshes do not need to be aligned with the interface. The discrete space consists of constants on each edge and broken linear polynomials satisfying the interface conditions in each element. For triangular meshes, such broken linear plynomials coincide with the basis functions in immersed finite element methods [26]. We establish some approximation properties of the broken linear polynomials and the discrete weak gradient of a certain projection of the solution on polygonal meshes. We then prove an optimal error estimate of our scheme in the discrete $H^1$-seminorm under some assumptions on the exact solution. Numerical experiments are provided to confirm our theoretical analysis. △ Less

Submitted 16 August, 2022; originally announced August 2022.

MSC Class: 65N12; 65N15; 65N30; 35J15

arXiv:2203.14094 [pdf, other]

SlimFL: Federated Learning with Superposition Coding over Slimmable Neural Networks

Authors: Won Joon Yun, Yunseok Kwak, Hankyul Baek, Soyi Jung, Mingyue Ji, Mehdi Bennis, Jihong Park, Joongheon Kim

Abstract: Federated learning (FL) is a key enabler for efficient communication and computing, leveraging devices' distributed computing capabilities. However, applying FL in practice is challenging due to the local devices' heterogeneous energy, wireless channel conditions, and non-independently and identically distributed (non-IID) data distributions. To cope with these issues, this paper proposes a novel… ▽ More Federated learning (FL) is a key enabler for efficient communication and computing, leveraging devices' distributed computing capabilities. However, applying FL in practice is challenging due to the local devices' heterogeneous energy, wireless channel conditions, and non-independently and identically distributed (non-IID) data distributions. To cope with these issues, this paper proposes a novel learning framework by integrating FL and width-adjustable slimmable neural networks (SNN). Integrating FL with SNNs is challenging due to time-varying channel conditions and data distributions. In addition, existing multi-width SNN training algorithms are sensitive to the data distributions across devices, which makes SNN ill-suited for FL. Motivated by this, we propose a communication and energy-efficient SNN-based FL (named SlimFL) that jointly utilizes superposition coding (SC) for global model aggregation and superposition training (ST) for updating local models. By applying SC, SlimFL exchanges the superposition of multiple-width configurations decoded as many times as possible for a given communication throughput. Leveraging ST, SlimFL aligns the forward propagation of different width configurations while avoiding inter-width interference during backpropagation. We formally prove the convergence of SlimFL. The result reveals that SlimFL is not only communication-efficient but also deals with non-IID data distributions and poor channel conditions, which is also corroborated by data-intensive simulations. △ Less

Submitted 21 December, 2022; v1 submitted 26 March, 2022; originally announced March 2022.

Comments: arXiv admin note: text overlap with arXiv:2112.02543

arXiv:2203.10443 [pdf, other]

Quantum Multi-Agent Reinforcement Learning via Variational Quantum Circuit Design

Authors: Won Joon Yun, Yunseok Kwak, Jae Pyoung Kim, Hyunhee Cho, Soyi Jung, Jihong Park, Joongheon Kim

Abstract: In recent years, quantum computing (QC) has been getting a lot of attention from industry and academia. Especially, among various QC research topics, variational quantum circuit (VQC) enables quantum deep reinforcement learning (QRL). Many studies of QRL have shown that the QRL is superior to the classical reinforcement learning (RL) methods under the constraints of the number of training paramete… ▽ More In recent years, quantum computing (QC) has been getting a lot of attention from industry and academia. Especially, among various QC research topics, variational quantum circuit (VQC) enables quantum deep reinforcement learning (QRL). Many studies of QRL have shown that the QRL is superior to the classical reinforcement learning (RL) methods under the constraints of the number of training parameters. This paper extends and demonstrates the QRL to quantum multi-agent RL (QMARL). However, the extension of QRL to QMARL is not straightforward due to the challenge of the noise intermediate-scale quantum (NISQ) and the non-stationary properties in classical multi-agent RL (MARL). Therefore, this paper proposes the centralized training and decentralized execution (CTDE) QMARL framework by designing novel VQCs for the framework to cope with these issues. To corroborate the QMARL framework, this paper conducts the QMARL demonstration in a single-hop environment where edge agents offload packets to clouds. The extensive demonstration shows that the proposed QMARL framework enhances 57.7% of total reward than classical frameworks. △ Less

Submitted 19 March, 2022; originally announced March 2022.

arXiv:2202.11200 [pdf, other]

Quantum Distributed Deep Learning Architectures: Models, Discussions, and Applications

Authors: Yunseok Kwak, Won Joon Yun, Jae Pyoung Kim, Hyunhee Cho, Minseok Choi, Soyi Jung, Joongheon Kim

Abstract: Although deep learning (DL) has already become a state-of-the-art technology for various data processing tasks, data security and computational overload problems often arise due to their high data and computational power dependency. To solve this problem, quantum deep learning (QDL) and distributed deep learning (DDL) has emerged to complement existing DL methods. Furthermore, a quantum distribute… ▽ More Although deep learning (DL) has already become a state-of-the-art technology for various data processing tasks, data security and computational overload problems often arise due to their high data and computational power dependency. To solve this problem, quantum deep learning (QDL) and distributed deep learning (DDL) has emerged to complement existing DL methods. Furthermore, a quantum distributed deep learning (QDDL) technique that combines and maximizes these advantages is getting attention. This paper compares several model structures for QDDL and discusses their possibilities and limitations to leverage QDDL for some representative application scenarios. △ Less

Submitted 7 April, 2022; v1 submitted 19 February, 2022; originally announced February 2022.

arXiv:2112.02543 [pdf, other]

Joint Superposition Coding and Training for Federated Learning over Multi-Width Neural Networks

Authors: Hankyul Baek, Won Joon Yun, Yunseok Kwak, Soyi Jung, Mingyue Ji, Mehdi Bennis, Jihong Park, Joongheon Kim

Abstract: This paper aims to integrate two synergetic technologies, federated learning (FL) and width-adjustable slimmable neural network (SNN) architectures. FL preserves data privacy by exchanging the locally trained models of mobile devices. By adopting SNNs as local models, FL can flexibly cope with the time-varying energy capacities of mobile devices. Combining FL and SNNs is however non-trivial, parti… ▽ More This paper aims to integrate two synergetic technologies, federated learning (FL) and width-adjustable slimmable neural network (SNN) architectures. FL preserves data privacy by exchanging the locally trained models of mobile devices. By adopting SNNs as local models, FL can flexibly cope with the time-varying energy capacities of mobile devices. Combining FL and SNNs is however non-trivial, particularly under wireless connections with time-varying channel conditions. Furthermore, existing multi-width SNN training algorithms are sensitive to the data distributions across devices, so are ill-suited to FL. Motivated by this, we propose a communication and energy-efficient SNN-based FL (named SlimFL) that jointly utilizes superposition coding (SC) for global model aggregation and superposition training (ST) for updating local models. By applying SC, SlimFL exchanges the superposition of multiple width configurations that are decoded as many as possible for a given communication throughput. Leveraging ST, SlimFL aligns the forward propagation of different width configurations, while avoiding the inter-width interference during backpropagation. We formally prove the convergence of SlimFL. The result reveals that SlimFL is not only communication-efficient but also can counteract non-IID data distributions and poor channel conditions, which is also corroborated by simulations. △ Less

Submitted 5 December, 2021; originally announced December 2021.

Comments: 10 pages, 7 figures, Accepted to IEEE INFOCOM 2022

arXiv:2109.08388 [pdf, ps, other]

Mixed virtual volume methods for elliptic problems

Authors: Gwanghyun Jo, Do Y. Kwak

Abstract: We develop a class of mixed virtual volume methods for elliptic problems on polygonal/polyhedral grids. Unlike the mixed virtual element methods introduced in \cite{brezzi2014basic,da2016mixed}, our methods are reduced to symmetric, positive definite problems for the primary variable without using Lagrangian multipliers. We start from the usual way of changing the given equation into a mixed sys… ▽ More We develop a class of mixed virtual volume methods for elliptic problems on polygonal/polyhedral grids. Unlike the mixed virtual element methods introduced in \cite{brezzi2014basic,da2016mixed}, our methods are reduced to symmetric, positive definite problems for the primary variable without using Lagrangian multipliers. We start from the usual way of changing the given equation into a mixed system using the Darcy's law, $\bu=-{\cal K} \nabla p$. By integrating the system of equations with some judiciously chosen test spaces on each element, we define new mixed virtual volume methods of all orders. We show that these new schemes are equivalent to the nonconforming virtual element methods for the primal variable $p$. Once the primary variable is computed solving the symmetric, positive definite system, all the degrees of freedom for the Darcy velocity are locally computed. Also, the $L^2$-projection onto the polynomial space is easy to compute. Hence our work opens an easy way to compute Darcy velocity on the polygonal/polyhedral grids. For the lowest order case, we give a formula to compute a Raviart-Thomas space like representation which satisfies the conservation law. An optimal error analysis is carried out and numerical results are presented which support the theory. △ Less

Submitted 17 September, 2021; originally announced September 2021.

MSC Class: 65N15; 65N30

arXiv:2108.09971 [pdf, ps, other]

doi 10.1016/j.cma.2021.114448

Lowest-order virtual element methods for linear elasticity problems

Authors: Do Y. Kwak, Hyeokjoo Park

Abstract: We present two kinds of lowest-order virtual element methods for planar linear elasticity problems. For the first one we use the nonconforming virtual element method with a stabilizing term. It can be interpreted as a modification of the nonconforming Crouzeix-Raviart finite element method as suggested in [22] to the virtual element method. For the second one we use the conforming virtual element… ▽ More We present two kinds of lowest-order virtual element methods for planar linear elasticity problems. For the first one we use the nonconforming virtual element method with a stabilizing term. It can be interpreted as a modification of the nonconforming Crouzeix-Raviart finite element method as suggested in [22] to the virtual element method. For the second one we use the conforming virtual element for one component of the displacement vector and the nonconforming virtual element for the other. This approach can be seen as an extension of the idea of Kouhia and Stenberg suggested in [23] to the virtual element method. We show that our proposed methods satisfy the discrete Korn's inequality. We also prove that the methods are convergent uniformly for the nearly incompressible case and the convergence rates are optimal. △ Less

Submitted 23 August, 2021; originally announced August 2021.

MSC Class: 65N12; 65N15; 65N30

arXiv:2108.09967 [pdf, ps, other]

A formal construction of a divergence-free basis in the nonconforming virtual element method for the Stokes problem

Authors: Do Y. Kwak, Hyeokjoo Park

Abstract: We develop a formal construction of a pointwise divergence-free basis in the nonconforming virtual element method of arbitrary order for the Stokes problem introduced in [19]. The proposed construction can be seen as a generalization of the divergence-free basis in Crouzeix-Raviart finite element space [10, 17] to the virtual element space. Using the divergence-free basis obtained from our constru… ▽ More We develop a formal construction of a pointwise divergence-free basis in the nonconforming virtual element method of arbitrary order for the Stokes problem introduced in [19]. The proposed construction can be seen as a generalization of the divergence-free basis in Crouzeix-Raviart finite element space [10, 17] to the virtual element space. Using the divergence-free basis obtained from our construction, we can eliminate the pressure variable from the mixed system and obtain a symmetric positive definite system. Several numerical tests are presented to confirm the efficiency and the accuracy of our construction. △ Less

Submitted 23 August, 2021; originally announced August 2021.

MSC Class: 65N12; 65N30; 76D07

arXiv:2108.06849 [pdf, other]

Introduction to Quantum Reinforcement Learning: Theory and PennyLane-based Implementation

Authors: Yunseok Kwak, Won Joon Yun, Soyi Jung, Jong-Kook Kim, Joongheon Kim

Abstract: The emergence of quantum computing enables for researchers to apply quantum circuit on many existing studies. Utilizing quantum circuit and quantum differential programming, many research are conducted such as \textit{Quantum Machine Learning} (QML). In particular, quantum reinforcement learning is a good field to test the possibility of quantum machine learning, and a lot of research is being don… ▽ More The emergence of quantum computing enables for researchers to apply quantum circuit on many existing studies. Utilizing quantum circuit and quantum differential programming, many research are conducted such as \textit{Quantum Machine Learning} (QML). In particular, quantum reinforcement learning is a good field to test the possibility of quantum machine learning, and a lot of research is being done. This work will introduce the concept of quantum reinforcement learning using a variational quantum circuit, and confirm its possibility through implementation and experimentation. We will first present the background knowledge and working principle of quantum reinforcement learning, and then guide the implementation method using the PennyLane library. We will also discuss the power and possibility of quantum reinforcement learning from the experimental results obtained through this work. △ Less

Submitted 15 August, 2021; originally announced August 2021.

arXiv:2108.01468 [pdf, other]

Quantum Neural Networks: Concepts, Applications, and Challenges

Authors: Yunseok Kwak, Won Joon Yun, Soyi Jung, Joongheon Kim

Abstract: Quantum deep learning is a research field for the use of quantum computing techniques for training deep neural networks. The research topics and directions of deep learning and quantum computing have been separated for long time, however by discovering that quantum circuits can act like artificial neural networks, quantum deep learning research is widely adopted. This paper explains the background… ▽ More Quantum deep learning is a research field for the use of quantum computing techniques for training deep neural networks. The research topics and directions of deep learning and quantum computing have been separated for long time, however by discovering that quantum circuits can act like artificial neural networks, quantum deep learning research is widely adopted. This paper explains the backgrounds and basic principles of quantum deep learning and also introduces major achievements. After that, this paper discusses the challenges of quantum deep learning research in multiple perspectives. Lastly, this paper presents various future research directions and application fields of quantum deep learning. △ Less

Submitted 2 August, 2021; originally announced August 2021.

arXiv:2108.00626 [pdf, ps, other]

Quantum Scheduling for Millimeter-Wave Observation Satellite Constellation

Authors: Joongheon Kim, Yunseok Kwak, Soyi Jung, Jae-Hyun Kim

Abstract: In beyond 5G and 6G network scenarios, the use of satellites has been actively discussed for extending target monitoring areas, even for extreme circumstances, where the monitoring functionalities can be realized due to the usage of millimeter-wave wireless links. This paper designs an efficient scheduling algorithm which minimizes overlap** monitoring areas among observation satellite constella… ▽ More In beyond 5G and 6G network scenarios, the use of satellites has been actively discussed for extending target monitoring areas, even for extreme circumstances, where the monitoring functionalities can be realized due to the usage of millimeter-wave wireless links. This paper designs an efficient scheduling algorithm which minimizes overlap** monitoring areas among observation satellite constellation. In order to achieve this objective, a quantum optimization based algorithm is used because the overlap** can be mathematically modelled via a max-weight independent set (MWIS) problem which is one of well-known NP-hard problems. △ Less

Submitted 2 August, 2021; originally announced August 2021.

arXiv:2107.07041 [pdf, other]

Mitigating Memorization in Sample Selection for Learning with Noisy Labels

Authors: Kyeongbo Kong, Junggi Lee, Youngchul Kwak, Young-Rae Cho, Seong-Eun Kim, Woo-** Song

Abstract: Because deep learning is vulnerable to noisy labels, sample selection techniques, which train networks with only clean labeled data, have attracted a great attention. However, if the labels are dominantly corrupted by few classes, these noisy samples are called dominant-noisy-labeled samples, the network also learns dominant-noisy-labeled samples rapidly via content-aware optimization. In this stu… ▽ More Because deep learning is vulnerable to noisy labels, sample selection techniques, which train networks with only clean labeled data, have attracted a great attention. However, if the labels are dominantly corrupted by few classes, these noisy samples are called dominant-noisy-labeled samples, the network also learns dominant-noisy-labeled samples rapidly via content-aware optimization. In this study, we propose a compelling criteria to penalize dominant-noisy-labeled samples intensively through class-wise penalty labels. By averaging prediction confidences for the each observed label, we obtain suitable penalty labels that have high values if the labels are largely corrupted by some classes. Experiments were performed using benchmarks (CIFAR-10, CIFAR-100, Tiny-ImageNet) and real-world datasets (ANIMAL-10N, Clothing1M) to evaluate the proposed criteria in various scenarios with different noise rates. Using the proposed sample selection, the learning process of the network becomes significantly robust to noisy labels compared to existing methods in several noise types. △ Less

Submitted 8 July, 2021; originally announced July 2021.

Comments: 14 pages, 9 figures, spotlight presented at the ICML 2021 Workshop on Subset Selection in ML

arXiv:2101.00241 [pdf, ps, other]

Locally conservative immersed finite element method for elliptic interface problems

Authors: Gwanghyun Jo, Do Young Kwak, Young Ju Lee

Abstract: In this paper, we introduce the locally conservative enriched immersed finite element method (EIFEM) to tackle the elliptic problem with interface. The immersed finite element is useful for handling interface with mesh unfit with the interface. However, all the currently available method under IFEM framework may not be designed to consider the flux conservation. We provide an efficient and effecti… ▽ More In this paper, we introduce the locally conservative enriched immersed finite element method (EIFEM) to tackle the elliptic problem with interface. The immersed finite element is useful for handling interface with mesh unfit with the interface. However, all the currently available method under IFEM framework may not be designed to consider the flux conservation. We provide an efficient and effective remedy for this issue by introducing a local piecewise constant enrichment, which provides the locally conservative flux. We have also constructed and analyzed an auxiliary space preconditioner for the resulting system based on the application of algebraic multigrid method. The new observation in this work is that by imposing strong Dirichlet boundary condition for the standard IFEM part of EIFEM, we are able to remove the zero eigen-mode of the EIFEM system while still imposing the Dirichlet boundary condition weakly assigned to the piecewise constant enrichment part of EIFEM. A couple of issues relevant to the piecewise constant enrichment given for the mesh unfit to the interface has been discussed and clarified as well. Numerical tests are provided to confirm the theoretical development. △ Less

Submitted 1 January, 2021; originally announced January 2021.

arXiv:2010.10986 [pdf]

Highly-scalable stochastic neuron based on Ovonic Threshold Switch (OTS) and its applications in Restricted Boltzmann Machine (RBM)

Authors: Seong-il Im, Hye** Lee, Jaesang Lee, Jae-Seung Jeong, Joon Young Kwak, Keunsu Kim, Jeong Ho Cho, Hyunsu Ju, Suyoun Lee

Abstract: Interest in Restricted Boltzmann Machine (RBM) is growing as a generative stochastic artificial neural network to implement a novel energy-efficient machine-learning (ML) technique. For a hardware implementation of the RBM, an essential building block is a reliable stochastic binary neuron device that generates random spikes following the Boltzmann distribution. Here, we propose a highly-scalable… ▽ More Interest in Restricted Boltzmann Machine (RBM) is growing as a generative stochastic artificial neural network to implement a novel energy-efficient machine-learning (ML) technique. For a hardware implementation of the RBM, an essential building block is a reliable stochastic binary neuron device that generates random spikes following the Boltzmann distribution. Here, we propose a highly-scalable stochastic neuron device based on Ovonic Threshold Switch (OTS) which utilizes the random emission and capture process of traps as the source of stochasticity. The switching probability is well described by the Boltzmann distribution, which can be controlled by operating parameters. As a candidate for a true random number generator (TRNG), it passes 15 among the 16 tests of the National Institute of Standards and Technology (NIST) Statistical Test Suite (Special Publication 800-22). In addition, the recognition task of handwritten digits (MNIST) is demonstrated using a simulated RBM network consisting of the proposed device with a maximum recognition accuracy of 86.07 %. Furthermore, reconstruction of images is successfully demonstrated using images contaminated with noises, resulting in images with the noise removed. These results show the promising properties of OTS-based stochastic neuron devices for applications in RBM systems. △ Less

Submitted 21 October, 2020; originally announced October 2020.

arXiv:2009.13703 [pdf]

Frequency-tunable nano-oscillator based on Ovonic Threshold Switch (OTS)

Authors: Seon Jeong Kim, Seong Won Cho, Hye** Lee, Jaesang Lee, Tae Yeon Seong, Inho Kim, Jong-Keuk Park, Joon Young Kwak, Jaewook Kim, Jongkil Park, YeonJoo Jeong, Gyu Weon Hwang, Kyeong Seok Lee, Suyoun Lee

Abstract: Nano-oscillator devices are gaining more and more attention as a prerequisite for develo** novel energy-efficient computing systems based on coupled oscillators. Here, we introduce a highly scalable, frequency-tunable nano-oscillator consisting of one Ovonic threshold switch (OTS) and a field-effect transistor (FET). It is presented that the proposed device shows an oscillating behavior with a n… ▽ More Nano-oscillator devices are gaining more and more attention as a prerequisite for develo** novel energy-efficient computing systems based on coupled oscillators. Here, we introduce a highly scalable, frequency-tunable nano-oscillator consisting of one Ovonic threshold switch (OTS) and a field-effect transistor (FET). It is presented that the proposed device shows an oscillating behavior with a natural frequency (f_{nat}) adjustable from 0.5 to 2 MHz depending on the gate voltage applied to the FET. In addition, under a small periodic input, it is observed that the oscillating frequency (f_{osc}) of the device is locked to the frequency (f_{in}) of the input when f_{in} ~ f_{nat}, demonstrating the so-called synchronization phenomenon. It also shows the phase lock of the combined oscillator network using circuit simulation, where the phase relation between the oscillators can be controlled by the coupling strength. These results imply that the proposed device is promising for applications in oscillator-based computing systems. △ Less

Submitted 28 September, 2020; originally announced September 2020.

Comments: 18 pages including 5 figures

arXiv:1910.07331 [pdf, other]

A Generalized and Robust Method Towards Practical Gaze Estimation on Smart Phone

Authors: Tianchu Guo, Yongchao Liu, Hui Zhang, Xiabing Liu, Youngjun Kwak, Byung In Yoo, Jae-Joon Han, Changkyu Choi

Abstract: Gaze estimation for ordinary smart phone, e.g. estimating where the user is looking at on the phone screen, can be applied in various applications. However, the widely used appearance-based CNN methods still have two issues for practical adoption. First, due to the limited dataset, gaze estimation is very likely to suffer from over-fitting, leading to poor accuracy at run time. Second, the current… ▽ More Gaze estimation for ordinary smart phone, e.g. estimating where the user is looking at on the phone screen, can be applied in various applications. However, the widely used appearance-based CNN methods still have two issues for practical adoption. First, due to the limited dataset, gaze estimation is very likely to suffer from over-fitting, leading to poor accuracy at run time. Second, the current methods are usually not robust, i.e. their prediction results having notable jitters even when the user is performing gaze fixation, which degrades user experience greatly. For the first issue, we propose a new tolerant and talented (TAT) training scheme, which is an iterative random knowledge distillation framework enhanced with cosine similarity pruning and aligned orthogonal initialization. The knowledge distillation is a tolerant teaching process providing diverse and informative supervision. The enhanced pruning and initialization is a talented learning process prompting the network to escape from the local minima and re-born from a better start. For the second issue, we define a new metric to measure the robustness of gaze estimator, and propose an adversarial training based Disturbance with Ordinal loss (DwO) method to improve it. The experimental results show that our TAT method achieves state-of-the-art performance on GazeCapture dataset, and that our DwO method improves the robustness while kee** comparable accuracy. △ Less

Submitted 16 October, 2019; originally announced October 2019.

Comments: Accepted by ICCV 2019 Workshop. Fix the error of the Figure 1 in the camera ready file

arXiv:1810.08381 [pdf]

doi 10.1103/PhysRevApplied.13.064056

A highly scalable and energy-efficient artificial neuron using an Ovonic Threshold Switch (OTS) featuring the spike-frequency adaptation and chaotic activity

Authors: Milim Lee, Youngjo Kim, Seong Won Cho, Joon Young Kwak, Hyunsu Ju, Yeon** Yi, Byung-ki Cheong, Suyoun Lee

Abstract: As an essential building block for develo** a large-scale brain-inspired computing system, we present a highly scalable and energy-efficient artificial neuron device composed of an Ovonic Threshold Switch (OTS) and a few passive electrical components. It shows not only the basic integrate-and-fire (I&F) function and the rate coding ability, but also the spike-frequency adaptation (SFA) property… ▽ More As an essential building block for develo** a large-scale brain-inspired computing system, we present a highly scalable and energy-efficient artificial neuron device composed of an Ovonic Threshold Switch (OTS) and a few passive electrical components. It shows not only the basic integrate-and-fire (I&F) function and the rate coding ability, but also the spike-frequency adaptation (SFA) property and the chaotic activity. The latter two, being the most common features found in the mammalian cortex, are particularly essential for the realization of the energy-efficient signal processing, learning, and adaptation to environments1-3, but have been hard to achieve up to now. Furthermore, with our OTS-based neuron device employing the reservoir computing technique combined with delayed feedback dynamics, spoken-digit recognition task has been performed with a considerable degree of recognition accuracy. From a comparison with a Mott memristor-based artificial neuron device, it is shown that the OTS-based artificial neuron is much more energy-efficient by about 100 times. These results show that our OTS-based artificial neuron device is promising for the application in the development of a large-scale brain-inspired computing system. △ Less

Submitted 19 October, 2018; originally announced October 2018.

Journal ref: Phys. Rev. Applied 13, 064056 (2020)

arXiv:1808.05779 [pdf, other]

Learning to Quantize Deep Networks by Optimizing Quantization Intervals with Task Loss

Authors: Sangil Jung, Changyong Son, Seohyung Lee, **woo Son, Youngjun Kwak, Jae-Joon Han, Sung Ju Hwang, Changkyu Choi

Abstract: Reducing bit-widths of activations and weights of deep networks makes it efficient to compute and store them in memory, which is crucial in their deployments to resource-limited devices, such as mobile phones. However, decreasing bit-widths with quantization generally yields drastically degraded accuracy. To tackle this problem, we propose to learn to quantize activations and weights via a trainab… ▽ More Reducing bit-widths of activations and weights of deep networks makes it efficient to compute and store them in memory, which is crucial in their deployments to resource-limited devices, such as mobile phones. However, decreasing bit-widths with quantization generally yields drastically degraded accuracy. To tackle this problem, we propose to learn to quantize activations and weights via a trainable quantizer that transforms and discretizes them. Specifically, we parameterize the quantization intervals and obtain their optimal values by directly minimizing the task loss of the network. This quantization-interval-learning (QIL) allows the quantized networks to maintain the accuracy of the full-precision (32-bit) networks with bit-width as low as 4-bit and minimize the accuracy degeneration with further bit-width reduction (i.e., 3 and 2-bit). Moreover, our quantizer can be trained on a heterogeneous dataset, and thus can be used to quantize pretrained networks without access to their training data. We demonstrate the effectiveness of our trainable quantizer on ImageNet dataset with various network architectures such as ResNet-18, -34 and AlexNet, on which it outperforms existing methods to achieve the state-of-the-art accuracy. △ Less

Submitted 22 November, 2018; v1 submitted 17 August, 2018; originally announced August 2018.

arXiv:1801.08022 [pdf]

Interplay between superconductivity and magnetism in one-unit-cell LaAlO3 capped with SrTiO3

Authors: Yongsu Kwak, Woojoo Han, Thach D. N. Ngo, Dorj Odkhuu, Jihwan Kim, Young Heon Kim, Noejung Park, Sonny H. Rhim, Myung-Hwa Jung, Junho Suh, Seung-Bo Shim, Mahn-Soo Choi, Yong-Joo Doh, Joon Sung Lee, Jonghyun Song, **hee Kim

Abstract: To form a conducting layer at the interface between the oxide insulators LaAlO3 and SrTiO3, the LaAlO3 layer on the SrTiO3 substrate must be at least four unit-cells-thick. The LaAlO3 SrTiO3 heterointerface thus formed exhibits various intriguing phenomena such as ferromagnetism and superconductivity. It has been widely studied for being a low-dimensional ferromagnetic oxide superconducting system… ▽ More To form a conducting layer at the interface between the oxide insulators LaAlO3 and SrTiO3, the LaAlO3 layer on the SrTiO3 substrate must be at least four unit-cells-thick. The LaAlO3 SrTiO3 heterointerface thus formed exhibits various intriguing phenomena such as ferromagnetism and superconductivity. It has been widely studied for being a low-dimensional ferromagnetic oxide superconducting system with a strong gate-tunable spin-orbit interaction. However, its lack of stability and environmental susceptiveness have been an obstacle to its further experimental investigations and applications. Here, we demonstrate that cap** the bilayer with SrTiO3 relieves this thickness limit, while enhancing the stability and controllability of the interface. In addition, the SrTiO3-capped LaAlO3 exhibits unconventional superconductivity; the critical current dramatically increases under a parallel magnetic field, and shows a reversed hysteresis contrary to the conventional hysteresis of magnetoresistance. Its superconducting energy gap of $Δ\sim 1.31k_BT_c$ also deviates from conventional BCS-type superconductivity. The oxide trilayer could be a robust platform for studying the extraordinary interplay of superconductivity and ferromagnetism at the interface electron system between LaAlO3 and SrTiO3. △ Less

Submitted 24 January, 2018; originally announced January 2018.

Comments: 17 pages, 3 figures

arXiv:1710.11365 [pdf]

Coulomb drag transistor via graphene/MoS2 heterostructures

Authors: Youngjo **, Min-Kyu Joo, Byoung Hee Moon, Hyun Kim, Sanghyup Lee, Hye Yun Jeong, Hyo Yeol Kwak, Young Hee Lee

Abstract: Two-dimensional (2D) heterointerfaces often provide extraordinary carrier transport as exemplified by superconductivity or excitonic superfluidity. Recently, double-layer graphene separated by few-layered boron nitride demonstrated the Coulomb drag phenomenon: carriers in the active layer drag the carriers in the passive layer. Here, we propose a new switching device operating via Coulomb drag int… ▽ More Two-dimensional (2D) heterointerfaces often provide extraordinary carrier transport as exemplified by superconductivity or excitonic superfluidity. Recently, double-layer graphene separated by few-layered boron nitride demonstrated the Coulomb drag phenomenon: carriers in the active layer drag the carriers in the passive layer. Here, we propose a new switching device operating via Coulomb drag interaction at a graphene/MoS2 (GM) heterointerface. The ideal van der Waals distance allows strong coupling of the interlayer electron-hole pairs, whose recombination is prevented by the Schottky barrier formed due to charge transfer at the heterointerface. This device exhibits a high carrier mobility (up to ~3,700 cm^2V^-1s^-1) even at room temperature, while maintaining a high on/off current ratio (~10^8), outperforming those of individual layers. In the electron-electron drag regime, graphene-like Shubnikov-de Haas oscillations are observed at low temperatures. Our Coulomb drag transistor could provide a shortcut for the practical application of quantum-mechanical 2D heterostructures at room temperature. △ Less

Submitted 31 October, 2017; originally announced October 2017.

Comments: 14 pages, 4 figures

arXiv:1703.07140 [pdf, other]

Deep generative-contrastive networks for facial expression recognition

Authors: Youngsung Kim, ByungIn Yoo, Youngjun Kwak, Changkyu Choi, Junmo Kim

Abstract: As the expressive depth of an emotional face differs with individuals or expressions, recognizing an expression using a single facial image at a moment is difficult. A relative expression of a query face compared to a reference face might alleviate this difficulty. In this paper, we propose to utilize contrastive representation that embeds a distinctive expressive factor for a discriminative purpo… ▽ More As the expressive depth of an emotional face differs with individuals or expressions, recognizing an expression using a single facial image at a moment is difficult. A relative expression of a query face compared to a reference face might alleviate this difficulty. In this paper, we propose to utilize contrastive representation that embeds a distinctive expressive factor for a discriminative purpose. The contrastive representation is calculated at the embedding layer of deep networks by comparing a given (query) image with the reference image. We attempt to utilize a generative reference image that is estimated based on the given image. Consequently, we deploy deep neural networks that embed a combination of a generative model, a contrastive model, and a discriminative model with an end-to-end training manner. In our proposed networks, we attempt to disentangle a facial expressive factor in two steps including learning of a generator network and a contrastive encoder network. We conducted extensive experiments on publicly available face expression databases (CK+, MMI, Oulu-CASIA, and in-the-wild databases) that have been widely adopted in the recent literatures. The proposed method outperforms the known state-of-the art methods in terms of the recognition accuracy. △ Less

Submitted 8 May, 2019; v1 submitted 21 March, 2017; originally announced March 2017.

arXiv:1510.01839 [pdf, ps, other]

An IMPES scheme for a two-phase flow in heterogeneous porous media using a structured grid

Authors: Gwanghyun Jo, Do Y. Kwak

Abstract: We develop a numerical scheme for a two-phase immiscible flow in heterogeneous porous media using a structured grid finite element method, which have been successfully used for the computation of various physical applications involving elliptic equations \cite{li2003new, li2004immersed, chang2011discontinuous, chou2010optimal, kwak2010analysis}. The proposed method is based on the implicit pressur… ▽ More We develop a numerical scheme for a two-phase immiscible flow in heterogeneous porous media using a structured grid finite element method, which have been successfully used for the computation of various physical applications involving elliptic equations \cite{li2003new, li2004immersed, chang2011discontinuous, chou2010optimal, kwak2010analysis}. The proposed method is based on the implicit pressure-explicit saturation procedure. To solve the pressure equation, we use an IFEM based on the Rannacher-Turek \cite{rannacher1992simple} nonconforming space, which is a modification of the work in \cite{kwak2010analysis} where `broken' $P_1$ nonconforming element of Crouzeix-Raviart \cite{crouzeix1973conforming} was developed. For the Darcy velocity, we apply the mixed finite volume method studied in \cite{chou2003mixed, kwak2010analysis} on the basis of immersed finite element method (IFEM). In this way, the Darcy velocity of the flow can be computed cheaply (locally) after we solve the pressure equation. The computed Darcy velocity is used to solve the saturation equation explicitly. Thus the whole procedure can be implemented in an efficient way using a structured grid which is independent of the underlying heterogeneous porous media. Numerical results show that our method exhibits optimal order convergence rates for the pressure and velocity variables, and suboptimal rate for saturation. △ Less

Submitted 7 October, 2015; originally announced October 2015.

arXiv:1506.08517 [pdf, ps, other]

A generalization of the divide and conquer algorithm for the symmetric tridiagonal eigenproblem

Authors: Do Young Kwak, Jaeyeon Kim

Abstract: In this paper, we present a generalized Cuppen's divide-and-conquer algorithm for the symmetric tridiagonal eigenproblem. We extend the Cuppen's work to the rank two modifications of the form $A =T +β_1\bw_1\bw_1^T + β_2\bw_2\bw_2^T$, where $T$ is a block tridiagonal matrix having three blocks. We introduce a new deflation technique and obtain a secular equation, for which the distribution of eige… ▽ More In this paper, we present a generalized Cuppen's divide-and-conquer algorithm for the symmetric tridiagonal eigenproblem. We extend the Cuppen's work to the rank two modifications of the form $A =T +β_1\bw_1\bw_1^T + β_2\bw_2\bw_2^T$, where $T$ is a block tridiagonal matrix having three blocks. We introduce a new deflation technique and obtain a secular equation, for which the distribution of eigenvalues is nontrivial. We present a way to count the number of eigenvalues in each subinterval. It turns out that each subinterval contains either none, one or two eigenvalues. Furthermore, computing eigenvectors preserving the orthogonality are also suggested. Some numerical results, showing our algorithm can calculate the eigenvalue twice as fast as the Cuppen's divide-and-conquer algorithm, are included. △ Less

Submitted 29 June, 2015; originally announced June 2015.

Comments: submitted to SIAM J. Matrix Analysis and Application-SIMAX

MSC Class: 65F15; 15A18

arXiv:1506.01292 [pdf, ps, other]

Immersed finite element method for eigenvalue problems in elasticity

Authors: Seungwoo Lee, Do Y. Kwak, Imbo Sim

Abstract: We consider the approximation of eigenvalue problems for elasticity equations with interface. This kind of problems can be efficiently discretized by using immersed finite element method (IFEM) based on Crouzeix-Raviart P1-nonconforming element. The stability and the optimal convergence of IFEM for solving eigenvalue problems with interface are proved by adapting spectral analysis methods for the… ▽ More We consider the approximation of eigenvalue problems for elasticity equations with interface. This kind of problems can be efficiently discretized by using immersed finite element method (IFEM) based on Crouzeix-Raviart P1-nonconforming element. The stability and the optimal convergence of IFEM for solving eigenvalue problems with interface are proved by adapting spectral analysis methods for the classical eigenvalue problem. Numerical experiments demonstrate our theoretical results. △ Less

Submitted 3 June, 2015; originally announced June 2015.

Comments: 17 pages, 11 figures, 1 table

arXiv:1412.3163 [pdf, ps, other]

Immersed Finite Element Method for Eigenvalue Problem

Authors: Seungwoo Lee, Do Y. Kwak, Imbo Sim

Abstract: We consider the approximation of elliptic eigenvalue problem with an immersed interface. The main aim of this paper is to prove the stability and convergence of an immersed finite element method (IFEM) for eigenvalues using Crouzeix-Raviart $P_1$-nonconforming approximation. We show that spectral analysis for the classical eigenvalue problem can be easily applied to our model problem. We analyze t… ▽ More We consider the approximation of elliptic eigenvalue problem with an immersed interface. The main aim of this paper is to prove the stability and convergence of an immersed finite element method (IFEM) for eigenvalues using Crouzeix-Raviart $P_1$-nonconforming approximation. We show that spectral analysis for the classical eigenvalue problem can be easily applied to our model problem. We analyze the IFEM for elliptic eigenvalue problem with an immersed interface and derive the optimal convergence of eigenvalues. Numerical experiments demonstrate our theoretical results. △ Less

Submitted 9 December, 2014; originally announced December 2014.

arXiv:1408.4227 [pdf, ps, other]

A stabilized $P_1$ immersed finite element method for the interface elasticity problems

Authors: Do Y. Kwak, Sangwon **, Dae H. Kyeong

Abstract: We develop a new finite element method for solving planar elasticity problems involving of heterogeneous materials with a mesh not necessarily aligning with the interface of the materials. This method is based on the `broken' Crouzeix-Raviart $P_1$-nonconforming finite element method for elliptic interface problems \cite{Kwak-We-Ch}. To ensure the coercivity of the bilinear form arising from usi… ▽ More We develop a new finite element method for solving planar elasticity problems involving of heterogeneous materials with a mesh not necessarily aligning with the interface of the materials. This method is based on the `broken' Crouzeix-Raviart $P_1$-nonconforming finite element method for elliptic interface problems \cite{Kwak-We-Ch}. To ensure the coercivity of the bilinear form arising from using the nonconforming finite elements, we add stabilizing terms as in the discontinuous Galerkin (DG) method \cite{Arnold-IP},\cite{Ar-B-Co-Ma},\cite{Wheeler}. The novelty of our method is that we use meshes independent of the interface, so that the interface may cut through the elements. Instead, we modify the basis functions so that they satisfy the Laplace-Young condition along the interface of each element. We prove optimal $H^1$ and divergence norm error estimates. Numerical experiments are carried out to demonstrate that the our method is optimal for various Lamè parameters $μ$ and $λ$ and locking free as $λ\to\infty$. △ Less

Submitted 20 June, 2015; v1 submitted 19 August, 2014; originally announced August 2014.

Comments: Submitted to M2an on May 18 2015. Added a new author (Dae H. Kyeong)

MSC Class: 65N30; 74S05

arXiv:1408.4214 [pdf, ps, other]

A modified $P_1$ - immersed finite element method

Authors: Do Y. Kwak, Juho Lee

Abstract: In recent years, the immersed finite element methods (IFEM) introduced in \cite{Li2003}, \cite{Li2004} to solve elliptic problems having an interface in the domain due to the discontinuity of coefficients are getting more attentions of researchers because of their simplicity and efficiency. Unlike the conventional finite element methods, the IFEM allows the interface cut through the interior of th… ▽ More In recent years, the immersed finite element methods (IFEM) introduced in \cite{Li2003}, \cite{Li2004} to solve elliptic problems having an interface in the domain due to the discontinuity of coefficients are getting more attentions of researchers because of their simplicity and efficiency. Unlike the conventional finite element methods, the IFEM allows the interface cut through the interior of the element, yet after the basis functions are altered so that they satisfy the flux jump conditions, it seems to show a reasonable order of convergence. In this paper, we propose an improved version of the $P_1$ based IFEM by adding the line integral of flux terms on each element. This technique resembles the discontinuous Galerkin (DG) method, however, our method has much less degrees of freedom than the DG methods since we use the same number of unknowns as the conventional $P_1$ finite element method. We prove $H^1$ and $L^2$ error estimates which are optimal both in order and regularity. Numerical experiments were carried out for several examples, which show the robustness of our scheme. △ Less

Submitted 3 July, 2015; v1 submitted 19 August, 2014; originally announced August 2014.

Comments: Some Figures were removed from original article due to errors occurred during the processing

MSC Class: 65N30

arXiv:1107.1841 [pdf, ps, other]

doi 10.1016/j.asr.2011.06.025

Changes in Sea-Level Pressure over South Korea Associated with High-Speed Solar Wind Events

Authors: Il-Hyun Cho, Young-Sil Kwak, Katsuhide Marubashi, Yeon-Han Kim, Young-Deuk Park, Heon-Young Chang

Abstract: We explore a possibility that the daily sea-level pressure (SLP) over South Korea responds to the high-speed solar wind event. This is of interest in two aspects: First, if there is a statistical association this can be another piece of evidence showing that various meteorological observables indeed respond to variations in the interplanetary environment. Second, this can be a very crucial observa… ▽ More We explore a possibility that the daily sea-level pressure (SLP) over South Korea responds to the high-speed solar wind event. This is of interest in two aspects: First, if there is a statistical association this can be another piece of evidence showing that various meteorological observables indeed respond to variations in the interplanetary environment. Second, this can be a very crucial observational constraint since most models proposed so far are expected to preferentially work in higher latitude regions than the low latitude region studied here. We have examined daily solar wind speed ${\rm V}$, daily SLP difference ${\rm ΔSLP}$, and daily ${\rm \log(BV^{2})}$ using the superposed epoch analysis in which the key date is set such that the daily solar wind speed exceeds 800 ${\rm kms^{-1}}$. We find that the daily ${\rm ΔSLP}$ averaged out of 12 events reaches its peak at day +1 and gradually decreases back to its normal level. The amount of positive deviation of ${\rm ΔSLP}$ is +2.5 hPa. The duration of deviation is a few days. We also find that ${\rm ΔSLP}$ is well correlated with both the speed of solar wind and ${\rm \log(BV^{2})}$. The obtained linear correlation coefficients and chance probabilities with one-day lag for two cases are $r \simeq 0.81$ with $P> 99.9%$, and $r \simeq 0.84$ with $P> 99.9%$, respectively. We conclude by briefly discussing future direction to pursue. △ Less

Submitted 10 July, 2011; originally announced July 2011.

Comments: 23 pages, 4 figure, accepted to Advances in Space Research

arXiv:1103.4255 [pdf, ps, other]

doi 10.1016/j.jastp.2011.03.007

Dependence of GCRs influx on the Solar North-South Asymmetry

Authors: Il-Hyun Cho, Young-Sil Kwak, Heon-Young Chang, Kyung-Suk Cho, Young-Deuk Park, Ho-Sung Choi

Abstract: We investigate the dependence of the amount of the observed galactic cosmic ray (GCR) influx on the solar North-South asymmetry using the neutron count rates obtained from four stations and sunspot data in archives spanning six solar cycles from 1953 to 2008. We find that the observed GCR influxes at Moscow, Kiel, Climax and Huancayo stations are more suppressed when the solar activity in the sout… ▽ More We investigate the dependence of the amount of the observed galactic cosmic ray (GCR) influx on the solar North-South asymmetry using the neutron count rates obtained from four stations and sunspot data in archives spanning six solar cycles from 1953 to 2008. We find that the observed GCR influxes at Moscow, Kiel, Climax and Huancayo stations are more suppressed when the solar activity in the southern hemisphere is dominant compared with when the solar activity in the northern hemisphere is dominant. Its reduction rates at four stations are all larger than those of the suppression due to other factors including the solar polarity effect on the GCR influx. We perform the student's t-test to see how significant these suppressions are. It is found that suppressions due to the solar North-South asymmetry as well as the solar polarity are significant and yet the suppressions associated with the former are larger and more significant. △ Less

Submitted 22 March, 2011; originally announced March 2011.

Comments: 17 pages, 3figures, accepted to JASTP

arXiv:0911.4772 [pdf, ps, other]

An Analysis of broken $P_1$-Nonconforming Finite Element Method For Interface Problems

Authors: Do Y. Kwak, K. T. Wee

Abstract: We study some numerical methods for solving second order elliptic problem with interface. We introduce an immersed interface finite element method based on the `broken' $P_1$-nonconforming piecewise linear polynomials on interface triangular elements having edge averages as degrees of freedom. This linear polynomials are broken to match the homogeneous jump condition along the interface which is… ▽ More We study some numerical methods for solving second order elliptic problem with interface. We introduce an immersed interface finite element method based on the `broken' $P_1$-nonconforming piecewise linear polynomials on interface triangular elements having edge averages as degrees of freedom. This linear polynomials are broken to match the homogeneous jump condition along the interface which is allowed to cut through the element. We prove optimal orders of convergence in $H^1$ and $L^2$-norm. Next we propose a mixed finite volume method in the context introduced in \cite{Kwak2003} using the Raviart-Thomas mixed finite element and this `broken' $P_1$-nonconforming element. The advantage of this mixed finite volume method is that once we solve the symmetric positive definite pressure equation(without Lagrangian multiplier), the velocity can be computed locally by a simple formula. This procedure avoids solving the saddle point problem. Furthermore, we show optimal error estimates of velocity and pressure in our mixed finite volume method. Numerical results show optimal orders of error in $L^2$-norm and broken $H^1$-norm for the pressure, and in $H(\Div)$-norm for the velocity. △ Less

Submitted 25 November, 2009; originally announced November 2009.

MSC Class: 65N15; 65N30

arXiv:0911.4769 [pdf, ps, other]

Extraction method for Stokes Flow with jumps in the pressure

Authors: K. S. Chang, D. Y. Kwak

Abstract: In this paper, we consider a stationary, constant viscosity, incompressible Stokes flow with singular forces along one or several interfaces. Assuming only the jumps of the pressure are present along the interface, we develop a new numerical scheme for such a problem. By constructing an approximate singular function and removing it, we can apply a standard finite element method to solve it. A ma… ▽ More In this paper, we consider a stationary, constant viscosity, incompressible Stokes flow with singular forces along one or several interfaces. Assuming only the jumps of the pressure are present along the interface, we develop a new numerical scheme for such a problem. By constructing an approximate singular function and removing it, we can apply a standard finite element method to solve it. A main advantage of our scheme is that one can use a uniform grid. We observe optimal $O(h)$ order for the pressure and $O(h^2)$ order for the velocity. △ Less

Submitted 25 November, 2009; originally announced November 2009.

MSC Class: 65Z05; 76D07

arXiv:cond-mat/0108188 [pdf, ps, other]

doi 10.1103/PhysRevE.65.031602

Monte Carlo Simulation of Sinusoidally Modulated Superlattice Growth

Authors: H. Jeong, B. Kahng, S. Lee, C. Y. Kwak, A. -L. Barabasi, J. K. Furdyna

Abstract: The fabrication of ZnSe/ZnTe superlattices grown by the process of rotating the substrate in the presence of an inhomogeneous flux distribution instead of successively closing and opening of source shutters is studied via Monte Carlo simulations. It is found that the concentration of each compound is sinusoidally modulated along the growth direction, caused by the uneven arrival of Se and Te ato… ▽ More The fabrication of ZnSe/ZnTe superlattices grown by the process of rotating the substrate in the presence of an inhomogeneous flux distribution instead of successively closing and opening of source shutters is studied via Monte Carlo simulations. It is found that the concentration of each compound is sinusoidally modulated along the growth direction, caused by the uneven arrival of Se and Te atoms at a given point of the sample, and by the variation of the Te/Se ratio at that point due to the rotation of the substrate. In this way we obtain a ZnSe$_{1-x}$Te$_x$ alloy in which the composition $x$ varies sinusoidally along the growth direction. The period of the modulation is directly controlled by the rate of the substrate rotation. The amplitude of the compositional modulation is monotonous for small angular velocities of the substrate rotation, but is itself modulated for large angular velocities. The average amplitude of the modulation pattern decreases as the angular velocity of substrate rotation increases and the measurement position approaches the center of rotation. The simulation results are in good agreement with previously published experimental measurements on superlattices fabricated in this manner. △ Less

Submitted 10 August, 2001; originally announced August 2001.

Showing 1–42 of 42 results for author: Kwak, Y