-
Does Incomplete Syntax Influence Korean Language Model? Focusing on Word Order and Case Markers
Authors:
Jong Myoung Kim,
Young-Jun Lee,
Yong-** Han,
Sangkeun Jung,
Ho-** Choi
Abstract:
Syntactic elements, such as word order and case markers, are fundamental in natural language processing. Recent studies show that syntactic information boosts language model performance and offers clues for people to understand their learning mechanisms. Unlike languages with a fixed word order such as English, Korean allows for varied word sequences, despite its canonical structure, due to case m…
▽ More
Syntactic elements, such as word order and case markers, are fundamental in natural language processing. Recent studies show that syntactic information boosts language model performance and offers clues for people to understand their learning mechanisms. Unlike languages with a fixed word order such as English, Korean allows for varied word sequences, despite its canonical structure, due to case markers that indicate the functions of sentence components. This study explores whether Korean language models can accurately capture this flexibility. We note that incomplete word orders and omitted case markers frequently appear in ordinary Korean communication. To investigate this further, we introduce the Syntactically Incomplete Korean (SIKO) dataset. Through SIKO, we assessed Korean language models' flexibility with incomplete syntax and confirmed the dataset's training value. Results indicate these models reflect Korean's inherent flexibility, accurately handling incomplete inputs. Moreover, fine-tuning with SIKO enhances the ability to handle common incomplete Korean syntactic forms. The dataset's simple construction process, coupled with significant performance enhancements, solidifies its standing as an effective data augmentation technique.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models
Authors:
Nishad Singhi,
Jae Myung Kim,
Karsten Roth,
Zeynep Akata
Abstract:
Concept Bottleneck Models (CBMs) ground image classification on human-understandable concepts to allow for interpretable model decisions. Crucially, the CBM design inherently allows for human interventions, in which expert users are given the ability to modify potentially misaligned concept choices to influence the decision behavior of the model in an interpretable fashion. However, existing appro…
▽ More
Concept Bottleneck Models (CBMs) ground image classification on human-understandable concepts to allow for interpretable model decisions. Crucially, the CBM design inherently allows for human interventions, in which expert users are given the ability to modify potentially misaligned concept choices to influence the decision behavior of the model in an interpretable fashion. However, existing approaches often require numerous human interventions per image to achieve strong performances, posing practical challenges in scenarios where obtaining human feedback is expensive. In this paper, we find that this is noticeably driven by an independent treatment of concepts during intervention, wherein a change of one concept does not influence the use of other ones in the model's final decision. To address this issue, we introduce a trainable concept intervention realignment module, which leverages concept relations to realign concept assignments post-intervention. Across standard, real-world benchmarks, we find that concept realignment can significantly improve intervention efficacy; significantly reducing the number of interventions needed to reach a target classification performance or concept prediction accuracy. In addition, it easily integrates into existing concept-based architectures without requiring changes to the models themselves. This reduced cost of human-model collaboration is crucial to enhancing the feasibility of CBMs in resource-constrained environments.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Ultrasensitive Textile Strain Sensors Redefine Wearable Silent Speech Interfaces with High Machine Learning Efficiency
Authors:
Chenyu Tang,
Muzi Xu,
Wentian Yi,
Zibo Zhang,
Edoardo Occhipinti,
Chaoqun Dong,
Dafydd Ravenscroft,
Sung-Min Jung,
Sanghyo Lee,
Shuo Gao,
Jong Min Kim,
Luigi G. Occhipinti
Abstract:
Our research presents a wearable Silent Speech Interface (SSI) technology that excels in device comfort, time-energy efficiency, and speech decoding accuracy for real-world use. We developed a biocompatible, durable textile choker with an embedded graphene-based strain sensor, capable of accurately detecting subtle throat movements. This sensor, surpassing other strain sensors in sensitivity by 42…
▽ More
Our research presents a wearable Silent Speech Interface (SSI) technology that excels in device comfort, time-energy efficiency, and speech decoding accuracy for real-world use. We developed a biocompatible, durable textile choker with an embedded graphene-based strain sensor, capable of accurately detecting subtle throat movements. This sensor, surpassing other strain sensors in sensitivity by 420%, simplifies signal processing compared to traditional voice recognition methods. Our system uses a computationally efficient neural network, specifically a one-dimensional convolutional neural network with residual structures, to decode speech signals. This network is energy and time-efficient, reducing computational load by 90% while achieving 95.25% accuracy for a 20-word lexicon and swiftly adapting to new users and words with minimal samples. This innovation demonstrates a practical, sensitive, and precise wearable SSI suitable for daily communication applications.
△ Less
Submitted 7 December, 2023; v1 submitted 27 November, 2023;
originally announced November 2023.
-
Waffling around for Performance: Visual Classification with Random Words and Broad Concepts
Authors:
Karsten Roth,
Jae Myung Kim,
A. Sophia Koepke,
Oriol Vinyals,
Cordelia Schmid,
Zeynep Akata
Abstract:
The visual classification performance of vision-language models such as CLIP has been shown to benefit from additional semantic knowledge from large language models (LLMs) such as GPT-3. In particular, averaging over LLM-generated class descriptors, e.g. "waffle, which has a round shape", can notably improve generalization performance. In this work, we critically study this behavior and propose Wa…
▽ More
The visual classification performance of vision-language models such as CLIP has been shown to benefit from additional semantic knowledge from large language models (LLMs) such as GPT-3. In particular, averaging over LLM-generated class descriptors, e.g. "waffle, which has a round shape", can notably improve generalization performance. In this work, we critically study this behavior and propose WaffleCLIP, a framework for zero-shot visual classification which simply replaces LLM-generated descriptors with random character and word descriptors. Without querying external models, we achieve comparable performance gains on a large number of visual classification tasks. This allows WaffleCLIP to both serve as a low-cost alternative, as well as a sanity check for any future LLM-based vision-language model extensions. We conduct an extensive experimental study on the impact and shortcomings of additional semantics introduced with LLM-generated descriptors, and showcase how - if available - semantic context is better leveraged by querying LLMs for high-level concepts, which we show can be done to jointly resolve potential class name ambiguities. Code is available here: https://github.com/ExplainableML/WaffleCLIP.
△ Less
Submitted 16 August, 2023; v1 submitted 12 June, 2023;
originally announced June 2023.
-
Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval
Authors:
Jae Myung Kim,
A. Sophia Koepke,
Cordelia Schmid,
Zeynep Akata
Abstract:
Cross-modal retrieval methods are the preferred tool to search databases for the text that best matches a query image and vice versa. However, image-text retrieval models commonly learn to memorize spurious correlations in the training data, such as frequent object co-occurrence, instead of looking at the actual underlying reasons for the prediction in the image. For image-text retrieval, this man…
▽ More
Cross-modal retrieval methods are the preferred tool to search databases for the text that best matches a query image and vice versa. However, image-text retrieval models commonly learn to memorize spurious correlations in the training data, such as frequent object co-occurrence, instead of looking at the actual underlying reasons for the prediction in the image. For image-text retrieval, this manifests in retrieved sentences that mention objects that are not present in the query image. In this work, we introduce ODmAP@k, an object decorrelation metric that measures a model's robustness to spurious correlations in the training data. We use automatic image and text manipulations to control the presence of such object correlations in designated test data. Additionally, our data synthesis technique is used to tackle model biases due to spurious correlations of semantically unrelated objects in the training data. We apply our proposed pipeline, which involves the finetuning of image-text retrieval frameworks on carefully designed synthetic data, to three state-of-the-art models for image-text retrieval. This results in significant improvements for all three models, both in terms of the standard retrieval performance and in terms of our object decorrelation metric. The code is available at https://github.com/ExplainableML/Spurious_CM_Retrieval.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
Bridging the Gap between Model Explanations in Partially Annotated Multi-label Classification
Authors:
Youngwook Kim,
Jae Myung Kim,
Jieun Jeong,
Cordelia Schmid,
Zeynep Akata,
Jungwoo Lee
Abstract:
Due to the expensive costs of collecting labels in multi-label classification datasets, partially annotated multi-label classification has become an emerging field in computer vision. One baseline approach to this task is to assume unobserved labels as negative labels, but this assumption induces label noise as a form of false negative. To understand the negative impact caused by false negative la…
▽ More
Due to the expensive costs of collecting labels in multi-label classification datasets, partially annotated multi-label classification has become an emerging field in computer vision. One baseline approach to this task is to assume unobserved labels as negative labels, but this assumption induces label noise as a form of false negative. To understand the negative impact caused by false negative labels, we study how these labels affect the model's explanation. We observe that the explanation of two models, trained with full and partial labels each, highlights similar regions but with different scaling, where the latter tends to have lower attribution scores. Based on these findings, we propose to boost the attribution scores of the model trained with partial labels to make its explanation resemble that of the model trained with full labels. Even with the conceptually simple approach, the multi-label classification performance improves by a large margin in three different datasets on a single positive label setting and one on a large-scale partial label setting. Code is available at https://github.com/youngwk/BridgeGapExplanationPAMC.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Likelihood Annealing: Fast Calibrated Uncertainty for Regression
Authors:
Uddeshya Upadhyay,
Jae Myung Kim,
Cordelia Schmidt,
Bernhard Schölkopf,
Zeynep Akata
Abstract:
Recent advances in deep learning have shown that uncertainty estimation is becoming increasingly important in applications such as medical imaging, natural language processing, and autonomous systems. However, accurately quantifying uncertainty remains a challenging problem, especially in regression tasks where the output space is continuous. Deep learning approaches that allow uncertainty estimat…
▽ More
Recent advances in deep learning have shown that uncertainty estimation is becoming increasingly important in applications such as medical imaging, natural language processing, and autonomous systems. However, accurately quantifying uncertainty remains a challenging problem, especially in regression tasks where the output space is continuous. Deep learning approaches that allow uncertainty estimation for regression problems often converge slowly and yield poorly calibrated uncertainty estimates that can not be effectively used for quantification. Recently proposed post hoc calibration techniques are seldom applicable to regression problems and often add overhead to an already slow model training phase. This work presents a fast calibrated uncertainty estimation method for regression tasks called Likelihood Annealing, that consistently improves the convergence of deep regression models and yields calibrated uncertainty without any post hoc calibration phase. Unlike previous methods for calibrated uncertainty in regression that focus only on low-dimensional regression problems, our method works well on a broad spectrum of regression problems, including high-dimensional regression.Our empirical analysis shows that our approach is generalizable to various network architectures, including multilayer perceptrons, 1D/2D convolutional networks, and graph neural networks, on five vastly diverse tasks, i.e., chaotic particle trajectory denoising, physical property prediction of molecules using 3D atomistic representation, natural image super-resolution, and medical image translation using MRI.
△ Less
Submitted 2 July, 2023; v1 submitted 21 February, 2023;
originally announced February 2023.
-
Dilute neutron star matter from neural-network quantum states
Authors:
Bryce Fore,
Jane M. Kim,
Giuseppe Carleo,
Morten Hjorth-Jensen,
Alessandro Lovato
Abstract:
Low-density neutron matter is characterized by fascinating emergent quantum phenomena, such as the formation of Cooper pairs and the onset of superfluidity. We model this density regime by capitalizing on the expressivity of the hidden-nucleon neural-network quantum states combined with variational Monte Carlo and stochastic reconfiguration techniques. Our approach is competitive with the auxiliar…
▽ More
Low-density neutron matter is characterized by fascinating emergent quantum phenomena, such as the formation of Cooper pairs and the onset of superfluidity. We model this density regime by capitalizing on the expressivity of the hidden-nucleon neural-network quantum states combined with variational Monte Carlo and stochastic reconfiguration techniques. Our approach is competitive with the auxiliary-field diffusion Monte Carlo method at a fraction of the computational cost. Using a leading-order pionless effective field theory Hamiltonian, we compute the energy per particle of infinite neutron matter and compare it with those obtained from highly realistic interactions. In addition, a comparison between the spin-singlet and triplet two-body distribution functions indicates the emergence pairing in the $^1S_0$ channel.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
Efficient Solutions of Fermionic Systems using Artificial Neural Networks
Authors:
Even M. Nordhagen,
Jane M. Kim,
Bryce Fore,
Alessandro Lovato,
Morten Hjorth-Jensen
Abstract:
We discuss differences and similarities between variational Monte Carlo approaches that use conventional and artificial neural network parameterizations of the ground-state wave function for systems of fermions. We focus on a relatively shallow neural-network architectures, the so called restricted Boltzmann machine, and discuss unsupervised learning algorithms that are suitable to model complicat…
▽ More
We discuss differences and similarities between variational Monte Carlo approaches that use conventional and artificial neural network parameterizations of the ground-state wave function for systems of fermions. We focus on a relatively shallow neural-network architectures, the so called restricted Boltzmann machine, and discuss unsupervised learning algorithms that are suitable to model complicated many-body correlations. We analyze the strengths and weaknesses of conventional and neural-network wave functions by solving various circular quantum-dots systems. Results for up to 90 electrons are presented and particular emphasis is placed on how to efficiently implement these methods on homogeneous and heterogeneous high-performance computing facilities.
△ Less
Submitted 1 October, 2022;
originally announced October 2022.
-
Large Loss Matters in Weakly Supervised Multi-Label Classification
Authors:
Youngwook Kim,
Jae Myung Kim,
Zeynep Akata,
Jungwoo Lee
Abstract:
Weakly supervised multi-label classification (WSML) task, which is to learn a multi-label classification using partially observed labels per image, is becoming increasingly important due to its huge annotation cost. In this work, we first regard unobserved labels as negative labels, casting the WSML task into noisy multi-label classification. From this point of view, we empirically observe that me…
▽ More
Weakly supervised multi-label classification (WSML) task, which is to learn a multi-label classification using partially observed labels per image, is becoming increasingly important due to its huge annotation cost. In this work, we first regard unobserved labels as negative labels, casting the WSML task into noisy multi-label classification. From this point of view, we empirically observe that memorization effect, which was first discovered in a noisy multi-class setting, also occurs in a multi-label setting. That is, the model first learns the representation of clean labels, and then starts memorizing noisy labels. Based on this finding, we propose novel methods for WSML which reject or correct the large loss samples to prevent model from memorizing the noisy label. Without heavy and complex components, our proposed methods outperform previous state-of-the-art WSML methods on several partial label settings including Pascal VOC 2012, MS COCO, NUSWIDE, CUB, and OpenImages V3 datasets. Various analysis also show that our methodology actually works well, validating that treating large loss properly matters in a weakly supervised multi-label classification. Our code is available at https://github.com/snucml/LargeLossMatters.
△ Less
Submitted 8 June, 2022;
originally announced June 2022.
-
Keep CALM and Improve Visual Feature Attribution
Authors:
Jae Myung Kim,
Junsuk Choe,
Zeynep Akata,
Seong Joon Oh
Abstract:
The class activation map**, or CAM, has been the cornerstone of feature attribution methods for multiple vision tasks. Its simplicity and effectiveness have led to wide applications in the explanation of visual predictions and weakly-supervised localization tasks. However, CAM has its own shortcomings. The computation of attribution maps relies on ad-hoc calibration steps that are not part of th…
▽ More
The class activation map**, or CAM, has been the cornerstone of feature attribution methods for multiple vision tasks. Its simplicity and effectiveness have led to wide applications in the explanation of visual predictions and weakly-supervised localization tasks. However, CAM has its own shortcomings. The computation of attribution maps relies on ad-hoc calibration steps that are not part of the training computational graph, making it difficult for us to understand the real meaning of the attribution values. In this paper, we improve CAM by explicitly incorporating a latent variable encoding the location of the cue for recognition in the formulation, thereby subsuming the attribution map into the training computational graph. The resulting model, class activation latent map**, or CALM, is trained with the expectation-maximization algorithm. Our experiments show that CALM identifies discriminative attributes for image classifiers more accurately than CAM and other visual attribution baselines. CALM also shows performance improvements over prior arts on the weakly-supervised object localization benchmarks. Our code is available at https://github.com/naver-ai/calm.
△ Less
Submitted 12 August, 2021; v1 submitted 14 June, 2021;
originally announced June 2021.
-
Inkjet printed circuits with two-dimensional semiconductor inks for high-performance electronics
Authors:
Tian Carey,
Adrees Arbab,
Luca Anzi,
Helen Bristow,
Fei Hui,
Sivasambu Bohm,
Gwenhivir Wyatt-Moon,
Andrew Flewitt,
Andrew Wadsworth,
Nicola Gasparini,
Jong Min Kim,
Mario Lanza,
Iain McCulloch,
Roman Sordan,
Felice Torrisi
Abstract:
Air-stable semiconducting inks suitable for complementary logic are key to create low-power printed integrated circuits (ICs). High-performance printable electronic inks with two-dimensional materials have the potential to enable the next generation of high performance, low-cost printed digital electronics. Here we demonstrate air-stable, low voltage (< 5 V) operation of inkjet-printed n-type moly…
▽ More
Air-stable semiconducting inks suitable for complementary logic are key to create low-power printed integrated circuits (ICs). High-performance printable electronic inks with two-dimensional materials have the potential to enable the next generation of high performance, low-cost printed digital electronics. Here we demonstrate air-stable, low voltage (< 5 V) operation of inkjet-printed n-type molybdenum disulfide (MoS2) and p-type indacenodithiophene-co-benzothiadiazole (IDT-BT) field-effect transistors (FETs), estimating a switching time of τ ~ 3.3 μs for the MoS2 FETs. We achieve this by engineering high-quality MoS2 and air-stable IDT-BT inks suitable for inkjet-printing complementary pairs of n-type MoS2 and p-type IDT-BT FETs. We then integrate MoS2 and IDT-BT FETs to realise inkjet-printed complementary logic inverters with a voltage gain |Av| ~ 4 when in resistive load configuration and |Av| ~ 1.36 in complementary configuration. These results represent a key enabling step towards ubiquitous long-term stable, low-cost printed digital ICs.
△ Less
Submitted 24 November, 2020;
originally announced November 2020.
-
REST: Performance Improvement of a Black Box Model via RL-based Spatial Transformation
Authors:
Jae Myung Kim,
Hyung** Kim,
Chanwoo Park,
Jungwoo Lee
Abstract:
In recent years, deep neural networks (DNN) have become a highly active area of research, and shown remarkable achievements on a variety of computer vision tasks. DNNs, however, are known to often make overconfident yet incorrect predictions on out-of-distribution samples, which can be a major obstacle to real-world deployments because the training dataset is always limited compared to diverse rea…
▽ More
In recent years, deep neural networks (DNN) have become a highly active area of research, and shown remarkable achievements on a variety of computer vision tasks. DNNs, however, are known to often make overconfident yet incorrect predictions on out-of-distribution samples, which can be a major obstacle to real-world deployments because the training dataset is always limited compared to diverse real-world samples. Thus, it is fundamental to provide guarantees of robustness to the distribution shift between training and test time when we construct DNN models in practice. Moreover, in many cases, the deep learning models are deployed as black boxes and the performance has been already optimized for a training dataset, thus changing the black box itself can lead to performance degradation. We here study the robustness to the geometric transformations in a specific condition where the black-box image classifier is given. We propose an additional learner, \emph{REinforcement Spatial Transform learner (REST)}, that transforms the warped input data into samples regarded as in-distribution by the black-box models. Our work aims to improve the robustness by adding a REST module in front of any black boxes and training only the REST module without retraining the original black box model in an end-to-end manner, i.e. we try to convert the real-world data into training distribution which the performance of the black-box model is best suited for. We use a confidence score that is obtained from the black-box model to determine whether the transformed input is drawn from in-distribution. We empirically show that our method has an advantage in generalization to geometric transformations and sample efficiency.
△ Less
Submitted 16 February, 2020;
originally announced February 2020.
-
Exploring linearity of deep neural network trained QSM: QSMnet+
Authors:
Woo** Jung,
Jaeyeon Yoon,
Joon Yul Choi,
Jae Myung Kim,
Yoonho Nam,
Eung Yeop Kim,
Jongho Lee
Abstract:
Recently, deep neural network-powered quantitative susceptibility map** (QSM), QSMnet, successfully performed ill conditioned dipole inversion in QSM and generated high-quality susceptibility maps. In this paper, the network, which was trained by healthy volunteer data, is evaluated for hemorrhagic lesions that have substantially higher susceptibility than healthy tissues in order to test linear…
▽ More
Recently, deep neural network-powered quantitative susceptibility map** (QSM), QSMnet, successfully performed ill conditioned dipole inversion in QSM and generated high-quality susceptibility maps. In this paper, the network, which was trained by healthy volunteer data, is evaluated for hemorrhagic lesions that have substantially higher susceptibility than healthy tissues in order to test linearity of QSMnet for susceptibility. The results show that QSMnet underestimates susceptibility in hemorrhagic lesions, revealing degraded linearity of the network for the untrained susceptibility range. To overcome this limitation, a data augmentation method is proposed to generalize the network for a wider range of susceptibility. The newly trained network, which is referred to as QSMnet+, is assessed in computer-simulated lesions with an extended susceptibility range (-1.4 ppm to +1.4 ppm) and also in twelve hemorrhagic patients. The simulation results demonstrate improved linearity of QSMnet+ over QSMnet (root mean square error of QSMnet+: 0.04 ppm vs. QSMnet: 0.36 ppm). When applied to patient data, QSMnet+ maps show less noticeable artifacts to those of conventional QSM maps. Moreover, the susceptibility values of QSMnet+ in hemorrhagic lesions are better matched to those of the conventional QSM method than those of QSMnet when analyzed using linear regression (QSMnet+: slope = 1.05, intercept = -0.03, R2 = 0.93; QSMnet: slope = 0.68, intercept = 0.06, R2 = 0.86), consolidating improved linearity in QSMnet+. This study demonstrates the importance of the trained data range in deep neural network-powered parametric map** and suggests the data augmentation approach for generalization of network. The new network can be applicable for a wide range of susceptibility quantification.
△ Less
Submitted 14 October, 2019; v1 submitted 17 September, 2019;
originally announced September 2019.
-
A pseudo-capacitive chalcogenide-based electrode with dense 1-dimensional nanoarrays for enhanced energy density in asymmetric supercapacitors
Authors:
Young-Woo Lee,
Byung-Sung Kima,
Jong Hong,
Juwon Lee,
Sangyeon Pak,
Hyeon-Sik Jang,
Dongmok Whang,
SeungNam Cha,
Jung Inn Sohn,
Jong Min Kim
Abstract:
To achieve the further development of supercapacitors (SCs), which have intensively received attention as a next-generation energy storage system, the rational design of active electrode materials with electrochemically more favorable structure is one of the most important factors to improve the SC performance with high specific energy and power density. We propose and successfully grow copper sul…
▽ More
To achieve the further development of supercapacitors (SCs), which have intensively received attention as a next-generation energy storage system, the rational design of active electrode materials with electrochemically more favorable structure is one of the most important factors to improve the SC performance with high specific energy and power density. We propose and successfully grow copper sulfide (CuS) nanowires (NWs) as a chalcogenide-based electrode material directly on a Cu mesh current collector using the combination of a facile liquid-solid chemical oxidation process and an anion exchange reaction. We found that the as-prepared CuS NWs have well-arrayed structures with nanosized crystal grains, a high aspect ratio and density, as well as a good mechanical and electrical contact to the Cu mesh. The obtained CuS NW based electrodes, with additional binder- and conductive material-free, exhibit a much higher areal capacitance of 378.0 mF/cm2 and excellent cyclability of an approximately 90.2 percentage retention during 2000 charge/discharge cycles due to their unique structural, electrical, and electrochemical properties. Furthermore, for practical SC applications, an asymmetric supercapacitor is fabricated using active carbon as an anode and CuS NWs as a cathode, and exhibits the good capacitance retention of 91% during 2000 charge/discharge processes and the excellent volumetric energy density of 1.11 mW h/cm3 compared to other reported pseudo-capacitive SCs.
△ Less
Submitted 15 May, 2019;
originally announced May 2019.
-
Sampling-based Bayesian Inference with gradient uncertainty
Authors:
Chanwoo Park,
Jae Myung Kim,
Seok Hyeon Ha,
Jungwoo Lee
Abstract:
Deep neural networks(NNs) have achieved impressive performance, often exceed human performance on many computer vision tasks. However, one of the most challenging issues that still remains is that NNs are overconfident in their predictions, which can be very harmful when this arises in safety critical applications. In this paper, we show that predictive uncertainty can be efficiently estimated whe…
▽ More
Deep neural networks(NNs) have achieved impressive performance, often exceed human performance on many computer vision tasks. However, one of the most challenging issues that still remains is that NNs are overconfident in their predictions, which can be very harmful when this arises in safety critical applications. In this paper, we show that predictive uncertainty can be efficiently estimated when we incorporate the concept of gradients uncertainty into posterior sampling. The proposed method is tested on two different datasets, MNIST for in-distribution confusing examples and notMNIST for out-of-distribution data. We show that our method is able to efficiently represent predictive uncertainty on both datasets.
△ Less
Submitted 27 December, 2019; v1 submitted 8 December, 2018;
originally announced December 2018.
-
Compressive Sampling using Annihilating Filter-based Low-Rank Interpolation
Authors:
Jong Chul Ye,
Jong Min Kim,
Kyong Hwan **,
Kiryung Lee
Abstract:
While the recent theory of compressed sensing provides an opportunity to overcome the Nyquist limit in recovering sparse signals, a solution approach usually takes a form of inverse problem of the unknown signal, which is crucially dependent on specific signal representation. In this paper, we propose a drastically different two-step Fourier compressive sampling framework in continuous domain that…
▽ More
While the recent theory of compressed sensing provides an opportunity to overcome the Nyquist limit in recovering sparse signals, a solution approach usually takes a form of inverse problem of the unknown signal, which is crucially dependent on specific signal representation. In this paper, we propose a drastically different two-step Fourier compressive sampling framework in continuous domain that can be implemented as a measurement domain interpolation, after which a signal reconstruction can be done using classical analytic reconstruction methods. The main idea is originated from the fundamental duality between the sparsity in the primary space and the low-rankness of a structured matrix in the spectral domain, which shows that a low-rank interpolator in the spectral domain can enjoy all the benefit of sparse recovery with performance guarantees. Most notably, the proposed low-rank interpolation approach can be regarded as a generalization of recent spectral compressed sensing to recover large class of finite rate of innovations (FRI) signals at near optimal sampling rate. Moreover, for the case of cardinal representation, we can show that the proposed low-rank interpolation will benefit from inherent regularization and the optimal incoherence parameter. Using the powerful dual certificates and golfing scheme, we show that the new framework still achieves the near-optimal sampling rate for general class of FRI signal recovery, and the sampling rate can be further reduced for the class of cardinal splines. Numerical results using various type of FRI signals confirmed that the proposed low-rank interpolation approach has significant better phase transition than the conventional CS approaches.
△ Less
Submitted 26 September, 2016; v1 submitted 29 November, 2015;
originally announced November 2015.
-
Improving M-SBL for Joint Sparse Recovery using a Subspace Penalty
Authors:
Jong Chul Ye,
Jong Min Kim,
Yoram Bresler
Abstract:
The multiple measurement vector problem (MMV) is a generalization of the compressed sensing problem that addresses the recovery of a set of jointly sparse signal vectors. One of the important contributions of this paper is to reveal that the seemingly least related state-of-art MMV joint sparse recovery algorithms - M-SBL (multiple sparse Bayesian learning) and subspace-based hybrid greedy algorit…
▽ More
The multiple measurement vector problem (MMV) is a generalization of the compressed sensing problem that addresses the recovery of a set of jointly sparse signal vectors. One of the important contributions of this paper is to reveal that the seemingly least related state-of-art MMV joint sparse recovery algorithms - M-SBL (multiple sparse Bayesian learning) and subspace-based hybrid greedy algorithms - have a very important link. More specifically, we show that replacing the $\log\det(\cdot)$ term in M-SBL by a rank proxy that exploits the spark reduction property discovered in subspace-based joint sparse recovery algorithms, provides significant improvements. In particular, if we use the Schatten-$p$ quasi-norm as the corresponding rank proxy, the global minimiser of the proposed algorithm becomes identical to the true solution as $p \rightarrow 0$. Furthermore, under the same regularity conditions, we show that the convergence to a local minimiser is guaranteed using an alternating minimization algorithm that has closed form expressions for each of the minimization steps, which are convex. Numerical simulations under a variety of scenarios in terms of SNR, and condition number of the signal amplitude matrix demonstrate that the proposed algorithm consistently outperforms M-SBL and other state-of-the art algorithms.
△ Less
Submitted 25 March, 2015; v1 submitted 23 March, 2015;
originally announced March 2015.
-
Optogenetic control of cell signaling pathway through scattering skull using wavefront sha**
Authors:
Jonghee Yoon,
Minji Lee,
KyeoReh Lee,
Nury Kim,
** Man Kim,
Jongchan Park,
Chulhee Choi,
Won Do Heo,
YongKeun Park
Abstract:
We introduce a non-invasive approach for optogenetic regulation in biological cells through highly scattering skull tissue using wavefront sha**. The wavefront of the incident light was systematically controlled using a spatial light modulator in order to overcome multiple light-scattering in a mouse skull layer and to focus light on the target cells. We demonstrate that illumination with shaped…
▽ More
We introduce a non-invasive approach for optogenetic regulation in biological cells through highly scattering skull tissue using wavefront sha**. The wavefront of the incident light was systematically controlled using a spatial light modulator in order to overcome multiple light-scattering in a mouse skull layer and to focus light on the target cells. We demonstrate that illumination with shaped waves enables spatiotemporal regulation of intracellular Ca2+ level at the individual-cell level.
△ Less
Submitted 27 October, 2015; v1 submitted 17 February, 2015;
originally announced February 2015.
-
Improving Noise Robustness in Subspace-based Joint Sparse Recovery
Authors:
Jong Min Kim,
Ok Kyun Lee,
Jong Chul Ye
Abstract:
In a multiple measurement vector problem (MMV), where multiple signals share a common sparse support and are sampled by a common sensing matrix, we can expect joint sparsity to enable a further reduction in the number of required measurements. While a diversity gain from joint sparsity had been demonstrated earlier in the case of a convex relaxation method using an $l_1/l_2$ mixed norm penalty, on…
▽ More
In a multiple measurement vector problem (MMV), where multiple signals share a common sparse support and are sampled by a common sensing matrix, we can expect joint sparsity to enable a further reduction in the number of required measurements. While a diversity gain from joint sparsity had been demonstrated earlier in the case of a convex relaxation method using an $l_1/l_2$ mixed norm penalty, only recently was it shown that similar diversity gain can be achieved by greedy algorithms if we combine greedy steps with a MUSIC-like subspace criterion. However, the main limitation of these hybrid algorithms is that they often require a large number of snapshots or a high signal-to-noise ratio (SNR) for an accurate subspace as well as partial support estimation. One of the main contributions of this work is to show that the noise robustness of these algorithms can be significantly improved by allowing sequential subspace estimation and support filtering, even when the number of snapshots is insufficient. Numerical simulations show that a novel sequential compressive MUSIC (sequential CS-MUSIC) that combines the sequential subspace estimation and support filtering steps significantly outperforms the existing greedy algorithms and is quite comparable with computationally expensive state-of-art algorithms.
△ Less
Submitted 15 May, 2012; v1 submitted 15 December, 2011;
originally announced December 2011.
-
Exact Dynamic Support Tracking with Multiple Measurement Vectors using Compressive MUSIC
Authors:
Jong Min Kim,
Ok Kyun Lee,
Jong Chul Ye
Abstract:
Dynamic tracking of sparse targets has been one of the important topics in array signal processing. Recently, compressed sensing (CS) approaches have been extensively investigated as a new tool for this problem using partial support information obtained by exploiting temporal redundancy. However, most of these approaches are formulated under single measurement vector compressed sensing (SMV-CS) fr…
▽ More
Dynamic tracking of sparse targets has been one of the important topics in array signal processing. Recently, compressed sensing (CS) approaches have been extensively investigated as a new tool for this problem using partial support information obtained by exploiting temporal redundancy. However, most of these approaches are formulated under single measurement vector compressed sensing (SMV-CS) framework, where the performance guarantees are only in a probabilistic manner. The main contribution of this paper is to allow \textit{deterministic} tracking of time varying supports with multiple measurement vectors (MMV) by exploiting multi-sensor diversity. In particular, we show that a novel compressive MUSIC (CS-MUSIC) algorithm with optimized partial support selection not only allows removal of inaccurate portion of previous support estimation but also enables addition of newly emerged part of unknown support. Numerical results confirm the theory.
△ Less
Submitted 3 October, 2011;
originally announced October 2011.
-
Compressive MUSIC with optimized partial support for joint sparse recovery
Authors:
Jong Min Kim,
Ok Kyun Lee,
Jong Chul Ye
Abstract:
Multiple measurement vector (MMV) problem addresses the identification of unknown input vectors that share common sparse support. The MMV problems had been traditionally addressed either by sensor array signal processing or compressive sensing. However, recent breakthrough in this area such as compressive MUSIC (CS-MUSIC) or subspace-augumented MUSIC (SA-MUSIC) optimally combines the compressive s…
▽ More
Multiple measurement vector (MMV) problem addresses the identification of unknown input vectors that share common sparse support. The MMV problems had been traditionally addressed either by sensor array signal processing or compressive sensing. However, recent breakthrough in this area such as compressive MUSIC (CS-MUSIC) or subspace-augumented MUSIC (SA-MUSIC) optimally combines the compressive sensing (CS) and array signal processing such that $k-r$ supports are first found by CS and the remaining $r$ supports are determined by generalized MUSIC criterion, where $k$ and $r$ denote the sparsity and the independent snapshots, respectively. Even though such hybrid approach significantly outperforms the conventional algorithms, its performance heavily depends on the correct identification of $k-r$ partial support by compressive sensing step, which often deteriorate the overall performance. The main contribution of this paper is, therefore, to show that as long as $k-r+1$ correct supports are included in any $k$-sparse CS solution, the optimal $k-r$ partial support can be found using a subspace fitting criterion, significantly improving the overall performance of CS-MUSIC. Furthermore, unlike the single measurement CS counterpart that requires infinite SNR for a perfect support recovery, we can derive an information theoretic sufficient condition for the perfect recovery using CS-MUSIC under a {\em finite} SNR scenario.
△ Less
Submitted 31 May, 2011; v1 submitted 16 February, 2011;
originally announced February 2011.
-
Direct and Indirect Detection of Neutralino Dark Matter and Collider Signatures in an $SO(10)$ Model with Two Intermediate Scales
Authors:
Manuel Drees,
Ju Min Kim,
Eun-Kyung Park
Abstract:
We investigate the detectability of neutralino Dark Matter via direct and indirect searches as well as collider signatures of an $SO(10)$ model with two intermediate scales. We compare the direct Dark Matter detection cross section and the muon flux due to neutralino annihilation in the Sun that we obtain in this model with mSUGRA predictions and with the sensitivity of current and future experime…
▽ More
We investigate the detectability of neutralino Dark Matter via direct and indirect searches as well as collider signatures of an $SO(10)$ model with two intermediate scales. We compare the direct Dark Matter detection cross section and the muon flux due to neutralino annihilation in the Sun that we obtain in this model with mSUGRA predictions and with the sensitivity of current and future experiments. In both cases, we find that the detectability improves as the model deviates more from mSUGRA. In order to study collider signatures, we choose two benchmark points that represent the main phenomenological features of the model: a lower value of $|μ|$ and reduced third generation sfermion masses due to extra Yukawa coupling contributions in the Renormalization Group Equations, and increased first and second generation slepton masses due to new gaugino loop contributions. We show that measurements at the LHC can distinguish this model from mSUGRA in both cases, by counting events containing leptonically decaying $Z^0$ bosons, heavy neutral Higgs bosons, or like--sign lepton pairs.
△ Less
Submitted 10 June, 2010;
originally announced June 2010.
-
Dirac Neutralinos and Electroweak Scalar Bosons of N=1/N=2 Hybrid Supersymmetry at Colliders
Authors:
S. Y. Choi,
D. Choudhury,
A. Freitas,
J. Kalinowski,
J. M. Kim,
P. M. Zerwas
Abstract:
In the N=1 supersymmetric extension of the Standard Model, neutralinos associated in supermultiplets with the neutral electroweak gauge and Higgs bosons are, as well as gluinos, Majorana fermions. They can be paired with the Majorana fermions of novel gaugino/scalar supermultiplets, as suggested by extended N=2 supersymmetry, to Dirac particles. Matter fields are not extended beyond the standard N…
▽ More
In the N=1 supersymmetric extension of the Standard Model, neutralinos associated in supermultiplets with the neutral electroweak gauge and Higgs bosons are, as well as gluinos, Majorana fermions. They can be paired with the Majorana fermions of novel gaugino/scalar supermultiplets, as suggested by extended N=2 supersymmetry, to Dirac particles. Matter fields are not extended beyond the standard N=1 supermultiplets in N=1/N=2 hybrid supersymmetry to preserve the chiral character of the theory. Complementing earlier analyses in the color sector, central elements of such an electroweak scenario are analyzed in the present study. The decay properties of the Dirac fermions and of the scalar bosons are worked out, and the single and pair production channels of the new particles are described for proton collisions at the LHC, and electron/positron and photon-photon collisions at linear colliders. Special attention is paid to modifications of the Higgs sector, identified with an N=2 hypermultiplet, by the mixing with the novel electroweak scalar sector.
△ Less
Submitted 12 August, 2010; v1 submitted 5 May, 2010;
originally announced May 2010.
-
Compressive MUSIC: A Missing Link Between Compressive Sensing and Array Signal Processing
Authors:
Jong Min Kim,
Ok Kyun Lee,
Jong Chul Ye
Abstract:
The multiple measurement vector (MMV) problem addresses the identification of unknown input vectors that share common sparse support. Even though MMV problems had been traditionally addressed within the context of sensor array signal processing, the recent trend is to apply compressive sensing (CS) due to its capability to estimate sparse support even with an insufficient number of snapshots, in w…
▽ More
The multiple measurement vector (MMV) problem addresses the identification of unknown input vectors that share common sparse support. Even though MMV problems had been traditionally addressed within the context of sensor array signal processing, the recent trend is to apply compressive sensing (CS) due to its capability to estimate sparse support even with an insufficient number of snapshots, in which case classical array signal processing fails. However, CS guarantees the accurate recovery in a probabilistic manner, which often shows inferior performance in the regime where the traditional array signal processing approaches succeed. The apparent dichotomy between the {\em probabilistic} CS and {\em deterministic} sensor array signal processing have not been fully understood. The main contribution of the present article is a unified approach that unveils a {missing link} between CS and array signal processing. The new algorithm, which we call {\em compressive MUSIC}, identifies the parts of support using CS, after which the remaining supports are estimated using a novel generalized MUSIC criterion. Using a large system MMV model, we show that our compressive MUSIC requires a smaller number of sensor elements for accurate support recovery than the existing CS methods and can approach the optimal $l_0$-bound with finite number of snapshots.
△ Less
Submitted 1 April, 2011; v1 submitted 25 April, 2010;
originally announced April 2010.
-
Potentially Large One-loop Corrections to WIMP Annihilation
Authors:
M. Drees,
J. M. Kim,
K. I. Nagao
Abstract:
We compute one-loop corrections to the annihilation of non--relativistic particles $χ$ due to the exchange of a (gauge or Higgs) boson $φ$ with mass $μ$ in the initial state. In the limit $m_χ\gg μ$ this leads to the "Sommerfeld enhancement" of the annihilation cross section. However, here we are interested in the case $μ\lsim m_χ$, where the one--loop corrections are well--behaved, but can still…
▽ More
We compute one-loop corrections to the annihilation of non--relativistic particles $χ$ due to the exchange of a (gauge or Higgs) boson $φ$ with mass $μ$ in the initial state. In the limit $m_χ\gg μ$ this leads to the "Sommerfeld enhancement" of the annihilation cross section. However, here we are interested in the case $μ\lsim m_χ$, where the one--loop corrections are well--behaved, but can still be sizable. We find simple and accurate expressions for annihilation from both $S-$ and $P-$wave initial states; they differ from each other if $μ\neq 0$. In order to apply our results to the calculation of the relic density of Weakly Interacting Massive Particles (WIMPs), we describe how to compute the thermal average of the corrected cross sections. We apply this formalism to scalar and Dirac fermion singlet WIMPs, and show that the corrections are always very small in the former case, but can be very large in the latter. Moreover, in the context of the Minimal Supersymmetric Standard Model, these corrections can decrease the relic density of neutralinos by more than 1%, if the lightest neutralino is a strongly mixed state.
△ Less
Submitted 28 April, 2010; v1 submitted 19 November, 2009;
originally announced November 2009.
-
Scalar gluons and Dirac gluinos at the LHC
Authors:
S. Y. Choi,
J. Kalinowski,
J. M. Kim,
E. Popenda
Abstract:
The hybrid N=1/N=2 supersymmetric model predicts scalar gluons (sgluons) as SUSY partners of the Dirac gluino. Their strikingly distinct phenomenology at the CERN Large Hadron Collider is discussed.
The hybrid N=1/N=2 supersymmetric model predicts scalar gluons (sgluons) as SUSY partners of the Dirac gluino. Their strikingly distinct phenomenology at the CERN Large Hadron Collider is discussed.
△ Less
Submitted 10 November, 2009;
originally announced November 2009.
-
Color-octet scalars at the LHC
Authors:
S. Y. Choi,
M. Drees,
J. Kalinowski,
J. M. Kim,
E. Popenda,
P. M. Zerwas
Abstract:
Elements of the phenomenology of color-octet scalars (sgluons), as predicted in the hybrid N=1/N=2 supersymmetric model, are discussed in the light of forthcoming experiments at the CERN Large Hadron Collider.
Elements of the phenomenology of color-octet scalars (sgluons), as predicted in the hybrid N=1/N=2 supersymmetric model, are discussed in the light of forthcoming experiments at the CERN Large Hadron Collider.
△ Less
Submitted 26 February, 2009;
originally announced February 2009.
-
Color-Octet Scalars of N=2 Supersymmetry at the LHC
Authors:
S. Y. Choi,
M. Drees,
J. Kalinowski,
J. M. Kim,
E. Popenda,
P. M. Zerwas
Abstract:
The color gauge hyper-multiplet in N=2 supersymmetry consists of the usual N=1 gauge vector/gaugino super-multiplet, joined with a novel gaugino/scalar super-multiplet. Large cross sections are predicted for the production of pairs of the color-octet scalars $σ$ [sgluons] at the LHC: $gg, q\bar{q} \to σσ^{\ast}$. Single $σ$ production is possible at one-loop level, but the $g g\to σ$ amplitude v…
▽ More
The color gauge hyper-multiplet in N=2 supersymmetry consists of the usual N=1 gauge vector/gaugino super-multiplet, joined with a novel gaugino/scalar super-multiplet. Large cross sections are predicted for the production of pairs of the color-octet scalars $σ$ [sgluons] at the LHC: $gg, q\bar{q} \to σσ^{\ast}$. Single $σ$ production is possible at one-loop level, but the $g g\to σ$ amplitude vanishes in the limit of degenerate $L$ and $R$ squarks. When kinematically allowed, $σ$ decays predominantly into two gluinos, whose cascade decays give rise to a burst of eight or more jets together with four LSP's as signature for $σ$ pair events at the LHC. $σ$ can also decay into a squark-antisquark pair at tree level. At one-loop level $σ$ decays into gluons or a $t \bar t$ pair are predicted, generating exciting resonance signatures in the final states. The corresponding partial widths are very roughly comparable to that for three body final states mediated by one virtual squark at tree level.
△ Less
Submitted 23 January, 2009; v1 submitted 18 December, 2008;
originally announced December 2008.
-
Neutralino Dark Matter in an SO(10) Model with Two-step Intermediate Scale Symmetry Breaking
Authors:
Manuel Drees,
Ju Min Kim
Abstract:
We consider a supersymmetric Grand Unified Theory (GUT) based on the gauge group SO(10) suggested by Aulakh et al., which features two--step intermediate symmetry breaking, $SO(10) \to SU(4)_C \times SU(2)_L \times SU(2)_R \to SU(3)_C \times U(1)_{B-L} \times SU(2)_L \times SU(2)_R \to SU(3)_C \times SU(2)_L \times U(1)_Y$. {\bf $45, 54, 126+\overline{126}$} dimensional representations of Higgs…
▽ More
We consider a supersymmetric Grand Unified Theory (GUT) based on the gauge group SO(10) suggested by Aulakh et al., which features two--step intermediate symmetry breaking, $SO(10) \to SU(4)_C \times SU(2)_L \times SU(2)_R \to SU(3)_C \times U(1)_{B-L} \times SU(2)_L \times SU(2)_R \to SU(3)_C \times SU(2)_L \times U(1)_Y$. {\bf $45, 54, 126+\overline{126}$} dimensional representations of Higgs superfields are employed to achieve this symmetry breaking chain. We also introduce a second, very heavy, pair of Higgs doublets, which modifies the Yukawa couplings of matter fields relative to minimal SO(10) predictions. We analyze the differences in the low energy phenomenology compared to that of mSUGRA, assuming universal soft breaking scalar masses, gaugino masses and trilinear couplings at the GUT scale. We find that thermal neutralino Dark Matter remains viable in this scenario, although for small and moderate values of $\tanβ$ the allowed region is even more highly constrained than in mSUGRA, and depends strongly on the the light neutrino masses.
△ Less
Submitted 10 October, 2008;
originally announced October 2008.
-
Ferromagnetism in the Mott insulator Ba2NaOsO6
Authors:
A. S. Erickson,
S. Misra,
G. J. Miller,
R. R. Gupta,
Z. Schlesinger,
W. A. Harrison,
J. M. Kim,
I. R. Fisher
Abstract:
Results are presented of single crystal structural, thermodynamic, and reflectivity measurements of the double-perovskite Ba2NaOsO6. These characterize the material as a 5d^1 ferromagnetic Mott insulator with an ordered moment of ~0.2 Bohr magnetons per formula unit and TC = 6.8(3) K. The magnetic entropy associated with this phase transition is close to Rln2, indicating that the quartet grounds…
▽ More
Results are presented of single crystal structural, thermodynamic, and reflectivity measurements of the double-perovskite Ba2NaOsO6. These characterize the material as a 5d^1 ferromagnetic Mott insulator with an ordered moment of ~0.2 Bohr magnetons per formula unit and TC = 6.8(3) K. The magnetic entropy associated with this phase transition is close to Rln2, indicating that the quartet groundstate anticipated from consideration of the crystal structure is split, consistent with a scenario in which the ferromagnetism is associated with orbital ordering.
△ Less
Submitted 7 May, 2007; v1 submitted 13 October, 2006;
originally announced October 2006.
-
Cyclic Topology in Complex Networks
Authors:
Hyun-Joo Kim,
** Min Kim
Abstract:
We propose a cyclic coefficient $R$ which represents the cyclic characteristics of complex networks. If the network forms a perfect tree-like structure then $R$ becomes zero. The larger value of $R$ represents that the network is more cyclic. We measure the cyclic coefficients and the distributions of the local cyclic coefficient for both various real networks and the representative network mode…
▽ More
We propose a cyclic coefficient $R$ which represents the cyclic characteristics of complex networks. If the network forms a perfect tree-like structure then $R$ becomes zero. The larger value of $R$ represents that the network is more cyclic. We measure the cyclic coefficients and the distributions of the local cyclic coefficient for both various real networks and the representative network models and characterize the cyclic structures of them.
△ Less
Submitted 21 March, 2005;
originally announced March 2005.
-
First order isotropic - smectic-A transition in liquid crystal-aerosil gels
Authors:
M. K. Ramazanoglu,
P. S. Clegg,
R. J. Birgeneau,
C. W. Garland,
M. E. Neubert,
J. M. Kim
Abstract:
The short-range order which remains when the isotropic to smectic-A transition is perturbed by a gel of silica nanoparticles (aerosils) has been studied using high-resolution synchrotron x-ray diffraction. The gels have been created \textit{in situ} in decylcyanobiphenyl (10CB), which has a strongly first-order isotropic to smectic-A transition. The effects are determined by detailed analysis of…
▽ More
The short-range order which remains when the isotropic to smectic-A transition is perturbed by a gel of silica nanoparticles (aerosils) has been studied using high-resolution synchrotron x-ray diffraction. The gels have been created \textit{in situ} in decylcyanobiphenyl (10CB), which has a strongly first-order isotropic to smectic-A transition. The effects are determined by detailed analysis of the temperature and gel density dependence of the smectic structure factor. In previous studies of the continuous nematic to smectic-A transition in a variety of thermotropic liquid crystals the aerosil gel appeared to pin, at random, the phase of the smectic density modulation. For the isotropic to smectic-A transition the same gel perturbation yields different results. The smectic correlation length decreases more slowly with increasing random field variance in good quantitative agreement with the effect of a random pinning field at a transition from a uniform phase directly to a phase with one-dimensional translational order. We thus compare the influence of random fields on a \textit{freezing} transition with and without an intervening orientationally ordered phase.
△ Less
Submitted 27 November, 2003;
originally announced November 2003.
-
Effect of Long-Range Interactions in the Conserved Kardar-Parisi-Zhang Equation
Authors:
Youngkyun Jung,
In-mook Kim,
** Min Kim
Abstract:
The conserved Kardar-Parisi-Zhang equation in the presence of long-range nonlinear interactions is studied by the dynamic renormalization group method. The long-range effect produces new fixed points with continuously varying exponents and gives distinct phase transitions, depending on both the long-range interaction strength and the substrate dimension $d$. The long-range interaction makes the…
▽ More
The conserved Kardar-Parisi-Zhang equation in the presence of long-range nonlinear interactions is studied by the dynamic renormalization group method. The long-range effect produces new fixed points with continuously varying exponents and gives distinct phase transitions, depending on both the long-range interaction strength and the substrate dimension $d$. The long-range interaction makes the surface width less rough than that of the short-range interaction. In particular, the surface becomes a smooth one with a negative roughness exponent at the physical dimension d=2.
△ Less
Submitted 2 June, 1998;
originally announced June 1998.
-
Instability, Intermittency and Multiscaling in Discrete Growth Models of Kinetic Roughening
Authors:
C. Dasgupta,
J. M. Kim,
M. Dutta,
S. Das Sarma
Abstract:
We show by numerical simulations that discretized versions of commonly studied continuum nonlinear growth equations (such as the Kardar-Parisi-Zhang equation and the Lai-Das Sarma equation) and related atomistic models of epitaxial growth have a generic instability in which isolated pillars (or grooves) on an otherwise flat interface grow in time when their height (or depth) exceeds a critical v…
▽ More
We show by numerical simulations that discretized versions of commonly studied continuum nonlinear growth equations (such as the Kardar-Parisi-Zhang equation and the Lai-Das Sarma equation) and related atomistic models of epitaxial growth have a generic instability in which isolated pillars (or grooves) on an otherwise flat interface grow in time when their height (or depth) exceeds a critical value. Depending on the details of the model, the instability found in the discretized version may or may not be present in the truly continuum growth equation, indicating that the behavior of discretized nonlinear growth equations may be very different from that of their continuum counterparts. This instability can be controlled either by the introduction of higher-order nonlinear terms with appropriate coefficients or by restricting the growth of pillars (or grooves) by other means. A number of such ``controlled instability'' models are studied by simulation. For appropriate choice of the parameters used for controlling the instability, these models exhibit intermittent behavior, characterized by multiexponent scaling of height fluctuations, over the time interval during which the instability is active. The behavior found in this regime is very similar to the ``turbulent'' behavior observed in recent simulations of several one- and two-dimensional atomistic models of epitaxial growth. [pacs{61.50.Cj, 68.55.Bd, 05.70.Ln, 64.60.Ht}]
△ Less
Submitted 28 July, 1996;
originally announced July 1996.
-
Defect Formation and Crossover Behavior in the Dynamic Scaling Properties of Molecular Beam Epitaxy
Authors:
S. DasSarma,
C. J. Lanczycki,
S. V. Ghaisas,
J. M. Kim
Abstract:
Stochastic simulation results, appropriate for Molecular Beam Epitaxy, involving ballistic deposition and thermally activated Arrhenius diffusion of adatoms are presented for one- and two-dimensional substrates, allowing for overhangs and bulk vacancies. The asymptotic Kardar-Parisi- Zhang universality is found to be triggered by a sudden nucleation of large-scale defect formation in the growing…
▽ More
Stochastic simulation results, appropriate for Molecular Beam Epitaxy, involving ballistic deposition and thermally activated Arrhenius diffusion of adatoms are presented for one- and two-dimensional substrates, allowing for overhangs and bulk vacancies. The asymptotic Kardar-Parisi- Zhang universality is found to be triggered by a sudden nucleation of large-scale defect formation in the growing film that shows a distinct dependence on dimensionality. The pre-nucleation transient behavior, which may be of experimental relevance due to the low defect content, is associated with standard solid-on-solid universality classes.
△ Less
Submitted 20 January, 1994;
originally announced January 1994.
-
Generalizations of the KPZ equation (minor changes to enable easy latexing)
Authors:
J. P. Doherty,
M. A. Moore,
J. M. Kim,
A. J. Bray
Abstract:
We generalize the KPZ equation to an O(3) $N=2j+1$ component model. In the limit $N \to \infty$ we show that the mode coupling equations become exact. Solving these approximately we find that the dynamic exponent $z$ increases from $3/2$ for $d=1$ to $2$ at the dimension $d\approx3.6$. For $d=1$ it can be shown analytically that $z=3/2$ for all $j$. The case $j=2$ for $d=2$ is investigated by nu…
▽ More
We generalize the KPZ equation to an O(3) $N=2j+1$ component model. In the limit $N \to \infty$ we show that the mode coupling equations become exact. Solving these approximately we find that the dynamic exponent $z$ increases from $3/2$ for $d=1$ to $2$ at the dimension $d\approx3.6$. For $d=1$ it can be shown analytically that $z=3/2$ for all $j$. The case $j=2$ for $d=2$ is investigated by numerical integration of the KPZ equation.
△ Less
Submitted 30 July, 1993; v1 submitted 26 July, 1993;
originally announced July 1993.