-
Linguistically Conditioned Semantic Textual Similarity
Authors:
**gxuan Tu,
Keer Xu,
Liulu Yue,
Bingyang Ye,
Kyeongmin Rim,
James Pustejovsky
Abstract:
Semantic textual similarity (STS) is a fundamental NLP task that measures the semantic similarity between a pair of sentences. In order to reduce the inherent ambiguity posed from the sentences, a recent work called Conditional STS (C-STS) has been proposed to measure the sentences' similarity conditioned on a certain aspect. Despite the popularity of C-STS, we find that the current C-STS dataset…
▽ More
Semantic textual similarity (STS) is a fundamental NLP task that measures the semantic similarity between a pair of sentences. In order to reduce the inherent ambiguity posed from the sentences, a recent work called Conditional STS (C-STS) has been proposed to measure the sentences' similarity conditioned on a certain aspect. Despite the popularity of C-STS, we find that the current C-STS dataset suffers from various issues that could impede proper evaluation on this task. In this paper, we reannotate the C-STS validation set and observe an annotator discrepancy on 55% of the instances resulting from the annotation errors in the original label, ill-defined conditions, and the lack of clarity in the task definition. After a thorough dataset analysis, we improve the C-STS task by leveraging the models' capability to understand the conditions under a QA task setting. With the generated answers, we present an automatic error identification pipeline that is able to identify annotation errors from the C-STS data with over 80% F1 score. We also propose a new method that largely improves the performance over baselines on the C-STS data by training the models with the answers. Finally we discuss the conditionality annotation based on the typed-feature structure (TFS) of entity types. We show in examples that the TFS is able to provide a linguistic foundation for constructing C-STS data with new conditions.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Variational non-Bayesian inference of the Probability Density Function in the Wiener Algebra
Authors:
U ** Choi,
Kyung Soo Rim
Abstract:
This paper presents a research study focused on uncovering the hidden population distribution from the viewpoint of a variational non-Bayesian approach. It asserts that if the hidden probability density function (PDF) has continuous partial derivatives of at least half the dimension's order, it can be perfectly reconstructed from a stationary ergodic process: First, we establish that if the PDF be…
▽ More
This paper presents a research study focused on uncovering the hidden population distribution from the viewpoint of a variational non-Bayesian approach. It asserts that if the hidden probability density function (PDF) has continuous partial derivatives of at least half the dimension's order, it can be perfectly reconstructed from a stationary ergodic process: First, we establish that if the PDF belongs to the Wiener algebra, its canonical ensemble form is uniquely determined through the Fréchet differentiation of the Kullback-Leibler divergence, aiming to minimize their cross-entropy. Second, we utilize the result that the differentiability of the PDF implies its membership in the Wiener algebra. Third, as the energy function of the canonical ensemble is defined as a series, the problem transforms into finding solutions to the equations of analytic series for the coefficients in the energy function. Naturally, through the use of truncated polynomial series and by demonstrating the convergence of partial sums of the energy function, we ensure the efficiency of approximation with a finite number of data points. Finally, through numerical experiments, we approximate the PDF from a random sample obtained from a bivariate normal distribution and also provide approximations for the mean and covariance from the PDF. This study substantiates the excellence of its results and their practical applicability.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
An Algorithm for Approximating Implicit Functions by Polynomials without Higher-Order Differentiability
Authors:
Kyung Soo Rim
Abstract:
We consider an equation of multiple variables in which a partial derivative does not vanish at a point. The implicit function theorem provides a local existence and uniqueness of the function for the equation. In this paper, we propose an algorithm to approximate the function by a polynomial without using higher-order differentiability, which depends essentially on integrability. Moreover, we exte…
▽ More
We consider an equation of multiple variables in which a partial derivative does not vanish at a point. The implicit function theorem provides a local existence and uniqueness of the function for the equation. In this paper, we propose an algorithm to approximate the function by a polynomial without using higher-order differentiability, which depends essentially on integrability. Moreover, we extend the method to a system of equations if the Jacobian determinant does not vanish. This is a robust method for implicit functions that are not differentiable to higher-order. Additionally, we present two numerical experiments to verify the theoretical results.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Dense Paraphrasing for Textual Enrichment
Authors:
**gxuan Tu,
Kyeongmin Rim,
Eben Holderness,
James Pustejovsky
Abstract:
Understanding inferences and answering questions from text requires more than merely recovering surface arguments, adjuncts, or strings associated with the query terms. As humans, we interpret sentences as contextualized components of a narrative or discourse, by both filling in missing information, and reasoning about event consequences. In this paper, we define the process of rewriting a textual…
▽ More
Understanding inferences and answering questions from text requires more than merely recovering surface arguments, adjuncts, or strings associated with the query terms. As humans, we interpret sentences as contextualized components of a narrative or discourse, by both filling in missing information, and reasoning about event consequences. In this paper, we define the process of rewriting a textual expression (lexeme or phrase) such that it reduces ambiguity while also making explicit the underlying semantics that is not (necessarily) expressed in the economy of sentence structure as Dense Paraphrasing (DP). We build the first complete DP dataset, provide the scope and design of the annotation task, and present results demonstrating how this DP process can enrich a source text to improve inferencing and QA task performance. The data and the source code will be publicly available.
△ Less
Submitted 20 October, 2022;
originally announced October 2022.
-
Analytic Implicit Functions
Authors:
Kyung Soo Rim
Abstract:
In this paper, we introduce a method of converting implicit equations to the usual forms of functions locally without differentiability. For a system of implicit equations which are equipped with continuous functions, if there are unique analytic implicit functions, that satisfies the system in some rectangle, then each analytic function is represented as a power series which is the weak-star limi…
▽ More
In this paper, we introduce a method of converting implicit equations to the usual forms of functions locally without differentiability. For a system of implicit equations which are equipped with continuous functions, if there are unique analytic implicit functions, that satisfies the system in some rectangle, then each analytic function is represented as a power series which is the weak-star limit of partial sums in the space of essentially bounded functions. We also provide numerical examples in order to demonstrate how the theoretical results in this article can be applied in practice and to show the effectiveness of the suggested approaches.
△ Less
Submitted 10 July, 2022;
originally announced July 2022.
-
Designing Multimodal Datasets for NLP Challenges
Authors:
James Pustejovsky,
Eben Holderness,
**gxuan Tu,
Parker Glenn,
Kyeongmin Rim,
Kelley Lynch,
Richard Brutti
Abstract:
In this paper, we argue that the design and development of multimodal datasets for natural language processing (NLP) challenges should be enhanced in two significant respects: to more broadly represent commonsense semantic inferences; and to better reflect the dynamics of actions and events, through a substantive alignment of textual and visual information. We identify challenges and tasks that ar…
▽ More
In this paper, we argue that the design and development of multimodal datasets for natural language processing (NLP) challenges should be enhanced in two significant respects: to more broadly represent commonsense semantic inferences; and to better reflect the dynamics of actions and events, through a substantive alignment of textual and visual information. We identify challenges and tasks that are reflective of linguistic and cognitive competencies that humans have when speaking and reasoning, rather than merely the performance of systems on isolated tasks. We introduce the distinction between challenge-based tasks and competence-based performance, and describe a diagnostic dataset, Recipe-to-Video Questions (R2VQ), designed for testing competence-based comprehension over a multimodal recipe collection (http://r2vq.org/). The corpus contains detailed annotation supporting such inferencing tasks and facilitating a rich set of question families that we use to evaluate NLP systems.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
-
Influence of M/A substitution on material properties of intermetallic compounds MSn$_2$ (M = Fe, Co; A = Li, Na): A first-principles study
Authors:
Chol-Jun Yu,
Un-Song Hwang,
Yong-Chol Pak,
Kyonga Rim,
Chol Ryu,
Chon-Ryong Mun,
Un-Gi Jong
Abstract:
Iron and cobalt distannides \ce{MSn2} (M = Fe, Co) are regarded as a promising conversion-type anode material for lithium- and sodium-ion batteries, but their properties are not well understood. In this work, we report a first-principles study of alkali metal (A = Li, Na) substitutional effect on the structural, mechanical, lattice vibrational, electronic and defect properties of these distannides…
▽ More
Iron and cobalt distannides \ce{MSn2} (M = Fe, Co) are regarded as a promising conversion-type anode material for lithium- and sodium-ion batteries, but their properties are not well understood. In this work, we report a first-principles study of alkali metal (A = Li, Na) substitutional effect on the structural, mechanical, lattice vibrational, electronic and defect properties of these distannides. Special attention is paid to systematic comparison between \ce{FeSn2} and \ce{CoSn2}. Our calculations reveal that M/A substitution induces a lattice expansion and decrease of elastic constants, which is more announced with Na substitution than Li, and moreover changes the elastic property of \ce{FeSn2} from ductile to brittle whereas preserves the ductility of \ce{CoSn2}. An imaginary phonon frequency mode appears only for \ce{FeSn2} and \ce{FeNaSn2}, and M/A substitution provokes a definite gap between high and low frequency regions. We perform a careful analysis of electronic density of states, band structures and Fermi surface, providing an insight into difference of electronic structures between \ce{FeSn2} and \ce{CoSn2}. With further calculation of defect formation energies and alkali ion diffusion barriers, we believe this work can be useful to design conversion-type anode materials for alkali-ion batteries.
△ Less
Submitted 23 July, 2020;
originally announced July 2020.
-
Probabilistic Neural Network: Frequency and Moment Learnings
Authors:
Kyung Soo Rim,
U ** Choi
Abstract:
We introduce probabilistic neural networks that describe unsupervised synchronous learning on an atomic Hardy space and space of bounded real analytic functions, respectively. For a stationary ergodic vector process, we prove that the probabilistic neural network yields a unique collection of neurons in global optimization without initialization and back-propagation. During learning, we show that…
▽ More
We introduce probabilistic neural networks that describe unsupervised synchronous learning on an atomic Hardy space and space of bounded real analytic functions, respectively. For a stationary ergodic vector process, we prove that the probabilistic neural network yields a unique collection of neurons in global optimization without initialization and back-propagation. During learning, we show that all neurons communicate with each other, in the sense of linear combinations, until the learning is finished. Also, we give convergence results for the stability of neurons, estimation methods, and topological statistics to appreciate unsupervised estimation of a probabilistic neural network. As application, we attach numerical experiments on samples drawn by a standing wave.
△ Less
Submitted 22 April, 2020;
originally announced April 2020.
-
Multimodal Interactive Learning of Primitive Actions
Authors:
Tuan Do,
Nikhil Krishnaswamy,
Kyeongmin Rim,
James Pustejovsky
Abstract:
We describe an ongoing project in learning to perform primitive actions from demonstrations using an interactive interface. In our previous work, we have used demonstrations captured from humans performing actions as training samples for a neural network-based trajectory model of actions to be performed by a computational agent in novel setups. We found that our original framework had some limitat…
▽ More
We describe an ongoing project in learning to perform primitive actions from demonstrations using an interactive interface. In our previous work, we have used demonstrations captured from humans performing actions as training samples for a neural network-based trajectory model of actions to be performed by a computational agent in novel setups. We found that our original framework had some limitations that we hope to overcome by incorporating communication between the human and the computational agent, using the interaction between them to fine-tune the model learned by the machine. We propose a framework that uses multimodal human-computer interaction to teach action concepts to machines, making use of both live demonstration and communication through natural language, as two distinct teaching modalities, while requiring few training samples.
△ Less
Submitted 1 October, 2018;
originally announced October 2018.
-
Software Cognitive Complexity Measure Based on Scope of Variables
Authors:
Kwangmyong Rim,
Yonghua Choe
Abstract:
In this paper, we define a Mathematical model of program structure. Mathematical model of program structure defined here provides unified mathematical treatment of program structure, which reveals that a program is a large and finite set of embedded binary relations between current statement and previous ones. Then, a program is considered as a composed listing and a logical combination of multipl…
▽ More
In this paper, we define a Mathematical model of program structure. Mathematical model of program structure defined here provides unified mathematical treatment of program structure, which reveals that a program is a large and finite set of embedded binary relations between current statement and previous ones. Then, a program is considered as a composed listing and a logical combination of multiple statements according to the certain composing rules. We also define the Scope Information Complexity Number (SICN) and present the cognitive complexity based on functional decomposition of software, including theoretical validation through nine Weyuker's properties.
△ Less
Submitted 17 September, 2014;
originally announced September 2014.
-
Visualizing Individual Nitrogen Dopants in Monolayer Graphene
Authors:
Liuyan Zhao,
Rui He,
Kwang Taeg Rim,
Theanne Schiros,
Keun Soo Kim,
Hui Zhou,
Christopher Gutiérrez,
S. P. Chockalingam,
Carlos J. Arguello,
Lucia Pálová,
Dennis Nordlund,
Mark S. Hybertsen,
David R. Reichman,
Tony F. Heinz,
Philip Kim,
Aron Pinczuk,
George W. Flynn,
Abhay N. Pasupathy
Abstract:
In monolayer graphene, substitutional do** during growth can be used to alter its electronic properties. We used scanning tunneling microscopy (STM), Raman spectroscopy, x-ray spectroscopy, and first principles calculations to characterize individual nitrogen dopants in monolayer graphene grown on a copper substrate. Individual nitrogen atoms were incorporated as graphitic dopants, and a fractio…
▽ More
In monolayer graphene, substitutional do** during growth can be used to alter its electronic properties. We used scanning tunneling microscopy (STM), Raman spectroscopy, x-ray spectroscopy, and first principles calculations to characterize individual nitrogen dopants in monolayer graphene grown on a copper substrate. Individual nitrogen atoms were incorporated as graphitic dopants, and a fraction of the extra electron on each nitrogen atom was delocalized into the graphene lattice. The electronic structure of nitrogen-doped graphene was strongly modified only within a few lattice spacings of the site of the nitrogen dopant. These findings show that chemical do** is a promising route to achieving high-quality graphene films with a large carrier concentration.
△ Less
Submitted 19 August, 2011;
originally announced August 2011.
-
Stability of Localized Integral Operators on Weighted $L^p$ spaces
Authors:
Kyung Soo Rim,
Chang Eon Shin,
Qiyu Sun
Abstract:
In this paper, we consider localized integral operators whose kernels have mild singularity near the diagonal and certain Holder regularity and decay off the diagonal. Our model example is the Bessel potential operator ${\mathcal J}_γ, γ>0$. We show that if such a localized integral operator has stability on a weighted function space $L^p_w$ for some $p\in [1, \infty)$ and Muckenhoupt $A_p$-weight…
▽ More
In this paper, we consider localized integral operators whose kernels have mild singularity near the diagonal and certain Holder regularity and decay off the diagonal. Our model example is the Bessel potential operator ${\mathcal J}_γ, γ>0$. We show that if such a localized integral operator has stability on a weighted function space $L^p_w$ for some $p\in [1, \infty)$ and Muckenhoupt $A_p$-weight $w$, then it has stability on weighted function spaces $L^{p'}_{w'}$ for all $1\le p'<\infty$ and Muckenhoupt $A_{p'}$-weights $w'$.
△ Less
Submitted 9 July, 2011;
originally announced July 2011.
-
The Atomic-scale Growth of Large-Area Monolayer Graphene on Single-Crystal Copper Substrates
Authors:
L. Zhao,
K. T. Rim,
H. Zhou,
R. He,
T. F. Heinz,
A. Pinczuk,
G. W. Flynn,
A. N. Pasupathy
Abstract:
We study the growth and microscopic structure of large-area graphene monolayers, grown on copper single crystals by chemical vapor deposition (CVD) in ultra-high vacuum (UHV). Using atomic-resolution scanning tunneling microscopy (STM), we find that graphene grows primarily in registry with the underlying copper lattice for both Cu(111) and Cu(100). The graphene has a hexagonal superstructure on C…
▽ More
We study the growth and microscopic structure of large-area graphene monolayers, grown on copper single crystals by chemical vapor deposition (CVD) in ultra-high vacuum (UHV). Using atomic-resolution scanning tunneling microscopy (STM), we find that graphene grows primarily in registry with the underlying copper lattice for both Cu(111) and Cu(100). The graphene has a hexagonal superstructure on Cu(111) with a significant electronic component, whereas it has a linear superstructure on Cu(100). The film quality is limited by grain boundaries, and the best growth is obtained on the Cu(111) surface.
△ Less
Submitted 20 August, 2010;
originally announced August 2010.
-
High-Resolution Scanning Tunneling Microscopy Imaging of Mesoscopic Graphene Sheets on an Insulating Surface
Authors:
Elena Stolyarova,
Kwang Taeg Rim,
Sunmin Ryu,
Janina Maultzsch,
Philip Kim,
Louis E. Brus,
Tony F. Heinz,
Mark S. Hybertsen,
George W. Flynn
Abstract:
We present scanning tunneling microscopy (STM) images of single-layer graphene crystals examined under ultrahigh vacuum conditions. The samples, with lateral dimensions on the micron scale, were prepared on a silicon dioxide surface by direct exfoliation of single crystal graphite. The single-layer films were identified using Raman spectroscopy. Topographic images of single-layer samples display…
▽ More
We present scanning tunneling microscopy (STM) images of single-layer graphene crystals examined under ultrahigh vacuum conditions. The samples, with lateral dimensions on the micron scale, were prepared on a silicon dioxide surface by direct exfoliation of single crystal graphite. The single-layer films were identified using Raman spectroscopy. Topographic images of single-layer samples display the honeycomb structure expected for the full hexagonal symmetry of an isolated graphene monolayer. The absence of observable defects in the STM images is indicative of the high quality of these films. Crystals comprised of a few layers of graphene were also examined. They exhibited dramatically different STM topography, displaying the reduced three-fold symmetry characteristic of the surface of bulk graphite.
△ Less
Submitted 6 May, 2007;
originally announced May 2007.