-
CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions
Authors:
Donghee Choi,
Mogan Gim,
Donghyeon Park,
Mujeen Sung,
Hyunjae Kim,
Jaewoo Kang,
Jihun Choi
Abstract:
This paper introduces CookingSense, a descriptive collection of knowledge assertions in the culinary domain extracted from various sources, including web data, scientific papers, and recipes, from which knowledge covering a broad range of aspects is acquired. CookingSense is constructed through a series of dictionary-based filtering and language model-based semantic filtering techniques, which res…
▽ More
This paper introduces CookingSense, a descriptive collection of knowledge assertions in the culinary domain extracted from various sources, including web data, scientific papers, and recipes, from which knowledge covering a broad range of aspects is acquired. CookingSense is constructed through a series of dictionary-based filtering and language model-based semantic filtering techniques, which results in a rich knowledgebase of multidisciplinary food-related assertions. Additionally, we present FoodBench, a novel benchmark to evaluate culinary decision support systems. From evaluations with FoodBench, we empirically prove that CookingSense improves the performance of retrieval augmented language models. We also validate the quality and variety of assertions in CookingSense through qualitative analysis.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
MolPLA: A Molecular Pretraining Framework for Learning Cores, R-Groups and their Linker Joints
Authors:
Mogan Gim,
Jueon Park,
Soyon Park,
Sanghoon Lee,
Seungheun Baek,
Junhyun Lee,
Ngoc-Quang Nguyen,
Jaewoo Kang
Abstract:
Molecular core structures and R-groups are essential concepts in drug development. Integration of these concepts with conventional graph pre-training approaches can promote deeper understanding in molecules. We propose MolPLA, a novel pre-training framework that employs masked graph contrastive learning in understanding the underlying decomposable parts inmolecules that implicate their core struct…
▽ More
Molecular core structures and R-groups are essential concepts in drug development. Integration of these concepts with conventional graph pre-training approaches can promote deeper understanding in molecules. We propose MolPLA, a novel pre-training framework that employs masked graph contrastive learning in understanding the underlying decomposable parts inmolecules that implicate their core structure and peripheral R-groups. Furthermore, we formulate an additional framework that grants MolPLA the ability to help chemists find replaceable R-groups in lead optimization scenarios. Experimental results on molecular property prediction show that MolPLA exhibits predictability comparable to current state-of-the-art models. Qualitative analysis implicate that MolPLA is capable of distinguishing core and R-group sub-structures, identifying decomposable regions in molecules and contributing to lead optimization scenarios by rationally suggesting R-group replacements given various query core templates. The code implementation for MolPLA and its pre-trained model checkpoint is available at https://github.com/dmis-lab/MolPLA
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
KitchenScale: Learning to predict ingredient quantities from recipe contexts
Authors:
Donghee Choi,
Mogan Gim,
Samy Badreddine,
Hajung Kim,
Donghyeon Park,
Jaewoo Kang
Abstract:
Determining proper quantities for ingredients is an essential part of cooking practice from the perspective of enriching tastiness and promoting healthiness. We introduce KitchenScale, a fine-tuned Pre-trained Language Model (PLM) that predicts a target ingredient's quantity and measurement unit given its recipe context. To effectively train our KitchenScale model, we formulate an ingredient quant…
▽ More
Determining proper quantities for ingredients is an essential part of cooking practice from the perspective of enriching tastiness and promoting healthiness. We introduce KitchenScale, a fine-tuned Pre-trained Language Model (PLM) that predicts a target ingredient's quantity and measurement unit given its recipe context. To effectively train our KitchenScale model, we formulate an ingredient quantity prediction task that consists of three sub-tasks which are ingredient measurement type classification, unit classification, and quantity regression task. Furthermore, we utilized transfer learning of cooking knowledge from recipe texts to PLMs. We adopted the Discrete Latent Exponent (DExp) method to cope with high variance of numerical scales in recipe corpora. Experiments with our newly constructed dataset and recommendation examples demonstrate KitchenScale's understanding of various recipe contexts and generalizability in predicting ingredient quantities. We implemented a web application for KitchenScale to demonstrate its functionality in recommending ingredient quantities expressed in numerals (e.g., 2) with units (e.g., ounce).
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
RecipeMind: Guiding Ingredient Choices from Food Pairing to Recipe Completion using Cascaded Set Transformer
Authors:
Mogan Gim,
Donghee Choi,
Kana Maruyama,
Jihun Choi,
Hajung Kim,
Donghyeon Park,
Jaewoo Kang
Abstract:
We propose a computational approach for recipe ideation, a downstream task that helps users select and gather ingredients for creating dishes. To perform this task, we developed RecipeMind, a food affinity score prediction model that quantifies the suitability of adding an ingredient to set of other ingredients. We constructed a large-scale dataset containing ingredient co-occurrence based scores…
▽ More
We propose a computational approach for recipe ideation, a downstream task that helps users select and gather ingredients for creating dishes. To perform this task, we developed RecipeMind, a food affinity score prediction model that quantifies the suitability of adding an ingredient to set of other ingredients. We constructed a large-scale dataset containing ingredient co-occurrence based scores to train and evaluate RecipeMind on food affinity score prediction. Deployed in recipe ideation, RecipeMind helps the user expand an initial set of ingredients by suggesting additional ingredients. Experiments and qualitative analysis show RecipeMind's potential in fulfilling its assistive role in cuisine domain.
△ Less
Submitted 14 October, 2022;
originally announced October 2022.
-
Topological defects and geometric memory across the nematic-smectic A liquid crystal phase transition
Authors:
Ahram Suh,
Min-Jun Gim,
Daniel Beller,
Dong Ki Yoon
Abstract:
We study transformations of self-organized defect arrays at the nematic-smectic A liquid crystal phase transition, and show that these defect configurations are correlated, or "remembered", across the phase transition. A thin film of thermotropic liquid crystal is subjected to hybrid anchoring by an air interface and a water substrate, and viewed under polarized optical microscopy. Upon heating fr…
▽ More
We study transformations of self-organized defect arrays at the nematic-smectic A liquid crystal phase transition, and show that these defect configurations are correlated, or "remembered", across the phase transition. A thin film of thermotropic liquid crystal is subjected to hybrid anchoring by an air interface and a water substrate, and viewed under polarized optical microscopy. Upon heating from smectic-A to nematic, a packing of focal conic domains melts into a dense array of boojums---nematic surface defects---which then coarsens by pair-annihilation. With the aid of Landau-de Gennes numerical modeling, we elucidate the topological and geometrical rules underlying this transformation. In the transition from nematic to smectic-A, we show that focal conic domain packings are organized over large scales in patterns that retain a geometric memory of the nematic boojum configuration, which can be recovered with remarkable fidelity.
△ Less
Submitted 10 April, 2019;
originally announced April 2019.
-
Conservativeness criteria for generalized Dirichlet forms
Authors:
Minjung Gim,
Gerald Trutnau
Abstract:
We develop sufficient analytic conditions for conservativeness of non-sectorial perturbations of symmetric Dirichlet forms which can be represented through a carré du champ on a locally compact separable metric space. These form an important subclass of generalized Dirichlet forms which were introduced in \cite{St1}. In case there exists an associated strong Feller process, the analytic conditions…
▽ More
We develop sufficient analytic conditions for conservativeness of non-sectorial perturbations of symmetric Dirichlet forms which can be represented through a carré du champ on a locally compact separable metric space. These form an important subclass of generalized Dirichlet forms which were introduced in \cite{St1}. In case there exists an associated strong Feller process, the analytic conditions imply conservativeness, i.e. non-explosion of the associated process in the classical probabilistic sense. As an application of our general results on locally compact separable metric state spaces, we consider a generalized Dirichlet form given on a closed or open subset of $\mathbb{R}^d$ which is given as a divergence free first order perturbation of a symmetric energy form. Then using volume growth conditions of the carré du champ and the non-sectorial first order part, we derive an explicit criterion for conservativeness. We present several concrete examples which relate our results to previous ones obtained by different authors. In particular, we show that conservativeness can hold for a cubic variance if the drift is strong enough to compensate it. This work continues our previous work on transience and recurrence of generalized Dirichlet forms.
△ Less
Submitted 27 November, 2016; v1 submitted 16 May, 2016;
originally announced May 2016.
-
Fast Fabrication of Sub-200-nm Nanogrooves using Liquid Crystal Material
Authors:
Dae Seok Kim,
Yun Jeong Cha,
Min-Jun Gim,
Dong Ki Yoon
Abstract:
Self-assembly of soft materials attracts keen interest for patterning applications owing to its ease and spontaneous behavior. We report the fabrication of nanogrooves using sublimation and recondensation of liquid crystal (LC) materials. First, well-aligned smectic LC structures are obtained on the micron-scale topographic patterns of the microchannel; then the sublimation and recondensation proc…
▽ More
Self-assembly of soft materials attracts keen interest for patterning applications owing to its ease and spontaneous behavior. We report the fabrication of nanogrooves using sublimation and recondensation of liquid crystal (LC) materials. First, well-aligned smectic LC structures are obtained on the micron-scale topographic patterns of the microchannel; then the sublimation and recondensation process directly produces nanogrooves having sub-200-nm scale. The entire process can be completed in less than 30 min. After it is replicated using an ultraviolet-curable polymer, our platform can be used as an alignment layer to control other guest LC materials.
△ Less
Submitted 25 April, 2016;
originally announced April 2016.
-
Recurrence criteria for generalized Dirichlet forms
Authors:
Minjung Gim,
Gerald Trutnau
Abstract:
We develop sufficient analytic conditions for recurrence and transience of non-sectorial perturbations of possibly non-symmetric Dirichlet forms on a general state space. These form an important subclass of generalized Dirichlet forms which were introduced in \cite{St1}. In case there exists an associated process, we show how the analytic conditions imply recurrence and transience in the classical…
▽ More
We develop sufficient analytic conditions for recurrence and transience of non-sectorial perturbations of possibly non-symmetric Dirichlet forms on a general state space. These form an important subclass of generalized Dirichlet forms which were introduced in \cite{St1}. In case there exists an associated process, we show how the analytic conditions imply recurrence and transience in the classical probabilistic sense. As an application, we consider a generalized Dirichlet form given on a closed or open subset of $\mathbb{R}^d$ which is given as a divergence free first order perturbation of a non-symmetric energy form. Then using volume growth conditions of the sectorial and non-sectorial first order part, we derive an explicit criterion for recurrence. Moreover, we present concrete examples with applications to Muckenhoupt weights and counterexamples. The counterexamples show that the non-sectorial case differs qualitatively from the symmetric or non-symmetric sectorial case. Namely, we make the observation that one of the main criteria for recurrence in these cases fails to be true for generalized Dirichlet forms.
△ Less
Submitted 20 July, 2017; v1 submitted 10 August, 2015;
originally announced August 2015.
-
Explicit recurrence criteria for symmetric gradient type Dirichlet forms satisfying a Hamza type condition
Authors:
Minjung Gim,
Gerald Trutnau
Abstract:
In this note, we present explicit conditions for symmetric gradient type Dirichlet forms to be recurrent. This type of Dirichlet form is typically strongly local and hence associated to a diffusion. We consider the one dimensional case and the multidimensional case, as well as the case with reflecting boundary conditions. Our main achievement is that the explicit results are obtained under quite w…
▽ More
In this note, we present explicit conditions for symmetric gradient type Dirichlet forms to be recurrent. This type of Dirichlet form is typically strongly local and hence associated to a diffusion. We consider the one dimensional case and the multidimensional case, as well as the case with reflecting boundary conditions. Our main achievement is that the explicit results are obtained under quite weak assumptions on the closability, hence regularity of the underlying coefficients. Especially in dimension one, where a Hamza type condition is assumed, the construction of the sequence of functions $(u_n)_{n\in \N}$ in the Dirichlet space that determine recurrence works for quite general Dirichlet forms but is still explicit.
△ Less
Submitted 1 August, 2013;
originally announced August 2013.
-
The Open Cluster NGC 7789: II. CCD VI Photometry
Authors:
Munhwan Gim,
Don. A. VandenBerg,
Peter B. Stetson,
James E. Hesser,
David R. Zurek
Abstract:
A (V,V - I)--diagram for the intermediate-age open cluster NGC 7789 has been derived from CCD observations of more than 15,000 stars within ~ 18 arcmin of the cluster center. From the brightest giants and blue stragglers at V ~ 11 to the faintest lower main-sequence stars that were observed (at V ~ 21, M_V ~ 9), the C-M diagram is well defined. A prominent clump of core helium-burning stars is e…
▽ More
A (V,V - I)--diagram for the intermediate-age open cluster NGC 7789 has been derived from CCD observations of more than 15,000 stars within ~ 18 arcmin of the cluster center. From the brightest giants and blue stragglers at V ~ 11 to the faintest lower main-sequence stars that were observed (at V ~ 21, M_V ~ 9), the C-M diagram is well defined. A prominent clump of core helium-burning stars is evident at V = 13.0 and the upper end of the main sequence shows a fairly pronounced curvature to the red, which is indicative of significant convective core overshooting. Indeed, comparisons with up-to-date stellar models show that it is not possible to explain the observed morphology in the vicinity of the turnoff unless the overshooting is quite extensive. Interestingly, if sufficient overshooting is assumed in order to match the main-sequence data, it is not possible to reproduce the cluster's extended giant branch unless the cluster age is at least 1.6 Gyr (assuming a metallicity in the range -0.2 <= [Fe/H] <= 0.0). This, in turn, requires that the cluster have an apparent distance modulus (m-M)_V <= 12.2. Thus, sometime within the past few hundred million years, the ignition of helium burning in NGC 7789 has switched from a quiescent to an explosive (``flash'') phenomenon, and the length of the cluster's red-giant branch has been steadily increasing with the passage of time since then. From main-sequence fits to models that have been carefully normalized to the Sun, we infer a reddening 0.35 <= E(V-I) <= 0.38.
△ Less
Submitted 12 August, 1998;
originally announced August 1998.
-
The Open Cluster NGC 7789: I. Radial Velocities for Giant Stars
Authors:
Munhwan Gim,
James E. Hesser,
Robert D. McClure,
Peter B. Stetson
Abstract:
A total of 597 radial-velocity observations for 112 stars in the ~1.6 Gyr old open cluster NGC 7789 have been obtained since 1979 with the radial velocity spectrometer at the Dominion Astrophysical Observatory. The mean cluster radial velocity is -54.9 +/- 0.12 km/s and the dispersion is 0.86 km/s, from 50 constant-velocity stars selected as members from this radial-velocity study and the proper…
▽ More
A total of 597 radial-velocity observations for 112 stars in the ~1.6 Gyr old open cluster NGC 7789 have been obtained since 1979 with the radial velocity spectrometer at the Dominion Astrophysical Observatory. The mean cluster radial velocity is -54.9 +/- 0.12 km/s and the dispersion is 0.86 km/s, from 50 constant-velocity stars selected as members from this radial-velocity study and the proper motion study of McNamara and Solomon (1981). Twenty-five stars (32%) among 78 members are possible radial-velocity variable stars, but no orbits are determined because of the sparse sampling. Seventeen stars are radial-velocity non-members, while membership estimates of six stars are uncertain.
There is a hint that the observed velocity dispersion falls off at large radius. This may due to the inclusion of long-period binaries preferentially in the central area of the cluster. The known radial-velocity variables also seem to be more concentrated toward the center than members with constant velocity. Although this is significant at only the 85% level, when combined with similar result of Raboud and Mermilliod (1994) for three other clusters, the data strongly support the conclusion that mass segregation is being detected.
△ Less
Submitted 6 July, 1998;
originally announced July 1998.