-
INDUS: Effective and Efficient Language Models for Scientific Applications
Authors:
Bishwaranjan Bhattacharjee,
Aashka Trivedi,
Masayasu Muraoka,
Muthukumaran Ramasubramanian,
Takuma Udagawa,
Iksha Gurung,
Rong Zhang,
Bharath Dandala,
Rahul Ramachandran,
Manil Maskey,
Kaylin Bugbee,
Mike Little,
Elizabeth Fancher,
Lauren Sanders,
Sylvain Costes,
Sergi Blanco-Cuaresma,
Kelly Lockhart,
Thomas Allen,
Felix Grezes,
Megan Ansdell,
Alberto Accomazzi,
Yousef El-Kurdi,
Davis Wertheimer,
Birgit Pfitzmann,
Cesar Berrospi Ramis
, et al. (9 additional authors not shown)
Abstract:
Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specialized tasks. Inspired by this pivotal insight, we developed INDUS, a comprehensive suite of LLMs tailored for the Earth science, biology, physics, heliophysics,…
▽ More
Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specialized tasks. Inspired by this pivotal insight, we developed INDUS, a comprehensive suite of LLMs tailored for the Earth science, biology, physics, heliophysics, planetary sciences and astrophysics domains and trained using curated scientific corpora drawn from diverse data sources. The suite of models include: (1) an encoder model trained using domain-specific vocabulary and corpora to address natural language understanding tasks, (2) a contrastive-learning-based general text embedding model trained using a diverse set of datasets drawn from multiple sources to address information retrieval tasks and (3) smaller versions of these models created using knowledge distillation techniques to address applications which have latency or resource constraints. We also created three new scientific benchmark datasets namely, CLIMATE-CHANGE-NER (entity-recognition), NASA-QA (extractive QA) and NASA-IR (IR) to accelerate research in these multi-disciplinary fields. Finally, we show that our models outperform both general-purpose encoders (RoBERTa) and existing domain-specific encoders (SciBERT) on these new tasks as well as existing benchmark tasks in the domains of interest.
△ Less
Submitted 20 May, 2024; v1 submitted 17 May, 2024;
originally announced May 2024.
-
Solutions of the tt*-equations constructed from the SU(2)$_k$-fusion ring, and Smyth potentials
Authors:
Tadashi Udagawa
Abstract:
Cecotti and Vafa introduced the tt*-equation (topological-antitopological fusion equation), whose solutions describe massive deformations of supersymmetric conformal field theories. We describe some solutions of the tt*-equation constructed from the SU(2)$_k$-fusion algebra. The idea of the construction is due to Cecotti and Vafa, but we give a precise mathematical formulation and a description of…
▽ More
Cecotti and Vafa introduced the tt*-equation (topological-antitopological fusion equation), whose solutions describe massive deformations of supersymmetric conformal field theories. We describe some solutions of the tt*-equation constructed from the SU(2)$_k$-fusion algebra. The idea of the construction is due to Cecotti and Vafa, but we give a precise mathematical formulation and a description of the "holomorphic data" corresponding to the solutions by using the DPW method. Furthermore, we give a relation between the solutions and the representations of SU(2). As a special case, we consider the solutions corresponding to the supersymmetric A$_k$-minimal model.
△ Less
Submitted 4 November, 2023;
originally announced November 2023.
-
Globality of the DPW construction for Smyth potentials in the case of SU(1,1)
Authors:
Tadashi Udagawa
Abstract:
We construct harmonic maps into SU(1,1)/U(1) satrting from Smyth potentials ξ, by the DPW method, In this method, harmonic maps are obtained from the Iwasawa factorization of a solution L of L^{-1} dL = ξ. However, the Iwasawa factorization in the case of a noncompact group is not always global. We show that L can be expressed in terms of Bessel functions and from the asymptotic expansion of Besse…
▽ More
We construct harmonic maps into SU(1,1)/U(1) satrting from Smyth potentials ξ, by the DPW method, In this method, harmonic maps are obtained from the Iwasawa factorization of a solution L of L^{-1} dL = ξ. However, the Iwasawa factorization in the case of a noncompact group is not always global. We show that L can be expressed in terms of Bessel functions and from the asymptotic expansion of Bessel functions we solve a Riemann-Hilbert problem to give a global Iwasawa factorization. In this way we give a more direct proof than in the work of Dorfmeister-Guest-Rossman (2010), while avoiding the general isomonodromy theory used by Guest-Its-Lin (2015).
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models
Authors:
Takuma Udagawa,
Aashka Trivedi,
Michele Merler,
Bishwaranjan Bhattacharjee
Abstract:
Large language models have become a vital component in modern NLP, achieving state of the art performance in a variety of tasks. However, they are often inefficient for real-world deployment due to their expensive inference costs. Knowledge distillation is a promising technique to improve their efficiency while retaining most of their effectiveness. In this paper, we reproduce, compare and analyze…
▽ More
Large language models have become a vital component in modern NLP, achieving state of the art performance in a variety of tasks. However, they are often inefficient for real-world deployment due to their expensive inference costs. Knowledge distillation is a promising technique to improve their efficiency while retaining most of their effectiveness. In this paper, we reproduce, compare and analyze several representative methods for task-agnostic (general-purpose) distillation of Transformer language models. Our target of study includes Output Distribution (OD) transfer, Hidden State (HS) transfer with various layer map** strategies, and Multi-Head Attention (MHA) transfer based on MiniLMv2. Through our extensive experiments, we study the effectiveness of each method for various student architectures in both monolingual (English) and multilingual settings. Overall, we show that MHA transfer based on MiniLMv2 is generally the best option for distillation and explain the potential reasons behind its success. Moreover, we show that HS transfer remains as a competitive baseline, especially under a sophisticated layer map** strategy, while OD transfer consistently lags behind other approaches. Findings from this study helped us deploy efficient yet effective student models for latency-critical applications.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems
Authors:
Takuma Udagawa,
Masayuki Suzuki,
Gakuto Kurata,
Masayasu Muraoka,
George Saon
Abstract:
Transferring the knowledge of large language models (LLMs) is a promising technique to incorporate linguistic knowledge into end-to-end automatic speech recognition (ASR) systems. However, existing works only transfer a single representation of LLM (e.g. the last layer of pretrained BERT), while the representation of a text is inherently non-unique and can be obtained variously from different laye…
▽ More
Transferring the knowledge of large language models (LLMs) is a promising technique to incorporate linguistic knowledge into end-to-end automatic speech recognition (ASR) systems. However, existing works only transfer a single representation of LLM (e.g. the last layer of pretrained BERT), while the representation of a text is inherently non-unique and can be obtained variously from different layers, contexts and models. In this work, we explore a wide range of techniques to obtain and transfer multiple representations of LLMs into a transducer-based ASR system. While being conceptually simple, we show that transferring multiple representations of LLMs can be an effective alternative to transferring only a single representation.
△ Less
Submitted 25 December, 2023; v1 submitted 7 September, 2023;
originally announced September 2023.
-
Neural Architecture Search for Effective Teacher-Student Knowledge Transfer in Language Models
Authors:
Aashka Trivedi,
Takuma Udagawa,
Michele Merler,
Rameswar Panda,
Yousef El-Kurdi,
Bishwaranjan Bhattacharjee
Abstract:
Large pretrained language models have achieved state-of-the-art results on a variety of downstream tasks. Knowledge Distillation (KD) into a smaller student model addresses their inefficiency, allowing for deployment in resource-constrained environments. However, KD can be ineffective when the student is manually selected from a set of existing options, since it can be a sub-optimal choice within…
▽ More
Large pretrained language models have achieved state-of-the-art results on a variety of downstream tasks. Knowledge Distillation (KD) into a smaller student model addresses their inefficiency, allowing for deployment in resource-constrained environments. However, KD can be ineffective when the student is manually selected from a set of existing options, since it can be a sub-optimal choice within the space of all possible student architectures. We develop multilingual KD-NAS, the use of Neural Architecture Search (NAS) guided by KD to find the optimal student architecture for task agnostic distillation from a multilingual teacher. In each episode of the search process, a NAS controller predicts a reward based on the distillation loss and latency of inference. The top candidate architectures are then distilled from the teacher on a small proxy set. Finally the architecture(s) with the highest reward is selected, and distilled on the full training corpus. KD-NAS can automatically trade off efficiency and effectiveness, and recommends architectures suitable to various latency budgets. Using our multi-layer hidden state distillation process, our KD-NAS student model achieves a 7x speedup on CPU inference (2x on GPU) compared to a XLM-Roberta Base Teacher, while maintaining 90% performance, and has been deployed in 3 software offerings requiring large throughput, low latency and deployment on CPU.
△ Less
Submitted 13 October, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
Sentence Identification with BOS and EOS Label Combinations
Authors:
Takuma Udagawa,
Hiroshi Kanayama,
Issei Yoshida
Abstract:
The sentence is a fundamental unit in many NLP applications. Sentence segmentation is widely used as the first preprocessing task, where an input text is split into consecutive sentences considering the end of the sentence (EOS) as their boundaries. This task formulation relies on a strong assumption that the input text consists only of sentences, or what we call the sentential units (SUs). Howeve…
▽ More
The sentence is a fundamental unit in many NLP applications. Sentence segmentation is widely used as the first preprocessing task, where an input text is split into consecutive sentences considering the end of the sentence (EOS) as their boundaries. This task formulation relies on a strong assumption that the input text consists only of sentences, or what we call the sentential units (SUs). However, real-world texts often contain non-sentential units (NSUs) such as metadata, sentence fragments, nonlinguistic markers, etc. which are unreasonable or undesirable to be treated as a part of an SU. To tackle this issue, we formulate a novel task of sentence identification, where the goal is to identify SUs while excluding NSUs in a given text. To conduct sentence identification, we propose a simple yet effective method which combines the beginning of the sentence (BOS) and EOS labels to determine the most probable SUs and NSUs based on dynamic programming. To evaluate this task, we design an automatic, language-independent procedure to convert the Universal Dependencies corpora into sentence identification benchmarks. Finally, our experiments on the sentence identification task demonstrate that our proposed method generally outperforms sentence segmentation baselines which only utilize EOS labels.
△ Less
Submitted 30 January, 2023;
originally announced January 2023.
-
Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Authors:
Takuma Udagawa,
Haruka Kiyohara,
Yusuke Narita,
Yuta Saito,
Kei Tateno
Abstract:
Off-policy evaluation (OPE) aims to accurately evaluate the performance of counterfactual policies using only offline logged data. Although many estimators have been developed, there is no single estimator that dominates the others, because the estimators' accuracy can vary greatly depending on a given OPE task such as the evaluation policy, number of actions, and noise level. Thus, the data-drive…
▽ More
Off-policy evaluation (OPE) aims to accurately evaluate the performance of counterfactual policies using only offline logged data. Although many estimators have been developed, there is no single estimator that dominates the others, because the estimators' accuracy can vary greatly depending on a given OPE task such as the evaluation policy, number of actions, and noise level. Thus, the data-driven estimator selection problem is becoming increasingly important and can have a significant impact on the accuracy of OPE. However, identifying the most accurate estimator using only the logged data is quite challenging because the ground-truth estimation accuracy of estimators is generally unavailable. This paper studies this challenging problem of estimator selection for OPE for the first time. In particular, we enable an estimator selection that is adaptive to a given OPE task, by appropriately subsampling available logged data and constructing pseudo policies useful for the underlying estimator selection task. Comprehensive experiments on both synthetic and real-world company data demonstrate that the proposed procedure substantially improves the estimator selection compared to a non-adaptive heuristic.
△ Less
Submitted 29 January, 2023; v1 submitted 25 November, 2022;
originally announced November 2022.
-
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
Authors:
Takuma Udagawa,
Masayuki Suzuki,
Gakuto Kurata,
Nobuyasu Itoh,
George Saon
Abstract:
Large-scale language models (LLMs) such as GPT-2, BERT and RoBERTa have been successfully applied to ASR N-best rescoring. However, whether or how they can benefit competitive, near state-of-the-art ASR systems remains unexplored. In this study, we incorporate LLM rescoring into one of the most competitive ASR baselines: the Conformer-Transducer model. We demonstrate that consistent improvement is…
▽ More
Large-scale language models (LLMs) such as GPT-2, BERT and RoBERTa have been successfully applied to ASR N-best rescoring. However, whether or how they can benefit competitive, near state-of-the-art ASR systems remains unexplored. In this study, we incorporate LLM rescoring into one of the most competitive ASR baselines: the Conformer-Transducer model. We demonstrate that consistent improvement is achieved by the LLM's bidirectionality, pretraining, in-domain finetuning and context augmentation. Furthermore, our lexical analysis sheds light on how each of these components may be contributing to the ASR performance.
△ Less
Submitted 18 August, 2022; v1 submitted 1 April, 2022;
originally announced April 2022.
-
Data-Driven Off-Policy Estimator Selection: An Application in User Marketing on An Online Content Delivery Service
Authors:
Yuta Saito,
Takuma Udagawa,
Kei Tateno
Abstract:
Off-policy evaluation (OPE) is the method that attempts to estimate the performance of decision making policies using historical data generated by different policies without conducting costly online A/B tests. Accurate OPE is essential in domains such as healthcare, marketing or recommender systems to avoid deploying poor performing policies, as such policies may hart human lives or destroy the us…
▽ More
Off-policy evaluation (OPE) is the method that attempts to estimate the performance of decision making policies using historical data generated by different policies without conducting costly online A/B tests. Accurate OPE is essential in domains such as healthcare, marketing or recommender systems to avoid deploying poor performing policies, as such policies may hart human lives or destroy the user experience. Thus, many OPE methods with theoretical backgrounds have been proposed. One emerging challenge with this trend is that a suitable estimator can be different for each application setting. It is often unknown for practitioners which estimator to use for their specific applications and purposes. To find out a suitable estimator among many candidates, we use a data-driven estimator selection procedure for off-policy policy performance estimators as a practical solution. As proof of concept, we use our procedure to select the best estimator to evaluate coupon treatment policies on a real-world online content delivery service. In the experiment, we first observe that a suitable estimator might change with different definitions of the outcome variable, and thus the accurate estimator selection is critical in real-world applications of OPE. Then, we demonstrate that, by utilizing the estimator selection procedure, we can easily find out suitable estimators for each purpose.
△ Less
Submitted 17 September, 2021;
originally announced September 2021.
-
Evaluating the Robustness of Off-Policy Evaluation
Authors:
Yuta Saito,
Takuma Udagawa,
Haruka Kiyohara,
Kazuki Mogi,
Yusuke Narita,
Kei Tateno
Abstract:
Off-policy Evaluation (OPE), or offline evaluation in general, evaluates the performance of hypothetical policies leveraging only offline log data. It is particularly useful in applications where the online interaction involves high stakes and expensive setting such as precision medicine and recommender systems. Since many OPE estimators have been proposed and some of them have hyperparameters to…
▽ More
Off-policy Evaluation (OPE), or offline evaluation in general, evaluates the performance of hypothetical policies leveraging only offline log data. It is particularly useful in applications where the online interaction involves high stakes and expensive setting such as precision medicine and recommender systems. Since many OPE estimators have been proposed and some of them have hyperparameters to be tuned, there is an emerging challenge for practitioners to select and tune OPE estimators for their specific application. Unfortunately, identifying a reliable estimator from results reported in research papers is often difficult because the current experimental procedure evaluates and compares the estimators' performance on a narrow set of hyperparameters and evaluation policies. Therefore, it is difficult to know which estimator is safe and reliable to use. In this work, we develop Interpretable Evaluation for Offline Evaluation (IEOE), an experimental procedure to evaluate OPE estimators' robustness to changes in hyperparameters and/or evaluation policies in an interpretable manner. Then, using the IEOE procedure, we perform extensive evaluation of a wide variety of existing estimators on Open Bandit Dataset, a large-scale public real-world dataset for OPE. We demonstrate that our procedure can evaluate the estimators' robustness to the hyperparamter choice, hel** us avoid using unsafe estimators. Finally, we apply IEOE to real-world e-commerce platform data and demonstrate how to use our protocol in practice.
△ Less
Submitted 31 August, 2021;
originally announced August 2021.
-
Maintaining Common Ground in Dynamic Environments
Authors:
Takuma Udagawa,
Akiko Aizawa
Abstract:
Common grounding is the process of creating and maintaining mutual understandings, which is a critical aspect of sophisticated human communication. While various task settings have been proposed in existing literature, they mostly focus on creating common ground under static context and ignore the aspect of maintaining them overtime under dynamic context. In this work, we propose a novel task sett…
▽ More
Common grounding is the process of creating and maintaining mutual understandings, which is a critical aspect of sophisticated human communication. While various task settings have been proposed in existing literature, they mostly focus on creating common ground under static context and ignore the aspect of maintaining them overtime under dynamic context. In this work, we propose a novel task setting to study the ability of both creating and maintaining common ground in dynamic environments. Based on our minimal task formulation, we collected a large-scale dataset of 5,617 dialogues to enable fine-grained evaluation and analysis of various dialogue systems. Through our dataset analyses, we highlight novel challenges introduced in our setting, such as the usage of complex spatio-temporal expressions to create and maintain common ground. Finally, we conduct extensive experiments to assess the capabilities of our baseline dialogue system and discuss future prospects of our research.
△ Less
Submitted 29 May, 2021;
originally announced May 2021.
-
A Linguistic Analysis of Visually Grounded Dialogues Based on Spatial Expressions
Authors:
Takuma Udagawa,
Takato Yamazaki,
Akiko Aizawa
Abstract:
Recent models achieve promising results in visually grounded dialogues. However, existing datasets often contain undesirable biases and lack sophisticated linguistic analyses, which make it difficult to understand how well current models recognize their precise linguistic structures. To address this problem, we make two design choices: first, we focus on OneCommon Corpus \citep{udagawa2019natural,…
▽ More
Recent models achieve promising results in visually grounded dialogues. However, existing datasets often contain undesirable biases and lack sophisticated linguistic analyses, which make it difficult to understand how well current models recognize their precise linguistic structures. To address this problem, we make two design choices: first, we focus on OneCommon Corpus \citep{udagawa2019natural,udagawa2020annotated}, a simple yet challenging common grounding dataset which contains minimal bias by design. Second, we analyze their linguistic structures based on \textit{spatial expressions} and provide comprehensive and reliable annotation for 600 dialogues. We show that our annotation captures important linguistic structures including predicate-argument structure, modification and ellipsis. In our experiments, we assess the model's understanding of these structures through reference resolution. We demonstrate that our annotation can reveal both the strengths and weaknesses of baseline models in essential levels of detail. Overall, we propose a novel framework and resource for investigating fine-grained language understanding in visually grounded dialogues.
△ Less
Submitted 6 October, 2020;
originally announced October 2020.
-
An Annotated Corpus of Reference Resolution for Interpreting Common Grounding
Authors:
Takuma Udagawa,
Akiko Aizawa
Abstract:
Common grounding is the process of creating, repairing and updating mutual understandings, which is a fundamental aspect of natural language conversation. However, interpreting the process of common grounding is a challenging task, especially under continuous and partially-observable context where complex ambiguity, uncertainty, partial understandings and misunderstandings are introduced. Interpre…
▽ More
Common grounding is the process of creating, repairing and updating mutual understandings, which is a fundamental aspect of natural language conversation. However, interpreting the process of common grounding is a challenging task, especially under continuous and partially-observable context where complex ambiguity, uncertainty, partial understandings and misunderstandings are introduced. Interpretation becomes even more challenging when we deal with dialogue systems which still have limited capability of natural language understanding and generation. To address this problem, we consider reference resolution as the central subtask of common grounding and propose a new resource to study its intermediate process. Based on a simple and general annotation schema, we collected a total of 40,172 referring expressions in 5,191 dialogues curated from an existing corpus, along with multiple judgements of referent interpretations. We show that our annotation is highly reliable, captures the complexity of common grounding through a natural degree of reasonable disagreements, and allows for more detailed and quantitative analyses of common grounding strategies. Finally, we demonstrate the advantages of our annotation for interpreting, analyzing and improving common grounding in baseline dialogue systems.
△ Less
Submitted 18 November, 2019;
originally announced November 2019.
-
A Natural Language Corpus of Common Grounding under Continuous and Partially-Observable Context
Authors:
Takuma Udagawa,
Akiko Aizawa
Abstract:
Common grounding is the process of creating, repairing and updating mutual understandings, which is a critical aspect of sophisticated human communication. However, traditional dialogue systems have limited capability of establishing common ground, and we also lack task formulations which introduce natural difficulty in terms of common grounding while enabling easy evaluation and analysis of compl…
▽ More
Common grounding is the process of creating, repairing and updating mutual understandings, which is a critical aspect of sophisticated human communication. However, traditional dialogue systems have limited capability of establishing common ground, and we also lack task formulations which introduce natural difficulty in terms of common grounding while enabling easy evaluation and analysis of complex models. In this paper, we propose a minimal dialogue task which requires advanced skills of common grounding under continuous and partially-observable context. Based on this task formulation, we collected a largescale dataset of 6,760 dialogues which fulfills essential requirements of natural language corpora. Our analysis of the dataset revealed important phenomena related to common grounding that need to be considered. Finally, we evaluate and analyze baseline neural models on a simple subtask that requires recognition of the created common ground. We show that simple baseline models perform decently but leave room for further improvement. Overall, we show that our proposed task will be a fundamental testbed where we can train, evaluate, and analyze dialogue system's ability for sophisticated common grounding.
△ Less
Submitted 8 July, 2019;
originally announced July 2019.
-
Finite-size Gap, Magnetization, and Entanglement of Deformed Fredkin Spin Chain
Authors:
Takuma Udagawa,
Hosho Katsura
Abstract:
We investigate ground- and excited-state properties of the deformed Fredkin spin chain proposed by Salberger, Zhang, Klich, Korepin, and the authors. This model is a one-parameter deformation of the Fredkin spin chain, whose Hamiltonian is $3$-local and translationally invariant in the bulk. The model is frustration-free and its unique ground state can be expressed as a weighted superposition of c…
▽ More
We investigate ground- and excited-state properties of the deformed Fredkin spin chain proposed by Salberger, Zhang, Klich, Korepin, and the authors. This model is a one-parameter deformation of the Fredkin spin chain, whose Hamiltonian is $3$-local and translationally invariant in the bulk. The model is frustration-free and its unique ground state can be expressed as a weighted superposition of colored Dyck paths. We focus on the case where the deformation parameter $t>1$. By using a variational method, we prove that the finite-size gap decays at least exponentially with increasing the system size. We prove that the magnetization in the ground state is along the $z$-direction, namely $\langle s^x \rangle =\langle s^y \rangle=0$, and show that the $z$-component $\langle s^z \rangle$ exhibits a domain-wall structure. We then study the entanglement properties of the chain. In particular, we derive upper and lower bounds for the von Neumann and Rényi entropies, and entanglement spectrum for any bipartition of the chain.
△ Less
Submitted 7 September, 2017; v1 submitted 2 January, 2017;
originally announced January 2017.
-
Deformed Fredkin Spin Chain with Extensive Entanglement
Authors:
Olof Salberger,
Takuma Udagawa,
Zhao Zhang,
Hosho Katsura,
Israel Klich,
Vladimir Korepin
Abstract:
We introduce a new spin chain which is a deformation of the Fredkin spin chain and has a phase transition between bounded and extensive entanglement entropy scaling. In this chain, spins have a local interaction of three nearest neighbors. The Hamiltonian is frustration-free and its ground state can be described analytically as a weighted superposition of Dyck paths. In the purely spin $1/2$ case,…
▽ More
We introduce a new spin chain which is a deformation of the Fredkin spin chain and has a phase transition between bounded and extensive entanglement entropy scaling. In this chain, spins have a local interaction of three nearest neighbors. The Hamiltonian is frustration-free and its ground state can be described analytically as a weighted superposition of Dyck paths. In the purely spin $1/2$ case, the entanglement entropy obeys an area law: it is bounded from above by a constant, when the size of the block $n$ increases (and $t>1$). When a local color degree of freedom is introduced the entanglement entropy increases linearly with the size of the block (and $t>1$). The entanglement entropy of half of the chain is tightly bounded by ${ n}\log s$ where $n$ is the size of the block, and $s$ is the number of colors. Our chain fosters a new example for a significant boost to entropy and for the existence of the associated critical rainbow phase where the entanglement entropy scales with volume that has recently been discovered in Zhang et al. (arXiv:1606.07795)
△ Less
Submitted 15 November, 2016;
originally announced November 2016.
-
Extended Optical Model Analyses of Elastic Scattering and Fusion Cross Section Data for the 9Be+28Si, 144Sm, and 208Pb Systems at Near-Coulomb-Barrier Energies using Double Folding Potential
Authors:
W. Y. So,
T. Udagawa,
K. S. Kim,
S. W. Hong,
B. T. Kim
Abstract:
Based on the extended optical model with the double folding potential, in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed of elastic scattering and fusion cross section data for the $^{9}$Be+$^{28}$Si, $^{144}$Sm, and $^{208}$Pb systems at near-Coulomb-barrier energies. We find that the real part of the result…
▽ More
Based on the extended optical model with the double folding potential, in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed of elastic scattering and fusion cross section data for the $^{9}$Be+$^{28}$Si, $^{144}$Sm, and $^{208}$Pb systems at near-Coulomb-barrier energies. We find that the real part of the resultant DR part of the polarization potential is systematically repulsive for all the targets considered, which is consistent with the results deduced from the Continuum Discretized Coupled Channel (CDCC) calculations taking into account the polarization effects due to breakup. Further, it is found that both DR and fusion parts of the extracted polarization potentials satisfy the dispersion relation.
△ Less
Submitted 14 March, 2010; v1 submitted 9 July, 2008;
originally announced July 2008.
-
Extended Optical Model Analyses of Elastic Scattering and Fusion Cross Section Data for the $^{12}$C+$^{208}$Pb System at Near-Coulomb-Barrier Energies by using a Folding Potential
Authors:
W. Y. So,
T. Udagawa,
S. W. Hong,
B. T. Kim
Abstract:
Simultaneous $χ^{2}$ analyses are performed for elastic scattering and fusion cross section data for the $^{12}$C+$^{208}$Pb system at near-Coulomb-barrier energies by using the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts. Use is made of the double folding potential as a bare potential. It is found that the experime…
▽ More
Simultaneous $χ^{2}$ analyses are performed for elastic scattering and fusion cross section data for the $^{12}$C+$^{208}$Pb system at near-Coulomb-barrier energies by using the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts. Use is made of the double folding potential as a bare potential. It is found that the experimental elastic scattering and fusion data are well reproduced without introducing any normalization factor for the double folding potential and also that both DR and fusion parts of the polarization potential determined from the $χ^{2}$ analyses satisfy separately the dispersion relation. Furthermore, it is shown that the imaginary parts of both DR and fusion potentials at the strong absorption radius change very rapidly, which results in a typical threshold anomaly in the total imaginary potential as observed with tightly bound projectiles such as $α$-particle and $^{16}$O.
△ Less
Submitted 14 January, 2008;
originally announced January 2008.
-
Extended Optical Model Analyses of Elastic Scattering and Fusion Cross Section Data for the 7Li+208Pb System at Near-Coulomb-Barrier Energies using the Folding Potential
Authors:
W. Y. So,
T. Udagawa,
K. S. Kim,
S. W. Hong,
B. T. Kim
Abstract:
Simultaneous $χ^{2}$ analyses previously made for elastic scattering and fusion cross section data for the $^{6}$Li+$^{208}$Pb system is extended to the $^{7}$Li+$^{208}$Pb system at near-Coulomb-barrier energies based on the extended optical model approach, in which the polarization potential is decomposed into direct reaction (DR) and fusion parts. Use is made of the double folding potential a…
▽ More
Simultaneous $χ^{2}$ analyses previously made for elastic scattering and fusion cross section data for the $^{6}$Li+$^{208}$Pb system is extended to the $^{7}$Li+$^{208}$Pb system at near-Coulomb-barrier energies based on the extended optical model approach, in which the polarization potential is decomposed into direct reaction (DR) and fusion parts. Use is made of the double folding potential as a bare potential. It is found that the experimental elastic scattering and fusion data are well reproduced without introducing any normalization factor for the double folding potential and that both the DR and fusion parts of the polarization potential determined from the $χ^{2}$ analyses satisfy separately the dispersion relation. Further, we find that the real part of the fusion portion of the polarization potential is attractive while that of the DR part is repulsive except at energies far below the Coulomb barrier energy. A comparison is made of the present results with those obtained from the Continuum Discretized Coupled Channel (CDCC) calculations and a previous study based on the conventional optical model with a double folding potential. We also compare the present results for the $^7$Li+$^{208}$Pb system with the analysis previously made for the $^{6}$Li+$^{208}$Pb system.
△ Less
Submitted 5 June, 2007;
originally announced June 2007.
-
Extended Optical Model Analyses of Elastic Scattering and Fusion Cross Sections for 6Li + 208Pb System at Near-Coulomb-Barrier Energies by using Folding Potential
Authors:
W. Y. So,
T. Udagawa,
K. S. Kim,
S. W. Hong,
B. T. Kim
Abstract:
Based on the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed for elastic scattering and fusion cross section data for the $^{6}$Li+$^{208}$Pb system at near-Coulomb-barrier energies. A folding potential is used as the bare potential. It is found that the real part of the resu…
▽ More
Based on the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed for elastic scattering and fusion cross section data for the $^{6}$Li+$^{208}$Pb system at near-Coulomb-barrier energies. A folding potential is used as the bare potential. It is found that the real part of the resultant DR part of the polarization potential is repulsive, which is consistent with the results from the Continuum Discretized Coupled Channel (CDCC) calculations and the normalization factors needed for the folding potentials. Further, it is found that both DR and fusion parts of the polarization potential satisfy separately the dispersion relation.
△ Less
Submitted 13 December, 2006;
originally announced December 2006.
-
Extended Optical Model Analyses of Elastic Scattering, Direct Reaction, and Fusion Cross Sections for the 9Be + 208Pb System at Near-Coulomb-Barrier Energies
Authors:
W. Y. So,
S. W. Hong,
B. T. Kim,
T. Udagawa
Abstract:
Based on the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed for elastic scattering, DR, and fusion cross section data for the $^{9}$Be+$^{208}$Pb system at near-Coulomb-barrier energies. Similar $χ^{2}$ analyses are also performed by only taking into account the elastic scat…
▽ More
Based on the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed for elastic scattering, DR, and fusion cross section data for the $^{9}$Be+$^{208}$Pb system at near-Coulomb-barrier energies. Similar $χ^{2}$ analyses are also performed by only taking into account the elastic scattering and fusion data as was previously done by the present authors, and the results are compared with those of the full analysis including the DR cross section data as well. We find that the analyses using only elastic scattering and fusion data can produce very consistent and reliable predictions of cross sections particularly when the DR cross section data are not complete. Discussions are also given on the results obtained from similar analyses made earlier for the $^{9}$Be+$^{209}$Bi system.
△ Less
Submitted 27 September, 2005;
originally announced September 2005.
-
A Novel Method for the Solution of the Schroedinger Eq. in the Presence of Exchange Terms
Authors:
G. H. Rawitscher,
S. Y. Kang,
I. Koltracht,
E. Zerrad,
K. Zerrad,
B. T. Kim,
T. Udagawa
Abstract:
In the Hartree-Fock approximation the Pauli exclusion principle leads to a Schroedinger Eq. of an integro-differential form. We describe a new spectral noniterative method (S-IEM), previously developed for solving the Lippman-Schwinger integral equation with local potentials, which has now been extended so as to include the exchange nonlocality. We apply it to the restricted case of electron-Hyd…
▽ More
In the Hartree-Fock approximation the Pauli exclusion principle leads to a Schroedinger Eq. of an integro-differential form. We describe a new spectral noniterative method (S-IEM), previously developed for solving the Lippman-Schwinger integral equation with local potentials, which has now been extended so as to include the exchange nonlocality. We apply it to the restricted case of electron-Hydrogen scattering in which the bound electron remains in the ground state and the incident electron has zero angular momentum, and we compare the acuracy and economy of the new method to three other methods. One is a non-iterative solution (NIEM) of the integral equation as described by Sams and Kouri in 1969. Another is an iterative method introduced by Kim and Udagawa in 1990 for nuclear physics applications, which makes an expansion of the solution into an especially favorable basis obtained by a method of moments. The third one is based on the Singular Value Decomposition of the exchange term followed by iterations over the remainder. The S-IEM method turns out to be more accurate by many orders of magnitude than any of the other three methods described above for the same number of mesh points.
△ Less
Submitted 29 January, 2002;
originally announced January 2002.
-
Simultaneous Optical Model Analyses of Elastic Scattering, Breakup, and Fusion Cross Section Data for the $^{6}$He + $^{209}$Bi System at Near-Coulomb-Barrier Energies
Authors:
B. T. Kim,
W. Y. So,
S. W. Hong,
T. Udagawa
Abstract:
Based on an approach recently proposed by us, simultaneous $χ^{2}$-analyses are performed for elastic scattering, direct reaction (DR) and fusion cross sections data for the $^{6}$He+$^{209}$Bi system at near-Coulomb-barrier energies to determine the parameters of the polarization potential consisting of DR and fusion parts. We show that the data are well reproduced by the resultant potential, w…
▽ More
Based on an approach recently proposed by us, simultaneous $χ^{2}$-analyses are performed for elastic scattering, direct reaction (DR) and fusion cross sections data for the $^{6}$He+$^{209}$Bi system at near-Coulomb-barrier energies to determine the parameters of the polarization potential consisting of DR and fusion parts. We show that the data are well reproduced by the resultant potential, which also satisfies the proper dispersion relation. A discussion is given of the nature of the threshold anomaly seen in the potential.
△ Less
Submitted 22 November, 2001;
originally announced November 2001.
-
Semi-classical Characters and Optical Model Description of Heavy Ion Scattering, Direct Reactions, and Fusion at Near-barrier Energies
Authors:
B. T. Kim,
W. Y. So,
S. W. Hong,
T. Udagawa
Abstract:
An approach is proposed to calculate the direct reaction (DR) and fusion probabilities for heavy ion collisions at near-Coulomb-barrier energies as functions of the distance of closest approach D within the framework of the optical model that introduces two types of imaginary potentials, DR and fusion. The probabilities are calculated by using partial DR and fusion cross sections, together with…
▽ More
An approach is proposed to calculate the direct reaction (DR) and fusion probabilities for heavy ion collisions at near-Coulomb-barrier energies as functions of the distance of closest approach D within the framework of the optical model that introduces two types of imaginary potentials, DR and fusion. The probabilities are calculated by using partial DR and fusion cross sections, together with the classical relations associated with the Coulomb trajectory. Such an approach makes it possible to analyze the data for angular distributions of the inclusive DR cross section, facilitating the determination of the radius parameters of the imaginary DR potential in a less ambiguous manner. Simultaneous $χ^{2}$-analyses are performed of relevant data for the $^{16}$O+$^{208}$Pb system near the Coulomb-barrier energy.
△ Less
Submitted 2 November, 2001; v1 submitted 1 November, 2001;
originally announced November 2001.
-
Can Doubly Strange Dibaryon Resonances be Discovered at RHIC?
Authors:
S. D. Paganis,
G. W. Hoffmann,
R. L. Ray,
J. -L. Tang,
T. Udagawa,
R. S. Longacre
Abstract:
The baryon-baryon continuum invariant mass spectrum generated from relativistic nucleus + nucleus collision data may reveal the existence of doubly-strange dibaryons not stable against strong decay if they lie within a few MeV of threshold. Furthermore, since the dominant component of these states is a superposition of two color-octet clusters which can be produced intermediately in a color-deco…
▽ More
The baryon-baryon continuum invariant mass spectrum generated from relativistic nucleus + nucleus collision data may reveal the existence of doubly-strange dibaryons not stable against strong decay if they lie within a few MeV of threshold. Furthermore, since the dominant component of these states is a superposition of two color-octet clusters which can be produced intermediately in a color-deconfined quark-gluon plasma (QGP), an enhanced production of dibaryon resonances could be a signal of QGP formation. A total of eight, doubly-strange dibaryon states are considered for experimental search using the STAR detector (Solenoidal Tracker at RHIC) at the new Relativistic Heavy Ion Collider (RHIC). These states may decay to Lambda-Lambda and/or proton-Cascade-minus, depending on the resonance energy. STAR's large acceptance, precision tracking and vertex reconstruction capabilities, and large data volume capacity, make it an ideal instrument to use for such a search. Detector performance and analysis sensitivity are studied as a function of resonance production rate and width for one particular dibaryon which can directly strong decay to proton-Cascade-minus but not Lambda-Lambda. Results indicate that such resonances may be discovered using STAR if the resonance production rates are comparable to coalescence model predictions for dibaryon bound states.
△ Less
Submitted 12 June, 2000; v1 submitted 8 October, 1999;
originally announced October 1999.
-
Can only flavor-nonsinglet H dibaryons be stable against strong decays?
Authors:
Stathes D. Paganis,
Takeshi Udagawa,
G. W. Hoffmann,
R. L. Ray
Abstract:
Using the QCD sum rule approach, we show that the flavor-nonsinglet $H$ dibaryon states with J$^π = 1^+$, J$^π = 0^+$, I=1 (27plet) are nearly degenerate with the J$^π = 0^+$, I=0 singlet $H_0$ dibaryon, which has been predicted to be stable against strong decay, but has not been observed. Our calculation, which does not require an instanton correction, suggests that the $H_0$ is slightly heavie…
▽ More
Using the QCD sum rule approach, we show that the flavor-nonsinglet $H$ dibaryon states with J$^π = 1^+$, J$^π = 0^+$, I=1 (27plet) are nearly degenerate with the J$^π = 0^+$, I=0 singlet $H_0$ dibaryon, which has been predicted to be stable against strong decay, but has not been observed. Our calculation, which does not require an instanton correction, suggests that the $H_0$ is slightly heavier than these flavor-nonsinglet $H$s over a wide range of the parameter space. If the singlet $H_0$ mass lies above the $ΛΛ$ threshold (2231~MeV), then the strong interaction breakup to $ΛΛ$ would produce a very broad resonance in the $ΛΛ$ invariant mass spectrum which would be very difficult to observe. On the other hand, if these flavor-nonsinglet J=0 and 1 $H$ dibaryons are also above the $ΛΛ$ threshold, but below the $Ξ^0n$ breakup threshold (2254 MeV), then because the direct, strong interaction decay to the $ΛΛ$ channel is forbidden, these flavor-nonsinglet states might be more amenable to experimental observation. The present results allow a possible reconciliation between the reported observation of $ΛΛ$ hypernuclei, which argue against a stable $H_0$, and the possible existence of $H$ dibaryons in general.
△ Less
Submitted 19 June, 1997;
originally announced June 1997.
-
Dam** mechanisms of the Delta resonance in nuclei
Authors:
B. Koerfgen,
P. Oltmanns,
F. Osterfeld,
T. Udagawa
Abstract:
The dam** mechanisms of the Delta(1232) resonance in nuclei are studied by analyzing the quasi-free decay reactions 12C(pi+,pi+ p)11B and 12C(3He,t pi+ p)11B and the 2p emission reactions 12C(pi+,pp)10B and 12C(3He,t pp)10B. The coincidence cross sections are calculated within the framework of the isobar-hole model. It is found that the 2p emission process induced by the decay of the Delta res…
▽ More
The dam** mechanisms of the Delta(1232) resonance in nuclei are studied by analyzing the quasi-free decay reactions 12C(pi+,pi+ p)11B and 12C(3He,t pi+ p)11B and the 2p emission reactions 12C(pi+,pp)10B and 12C(3He,t pp)10B. The coincidence cross sections are calculated within the framework of the isobar-hole model. It is found that the 2p emission process induced by the decay of the Delta resonance in the nucleus can be consistently described by a pi+rho+g' model for the Delta+N -> N+N decay interaction.
△ Less
Submitted 17 December, 1996;
originally announced December 1996.