Search | arXiv e-print repository

INDUS: Effective and Efficient Language Models for Scientific Applications

Authors: Bishwaranjan Bhattacharjee, Aashka Trivedi, Masayasu Muraoka, Muthukumaran Ramasubramanian, Takuma Udagawa, Iksha Gurung, Rong Zhang, Bharath Dandala, Rahul Ramachandran, Manil Maskey, Kaylin Bugbee, Mike Little, Elizabeth Fancher, Lauren Sanders, Sylvain Costes, Sergi Blanco-Cuaresma, Kelly Lockhart, Thomas Allen, Felix Grezes, Megan Ansdell, Alberto Accomazzi, Yousef El-Kurdi, Davis Wertheimer, Birgit Pfitzmann, Cesar Berrospi Ramis , et al. (9 additional authors not shown)

Abstract: Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specialized tasks. Inspired by this pivotal insight, we developed INDUS, a comprehensive suite of LLMs tailored for the Earth science, biology, physics, heliophysics,… ▽ More Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specialized tasks. Inspired by this pivotal insight, we developed INDUS, a comprehensive suite of LLMs tailored for the Earth science, biology, physics, heliophysics, planetary sciences and astrophysics domains and trained using curated scientific corpora drawn from diverse data sources. The suite of models include: (1) an encoder model trained using domain-specific vocabulary and corpora to address natural language understanding tasks, (2) a contrastive-learning-based general text embedding model trained using a diverse set of datasets drawn from multiple sources to address information retrieval tasks and (3) smaller versions of these models created using knowledge distillation techniques to address applications which have latency or resource constraints. We also created three new scientific benchmark datasets namely, CLIMATE-CHANGE-NER (entity-recognition), NASA-QA (extractive QA) and NASA-IR (IR) to accelerate research in these multi-disciplinary fields. Finally, we show that our models outperform both general-purpose encoders (RoBERTa) and existing domain-specific encoders (SciBERT) on these new tasks as well as existing benchmark tasks in the domains of interest. △ Less

Submitted 20 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

arXiv:2311.02353 [pdf, ps, other]

Solutions of the tt*-equations constructed from the SU(2)$_k$-fusion ring, and Smyth potentials

Authors: Tadashi Udagawa

Abstract: Cecotti and Vafa introduced the tt*-equation (topological-antitopological fusion equation), whose solutions describe massive deformations of supersymmetric conformal field theories. We describe some solutions of the tt*-equation constructed from the SU(2)$_k$-fusion algebra. The idea of the construction is due to Cecotti and Vafa, but we give a precise mathematical formulation and a description of… ▽ More Cecotti and Vafa introduced the tt*-equation (topological-antitopological fusion equation), whose solutions describe massive deformations of supersymmetric conformal field theories. We describe some solutions of the tt*-equation constructed from the SU(2)$_k$-fusion algebra. The idea of the construction is due to Cecotti and Vafa, but we give a precise mathematical formulation and a description of the "holomorphic data" corresponding to the solutions by using the DPW method. Furthermore, we give a relation between the solutions and the representations of SU(2). As a special case, we consider the solutions corresponding to the supersymmetric A$_k$-minimal model. △ Less

Submitted 4 November, 2023; originally announced November 2023.

Comments: 21 pages

MSC Class: 53Z05; 17B80

arXiv:2310.13331 [pdf, ps, other]

Globality of the DPW construction for Smyth potentials in the case of SU(1,1)

Authors: Tadashi Udagawa

Abstract: We construct harmonic maps into SU(1,1)/U(1) satrting from Smyth potentials ξ, by the DPW method, In this method, harmonic maps are obtained from the Iwasawa factorization of a solution L of L^{-1} dL = ξ. However, the Iwasawa factorization in the case of a noncompact group is not always global. We show that L can be expressed in terms of Bessel functions and from the asymptotic expansion of Besse… ▽ More We construct harmonic maps into SU(1,1)/U(1) satrting from Smyth potentials ξ, by the DPW method, In this method, harmonic maps are obtained from the Iwasawa factorization of a solution L of L^{-1} dL = ξ. However, the Iwasawa factorization in the case of a noncompact group is not always global. We show that L can be expressed in terms of Bessel functions and from the asymptotic expansion of Bessel functions we solve a Riemann-Hilbert problem to give a global Iwasawa factorization. In this way we give a more direct proof than in the work of Dorfmeister-Guest-Rossman (2010), while avoiding the general isomonodromy theory used by Guest-Its-Lin (2015). △ Less

Submitted 20 October, 2023; originally announced October 2023.

Comments: 24 pages

MSC Class: 53A10; 35Q15

arXiv:2310.08797 [pdf, other]

A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models

Authors: Takuma Udagawa, Aashka Trivedi, Michele Merler, Bishwaranjan Bhattacharjee

Abstract: Large language models have become a vital component in modern NLP, achieving state of the art performance in a variety of tasks. However, they are often inefficient for real-world deployment due to their expensive inference costs. Knowledge distillation is a promising technique to improve their efficiency while retaining most of their effectiveness. In this paper, we reproduce, compare and analyze… ▽ More Large language models have become a vital component in modern NLP, achieving state of the art performance in a variety of tasks. However, they are often inefficient for real-world deployment due to their expensive inference costs. Knowledge distillation is a promising technique to improve their efficiency while retaining most of their effectiveness. In this paper, we reproduce, compare and analyze several representative methods for task-agnostic (general-purpose) distillation of Transformer language models. Our target of study includes Output Distribution (OD) transfer, Hidden State (HS) transfer with various layer map** strategies, and Multi-Head Attention (MHA) transfer based on MiniLMv2. Through our extensive experiments, we study the effectiveness of each method for various student architectures in both monolingual (English) and multilingual settings. Overall, we show that MHA transfer based on MiniLMv2 is generally the best option for distillation and explain the potential reasons behind its success. Moreover, we show that HS transfer remains as a competitive baseline, especially under a sophisticated layer map** strategy, while OD transfer consistently lags behind other approaches. Findings from this study helped us deploy efficient yet effective student models for latency-critical applications. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Comments: Accepted to EMNLP 2023 Industry Track

arXiv:2309.04031 [pdf, other]

Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems

Authors: Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Masayasu Muraoka, George Saon

Abstract: Transferring the knowledge of large language models (LLMs) is a promising technique to incorporate linguistic knowledge into end-to-end automatic speech recognition (ASR) systems. However, existing works only transfer a single representation of LLM (e.g. the last layer of pretrained BERT), while the representation of a text is inherently non-unique and can be obtained variously from different laye… ▽ More Transferring the knowledge of large language models (LLMs) is a promising technique to incorporate linguistic knowledge into end-to-end automatic speech recognition (ASR) systems. However, existing works only transfer a single representation of LLM (e.g. the last layer of pretrained BERT), while the representation of a text is inherently non-unique and can be obtained variously from different layers, contexts and models. In this work, we explore a wide range of techniques to obtain and transfer multiple representations of LLMs into a transducer-based ASR system. While being conceptually simple, we show that transferring multiple representations of LLMs can be an effective alternative to transferring only a single representation. △ Less

Submitted 25 December, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

Comments: Accepted to ICASSP 2024

arXiv:2303.09639 [pdf, other]

Neural Architecture Search for Effective Teacher-Student Knowledge Transfer in Language Models

Authors: Aashka Trivedi, Takuma Udagawa, Michele Merler, Rameswar Panda, Yousef El-Kurdi, Bishwaranjan Bhattacharjee

Abstract: Large pretrained language models have achieved state-of-the-art results on a variety of downstream tasks. Knowledge Distillation (KD) into a smaller student model addresses their inefficiency, allowing for deployment in resource-constrained environments. However, KD can be ineffective when the student is manually selected from a set of existing options, since it can be a sub-optimal choice within… ▽ More Large pretrained language models have achieved state-of-the-art results on a variety of downstream tasks. Knowledge Distillation (KD) into a smaller student model addresses their inefficiency, allowing for deployment in resource-constrained environments. However, KD can be ineffective when the student is manually selected from a set of existing options, since it can be a sub-optimal choice within the space of all possible student architectures. We develop multilingual KD-NAS, the use of Neural Architecture Search (NAS) guided by KD to find the optimal student architecture for task agnostic distillation from a multilingual teacher. In each episode of the search process, a NAS controller predicts a reward based on the distillation loss and latency of inference. The top candidate architectures are then distilled from the teacher on a small proxy set. Finally the architecture(s) with the highest reward is selected, and distilled on the full training corpus. KD-NAS can automatically trade off efficiency and effectiveness, and recommends architectures suitable to various latency budgets. Using our multi-layer hidden state distillation process, our KD-NAS student model achieves a 7x speedup on CPU inference (2x on GPU) compared to a XLM-Roberta Base Teacher, while maintaining 90% performance, and has been deployed in 3 software offerings requiring large throughput, low latency and deployment on CPU. △ Less

Submitted 13 October, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

Comments: 11 pages, 5 figures

arXiv:2301.13352 [pdf, other]

Sentence Identification with BOS and EOS Label Combinations

Authors: Takuma Udagawa, Hiroshi Kanayama, Issei Yoshida

Abstract: The sentence is a fundamental unit in many NLP applications. Sentence segmentation is widely used as the first preprocessing task, where an input text is split into consecutive sentences considering the end of the sentence (EOS) as their boundaries. This task formulation relies on a strong assumption that the input text consists only of sentences, or what we call the sentential units (SUs). Howeve… ▽ More The sentence is a fundamental unit in many NLP applications. Sentence segmentation is widely used as the first preprocessing task, where an input text is split into consecutive sentences considering the end of the sentence (EOS) as their boundaries. This task formulation relies on a strong assumption that the input text consists only of sentences, or what we call the sentential units (SUs). However, real-world texts often contain non-sentential units (NSUs) such as metadata, sentence fragments, nonlinguistic markers, etc. which are unreasonable or undesirable to be treated as a part of an SU. To tackle this issue, we formulate a novel task of sentence identification, where the goal is to identify SUs while excluding NSUs in a given text. To conduct sentence identification, we propose a simple yet effective method which combines the beginning of the sentence (BOS) and EOS labels to determine the most probable SUs and NSUs based on dynamic programming. To evaluate this task, we design an automatic, language-independent procedure to convert the Universal Dependencies corpora into sentence identification benchmarks. Finally, our experiments on the sentence identification task demonstrate that our proposed method generally outperforms sentence segmentation baselines which only utilize EOS labels. △ Less

Submitted 30 January, 2023; originally announced January 2023.

Comments: Accepted to EACL 2023 (Findings)

arXiv:2211.13904 [pdf, other]

Policy-Adaptive Estimator Selection for Off-Policy Evaluation

Authors: Takuma Udagawa, Haruka Kiyohara, Yusuke Narita, Yuta Saito, Kei Tateno

Abstract: Off-policy evaluation (OPE) aims to accurately evaluate the performance of counterfactual policies using only offline logged data. Although many estimators have been developed, there is no single estimator that dominates the others, because the estimators' accuracy can vary greatly depending on a given OPE task such as the evaluation policy, number of actions, and noise level. Thus, the data-drive… ▽ More Off-policy evaluation (OPE) aims to accurately evaluate the performance of counterfactual policies using only offline logged data. Although many estimators have been developed, there is no single estimator that dominates the others, because the estimators' accuracy can vary greatly depending on a given OPE task such as the evaluation policy, number of actions, and noise level. Thus, the data-driven estimator selection problem is becoming increasingly important and can have a significant impact on the accuracy of OPE. However, identifying the most accurate estimator using only the logged data is quite challenging because the ground-truth estimation accuracy of estimators is generally unavailable. This paper studies this challenging problem of estimator selection for OPE for the first time. In particular, we enable an estimator selection that is adaptive to a given OPE task, by appropriately subsampling available logged data and constructing pseudo policies useful for the underlying estimator selection task. Comprehensive experiments on both synthetic and real-world company data demonstrate that the proposed procedure substantially improves the estimator selection compared to a non-adaptive heuristic. △ Less

Submitted 29 January, 2023; v1 submitted 25 November, 2022; originally announced November 2022.

Comments: accepted at AAAI'23

arXiv:2204.00212 [pdf, ps, other]

Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems

Authors: Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Nobuyasu Itoh, George Saon

Abstract: Large-scale language models (LLMs) such as GPT-2, BERT and RoBERTa have been successfully applied to ASR N-best rescoring. However, whether or how they can benefit competitive, near state-of-the-art ASR systems remains unexplored. In this study, we incorporate LLM rescoring into one of the most competitive ASR baselines: the Conformer-Transducer model. We demonstrate that consistent improvement is… ▽ More Large-scale language models (LLMs) such as GPT-2, BERT and RoBERTa have been successfully applied to ASR N-best rescoring. However, whether or how they can benefit competitive, near state-of-the-art ASR systems remains unexplored. In this study, we incorporate LLM rescoring into one of the most competitive ASR baselines: the Conformer-Transducer model. We demonstrate that consistent improvement is achieved by the LLM's bidirectionality, pretraining, in-domain finetuning and context augmentation. Furthermore, our lexical analysis sheds light on how each of these components may be contributing to the ASR performance. △ Less

Submitted 18 August, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

Comments: Accepted to Interspeech 2022

arXiv:2109.08621 [pdf, ps, other]

Data-Driven Off-Policy Estimator Selection: An Application in User Marketing on An Online Content Delivery Service

Authors: Yuta Saito, Takuma Udagawa, Kei Tateno

Abstract: Off-policy evaluation (OPE) is the method that attempts to estimate the performance of decision making policies using historical data generated by different policies without conducting costly online A/B tests. Accurate OPE is essential in domains such as healthcare, marketing or recommender systems to avoid deploying poor performing policies, as such policies may hart human lives or destroy the us… ▽ More Off-policy evaluation (OPE) is the method that attempts to estimate the performance of decision making policies using historical data generated by different policies without conducting costly online A/B tests. Accurate OPE is essential in domains such as healthcare, marketing or recommender systems to avoid deploying poor performing policies, as such policies may hart human lives or destroy the user experience. Thus, many OPE methods with theoretical backgrounds have been proposed. One emerging challenge with this trend is that a suitable estimator can be different for each application setting. It is often unknown for practitioners which estimator to use for their specific applications and purposes. To find out a suitable estimator among many candidates, we use a data-driven estimator selection procedure for off-policy policy performance estimators as a practical solution. As proof of concept, we use our procedure to select the best estimator to evaluate coupon treatment policies on a real-world online content delivery service. In the experiment, we first observe that a suitable estimator might change with different definitions of the outcome variable, and thus the accurate estimator selection is critical in real-world applications of OPE. Then, we demonstrate that, by utilizing the estimator selection procedure, we can easily find out suitable estimators for each purpose. △ Less

Submitted 17 September, 2021; originally announced September 2021.

Comments: presented at REVEAL workshop, RecSys2020

arXiv:2108.13703 [pdf, other]

doi 10.1145/3460231.3474245

Evaluating the Robustness of Off-Policy Evaluation

Authors: Yuta Saito, Takuma Udagawa, Haruka Kiyohara, Kazuki Mogi, Yusuke Narita, Kei Tateno

Abstract: Off-policy Evaluation (OPE), or offline evaluation in general, evaluates the performance of hypothetical policies leveraging only offline log data. It is particularly useful in applications where the online interaction involves high stakes and expensive setting such as precision medicine and recommender systems. Since many OPE estimators have been proposed and some of them have hyperparameters to… ▽ More Off-policy Evaluation (OPE), or offline evaluation in general, evaluates the performance of hypothetical policies leveraging only offline log data. It is particularly useful in applications where the online interaction involves high stakes and expensive setting such as precision medicine and recommender systems. Since many OPE estimators have been proposed and some of them have hyperparameters to be tuned, there is an emerging challenge for practitioners to select and tune OPE estimators for their specific application. Unfortunately, identifying a reliable estimator from results reported in research papers is often difficult because the current experimental procedure evaluates and compares the estimators' performance on a narrow set of hyperparameters and evaluation policies. Therefore, it is difficult to know which estimator is safe and reliable to use. In this work, we develop Interpretable Evaluation for Offline Evaluation (IEOE), an experimental procedure to evaluate OPE estimators' robustness to changes in hyperparameters and/or evaluation policies in an interpretable manner. Then, using the IEOE procedure, we perform extensive evaluation of a wide variety of existing estimators on Open Bandit Dataset, a large-scale public real-world dataset for OPE. We demonstrate that our procedure can evaluate the estimators' robustness to the hyperparamter choice, hel** us avoid using unsafe estimators. Finally, we apply IEOE to real-world e-commerce platform data and demonstrate how to use our protocol in practice. △ Less

Submitted 31 August, 2021; originally announced August 2021.

Comments: Accepted at RecSys2021

arXiv:2105.14207 [pdf, other]

Maintaining Common Ground in Dynamic Environments

Authors: Takuma Udagawa, Akiko Aizawa

Abstract: Common grounding is the process of creating and maintaining mutual understandings, which is a critical aspect of sophisticated human communication. While various task settings have been proposed in existing literature, they mostly focus on creating common ground under static context and ignore the aspect of maintaining them overtime under dynamic context. In this work, we propose a novel task sett… ▽ More Common grounding is the process of creating and maintaining mutual understandings, which is a critical aspect of sophisticated human communication. While various task settings have been proposed in existing literature, they mostly focus on creating common ground under static context and ignore the aspect of maintaining them overtime under dynamic context. In this work, we propose a novel task setting to study the ability of both creating and maintaining common ground in dynamic environments. Based on our minimal task formulation, we collected a large-scale dataset of 5,617 dialogues to enable fine-grained evaluation and analysis of various dialogue systems. Through our dataset analyses, we highlight novel challenges introduced in our setting, such as the usage of complex spatio-temporal expressions to create and maintain common ground. Finally, we conduct extensive experiments to assess the capabilities of our baseline dialogue system and discuss future prospects of our research. △ Less

Submitted 29 May, 2021; originally announced May 2021.

Comments: Accepted at TACL; pre-MIT Press publication version

arXiv:2010.03127 [pdf, other]

A Linguistic Analysis of Visually Grounded Dialogues Based on Spatial Expressions

Authors: Takuma Udagawa, Takato Yamazaki, Akiko Aizawa

Abstract: Recent models achieve promising results in visually grounded dialogues. However, existing datasets often contain undesirable biases and lack sophisticated linguistic analyses, which make it difficult to understand how well current models recognize their precise linguistic structures. To address this problem, we make two design choices: first, we focus on OneCommon Corpus \citep{udagawa2019natural,… ▽ More Recent models achieve promising results in visually grounded dialogues. However, existing datasets often contain undesirable biases and lack sophisticated linguistic analyses, which make it difficult to understand how well current models recognize their precise linguistic structures. To address this problem, we make two design choices: first, we focus on OneCommon Corpus \citep{udagawa2019natural,udagawa2020annotated}, a simple yet challenging common grounding dataset which contains minimal bias by design. Second, we analyze their linguistic structures based on \textit{spatial expressions} and provide comprehensive and reliable annotation for 600 dialogues. We show that our annotation captures important linguistic structures including predicate-argument structure, modification and ellipsis. In our experiments, we assess the model's understanding of these structures through reference resolution. We demonstrate that our annotation can reveal both the strengths and weaknesses of baseline models in essential levels of detail. Overall, we propose a novel framework and resource for investigating fine-grained language understanding in visually grounded dialogues. △ Less

Submitted 6 October, 2020; originally announced October 2020.

Comments: 16 pages, Findings of EMNLP 2020

arXiv:1911.07588 [pdf, other]

An Annotated Corpus of Reference Resolution for Interpreting Common Grounding

Authors: Takuma Udagawa, Akiko Aizawa

Abstract: Common grounding is the process of creating, repairing and updating mutual understandings, which is a fundamental aspect of natural language conversation. However, interpreting the process of common grounding is a challenging task, especially under continuous and partially-observable context where complex ambiguity, uncertainty, partial understandings and misunderstandings are introduced. Interpre… ▽ More Common grounding is the process of creating, repairing and updating mutual understandings, which is a fundamental aspect of natural language conversation. However, interpreting the process of common grounding is a challenging task, especially under continuous and partially-observable context where complex ambiguity, uncertainty, partial understandings and misunderstandings are introduced. Interpretation becomes even more challenging when we deal with dialogue systems which still have limited capability of natural language understanding and generation. To address this problem, we consider reference resolution as the central subtask of common grounding and propose a new resource to study its intermediate process. Based on a simple and general annotation schema, we collected a total of 40,172 referring expressions in 5,191 dialogues curated from an existing corpus, along with multiple judgements of referent interpretations. We show that our annotation is highly reliable, captures the complexity of common grounding through a natural degree of reasonable disagreements, and allows for more detailed and quantitative analyses of common grounding strategies. Finally, we demonstrate the advantages of our annotation for interpreting, analyzing and improving common grounding in baseline dialogue systems. △ Less

Submitted 18 November, 2019; originally announced November 2019.

Comments: 9 pages, 7 figures, 6 tables, Accepted by AAAI 2020

arXiv:1907.03399 [pdf, other]

A Natural Language Corpus of Common Grounding under Continuous and Partially-Observable Context

Authors: Takuma Udagawa, Akiko Aizawa

Abstract: Common grounding is the process of creating, repairing and updating mutual understandings, which is a critical aspect of sophisticated human communication. However, traditional dialogue systems have limited capability of establishing common ground, and we also lack task formulations which introduce natural difficulty in terms of common grounding while enabling easy evaluation and analysis of compl… ▽ More Common grounding is the process of creating, repairing and updating mutual understandings, which is a critical aspect of sophisticated human communication. However, traditional dialogue systems have limited capability of establishing common ground, and we also lack task formulations which introduce natural difficulty in terms of common grounding while enabling easy evaluation and analysis of complex models. In this paper, we propose a minimal dialogue task which requires advanced skills of common grounding under continuous and partially-observable context. Based on this task formulation, we collected a largescale dataset of 6,760 dialogues which fulfills essential requirements of natural language corpora. Our analysis of the dataset revealed important phenomena related to common grounding that need to be considered. Finally, we evaluate and analyze baseline neural models on a simple subtask that requires recognition of the created common ground. We show that simple baseline models perform decently but leave room for further improvement. Overall, we show that our proposed task will be a fundamental testbed where we can train, evaluate, and analyze dialogue system's ability for sophisticated common grounding. △ Less

Submitted 8 July, 2019; originally announced July 2019.

Comments: AAAI 2019

arXiv:1701.00346 [pdf, ps, other]

doi 10.1088/1751-8121/aa85b5

Finite-size Gap, Magnetization, and Entanglement of Deformed Fredkin Spin Chain

Authors: Takuma Udagawa, Hosho Katsura

Abstract: We investigate ground- and excited-state properties of the deformed Fredkin spin chain proposed by Salberger, Zhang, Klich, Korepin, and the authors. This model is a one-parameter deformation of the Fredkin spin chain, whose Hamiltonian is $3$-local and translationally invariant in the bulk. The model is frustration-free and its unique ground state can be expressed as a weighted superposition of c… ▽ More We investigate ground- and excited-state properties of the deformed Fredkin spin chain proposed by Salberger, Zhang, Klich, Korepin, and the authors. This model is a one-parameter deformation of the Fredkin spin chain, whose Hamiltonian is $3$-local and translationally invariant in the bulk. The model is frustration-free and its unique ground state can be expressed as a weighted superposition of colored Dyck paths. We focus on the case where the deformation parameter $t>1$. By using a variational method, we prove that the finite-size gap decays at least exponentially with increasing the system size. We prove that the magnetization in the ground state is along the $z$-direction, namely $\langle s^x \rangle =\langle s^y \rangle=0$, and show that the $z$-component $\langle s^z \rangle$ exhibits a domain-wall structure. We then study the entanglement properties of the chain. In particular, we derive upper and lower bounds for the von Neumann and Rényi entropies, and entanglement spectrum for any bipartition of the chain. △ Less

Submitted 7 September, 2017; v1 submitted 2 January, 2017; originally announced January 2017.

Comments: 16 pages, 5 figures. v2: Sec. 5.3 has been modified, references added v3: Sec. 4.2 has been modified

Journal ref: J. Phys. A: Math. Theor. 50 (2017) 405002

arXiv:1611.04983 [pdf, other]

doi 10.1088/1742-5468/aa6b1f

Deformed Fredkin Spin Chain with Extensive Entanglement

Authors: Olof Salberger, Takuma Udagawa, Zhao Zhang, Hosho Katsura, Israel Klich, Vladimir Korepin

Abstract: We introduce a new spin chain which is a deformation of the Fredkin spin chain and has a phase transition between bounded and extensive entanglement entropy scaling. In this chain, spins have a local interaction of three nearest neighbors. The Hamiltonian is frustration-free and its ground state can be described analytically as a weighted superposition of Dyck paths. In the purely spin $1/2$ case,… ▽ More We introduce a new spin chain which is a deformation of the Fredkin spin chain and has a phase transition between bounded and extensive entanglement entropy scaling. In this chain, spins have a local interaction of three nearest neighbors. The Hamiltonian is frustration-free and its ground state can be described analytically as a weighted superposition of Dyck paths. In the purely spin $1/2$ case, the entanglement entropy obeys an area law: it is bounded from above by a constant, when the size of the block $n$ increases (and $t>1$). When a local color degree of freedom is introduced the entanglement entropy increases linearly with the size of the block (and $t>1$). The entanglement entropy of half of the chain is tightly bounded by ${ n}\log s$ where $n$ is the size of the block, and $s$ is the number of colors. Our chain fosters a new example for a significant boost to entropy and for the existence of the associated critical rainbow phase where the entanglement entropy scales with volume that has recently been discovered in Zhang et al. (arXiv:1606.07795) △ Less

Submitted 15 November, 2016; originally announced November 2016.

Journal ref: J. Stat. Mech. (2017) 063103

arXiv:0807.1464

Extended Optical Model Analyses of Elastic Scattering and Fusion Cross Section Data for the 9Be+28Si, 144Sm, and 208Pb Systems at Near-Coulomb-Barrier Energies using Double Folding Potential

Authors: W. Y. So, T. Udagawa, K. S. Kim, S. W. Hong, B. T. Kim

Abstract: Based on the extended optical model with the double folding potential, in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed of elastic scattering and fusion cross section data for the $^{9}$Be+$^{28}$Si, $^{144}$Sm, and $^{208}$Pb systems at near-Coulomb-barrier energies. We find that the real part of the result… ▽ More Based on the extended optical model with the double folding potential, in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed of elastic scattering and fusion cross section data for the $^{9}$Be+$^{28}$Si, $^{144}$Sm, and $^{208}$Pb systems at near-Coulomb-barrier energies. We find that the real part of the resultant DR part of the polarization potential is systematically repulsive for all the targets considered, which is consistent with the results deduced from the Continuum Discretized Coupled Channel (CDCC) calculations taking into account the polarization effects due to breakup. Further, it is found that both DR and fusion parts of the extracted polarization potentials satisfy the dispersion relation. △ Less

Submitted 14 March, 2010; v1 submitted 9 July, 2008; originally announced July 2008.

Comments: 28 pages, 6 figures, submitted to Physical Review C

arXiv:0801.2200 [pdf, ps, other]

doi 10.1103/PhysRevC.77.024609

Extended Optical Model Analyses of Elastic Scattering and Fusion Cross Section Data for the $^{12}$C+$^{208}$Pb System at Near-Coulomb-Barrier Energies by using a Folding Potential

Authors: W. Y. So, T. Udagawa, S. W. Hong, B. T. Kim

Abstract: Simultaneous $χ^{2}$ analyses are performed for elastic scattering and fusion cross section data for the $^{12}$C+$^{208}$Pb system at near-Coulomb-barrier energies by using the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts. Use is made of the double folding potential as a bare potential. It is found that the experime… ▽ More Simultaneous $χ^{2}$ analyses are performed for elastic scattering and fusion cross section data for the $^{12}$C+$^{208}$Pb system at near-Coulomb-barrier energies by using the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts. Use is made of the double folding potential as a bare potential. It is found that the experimental elastic scattering and fusion data are well reproduced without introducing any normalization factor for the double folding potential and also that both DR and fusion parts of the polarization potential determined from the $χ^{2}$ analyses satisfy separately the dispersion relation. Furthermore, it is shown that the imaginary parts of both DR and fusion potentials at the strong absorption radius change very rapidly, which results in a typical threshold anomaly in the total imaginary potential as observed with tightly bound projectiles such as $α$-particle and $^{16}$O. △ Less

Submitted 14 January, 2008; originally announced January 2008.

Comments: 26 pages, 7 figures, submitted to Physical Review C

Journal ref: Phys.Rev.C77:024609,2008

arXiv:0706.0586 [pdf, ps, other]

doi 10.1103/PhysRevC.76.024613

Extended Optical Model Analyses of Elastic Scattering and Fusion Cross Section Data for the 7Li+208Pb System at Near-Coulomb-Barrier Energies using the Folding Potential

Authors: W. Y. So, T. Udagawa, K. S. Kim, S. W. Hong, B. T. Kim

Abstract: Simultaneous $χ^{2}$ analyses previously made for elastic scattering and fusion cross section data for the $^{6}$Li+$^{208}$Pb system is extended to the $^{7}$Li+$^{208}$Pb system at near-Coulomb-barrier energies based on the extended optical model approach, in which the polarization potential is decomposed into direct reaction (DR) and fusion parts. Use is made of the double folding potential a… ▽ More Simultaneous $χ^{2}$ analyses previously made for elastic scattering and fusion cross section data for the $^{6}$Li+$^{208}$Pb system is extended to the $^{7}$Li+$^{208}$Pb system at near-Coulomb-barrier energies based on the extended optical model approach, in which the polarization potential is decomposed into direct reaction (DR) and fusion parts. Use is made of the double folding potential as a bare potential. It is found that the experimental elastic scattering and fusion data are well reproduced without introducing any normalization factor for the double folding potential and that both the DR and fusion parts of the polarization potential determined from the $χ^{2}$ analyses satisfy separately the dispersion relation. Further, we find that the real part of the fusion portion of the polarization potential is attractive while that of the DR part is repulsive except at energies far below the Coulomb barrier energy. A comparison is made of the present results with those obtained from the Continuum Discretized Coupled Channel (CDCC) calculations and a previous study based on the conventional optical model with a double folding potential. We also compare the present results for the $^7$Li+$^{208}$Pb system with the analysis previously made for the $^{6}$Li+$^{208}$Pb system. △ Less

Submitted 5 June, 2007; originally announced June 2007.

Comments: 7 figures, submitted to PRC

Journal ref: Phys.Rev.C76:024613,2007

arXiv:nucl-th/0612057 [pdf, ps, other]

doi 10.1103/PhysRevC.75.024610

Extended Optical Model Analyses of Elastic Scattering and Fusion Cross Sections for 6Li + 208Pb System at Near-Coulomb-Barrier Energies by using Folding Potential

Authors: W. Y. So, T. Udagawa, K. S. Kim, S. W. Hong, B. T. Kim

Abstract: Based on the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed for elastic scattering and fusion cross section data for the $^{6}$Li+$^{208}$Pb system at near-Coulomb-barrier energies. A folding potential is used as the bare potential. It is found that the real part of the resu… ▽ More Based on the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed for elastic scattering and fusion cross section data for the $^{6}$Li+$^{208}$Pb system at near-Coulomb-barrier energies. A folding potential is used as the bare potential. It is found that the real part of the resultant DR part of the polarization potential is repulsive, which is consistent with the results from the Continuum Discretized Coupled Channel (CDCC) calculations and the normalization factors needed for the folding potentials. Further, it is found that both DR and fusion parts of the polarization potential satisfy separately the dispersion relation. △ Less

Submitted 13 December, 2006; originally announced December 2006.

Comments: 6 figures

Journal ref: Phys.Rev. C75 (2007) 024610

arXiv:nucl-th/0509083 [pdf, ps, other]

doi 10.1103/PhysRevC.72.064602

Extended Optical Model Analyses of Elastic Scattering, Direct Reaction, and Fusion Cross Sections for the 9Be + 208Pb System at Near-Coulomb-Barrier Energies

Authors: W. Y. So, S. W. Hong, B. T. Kim, T. Udagawa

Abstract: Based on the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed for elastic scattering, DR, and fusion cross section data for the $^{9}$Be+$^{208}$Pb system at near-Coulomb-barrier energies. Similar $χ^{2}$ analyses are also performed by only taking into account the elastic scat… ▽ More Based on the extended optical model approach in which the polarization potential is decomposed into direct reaction (DR) and fusion parts, simultaneous $χ^{2}$ analyses are performed for elastic scattering, DR, and fusion cross section data for the $^{9}$Be+$^{208}$Pb system at near-Coulomb-barrier energies. Similar $χ^{2}$ analyses are also performed by only taking into account the elastic scattering and fusion data as was previously done by the present authors, and the results are compared with those of the full analysis including the DR cross section data as well. We find that the analyses using only elastic scattering and fusion data can produce very consistent and reliable predictions of cross sections particularly when the DR cross section data are not complete. Discussions are also given on the results obtained from similar analyses made earlier for the $^{9}$Be+$^{209}$Bi system. △ Less

Submitted 27 September, 2005; originally announced September 2005.

Comments: 5 figures

Journal ref: Phys.Rev. C72 (2005) 064602

arXiv:physics/0201066 [pdf, ps, other]

doi 10.1063/1.1567254

A Novel Method for the Solution of the Schroedinger Eq. in the Presence of Exchange Terms

Authors: G. H. Rawitscher, S. Y. Kang, I. Koltracht, E. Zerrad, K. Zerrad, B. T. Kim, T. Udagawa

Abstract: In the Hartree-Fock approximation the Pauli exclusion principle leads to a Schroedinger Eq. of an integro-differential form. We describe a new spectral noniterative method (S-IEM), previously developed for solving the Lippman-Schwinger integral equation with local potentials, which has now been extended so as to include the exchange nonlocality. We apply it to the restricted case of electron-Hyd… ▽ More In the Hartree-Fock approximation the Pauli exclusion principle leads to a Schroedinger Eq. of an integro-differential form. We describe a new spectral noniterative method (S-IEM), previously developed for solving the Lippman-Schwinger integral equation with local potentials, which has now been extended so as to include the exchange nonlocality. We apply it to the restricted case of electron-Hydrogen scattering in which the bound electron remains in the ground state and the incident electron has zero angular momentum, and we compare the acuracy and economy of the new method to three other methods. One is a non-iterative solution (NIEM) of the integral equation as described by Sams and Kouri in 1969. Another is an iterative method introduced by Kim and Udagawa in 1990 for nuclear physics applications, which makes an expansion of the solution into an especially favorable basis obtained by a method of moments. The third one is based on the Singular Value Decomposition of the exchange term followed by iterations over the remainder. The S-IEM method turns out to be more accurate by many orders of magnitude than any of the other three methods described above for the same number of mesh points. △ Less

Submitted 29 January, 2002; originally announced January 2002.

Comments: 29 pages, 4 figures, submitted to Phys. Rev. A

arXiv:nucl-th/0111061 [pdf, ps, other]

doi 10.1103/PhysRevC.65.044616

Simultaneous Optical Model Analyses of Elastic Scattering, Breakup, and Fusion Cross Section Data for the $^{6}$He + $^{209}$Bi System at Near-Coulomb-Barrier Energies

Authors: B. T. Kim, W. Y. So, S. W. Hong, T. Udagawa

Abstract: Based on an approach recently proposed by us, simultaneous $χ^{2}$-analyses are performed for elastic scattering, direct reaction (DR) and fusion cross sections data for the $^{6}$He+$^{209}$Bi system at near-Coulomb-barrier energies to determine the parameters of the polarization potential consisting of DR and fusion parts. We show that the data are well reproduced by the resultant potential, w… ▽ More Based on an approach recently proposed by us, simultaneous $χ^{2}$-analyses are performed for elastic scattering, direct reaction (DR) and fusion cross sections data for the $^{6}$He+$^{209}$Bi system at near-Coulomb-barrier energies to determine the parameters of the polarization potential consisting of DR and fusion parts. We show that the data are well reproduced by the resultant potential, which also satisfies the proper dispersion relation. A discussion is given of the nature of the threshold anomaly seen in the potential. △ Less

Submitted 22 November, 2001; originally announced November 2001.

Journal ref: Phys.Rev. C65 (2002) 044616

arXiv:nucl-th/0111002 [pdf, ps, other]

doi 10.1103/PhysRevC.65.044607

Semi-classical Characters and Optical Model Description of Heavy Ion Scattering, Direct Reactions, and Fusion at Near-barrier Energies

Authors: B. T. Kim, W. Y. So, S. W. Hong, T. Udagawa

Abstract: An approach is proposed to calculate the direct reaction (DR) and fusion probabilities for heavy ion collisions at near-Coulomb-barrier energies as functions of the distance of closest approach D within the framework of the optical model that introduces two types of imaginary potentials, DR and fusion. The probabilities are calculated by using partial DR and fusion cross sections, together with… ▽ More An approach is proposed to calculate the direct reaction (DR) and fusion probabilities for heavy ion collisions at near-Coulomb-barrier energies as functions of the distance of closest approach D within the framework of the optical model that introduces two types of imaginary potentials, DR and fusion. The probabilities are calculated by using partial DR and fusion cross sections, together with the classical relations associated with the Coulomb trajectory. Such an approach makes it possible to analyze the data for angular distributions of the inclusive DR cross section, facilitating the determination of the radius parameters of the imaginary DR potential in a less ambiguous manner. Simultaneous $χ^{2}$-analyses are performed of relevant data for the $^{16}$O+$^{208}$Pb system near the Coulomb-barrier energy. △ Less

Submitted 2 November, 2001; v1 submitted 1 November, 2001; originally announced November 2001.

Journal ref: Phys.Rev. C65 (2002) 044607

arXiv:nucl-ex/9910007 [pdf, ps, other]

doi 10.1103/PhysRevC.62.024906

Can Doubly Strange Dibaryon Resonances be Discovered at RHIC?

Authors: S. D. Paganis, G. W. Hoffmann, R. L. Ray, J. -L. Tang, T. Udagawa, R. S. Longacre

Abstract: The baryon-baryon continuum invariant mass spectrum generated from relativistic nucleus + nucleus collision data may reveal the existence of doubly-strange dibaryons not stable against strong decay if they lie within a few MeV of threshold. Furthermore, since the dominant component of these states is a superposition of two color-octet clusters which can be produced intermediately in a color-deco… ▽ More The baryon-baryon continuum invariant mass spectrum generated from relativistic nucleus + nucleus collision data may reveal the existence of doubly-strange dibaryons not stable against strong decay if they lie within a few MeV of threshold. Furthermore, since the dominant component of these states is a superposition of two color-octet clusters which can be produced intermediately in a color-deconfined quark-gluon plasma (QGP), an enhanced production of dibaryon resonances could be a signal of QGP formation. A total of eight, doubly-strange dibaryon states are considered for experimental search using the STAR detector (Solenoidal Tracker at RHIC) at the new Relativistic Heavy Ion Collider (RHIC). These states may decay to Lambda-Lambda and/or proton-Cascade-minus, depending on the resonance energy. STAR's large acceptance, precision tracking and vertex reconstruction capabilities, and large data volume capacity, make it an ideal instrument to use for such a search. Detector performance and analysis sensitivity are studied as a function of resonance production rate and width for one particular dibaryon which can directly strong decay to proton-Cascade-minus but not Lambda-Lambda. Results indicate that such resonances may be discovered using STAR if the resonance production rates are comparable to coalescence model predictions for dibaryon bound states. △ Less

Submitted 12 June, 2000; v1 submitted 8 October, 1999; originally announced October 1999.

Comments: 28 pages, 5 figures, revised version

Journal ref: Phys.Rev. C62 (2000) 024906

arXiv:nucl-th/9706050 [pdf, ps, other]

doi 10.1103/PhysRevC.56.570

Can only flavor-nonsinglet H dibaryons be stable against strong decays?

Authors: Stathes D. Paganis, Takeshi Udagawa, G. W. Hoffmann, R. L. Ray

Abstract: Using the QCD sum rule approach, we show that the flavor-nonsinglet $H$ dibaryon states with J$^π = 1^+$, J$^π = 0^+$, I=1 (27plet) are nearly degenerate with the J$^π = 0^+$, I=0 singlet $H_0$ dibaryon, which has been predicted to be stable against strong decay, but has not been observed. Our calculation, which does not require an instanton correction, suggests that the $H_0$ is slightly heavie… ▽ More Using the QCD sum rule approach, we show that the flavor-nonsinglet $H$ dibaryon states with J$^π = 1^+$, J$^π = 0^+$, I=1 (27plet) are nearly degenerate with the J$^π = 0^+$, I=0 singlet $H_0$ dibaryon, which has been predicted to be stable against strong decay, but has not been observed. Our calculation, which does not require an instanton correction, suggests that the $H_0$ is slightly heavier than these flavor-nonsinglet $H$s over a wide range of the parameter space. If the singlet $H_0$ mass lies above the $ΛΛ$ threshold (2231~MeV), then the strong interaction breakup to $ΛΛ$ would produce a very broad resonance in the $ΛΛ$ invariant mass spectrum which would be very difficult to observe. On the other hand, if these flavor-nonsinglet J=0 and 1 $H$ dibaryons are also above the $ΛΛ$ threshold, but below the $Ξ^0n$ breakup threshold (2254 MeV), then because the direct, strong interaction decay to the $ΛΛ$ channel is forbidden, these flavor-nonsinglet states might be more amenable to experimental observation. The present results allow a possible reconciliation between the reported observation of $ΛΛ$ hypernuclei, which argue against a stable $H_0$, and the possible existence of $H$ dibaryons in general. △ Less

Submitted 19 June, 1997; originally announced June 1997.

Comments: 10 pages, 2 figures

Journal ref: Phys.Rev.C56:570-573,1997

arXiv:nucl-th/9612046 [pdf, ps, other]

doi 10.1103/PhysRevC.55.1819

Dam** mechanisms of the Delta resonance in nuclei

Authors: B. Koerfgen, P. Oltmanns, F. Osterfeld, T. Udagawa

Abstract: The dam** mechanisms of the Delta(1232) resonance in nuclei are studied by analyzing the quasi-free decay reactions 12C(pi+,pi+ p)11B and 12C(3He,t pi+ p)11B and the 2p emission reactions 12C(pi+,pp)10B and 12C(3He,t pp)10B. The coincidence cross sections are calculated within the framework of the isobar-hole model. It is found that the 2p emission process induced by the decay of the Delta res… ▽ More The dam** mechanisms of the Delta(1232) resonance in nuclei are studied by analyzing the quasi-free decay reactions 12C(pi+,pi+ p)11B and 12C(3He,t pi+ p)11B and the 2p emission reactions 12C(pi+,pp)10B and 12C(3He,t pp)10B. The coincidence cross sections are calculated within the framework of the isobar-hole model. It is found that the 2p emission process induced by the decay of the Delta resonance in the nucleus can be consistently described by a pi+rho+g' model for the Delta+N -> N+N decay interaction. △ Less

Submitted 17 December, 1996; originally announced December 1996.

Comments: 9 pages, 5 Postscript figures, uses RevTex, psfig.sty. Accepted by Physical Review C

Report number: IKP(Th)-96-23

Journal ref: Phys.Rev.C55:1819-1825,1997

Showing 1–28 of 28 results for author: Udagawa, T