Search | arXiv e-print repository

arXiv:2406.16833 [pdf, other]

USDC: A Dataset of $\underline{U}$ser $\underline{S}$tance and $\underline{D}$ogmatism in Long $\underline{C}$onversations

Authors: Mounika Marreddy, Subba Reddy Oota, Venkata Charan Chinni, Manish Gupta, Lucie Flek

Abstract: Identifying user's opinions and stances in long conversation threads on various topics can be extremely critical for enhanced personalization, market research, political campaigns, customer service, conflict resolution, targeted advertising, and content moderation. Hence, training language models to automate this task is critical. However, to train such models, gathering manual annotations has mul… ▽ More Identifying user's opinions and stances in long conversation threads on various topics can be extremely critical for enhanced personalization, market research, political campaigns, customer service, conflict resolution, targeted advertising, and content moderation. Hence, training language models to automate this task is critical. However, to train such models, gathering manual annotations has multiple challenges: 1) It is time-consuming and costly; 2) Conversation threads could be very long, increasing chances of noisy annotations; and 3) Interpreting instances where a user changes their opinion within a conversation is difficult because often such transitions are subtle and not expressed explicitly. Inspired by the recent success of large language models (LLMs) for complex natural language processing (NLP) tasks, we leverage Mistral Large and GPT-4 to automate the human annotation process on the following two tasks while also providing reasoning: i) User Stance classification, which involves labeling a user's stance of a post in a conversation on a five-point scale; ii) User Dogmatism classification, which deals with labeling a user's overall opinion in the conversation on a four-point scale. The majority voting on zero-shot, one-shot, and few-shot annotations from these two LLMs on 764 multi-user Reddit conversations helps us curate the USDC dataset. USDC is then used to finetune and instruction-tune multiple deployable small language models for the 5-class stance and 4-class dogmatism classification tasks. We make the code and dataset publicly available [https://anonymous.4open.science/r/USDC-0F7F]. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: 32 pages, 18 figures

arXiv:2405.20665 [pdf]

doi 10.1063/5.0219606

Voltage-insensitive stochastic magnetic tunnel junctions with double free layers

Authors: Rikuto Ota, Keito Kobayashi, Keisuke Hayakawa, Shun Kanai, Kerem Y. Çamsarı, Hideo Ohno, Shunsuke Fukami

Abstract: Stochastic magnetic tunnel junctions (s-MTJ) is a promising component of probabilistic bit (p-bit), which plays a pivotal role in probabilistic computers. For a standard cell structure of the p-bit, s-MTJ is desired to be insensitive to voltage across the junction over several hundred millivolts. In conventional s-MTJs with a reference layer having a fixed magnetization direction, however, the sto… ▽ More Stochastic magnetic tunnel junctions (s-MTJ) is a promising component of probabilistic bit (p-bit), which plays a pivotal role in probabilistic computers. For a standard cell structure of the p-bit, s-MTJ is desired to be insensitive to voltage across the junction over several hundred millivolts. In conventional s-MTJs with a reference layer having a fixed magnetization direction, however, the stochastic output significantly varies with the voltage due to spin-transfer torque (STT) acting on the stochastic free layer. In this work, we study a s-MTJ with a "double-free-layer" design theoretically proposed earlier, in which the fixed reference layer of the conventional structure is replaced by another stochastic free layer, effectively mitigating the influence of STT on the stochastic output. We show that the key device property characterized by the ratio of relaxation times between the high- and low-resistance states is one to two orders of magnitude less sensitive to bias voltage variations compared to conventional s-MTJs when the top and bottom free layers are designed to possess the same effective thickness. This work opens a pathway for reliable, nanosecond-operation, high-output, and scalable spintronics-based p-bits. △ Less

Submitted 31 May, 2024; originally announced May 2024.

Journal ref: Appl. Phys. Lett. 125, 022406 (2024)

arXiv:2312.10683 [pdf, other]

Concordance of Morse functions on manifolds

Authors: Ryosuke Ota

Abstract: In this paper, the concordance of Morse functions is defined, and a necessary and sufficient condition for given two Morse functions to be concordant is presented and is compared with the cobordism criterion. Cobordism of Morse functions on smooth closed manifolds is an equivalence relation defined by using cobordisms of manifolds and fold maps. Given two Morse functions, it is important to decide… ▽ More In this paper, the concordance of Morse functions is defined, and a necessary and sufficient condition for given two Morse functions to be concordant is presented and is compared with the cobordism criterion. Cobordism of Morse functions on smooth closed manifolds is an equivalence relation defined by using cobordisms of manifolds and fold maps. Given two Morse functions, it is important to decide whether they are cobordant or not, and this problem was first solved for surfaces and then for manifolds of general dimensions by Ikegami-Saeki, Kalmár, and Ikegami. On the other hand, for Morse functions on the same manifold, we can consider a stronger equivalence relation called concordance. △ Less

Submitted 17 December, 2023; originally announced December 2023.

Comments: 9 pages, 5 figures

MSC Class: 57R45

arXiv:2311.06642 [pdf, other]

doi 10.1103/PhysRevApplied.21.054002

Double-Free-Layer Stochastic Magnetic Tunnel Junctions with Synthetic Antiferromagnets

Authors: Kemal Selcuk, Shun Kanai, Rikuto Ota, Hideo Ohno, Shunsuke Fukami, Kerem Y. Camsari

Abstract: Stochastic magnetic tunnel junctions (sMTJ) using low-barrier nanomagnets have shown promise as fast, energy-efficient, and scalable building blocks for probabilistic computing. Despite recent experimental and theoretical progress, sMTJs exhibiting the ideal characteristics necessary for probabilistic bits (p-bit) are still lacking. Ideally, the sMTJs should have (a) voltage bias independence prev… ▽ More Stochastic magnetic tunnel junctions (sMTJ) using low-barrier nanomagnets have shown promise as fast, energy-efficient, and scalable building blocks for probabilistic computing. Despite recent experimental and theoretical progress, sMTJs exhibiting the ideal characteristics necessary for probabilistic bits (p-bit) are still lacking. Ideally, the sMTJs should have (a) voltage bias independence preventing read disturbance (b) uniform randomness in the magnetization angle between the free layers, and (c) fast fluctuations without requiring external magnetic fields while being robust to magnetic field perturbations. Here, we propose a new design satisfying all of these requirements, using double-free-layer sMTJs with synthetic antiferromagnets (SAF). We evaluate the proposed sMTJ design with experimentally benchmarked spin-circuit models accounting for transport physics, coupled with the stochastic Landau-Lifshitz-Gilbert equation for magnetization dynamics. We find that the use of low-barrier SAF layers reduces dipolar coupling, achieving uncorrelated fluctuations at zero-magnetic field surviving up to diameters exceeding ($D\approx 100$ nm) if the nanomagnets can be made thin enough ($\approx 1$-$2$ nm). The double-free-layer structure retains bias-independence and the circular nature of the nanomagnets provides near-uniform randomness with fast fluctuations. Combining our full sMTJ model with advanced transistor models, we estimate the energy to generate a random bit as $\approx$ 3.6 fJ, with fluctuation rates of $\approx$ 3.3 GHz per p-bit. Our results will guide the experimental development of superior stochastic magnetic tunnel junctions for large-scale and energy-efficient probabilistic computation for problems relevant to machine learning and artificial intelligence. △ Less

Submitted 30 March, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

Journal ref: Phys. Rev. Applied 21, 054002 (2024)

arXiv:2311.04664 [pdf, other]

Speech language models lack important brain-relevant semantics

Authors: Subba Reddy Oota, Emin Çelik, Fatma Deniz, Mariya Toneva

Abstract: Despite known differences between reading and listening in the brain, recent work has shown that text-based language models predict both text-evoked and speech-evoked brain activity to an impressive degree. This poses the question of what types of information language models truly predict in the brain. We investigate this question via a direct approach, in which we systematically remove specific l… ▽ More Despite known differences between reading and listening in the brain, recent work has shown that text-based language models predict both text-evoked and speech-evoked brain activity to an impressive degree. This poses the question of what types of information language models truly predict in the brain. We investigate this question via a direct approach, in which we systematically remove specific low-level stimulus features (textual, speech, and visual) from language model representations to assess their impact on alignment with fMRI brain recordings during reading and listening. Comparing these findings with speech-based language models reveals starkly different effects of low-level features on brain alignment. While text-based models show reduced alignment in early sensory regions post-removal, they retain significant predictive power in late language regions. In contrast, speech-based models maintain strong alignment in early auditory regions even after feature removal but lose all predictive power in late language regions. These results suggest that speech-based models provide insights into additional information processed by early auditory regions, but caution is needed when using them to model processing in late language regions. We make our code publicly available. [https://github.com/subbareddy248/speech-llm-brain] △ Less

Submitted 16 June, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

Comments: 26 pages, 20 figures, The 62nd Annual Meeting of the Association for Computational Linguistics, Long paper - Main

arXiv:2311.04432 [pdf]

Emphasizing Cherenkov photons from bismuth germanate by single photon deconvolution

Authors: Ryosuke Ota, Kibo Ote

Abstract: Bismuth germanate (BGO) has been receiving attention again because it is a potential scintillator for future time-of-flight positron emission tomography. Owing to its optical properties, BGO emits a relatively large number of Cherenkov photons after 511 keV gamma-ray interactions, which pushes the timing resolution of a detector. Nonetheless, efficiently detecting Cherenkov photons among scintilla… ▽ More Bismuth germanate (BGO) has been receiving attention again because it is a potential scintillator for future time-of-flight positron emission tomography. Owing to its optical properties, BGO emits a relatively large number of Cherenkov photons after 511 keV gamma-ray interactions, which pushes the timing resolution of a detector. Nonetheless, efficiently detecting Cherenkov photons among scintillation photons is similar to looking for a needle in a haystack. Thus, we propose a method that can efficiently emphasize Cherenkov photon from a detector waveform by deconvolving a single photon response of photodetector. As a proof-of-concept, we perform the deconvolution, and a probability density function (PDF) of bismuth germanate was obtained, which is compared to a conventional time correlated single photon counting method. Furthermore, we investigate if the proposed deconvolution can emphasize a faint Cherenkov photon. Consequently, the PDF obtained by the proposed deconvolution shows a good agreement with that obtained using a conventional method. A coincidence time resolution obtained using the proposed deconvolution is improved by 43% in full width at half maximum, compared to a voltage-based leading edge discriminator. It can be concluded that the proposed deconvolution method can efficiently emphasize Cherenkov photon and improve the timing performance of BGO-based detectors. △ Less

Submitted 7 November, 2023; originally announced November 2023.

Comments: 10 pages, 10 figures

arXiv:2307.10246 [pdf, other]

Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)

Authors: Subba Reddy Oota, Zijiao Chen, Manish Gupta, Raju S. Bapi, Gael Jobard, Frederic Alexandre, Xavier Hinaut

Abstract: Can we obtain insights about the brain using AI models? How is the information in deep learning models related to brain recordings? Can we improve AI models with the help of brain recordings? Such questions can be tackled by studying brain recordings like functional magnetic resonance imaging (fMRI). As a first step, the neuroscience community has contributed several large cognitive neuroscience d… ▽ More Can we obtain insights about the brain using AI models? How is the information in deep learning models related to brain recordings? Can we improve AI models with the help of brain recordings? Such questions can be tackled by studying brain recordings like functional magnetic resonance imaging (fMRI). As a first step, the neuroscience community has contributed several large cognitive neuroscience datasets related to passive reading/listening/viewing of concept words, narratives, pictures, and movies. Encoding and decoding models using these datasets have also been proposed in the past two decades. These models serve as additional tools for basic cognitive science and neuroscience research. Encoding models aim at generating fMRI brain representations given a stimulus automatically. They have several practical applications in evaluating and diagnosing neurological conditions and thus may also help design therapies for brain damage. Decoding models solve the inverse problem of reconstructing the stimuli given the fMRI. They are useful for designing brain-machine or brain-computer interfaces. Inspired by the effectiveness of deep learning models for natural language processing, computer vision, and speech, several neural encoding and decoding models have been recently proposed. In this survey, we will first discuss popular representations of language, vision and speech stimuli, and present a summary of neuroscience datasets. Further, we will review popular deep learning based encoding and decoding architectures and note their benefits and limitations. Finally, we will conclude with a summary and discussion about future trends. Given the large amount of recently published work in the computational cognitive neuroscience (CCN) community, we believe that this survey enables an entry point for DNN researchers to diversify into CCN research. △ Less

Submitted 8 July, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

Comments: 47 pages, 23 figures

arXiv:2305.14453 [pdf, other]

On Robustness of Finetuned Transformer-based NLP Models

Authors: Pavan Kalyan Reddy Neerudu, Subba Reddy Oota, Mounika Marreddy, Venkateswara Rao Kagita, Manish Gupta

Abstract: Transformer-based pretrained models like BERT, GPT-2 and T5 have been finetuned for a large number of natural language processing (NLP) tasks, and have been shown to be very effective. However, while finetuning, what changes across layers in these models with respect to pretrained checkpoints is under-studied. Further, how robust are these models to perturbations in input text? Does the robustness… ▽ More Transformer-based pretrained models like BERT, GPT-2 and T5 have been finetuned for a large number of natural language processing (NLP) tasks, and have been shown to be very effective. However, while finetuning, what changes across layers in these models with respect to pretrained checkpoints is under-studied. Further, how robust are these models to perturbations in input text? Does the robustness vary depending on the NLP task for which the models have been finetuned? While there exists some work on studying the robustness of BERT finetuned for a few NLP tasks, there is no rigorous study that compares this robustness across encoder only, decoder only and encoder-decoder models. In this paper, we characterize changes between pretrained and finetuned language model representations across layers using two metrics: CKA and STIR. Further, we study the robustness of three language models (BERT, GPT-2 and T5) with eight different text perturbations on classification tasks from the General Language Understanding Evaluation (GLUE) benchmark, and generation tasks like summarization, free-form generation and question generation. GPT-2 representations are more robust than BERT and T5 across multiple types of input perturbation. Although models exhibit good robustness broadly, drop** nouns, verbs or changing characters are the most impactful. Overall, this study provides valuable insights into perturbation-specific weaknesses of popular Transformer-based models, which should be kept in mind when passing inputs. We make the code and models publicly available [https://github.com/PavanNeerudu/Robustness-of-Transformers-models]. △ Less

Submitted 8 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: 16 pages, 8 figures, To be published in the proceedings of the Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP 2023), Singapore, Long paper

arXiv:2304.09539 [pdf, other]

doi 10.1109/TASC.2023.3254488

Fabrication of a 64-Pixel TES Microcalorimeter Array with Iron Absorbers Uniquely Designed for 14.4-keV Solar Axion Search

Authors: Yuta Yagi, Tasuku Hayashi, Keita Tanaka, Rikuta Miyagawa, Ryo Ota, Noriko Y. Yamasaki, Kazuhisa Mitsuda, Nao Yoshida, Mikiko Saito, Takayuki Homma

Abstract: If a hypothetical elementary particle called an axion exists, to solve the strong CP problem, a 57Fe nucleus in the solar core could emit a 14.4-keV monochromatic axion through the M1 transition. If such axions are once more transformed into photons by a 57Fe absorber, a transition edge sensor (TES) X-ray microcalorimeter should be able to detect them efficiently. We have designed and fabricated a… ▽ More If a hypothetical elementary particle called an axion exists, to solve the strong CP problem, a 57Fe nucleus in the solar core could emit a 14.4-keV monochromatic axion through the M1 transition. If such axions are once more transformed into photons by a 57Fe absorber, a transition edge sensor (TES) X-ray microcalorimeter should be able to detect them efficiently. We have designed and fabricated a dedicated 64-pixel TES array with iron absorbers for the solar axion search. In order to decrease the effect of iron magnetization on spectroscopic performance, the iron absorber is placed next to the TES while maintaining a certain distance. A gold thermal transfer strap connects them. We have accomplished the electroplating of gold straps with high thermal conductivity. The residual resistivity ratio (RRR) was over 23, more than eight times higher than a previous evaporated strap. In addition, we successfully electroplated pure-iron films of more than a few micrometers in thickness for absorbers and a fabricated 64-pixel TES calorimeter structure. △ Less

Submitted 19 April, 2023; originally announced April 2023.

Comments: 5 pages, 5 figures, published in IEEE Transactions on Applied Superconductivity on 8 March 2023

arXiv:2302.08589 [pdf, other]

Syntactic Structure Processing in the Brain while Listening

Authors: Subba Reddy Oota, Mounika Marreddy, Manish Gupta, Bapi Raju Surampud

Abstract: Syntactic parsing is the task of assigning a syntactic structure to a sentence. There are two popular syntactic parsing methods: constituency and dependency parsing. Recent works have used syntactic embeddings based on constituency trees, incremental top-down parsing, and other word syntactic features for brain activity prediction given the text stimuli to study how the syntax structure is represe… ▽ More Syntactic parsing is the task of assigning a syntactic structure to a sentence. There are two popular syntactic parsing methods: constituency and dependency parsing. Recent works have used syntactic embeddings based on constituency trees, incremental top-down parsing, and other word syntactic features for brain activity prediction given the text stimuli to study how the syntax structure is represented in the brain's language network. However, the effectiveness of dependency parse trees or the relative predictive power of the various syntax parsers across brain areas, especially for the listening task, is yet unexplored. In this study, we investigate the predictive power of the brain encoding models in three settings: (i) individual performance of the constituency and dependency syntactic parsing based embedding methods, (ii) efficacy of these syntactic parsing based embedding methods when controlling for basic syntactic signals, (iii) relative effectiveness of each of the syntactic embedding methods when controlling for the other. Further, we explore the relative importance of syntactic information (from these syntactic embedding methods) versus semantic information using BERT embeddings. We find that constituency parsers help explain activations in the temporal lobe and middle-frontal gyrus, while dependency parsers better encode syntactic structure in the angular gyrus and posterior cingulate cortex. Although semantic signals from BERT are more effective compared to any of the syntactic features or embedding methods, syntactic embedding methods explain additional variance for a few brain regions. △ Less

Submitted 16 February, 2023; originally announced February 2023.

Comments: 21 pages, 22 figures

arXiv:2212.12937 [pdf, other]

GAE-ISumm: Unsupervised Graph-Based Summarization of Indian Languages

Authors: Lakshmi Sireesha Vakada, Anudeep Ch, Mounika Marreddy, Subba Reddy Oota, Radhika Mamidi

Abstract: Document summarization aims to create a precise and coherent summary of a text document. Many deep learning summarization models are developed mainly for English, often requiring a large training corpus and efficient pre-trained language models and tools. However, English summarization models for low-resource Indian languages are often limited by rich morphological variation, syntax, and semantic… ▽ More Document summarization aims to create a precise and coherent summary of a text document. Many deep learning summarization models are developed mainly for English, often requiring a large training corpus and efficient pre-trained language models and tools. However, English summarization models for low-resource Indian languages are often limited by rich morphological variation, syntax, and semantic differences. In this paper, we propose GAE-ISumm, an unsupervised Indic summarization model that extracts summaries from text documents. In particular, our proposed model, GAE-ISumm uses Graph Autoencoder (GAE) to learn text representations and a document summary jointly. We also provide a manually-annotated Telugu summarization dataset TELSUM, to experiment with our model GAE-ISumm. Further, we experiment with the most publicly available Indian language summarization datasets to investigate the effectiveness of GAE-ISumm on other Indian languages. Our experiments of GAE-ISumm in seven languages make the following observations: (i) it is competitive or better than state-of-the-art results on all datasets, (ii) it reports benchmark results on TELSUM, and (iii) the inclusion of positional and cluster information in the proposed model improved the performance of summaries. △ Less

Submitted 25 December, 2022; originally announced December 2022.

Comments: 9 pages, 7 figures

arXiv:2212.08094 [pdf, other]

Joint processing of linguistic properties in brains and language models

Authors: Subba Reddy Oota, Manish Gupta, Mariya Toneva

Abstract: Language models have been shown to be very effective in predicting brain recordings of subjects experiencing complex language stimuli. For a deeper understanding of this alignment, it is important to understand the correspondence between the detailed processing of linguistic information by the human brain versus language models. We investigate this correspondence via a direct approach, in which we… ▽ More Language models have been shown to be very effective in predicting brain recordings of subjects experiencing complex language stimuli. For a deeper understanding of this alignment, it is important to understand the correspondence between the detailed processing of linguistic information by the human brain versus language models. We investigate this correspondence via a direct approach, in which we eliminate information related to specific linguistic properties in the language model representations and observe how this intervention affects the alignment with fMRI brain recordings obtained while participants listened to a story. We investigate a range of linguistic properties (surface, syntactic, and semantic) and find that the elimination of each one results in a significant decrease in brain alignment. Specifically, we find that syntactic properties (i.e. Top Constituents and Tree Depth) have the largest effect on the trend of brain alignment across model layers. These findings provide clear evidence for the role of specific linguistic information in the alignment between brain and language models, and open new avenues for map** the joint information processing in both systems. We make the code publicly available [https://github.com/subbareddy248/linguistic-properties-brain-alignment]. △ Less

Submitted 8 November, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

Comments: 22 pages, 12 figures, To be published in the proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, USA

arXiv:2205.01404 [pdf, other]

doi 10.18653/v1/2022.naacl-main.235

Neural Language Taskonomy: Which NLP Tasks are the most Predictive of fMRI Brain Activity?

Authors: Subba Reddy Oota, Jashn Arora, Veeral Agarwal, Mounika Marreddy, Manish Gupta, Bapi Raju Surampudi

Abstract: Several popular Transformer based language models have been found to be successful for text-driven brain encoding. However, existing literature leverages only pretrained text Transformer models and has not explored the efficacy of task-specific learned Transformer representations. In this work, we explore transfer learning from representations learned for ten popular natural language processing ta… ▽ More Several popular Transformer based language models have been found to be successful for text-driven brain encoding. However, existing literature leverages only pretrained text Transformer models and has not explored the efficacy of task-specific learned Transformer representations. In this work, we explore transfer learning from representations learned for ten popular natural language processing tasks (two syntactic and eight semantic) for predicting brain responses from two diverse datasets: Pereira (subjects reading sentences from paragraphs) and Narratives (subjects listening to the spoken stories). Encoding models based on task features are used to predict activity in different regions across the whole brain. Features from coreference resolution, NER, and shallow syntax parsing explain greater variance for the reading activity. On the other hand, for the listening activity, tasks such as paraphrase generation, summarization, and natural language inference show better encoding performance. Experiments across all 10 task representations provide the following cognitive insights: (i) language left hemisphere has higher predictive brain activity versus language right hemisphere, (ii) posterior medial cortex, temporo-parieto-occipital junction, dorsal frontal lobe have higher correlation versus early auditory and auditory association cortex, (iii) syntactic and semantic tasks display a good predictive performance across brain regions for reading and listening stimuli resp. △ Less

Submitted 3 May, 2022; originally announced May 2022.

Comments: 18 pages, 18 figures

arXiv:2205.01204 [pdf, other]

Multi-Task Text Classification using Graph Convolutional Networks for Large-Scale Low Resource Language

Authors: Mounika Marreddy, Subba Reddy Oota, Lakshmi Sireesha Vakada, Venkata Charan Chinni, Radhika Mamidi

Abstract: Graph Convolutional Networks (GCN) have achieved state-of-art results on single text classification tasks like sentiment analysis, emotion detection, etc. However, the performance is achieved by testing and reporting on resource-rich languages like English. Applying GCN for multi-task text classification is an unexplored area. Moreover, training a GCN or adopting an English GCN for Indian language… ▽ More Graph Convolutional Networks (GCN) have achieved state-of-art results on single text classification tasks like sentiment analysis, emotion detection, etc. However, the performance is achieved by testing and reporting on resource-rich languages like English. Applying GCN for multi-task text classification is an unexplored area. Moreover, training a GCN or adopting an English GCN for Indian languages is often limited by data availability, rich morphological variation, syntax, and semantic differences. In this paper, we study the use of GCN for the Telugu language in single and multi-task settings for four natural language processing (NLP) tasks, viz. sentiment analysis (SA), emotion identification (EI), hate-speech (HS), and sarcasm detection (SAR). In order to evaluate the performance of GCN with one of the Indian languages, Telugu, we analyze the GCN based models with extensive experiments on four downstream tasks. In addition, we created an annotated Telugu dataset, TEL-NLP, for the four NLP tasks. Further, we propose a supervised graph reconstruction method, Multi-Task Text GCN (MT-Text GCN) on the Telugu that leverages to simultaneously (i) learn the low-dimensional word and sentence graph embeddings from word-sentence graph reconstruction using graph autoencoder (GAE) and (ii) perform multi-task text classification using these latent sentence graph embeddings. We argue that our proposed MT-Text GCN achieves significant improvements on TEL-NLP over existing Telugu pretrained word embeddings, and multilingual pretrained Transformer models: mBERT, and XLM-R. On TEL-NLP, we achieve a high F1-score for four NLP tasks: SA (0.84), EI (0.55), HS (0.83) and SAR (0.66). Finally, we show our model's quantitative and qualitative analysis on the four NLP tasks in Telugu. △ Less

Submitted 2 May, 2022; originally announced May 2022.

Comments: 9 pages, 6 figures

arXiv:2204.09564 [pdf, other]

Cross-view Brain Decoding

Authors: Subba Reddy Oota, Jashn Arora, Manish Gupta, Raju S. Bapi

Abstract: How the brain captures the meaning of linguistic stimuli across multiple views is still a critical open question in neuroscience. Consider three different views of the concept apartment: (1) picture (WP) presented with the target word label, (2) sentence (S) using the target word, and (3) word cloud (WC) containing the target word along with other semantically related words. Unlike previous effort… ▽ More How the brain captures the meaning of linguistic stimuli across multiple views is still a critical open question in neuroscience. Consider three different views of the concept apartment: (1) picture (WP) presented with the target word label, (2) sentence (S) using the target word, and (3) word cloud (WC) containing the target word along with other semantically related words. Unlike previous efforts, which focus only on single view analysis, in this paper, we study the effectiveness of brain decoding in a zero-shot cross-view learning setup. Further, we propose brain decoding in the novel context of cross-view-translation tasks like image captioning (IC), image tagging (IT), keyword extraction (KE), and sentence formation (SF). Using extensive experiments, we demonstrate that cross-view zero-shot brain decoding is practical leading to ~0.68 average pairwise accuracy across view pairs. Also, the decoded representations are sufficiently detailed to enable high accuracy for cross-view-translation tasks with following pairwise accuracy: IC (78.0), IT (83.0), KE (83.7) and SF (74.5). Analysis of the contribution of different brain networks reveals exciting cognitive insights: (1) A high percentage of visual voxels are involved in image captioning and image tagging tasks, and a high percentage of language voxels are involved in the sentence formation and keyword extraction tasks. (2) Zero-shot accuracy of the model trained on S view and tested on WC view is better than same-view accuracy of the model trained and tested on WC view. △ Less

Submitted 18 April, 2022; originally announced April 2022.

Comments: 11 pages, 10 figures

arXiv:2204.08261 [pdf, other]

Visio-Linguistic Brain Encoding

Authors: Subba Reddy Oota, Jashn Arora, Vijay Rowtula, Manish Gupta, Raju S. Bapi

Abstract: Enabling effective brain-computer interfaces requires understanding how the human brain encodes stimuli across modalities such as visual, language (or text), etc. Brain encoding aims at constructing fMRI brain activity given a stimulus. There exists a plethora of neural encoding models which study brain encoding for single mode stimuli: visual (pretrained CNNs) or text (pretrained language models)… ▽ More Enabling effective brain-computer interfaces requires understanding how the human brain encodes stimuli across modalities such as visual, language (or text), etc. Brain encoding aims at constructing fMRI brain activity given a stimulus. There exists a plethora of neural encoding models which study brain encoding for single mode stimuli: visual (pretrained CNNs) or text (pretrained language models). Few recent papers have also obtained separate visual and text representation models and performed late-fusion using simple heuristics. However, previous work has failed to explore: (a) the effectiveness of image Transformer models for encoding visual stimuli, and (b) co-attentive multi-modal modeling for visual and text reasoning. In this paper, we systematically explore the efficacy of image Transformers (ViT, DEiT, and BEiT) and multi-modal Transformers (VisualBERT, LXMERT, and CLIP) for brain encoding. Extensive experiments on two popular datasets, BOLD5000 and Pereira, provide the following insights. (1) To the best of our knowledge, we are the first to investigate the effectiveness of image and multi-modal Transformers for brain encoding. (2) We find that VisualBERT, a multi-modal Transformer, significantly outperforms previously proposed single-mode CNNs, image Transformers as well as other previously proposed multi-modal models, thereby establishing new state-of-the-art. The supremacy of visio-linguistic models raises the question of whether the responses elicited in the visual regions are affected implicitly by linguistic processing even when passively viewing images. Future fMRI tasks can verify this computational insight in an appropriate experimental setting. △ Less

Submitted 18 April, 2022; originally announced April 2022.

Comments: 18 pages, 13 figures

arXiv:2110.03196 [pdf, other]

Explicitly Multi-Modal Benchmarks for Multi-Objective Optimization

Authors: Ryosuke Ota, Reiya Hagiwara, Naoki Hamada, Likun Liu, Takahiro Yamamoto, Daisuke Sakurai

Abstract: In multi-objective optimization, designing good benchmark problems is an important issue for improving solvers. Controlling the global location of Pareto optima in existing benchmark problems has been problematic, and it is even more difficult when the design space is high-dimensional since visualization is extremely challenging. As a benchmarking with explicit local Pareto fronts, we introduc… ▽ More In multi-objective optimization, designing good benchmark problems is an important issue for improving solvers. Controlling the global location of Pareto optima in existing benchmark problems has been problematic, and it is even more difficult when the design space is high-dimensional since visualization is extremely challenging. As a benchmarking with explicit local Pareto fronts, we introduce a benchmarking based on basin connectivity (3BC) by using basins of attraction. The 3BC allows for the specification of a multimodal landscape through a kind of topological analysis called the basin graph, effectively generating optimization problems from this graph. Various known indicators measure the performance of a solver in searching global Pareto optima, but using 3BC can make us localize them for each local Pareto front by restricting it to its basin. 3BC's mathematical formulation ensures the accurate representation of the specified optimization landscape, guaranteeing the existence of intended local and global Pareto optima. △ Less

Submitted 9 February, 2024; v1 submitted 7 October, 2021; originally announced October 2021.

arXiv:2109.00802 [pdf]

doi 10.1016/j.media.2021.102226

Anatomical-Guided Attention Enhances Unsupervised PET Image Denoising Performance

Authors: Yuya Onishi, Fumio Hashimoto, Kibo Ote, Hiroyuki Ohba, Ryosuke Ota, Etsuji Yoshikawa, Yasuomi Ouchi

Abstract: Although supervised convolutional neural networks (CNNs) often outperform conventional alternatives for denoising positron emission tomography (PET) images, they require many low- and high-quality reference PET image pairs. Herein, we propose an unsupervised 3D PET image denoising method based on an anatomical information-guided attention mechanism. The proposed magnetic resonance-guided deep deco… ▽ More Although supervised convolutional neural networks (CNNs) often outperform conventional alternatives for denoising positron emission tomography (PET) images, they require many low- and high-quality reference PET image pairs. Herein, we propose an unsupervised 3D PET image denoising method based on an anatomical information-guided attention mechanism. The proposed magnetic resonance-guided deep decoder (MR-GDD) utilizes the spatial details and semantic features of MR-guidance image more effectively by introducing encoder-decoder and deep decoder subnetworks. Moreover, the specific shapes and patterns of the guidance image do not affect the denoised PET image, because the guidance image is input to the network through an attention gate. In a Monte Carlo simulation of [$^{18}$F]fluoro-2-deoxy-D-glucose (FDG), the proposed method achieved the highest peak signal-to-noise ratio and structural similarity (27.92 $\pm$ 0.44 dB/0.886 $\pm$ 0.007), as compared with Gaussian filtering (26.68 $\pm$ 0.10 dB/0.807 $\pm$ 0.004), image guided filtering (27.40 $\pm$ 0.11 dB/0.849 $\pm$ 0.003), deep image prior (DIP) (24.22 $\pm$ 0.43 dB/0.737 $\pm$ 0.017), and MR-DIP (27.65 $\pm$ 0.42 dB/0.879 $\pm$ 0.007). Furthermore, we experimentally visualized the behavior of the optimization process, which is often unknown in unsupervised CNN-based restoration problems. For preclinical (using [$^{18}$F]FDG and [$^{11}$C]raclopride) and clinical (using [$^{18}$F]florbetapir) studies, the proposed method demonstrates state-of-the-art denoising performance while retaining spatial resolution and quantitative accuracy, despite using a common network architecture for various noisy PET images with 1/10th of the full counts. These results suggest that the proposed MR-GDD can reduce PET scan times and PET tracer doses considerably without impacting patients. △ Less

Submitted 7 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

Comments: 30 pages, 12 figures

Journal ref: Med. Image Anal. 74 (2021) 102226

arXiv:2105.05805 [pdf]

Direct positron emission imaging: ultra-fast timing enables reconstruction-free imaging

Authors: Ryosuke Ota, Sun Il Kwon, Eric Berg, Fumio Hashimoto, Kyohei Nakajima, Izumi Ogawa, Yoichi Tamagawa, Tomohide Omura, Tomoyuki Hasegawa, Simon R. Cherry

Abstract: Positron emission tomography, like many other tomographic imaging modalities, relies on an image reconstruction step to produce cross-sectional images from projection data. Detection and localization of the back-to-back annihilation photons produced by positron-electron annihilation defines the trajectories of these photons, which when combined with tomographic reconstruction algorithms, permits r… ▽ More Positron emission tomography, like many other tomographic imaging modalities, relies on an image reconstruction step to produce cross-sectional images from projection data. Detection and localization of the back-to-back annihilation photons produced by positron-electron annihilation defines the trajectories of these photons, which when combined with tomographic reconstruction algorithms, permits recovery of the distribution of positron-emitting radionuclides. Here we produce cross-sectional images directly from the detected coincident annihilation photons, without using a reconstruction algorithm. Ultra-fast radiation detectors with a resolving time averaging 32 picoseconds measured the difference in arrival time of pairs of annihilation photons, localizing the annihilation site to 4.8 mm. This is sufficient to directly generate an image without reconstruction and without the geometric and sampling constraints that normally present for tomographic imaging systems. △ Less

Submitted 12 May, 2021; originally announced May 2021.

arXiv:2010.02742 [pdf, other]

Wound and episode level readmission risk or weeks to readmit: Why do patients get readmitted? How long does it take for a patient to get readmitted?

Authors: Subba Reddy Oota, Nafisur Rahman, Shahid Saleem Mohammed, Jeffrey Galitz, Ming Liu

Abstract: The Affordable care Act of 2010 had introduced Readmission reduction program in 2012 to reduce avoidable re-admissions to control rising healthcare costs. Wound care impacts 15 of medicare beneficiaries making it one of the major contributors of medicare health care cost. Health plans have been exploring proactive health care services that can focus on preventing wound recurrences and re-admission… ▽ More The Affordable care Act of 2010 had introduced Readmission reduction program in 2012 to reduce avoidable re-admissions to control rising healthcare costs. Wound care impacts 15 of medicare beneficiaries making it one of the major contributors of medicare health care cost. Health plans have been exploring proactive health care services that can focus on preventing wound recurrences and re-admissions to control the wound care costs. With rising costs of Wound care industry, it has become of paramount importance to reduce wound recurrences & patient re-admissions. What factors are responsible for a Wound to recur which ultimately lead to hospitalization or re-admission? Is there a way to identify the patients at risk of re-admission before the occurrence using data driven analysis? Patient re-admission risk management has become critical for patients suffering from chronic wounds such as diabetic ulcers, pressure ulcers, and vascular ulcers. Understanding the risk & the factors that cause patient readmission can help care providers and patients avoid wound recurrences. Our work focuses on identifying patients who are at high risk of re-admission & determining the time period with in which a patient might get re-admitted. Frequent re-admissions add financial stress to the patient & Health plan and deteriorate the quality of life of the patient. Having this information can allow a provider to set up preventive measures that can delay, if not prevent, patients' re-admission. On a combined wound & episode-level data set of patient's wound care information, our extended autoprognosis achieves a recall of 92 and a precision of 92 for the predicting a patient's re-admission risk. For new patient class, precision and recall are as high as 91 and 98, respectively. We are also able to predict the patient's discharge event for a re-admission event to occur through our model with a MAE of 2.3 weeks. △ Less

Submitted 5 October, 2020; originally announced October 2020.

Comments: 7 pages, 7 figures

arXiv:1909.12299 [pdf, other]

Expert2Coder: Capturing Divergent Brain Regions Using Mixture of Regression Experts

Authors: Subba Reddy Oota, Naresh Manwani, Raju S. Bapi

Abstract: fMRI semantic category understanding using linguistic encoding models attempts to learn a forward map** that relates stimuli to the corresponding brain activation. State-of-the-art encoding models use a single global model (linear or non-linear) to predict brain activation given the stimulus. However, the critical assumption in these methods is that a priori different brain regions respond the s… ▽ More fMRI semantic category understanding using linguistic encoding models attempts to learn a forward map** that relates stimuli to the corresponding brain activation. State-of-the-art encoding models use a single global model (linear or non-linear) to predict brain activation given the stimulus. However, the critical assumption in these methods is that a priori different brain regions respond the same way to all the stimuli, that is, there is no modularity or specialization assumed for any region. This goes against the modularity theory, supported by many cognitive neuroscience investigations suggesting that there are functionally specialized regions in the brain. In this paper, we achieve this by clustering similar regions together and for every cluster we learn a different linear regression model using a mixture of linear experts model. The key idea here is that each linear expert captures the behaviour of similar brain regions. Given a new stimulus, the utility of the proposed model is twofold (i) predicts the brain activation as a weighted linear combination of the activations of multiple linear experts and (ii) to learn multiple experts corresponding to different brain regions. We argue that each expert captures activity patterns related to a particular region of interest (ROI) in the human brain. This study helps in understanding the brain regions that are activated together given different kinds of stimuli. Importantly, we suggest that the mixture of regression experts (MoRE) framework successfully combines the two principles of organization of function in the brain, namely that of specialization and integration. Experiments on fMRI data from paradigm 1 [1]where participants view linguistic stimuli show that the proposed MoRE model has better prediction accuracy compared to that of conventional models. △ Less

Submitted 29 May, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

Comments: 15 pages

arXiv:1904.00762 [pdf, other]

Affect in Tweets Using Experts Model

Authors: Subba Reddy Oota, Adithya Avvaru, Mounika Marreddy, Radhika Mamidi

Abstract: Estimating the intensity of emotion has gained significance as modern textual inputs in potential applications like social media, e-retail markets, psychology, advertisements etc., carry a lot of emotions, feelings, expressions along with its meaning. However, the approaches of traditional sentiment analysis primarily focuses on classifying the sentiment in general (positive or negative) or at an… ▽ More Estimating the intensity of emotion has gained significance as modern textual inputs in potential applications like social media, e-retail markets, psychology, advertisements etc., carry a lot of emotions, feelings, expressions along with its meaning. However, the approaches of traditional sentiment analysis primarily focuses on classifying the sentiment in general (positive or negative) or at an aspect level(very positive, low negative, etc.) and cannot exploit the intensity information. Moreover, automatically identifying emotions like anger, fear, joy, sadness, disgust etc., from text introduces challenging scenarios where single tweet may contain multiple emotions with different intensities and some emotions may even co-occur in some of the tweets. In this paper, we propose an architecture, Experts Model, inspired from the standard Mixture of Experts (MoE) model. The key idea here is each expert learns different sets of features from the feature vector which helps in better emotion detection from the tweet. We compared the results of our Experts Model with both baseline results and top five performers of SemEval-2018 Task-1, Affect in Tweets (AIT). The experimental results show that our proposed approach deals with the emotion detection problem and stands at top-5 results. △ Less

Submitted 20 March, 2019; originally announced April 2019.

Comments: 10 pages, 6 figures, The 32nd Pacific Asia Conference on Language, Information and Computation (PACLIC 32)

arXiv:1811.10740 [pdf, other]

Mixture of Regression Experts in fMRI Encoding

Authors: Subba Reddy Oota, Adithya Avvaru, Naresh Manwani, Raju S. Bapi

Abstract: fMRI semantic category understanding using linguistic encoding models attempt to learn a forward map** that relates stimuli to the corresponding brain activation. Classical encoding models use linear multi-variate methods to predict the brain activation (all voxels) given the stimulus. However, these methods essentially assume multiple regions as one large uniform region or several independent r… ▽ More fMRI semantic category understanding using linguistic encoding models attempt to learn a forward map** that relates stimuli to the corresponding brain activation. Classical encoding models use linear multi-variate methods to predict the brain activation (all voxels) given the stimulus. However, these methods essentially assume multiple regions as one large uniform region or several independent regions, ignoring connections among them. In this paper, we present a mixture of experts-based model where a group of experts captures brain activity patterns related to particular regions of interest (ROI) and also show the discrimination across different experts. The model is trained word stimuli encoded as 25-dimensional feature vectors as input and the corresponding brain responses as output. Given a new word (25-dimensional feature vector), it predicts the entire brain activation as the linear combination of multiple experts brain activations. We argue that each expert learns a certain region of brain activations corresponding to its category of words, which solves the problem of identifying the regions with a simple encoding model. We showcase that proposed mixture of experts-based model indeed learns region-based experts to predict the brain activations with high spatial accuracy. △ Less

Submitted 1 December, 2018; v1 submitted 26 November, 2018; originally announced November 2018.

Comments: 8 pages, 3 figures, Workshop on Visually Grounded Interaction and Language @ 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montréal, Canada

arXiv:1806.05177 [pdf, other]

fMRI Semantic Category Decoding using Linguistic Encoding of Word Embeddings

Authors: Subba Reddy Oota, Naresh Manwani, Bapi Raju S

Abstract: The dispute of how the human brain represents conceptual knowledge has been argued in many scientific fields. Brain imaging studies have shown that the spatial patterns of neural activation in the brain are correlated with thinking about different semantic categories of words (for example, tools, animals, and buildings) or when viewing the related pictures. In this paper, we present a computationa… ▽ More The dispute of how the human brain represents conceptual knowledge has been argued in many scientific fields. Brain imaging studies have shown that the spatial patterns of neural activation in the brain are correlated with thinking about different semantic categories of words (for example, tools, animals, and buildings) or when viewing the related pictures. In this paper, we present a computational model that learns to predict the neural activation captured in functional magnetic resonance imaging (fMRI) data of test words. Unlike the models with hand-crafted features that have been used in the literature, in this paper we propose a novel approach wherein decoding models are built with features extracted from popular linguistic encodings of Word2Vec, GloVe, Meta-Embeddings in conjunction with the empirical fMRI data associated with viewing several dozen concrete nouns. We compared these models with several other models that use word features extracted from FastText, Randomly-generated features, Mitchell's 25 features [1]. The experimental results show that the predicted fMRI images using Meta-Embeddings meet the state-of-the-art performance. Although models with features from GloVe and Word2Vec predict fMRI images similar to the state-of-the-art model, model with features from Meta-Embeddings predicts significantly better. The proposed scheme that uses popular linguistic encoding offers a simple and easy approach for semantic decoding from fMRI experiments. △ Less

Submitted 13 June, 2018; originally announced June 2018.

Comments: 12 pages, 7 Figures

arXiv:1710.02861 [pdf, ps, other]

Clickbait detection using word embeddings

Authors: Vijayasaradhi Indurthi, Subba Reddy Oota

Abstract: Clickbait is a pejorative term describing web content that is aimed at generating online advertising revenue, especially at the expense of quality or accuracy, relying on sensationalist headlines or eye-catching thumbnail pictures to attract click-throughs and to encourage forwarding of the material over online social networks. We use distributed word representations of the words in the title as f… ▽ More Clickbait is a pejorative term describing web content that is aimed at generating online advertising revenue, especially at the expense of quality or accuracy, relying on sensationalist headlines or eye-catching thumbnail pictures to attract click-throughs and to encourage forwarding of the material over online social networks. We use distributed word representations of the words in the title as features to identify clickbaits in online news media. We train a machine learning model using linear regression to predict the cickbait score of a given tweet. Our methods achieve an F1-score of 64.98\% and an MSE of 0.0791. Compared to other methods, our method is simple, fast to train, does not require extensive feature engineering and yet moderately effective. △ Less

Submitted 8 October, 2017; originally announced October 2017.

Comments: Clickbait Challenge 2017

arXiv:1703.00623 [pdf, ps, other]

doi 10.1103/PhysRevC.96.014005

Missing-mass spectroscopy with the ${}^{6}$Li($π^{-}, K^{+}$)X reaction to search for ${}^{6}_Λ$H

Authors: R. Honda, M. Agnello, J. K. Ahn, S. Ajimura, Y. Akazawa, N. Amano, K. Aoki, H. C. Bhang, N. Chiga, M. Endo, P. Evtoukhovitch, A. Feliciello, H. Fujioka, T. Fukuda, S. Hasegawa, S. H. Hayakawa, K. Hosomi, S. H. Hwang, Y. Ichikawa, Y. Igarashi, K. Imai, N. Ishibashi, R. Iwasaki, C. W. Joo, R. Kiuchi , et al. (41 additional authors not shown)

Abstract: We searched for the bound state of the neutron-rich $Λ$-hypernucleus ${}^{6}_Λ$H, using the ${}^{6}$Li($π^{-}, K^{+}$)X double charge-exchange reaction at a $π^{-}$ beam momentum of 1.2 GeV/c at J-PARC. A total of $1.4 \times 10^{12}$ $π^{-}$ was driven onto a ${}^{6}$Li target of 3.5-g/$\rm cm^2$ thickness. No event was observed below the bound threshold, i.e., the mass of ${}^{4}_Λ$H + 2n, in th… ▽ More We searched for the bound state of the neutron-rich $Λ$-hypernucleus ${}^{6}_Λ$H, using the ${}^{6}$Li($π^{-}, K^{+}$)X double charge-exchange reaction at a $π^{-}$ beam momentum of 1.2 GeV/c at J-PARC. A total of $1.4 \times 10^{12}$ $π^{-}$ was driven onto a ${}^{6}$Li target of 3.5-g/$\rm cm^2$ thickness. No event was observed below the bound threshold, i.e., the mass of ${}^{4}_Λ$H + 2n, in the missing-mass spectrum of the ${}^{6}$Li($π^{-}, K^{+}$)X reaction in the $2^{\circ}$ < $θ_{πK}$ < $20^{\circ}$ angular range. Furthermore, no event was found up to 2.8 MeV/$c^2$ above the bound threshold. We obtained the the double-differential cross section spectra of the ${}^{6}$Li($π^{-}, K^{+}$)X reaction in the angular range of $2^{\circ}$ < $θ_{πK}$ < $14^{\circ}$. An upper limit of 0.56 nb/sr (90% C.L.) was obtained for the production cross section of the ${}^{6}_Λ$H hypernucleus bound state. In addition, not only the bound state region, but also the $Λ$ continuum region and part of the $Σ^{-}$ quasi-free production region of the ${}^{6}$Li($π^{-}, K^{+}$)X reaction, were obtained with high statistics. The present missing-mass spectrum will facilitate the investigation of the $Σ^{-}$-nucleus optical potential for $Σ^{-}$-${}^{5}$He through spectrum shape analysis. △ Less

Submitted 3 March, 2017; v1 submitted 2 March, 2017; originally announced March 2017.

Comments: 24 pages, 17 figures

Journal ref: Phys. Rev. C 96, 014005 (2017)

arXiv:1508.04563 [pdf]

doi 10.1080/14786435.2015.1095365

Phason space analysis and structure modeling of 100 A-scale dodecagonal quasicrystal in Mn-based alloy

Authors: Tsutomu Ishimasa, Shuhei Iwami, Norihito Sakaguchi, Ryo Oota, Marek Mihalkovic

Abstract: The dodecagonal quasicrystal classified into the five-dimensional space group P126/mmc, recently discovered in a Mn-Cr-Ni-Si alloy, has been analyzed using atomic-resolution spherical aberration-corrected electron microscopy, i.e. high-angle annular dark-field scanning transmission electron microscopy (HAADF-STEM) and conventional transmission electron microscopy. By observing along the 12-fold ax… ▽ More The dodecagonal quasicrystal classified into the five-dimensional space group P126/mmc, recently discovered in a Mn-Cr-Ni-Si alloy, has been analyzed using atomic-resolution spherical aberration-corrected electron microscopy, i.e. high-angle annular dark-field scanning transmission electron microscopy (HAADF-STEM) and conventional transmission electron microscopy. By observing along the 12-fold axis, non-periodic tiling consisting of an equilateral triangle and a square has been revealed, of which common edge length is a = 4.560 A. These tiles tend to form a network of dodecagons of which size is (2+Sqrt(3))a ~ 17 A in diameter. The tiling was interpreted as an aggregate of 100 A-scale oriented domains of high- and low-quality quasicrystals with small crystallites appearing at their boundaries. The quasicrystal domains exhibited a densely-filled circular acceptance region in the phason space. This is the first observation of the acceptance region in an actual dodecagonal quasicrystal. Atomic structure model consistent with the electron microscopy images is a standard Frank-Kasper decoration of the triangle and square tiles, that can be inferred from the crystal structures of Zr4Al3 and Cr3Si. Four kinds of layers located at z = 0, +-1/4 and 1/2 are stacked periodically along the 12-fold axis, and the atoms at z = 0 and 1/2 form hexagonal anti-prisms consistently with the 126-screw axis. The validity of this structure model was examined by means of powder X-ray diffraction. △ Less

Submitted 19 August, 2015; originally announced August 2015.

Comments: 21 pages, 3 tables, 16 figures

arXiv:1411.6708 [pdf, ps, other]

doi 10.1093/ptep/ptv002

Observation of the "$K^-pp$"-like structure in the $d(π^+, K^+)$ reaction at 1.69 GeV/$c$

Authors: Yudai Ichikawa, Tomofumi Nagae, Hiroyuki Fujioka, Hyoungchan Bhang, Stefania Bufalino, Hiroyuki Ekawa, Petr Evtoukhovitch, Alessandro Feliciello, Shoichi Hasegawa, Shuhei Hayakawa, Ryotaro Honda, Kenji Hosomi, Kenichi Imai, Shigeru Ishimoto, Changwoo Joo, Shunsuke Kanatsuki, Ryuta Kiuchi, Takeshi Koike, Harphool Kumawat, Yuki Matsumoto, Koji Miwa, Manabu Moritsu, Megumi Naruki, Masayuki Niiyama, Yuki Nozawa , et al. (19 additional authors not shown)

Abstract: We have observed a "$K^-pp$"-like structure in the $d(π^+,K^+)$ reaction at 1.69 GeV/$c$. In this reaction $Λ(1405)$ hyperon resonance is expected to be produced as a doorway to form the $K^-pp$ through the $Λ^*p\rightarrow K^-pp$ process. However, most of the produced $Λ(1405)$'s would escape from deuteron without secondary reactions. Therefore, coincidence of high-momentum ($>$ 250~MeV/$c$) prot… ▽ More We have observed a "$K^-pp$"-like structure in the $d(π^+,K^+)$ reaction at 1.69 GeV/$c$. In this reaction $Λ(1405)$ hyperon resonance is expected to be produced as a doorway to form the $K^-pp$ through the $Λ^*p\rightarrow K^-pp$ process. However, most of the produced $Λ(1405)$'s would escape from deuteron without secondary reactions. Therefore, coincidence of high-momentum ($>$ 250~MeV/$c$) proton(s) in large emission angles ($39^\circ<θ_{lab.}<122^\circ$) was requested to enhance the signal-to-background ratio. A broad enhancement in the proton coincidence spectra are observed around the missing-mass of 2.27 GeV/$c^2$, which corresponds to the $K^-pp$ binding energy of 95 $^{+18}_{-17}$ (stat.) $^{+30}_{-21}$ (syst.) MeV and the width of 162 $^{+87}_{-45}$ (stat.) $^{+66}_{-78}$ (syst.) MeV. △ Less

Submitted 24 November, 2014; originally announced November 2014.

Comments: 8 pages, 4 figures, submitted to PTEP

arXiv:1407.3051 [pdf, ps, other]

doi 10.1093/ptep/ptu128

Inclusive spectrum of the $d(π^+, K^+)$ reaction at 1.69 GeV/c

Authors: Yudai Ichikawa, Tomofumi Nagae, Hyoungchan Bhang, Stefania Bufalino, Hiroyuki Ekawa, Petr Evtoukhovitch, Alessandro Feliciello, Hiroyuki Fujioka, Shoichi Hasegawa, Shuhei Hayakawa, Ryotaro Honda, Kenji Hosomi, Ken'ichi Imai, Shigeru Ishimoto, Changwoo Joo, Shunsuke Kanatsuki, Ryuta Kiuchi, Takeshi Koike, Harphool Kumawat, Yuki Matsumoto, Koji Miwa, Manabu Moritsu, Megumi Naruki, Masayuki Niiyama, Yuki Nozawa , et al. (19 additional authors not shown)

Abstract: We have measured an inclusive missing-mass spectrum of the $d(π^+, K^+)$ reaction at the pion incident momentum of 1.69 GeV/$c$ at the laboratory scattering angles between 2$^\circ$ and 16$^\circ$ with the missing-mass resolution of 2.7 $\pm$ 0.1 MeV/$c^2$ (FWHM) at the missing mass of 2.27 GeV/$c^{2}$. In this Letter, we first try to understand the spectrum as a simple quasi-free picture based on… ▽ More We have measured an inclusive missing-mass spectrum of the $d(π^+, K^+)$ reaction at the pion incident momentum of 1.69 GeV/$c$ at the laboratory scattering angles between 2$^\circ$ and 16$^\circ$ with the missing-mass resolution of 2.7 $\pm$ 0.1 MeV/$c^2$ (FWHM) at the missing mass of 2.27 GeV/$c^{2}$. In this Letter, we first try to understand the spectrum as a simple quasi-free picture based on several known elementary cross sections, considering the neutron/proton Fermi motion in deuteron. While gross spectrum structures are well understood in this picture, we have observed two distinct deviations; one peculiar enhancement at 2.13 GeV/$c^2$ is due to the $ΣN$ cusp, and the other notable feature is a shift of a broad bump structure, mainly originating from hyperon resonance productions of $Λ(1405)$ and $Σ(1385)^{+/0}$, by about 22.4 $\pm$ 0.4 (stat.) $^{+2.7}_{-1.7}$ (syst.) MeV/$c^2$ toward the low-mass side, which is calculated in the kinematics of a proton at rest as the target. △ Less

Submitted 24 August, 2014; v1 submitted 11 July, 2014; originally announced July 2014.

Comments: 8 pages, 3 figures, submitted to PTEP

arXiv:1407.0669 [pdf, ps, other]

doi 10.1103/PhysRevC.90.035205

High-resolution search for the $Θ^{+}$ pentaquark via a pion-induced reaction at J-PARC

Authors: J-PARC E19 Collaboration, :, M. Moritsu, S. Adachi, M. Agnello, S. Ajimura, K. Aoki, H. C. Bhang, B. Bassalleck, E. Botta, S. Bufalino, N. Chiga, H. Ekawa, P. Evtoukhovitch, A. Feliciello, H. Fujioka, S. Hayakawa, F. Hiruma, R. Honda, K. Hosomi, Y. Ichikawa, M. Ieiri, Y. Igarashi, K. Imai, N. Ishibashi , et al. (51 additional authors not shown)

Abstract: The pentaquark $Θ^+$ has been searched for via the $π^-p \to K^-X$ reaction with beam momenta of 1.92 and 2.01 GeV/$c$ at J-PARC. A missing mass resolution of 2 MeV (FWHM) was achieved but no sharp peak structure was observed. The upper limits on the production cross section averaged over the scattering angle from 2$^{\circ}$ to 15$^{\circ}$ in the laboratory frame were found to be less than 0.28… ▽ More The pentaquark $Θ^+$ has been searched for via the $π^-p \to K^-X$ reaction with beam momenta of 1.92 and 2.01 GeV/$c$ at J-PARC. A missing mass resolution of 2 MeV (FWHM) was achieved but no sharp peak structure was observed. The upper limits on the production cross section averaged over the scattering angle from 2$^{\circ}$ to 15$^{\circ}$ in the laboratory frame were found to be less than 0.28 $μ$b/sr at the 90\% confidence level for both the 1.92- and 2.01-GeV/$c$ data. The systematic uncertainty of the upper limits was controlled within 10\%. Constraints on the $Θ^+$ decay width were also evaluated with a theoretical calculation using effective Lagrangian. The present result implies that the width should be less than 0.36 and 1.9 MeV for the spin-parity of $1/2^+$ and $1/2^-$, respectively. △ Less

Submitted 8 October, 2014; v1 submitted 2 July, 2014; originally announced July 2014.

Comments: 12 pages, 9 figures; published version

Journal ref: Phys. Rev. C 90, 035205 (2014)

arXiv:1310.6104 [pdf, ps, other]

doi 10.1016/j.physletb.2013.12.062

Search for $^6_Λ$H hypernucleus by the $^6$Li$(π^-,K^+)$ reaction at $p_{π^-}$ = 1.2 GeV/$c$

Authors: H. Sugimura, M. Agnello, J. K. Ahn, S. Ajimura, Y. Akazawa, N. Amano, K. Aoki, H. C. Bhang, N. Chiga, M. Endo, P. Evtoukhovitch, A. Feliciello, H. Fujioka, T. Fukuda, S. Hasegawa, S. Hayakawa, R. Honda, K. Hosomi, S. H. Hwang, Y. Ichikawa, Y. Igarashi, K. Imai, N. Ishibashi, R. Iwasaki, C. W. Joo , et al. (41 additional authors not shown)

Abstract: We have carried out an experiment to search for a neutron-rich hypernucleus, $^6_Λ$H, by the $^6$Li($π^-,K^+$) reaction at $p_{π^-}$ =1.2 GeV/$c$. The obtained missing mass spectrum with an estimated energy resolution of 3.2 MeV (FWHM) showed no peak structure corresponding to the $^6_Λ$H hypernucleus neither below nor above the $^4_Λ$H$+2n$ particle decay threshold. An upper limit of the producti… ▽ More We have carried out an experiment to search for a neutron-rich hypernucleus, $^6_Λ$H, by the $^6$Li($π^-,K^+$) reaction at $p_{π^-}$ =1.2 GeV/$c$. The obtained missing mass spectrum with an estimated energy resolution of 3.2 MeV (FWHM) showed no peak structure corresponding to the $^6_Λ$H hypernucleus neither below nor above the $^4_Λ$H$+2n$ particle decay threshold. An upper limit of the production cross section for the bound $^6_Λ$H hypernucleus was estimated to be 1.2 nb/sr at 90% confidence level. △ Less

Submitted 5 February, 2014; v1 submitted 22 October, 2013; originally announced October 2013.

Comments: 6 pages, 5 figures, published version

Journal ref: Phys. Lett. B. 729 (2014) 39

arXiv:0812.5005 [pdf]

Measuring Fit of Sequence Data to Phylogenetic Model: Gain of Power using Marginal Tests

Authors: Peter J. Waddell, Rissa Ota, David Penny

Abstract: Testing fit of data to model is fundamentally important to any science, but publications in the field of phylogenetics rarely do this. Such analyses discard fundamental aspects of science as prescribed by Karl Popper. Indeed, not without cause, Popper (1978) once argued that evolutionary biology was unscientific as its hypotheses were untestable. Here we trace developments in assessing fit from… ▽ More Testing fit of data to model is fundamentally important to any science, but publications in the field of phylogenetics rarely do this. Such analyses discard fundamental aspects of science as prescribed by Karl Popper. Indeed, not without cause, Popper (1978) once argued that evolutionary biology was unscientific as its hypotheses were untestable. Here we trace developments in assessing fit from Penny et al. (1982) to the present. We compare the general log-likelihood ratio (the G or G2 statistic) statistic between the evolutionary tree model and the multinomial model with that of marginalized tests applied to an alignment (using placental mammal coding sequence data). It is seen that the most general test does not reject the fit of data to model (p~0.5), but the marginalized tests do. Tests on pair-wise frequency (F) matrices, strongly (p < 0.001) reject the most general phylogenetic (GTR) models commonly in use. It is also clear (p < 0.01) that the sequences are not stationary in their nucleotide composition. Deviations from stationarity and homogeneity seem to be unevenly distributed amongst taxa; not necessarily those expected from examining other regions of the genome. By marginalizing the 4t patterns of the i.i.d. model to observed and expected parsimony counts, that is, from constant sites, to singletons, to parsimony informative characters of a minimum possible length, then the likelihood ratio test regains power, and it too rejects the evolutionary model with p << 0.001. Given such behavior over relatively recent evolutionary time, readers in general should maintain a healthy skepticism of results, as the scale of the systematic errors in published analyses may really be far larger than the analytical methods (e.g., bootstrap) report. △ Less

Submitted 30 December, 2008; originally announced December 2008.

arXiv:0806.1471 [pdf, ps, other]

doi 10.1088/1742-6596/120/4/042014

Status report of the Tokyo axion helioscope experiment

Authors: Y. Inoue, M. Minowa, Y. Akimoto, R. Ota, T. Mizumoto, A. Yamamoto

Abstract: We have searched for solar axions with a detector which consists of a 4T x 2.3m superconducting magnet, PIN-photodiode X-ray detectors, and an altazimuth mount to track the sun. The conversion region is filled with cold helium gas which modifies the axion mass at which coherent conversion occurs. In the past measurements, axion mass from 0 to 0.27eV have been scanned. Since no positive evidence… ▽ More We have searched for solar axions with a detector which consists of a 4T x 2.3m superconducting magnet, PIN-photodiode X-ray detectors, and an altazimuth mount to track the sun. The conversion region is filled with cold helium gas which modifies the axion mass at which coherent conversion occurs. In the past measurements, axion mass from 0 to 0.27eV have been scanned. Since no positive evidence was seen, an upper limit to the axion-photon coupling constant was set to be g < 6-10E-10/GeV (95%CL) depending on the axion masses. We are now actively preparing for a new stage of the experiment aiming at one to a few eV solar axions. In this mass region, our detector might be able to check parameter regions which are preferable to the axion models. △ Less

Submitted 9 June, 2008; originally announced June 2008.

Comments: 3 pages, 3 figures, submitted to be included in the proceedings of TAUP 2007

Report number: RESCEU-54/08

Journal ref: J.Phys.Conf.Ser.120:042014,2008

Showing 1–33 of 33 results for author: Ota, R