-
USDC: A Dataset of $\underline{U}$ser $\underline{S}$tance and $\underline{D}$ogmatism in Long $\underline{C}$onversations
Authors:
Mounika Marreddy,
Subba Reddy Oota,
Venkata Charan Chinni,
Manish Gupta,
Lucie Flek
Abstract:
Identifying user's opinions and stances in long conversation threads on various topics can be extremely critical for enhanced personalization, market research, political campaigns, customer service, conflict resolution, targeted advertising, and content moderation. Hence, training language models to automate this task is critical. However, to train such models, gathering manual annotations has mul…
▽ More
Identifying user's opinions and stances in long conversation threads on various topics can be extremely critical for enhanced personalization, market research, political campaigns, customer service, conflict resolution, targeted advertising, and content moderation. Hence, training language models to automate this task is critical. However, to train such models, gathering manual annotations has multiple challenges: 1) It is time-consuming and costly; 2) Conversation threads could be very long, increasing chances of noisy annotations; and 3) Interpreting instances where a user changes their opinion within a conversation is difficult because often such transitions are subtle and not expressed explicitly. Inspired by the recent success of large language models (LLMs) for complex natural language processing (NLP) tasks, we leverage Mistral Large and GPT-4 to automate the human annotation process on the following two tasks while also providing reasoning: i) User Stance classification, which involves labeling a user's stance of a post in a conversation on a five-point scale; ii) User Dogmatism classification, which deals with labeling a user's overall opinion in the conversation on a four-point scale. The majority voting on zero-shot, one-shot, and few-shot annotations from these two LLMs on 764 multi-user Reddit conversations helps us curate the USDC dataset. USDC is then used to finetune and instruction-tune multiple deployable small language models for the 5-class stance and 4-class dogmatism classification tasks. We make the code and dataset publicly available [https://anonymous.4open.science/r/USDC-0F7F].
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Voltage-insensitive stochastic magnetic tunnel junctions with double free layers
Authors:
Rikuto Ota,
Keito Kobayashi,
Keisuke Hayakawa,
Shun Kanai,
Kerem Y. Çamsarı,
Hideo Ohno,
Shunsuke Fukami
Abstract:
Stochastic magnetic tunnel junctions (s-MTJ) is a promising component of probabilistic bit (p-bit), which plays a pivotal role in probabilistic computers. For a standard cell structure of the p-bit, s-MTJ is desired to be insensitive to voltage across the junction over several hundred millivolts. In conventional s-MTJs with a reference layer having a fixed magnetization direction, however, the sto…
▽ More
Stochastic magnetic tunnel junctions (s-MTJ) is a promising component of probabilistic bit (p-bit), which plays a pivotal role in probabilistic computers. For a standard cell structure of the p-bit, s-MTJ is desired to be insensitive to voltage across the junction over several hundred millivolts. In conventional s-MTJs with a reference layer having a fixed magnetization direction, however, the stochastic output significantly varies with the voltage due to spin-transfer torque (STT) acting on the stochastic free layer. In this work, we study a s-MTJ with a "double-free-layer" design theoretically proposed earlier, in which the fixed reference layer of the conventional structure is replaced by another stochastic free layer, effectively mitigating the influence of STT on the stochastic output. We show that the key device property characterized by the ratio of relaxation times between the high- and low-resistance states is one to two orders of magnitude less sensitive to bias voltage variations compared to conventional s-MTJs when the top and bottom free layers are designed to possess the same effective thickness. This work opens a pathway for reliable, nanosecond-operation, high-output, and scalable spintronics-based p-bits.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Concordance of Morse functions on manifolds
Authors:
Ryosuke Ota
Abstract:
In this paper, the concordance of Morse functions is defined, and a necessary and sufficient condition for given two Morse functions to be concordant is presented and is compared with the cobordism criterion. Cobordism of Morse functions on smooth closed manifolds is an equivalence relation defined by using cobordisms of manifolds and fold maps. Given two Morse functions, it is important to decide…
▽ More
In this paper, the concordance of Morse functions is defined, and a necessary and sufficient condition for given two Morse functions to be concordant is presented and is compared with the cobordism criterion. Cobordism of Morse functions on smooth closed manifolds is an equivalence relation defined by using cobordisms of manifolds and fold maps. Given two Morse functions, it is important to decide whether they are cobordant or not, and this problem was first solved for surfaces and then for manifolds of general dimensions by Ikegami-Saeki, Kalmár, and Ikegami. On the other hand, for Morse functions on the same manifold, we can consider a stronger equivalence relation called concordance.
△ Less
Submitted 17 December, 2023;
originally announced December 2023.
-
Double-Free-Layer Stochastic Magnetic Tunnel Junctions with Synthetic Antiferromagnets
Authors:
Kemal Selcuk,
Shun Kanai,
Rikuto Ota,
Hideo Ohno,
Shunsuke Fukami,
Kerem Y. Camsari
Abstract:
Stochastic magnetic tunnel junctions (sMTJ) using low-barrier nanomagnets have shown promise as fast, energy-efficient, and scalable building blocks for probabilistic computing. Despite recent experimental and theoretical progress, sMTJs exhibiting the ideal characteristics necessary for probabilistic bits (p-bit) are still lacking. Ideally, the sMTJs should have (a) voltage bias independence prev…
▽ More
Stochastic magnetic tunnel junctions (sMTJ) using low-barrier nanomagnets have shown promise as fast, energy-efficient, and scalable building blocks for probabilistic computing. Despite recent experimental and theoretical progress, sMTJs exhibiting the ideal characteristics necessary for probabilistic bits (p-bit) are still lacking. Ideally, the sMTJs should have (a) voltage bias independence preventing read disturbance (b) uniform randomness in the magnetization angle between the free layers, and (c) fast fluctuations without requiring external magnetic fields while being robust to magnetic field perturbations. Here, we propose a new design satisfying all of these requirements, using double-free-layer sMTJs with synthetic antiferromagnets (SAF). We evaluate the proposed sMTJ design with experimentally benchmarked spin-circuit models accounting for transport physics, coupled with the stochastic Landau-Lifshitz-Gilbert equation for magnetization dynamics. We find that the use of low-barrier SAF layers reduces dipolar coupling, achieving uncorrelated fluctuations at zero-magnetic field surviving up to diameters exceeding ($D\approx 100$ nm) if the nanomagnets can be made thin enough ($\approx 1$-$2$ nm). The double-free-layer structure retains bias-independence and the circular nature of the nanomagnets provides near-uniform randomness with fast fluctuations. Combining our full sMTJ model with advanced transistor models, we estimate the energy to generate a random bit as $\approx$ 3.6 fJ, with fluctuation rates of $\approx$ 3.3 GHz per p-bit. Our results will guide the experimental development of superior stochastic magnetic tunnel junctions for large-scale and energy-efficient probabilistic computation for problems relevant to machine learning and artificial intelligence.
△ Less
Submitted 30 March, 2024; v1 submitted 11 November, 2023;
originally announced November 2023.
-
Speech language models lack important brain-relevant semantics
Authors:
Subba Reddy Oota,
Emin Çelik,
Fatma Deniz,
Mariya Toneva
Abstract:
Despite known differences between reading and listening in the brain, recent work has shown that text-based language models predict both text-evoked and speech-evoked brain activity to an impressive degree. This poses the question of what types of information language models truly predict in the brain. We investigate this question via a direct approach, in which we systematically remove specific l…
▽ More
Despite known differences between reading and listening in the brain, recent work has shown that text-based language models predict both text-evoked and speech-evoked brain activity to an impressive degree. This poses the question of what types of information language models truly predict in the brain. We investigate this question via a direct approach, in which we systematically remove specific low-level stimulus features (textual, speech, and visual) from language model representations to assess their impact on alignment with fMRI brain recordings during reading and listening. Comparing these findings with speech-based language models reveals starkly different effects of low-level features on brain alignment. While text-based models show reduced alignment in early sensory regions post-removal, they retain significant predictive power in late language regions. In contrast, speech-based models maintain strong alignment in early auditory regions even after feature removal but lose all predictive power in late language regions. These results suggest that speech-based models provide insights into additional information processed by early auditory regions, but caution is needed when using them to model processing in late language regions. We make our code publicly available. [https://github.com/subbareddy248/speech-llm-brain]
△ Less
Submitted 16 June, 2024; v1 submitted 8 November, 2023;
originally announced November 2023.
-
Emphasizing Cherenkov photons from bismuth germanate by single photon deconvolution
Authors:
Ryosuke Ota,
Kibo Ote
Abstract:
Bismuth germanate (BGO) has been receiving attention again because it is a potential scintillator for future time-of-flight positron emission tomography. Owing to its optical properties, BGO emits a relatively large number of Cherenkov photons after 511 keV gamma-ray interactions, which pushes the timing resolution of a detector. Nonetheless, efficiently detecting Cherenkov photons among scintilla…
▽ More
Bismuth germanate (BGO) has been receiving attention again because it is a potential scintillator for future time-of-flight positron emission tomography. Owing to its optical properties, BGO emits a relatively large number of Cherenkov photons after 511 keV gamma-ray interactions, which pushes the timing resolution of a detector. Nonetheless, efficiently detecting Cherenkov photons among scintillation photons is similar to looking for a needle in a haystack. Thus, we propose a method that can efficiently emphasize Cherenkov photon from a detector waveform by deconvolving a single photon response of photodetector. As a proof-of-concept, we perform the deconvolution, and a probability density function (PDF) of bismuth germanate was obtained, which is compared to a conventional time correlated single photon counting method. Furthermore, we investigate if the proposed deconvolution can emphasize a faint Cherenkov photon. Consequently, the PDF obtained by the proposed deconvolution shows a good agreement with that obtained using a conventional method. A coincidence time resolution obtained using the proposed deconvolution is improved by 43% in full width at half maximum, compared to a voltage-based leading edge discriminator. It can be concluded that the proposed deconvolution method can efficiently emphasize Cherenkov photon and improve the timing performance of BGO-based detectors.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)
Authors:
Subba Reddy Oota,
Zijiao Chen,
Manish Gupta,
Raju S. Bapi,
Gael Jobard,
Frederic Alexandre,
Xavier Hinaut
Abstract:
Can we obtain insights about the brain using AI models? How is the information in deep learning models related to brain recordings? Can we improve AI models with the help of brain recordings? Such questions can be tackled by studying brain recordings like functional magnetic resonance imaging (fMRI). As a first step, the neuroscience community has contributed several large cognitive neuroscience d…
▽ More
Can we obtain insights about the brain using AI models? How is the information in deep learning models related to brain recordings? Can we improve AI models with the help of brain recordings? Such questions can be tackled by studying brain recordings like functional magnetic resonance imaging (fMRI). As a first step, the neuroscience community has contributed several large cognitive neuroscience datasets related to passive reading/listening/viewing of concept words, narratives, pictures, and movies. Encoding and decoding models using these datasets have also been proposed in the past two decades. These models serve as additional tools for basic cognitive science and neuroscience research. Encoding models aim at generating fMRI brain representations given a stimulus automatically. They have several practical applications in evaluating and diagnosing neurological conditions and thus may also help design therapies for brain damage. Decoding models solve the inverse problem of reconstructing the stimuli given the fMRI. They are useful for designing brain-machine or brain-computer interfaces. Inspired by the effectiveness of deep learning models for natural language processing, computer vision, and speech, several neural encoding and decoding models have been recently proposed. In this survey, we will first discuss popular representations of language, vision and speech stimuli, and present a summary of neuroscience datasets. Further, we will review popular deep learning based encoding and decoding architectures and note their benefits and limitations. Finally, we will conclude with a summary and discussion about future trends. Given the large amount of recently published work in the computational cognitive neuroscience (CCN) community, we believe that this survey enables an entry point for DNN researchers to diversify into CCN research.
△ Less
Submitted 8 July, 2024; v1 submitted 17 July, 2023;
originally announced July 2023.
-
On Robustness of Finetuned Transformer-based NLP Models
Authors:
Pavan Kalyan Reddy Neerudu,
Subba Reddy Oota,
Mounika Marreddy,
Venkateswara Rao Kagita,
Manish Gupta
Abstract:
Transformer-based pretrained models like BERT, GPT-2 and T5 have been finetuned for a large number of natural language processing (NLP) tasks, and have been shown to be very effective. However, while finetuning, what changes across layers in these models with respect to pretrained checkpoints is under-studied. Further, how robust are these models to perturbations in input text? Does the robustness…
▽ More
Transformer-based pretrained models like BERT, GPT-2 and T5 have been finetuned for a large number of natural language processing (NLP) tasks, and have been shown to be very effective. However, while finetuning, what changes across layers in these models with respect to pretrained checkpoints is under-studied. Further, how robust are these models to perturbations in input text? Does the robustness vary depending on the NLP task for which the models have been finetuned? While there exists some work on studying the robustness of BERT finetuned for a few NLP tasks, there is no rigorous study that compares this robustness across encoder only, decoder only and encoder-decoder models. In this paper, we characterize changes between pretrained and finetuned language model representations across layers using two metrics: CKA and STIR. Further, we study the robustness of three language models (BERT, GPT-2 and T5) with eight different text perturbations on classification tasks from the General Language Understanding Evaluation (GLUE) benchmark, and generation tasks like summarization, free-form generation and question generation. GPT-2 representations are more robust than BERT and T5 across multiple types of input perturbation. Although models exhibit good robustness broadly, drop** nouns, verbs or changing characters are the most impactful. Overall, this study provides valuable insights into perturbation-specific weaknesses of popular Transformer-based models, which should be kept in mind when passing inputs. We make the code and models publicly available [https://github.com/PavanNeerudu/Robustness-of-Transformers-models].
△ Less
Submitted 8 November, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Fabrication of a 64-Pixel TES Microcalorimeter Array with Iron Absorbers Uniquely Designed for 14.4-keV Solar Axion Search
Authors:
Yuta Yagi,
Tasuku Hayashi,
Keita Tanaka,
Rikuta Miyagawa,
Ryo Ota,
Noriko Y. Yamasaki,
Kazuhisa Mitsuda,
Nao Yoshida,
Mikiko Saito,
Takayuki Homma
Abstract:
If a hypothetical elementary particle called an axion exists, to solve the strong CP problem, a 57Fe nucleus in the solar core could emit a 14.4-keV monochromatic axion through the M1 transition. If such axions are once more transformed into photons by a 57Fe absorber, a transition edge sensor (TES) X-ray microcalorimeter should be able to detect them efficiently. We have designed and fabricated a…
▽ More
If a hypothetical elementary particle called an axion exists, to solve the strong CP problem, a 57Fe nucleus in the solar core could emit a 14.4-keV monochromatic axion through the M1 transition. If such axions are once more transformed into photons by a 57Fe absorber, a transition edge sensor (TES) X-ray microcalorimeter should be able to detect them efficiently. We have designed and fabricated a dedicated 64-pixel TES array with iron absorbers for the solar axion search. In order to decrease the effect of iron magnetization on spectroscopic performance, the iron absorber is placed next to the TES while maintaining a certain distance. A gold thermal transfer strap connects them. We have accomplished the electroplating of gold straps with high thermal conductivity. The residual resistivity ratio (RRR) was over 23, more than eight times higher than a previous evaporated strap. In addition, we successfully electroplated pure-iron films of more than a few micrometers in thickness for absorbers and a fabricated 64-pixel TES calorimeter structure.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
Syntactic Structure Processing in the Brain while Listening
Authors:
Subba Reddy Oota,
Mounika Marreddy,
Manish Gupta,
Bapi Raju Surampud
Abstract:
Syntactic parsing is the task of assigning a syntactic structure to a sentence. There are two popular syntactic parsing methods: constituency and dependency parsing. Recent works have used syntactic embeddings based on constituency trees, incremental top-down parsing, and other word syntactic features for brain activity prediction given the text stimuli to study how the syntax structure is represe…
▽ More
Syntactic parsing is the task of assigning a syntactic structure to a sentence. There are two popular syntactic parsing methods: constituency and dependency parsing. Recent works have used syntactic embeddings based on constituency trees, incremental top-down parsing, and other word syntactic features for brain activity prediction given the text stimuli to study how the syntax structure is represented in the brain's language network. However, the effectiveness of dependency parse trees or the relative predictive power of the various syntax parsers across brain areas, especially for the listening task, is yet unexplored. In this study, we investigate the predictive power of the brain encoding models in three settings: (i) individual performance of the constituency and dependency syntactic parsing based embedding methods, (ii) efficacy of these syntactic parsing based embedding methods when controlling for basic syntactic signals, (iii) relative effectiveness of each of the syntactic embedding methods when controlling for the other. Further, we explore the relative importance of syntactic information (from these syntactic embedding methods) versus semantic information using BERT embeddings. We find that constituency parsers help explain activations in the temporal lobe and middle-frontal gyrus, while dependency parsers better encode syntactic structure in the angular gyrus and posterior cingulate cortex. Although semantic signals from BERT are more effective compared to any of the syntactic features or embedding methods, syntactic embedding methods explain additional variance for a few brain regions.
△ Less
Submitted 16 February, 2023;
originally announced February 2023.
-
GAE-ISumm: Unsupervised Graph-Based Summarization of Indian Languages
Authors:
Lakshmi Sireesha Vakada,
Anudeep Ch,
Mounika Marreddy,
Subba Reddy Oota,
Radhika Mamidi
Abstract:
Document summarization aims to create a precise and coherent summary of a text document. Many deep learning summarization models are developed mainly for English, often requiring a large training corpus and efficient pre-trained language models and tools. However, English summarization models for low-resource Indian languages are often limited by rich morphological variation, syntax, and semantic…
▽ More
Document summarization aims to create a precise and coherent summary of a text document. Many deep learning summarization models are developed mainly for English, often requiring a large training corpus and efficient pre-trained language models and tools. However, English summarization models for low-resource Indian languages are often limited by rich morphological variation, syntax, and semantic differences. In this paper, we propose GAE-ISumm, an unsupervised Indic summarization model that extracts summaries from text documents. In particular, our proposed model, GAE-ISumm uses Graph Autoencoder (GAE) to learn text representations and a document summary jointly. We also provide a manually-annotated Telugu summarization dataset TELSUM, to experiment with our model GAE-ISumm. Further, we experiment with the most publicly available Indian language summarization datasets to investigate the effectiveness of GAE-ISumm on other Indian languages. Our experiments of GAE-ISumm in seven languages make the following observations: (i) it is competitive or better than state-of-the-art results on all datasets, (ii) it reports benchmark results on TELSUM, and (iii) the inclusion of positional and cluster information in the proposed model improved the performance of summaries.
△ Less
Submitted 25 December, 2022;
originally announced December 2022.
-
Joint processing of linguistic properties in brains and language models
Authors:
Subba Reddy Oota,
Manish Gupta,
Mariya Toneva
Abstract:
Language models have been shown to be very effective in predicting brain recordings of subjects experiencing complex language stimuli. For a deeper understanding of this alignment, it is important to understand the correspondence between the detailed processing of linguistic information by the human brain versus language models. We investigate this correspondence via a direct approach, in which we…
▽ More
Language models have been shown to be very effective in predicting brain recordings of subjects experiencing complex language stimuli. For a deeper understanding of this alignment, it is important to understand the correspondence between the detailed processing of linguistic information by the human brain versus language models. We investigate this correspondence via a direct approach, in which we eliminate information related to specific linguistic properties in the language model representations and observe how this intervention affects the alignment with fMRI brain recordings obtained while participants listened to a story. We investigate a range of linguistic properties (surface, syntactic, and semantic) and find that the elimination of each one results in a significant decrease in brain alignment. Specifically, we find that syntactic properties (i.e. Top Constituents and Tree Depth) have the largest effect on the trend of brain alignment across model layers. These findings provide clear evidence for the role of specific linguistic information in the alignment between brain and language models, and open new avenues for map** the joint information processing in both systems. We make the code publicly available [https://github.com/subbareddy248/linguistic-properties-brain-alignment].
△ Less
Submitted 8 November, 2023; v1 submitted 15 December, 2022;
originally announced December 2022.
-
Neural Language Taskonomy: Which NLP Tasks are the most Predictive of fMRI Brain Activity?
Authors:
Subba Reddy Oota,
Jashn Arora,
Veeral Agarwal,
Mounika Marreddy,
Manish Gupta,
Bapi Raju Surampudi
Abstract:
Several popular Transformer based language models have been found to be successful for text-driven brain encoding. However, existing literature leverages only pretrained text Transformer models and has not explored the efficacy of task-specific learned Transformer representations. In this work, we explore transfer learning from representations learned for ten popular natural language processing ta…
▽ More
Several popular Transformer based language models have been found to be successful for text-driven brain encoding. However, existing literature leverages only pretrained text Transformer models and has not explored the efficacy of task-specific learned Transformer representations. In this work, we explore transfer learning from representations learned for ten popular natural language processing tasks (two syntactic and eight semantic) for predicting brain responses from two diverse datasets: Pereira (subjects reading sentences from paragraphs) and Narratives (subjects listening to the spoken stories). Encoding models based on task features are used to predict activity in different regions across the whole brain. Features from coreference resolution, NER, and shallow syntax parsing explain greater variance for the reading activity. On the other hand, for the listening activity, tasks such as paraphrase generation, summarization, and natural language inference show better encoding performance. Experiments across all 10 task representations provide the following cognitive insights: (i) language left hemisphere has higher predictive brain activity versus language right hemisphere, (ii) posterior medial cortex, temporo-parieto-occipital junction, dorsal frontal lobe have higher correlation versus early auditory and auditory association cortex, (iii) syntactic and semantic tasks display a good predictive performance across brain regions for reading and listening stimuli resp.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
Multi-Task Text Classification using Graph Convolutional Networks for Large-Scale Low Resource Language
Authors:
Mounika Marreddy,
Subba Reddy Oota,
Lakshmi Sireesha Vakada,
Venkata Charan Chinni,
Radhika Mamidi
Abstract:
Graph Convolutional Networks (GCN) have achieved state-of-art results on single text classification tasks like sentiment analysis, emotion detection, etc. However, the performance is achieved by testing and reporting on resource-rich languages like English. Applying GCN for multi-task text classification is an unexplored area. Moreover, training a GCN or adopting an English GCN for Indian language…
▽ More
Graph Convolutional Networks (GCN) have achieved state-of-art results on single text classification tasks like sentiment analysis, emotion detection, etc. However, the performance is achieved by testing and reporting on resource-rich languages like English. Applying GCN for multi-task text classification is an unexplored area. Moreover, training a GCN or adopting an English GCN for Indian languages is often limited by data availability, rich morphological variation, syntax, and semantic differences. In this paper, we study the use of GCN for the Telugu language in single and multi-task settings for four natural language processing (NLP) tasks, viz. sentiment analysis (SA), emotion identification (EI), hate-speech (HS), and sarcasm detection (SAR). In order to evaluate the performance of GCN with one of the Indian languages, Telugu, we analyze the GCN based models with extensive experiments on four downstream tasks. In addition, we created an annotated Telugu dataset, TEL-NLP, for the four NLP tasks. Further, we propose a supervised graph reconstruction method, Multi-Task Text GCN (MT-Text GCN) on the Telugu that leverages to simultaneously (i) learn the low-dimensional word and sentence graph embeddings from word-sentence graph reconstruction using graph autoencoder (GAE) and (ii) perform multi-task text classification using these latent sentence graph embeddings. We argue that our proposed MT-Text GCN achieves significant improvements on TEL-NLP over existing Telugu pretrained word embeddings, and multilingual pretrained Transformer models: mBERT, and XLM-R. On TEL-NLP, we achieve a high F1-score for four NLP tasks: SA (0.84), EI (0.55), HS (0.83) and SAR (0.66). Finally, we show our model's quantitative and qualitative analysis on the four NLP tasks in Telugu.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
Cross-view Brain Decoding
Authors:
Subba Reddy Oota,
Jashn Arora,
Manish Gupta,
Raju S. Bapi
Abstract:
How the brain captures the meaning of linguistic stimuli across multiple views is still a critical open question in neuroscience. Consider three different views of the concept apartment: (1) picture (WP) presented with the target word label, (2) sentence (S) using the target word, and (3) word cloud (WC) containing the target word along with other semantically related words. Unlike previous effort…
▽ More
How the brain captures the meaning of linguistic stimuli across multiple views is still a critical open question in neuroscience. Consider three different views of the concept apartment: (1) picture (WP) presented with the target word label, (2) sentence (S) using the target word, and (3) word cloud (WC) containing the target word along with other semantically related words. Unlike previous efforts, which focus only on single view analysis, in this paper, we study the effectiveness of brain decoding in a zero-shot cross-view learning setup. Further, we propose brain decoding in the novel context of cross-view-translation tasks like image captioning (IC), image tagging (IT), keyword extraction (KE), and sentence formation (SF). Using extensive experiments, we demonstrate that cross-view zero-shot brain decoding is practical leading to ~0.68 average pairwise accuracy across view pairs. Also, the decoded representations are sufficiently detailed to enable high accuracy for cross-view-translation tasks with following pairwise accuracy: IC (78.0), IT (83.0), KE (83.7) and SF (74.5). Analysis of the contribution of different brain networks reveals exciting cognitive insights: (1) A high percentage of visual voxels are involved in image captioning and image tagging tasks, and a high percentage of language voxels are involved in the sentence formation and keyword extraction tasks. (2) Zero-shot accuracy of the model trained on S view and tested on WC view is better than same-view accuracy of the model trained and tested on WC view.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
Visio-Linguistic Brain Encoding
Authors:
Subba Reddy Oota,
Jashn Arora,
Vijay Rowtula,
Manish Gupta,
Raju S. Bapi
Abstract:
Enabling effective brain-computer interfaces requires understanding how the human brain encodes stimuli across modalities such as visual, language (or text), etc. Brain encoding aims at constructing fMRI brain activity given a stimulus. There exists a plethora of neural encoding models which study brain encoding for single mode stimuli: visual (pretrained CNNs) or text (pretrained language models)…
▽ More
Enabling effective brain-computer interfaces requires understanding how the human brain encodes stimuli across modalities such as visual, language (or text), etc. Brain encoding aims at constructing fMRI brain activity given a stimulus. There exists a plethora of neural encoding models which study brain encoding for single mode stimuli: visual (pretrained CNNs) or text (pretrained language models). Few recent papers have also obtained separate visual and text representation models and performed late-fusion using simple heuristics. However, previous work has failed to explore: (a) the effectiveness of image Transformer models for encoding visual stimuli, and (b) co-attentive multi-modal modeling for visual and text reasoning. In this paper, we systematically explore the efficacy of image Transformers (ViT, DEiT, and BEiT) and multi-modal Transformers (VisualBERT, LXMERT, and CLIP) for brain encoding. Extensive experiments on two popular datasets, BOLD5000 and Pereira, provide the following insights. (1) To the best of our knowledge, we are the first to investigate the effectiveness of image and multi-modal Transformers for brain encoding. (2) We find that VisualBERT, a multi-modal Transformer, significantly outperforms previously proposed single-mode CNNs, image Transformers as well as other previously proposed multi-modal models, thereby establishing new state-of-the-art. The supremacy of visio-linguistic models raises the question of whether the responses elicited in the visual regions are affected implicitly by linguistic processing even when passively viewing images. Future fMRI tasks can verify this computational insight in an appropriate experimental setting.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
Explicitly Multi-Modal Benchmarks for Multi-Objective Optimization
Authors:
Ryosuke Ota,
Reiya Hagiwara,
Naoki Hamada,
Likun Liu,
Takahiro Yamamoto,
Daisuke Sakurai
Abstract:
In multi-objective optimization, designing good benchmark problems is an important issue for improving solvers.
Controlling the global location of Pareto optima in existing benchmark problems has been problematic, and it is even more difficult when the design space is high-dimensional since visualization is extremely challenging.
As a benchmarking with explicit local Pareto fronts, we introduc…
▽ More
In multi-objective optimization, designing good benchmark problems is an important issue for improving solvers.
Controlling the global location of Pareto optima in existing benchmark problems has been problematic, and it is even more difficult when the design space is high-dimensional since visualization is extremely challenging.
As a benchmarking with explicit local Pareto fronts, we introduce a benchmarking based on basin connectivity (3BC) by using basins of attraction.
The 3BC allows for the specification of a multimodal landscape through a kind of topological analysis called the basin graph, effectively generating optimization problems from this graph.
Various known indicators measure the performance of a solver in searching global Pareto optima, but using 3BC can make us localize them for each local Pareto front by restricting it to its basin.
3BC's mathematical formulation ensures the accurate representation of the specified optimization landscape, guaranteeing the existence of intended local and global Pareto optima.
△ Less
Submitted 9 February, 2024; v1 submitted 7 October, 2021;
originally announced October 2021.
-
Anatomical-Guided Attention Enhances Unsupervised PET Image Denoising Performance
Authors:
Yuya Onishi,
Fumio Hashimoto,
Kibo Ote,
Hiroyuki Ohba,
Ryosuke Ota,
Etsuji Yoshikawa,
Yasuomi Ouchi
Abstract:
Although supervised convolutional neural networks (CNNs) often outperform conventional alternatives for denoising positron emission tomography (PET) images, they require many low- and high-quality reference PET image pairs. Herein, we propose an unsupervised 3D PET image denoising method based on an anatomical information-guided attention mechanism. The proposed magnetic resonance-guided deep deco…
▽ More
Although supervised convolutional neural networks (CNNs) often outperform conventional alternatives for denoising positron emission tomography (PET) images, they require many low- and high-quality reference PET image pairs. Herein, we propose an unsupervised 3D PET image denoising method based on an anatomical information-guided attention mechanism. The proposed magnetic resonance-guided deep decoder (MR-GDD) utilizes the spatial details and semantic features of MR-guidance image more effectively by introducing encoder-decoder and deep decoder subnetworks. Moreover, the specific shapes and patterns of the guidance image do not affect the denoised PET image, because the guidance image is input to the network through an attention gate. In a Monte Carlo simulation of [$^{18}$F]fluoro-2-deoxy-D-glucose (FDG), the proposed method achieved the highest peak signal-to-noise ratio and structural similarity (27.92 $\pm$ 0.44 dB/0.886 $\pm$ 0.007), as compared with Gaussian filtering (26.68 $\pm$ 0.10 dB/0.807 $\pm$ 0.004), image guided filtering (27.40 $\pm$ 0.11 dB/0.849 $\pm$ 0.003), deep image prior (DIP) (24.22 $\pm$ 0.43 dB/0.737 $\pm$ 0.017), and MR-DIP (27.65 $\pm$ 0.42 dB/0.879 $\pm$ 0.007). Furthermore, we experimentally visualized the behavior of the optimization process, which is often unknown in unsupervised CNN-based restoration problems. For preclinical (using [$^{18}$F]FDG and [$^{11}$C]raclopride) and clinical (using [$^{18}$F]florbetapir) studies, the proposed method demonstrates state-of-the-art denoising performance while retaining spatial resolution and quantitative accuracy, despite using a common network architecture for various noisy PET images with 1/10th of the full counts. These results suggest that the proposed MR-GDD can reduce PET scan times and PET tracer doses considerably without impacting patients.
△ Less
Submitted 7 September, 2021; v1 submitted 2 September, 2021;
originally announced September 2021.
-
Direct positron emission imaging: ultra-fast timing enables reconstruction-free imaging
Authors:
Ryosuke Ota,
Sun Il Kwon,
Eric Berg,
Fumio Hashimoto,
Kyohei Nakajima,
Izumi Ogawa,
Yoichi Tamagawa,
Tomohide Omura,
Tomoyuki Hasegawa,
Simon R. Cherry
Abstract:
Positron emission tomography, like many other tomographic imaging modalities, relies on an image reconstruction step to produce cross-sectional images from projection data. Detection and localization of the back-to-back annihilation photons produced by positron-electron annihilation defines the trajectories of these photons, which when combined with tomographic reconstruction algorithms, permits r…
▽ More
Positron emission tomography, like many other tomographic imaging modalities, relies on an image reconstruction step to produce cross-sectional images from projection data. Detection and localization of the back-to-back annihilation photons produced by positron-electron annihilation defines the trajectories of these photons, which when combined with tomographic reconstruction algorithms, permits recovery of the distribution of positron-emitting radionuclides. Here we produce cross-sectional images directly from the detected coincident annihilation photons, without using a reconstruction algorithm. Ultra-fast radiation detectors with a resolving time averaging 32 picoseconds measured the difference in arrival time of pairs of annihilation photons, localizing the annihilation site to 4.8 mm. This is sufficient to directly generate an image without reconstruction and without the geometric and sampling constraints that normally present for tomographic imaging systems.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
-
Wound and episode level readmission risk or weeks to readmit: Why do patients get readmitted? How long does it take for a patient to get readmitted?
Authors:
Subba Reddy Oota,
Nafisur Rahman,
Shahid Saleem Mohammed,
Jeffrey Galitz,
Ming Liu
Abstract:
The Affordable care Act of 2010 had introduced Readmission reduction program in 2012 to reduce avoidable re-admissions to control rising healthcare costs. Wound care impacts 15 of medicare beneficiaries making it one of the major contributors of medicare health care cost. Health plans have been exploring proactive health care services that can focus on preventing wound recurrences and re-admission…
▽ More
The Affordable care Act of 2010 had introduced Readmission reduction program in 2012 to reduce avoidable re-admissions to control rising healthcare costs. Wound care impacts 15 of medicare beneficiaries making it one of the major contributors of medicare health care cost. Health plans have been exploring proactive health care services that can focus on preventing wound recurrences and re-admissions to control the wound care costs. With rising costs of Wound care industry, it has become of paramount importance to reduce wound recurrences & patient re-admissions. What factors are responsible for a Wound to recur which ultimately lead to hospitalization or re-admission? Is there a way to identify the patients at risk of re-admission before the occurrence using data driven analysis? Patient re-admission risk management has become critical for patients suffering from chronic wounds such as diabetic ulcers, pressure ulcers, and vascular ulcers. Understanding the risk & the factors that cause patient readmission can help care providers and patients avoid wound recurrences. Our work focuses on identifying patients who are at high risk of re-admission & determining the time period with in which a patient might get re-admitted. Frequent re-admissions add financial stress to the patient & Health plan and deteriorate the quality of life of the patient. Having this information can allow a provider to set up preventive measures that can delay, if not prevent, patients' re-admission. On a combined wound & episode-level data set of patient's wound care information, our extended autoprognosis achieves a recall of 92 and a precision of 92 for the predicting a patient's re-admission risk. For new patient class, precision and recall are as high as 91 and 98, respectively. We are also able to predict the patient's discharge event for a re-admission event to occur through our model with a MAE of 2.3 weeks.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
Expert2Coder: Capturing Divergent Brain Regions Using Mixture of Regression Experts
Authors:
Subba Reddy Oota,
Naresh Manwani,
Raju S. Bapi
Abstract:
fMRI semantic category understanding using linguistic encoding models attempts to learn a forward map** that relates stimuli to the corresponding brain activation. State-of-the-art encoding models use a single global model (linear or non-linear) to predict brain activation given the stimulus. However, the critical assumption in these methods is that a priori different brain regions respond the s…
▽ More
fMRI semantic category understanding using linguistic encoding models attempts to learn a forward map** that relates stimuli to the corresponding brain activation. State-of-the-art encoding models use a single global model (linear or non-linear) to predict brain activation given the stimulus. However, the critical assumption in these methods is that a priori different brain regions respond the same way to all the stimuli, that is, there is no modularity or specialization assumed for any region. This goes against the modularity theory, supported by many cognitive neuroscience investigations suggesting that there are functionally specialized regions in the brain. In this paper, we achieve this by clustering similar regions together and for every cluster we learn a different linear regression model using a mixture of linear experts model. The key idea here is that each linear expert captures the behaviour of similar brain regions. Given a new stimulus, the utility of the proposed model is twofold (i) predicts the brain activation as a weighted linear combination of the activations of multiple linear experts and (ii) to learn multiple experts corresponding to different brain regions. We argue that each expert captures activity patterns related to a particular region of interest (ROI) in the human brain. This study helps in understanding the brain regions that are activated together given different kinds of stimuli. Importantly, we suggest that the mixture of regression experts (MoRE) framework successfully combines the two principles of organization of function in the brain, namely that of specialization and integration. Experiments on fMRI data from paradigm 1 [1]where participants view linguistic stimuli show that the proposed MoRE model has better prediction accuracy compared to that of conventional models.
△ Less
Submitted 29 May, 2020; v1 submitted 26 September, 2019;
originally announced September 2019.
-
Affect in Tweets Using Experts Model
Authors:
Subba Reddy Oota,
Adithya Avvaru,
Mounika Marreddy,
Radhika Mamidi
Abstract:
Estimating the intensity of emotion has gained significance as modern textual inputs in potential applications like social media, e-retail markets, psychology, advertisements etc., carry a lot of emotions, feelings, expressions along with its meaning. However, the approaches of traditional sentiment analysis primarily focuses on classifying the sentiment in general (positive or negative) or at an…
▽ More
Estimating the intensity of emotion has gained significance as modern textual inputs in potential applications like social media, e-retail markets, psychology, advertisements etc., carry a lot of emotions, feelings, expressions along with its meaning. However, the approaches of traditional sentiment analysis primarily focuses on classifying the sentiment in general (positive or negative) or at an aspect level(very positive, low negative, etc.) and cannot exploit the intensity information. Moreover, automatically identifying emotions like anger, fear, joy, sadness, disgust etc., from text introduces challenging scenarios where single tweet may contain multiple emotions with different intensities and some emotions may even co-occur in some of the tweets. In this paper, we propose an architecture, Experts Model, inspired from the standard Mixture of Experts (MoE) model. The key idea here is each expert learns different sets of features from the feature vector which helps in better emotion detection from the tweet. We compared the results of our Experts Model with both baseline results and top five performers of SemEval-2018 Task-1, Affect in Tweets (AIT). The experimental results show that our proposed approach deals with the emotion detection problem and stands at top-5 results.
△ Less
Submitted 20 March, 2019;
originally announced April 2019.
-
Mixture of Regression Experts in fMRI Encoding
Authors:
Subba Reddy Oota,
Adithya Avvaru,
Naresh Manwani,
Raju S. Bapi
Abstract:
fMRI semantic category understanding using linguistic encoding models attempt to learn a forward map** that relates stimuli to the corresponding brain activation. Classical encoding models use linear multi-variate methods to predict the brain activation (all voxels) given the stimulus. However, these methods essentially assume multiple regions as one large uniform region or several independent r…
▽ More
fMRI semantic category understanding using linguistic encoding models attempt to learn a forward map** that relates stimuli to the corresponding brain activation. Classical encoding models use linear multi-variate methods to predict the brain activation (all voxels) given the stimulus. However, these methods essentially assume multiple regions as one large uniform region or several independent regions, ignoring connections among them. In this paper, we present a mixture of experts-based model where a group of experts captures brain activity patterns related to particular regions of interest (ROI) and also show the discrimination across different experts. The model is trained word stimuli encoded as 25-dimensional feature vectors as input and the corresponding brain responses as output. Given a new word (25-dimensional feature vector), it predicts the entire brain activation as the linear combination of multiple experts brain activations. We argue that each expert learns a certain region of brain activations corresponding to its category of words, which solves the problem of identifying the regions with a simple encoding model. We showcase that proposed mixture of experts-based model indeed learns region-based experts to predict the brain activations with high spatial accuracy.
△ Less
Submitted 1 December, 2018; v1 submitted 26 November, 2018;
originally announced November 2018.
-
fMRI Semantic Category Decoding using Linguistic Encoding of Word Embeddings
Authors:
Subba Reddy Oota,
Naresh Manwani,
Bapi Raju S
Abstract:
The dispute of how the human brain represents conceptual knowledge has been argued in many scientific fields. Brain imaging studies have shown that the spatial patterns of neural activation in the brain are correlated with thinking about different semantic categories of words (for example, tools, animals, and buildings) or when viewing the related pictures. In this paper, we present a computationa…
▽ More
The dispute of how the human brain represents conceptual knowledge has been argued in many scientific fields. Brain imaging studies have shown that the spatial patterns of neural activation in the brain are correlated with thinking about different semantic categories of words (for example, tools, animals, and buildings) or when viewing the related pictures. In this paper, we present a computational model that learns to predict the neural activation captured in functional magnetic resonance imaging (fMRI) data of test words. Unlike the models with hand-crafted features that have been used in the literature, in this paper we propose a novel approach wherein decoding models are built with features extracted from popular linguistic encodings of Word2Vec, GloVe, Meta-Embeddings in conjunction with the empirical fMRI data associated with viewing several dozen concrete nouns. We compared these models with several other models that use word features extracted from FastText, Randomly-generated features, Mitchell's 25 features [1]. The experimental results show that the predicted fMRI images using Meta-Embeddings meet the state-of-the-art performance. Although models with features from GloVe and Word2Vec predict fMRI images similar to the state-of-the-art model, model with features from Meta-Embeddings predicts significantly better. The proposed scheme that uses popular linguistic encoding offers a simple and easy approach for semantic decoding from fMRI experiments.
△ Less
Submitted 13 June, 2018;
originally announced June 2018.
-
Clickbait detection using word embeddings
Authors:
Vijayasaradhi Indurthi,
Subba Reddy Oota
Abstract:
Clickbait is a pejorative term describing web content that is aimed at generating online advertising revenue, especially at the expense of quality or accuracy, relying on sensationalist headlines or eye-catching thumbnail pictures to attract click-throughs and to encourage forwarding of the material over online social networks. We use distributed word representations of the words in the title as f…
▽ More
Clickbait is a pejorative term describing web content that is aimed at generating online advertising revenue, especially at the expense of quality or accuracy, relying on sensationalist headlines or eye-catching thumbnail pictures to attract click-throughs and to encourage forwarding of the material over online social networks. We use distributed word representations of the words in the title as features to identify clickbaits in online news media. We train a machine learning model using linear regression to predict the cickbait score of a given tweet. Our methods achieve an F1-score of 64.98\% and an MSE of 0.0791. Compared to other methods, our method is simple, fast to train, does not require extensive feature engineering and yet moderately effective.
△ Less
Submitted 8 October, 2017;
originally announced October 2017.
-
Missing-mass spectroscopy with the ${}^{6}$Li($π^{-}, K^{+}$)X reaction to search for ${}^{6}_Λ$H
Authors:
R. Honda,
M. Agnello,
J. K. Ahn,
S. Ajimura,
Y. Akazawa,
N. Amano,
K. Aoki,
H. C. Bhang,
N. Chiga,
M. Endo,
P. Evtoukhovitch,
A. Feliciello,
H. Fujioka,
T. Fukuda,
S. Hasegawa,
S. H. Hayakawa,
K. Hosomi,
S. H. Hwang,
Y. Ichikawa,
Y. Igarashi,
K. Imai,
N. Ishibashi,
R. Iwasaki,
C. W. Joo,
R. Kiuchi
, et al. (41 additional authors not shown)
Abstract:
We searched for the bound state of the neutron-rich $Λ$-hypernucleus ${}^{6}_Λ$H, using the ${}^{6}$Li($π^{-}, K^{+}$)X double charge-exchange reaction at a $π^{-}$ beam momentum of 1.2 GeV/c at J-PARC. A total of $1.4 \times 10^{12}$ $π^{-}$ was driven onto a ${}^{6}$Li target of 3.5-g/$\rm cm^2$ thickness. No event was observed below the bound threshold, i.e., the mass of ${}^{4}_Λ$H + 2n, in th…
▽ More
We searched for the bound state of the neutron-rich $Λ$-hypernucleus ${}^{6}_Λ$H, using the ${}^{6}$Li($π^{-}, K^{+}$)X double charge-exchange reaction at a $π^{-}$ beam momentum of 1.2 GeV/c at J-PARC. A total of $1.4 \times 10^{12}$ $π^{-}$ was driven onto a ${}^{6}$Li target of 3.5-g/$\rm cm^2$ thickness. No event was observed below the bound threshold, i.e., the mass of ${}^{4}_Λ$H + 2n, in the missing-mass spectrum of the ${}^{6}$Li($π^{-}, K^{+}$)X reaction in the $2^{\circ}$ < $θ_{πK}$ < $20^{\circ}$ angular range. Furthermore, no event was found up to 2.8 MeV/$c^2$ above the bound threshold. We obtained the the double-differential cross section spectra of the ${}^{6}$Li($π^{-}, K^{+}$)X reaction in the angular range of $2^{\circ}$ < $θ_{πK}$ < $14^{\circ}$. An upper limit of 0.56 nb/sr (90% C.L.) was obtained for the production cross section of the ${}^{6}_Λ$H hypernucleus bound state. In addition, not only the bound state region, but also the $Λ$ continuum region and part of the $Σ^{-}$ quasi-free production region of the ${}^{6}$Li($π^{-}, K^{+}$)X reaction, were obtained with high statistics. The present missing-mass spectrum will facilitate the investigation of the $Σ^{-}$-nucleus optical potential for $Σ^{-}$-${}^{5}$He through spectrum shape analysis.
△ Less
Submitted 3 March, 2017; v1 submitted 2 March, 2017;
originally announced March 2017.
-
Phason space analysis and structure modeling of 100 A-scale dodecagonal quasicrystal in Mn-based alloy
Authors:
Tsutomu Ishimasa,
Shuhei Iwami,
Norihito Sakaguchi,
Ryo Oota,
Marek Mihalkovic
Abstract:
The dodecagonal quasicrystal classified into the five-dimensional space group P126/mmc, recently discovered in a Mn-Cr-Ni-Si alloy, has been analyzed using atomic-resolution spherical aberration-corrected electron microscopy, i.e. high-angle annular dark-field scanning transmission electron microscopy (HAADF-STEM) and conventional transmission electron microscopy. By observing along the 12-fold ax…
▽ More
The dodecagonal quasicrystal classified into the five-dimensional space group P126/mmc, recently discovered in a Mn-Cr-Ni-Si alloy, has been analyzed using atomic-resolution spherical aberration-corrected electron microscopy, i.e. high-angle annular dark-field scanning transmission electron microscopy (HAADF-STEM) and conventional transmission electron microscopy. By observing along the 12-fold axis, non-periodic tiling consisting of an equilateral triangle and a square has been revealed, of which common edge length is a = 4.560 A. These tiles tend to form a network of dodecagons of which size is (2+Sqrt(3))a ~ 17 A in diameter. The tiling was interpreted as an aggregate of 100 A-scale oriented domains of high- and low-quality quasicrystals with small crystallites appearing at their boundaries. The quasicrystal domains exhibited a densely-filled circular acceptance region in the phason space. This is the first observation of the acceptance region in an actual dodecagonal quasicrystal.
Atomic structure model consistent with the electron microscopy images is a standard Frank-Kasper decoration of the triangle and square tiles, that can be inferred from the crystal structures of Zr4Al3 and Cr3Si. Four kinds of layers located at z = 0, +-1/4 and 1/2 are stacked periodically along the 12-fold axis, and the atoms at z = 0 and 1/2 form hexagonal anti-prisms consistently with the 126-screw axis. The validity of this structure model was examined by means of powder X-ray diffraction.
△ Less
Submitted 19 August, 2015;
originally announced August 2015.
-
Observation of the "$K^-pp$"-like structure in the $d(π^+, K^+)$ reaction at 1.69 GeV/$c$
Authors:
Yudai Ichikawa,
Tomofumi Nagae,
Hiroyuki Fujioka,
Hyoungchan Bhang,
Stefania Bufalino,
Hiroyuki Ekawa,
Petr Evtoukhovitch,
Alessandro Feliciello,
Shoichi Hasegawa,
Shuhei Hayakawa,
Ryotaro Honda,
Kenji Hosomi,
Kenichi Imai,
Shigeru Ishimoto,
Changwoo Joo,
Shunsuke Kanatsuki,
Ryuta Kiuchi,
Takeshi Koike,
Harphool Kumawat,
Yuki Matsumoto,
Koji Miwa,
Manabu Moritsu,
Megumi Naruki,
Masayuki Niiyama,
Yuki Nozawa
, et al. (19 additional authors not shown)
Abstract:
We have observed a "$K^-pp$"-like structure in the $d(π^+,K^+)$ reaction at 1.69 GeV/$c$. In this reaction $Λ(1405)$ hyperon resonance is expected to be produced as a doorway to form the $K^-pp$ through the $Λ^*p\rightarrow K^-pp$ process. However, most of the produced $Λ(1405)$'s would escape from deuteron without secondary reactions. Therefore, coincidence of high-momentum ($>$ 250~MeV/$c$) prot…
▽ More
We have observed a "$K^-pp$"-like structure in the $d(π^+,K^+)$ reaction at 1.69 GeV/$c$. In this reaction $Λ(1405)$ hyperon resonance is expected to be produced as a doorway to form the $K^-pp$ through the $Λ^*p\rightarrow K^-pp$ process. However, most of the produced $Λ(1405)$'s would escape from deuteron without secondary reactions. Therefore, coincidence of high-momentum ($>$ 250~MeV/$c$) proton(s) in large emission angles ($39^\circ<θ_{lab.}<122^\circ$) was requested to enhance the signal-to-background ratio. A broad enhancement in the proton coincidence spectra are observed around the missing-mass of 2.27 GeV/$c^2$, which corresponds to the $K^-pp$ binding energy of 95 $^{+18}_{-17}$ (stat.) $^{+30}_{-21}$ (syst.) MeV and the width of 162 $^{+87}_{-45}$ (stat.) $^{+66}_{-78}$ (syst.) MeV.
△ Less
Submitted 24 November, 2014;
originally announced November 2014.
-
Inclusive spectrum of the $d(π^+, K^+)$ reaction at 1.69 GeV/c
Authors:
Yudai Ichikawa,
Tomofumi Nagae,
Hyoungchan Bhang,
Stefania Bufalino,
Hiroyuki Ekawa,
Petr Evtoukhovitch,
Alessandro Feliciello,
Hiroyuki Fujioka,
Shoichi Hasegawa,
Shuhei Hayakawa,
Ryotaro Honda,
Kenji Hosomi,
Ken'ichi Imai,
Shigeru Ishimoto,
Changwoo Joo,
Shunsuke Kanatsuki,
Ryuta Kiuchi,
Takeshi Koike,
Harphool Kumawat,
Yuki Matsumoto,
Koji Miwa,
Manabu Moritsu,
Megumi Naruki,
Masayuki Niiyama,
Yuki Nozawa
, et al. (19 additional authors not shown)
Abstract:
We have measured an inclusive missing-mass spectrum of the $d(π^+, K^+)$ reaction at the pion incident momentum of 1.69 GeV/$c$ at the laboratory scattering angles between 2$^\circ$ and 16$^\circ$ with the missing-mass resolution of 2.7 $\pm$ 0.1 MeV/$c^2$ (FWHM) at the missing mass of 2.27 GeV/$c^{2}$. In this Letter, we first try to understand the spectrum as a simple quasi-free picture based on…
▽ More
We have measured an inclusive missing-mass spectrum of the $d(π^+, K^+)$ reaction at the pion incident momentum of 1.69 GeV/$c$ at the laboratory scattering angles between 2$^\circ$ and 16$^\circ$ with the missing-mass resolution of 2.7 $\pm$ 0.1 MeV/$c^2$ (FWHM) at the missing mass of 2.27 GeV/$c^{2}$. In this Letter, we first try to understand the spectrum as a simple quasi-free picture based on several known elementary cross sections, considering the neutron/proton Fermi motion in deuteron. While gross spectrum structures are well understood in this picture, we have observed two distinct deviations; one peculiar enhancement at 2.13 GeV/$c^2$ is due to the $ΣN$ cusp, and the other notable feature is a shift of a broad bump structure, mainly originating from hyperon resonance productions of $Λ(1405)$ and $Σ(1385)^{+/0}$, by about 22.4 $\pm$ 0.4 (stat.) $^{+2.7}_{-1.7}$ (syst.) MeV/$c^2$ toward the low-mass side, which is calculated in the kinematics of a proton at rest as the target.
△ Less
Submitted 24 August, 2014; v1 submitted 11 July, 2014;
originally announced July 2014.
-
High-resolution search for the $Θ^{+}$ pentaquark via a pion-induced reaction at J-PARC
Authors:
J-PARC E19 Collaboration,
:,
M. Moritsu,
S. Adachi,
M. Agnello,
S. Ajimura,
K. Aoki,
H. C. Bhang,
B. Bassalleck,
E. Botta,
S. Bufalino,
N. Chiga,
H. Ekawa,
P. Evtoukhovitch,
A. Feliciello,
H. Fujioka,
S. Hayakawa,
F. Hiruma,
R. Honda,
K. Hosomi,
Y. Ichikawa,
M. Ieiri,
Y. Igarashi,
K. Imai,
N. Ishibashi
, et al. (51 additional authors not shown)
Abstract:
The pentaquark $Θ^+$ has been searched for via the $π^-p \to K^-X$ reaction with beam momenta of 1.92 and 2.01 GeV/$c$ at J-PARC. A missing mass resolution of 2 MeV (FWHM) was achieved but no sharp peak structure was observed. The upper limits on the production cross section averaged over the scattering angle from 2$^{\circ}$ to 15$^{\circ}$ in the laboratory frame were found to be less than 0.28…
▽ More
The pentaquark $Θ^+$ has been searched for via the $π^-p \to K^-X$ reaction with beam momenta of 1.92 and 2.01 GeV/$c$ at J-PARC. A missing mass resolution of 2 MeV (FWHM) was achieved but no sharp peak structure was observed. The upper limits on the production cross section averaged over the scattering angle from 2$^{\circ}$ to 15$^{\circ}$ in the laboratory frame were found to be less than 0.28 $μ$b/sr at the 90\% confidence level for both the 1.92- and 2.01-GeV/$c$ data. The systematic uncertainty of the upper limits was controlled within 10\%. Constraints on the $Θ^+$ decay width were also evaluated with a theoretical calculation using effective Lagrangian. The present result implies that the width should be less than 0.36 and 1.9 MeV for the spin-parity of $1/2^+$ and $1/2^-$, respectively.
△ Less
Submitted 8 October, 2014; v1 submitted 2 July, 2014;
originally announced July 2014.
-
Search for $^6_Λ$H hypernucleus by the $^6$Li$(π^-,K^+)$ reaction at $p_{π^-}$ = 1.2 GeV/$c$
Authors:
H. Sugimura,
M. Agnello,
J. K. Ahn,
S. Ajimura,
Y. Akazawa,
N. Amano,
K. Aoki,
H. C. Bhang,
N. Chiga,
M. Endo,
P. Evtoukhovitch,
A. Feliciello,
H. Fujioka,
T. Fukuda,
S. Hasegawa,
S. Hayakawa,
R. Honda,
K. Hosomi,
S. H. Hwang,
Y. Ichikawa,
Y. Igarashi,
K. Imai,
N. Ishibashi,
R. Iwasaki,
C. W. Joo
, et al. (41 additional authors not shown)
Abstract:
We have carried out an experiment to search for a neutron-rich hypernucleus, $^6_Λ$H, by the $^6$Li($π^-,K^+$) reaction at $p_{π^-}$ =1.2 GeV/$c$. The obtained missing mass spectrum with an estimated energy resolution of 3.2 MeV (FWHM) showed no peak structure corresponding to the $^6_Λ$H hypernucleus neither below nor above the $^4_Λ$H$+2n$ particle decay threshold. An upper limit of the producti…
▽ More
We have carried out an experiment to search for a neutron-rich hypernucleus, $^6_Λ$H, by the $^6$Li($π^-,K^+$) reaction at $p_{π^-}$ =1.2 GeV/$c$. The obtained missing mass spectrum with an estimated energy resolution of 3.2 MeV (FWHM) showed no peak structure corresponding to the $^6_Λ$H hypernucleus neither below nor above the $^4_Λ$H$+2n$ particle decay threshold. An upper limit of the production cross section for the bound $^6_Λ$H hypernucleus was estimated to be 1.2 nb/sr at 90% confidence level.
△ Less
Submitted 5 February, 2014; v1 submitted 22 October, 2013;
originally announced October 2013.
-
Measuring Fit of Sequence Data to Phylogenetic Model: Gain of Power using Marginal Tests
Authors:
Peter J. Waddell,
Rissa Ota,
David Penny
Abstract:
Testing fit of data to model is fundamentally important to any science, but publications in the field of phylogenetics rarely do this. Such analyses discard fundamental aspects of science as prescribed by Karl Popper. Indeed, not without cause, Popper (1978) once argued that evolutionary biology was unscientific as its hypotheses were untestable. Here we trace developments in assessing fit from…
▽ More
Testing fit of data to model is fundamentally important to any science, but publications in the field of phylogenetics rarely do this. Such analyses discard fundamental aspects of science as prescribed by Karl Popper. Indeed, not without cause, Popper (1978) once argued that evolutionary biology was unscientific as its hypotheses were untestable. Here we trace developments in assessing fit from Penny et al. (1982) to the present. We compare the general log-likelihood ratio (the G or G2 statistic) statistic between the evolutionary tree model and the multinomial model with that of marginalized tests applied to an alignment (using placental mammal coding sequence data). It is seen that the most general test does not reject the fit of data to model (p~0.5), but the marginalized tests do. Tests on pair-wise frequency (F) matrices, strongly (p < 0.001) reject the most general phylogenetic (GTR) models commonly in use. It is also clear (p < 0.01) that the sequences are not stationary in their nucleotide composition. Deviations from stationarity and homogeneity seem to be unevenly distributed amongst taxa; not necessarily those expected from examining other regions of the genome. By marginalizing the 4t patterns of the i.i.d. model to observed and expected parsimony counts, that is, from constant sites, to singletons, to parsimony informative characters of a minimum possible length, then the likelihood ratio test regains power, and it too rejects the evolutionary model with p << 0.001. Given such behavior over relatively recent evolutionary time, readers in general should maintain a healthy skepticism of results, as the scale of the systematic errors in published analyses may really be far larger than the analytical methods (e.g., bootstrap) report.
△ Less
Submitted 30 December, 2008;
originally announced December 2008.
-
Status report of the Tokyo axion helioscope experiment
Authors:
Y. Inoue,
M. Minowa,
Y. Akimoto,
R. Ota,
T. Mizumoto,
A. Yamamoto
Abstract:
We have searched for solar axions with a detector which consists of a 4T x 2.3m superconducting magnet, PIN-photodiode X-ray detectors, and an altazimuth mount to track the sun. The conversion region is filled with cold helium gas which modifies the axion mass at which coherent conversion occurs. In the past measurements, axion mass from 0 to 0.27eV have been scanned. Since no positive evidence…
▽ More
We have searched for solar axions with a detector which consists of a 4T x 2.3m superconducting magnet, PIN-photodiode X-ray detectors, and an altazimuth mount to track the sun. The conversion region is filled with cold helium gas which modifies the axion mass at which coherent conversion occurs. In the past measurements, axion mass from 0 to 0.27eV have been scanned. Since no positive evidence was seen, an upper limit to the axion-photon coupling constant was set to be g < 6-10E-10/GeV (95%CL) depending on the axion masses. We are now actively preparing for a new stage of the experiment aiming at one to a few eV solar axions. In this mass region, our detector might be able to check parameter regions which are preferable to the axion models.
△ Less
Submitted 9 June, 2008;
originally announced June 2008.