Search | arXiv e-print repository

arXiv:2406.19287 [pdf, other]

Isotropy of cosmic rays beyond $10^{20}$ eV favors their heavy mass composition

Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

Abstract: We report an estimation of the injected mass composition of ultra-high energy cosmic rays (UHECRs) at energies higher than 10 EeV. The composition is inferred from an energy-dependent sky distribution of UHECR events observed by the Telescope Array surface detector by comparing it to the Large Scale Structure of the local Universe. In the case of negligible extra-galactic magnetic fields the resul… ▽ More We report an estimation of the injected mass composition of ultra-high energy cosmic rays (UHECRs) at energies higher than 10 EeV. The composition is inferred from an energy-dependent sky distribution of UHECR events observed by the Telescope Array surface detector by comparing it to the Large Scale Structure of the local Universe. In the case of negligible extra-galactic magnetic fields the results are consistent with a relatively heavy injected composition at E ~ 10 EeV that becomes lighter up to E ~ 100 EeV, while the composition at E > 100 EeV is very heavy. The latter is true even in the presence of highest experimentally allowed extra-galactic magnetic fields, while the composition at lower energies can be light if a strong EGMF is present. The effect of the uncertainty in the galactic magnetic field on these results is subdominant. △ Less

Submitted 3 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

Comments: 8 pages, 3 figures, accepted for publication in PRL

arXiv:2406.19286 [pdf, other]

Mass composition of ultra-high energy cosmic rays from distribution of their arrival directions with the Telescope Array

Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

Abstract: We use a new method to estimate the injected mass composition of ultrahigh cosmic rays (UHECRs) at energies higher than 10 EeV. The method is based on comparison of the energy-dependent distribution of cosmic ray arrival directions as measured by the Telescope Array experiment (TA) with that calculated in a given putative model of UHECR under the assumption that sources trace the large-scale struc… ▽ More We use a new method to estimate the injected mass composition of ultrahigh cosmic rays (UHECRs) at energies higher than 10 EeV. The method is based on comparison of the energy-dependent distribution of cosmic ray arrival directions as measured by the Telescope Array experiment (TA) with that calculated in a given putative model of UHECR under the assumption that sources trace the large-scale structure (LSS) of the Universe. As we report in the companion letter, the TA data show large deflections with respect to the LSS which can be explained, assuming small extra-galactic magnetic fields (EGMF), by an intermediate composition changing to a heavy one (iron) in the highest energy bin. Here we show that these results are robust to uncertainties in UHECR injection spectra, the energy scale of the experiment and galactic magnetic fields (GMF). The assumption of weak EGMF, however, strongly affects this interpretation at all but the highest energies E > 100 EeV, where the remarkable isotropy of the data implies a heavy injected composition even in the case of strong EGMF. This result also holds if UHECR sources are as rare as $2 \times 10^{-5}$ Mpc$^{-3}$, that is the conservative lower limit for the source number density. △ Less

Submitted 3 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

Comments: 18 pages, 11 figures, accepted for publication in PRD

arXiv:2406.16535 [pdf, other]

Token-based Decision Criteria Are Suboptimal in In-context Learning

Authors: Hakaze Cho, Yoshihiro Sakai, Mariko Kato, Kenshiro Tanaka, Akira Ishii, Naoya Inoue

Abstract: In-Context Learning (ICL) typically utilizes classification criteria from probabilities of manually selected label tokens. However, we argue that such token-based classification criteria lead to suboptimal decision boundaries, despite delicate calibrations through translation and constrained rotation. To address this problem, we propose Hidden Calibration, which renounces token probabilities and u… ▽ More In-Context Learning (ICL) typically utilizes classification criteria from probabilities of manually selected label tokens. However, we argue that such token-based classification criteria lead to suboptimal decision boundaries, despite delicate calibrations through translation and constrained rotation. To address this problem, we propose Hidden Calibration, which renounces token probabilities and uses the nearest centroid classifier on the LM's last hidden states. In detail, we use the nearest centroid classification on the hidden states, assigning the category of the nearest centroid previously observed from a few-shot calibration set to the test sample as the predicted label. Our experiments on 3 models and 10 classification datasets indicate that Hidden Calibration consistently outperforms current token-based calibrations by about 20%. Our further analysis demonstrates that Hidden Calibration finds better classification criteria with less inter-categories overlap, and LMs provide linearly separable intra-category clusters with the help of demonstrations, which supports Hidden Calibration and gives new insights into the conventional ICL. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: 21 pages, 14 figures, 8 tables

arXiv:2406.14240 [pdf, other]

CityNav: Language-Goal Aerial Navigation Dataset with Geographic Information

Authors: Jungdae Lee, Taiki Miyanishi, Shuhei Kurita, Koya Sakamoto, Daichi Azuma, Yutaka Matsuo, Nakamasa Inoue

Abstract: Vision-and-language navigation (VLN) aims to guide autonomous agents through real-world environments by integrating visual and linguistic cues. While substantial progress has been made in understanding these interactive modalities in ground-level navigation, aerial navigation remains largely underexplored. This is primarily due to the scarcity of resources suitable for real-world, city-scale aeria… ▽ More Vision-and-language navigation (VLN) aims to guide autonomous agents through real-world environments by integrating visual and linguistic cues. While substantial progress has been made in understanding these interactive modalities in ground-level navigation, aerial navigation remains largely underexplored. This is primarily due to the scarcity of resources suitable for real-world, city-scale aerial navigation studies. To bridge this gap, we introduce CityNav, a new dataset for language-goal aerial navigation using a 3D point cloud representation from real-world cities. CityNav includes 32,637 natural language descriptions paired with human demonstration trajectories, collected from participants via a new web-based 3D simulator developed for this research. Each description specifies a navigation goal, leveraging the names and locations of landmarks within real-world cities. We also provide baseline models of navigation agents that incorporate an internal 2D spatial map representing landmarks referenced in the descriptions. We benchmark the latest aerial navigation baselines and our proposed model on the CityNav dataset. The results using this dataset reveal the following key findings: (i) Our aerial agent models trained on human demonstration trajectories outperform those trained on shortest path trajectories, highlighting the importance of human-driven navigation strategies; (ii) The integration of a 2D spatial map significantly enhances navigation efficiency at city scale. Our dataset and code are available at https://water-cookie.github.io/city-nav-proj/ △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: The first two authors are equally contributed

arXiv:2406.12402 [pdf, other]

Flee the Flaw: Annotating the Underlying Logic of Fallacious Arguments Through Templates and Slot-filling

Authors: Irfan Robbani, Paul Reisert, Naoya Inoue, Surawat Pothong, Camélia Guerraoui, Wenzhi Wang, Shoichi Naito, Jungmin Choi, Kentaro Inui

Abstract: Prior research in computational argumentation has mainly focused on scoring the quality of arguments, with less attention on explicating logical errors. In this work, we introduce four sets of explainable templates for common informal logical fallacies designed to explicate a fallacy's implicit logic. Using our templates, we conduct an annotation study on top of 400 fallacious arguments taken from… ▽ More Prior research in computational argumentation has mainly focused on scoring the quality of arguments, with less attention on explicating logical errors. In this work, we introduce four sets of explainable templates for common informal logical fallacies designed to explicate a fallacy's implicit logic. Using our templates, we conduct an annotation study on top of 400 fallacious arguments taken from LOGIC dataset and achieve a high agreement score (Krippendorf's alpha of 0.54) and reasonable coverage (0.83). Finally, we conduct an experiment for detecting the structure of fallacies and discover that state-of-the-art language models struggle with detecting fallacy templates (0.47 accuracy). To facilitate research on fallacies, we make our dataset and guidelines publicly available. △ Less

Submitted 18 June, 2024; originally announced June 2024.

arXiv:2406.08232 [pdf, other]

OpenCOLE: Towards Reproducible Automatic Graphic Design Generation

Authors: Naoto Inoue, Kento Masui, Wataru Shimoda, Kota Yamaguchi

Abstract: Automatic generation of graphic designs has recently received considerable attention. However, the state-of-the-art approaches are complex and rely on proprietary datasets, which creates reproducibility barriers. In this paper, we propose an open framework for automatic graphic design called OpenCOLE, where we build a modified version of the pioneering COLE and train our model exclusively on publi… ▽ More Automatic generation of graphic designs has recently received considerable attention. However, the state-of-the-art approaches are complex and rely on proprietary datasets, which creates reproducibility barriers. In this paper, we propose an open framework for automatic graphic design called OpenCOLE, where we build a modified version of the pioneering COLE and train our model exclusively on publicly available datasets. Based on GPT4V evaluations, our model shows promising performance comparable to the original COLE. We release the pipeline and training results to encourage open development. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: To appear as an extended abstract (EA) in Workshop on Graphic Design Understanding and Generation (in CVPR2024), code: https://github.com/CyberAgentAILab/OpenCOLE

arXiv:2406.01468 [pdf, other]

Understanding Token Probability Encoding in Output Embeddings

Authors: Hakaze Cho, Yoshihiro Sakai, Kenshiro Tanaka, Mariko Kato, Naoya Inoue

Abstract: In this paper, we investigate the output token probability information in the output embedding of language models. We provide an approximate common log-linear encoding of output token probabilities within the output embedding vectors and demonstrate that it is accurate and sparse when the output space is large and output logits are concentrated. Based on such findings, we edit the encoding in outp… ▽ More In this paper, we investigate the output token probability information in the output embedding of language models. We provide an approximate common log-linear encoding of output token probabilities within the output embedding vectors and demonstrate that it is accurate and sparse when the output space is large and output logits are concentrated. Based on such findings, we edit the encoding in output embedding to modify the output probability distribution accurately. Moreover, the sparsity we find in output probability encoding suggests that a large number of dimensions in the output embedding do not contribute to causal language modeling. Therefore, we attempt to delete the output-unrelated dimensions and find more than 30% of the dimensions can be deleted without significant movement in output distribution and degeneration on sequence generation. Additionally, in training dynamics, we use such encoding as a probe and find that the output embeddings capture token frequency information in early steps, even before an obvious convergence starts. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 15 pages, 17 figures, 3 tables

arXiv:2403.18187 [pdf, other]

LayoutFlow: Flow Matching for Layout Generation

Authors: Julian Jorge Andrade Guerreiro, Naoto Inoue, Kento Masui, Mayu Otani, Hideki Nakayama

Abstract: Finding a suitable layout represents a crucial task for diverse applications in graphic design. Motivated by simpler and smoother sampling trajectories, we explore the use of Flow Matching as an alternative to current diffusion-based layout generation models. Specifically, we propose LayoutFlow, an efficient flow-based model capable of generating high-quality layouts. Instead of progressively deno… ▽ More Finding a suitable layout represents a crucial task for diverse applications in graphic design. Motivated by simpler and smoother sampling trajectories, we explore the use of Flow Matching as an alternative to current diffusion-based layout generation models. Specifically, we propose LayoutFlow, an efficient flow-based model capable of generating high-quality layouts. Instead of progressively denoising the elements of a noisy layout, our method learns to gradually move, or flow, the elements of an initial sample until it reaches its final prediction. In addition, we employ a conditioning scheme that allows us to handle various generation tasks with varying degrees of conditioning with a single model. Empirically, LayoutFlow performs on par with state-of-the-art models while being significantly faster. △ Less

Submitted 26 March, 2024; originally announced March 2024.

arXiv:2402.05515 [pdf, other]

NoisyICL: A Little Noise in Model Parameters Calibrates In-context Learning

Authors: Yufeng Zhao, Yoshihiro Sakai, Naoya Inoue

Abstract: In-Context Learning (ICL) is suffering from unsatisfactory performance and under-calibration due to high prior bias and unfaithful confidence. Some previous works fine-tuned language models for better ICL performance with enormous datasets and computing costs. In this paper, we propose NoisyICL, simply perturbing the model parameters by random noises to strive for better performance and calibratio… ▽ More In-Context Learning (ICL) is suffering from unsatisfactory performance and under-calibration due to high prior bias and unfaithful confidence. Some previous works fine-tuned language models for better ICL performance with enormous datasets and computing costs. In this paper, we propose NoisyICL, simply perturbing the model parameters by random noises to strive for better performance and calibration. Our experiments on two models and 12 downstream datasets show that NoisyICL can help ICL produce more accurate predictions. Our further analysis indicates that NoisyICL enables the model to provide more fair predictions, and also with more faithful confidence. Therefore, we believe that NoisyICL is an effective calibration of ICL. Our experimental code is uploaded to Github. △ Less

Submitted 15 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: 20 pages, 28 figures, 7 tables (5 pages, 4 figures, 1 table in main body). ACL 2024 under review

arXiv:2401.06525 [pdf, other]

doi 10.1016/j.astropartphys.2023.102891

EUSO-SPB1 Mission and Science

Authors: JEM-EUSO Collaboration, :, G. Abdellaoui, S. Abe, J. H. Adams. Jr., D. Allard, G. Alonso, L. Anchordoqui, A. Anzalone, E. Arnone, K. Asano, R. Attallah, H. Attoui, M. Ave Pernas, R. Bachmann, S. Bacholle, M. Bagheri, M. Bakiri, J. Baláz, D. Barghini, S. Bartocci, M. Battisti, J. Bayer, B. Beldjilali, T. Belenguer , et al. (271 additional authors not shown)

Abstract: The Extreme Universe Space Observatory on a Super Pressure Balloon 1 (EUSO-SPB1) was launched in 2017 April from Wanaka, New Zealand. The plan of this mission of opportunity on a NASA super pressure balloon test flight was to circle the southern hemisphere. The primary scientific goal was to make the first observations of ultra-high-energy cosmic-ray extensive air showers (EASs) by looking down on… ▽ More The Extreme Universe Space Observatory on a Super Pressure Balloon 1 (EUSO-SPB1) was launched in 2017 April from Wanaka, New Zealand. The plan of this mission of opportunity on a NASA super pressure balloon test flight was to circle the southern hemisphere. The primary scientific goal was to make the first observations of ultra-high-energy cosmic-ray extensive air showers (EASs) by looking down on the atmosphere with an ultraviolet (UV) fluorescence telescope from suborbital altitude (33~km). After 12~days and 4~hours aloft, the flight was terminated prematurely in the Pacific Ocean. Before the flight, the instrument was tested extensively in the West Desert of Utah, USA, with UV point sources and lasers. The test results indicated that the instrument had sensitivity to EASs of approximately 3 EeV. Simulations of the telescope system, telescope on time, and realized flight trajectory predicted an observation of about 1 event assuming clear sky conditions. The effects of high clouds were estimated to reduce this value by approximately a factor of 2. A manual search and a machine-learning-based search did not find any EAS signals in these data. Here we review the EUSO-SPB1 instrument and flight and the EAS search. △ Less

Submitted 12 January, 2024; originally announced January 2024.

Comments: 18 pages, 19 figures

Journal ref: Astropart Phys 154 (2024) 102891

arXiv:2312.09718 [pdf, other]

Discovering Highly Influential Shortcut Reasoning: An Automated Template-Free Approach

Authors: Daichi Haraguchi, Kiyoaki Shirai, Naoya Inoue, Natthawut Kertkeidkachorn

Abstract: Shortcut reasoning is an irrational process of inference, which degrades the robustness of an NLP model. While a number of previous work has tackled the identification of shortcut reasoning, there are still two major limitations: (i) a method for quantifying the severity of the discovered shortcut reasoning is not provided; (ii) certain types of shortcut reasoning may be missed. To address these i… ▽ More Shortcut reasoning is an irrational process of inference, which degrades the robustness of an NLP model. While a number of previous work has tackled the identification of shortcut reasoning, there are still two major limitations: (i) a method for quantifying the severity of the discovered shortcut reasoning is not provided; (ii) certain types of shortcut reasoning may be missed. To address these issues, we propose a novel method for identifying shortcut reasoning. The proposed method quantifies the severity of the shortcut reasoning by leveraging out-of-distribution data and does not make any assumptions about the type of tokens triggering the shortcut reasoning. Our experiments on Natural Language Inference and Sentiment Analysis demonstrate that our framework successfully discovers known and unknown shortcut reasoning in the previous work. △ Less

Submitted 15 December, 2023; originally announced December 2023.

arXiv:2311.13602 [pdf, other]

Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation

Authors: Daichi Horita, Naoto Inoue, Kotaro Kikuchi, Kota Yamaguchi, Kiyoharu Aizawa

Abstract: Content-aware graphic layout generation aims to automatically arrange visual elements along with a given content, such as an e-commerce product image. In this paper, we argue that the current layout generation approaches suffer from the limited training data for the high-dimensional layout structure. We show that a simple retrieval augmentation can significantly improve the generation quality. Our… ▽ More Content-aware graphic layout generation aims to automatically arrange visual elements along with a given content, such as an e-commerce product image. In this paper, we argue that the current layout generation approaches suffer from the limited training data for the high-dimensional layout structure. We show that a simple retrieval augmentation can significantly improve the generation quality. Our model, which is named Retrieval-Augmented Layout Transformer (RALF), retrieves nearest neighbor layout examples based on an input image and feeds these results into an autoregressive generator. Our model can apply retrieval augmentation to various controllable generation tasks and yield high-quality layouts within a unified architecture. Our extensive experiments show that RALF successfully generates content-aware layouts in both constrained and unconstrained settings and significantly outperforms the baselines. △ Less

Submitted 15 April, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

Comments: Accepted to CVPR 2024 (Oral), Project website: https://udonda.github.io/RALF/ , GitHub: https://github.com/CyberAgentAILab/RALF

arXiv:2310.18773 [pdf, other]

CityRefer: Geography-aware 3D Visual Grounding Dataset on City-scale Point Cloud Data

Authors: Taiki Miyanishi, Fumiya Kitamori, Shuhei Kurita, Jungdae Lee, Motoaki Kawanabe, Nakamasa Inoue

Abstract: City-scale 3D point cloud is a promising way to express detailed and complicated outdoor structures. It encompasses both the appearance and geometry features of segmented city components, including cars, streets, and buildings, that can be utilized for attractive applications such as user-interactive navigation of autonomous vehicles and drones. However, compared to the extensive text annotations… ▽ More City-scale 3D point cloud is a promising way to express detailed and complicated outdoor structures. It encompasses both the appearance and geometry features of segmented city components, including cars, streets, and buildings, that can be utilized for attractive applications such as user-interactive navigation of autonomous vehicles and drones. However, compared to the extensive text annotations available for images and indoor scenes, the scarcity of text annotations for outdoor scenes poses a significant challenge for achieving these applications. To tackle this problem, we introduce the CityRefer dataset for city-level visual grounding. The dataset consists of 35k natural language descriptions of 3D objects appearing in SensatUrban city scenes and 5k landmarks labels synchronizing with OpenStreetMap. To ensure the quality and accuracy of the dataset, all descriptions and labels in the CityRefer dataset are manually verified. We also have developed a baseline system that can learn encoded language descriptions, 3D object instances, and geographical information about the city's landmarks to perform visual grounding on the CityRefer dataset. To the best of our knowledge, the CityRefer dataset is the largest city-level visual grounding dataset for localizing specific 3D objects. △ Less

Submitted 28 October, 2023; originally announced October 2023.

Comments: NeurIPS D&B 2023. The first two authors are equally contributed

arXiv:2309.17083 [pdf, other]

SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning

Authors: Risa Shinoda, Ryo Hayamizu, Kodai Nakashima, Nakamasa Inoue, Rio Yokota, Hirokatsu Kataoka

Abstract: Pre-training is a strong strategy for enhancing visual models to efficiently train them with a limited number of labeled images. In semantic segmentation, creating annotation masks requires an intensive amount of labor and time, and therefore, a large-scale pre-training dataset with semantic labels is quite difficult to construct. Moreover, what matters in semantic segmentation pre-training has no… ▽ More Pre-training is a strong strategy for enhancing visual models to efficiently train them with a limited number of labeled images. In semantic segmentation, creating annotation masks requires an intensive amount of labor and time, and therefore, a large-scale pre-training dataset with semantic labels is quite difficult to construct. Moreover, what matters in semantic segmentation pre-training has not been fully investigated. In this paper, we propose the Segmentation Radial Contour DataBase (SegRCDB), which for the first time applies formula-driven supervised learning for semantic segmentation. SegRCDB enables pre-training for semantic segmentation without real images or any manual semantic labels. SegRCDB is based on insights about what is important in pre-training for semantic segmentation and allows efficient pre-training. Pre-training with SegRCDB achieved higher mIoU than the pre-training with COCO-Stuff for fine-tuning on ADE-20k and Cityscapes with the same number of training images. SegRCDB has a high potential to contribute to semantic segmentation pre-training and investigation by enabling the creation of large datasets without manual annotation. The SegRCDB dataset will be released under a license that allows research and commercial use. Code is available at: https://github.com/dahlian00/SegRCDB △ Less

Submitted 29 September, 2023; originally announced September 2023.

Comments: ICCV2023. Code: https://github.com/dahlian00/SegRCDB, Project page: https://dahlian00.github.io/SegRCDBPage/

arXiv:2307.15341 [pdf, other]

Teach Me How to Improve My Argumentation Skills: A Survey on Feedback in Argumentation

Authors: Camélia Guerraoui, Paul Reisert, Naoya Inoue, Farjana Sultana Mim, Shoichi Naito, Jungmin Choi, Irfan Robbani, Wenzhi Wang, Kentaro Inui

Abstract: The use of argumentation in education has been shown to improve critical thinking skills for end-users such as students, and computational models for argumentation have been developed to assist in this process. Although these models are useful for evaluating the quality of an argument, they oftentimes cannot explain why a particular argument is considered poor or not, which makes it difficult to p… ▽ More The use of argumentation in education has been shown to improve critical thinking skills for end-users such as students, and computational models for argumentation have been developed to assist in this process. Although these models are useful for evaluating the quality of an argument, they oftentimes cannot explain why a particular argument is considered poor or not, which makes it difficult to provide constructive feedback to users to strengthen their critical thinking skills. In this survey, we aim to explore the different dimensions of feedback (Richness, Visualization, Interactivity, and Personalization) provided by the current computational models for argumentation, and the possibility of enhancing the power of explanations of such models, ultimately hel** learners improve their critical thinking skills. △ Less

Submitted 28 July, 2023; originally announced July 2023.

Comments: 14 pages, 4 figures

arXiv:2307.14710 [pdf, other]

Pre-training Vision Transformers with Very Limited Synthesized Images

Authors: Ryo Nakamura, Hirokatsu Kataoka, Sora Takashima, Edgar Josafat Martinez Noriega, Rio Yokota, Nakamasa Inoue

Abstract: Formula-driven supervised learning (FDSL) is a pre-training method that relies on synthetic images generated from mathematical formulae such as fractals. Prior work on FDSL has shown that pre-training vision transformers on such synthetic datasets can yield competitive accuracy on a wide range of downstream tasks. These synthetic images are categorized according to the parameters in the mathematic… ▽ More Formula-driven supervised learning (FDSL) is a pre-training method that relies on synthetic images generated from mathematical formulae such as fractals. Prior work on FDSL has shown that pre-training vision transformers on such synthetic datasets can yield competitive accuracy on a wide range of downstream tasks. These synthetic images are categorized according to the parameters in the mathematical formula that generate them. In the present work, we hypothesize that the process for generating different instances for the same category in FDSL, can be viewed as a form of data augmentation. We validate this hypothesis by replacing the instances with data augmentation, which means we only need a single image per category. Our experiments shows that this one-instance fractal database (OFDB) performs better than the original dataset where instances were explicitly generated. We further scale up OFDB to 21,000 categories and show that it matches, or even surpasses, the model pre-trained on ImageNet-21k in ImageNet-1k fine-tuning. The number of images in OFDB is 21k, whereas ImageNet-21k has 14M. This opens new possibilities for pre-training vision transformers with much smaller datasets. △ Less

Submitted 30 July, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

Comments: Accepted to ICCV 2023

arXiv:2305.13844 [pdf, other]

Arukikata Travelogue Dataset with Geographic Entity Mention, Coreference, and Link Annotation

Authors: Shohei Higashiyama, Hiroki Ouchi, Hiroki Teranishi, Hiroyuki Otomo, Yusuke Ide, Aitaro Yamamoto, Hiroyuki Shindo, Yuki Matsuda, Shoko Wakamiya, Naoya Inoue, Ikuya Yamada, Taro Watanabe

Abstract: Geoparsing is a fundamental technique for analyzing geo-entity information in text. We focus on document-level geoparsing, which considers geographic relatedness among geo-entity mentions, and presents a Japanese travelogue dataset designed for evaluating document-level geoparsing systems. Our dataset comprises 200 travelogue documents with rich geo-entity information: 12,171 mentions, 6,339 coref… ▽ More Geoparsing is a fundamental technique for analyzing geo-entity information in text. We focus on document-level geoparsing, which considers geographic relatedness among geo-entity mentions, and presents a Japanese travelogue dataset designed for evaluating document-level geoparsing systems. Our dataset comprises 200 travelogue documents with rich geo-entity information: 12,171 mentions, 6,339 coreference clusters, and 2,551 geo-entities linked to geo-database entries. △ Less

Submitted 23 May, 2023; originally announced May 2023.

arXiv:2305.11444 [pdf, other]

Arukikata Travelogue Dataset

Authors: Hiroki Ouchi, Hiroyuki Shindo, Shoko Wakamiya, Yuki Matsuda, Naoya Inoue, Shohei Higashiyama, Satoshi Nakamura, Taro Watanabe

Abstract: We have constructed Arukikata Travelogue Dataset and released it free of charge for academic research. This dataset is a Japanese text dataset with a total of over 31 million words, comprising 4,672 Japanese domestic travelogues and 9,607 overseas travelogues. Before providing our dataset, there was a scarcity of widely available travelogue data for research purposes, and each researcher had to pr… ▽ More We have constructed Arukikata Travelogue Dataset and released it free of charge for academic research. This dataset is a Japanese text dataset with a total of over 31 million words, comprising 4,672 Japanese domestic travelogues and 9,607 overseas travelogues. Before providing our dataset, there was a scarcity of widely available travelogue data for research purposes, and each researcher had to prepare their own data. This hinders the replication of existing studies and fair comparative analysis of experimental results. Our dataset enables any researchers to conduct investigation on the same data and to ensure transparency and reproducibility in research. In this paper, we describe the academic significance, characteristics, and prospects of our dataset. △ Less

Submitted 19 May, 2023; originally announced May 2023.

Comments: The application website for Arukikata Travelogue Dataset: https://www.nii.ac.jp/dsc/idr/arukikata/

arXiv:2303.18248 [pdf, other]

Towards Flexible Multi-modal Document Models

Authors: Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

Abstract: Creative workflows for generating graphical documents involve complex inter-related tasks, such as aligning elements, choosing appropriate fonts, or employing aesthetically harmonious colors. In this work, we attempt at building a holistic model that can jointly solve many different design tasks. Our model, which we denote by FlexDM, treats vector graphic documents as a set of multi-modal elements… ▽ More Creative workflows for generating graphical documents involve complex inter-related tasks, such as aligning elements, choosing appropriate fonts, or employing aesthetically harmonious colors. In this work, we attempt at building a holistic model that can jointly solve many different design tasks. Our model, which we denote by FlexDM, treats vector graphic documents as a set of multi-modal elements, and learns to predict masked fields such as element type, position, styling attributes, image, or text, using a unified architecture. Through the use of explicit multi-task learning and in-domain pre-training, our model can better capture the multi-modal relationships among the different document fields. Experimental results corroborate that our single FlexDM is able to successfully solve a multitude of different design tasks, while achieving performance that is competitive with task-specific and costly baselines. △ Less

Submitted 31 March, 2023; originally announced March 2023.

Comments: To be published in CVPR2023 (highlight), project page: https://cyberagentailab.github.io/flex-dm

arXiv:2303.08137 [pdf, other]

LayoutDM: Discrete Diffusion Model for Controllable Layout Generation

Authors: Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi

Abstract: Controllable layout generation aims at synthesizing plausible arrangement of element bounding boxes with optional constraints, such as type or position of a specific element. In this work, we try to solve a broad range of layout generation tasks in a single model that is based on discrete state-space diffusion models. Our model, named LayoutDM, naturally handles the structured layout data in the d… ▽ More Controllable layout generation aims at synthesizing plausible arrangement of element bounding boxes with optional constraints, such as type or position of a specific element. In this work, we try to solve a broad range of layout generation tasks in a single model that is based on discrete state-space diffusion models. Our model, named LayoutDM, naturally handles the structured layout data in the discrete representation and learns to progressively infer a noiseless layout from the initial input, where we model the layout corruption process by modality-wise discrete diffusion. For conditional generation, we propose to inject layout constraints in the form of masking or logit adjustment during inference. We show in the experiments that our LayoutDM successfully generates high-quality layouts and outperforms both task-specific and task-agnostic baselines on several layout tasks. △ Less

Submitted 14 March, 2023; originally announced March 2023.

Comments: To be published in CVPR2023, project page: https://cyberagentailab.github.io/layout-dm/

arXiv:2303.01112 [pdf, other]

Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves

Authors: Sora Takashima, Ryo Hayamizu, Nakamasa Inoue, Hirokatsu Kataoka, Rio Yokota

Abstract: Formula-driven supervised learning (FDSL) has been shown to be an effective method for pre-training vision transformers, where ExFractalDB-21k was shown to exceed the pre-training effect of ImageNet-21k. These studies also indicate that contours mattered more than textures when pre-training vision transformers. However, the lack of a systematic investigation as to why these contour-oriented synthe… ▽ More Formula-driven supervised learning (FDSL) has been shown to be an effective method for pre-training vision transformers, where ExFractalDB-21k was shown to exceed the pre-training effect of ImageNet-21k. These studies also indicate that contours mattered more than textures when pre-training vision transformers. However, the lack of a systematic investigation as to why these contour-oriented synthetic datasets can achieve the same accuracy as real datasets leaves much room for skepticism. In the present work, we develop a novel methodology based on circular harmonics for systematically investigating the design space of contour-oriented synthetic datasets. This allows us to efficiently search the optimal range of FDSL parameters and maximize the variety of synthetic images in the dataset, which we found to be a critical factor. When the resulting new dataset VisualAtom-21k is used for pre-training ViT-Base, the top-1 accuracy reached 83.7% when fine-tuning on ImageNet-1k. This is close to the top-1 accuracy (84.2%) achieved by JFT-300M pre-training, while the number of images is 1/14. Unlike JFT-300M which is a static dataset, the quality of synthetic datasets will continue to improve, and the current work is a testament to this possibility. FDSL is also free of the common issues associated with real images, e.g. privacy/copyright issues, labeling costs/errors, and ethical biases. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: Accepted to CVPR 2023

arXiv:2212.14178 [pdf]

Theoretical examination of QED Hamiltonian and negative-energy orbitals in relativistic molecular orbital theory

Authors: Nobuki Inoue, Yoshihiro Watanabe, Haruyuki Nakano

Abstract: The relativistic Hartree-Fock and electron correlation methods without the negative-energy orbital problem are examined on the basis of the quantum electrodynamics (QED) Hamiltonian. First, several QED Hamiltonians previously proposed are sifted by the orbital rotation invariance, the charge conjugation and time reversal invariance, and the nonrelativistic limit. A new total energy expression is t… ▽ More The relativistic Hartree-Fock and electron correlation methods without the negative-energy orbital problem are examined on the basis of the quantum electrodynamics (QED) Hamiltonian. First, several QED Hamiltonians previously proposed are sifted by the orbital rotation invariance, the charge conjugation and time reversal invariance, and the nonrelativistic limit. A new total energy expression is then proposed, in which a counter term corresponding to the energy of the polarized vacuum is subtracted from the total energy. This expression prevents the possibility of total energy divergence due to electron correlations, stemming from the fact that the QED Hamiltonian does not conserve the number of particles. Finally, based on the Hamiltonian and energy expression, the Dirac-Hartree-Fock (DHF) and electron correlation methods are reintroduced. The resulting QED-based DHF equation has the same form as the conventional DHF equation, but also formally describes systems specific to QED, such as the virtual positrons in the hydride ion and the positron in positronium. Three electron correlation methods are derived: the QED-based configuration interactions and single- and multireference perturbation methods. Numerical calculations show that the total energy of the QED Hamiltonian indeed diverges and that the counter term is effective in avoiding the divergence. The theoretical examinations in the present article suggest that the molecular orbital (MO) methods based on the QED Hamiltonian not only solve the problem of the negative-energy solutions of the relativistic MO method, but also provide a relativistic formalism to treat systems containing positrons. △ Less

Submitted 29 December, 2022; originally announced December 2022.

Comments: 39 pages, 6 figures

arXiv:2212.11541 [pdf, other]

Generative Colorization of Structured Mobile Web Pages

Authors: Kotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi

Abstract: Color is a critical design factor for web pages, affecting important factors such as viewer emotions and the overall trust and satisfaction of a website. Effective coloring requires design knowledge and expertise, but if this process could be automated through data-driven modeling, efficient exploration and alternative workflows would be possible. However, this direction remains underexplored due… ▽ More Color is a critical design factor for web pages, affecting important factors such as viewer emotions and the overall trust and satisfaction of a website. Effective coloring requires design knowledge and expertise, but if this process could be automated through data-driven modeling, efficient exploration and alternative workflows would be possible. However, this direction remains underexplored due to the lack of a formalization of the web page colorization problem, datasets, and evaluation protocols. In this work, we propose a new dataset consisting of e-commerce mobile web pages in a tractable format, which are created by simplifying the pages and extracting canonical color styles with a common web browser. The web page colorization problem is then formalized as a task of estimating plausible color styles for a given web page content with a given hierarchical structure of the elements. We present several Transformer-based methods that are adapted to this task by prepending structural message passing to capture hierarchical relationships between elements. Experimental results, including a quantitative evaluation designed for this task, demonstrate the advantages of our methods over statistical and image colorization methods. The code is available at https://github.com/CyberAgentAILab/webcolor. △ Less

Submitted 23 January, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

Comments: Accepted to WACV 2023

arXiv:2212.10352 [pdf, other]

Fixed-Weight Difference Target Propagation

Authors: Tatsukichi Shibuya, Nakamasa Inoue, Rei Kawakami, Ikuro Sato

Abstract: Target Propagation (TP) is a biologically more plausible algorithm than the error backpropagation (BP) to train deep networks, and improving practicality of TP is an open issue. TP methods require the feedforward and feedback networks to form layer-wise autoencoders for propagating the target values generated at the output layer. However, this causes certain drawbacks; e.g., careful hyperparameter… ▽ More Target Propagation (TP) is a biologically more plausible algorithm than the error backpropagation (BP) to train deep networks, and improving practicality of TP is an open issue. TP methods require the feedforward and feedback networks to form layer-wise autoencoders for propagating the target values generated at the output layer. However, this causes certain drawbacks; e.g., careful hyperparameter tuning is required to synchronize the feedforward and feedback training, and frequent updates of the feedback path are usually required than that of the feedforward path. Learning of the feedforward and feedback networks is sufficient to make TP methods capable of training, but is having these layer-wise autoencoders a necessary condition for TP to work? We answer this question by presenting Fixed-Weight Difference Target Propagation (FW-DTP) that keeps the feedback weights constant during training. We confirmed that this simple method, which naturally resolves the abovementioned problems of TP, can still deliver informative target values to hidden layers for a given task; indeed, FW-DTP consistently achieves higher test performance than a baseline, the Difference Target Propagation (DTP), on four classification datasets. We also present a novel propagation architecture that explains the exact form of the feedback function of DTP to analyze FW-DTP. △ Less

Submitted 19 December, 2022; originally announced December 2022.

Comments: Accepted at the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-23). 9 pages and 3 figures in main manuscript; 11 pages and 5 figures in supplementary material

arXiv:2212.02780 [pdf, ps, other]

Parameter Efficient Transfer Learning for Various Speech Processing Tasks

Authors: Shinta Otake, Rei Kawakami, Nakamasa Inoue

Abstract: Fine-tuning of self-supervised models is a powerful transfer learning method in a variety of fields, including speech processing, since it can utilize generic feature representations obtained from large amounts of unlabeled data. Fine-tuning, however, requires a new parameter set for each downstream task, which is parameter inefficient. Adapter architecture is proposed to partially solve this issu… ▽ More Fine-tuning of self-supervised models is a powerful transfer learning method in a variety of fields, including speech processing, since it can utilize generic feature representations obtained from large amounts of unlabeled data. Fine-tuning, however, requires a new parameter set for each downstream task, which is parameter inefficient. Adapter architecture is proposed to partially solve this issue by inserting lightweight learnable modules into a frozen pre-trained model. However, existing adapter architectures fail to adaptively leverage low- to high-level features stored in different layers, which is necessary for solving various kinds of speech processing tasks. Thus, we propose a new adapter architecture to acquire feature representations more flexibly for various speech tasks. In experiments, we applied this adapter to WavLM on four speech tasks. It performed on par or better than naive fine-tuning, with only 11% of learnable parameters. It also outperformed an existing adapter architecture. △ Less

Submitted 6 December, 2022; originally announced December 2022.

arXiv:2207.01847 [pdf, other]

PoF: Post-Training of Feature Extractor for Improving Generalization

Authors: Ikuro Sato, Ryota Yamada, Masayuki Tanaka, Nakamasa Inoue, Rei Kawakami

Abstract: It has been intensively investigated that the local shape, especially flatness, of the loss landscape near a minimum plays an important role for generalization of deep models. We developed a training algorithm called PoF: Post-Training of Feature Extractor that updates the feature extractor part of an already-trained deep model to search a flatter minimum. The characteristics are two-fold: 1) Feat… ▽ More It has been intensively investigated that the local shape, especially flatness, of the loss landscape near a minimum plays an important role for generalization of deep models. We developed a training algorithm called PoF: Post-Training of Feature Extractor that updates the feature extractor part of an already-trained deep model to search a flatter minimum. The characteristics are two-fold: 1) Feature extractor is trained under parameter perturbations in the higher-layer parameter space, based on observations that suggest flattening higher-layer parameter space, and 2) the perturbation range is determined in a data-driven manner aiming to reduce a part of test loss caused by the positive loss curvature. We provide a theoretical analysis that shows the proposed algorithm implicitly reduces the target Hessian components as well as the loss. Experimental results show that PoF improved model performance against baseline methods on both CIFAR-10 and CIFAR-100 datasets for only 10-epoch post-training, and on SVHN dataset for 50-epoch post-training. Source code is available at: \url{https://github.com/DensoITLab/PoF-v1 △ Less

Submitted 5 July, 2022; originally announced July 2022.

Comments: Accepted to ICML2022. Contains a link to the code

arXiv:2206.09132 [pdf, other]

Replacing Labeled Real-image Datasets with Auto-generated Contours

Authors: Hirokatsu Kataoka, Ryo Hayamizu, Ryosuke Yamada, Kodai Nakashima, Sora Takashima, Xinyu Zhang, Edgar Josafat Martinez-Noriega, Nakamasa Inoue, Rio Yokota

Abstract: In the present work, we show that the performance of formula-driven supervised learning (FDSL) can match or even exceed that of ImageNet-21k without the use of real images, human-, and self-supervision during the pre-training of Vision Transformers (ViTs). For example, ViT-Base pre-trained on ImageNet-21k shows 81.8% top-1 accuracy when fine-tuned on ImageNet-1k and FDSL shows 82.7% top-1 accuracy… ▽ More In the present work, we show that the performance of formula-driven supervised learning (FDSL) can match or even exceed that of ImageNet-21k without the use of real images, human-, and self-supervision during the pre-training of Vision Transformers (ViTs). For example, ViT-Base pre-trained on ImageNet-21k shows 81.8% top-1 accuracy when fine-tuned on ImageNet-1k and FDSL shows 82.7% top-1 accuracy when pre-trained under the same conditions (number of images, hyperparameters, and number of epochs). Images generated by formulas avoid the privacy/copyright issues, labeling cost and errors, and biases that real images suffer from, and thus have tremendous potential for pre-training general models. To understand the performance of the synthetic images, we tested two hypotheses, namely (i) object contours are what matter in FDSL datasets and (ii) increased number of parameters to create labels affects performance improvement in FDSL pre-training. To test the former hypothesis, we constructed a dataset that consisted of simple object contour combinations. We found that this dataset can match the performance of fractals. For the latter hypothesis, we found that increasing the difficulty of the pre-training task generally leads to better fine-tuning accuracy. △ Less

Submitted 18 June, 2022; originally announced June 2022.

Comments: Accepted to CVPR 2022

arXiv:2205.05115 [pdf, other]

doi 10.1029/2023GL102958

First High-speed Video Camera Observations of a Lightning Flash Associated with a Downward Terrestrial Gamma-ray Flash

Authors: R. U. Abbasi, M. M. F. Saba, J. W. Belz, P. R. Krehbiel, W. Rison, N. Kieu, D. R. da Silva, Dan Rodeheffer, M. A. Stanley, J. Remington, J. Mazich, R. LeVon, K. Smout, A. Petrizze, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii , et al. (127 additional authors not shown)

Abstract: In this paper, we present the first high-speed video observation of a cloud-to-ground lightning flash and its associated downward-directed Terrestrial Gamma-ray Flash (TGF). The optical emission of the event was observed by a high-speed video camera running at 40,000 frames per second in conjunction with the Telescope Array Surface Detector, Lightning Map** Array, interferometer, electric-field… ▽ More In this paper, we present the first high-speed video observation of a cloud-to-ground lightning flash and its associated downward-directed Terrestrial Gamma-ray Flash (TGF). The optical emission of the event was observed by a high-speed video camera running at 40,000 frames per second in conjunction with the Telescope Array Surface Detector, Lightning Map** Array, interferometer, electric-field fast antenna, and the National Lightning Detection Network. The cloud-to-ground flash associated with the observed TGF was formed by a fast downward leader followed by a very intense return stroke peak current of -154 kA. The TGF occurred while the downward leader was below cloud base, and even when it was halfway in its propagation to ground. The suite of gamma-ray and lightning instruments, timing resolution, and source proximity offer us detailed information and therefore a unique look at the TGF phenomena. △ Less

Submitted 9 August, 2023; v1 submitted 10 May, 2022; originally announced May 2022.

Journal ref: Geophysical Research Letters, 50, e2023GL102958 (2023)

arXiv:2204.01512 [pdf, other]

LPAttack: A Feasible Annotation Scheme for Capturing Logic Pattern of Attacks in Arguments

Authors: Farjana Sultana Mim, Naoya Inoue, Shoichi Naito, Keshav Singh, Kentaro Inui

Abstract: In argumentative discourse, persuasion is often achieved by refuting or attacking others arguments. Attacking is not always straightforward and often comprise complex rhetorical moves such that arguers might agree with a logic of an argument while attacking another logic. Moreover, arguer might neither deny nor agree with any logics of an argument, instead ignore them and attack the main stance of… ▽ More In argumentative discourse, persuasion is often achieved by refuting or attacking others arguments. Attacking is not always straightforward and often comprise complex rhetorical moves such that arguers might agree with a logic of an argument while attacking another logic. Moreover, arguer might neither deny nor agree with any logics of an argument, instead ignore them and attack the main stance of the argument by providing new logics and presupposing that the new logics have more value or importance than the logics present in the attacked argument. However, no existing studies in the computational argumentation capture such complex rhetorical moves in attacks or the presuppositions or value judgements in them. In order to address this gap, we introduce LPAttack, a novel annotation scheme that captures the common modes and complex rhetorical moves in attacks along with the implicit presuppositions and value judgements in them. Our annotation study shows moderate inter-annotator agreement, indicating that human annotation for the proposed scheme is feasible. We publicly release our annotated corpus and the annotation guidelines. △ Less

Submitted 4 April, 2022; originally announced April 2022.

Comments: 14 pages, 8 figures

arXiv:2201.12246

JEM-EUSO Collaboration contributions to the 37th International Cosmic Ray Conference

Authors: G. Abdellaoui, S. Abe, J. H. Adams Jr., D. Allard, G. Alonso, L. Anchordoqui, A. Anzalone, E. Arnone, K. Asano, R. Attallah, H. Attoui, M. Ave Pernas, M. Bagheri, J. Baláz, M. Bakiri, D. Barghini, S. Bartocci, M. Battisti, J. Bayer, B. Beldjilali, T. Belenguer, N. Belkhalfa, R. Bellotti, A. A. Belov, K. Benmessai , et al. (267 additional authors not shown)

Abstract: Compilation of papers presented by the JEM-EUSO Collaboration at the 37th International Cosmic Ray Conference (ICRC), held on July 12-23, 2021 (online) in Berlin, Germany. Compilation of papers presented by the JEM-EUSO Collaboration at the 37th International Cosmic Ray Conference (ICRC), held on July 12-23, 2021 (online) in Berlin, Germany. △ Less

Submitted 28 January, 2022; originally announced January 2022.

Comments: html page with links to the JEM-EUSO Collaboration papers presented at ICRC-2021, Berlin, Germany

arXiv:2201.07313 [pdf, other]

doi 10.3847/1538-4357/ac6def

Search for Spatial Correlations of Neutrinos with Ultra-High-Energy Cosmic Rays

Authors: The ANTARES collaboration, A. Albert, S. Alves, M. André, M. Anghinolfi, M. Ardid, S. Ardid, J. -J. Aubert, J. Aublin, B. Baret, S. Basa, B. Belhorma, M. Bendahman, V. Bertin, S. Biagi, M. Bissinger, J. Boumaaza, M. Bouta, M. C. Bouwhuis, H. Brânzaş, R. Bruijn, J. Brunner, J. Busto, B. Caiffi, D. Calvo , et al. (1025 additional authors not shown)

Abstract: For several decades, the origin of ultra-high-energy cosmic rays (UHECRs) has been an unsolved question of high-energy astrophysics. One approach for solving this puzzle is to correlate UHECRs with high-energy neutrinos, since neutrinos are a direct probe of hadronic interactions of cosmic rays and are not deflected by magnetic fields. In this paper, we present three different approaches for corre… ▽ More For several decades, the origin of ultra-high-energy cosmic rays (UHECRs) has been an unsolved question of high-energy astrophysics. One approach for solving this puzzle is to correlate UHECRs with high-energy neutrinos, since neutrinos are a direct probe of hadronic interactions of cosmic rays and are not deflected by magnetic fields. In this paper, we present three different approaches for correlating the arrival directions of neutrinos with the arrival directions of UHECRs. The neutrino data is provided by the IceCube Neutrino Observatory and ANTARES, while the UHECR data with energies above $\sim$50 EeV is provided by the Pierre Auger Observatory and the Telescope Array. All experiments provide increased statistics and improved reconstructions with respect to our previous results reported in 2015. The first analysis uses a high-statistics neutrino sample optimized for point-source searches to search for excesses of neutrinos clustering in the vicinity of UHECR directions. The second analysis searches for an excess of UHECRs in the direction of the highest-energy neutrinos. The third analysis searches for an excess of pairs of UHECRs and highest-energy neutrinos on different angular scales. None of the analyses has found a significant excess, and previously reported over-fluctuations are reduced in significance. Based on these results, we further constrain the neutrino flux spatially correlated with UHECRs. △ Less

Submitted 23 August, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

Comments: 39 pages, 7 figures, 4 tables; updated source files including xml authorlist

Report number: FERMILAB-PUB-22-033-AD-PPD-SCD-TD

Journal ref: ApJ 934 164 (2022)

arXiv:2201.06674 [pdf, other]

TYPIC: A Corpus of Template-Based Diagnostic Comments on Argumentation

Authors: Shoichi Naito, Shintaro Sawada, Chihiro Nakagawa, Naoya Inoue, Kenshi Yamaguchi, Iori Shimizu, Farjana Sultana Mim, Keshav Singh, Kentaro Inui

Abstract: Providing feedback on the argumentation of the learner is essential for develo** critical thinking skills, however, it requires a lot of time and effort. To mitigate the overload on teachers, we aim to automate a process of providing feedback, especially giving diagnostic comments which point out the weaknesses inherent in the argumentation. It is recommended to give specific diagnostic comments… ▽ More Providing feedback on the argumentation of the learner is essential for develo** critical thinking skills, however, it requires a lot of time and effort. To mitigate the overload on teachers, we aim to automate a process of providing feedback, especially giving diagnostic comments which point out the weaknesses inherent in the argumentation. It is recommended to give specific diagnostic comments so that learners can recognize the diagnosis without misinterpretation. However, it is not obvious how the task of providing specific diagnostic comments should be formulated. We present a formulation of the task as template selection and slot filling to make an automatic evaluation easier and the behavior of the model more tractable. The key to the formulation is the possibility of creating a template set that is sufficient for practical use. In this paper, we define three criteria that a template set should satisfy: expressiveness, informativeness, and uniqueness, and verify the feasibility of creating a template set that satisfies these criteria as a first trial. We will show that it is feasible through an annotation study that converts diagnostic comments given in a text to a template format. The corpus used in the annotation study is publicly available. △ Less

Submitted 21 June, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

Comments: LREC2022. The dataset is available at https://github.com/cl-tohoku/TYPIC

arXiv:2111.09962 [pdf, other]

doi 10.1103/PhysRevD.105.062002

Observation of Variations in Cosmic Ray Single Count Rates During Thunderstorms and Implications for Large-Scale Electric Field Changes

Authors: R. U. Abbasi, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, R. Cady, B. G. Cheon, J. Chiba, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, R. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, M. Hayashi , et al. (140 additional authors not shown)

Abstract: We present the first observation by the Telescope Array Surface Detector (TASD) of the effect of thunderstorms on the development of cosmic ray single count rate intensity over a 700 km$^{2}$ area. Observations of variations in the secondary low-energy cosmic ray counting rate, using the TASD, allow us to study the electric field inside thunderstorms, on a large scale, as it progresses on top of t… ▽ More We present the first observation by the Telescope Array Surface Detector (TASD) of the effect of thunderstorms on the development of cosmic ray single count rate intensity over a 700 km$^{2}$ area. Observations of variations in the secondary low-energy cosmic ray counting rate, using the TASD, allow us to study the electric field inside thunderstorms, on a large scale, as it progresses on top of the 700 km$^{2}$ detector, without dealing with the limitation of narrow exposure in time and space using balloons and aircraft detectors. In this work, variations in the cosmic ray intensity (single count rate) using the TASD, were studied and found to be on average at the $\sim(0.5-1)\%$ and up to 2\% level. These observations were found to be both in excess and in deficit. They were also found to be correlated with lightning in addition to thunderstorms. These variations lasted for tens of minutes; their footprint on the ground ranged from 6 to 24 km in diameter and moved in the same direction as the thunderstorm. With the use of simple electric field models inside the cloud and between cloud to ground, the observed variations in the cosmic ray single count rate were recreated using CORSIKA simulations. Depending on the electric field model used and the direction of the electric field in that model, the electric field magnitude that reproduces the observed low-energy cosmic ray single count rate variations was found to be approximately between 0.2-0.4 GV. This in turn allows us to get a reasonable insight on the electric field and its effect on cosmic ray air showers inside thunderstorms. △ Less

Submitted 18 November, 2021; originally announced November 2021.

arXiv:2110.14827 [pdf, other]

Indications of a Cosmic Ray Source in the Perseus-Pisces Supercluster

Authors: Telescope Array Collaboration, R. U. Abbasi, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, R. Cady, B. G. Cheon, J. Chiba, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, R. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon , et al. (135 additional authors not shown)

Abstract: The Telescope Array Collaboration has observed an excess of events with $E \ge 10^{19.4} ~{\rm eV}$ in the data which is centered at (RA, dec) = ($19^\circ$, $35^\circ$). This is near the center of the Perseus-Pisces supercluster (PPSC). The PPSC is about $70 ~{\rm Mpc}$ distant and is the closest supercluster in the Northern Hemisphere (other than the Virgo supercluster of which we are a part). A… ▽ More The Telescope Array Collaboration has observed an excess of events with $E \ge 10^{19.4} ~{\rm eV}$ in the data which is centered at (RA, dec) = ($19^\circ$, $35^\circ$). This is near the center of the Perseus-Pisces supercluster (PPSC). The PPSC is about $70 ~{\rm Mpc}$ distant and is the closest supercluster in the Northern Hemisphere (other than the Virgo supercluster of which we are a part). A Li-Ma oversampling analysis with $20^\circ$-radius circles indicates an excess in the arrival direction of events with a local significance of about 4 standard deviations. The probability of having such excess close to the PPSC by chance is estimated to be 3.5 standard deviations. This result indicates that a cosmic ray source likely exists in that supercluster. △ Less

Submitted 27 October, 2021; originally announced October 2021.

Comments: 8 pages, 4 figures, 1 table

arXiv:2110.13692 [pdf, other]

Annotating Implicit Reasoning in Arguments with Causal Links

Authors: Keshav Singh, Naoya Inoue, Farjana Sultana Mim, Shoichi Naitoh, Kentaro Inui

Abstract: Most of the existing work that focus on the identification of implicit knowledge in arguments generally represent implicit knowledge in the form of commonsense or factual knowledge. However, such knowledge is not sufficient to understand the implicit reasoning link between individual argumentative components (i.e., claim and premise). In this work, we focus on identifying the implicit knowledge in… ▽ More Most of the existing work that focus on the identification of implicit knowledge in arguments generally represent implicit knowledge in the form of commonsense or factual knowledge. However, such knowledge is not sufficient to understand the implicit reasoning link between individual argumentative components (i.e., claim and premise). In this work, we focus on identifying the implicit knowledge in the form of argumentation knowledge which can help in understanding the reasoning link in arguments. Being inspired by the Argument from Consequences scheme, we propose a semi-structured template to represent such argumentation knowledge that explicates the implicit reasoning in arguments via causality. We create a novel two-phase annotation process with simplified guidelines and show how to collect and filter high-quality implicit reasonings via crowdsourcing. We find substantial inter-annotator agreement for quality evaluation between experts, but find evidence that casts a few questions on the feasibility of collecting high-quality semi-structured implicit reasoning through our crowdsourcing process. We release our materials(i.e., crowdsourcing guidelines and collected implicit reasonings) to facilitate further research towards the structured representation of argumentation knowledge. △ Less

Submitted 26 October, 2021; originally announced October 2021.

Comments: Accepted to ArgKG:Workshop on Argumentation Knowledge Graphs (AKBC 2021)

arXiv:2110.11934 [pdf, other]

Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts

Authors: Allen Kim, Charuta Pethe, Naoya Inoue, Steve Skiena

Abstract: Substantial amounts of work are required to clean large collections of digitized books for NLP analysis, both because of the presence of errors in the scanned text and the presence of duplicate volumes in the corpora. In this paper, we consider the issue of deduplication in the presence of optical character recognition (OCR) errors. We present methods to handle these errors, evaluated on a collect… ▽ More Substantial amounts of work are required to clean large collections of digitized books for NLP analysis, both because of the presence of errors in the scanned text and the presence of duplicate volumes in the corpora. In this paper, we consider the issue of deduplication in the presence of optical character recognition (OCR) errors. We present methods to handle these errors, evaluated on a collection of 19,347 texts from the Project Gutenberg dataset and 96,635 texts from the HathiTrust Library. We demonstrate that improvements in language models now enable the detection and correction of OCR errors without consideration of the scanning image itself. The inconsistencies found by aligning pairs of scans of the same underlying work provides training data to build models for detecting and correcting errors. We identify the canonical version for each of 17,136 repeatedly-scanned books from 58,808 scans. Finally, we investigate methods to detect and correct errors in single-copy texts. We show that on average, our method corrects over six times as many errors as it introduces. We also provide interesting analysis on the relation between scanning quality and other factors such as location and publication year. △ Less

Submitted 22 October, 2021; originally announced October 2021.

Comments: Accepted for Findings of EMNLP 2021

arXiv:2109.06853 [pdf, other]

Summarize-then-Answer: Generating Concise Explanations for Multi-hop Reading Comprehension

Authors: Naoya Inoue, Harsh Trivedi, Steven Sinha, Niranjan Balasubramanian, Kentaro Inui

Abstract: How can we generate concise explanations for multi-hop Reading Comprehension (RC)? The current strategies of identifying supporting sentences can be seen as an extractive question-focused summarization of the input text. However, these extractive explanations are not necessarily concise i.e. not minimally sufficient for answering a question. Instead, we advocate for an abstractive approach, where… ▽ More How can we generate concise explanations for multi-hop Reading Comprehension (RC)? The current strategies of identifying supporting sentences can be seen as an extractive question-focused summarization of the input text. However, these extractive explanations are not necessarily concise i.e. not minimally sufficient for answering a question. Instead, we advocate for an abstractive approach, where we propose to generate a question-focused, abstractive summary of input paragraphs and then feed it to an RC system. Given a limited amount of human-annotated abstractive explanations, we train the abstractive explainer in a semi-supervised manner, where we start from the supervised model and then train it further through trial and error maximizing a conciseness-promoted reward function. Our experiments demonstrate that the proposed abstractive explainer can generate more compact explanations than an extractive explainer with limited supervision (only 2k instances) while maintaining sufficiency. △ Less

Submitted 14 September, 2021; originally announced September 2021.

Comments: Accepted to EMNLP2021 Long Paper (Main Track)

arXiv:2104.07924 [pdf, other]

A Comparative Study on Collecting High-Quality Implicit Reasonings at a Large-scale

Authors: Keshav Singh, Paul Reisert, Naoya Inoue, Kentaro Inui

Abstract: Explicating implicit reasoning (i.e. warrants) in arguments is a long-standing challenge for natural language understanding systems. While recent approaches have focused on explicating warrants via crowdsourcing or expert annotations, the quality of warrants has been questionable due to the extreme complexity and subjectivity of the task. In this paper, we tackle the complex task of warrant explic… ▽ More Explicating implicit reasoning (i.e. warrants) in arguments is a long-standing challenge for natural language understanding systems. While recent approaches have focused on explicating warrants via crowdsourcing or expert annotations, the quality of warrants has been questionable due to the extreme complexity and subjectivity of the task. In this paper, we tackle the complex task of warrant explication and devise various methodologies for collecting warrants. We conduct an extensive study with trained experts to evaluate the resulting warrants of each methodology and find that our methodologies allow for high-quality warrants to be collected. We construct a preliminary dataset of 6,000 warrants annotated over 600 arguments for 3 debatable topics. To facilitate research in related downstream tasks, we release our guidelines and preliminary dataset. △ Less

Submitted 16 April, 2021; originally announced April 2021.

Comments: 2 figures, 3 tables

arXiv:2103.13023 [pdf, other]

Can Vision Transformers Learn without Natural Images?

Authors: Kodai Nakashima, Hirokatsu Kataoka, Asato Matsumoto, Kenji Iwata, Nakamasa Inoue

Abstract: Can we complete pre-training of Vision Transformers (ViT) without natural images and human-annotated labels? Although a pre-trained ViT seems to heavily rely on a large-scale dataset and human-annotated labels, recent large-scale datasets contain several problems in terms of privacy violations, inadequate fairness protection, and labor-intensive annotation. In the present paper, we pre-train ViT w… ▽ More Can we complete pre-training of Vision Transformers (ViT) without natural images and human-annotated labels? Although a pre-trained ViT seems to heavily rely on a large-scale dataset and human-annotated labels, recent large-scale datasets contain several problems in terms of privacy violations, inadequate fairness protection, and labor-intensive annotation. In the present paper, we pre-train ViT without any image collections and annotation labor. We experimentally verify that our proposed framework partially outperforms sophisticated Self-Supervised Learning (SSL) methods like SimCLRv2 and MoCov2 without using any natural images in the pre-training phase. Moreover, although the ViT pre-trained without natural images produces some different visualizations from ImageNet pre-trained ViT, it can interpret natural image datasets to a large extent. For example, the performance rates on the CIFAR-10 dataset are as follows: our proposal 97.6 vs. SimCLRv2 97.4 vs. ImageNet 98.0. △ Less

Submitted 24 March, 2021; originally announced March 2021.

Comments: Project page: https://hirokatsukataoka16.github.io/Vision-Transformers-without-Natural-Images/

arXiv:2103.01086 [pdf, ps, other]

doi 10.1016/j.nima.2021.165726

Surface detectors of the TAx4 experiment

Authors: Telescope Array Collaboration, R. U. Abbasi, M. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, R. Cady, B. G. Cheon, J. Chiba, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, R. Fukushima, G. Furlich, W. Hanlon, M. Hayashi, N. Hayashida, K. Hibino , et al. (124 additional authors not shown)

Abstract: Telescope Array (TA) is the largest ultrahigh energy cosmic-ray (UHECR) observatory in the Northern Hemisphere. It explores the origin of UHECRs by measuring their energy spectrum, arrival-direction distribution, and mass composition using a surface detector (SD) array covering approximately 700 km$^2$ and fluorescence detector (FD) stations. TA has found evidence for a cluster of cosmic rays with… ▽ More Telescope Array (TA) is the largest ultrahigh energy cosmic-ray (UHECR) observatory in the Northern Hemisphere. It explores the origin of UHECRs by measuring their energy spectrum, arrival-direction distribution, and mass composition using a surface detector (SD) array covering approximately 700 km$^2$ and fluorescence detector (FD) stations. TA has found evidence for a cluster of cosmic rays with energies greater than 57 EeV. In order to confirm this evidence with more data, it is necessary to increase the data collection rate.We have begun building an expansion of TA that we call TAx4. In this paper, we explain the motivation, design, technical features, and expected performance of the TAx4 SD. We also present TAx4's current status and examples of the data that have already been collected. △ Less

Submitted 1 March, 2021; originally announced March 2021.

Comments: 26 pages, 17 figures, submitted to Nuclear Inst. and Methods in Physics Research, A

arXiv:2102.06540 [pdf, other]

Two Training Strategies for Improving Relation Extraction over Universal Graph

Authors: Qin Dai, Naoya Inoue, Ryo Takahashi, Kentaro Inui

Abstract: This paper explores how the Distantly Supervised Relation Extraction (DS-RE) can benefit from the use of a Universal Graph (UG), the combination of a Knowledge Graph (KG) and a large-scale text collection. A straightforward extension of a current state-of-the-art neural model for DS-RE with a UG may lead to degradation in performance. We first report that this degradation is associated with the di… ▽ More This paper explores how the Distantly Supervised Relation Extraction (DS-RE) can benefit from the use of a Universal Graph (UG), the combination of a Knowledge Graph (KG) and a large-scale text collection. A straightforward extension of a current state-of-the-art neural model for DS-RE with a UG may lead to degradation in performance. We first report that this degradation is associated with the difficulty in learning a UG and then propose two training strategies: (1) Path Type Adaptive Pretraining, which sequentially trains the model with different types of UG paths so as to prevent the reliance on a single type of UG path; and (2) Complexity Ranking Guided Attention mechanism, which restricts the attention span according to the complexity of a UG path so as to force the model to extract features not only from simple UG paths but also from complex ones. Experimental results on both biomedical and NYT10 datasets prove the robustness of our methods and achieve a new state-of-the-art result on the NYT10 dataset. The code and datasets used in this paper are available at https://github.com/baodaiqin/UGDSRE. △ Less

Submitted 6 May, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

arXiv:2101.08515 [pdf, other]

Pre-training without Natural Images

Authors: Hirokatsu Kataoka, Kazushige Okayasu, Asato Matsumoto, Eisuke Yamagata, Ryosuke Yamada, Nakamasa Inoue, Akio Nakamura, Yutaka Satoh

Abstract: Is it possible to use convolutional neural networks pre-trained without any natural images to assist natural image understanding? The paper proposes a novel concept, Formula-driven Supervised Learning. We automatically generate image patterns and their category labels by assigning fractals, which are based on a natural law existing in the background knowledge of the real world. Theoretically, the… ▽ More Is it possible to use convolutional neural networks pre-trained without any natural images to assist natural image understanding? The paper proposes a novel concept, Formula-driven Supervised Learning. We automatically generate image patterns and their category labels by assigning fractals, which are based on a natural law existing in the background knowledge of the real world. Theoretically, the use of automatically generated images instead of natural images in the pre-training phase allows us to generate an infinite scale dataset of labeled images. Although the models pre-trained with the proposed Fractal DataBase (FractalDB), a database without natural images, does not necessarily outperform models pre-trained with human annotated datasets at all settings, we are able to partially surpass the accuracy of ImageNet/Places pre-trained models. The image representation with the proposed FractalDB captures a unique feature in the visualization of convolutional layers and attentions. △ Less

Submitted 21 January, 2021; originally announced January 2021.

Comments: ACCV 2020 Best Paper Honorable Mention Award, Codes are publicly available: https://github.com/hirokatsukataoka16/FractalDB-Pretrained-ResNet-PyTorch

arXiv:2101.07406 [pdf, ps, other]

Initialization Using Perlin Noise for Training Networks with a Limited Amount of Data

Authors: Nakamasa Inoue, Eisuke Yamagata, Hirokatsu Kataoka

Abstract: We propose a novel network initialization method using Perlin noise for training image classification networks with a limited amount of data. Our main idea is to initialize the network parameters by solving an artificial noise classification problem, where the aim is to classify Perlin noise samples into their noise categories. Specifically, the proposed method consists of two steps. First, it gen… ▽ More We propose a novel network initialization method using Perlin noise for training image classification networks with a limited amount of data. Our main idea is to initialize the network parameters by solving an artificial noise classification problem, where the aim is to classify Perlin noise samples into their noise categories. Specifically, the proposed method consists of two steps. First, it generates Perlin noise samples with category labels defined based on noise complexity. Second, it solves a classification problem, in which network parameters are optimized to classify the generated noise samples. This method produces a reasonable set of initial weights (filters) for image classification. To the best of our knowledge, this is the first work to initialize networks by solving an artificial optimization problem without using any real-world images. Our experiments show that the proposed method outperforms conventional initialization methods on four image classification datasets. △ Less

Submitted 18 January, 2021; originally announced January 2021.

Comments: Accepted to ICPR2020

arXiv:2101.01713 [pdf, other]

doi 10.1109/TCSVT.2020.3047977

Learning from Synthetic Shadows for Shadow Detection and Removal

Authors: Naoto Inoue, Toshihiko Yamasaki

Abstract: Shadow removal is an essential task in computer vision and computer graphics. Recent shadow removal approaches all train convolutional neural networks (CNN) on real paired shadow/shadow-free or shadow/shadow-free/mask image datasets. However, obtaining a large-scale, diverse, and accurate dataset has been a big challenge, and it limits the performance of the learned models on shadow images with un… ▽ More Shadow removal is an essential task in computer vision and computer graphics. Recent shadow removal approaches all train convolutional neural networks (CNN) on real paired shadow/shadow-free or shadow/shadow-free/mask image datasets. However, obtaining a large-scale, diverse, and accurate dataset has been a big challenge, and it limits the performance of the learned models on shadow images with unseen shapes/intensities. To overcome this challenge, we present SynShadow, a novel large-scale synthetic shadow/shadow-free/matte image triplets dataset and a pipeline to synthesize it. We extend a physically-grounded shadow illumination model and synthesize a shadow image given an arbitrary combination of a shadow-free image, a matte image, and shadow attenuation parameters. Owing to the diversity, quantity, and quality of SynShadow, we demonstrate that shadow removal models trained on SynShadow perform well in removing shadows with diverse shapes and intensities on some challenging benchmarks. Furthermore, we show that merely fine-tuning from a SynShadow-pre-trained model improves existing shadow detection and removal models. Codes are publicly available at https://github.com/naoto0804/SynShadow. △ Less

Submitted 13 February, 2021; v1 submitted 5 January, 2021; originally announced January 2021.

Comments: Accepted to IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), v2: fixed typos

arXiv:2011.01785 [pdf, other]

Modeling Event Salience in Narratives via Barthes' Cardinal Functions

Authors: Takaki Otake, Sho Yokoi, Naoya Inoue, Ryo Takahashi, Tatsuki Kuribayashi, Kentaro Inui

Abstract: Events in a narrative differ in salience: some are more important to the story than others. Estimating event salience is useful for tasks such as story generation, and as a tool for text analysis in narratology and folkloristics. To compute event salience without any annotations, we adopt Barthes' definition of event salience and propose several unsupervised methods that require only a pre-trained… ▽ More Events in a narrative differ in salience: some are more important to the story than others. Estimating event salience is useful for tasks such as story generation, and as a tool for text analysis in narratology and folkloristics. To compute event salience without any annotations, we adopt Barthes' definition of event salience and propose several unsupervised methods that require only a pre-trained language model. Evaluating the proposed methods on folktales with event salience annotation, we show that the proposed methods outperform baseline methods and find fine-tuning a language model on narrative texts is a key factor in improving the proposed methods. △ Less

Submitted 3 November, 2020; originally announced November 2020.

Comments: accepted to COLING 2020

arXiv:2010.06137 [pdf, other]

Corruption Is Not All Bad: Incorporating Discourse Structure into Pre-training via Corruption for Essay Scoring

Authors: Farjana Sultana Mim, Naoya Inoue, Paul Reisert, Hiroki Ouchi, Kentaro Inui

Abstract: Existing approaches for automated essay scoring and document representation learning typically rely on discourse parsers to incorporate discourse structure into text representation. However, the performance of parsers is not always adequate, especially when they are used on noisy texts, such as student essays. In this paper, we propose an unsupervised pre-training approach to capture discourse str… ▽ More Existing approaches for automated essay scoring and document representation learning typically rely on discourse parsers to incorporate discourse structure into text representation. However, the performance of parsers is not always adequate, especially when they are used on noisy texts, such as student essays. In this paper, we propose an unsupervised pre-training approach to capture discourse structure of essays in terms of coherence and cohesion that does not require any discourse parser or annotation. We introduce several types of token, sentence and paragraph-level corruption techniques for our proposed pre-training approach and augment masked language modeling pre-training with our pre-training method to leverage both contextualized and discourse information. Our proposed unsupervised approach achieves new state-of-the-art result on essay Organization scoring task. △ Less

Submitted 12 October, 2020; originally announced October 2020.

arXiv:2009.14327 [pdf, other]

doi 10.1029/2019JD031940

Observations of the Origin of Downward Terrestrial Gamma-Ray Flashes

Authors: J. W. Belz, P. R. Krehbiel, J. Remington, M. A. Stanley, R. U. Abbasi, R. LeVon, W. Rison, D. Rodeheffer, the Telescope Array Scientific Collaboration, :, T. Abu-Zayyad, M. Allen, E. Barcikowski, D. R. Bergman, S. A. Blake, M. Byrne, R. Cady, B. G. Cheon, M. Chikawa, A. di Matteo, T. Fujii, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich , et al. (116 additional authors not shown)

Abstract: In this paper we report the first close, high-resolution observations of downward-directed terrestrial gamma-ray flashes (TGFs) detected by the large-area Telescope Array cosmic ray observatory, obtained in conjunction with broadband VHF interferometer and fast electric field change measurements of the parent discharge. The results show that the TGFs occur during strong initial breakdown pulses (I… ▽ More In this paper we report the first close, high-resolution observations of downward-directed terrestrial gamma-ray flashes (TGFs) detected by the large-area Telescope Array cosmic ray observatory, obtained in conjunction with broadband VHF interferometer and fast electric field change measurements of the parent discharge. The results show that the TGFs occur during strong initial breakdown pulses (IBPs) in the first few milliseconds of negative cloud-to-ground and low-altitude intracloud flashes, and that the IBPs are produced by a newly-identified streamer-based discharge process called fast negative breakdown. The observations indicate the relativistic runaway electron avalanches (RREAs) responsible for producing the TGFs are initiated by embedded spark-like transient conducting events (TCEs) within the fast streamer system, and potentially also by individual fast streamers themselves. The TCEs are inferred to be the cause of impulsive sub-pulses that are characteristic features of classic IBP sferics. Additional development of the avalanches would be facilitated by the enhanced electric field ahead of the advancing front of the fast negative breakdown. In addition to showing the nature of IBPs and their enigmatic sub-pulses, the observations also provide a possible explanation for the unsolved question of how the streamer to leader transition occurs during the initial negative breakdown, namely as a result of strong currents flowing in the final stage of successive IBPs, extending backward through both the IBP itself and the negative streamer breakdown preceding the IBP. △ Less

Submitted 12 October, 2020; v1 submitted 29 September, 2020; originally announced September 2020.

Comments: Typo fixed and reference added. Manuscript is 36 pages. Supplemental Information is 42 pages. This paper is to be published in the Journal of Geophysical Research: Atmospheres. Online data repository: Open Science Framework DOI: 10.17605/OSF.IO/Z3XDA

arXiv:2007.00023 [pdf, other]

doi 10.3847/2041-8213/aba0bc

Search for Large-scale Anisotropy on Arrival Directions of Ultra-high-energy Cosmic Rays Observed with the Telescope Array Experiment

Authors: Telescope Array Collaboration, R. U. Abbasi, M. Abe, T. Abu-Zayyad, M. Allen, R. Azuma, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, R. Cady, B. G. Cheon, J. Chiba, M. Chikawa, A. di Matteo, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, W. Hanlon, M. Hayashi, N. Hayashida, K. Hibino , et al. (121 additional authors not shown)

Abstract: Motivated by the detection of a significant dipole structure in the arrival directions of ultrahigh-energy cosmic rays above 8 EeV reported by the Pierre Auger Observatory (Auger), we search for a large-scale anisotropy using data collected with the surface detector array of the Telescope Array Experiment (TA). With 11 years of TA data, a dipole structure in a projection of the right ascension is… ▽ More Motivated by the detection of a significant dipole structure in the arrival directions of ultrahigh-energy cosmic rays above 8 EeV reported by the Pierre Auger Observatory (Auger), we search for a large-scale anisotropy using data collected with the surface detector array of the Telescope Array Experiment (TA). With 11 years of TA data, a dipole structure in a projection of the right ascension is fitted with an amplitude of 3.3+- 1.9% and a phase of 131 +- 33 degrees. The corresponding 99% confidence-level upper limit on the amplitude is 7.3%. At the current level of statistics, the fitted result is compatible with both an isotropic distribution and the dipole structure reported by Auger. △ Less

Submitted 27 July, 2020; v1 submitted 30 June, 2020; originally announced July 2020.

Comments: 6 pages, 3 figures, 1 table, Proofed title. Added journal reference and DOI

Journal ref: The Astrophysical Journal Letters 898, L28 (2020)

arXiv:2006.05012 [pdf, other]

doi 10.1103/PhysRevD.102.062004

Measurement of the Proton-Air Cross Section with Telescope Array's Black Rock Mesa and Long Ridge Fluorescence Detectors, and Surface Array in Hybrid Mode

Authors: R. U. Abbasi, M. Abe, T. Abu-Zayyad, M. Allen, R. Azuma, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, R. Cady, B. G. Cheon, J. Chiba, M. Chikawa, A. di Matteo, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, W. Hanlon, M. Hayashi, N. Hayashida, K. Hibino, R. Higuchi , et al. (120 additional authors not shown)

Abstract: Ultra high energy cosmic rays provide the highest known energy source in the universe to measure proton cross sections. Though conditions for collecting such data are less controlled than an accelerator environment, current generation cosmic ray observatories have large enough exposures to collect significant statistics for a reliable measurement for energies above what can be attained in the lab.… ▽ More Ultra high energy cosmic rays provide the highest known energy source in the universe to measure proton cross sections. Though conditions for collecting such data are less controlled than an accelerator environment, current generation cosmic ray observatories have large enough exposures to collect significant statistics for a reliable measurement for energies above what can be attained in the lab. Cosmic ray measurements of cross section use atmospheric calorimetry to measure depth of air shower maximum ($X_{\mathrm{max}}$), which is related to the primary particle's energy and mass. The tail of the $X_{\mathrm{max}}$ distribution is assumed to be dominated by showers generated by protons, allowing measurement of the inelastic proton-air cross section. In this work the proton-air inelastic cross section measurement, $σ^{\mathrm{inel}}_{\mathrm{p-air}}$, using data observed by Telescope Array's Black Rock Mesa and Long Ridge fluorescence detectors and surface detector array in hybrid mode is presented. $σ^{\mathrm{inel}}_{\mathrm{p-air}}$ is observed to be $520.1 \pm 35.8$[Stat.] $^{+25.0}_{-40}$[Sys.]~mb at $\sqrt{s} = 73$ TeV. The total proton-proton cross section is subsequently inferred from Glauber formalism and is found to be $σ^{\mathrm{tot}}_{\mathrm{pp}} = 139.4 ^{+23.4}_{-21.3}$ [Stat.]$ ^{+15.0}_{-24.0}$[Sys.]~mb. △ Less

Submitted 8 June, 2020; originally announced June 2020.

Journal ref: Phys. Rev. D 102, 062004 (2020)

arXiv:2006.04326 [pdf, ps, other]

Semi-Supervised Contrastive Learning with Generalized Contrastive Loss and Its Application to Speaker Recognition

Authors: Nakamasa Inoue, Keita Goto

Abstract: This paper introduces a semi-supervised contrastive learning framework and its application to text-independent speaker verification. The proposed framework employs generalized contrastive loss (GCL). GCL unifies losses from two different learning frameworks, supervised metric learning and unsupervised contrastive learning, and thus it naturally determines the loss for semi-supervised learning. In… ▽ More This paper introduces a semi-supervised contrastive learning framework and its application to text-independent speaker verification. The proposed framework employs generalized contrastive loss (GCL). GCL unifies losses from two different learning frameworks, supervised metric learning and unsupervised contrastive learning, and thus it naturally determines the loss for semi-supervised learning. In experiments, we applied the proposed framework to text-independent speaker verification on the VoxCeleb dataset. We demonstrate that GCL enables the learning of speaker embeddings in three manners, supervised learning, semi-supervised learning, and unsupervised learning, without any changes in the definition of the loss function. △ Less

Submitted 7 June, 2020; originally announced June 2020.

Showing 1–50 of 115 results for author: Inoue, N