Skip to main content

Showing 1–30 of 30 results for author: Shindo, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15358  [pdf, other

    cs.CL

    Introducing Syllable Tokenization for Low-resource Languages: A Case Study with Swahili

    Authors: Jesse Atuhurra, Hiroyuki Shindo, Hidetaka Kamigaito, Taro Watanabe

    Abstract: Many attempts have been made in multilingual NLP to ensure that pre-trained language models, such as mBERT or GPT2 get better and become applicable to low-resource languages. To achieve multilingualism for pre-trained language models (PLMs), we need techniques to create word embeddings that capture the linguistic characteristics of any language. Tokenization is one such technique because it allows… ▽ More

    Submitted 26 March, 2024; originally announced June 2024.

  2. arXiv:2406.06107  [pdf, other

    cs.AI

    EXPIL: Explanatory Predicate Invention for Learning in Games

    Authors: **gyuan Sha, Hikaru Shindo, Quentin Delfosse, Kristian Kersting, Devendra Singh Dhami

    Abstract: Reinforcement learning (RL) has proven to be a powerful tool for training agents that excel in various games. However, the black-box nature of neural network models often hinders our ability to understand the reasoning behind the agent's actions. Recent research has attempted to address this issue by using the guidance of pretrained neural agents to encode logic-based policies, allowing for interp… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 pages references, 8 figures, 3 tables

  3. arXiv:2403.15430  [pdf, other

    cs.CL

    Distilling Named Entity Recognition Models for Endangered Species from Large Language Models

    Authors: Jesse Atuhurra, Seiveright Cargill Dujohn, Hidetaka Kamigaito, Hiroyuki Shindo, Taro Watanabe

    Abstract: Natural language processing (NLP) practitioners are leveraging large language models (LLM) to create structured datasets from semi-structured and unstructured data sources such as patents, papers, and theses, without having domain-specific knowledge. At the same time, ecological experts are searching for a variety of means to preserve biodiversity. To contribute to these efforts, we focused on end… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  4. arXiv:2402.14123  [pdf, other

    cs.LG cs.AI cs.CV

    DeiSAM: Segment Anything with Deictic Prompting

    Authors: Hikaru Shindo, Manuel Brack, Gopika Sudhakaran, Devendra Singh Dhami, Patrick Schramowski, Kristian Kersting

    Abstract: Large-scale, pre-trained neural networks have demonstrated strong capabilities in various tasks, including zero-shot image segmentation. To identify concrete objects in complex scenes, humans instinctively rely on deictic descriptions in natural language, i.e., referring to something depending on the context such as "The object that is on the desk and behind the cup.". However, deep learning appro… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Preprint

  5. arXiv:2309.08395  [pdf, other

    cs.AI cs.LG

    Learning by Self-Explaining

    Authors: Wolfgang Stammer, Felix Friedrich, David Steinmann, Manuel Brack, Hikaru Shindo, Kristian Kersting

    Abstract: Current AI research mainly treats explanations as a means for model inspection. Yet, this neglects findings from human psychology that describe the benefit of self-explanations in an agent's learning process. Motivated by this, we introduce a novel approach in the context of image classification, termed Learning by Self-Explaining (LSX). LSX utilizes aspects of self-refining AI and human-guided ex… ▽ More

    Submitted 5 April, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  6. arXiv:2307.00928  [pdf, other

    cs.LG cs.AI cs.CV

    Learning Differentiable Logic Programs for Abstract Visual Reasoning

    Authors: Hikaru Shindo, Viktor Pfanschilling, Devendra Singh Dhami, Kristian Kersting

    Abstract: Visual reasoning is essential for building intelligent agents that understand the world and perform problem-solving beyond perception. Differentiable forward reasoning has been developed to integrate reasoning with gradient-based machine learning paradigms. However, due to the memory intensity, most existing approaches do not bring the best of the expressivity of first-order logic, excluding a cru… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: under review

  7. arXiv:2306.07743  [pdf, other

    cs.AI cs.CV cs.LG

    V-LoL: A Diagnostic Dataset for Visual Logical Learning

    Authors: Lukas Helff, Wolfgang Stammer, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Despite the successes of recent developments in visual AI, different shortcomings still exist; from missing exact logical reasoning, to abstract generalization abilities, to understanding complex and noisy scenes. Unfortunately, existing benchmarks, were not designed to capture more than a few of these aspects. Whereas deep learning datasets focus on visually complex data but simple visual reasoni… ▽ More

    Submitted 3 July, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  8. arXiv:2306.01439  [pdf, other

    cs.LG cs.AI cs.CL cs.LO cs.SC

    Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction

    Authors: Quentin Delfosse, Hikaru Shindo, Devendra Dhami, Kristian Kersting

    Abstract: The limited priors required by neural networks make them the dominating choice to encode and learn policies using reinforcement learning (RL). However, they are also black-boxes, making it hard to understand the agent's behaviour, especially when working on the image level. Therefore, neuro-symbolic RL aims at creating policies that are interpretable in the first place. Unfortunately, interpretabi… ▽ More

    Submitted 25 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 9 main pages + appendix (19 in total)

  9. arXiv:2305.13844  [pdf, other

    cs.CL

    Arukikata Travelogue Dataset with Geographic Entity Mention, Coreference, and Link Annotation

    Authors: Shohei Higashiyama, Hiroki Ouchi, Hiroki Teranishi, Hiroyuki Otomo, Yusuke Ide, Aitaro Yamamoto, Hiroyuki Shindo, Yuki Matsuda, Shoko Wakamiya, Naoya Inoue, Ikuya Yamada, Taro Watanabe

    Abstract: Geoparsing is a fundamental technique for analyzing geo-entity information in text. We focus on document-level geoparsing, which considers geographic relatedness among geo-entity mentions, and presents a Japanese travelogue dataset designed for evaluating document-level geoparsing systems. Our dataset comprises 200 travelogue documents with rich geo-entity information: 12,171 mentions, 6,339 coref… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  10. arXiv:2305.11444  [pdf, other

    cs.CL cs.AI cs.DL

    Arukikata Travelogue Dataset

    Authors: Hiroki Ouchi, Hiroyuki Shindo, Shoko Wakamiya, Yuki Matsuda, Naoya Inoue, Shohei Higashiyama, Satoshi Nakamura, Taro Watanabe

    Abstract: We have constructed Arukikata Travelogue Dataset and released it free of charge for academic research. This dataset is a Japanese text dataset with a total of over 31 million words, comprising 4,672 Japanese domestic travelogues and 9,607 overseas travelogues. Before providing our dataset, there was a scarcity of widely available travelogue data for research purposes, and each researcher had to pr… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: The application website for Arukikata Travelogue Dataset: https://www.nii.ac.jp/dsc/idr/arukikata/

  11. arXiv:2211.11650  [pdf, other

    cs.AI

    Neural Meta-Symbolic Reasoning and Learning

    Authors: Zihan Ye, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Deep neural learning uses an increasing amount of computation and data to solve very specific problems. By stark contrast, human minds solve a wide range of problems using a fixed amount of computation and limited experience. One ability that seems crucial to this kind of general intelligence is meta-reasoning, i.e., our ability to reason about reasoning. To make deep learning do more from less, w… ▽ More

    Submitted 15 December, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  12. arXiv:2208.13518  [pdf, other

    cs.AI cs.CL cs.CV cs.LO cs.SC

    LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems

    Authors: Björn Deiseroth, Patrick Schramowski, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Text-to-image models have recently achieved remarkable success with seemingly accurate samples in photo-realistic quality. However as state-of-the-art language models still struggle evaluating precise statements consistently, so do language model based image generation processes. In this work we showcase problems of state-of-the-art text-to-image models like DALL-E with generating accurate samples… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  13. arXiv:2110.09383  [pdf, other

    cs.AI cs.CV cs.LG

    Neuro-Symbolic Forward Reasoning

    Authors: Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Reasoning is an essential part of human intelligence and thus has been a long-standing goal in artificial intelligence research. With the recent success of deep learning, incorporating reasoning with deep learning systems, i.e., neuro-symbolic AI has become a major field of interest. We propose the Neuro-Symbolic Forward Reasoner (NSFR), a new approach for reasoning tasks taking advantage of diffe… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: Preprint

  14. arXiv:2103.01719  [pdf, other

    cs.AI cs.LG

    Differentiable Inductive Logic Programming for Structured Examples

    Authors: Hikaru Shindo, Masaaki Nishino, Akihiro Yamamoto

    Abstract: The differentiable implementation of logic yields a seamless combination of symbolic reasoning and deep neural networks. Recent research, which has developed a differentiable framework to learn logic programs from examples, can even acquire reasonable solutions from noisy datasets. However, this framework severely limits expressions for solutions, e.g., no function symbols are allowed, and the sha… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: Accepted by AAAI2021

  15. arXiv:2010.01057  [pdf, other

    cs.CL cs.LG

    LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

    Authors: Ikuya Yamada, Akari Asai, Hiroyuki Shindo, Hideaki Takeda, Yuji Matsumoto

    Abstract: Entity representations are useful in natural language tasks involving entities. In this paper, we propose new pretrained contextualized representations of words and entities based on the bidirectional transformer. The proposed model treats words and entities in a given text as independent tokens, and outputs contextualized representations of them. Our model is trained using a new pretraining task… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

    Comments: EMNLP 2020

  16. arXiv:2003.03960  [pdf, other

    cs.LG stat.ML

    Metric Learning for Ordered Labeled Trees with pq-grams

    Authors: Hikaru Shindo, Masaaki Nishino, Yasuaki Kobayashi, Akihiro Yamamoto

    Abstract: Computing the similarity between two data points plays a vital role in many machine learning algorithms. Metric learning has the aim of learning a good metric automatically from data. Most existing studies on metric learning for tree-structured data have adopted the approach of learning the tree edit distance. However, the edit distance is not amenable for big data analysis because it incurs high… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

    Comments: Accepted at ECAI 2020 (full paper)

  17. arXiv:2001.07331  [pdf, ps, other

    cs.CL

    Length-controllable Abstractive Summarization by Guiding with Summary Prototype

    Authors: Itsumi Saito, Kyosuke Nishida, Kosuke Nishida, Atsushi Otsuka, Hisako Asano, Junji Tomita, Hiroyuki Shindo, Yuji Matsumoto

    Abstract: We propose a new length-controllable abstractive summarization model. Recent state-of-the-art abstractive summarization models based on encoder-decoder models generate only one summary per source text. However, controllable summarization, especially of the length, is an important aspect for practical applications. Previous studies on length-controllable abstractive summarization incorporate length… ▽ More

    Submitted 20 January, 2020; originally announced January 2020.

  18. arXiv:1909.01259  [pdf, other

    cs.CL cs.LG

    Neural Attentive Bag-of-Entities Model for Text Classification

    Authors: Ikuya Yamada, Hiroyuki Shindo

    Abstract: This study proposes a Neural Attentive Bag-of-Entities model, which is a neural network model that performs text classification using entities in a knowledge base. Entities provide unambiguous and relevant semantic signals that are beneficial for capturing semantics in texts. We combine simple high-recall entity detection based on a dictionary, to detect entities in a document, with a novel neural… ▽ More

    Submitted 10 September, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: Accepted to CoNLL 2019

  19. arXiv:1909.00426  [pdf, other

    cs.CL cs.LG

    Global Entity Disambiguation with BERT

    Authors: Ikuya Yamada, Koki Washio, Hiroyuki Shindo, Yuji Matsumoto

    Abstract: We propose a global entity disambiguation (ED) model based on BERT. To capture global contextual information for ED, our model treats not only words but also entities as input tokens, and solves the task by sequentially resolving mentions to their referent entities and using resolved entities as inputs at each step. We train the model using a large entity-annotated corpus obtained from Wikipedia.… ▽ More

    Submitted 1 May, 2022; v1 submitted 1 September, 2019; originally announced September 2019.

    Comments: NAACL 2022

  20. arXiv:1909.00259  [pdf, ps, other

    cs.LG q-bio.BM stat.ML

    Gated Graph Recursive Neural Networks for Molecular Property Prediction

    Authors: Hiroyuki Shindo, Yuji Matsumoto

    Abstract: Molecule property prediction is a fundamental problem for computer-aided drug discovery and materials science. Quantum-chemical simulations such as density functional theory (DFT) have been widely used for calculating the molecule properties, however, because of the heavy computational cost, it is difficult to search a huge number of potential chemical compounds. Machine learning methods for molec… ▽ More

    Submitted 25 November, 2019; v1 submitted 31 August, 2019; originally announced September 2019.

  21. arXiv:1908.05691  [pdf

    cs.CL

    Improving Multi-Word Entity Recognition for Biomedical Texts

    Authors: Hamada A. Nayel, H. L. Shashirekha, Hiroyuki Shindo, Yuji Matsumoto

    Abstract: Biomedical Named Entity Recognition (BioNER) is a crucial step for analyzing Biomedical texts, which aims at extracting biomedical named entities from a given text. Different supervised machine learning algorithms have been applied for BioNER by various researchers. The main requirement of these approaches is an annotated dataset used for learning the parameters of machine learning algorithms. Seg… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.

    Comments: 13 pages, 2 figures, International Conference on Cognitive Informatics and Soft Computing (ICCISC-2017)

    Journal ref: International Journal of Pure and Applied Mathematics, Volume 118 No. 16, 2018

  22. arXiv:1812.06280  [pdf, other

    cs.CL cs.LG

    Wikipedia2Vec: An Efficient Toolkit for Learning and Visualizing the Embeddings of Words and Entities from Wikipedia

    Authors: Ikuya Yamada, Akari Asai, ** Sakuma, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji, Yuji Matsumoto

    Abstract: The embeddings of entities in a large knowledge base (e.g., Wikipedia) are highly beneficial for solving various natural language tasks that involve real world knowledge. In this paper, we present Wikipedia2Vec, a Python-based open-source tool for learning the embeddings of words and entities from Wikipedia. The proposed tool enables users to learn the embeddings efficiently by issuing a single co… ▽ More

    Submitted 26 September, 2020; v1 submitted 15 December, 2018; originally announced December 2018.

    Comments: EMNLP 2020 (system demonstration)

  23. arXiv:1811.04319  [pdf, other

    cs.LG cs.CL stat.ML

    Playing by the Book: An Interactive Game Approach for Action Graph Extraction from Text

    Authors: Ronen Tamari, Hiroyuki Shindo, Dafna Shahaf, Yuji Matsumoto

    Abstract: Understanding procedural text requires tracking entities, actions and effects as the narrative unfolds. We focus on the challenging real-world problem of action-graph extraction from material science papers, where language is highly specialized and data annotation is expensive and scarce. We propose a novel approach, Text2Quest, where procedural text is interpreted as instructions for an interacti… ▽ More

    Submitted 6 April, 2019; v1 submitted 10 November, 2018; originally announced November 2018.

    Comments: Accepted to NAACL 2019 ESSP workshop (https://scientific-knowledge.github.io/)

  24. arXiv:1810.02245  [pdf, other

    cs.CL

    A Span Selection Model for Semantic Role Labeling

    Authors: Hiroki Ouchi, Hiroyuki Shindo, Yuji Matsumoto

    Abstract: We present a simple and accurate span-based model for semantic role labeling (SRL). Our model directly takes into account all possible argument spans and scores them for each label. At decoding time, we greedily select higher scoring labeled spans. One advantage of our model is to allow us to design and use span-level features, that are difficult to use in token-based BIO tagging approaches. Exper… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

    Comments: Accepted by EMNLP 2018

  25. arXiv:1806.02960  [pdf, other

    cs.CL cs.NE

    Representation Learning of Entities and Documents from Knowledge Base Descriptions

    Authors: Ikuya Yamada, Hiroyuki Shindo, Yoshiyasu Takefuji

    Abstract: In this paper, we describe TextEnt, a neural network model that learns distributed representations of entities and documents directly from a knowledge base (KB). Given a document in a KB consisting of words and entity annotations, we train our model to predict the entity that the document describes and map the document and its target entity close to each other in a continuous vector space. Our mod… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

    Comments: Accepted at COLING 2018

  26. arXiv:1805.02917  [pdf, other

    cs.LG cs.CL stat.ML

    Interpretable Adversarial Perturbation in Input Embedding Space for Text

    Authors: Motoki Sato, Jun Suzuki, Hiroyuki Shindo, Yuji Matsumoto

    Abstract: Following great success in the image processing field, the idea of adversarial training has been applied to tasks in the natural language processing (NLP) field. One promising approach directly applies adversarial training developed in the image processing field to the input word embedding space instead of the discrete input space of texts. However, this approach abandons such interpretability as… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

    Comments: 8 pages, 4 figures

    Journal ref: IJCAI-ECAI-2018

  27. arXiv:1803.08652  [pdf, other

    cs.CL

    Studio Ousia's Quiz Bowl Question Answering System

    Authors: Ikuya Yamada, Ryuji Tamaki, Hiroyuki Shindo, Yoshiyasu Takefuji

    Abstract: In this chapter, we describe our question answering system, which was the winning system at the Human-Computer Question Answering (HCQA) Competition at the Thirty-first Annual Conference on Neural Information Processing Systems (NIPS). The competition requires participants to address a factoid question answering task referred to as quiz bowl. To address this task, we use two novel neural network m… ▽ More

    Submitted 23 March, 2018; originally announced March 2018.

    Comments: This is a preprint of a springer book chapter from NIPS 2017 Competitions proceedings

  28. arXiv:1705.02494  [pdf, other

    cs.CL cs.NE

    Learning Distributed Representations of Texts and Entities from Knowledge Base

    Authors: Ikuya Yamada, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji

    Abstract: We describe a neural network model that jointly learns distributed representations of texts and knowledge base (KB) entities. Given a text in the KB, we train our proposed model to predict entities that are relevant to the text. Our model is designed to be generic with the ability to address various NLP tasks with ease. We train the model using a large corpus of texts and their entity annotations… ▽ More

    Submitted 7 November, 2017; v1 submitted 6 May, 2017; originally announced May 2017.

    Journal ref: Transactions of the Association for Computational Linguistics, 5 (2017), 397-411

  29. arXiv:1703.04914  [pdf, ps, other

    cs.CL cs.IR

    Ensemble of Neural Classifiers for Scoring Knowledge Base Triples

    Authors: Ikuya Yamada, Motoki Sato, Hiroyuki Shindo

    Abstract: This paper describes our approach for the triple scoring task at the WSDM Cup 2017. The task required participants to assign a relevance score for each pair of entities and their types in a knowledge base in order to enhance the ranking results in entity retrieval tasks. We propose an approach wherein the outputs of multiple neural network classifiers are combined using a supervised machine learni… ▽ More

    Submitted 5 April, 2017; v1 submitted 15 March, 2017; originally announced March 2017.

    Comments: WSDM Cup 2017

  30. arXiv:1601.01343  [pdf, ps, other

    cs.CL

    Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation

    Authors: Ikuya Yamada, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji

    Abstract: Named Entity Disambiguation (NED) refers to the task of resolving multiple named entity mentions in a document to their correct references in a knowledge base (KB) (e.g., Wikipedia). In this paper, we propose a novel embedding method specifically designed for NED. The proposed method jointly maps words and entities into the same continuous vector space. We extend the skip-gram model by using two m… ▽ More

    Submitted 9 June, 2016; v1 submitted 6 January, 2016; originally announced January 2016.

    Comments: Accepted at CoNLL 2016