Skip to main content

Showing 1–3 of 3 results for author: Shibata, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.07687  [pdf, other

    cs.LG cs.CL cs.FL

    MLRegTest: A Benchmark for the Machine Learning of Regular Languages

    Authors: Sam van der Poel, Dakotah Lambert, Kalina Kostyszyn, Tiantian Gao, Rahul Verma, Derek Andersen, Joanne Chau, Emily Peterson, Cody St. Clair, Paul Fodor, Chihiro Shibata, Jeffrey Heinz

    Abstract: Evaluating machine learning (ML) systems on their ability to learn known classifiers allows fine-grained examination of the patterns they can learn, which builds confidence when they are applied to the learning of unknown classifiers. This article presents a new benchmark for ML systems on sequence classification called MLRegTest, which contains training, development, and test sets from 1,800 regu… ▽ More

    Submitted 8 November, 2023; v1 submitted 15 April, 2023; originally announced April 2023.

    Comments: 38 pages, MLRegTest benchmark available at https://doi.org/10.5061/dryad.dncjsxm4h , associated code at https://github.com/heinz-jeffrey/subregular-learning

  2. arXiv:2010.00363  [pdf, other

    cs.CL

    How LSTM Encodes Syntax: Exploring Context Vectors and Semi-Quantization on Natural Text

    Authors: Chihiro Shibata, Kei Uchiumi, Daichi Mochihashi

    Abstract: Long Short-Term Memory recurrent neural network (LSTM) is widely used and known to capture informative long-term syntactic dependencies. However, how such information are reflected in its internal vectors for natural text has not yet been sufficiently investigated. We analyze them by learning a language model where syntactic structures are implicitly given. We empirically show that the context upd… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

  3. arXiv:1705.05940  [pdf, ps, other

    cs.CL

    Subregular Complexity and Deep Learning

    Authors: Enes Avcu, Chihiro Shibata, Jeffrey Heinz

    Abstract: This paper argues that the judicial use of formal language theory and grammatical inference are invaluable tools in understanding how deep neural networks can and cannot represent and learn long-term dependencies in temporal sequences. Learning experiments were conducted with two types of Recurrent Neural Networks (RNNs) on six formal languages drawn from the Strictly Local (SL) and Strictly Piece… ▽ More

    Submitted 14 October, 2017; v1 submitted 16 May, 2017; originally announced May 2017.