Skip to main content

Showing 1–5 of 5 results for author: Shinzato, K

.
  1. arXiv:2403.15484  [pdf, other

    cs.CL cs.LG

    RakutenAI-7B: Extending Large Language Models for Japanese

    Authors: Rakuten Group, Aaron Levine, Connie Huang, Chenguang Wang, Eduardo Batista, Ewa Szymanska, Hongyi Ding, Hou Wei Chou, Jean-François Pessiot, Johanes Effendi, Justin Chiu, Kai Torben Ohlhus, Karan Chopra, Keiji Shinzato, Koji Murakami, Lee Xiong, Lei Chen, Maki Kubota, Maksim Tkachenko, Miroku Lee, Naoki Takahashi, Prathyusha Jwalapuram, Ryutaro Tatsushima, Saurabh Jain, Sunil Kumar Yadav , et al. (5 additional authors not shown)

    Abstract: We introduce RakutenAI-7B, a suite of Japanese-oriented large language models that achieve the best performance on the Japanese LM Harness benchmarks among the open 7B models. Along with the foundation model, we release instruction- and chat-tuned models, RakutenAI-7B-instruct and RakutenAI-7B-chat respectively, under the Apache 2.0 license.

    Submitted 21 March, 2024; originally announced March 2024.

  2. arXiv:2310.07170  [pdf, other

    cs.CL

    PHALM: Building a Knowledge Graph from Scratch by Prompting Humans and a Language Model

    Authors: Tatsuya Ide, Eiki Murata, Daisuke Kawahara, Takato Yamazaki, Shengzhe Li, Kenta Shinzato, Toshinori Sato

    Abstract: Despite the remarkable progress in natural language understanding with pretrained Transformers, neural language models often do not handle commonsense knowledge well. Toward commonsense-aware models, there have been attempts to obtain knowledge, ranging from automatic acquisition to crowdsourcing. However, it is difficult to obtain a high-quality knowledge base at a low cost, especially from scrat… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  3. arXiv:2306.05605  [pdf, other

    cs.CL cs.AI

    A Unified Generative Approach to Product Attribute-Value Identification

    Authors: Keiji Shinzato, Naoki Yoshinaga, Yandi Xia, Wei-Te Chen

    Abstract: Product attribute-value identification (PAVI) has been studied to link products on e-commerce sites with their attribute values (e.g., <Material, Cotton>) using product text as clues. Technical demands from real-world e-commerce platforms require PAVI methods to handle unseen values, multi-attribute values, and canonicalized values, which are only partly addressed in existing extraction- and class… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted to the Findings of ACL 2023

  4. Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction

    Authors: Keiji Shinzato, Naoki Yoshinaga, Yandi Xia, Wei-Te Chen

    Abstract: A key challenge in attribute value extraction (AVE) from e-commerce sites is how to handle a large number of attributes for diverse products. Although this challenge is partially addressed by a question answering (QA) approach which finds a value in product data for a given query (attribute), it does not work effectively for rare and ambiguous queries. We thus propose simple knowledge-driven query… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: Published at ACL 2022

    Journal ref: Proceedings of ACL 2022 (Volume 2: Short Papers), 227--234

  5. arXiv:2206.05399  [pdf, other

    cs.CL

    Building a Personalized Dialogue System with Prompt-Tuning

    Authors: Tomohito Kasahara, Daisuke Kawahara, Nguyen Tung, Shengzhe Li, Kenta Shinzato, Toshinori Sato

    Abstract: Dialogue systems without consistent responses are not fascinating. In this study, we build a dialogue system that can respond based on a given character setting (persona) to bring consistency. Considering the trend of the rapidly increasing scale of language models, we propose an approach that uses prompt-tuning, which has low learning costs, on pre-trained large-scale language models. The results… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: Accepted to NAACL 2022 SRW