Skip to main content

Showing 1–2 of 2 results for author: Maekawa, A

.
  1. arXiv:2404.00264  [pdf, other

    cs.CL cs.LG

    DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation

    Authors: Aru Maekawa, Satoshi Kosugi, Kotaro Funakoshi, Manabu Okumura

    Abstract: Dataset distillation aims to compress a training dataset by creating a small number of informative synthetic samples such that neural networks trained on them perform as well as those trained on the original training dataset. Current text dataset distillation methods create each synthetic sample as a sequence of word embeddings instead of a text to apply gradient-based optimization; however, such… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted by Findings of NAACL 2024

  2. arXiv:2403.05065  [pdf, other

    cs.CL

    Can we obtain significant success in RST discourse parsing by using Large Language Models?

    Authors: Aru Maekawa, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura

    Abstract: Recently, decoder-only pre-trained large language models (LLMs), with several tens of billion parameters, have significantly impacted a wide range of natural language processing (NLP) tasks. While encoder-only or encoder-decoder pre-trained language models have already proved to be effective in discourse parsing, the extent to which LLMs can perform this task remains an open research question. The… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted in the main conference of EACL 2024