Skip to main content

Showing 1–3 of 3 results for author: Koike, R

.
  1. arXiv:2402.15987  [pdf, other

    cs.CL cs.AI

    Likelihood-based Mitigation of Evaluation Bias in Large Language Models

    Authors: Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, Naoaki Okazaki

    Abstract: Large Language Models (LLMs) are widely used to evaluate natural language generation tasks as automated metrics. However, the likelihood, a measure of LLM's plausibility for a sentence, can vary due to superficial differences in sentences, such as word order and sentence structure. It is therefore possible that there might be a likelihood bias if LLMs are used for evaluation: they might overrate s… ▽ More

    Submitted 1 March, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

    Comments: 4 main pages

  2. arXiv:2311.08369  [pdf, other

    cs.CL

    How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection

    Authors: Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

    Abstract: To combat the misuse of Large Language Models (LLMs), many recent studies have presented LLM-generated-text detectors with promising performance. When users instruct LLMs to generate texts, the instruction can include different constraints depending on the user's need. However, most recent studies do not cover such diverse instruction patterns when creating datasets for LLM detection. In this pape… ▽ More

    Submitted 12 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: under review

  3. arXiv:2307.11729  [pdf, other

    cs.CL

    OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples

    Authors: Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki

    Abstract: Large Language Models (LLMs) have achieved human-level fluency in text generation, making it difficult to distinguish between human-written and LLM-generated texts. This poses a growing risk of misuse of LLMs and demands the development of detectors to identify LLM-generated texts. However, existing detectors lack robustness against attacks: they degrade detection accuracy by simply paraphrasing L… ▽ More

    Submitted 18 February, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: AAAI 2024 camera ready. Code and dataset available at https://github.com/ryuryukke/OUTFOX