Skip to main content

Showing 1–4 of 4 results for author: Lazar, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.11625  [pdf, other

    cs.CL

    SpeCrawler: Generating OpenAPI Specifications from API Documentation Using Large Language Models

    Authors: Koren Lazar, Matan Vetzler, Guy Uziel, David Boaz, Esther Goldbraich, David Amid, Ateret Anaby-Tavor

    Abstract: In the digital era, the widespread use of APIs is evident. However, scalable utilization of APIs poses a challenge due to structure divergence observed in online API documentation. This underscores the need for automatic tools to facilitate API consumption. A viable approach involves the conversion of documentation into an API Specification format. While previous attempts have been made using rule… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Under Review for KDD 2024

  2. arXiv:2303.01593  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    QAID: Question Answering Inspired Few-shot Intent Detection

    Authors: Asaf Yehudai, Matan Vetzler, Yosi Mass, Koren Lazar, Doron Cohen, Boaz Carmeli

    Abstract: Intent detection with semantically similar fine-grained intents is a challenging task. To address it, we reformulate intent detection as a question-answering retrieval task by treating utterances and intent names as questions and answers. To that end, we utilize a question-answering retrieval architecture and adopt a two stages training schema with batch contrastive loss. In the pre-training stage… ▽ More

    Submitted 21 March, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: ICLR paper

  3. arXiv:2109.04513  [pdf, other

    cs.CL

    Filling the Gaps in Ancient Akkadian Texts: A Masked Language Modelling Approach

    Authors: Koren Lazar, Benny Saret, Asaf Yehudai, Wayne Horowitz, Nathan Wasserman, Gabriel Stanovsky

    Abstract: We present models which complete missing text given transliterations of ancient Mesopotamian documents, originally written on cuneiform clay tablets (2500 BCE - 100 CE). Due to the tablets' deterioration, scholars often rely on contextual cues to manually fill in missing parts in the text in a subjective and time-consuming process. We identify that this challenge can be formulated as a masked lang… ▽ More

    Submitted 24 October, 2021; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021 (Main Conference)

  4. arXiv:2109.03858  [pdf, other

    cs.CL

    Collecting a Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation

    Authors: Shahar Levy, Koren Lazar, Gabriel Stanovsky

    Abstract: Recent works have found evidence of gender bias in models of machine translation and coreference resolution using mostly synthetic diagnostic datasets. While these quantify bias in a controlled experiment, they often do so on a small scale and consist mostly of artificial, out-of-distribution sentences. In this work, we find grammatical patterns indicating stereotypical and non-stereotypical gende… ▽ More

    Submitted 10 September, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to Findings of EMNLP 2021