Skip to main content

Showing 1–4 of 4 results for author: Recski, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.08188  [pdf, ps, other

    cs.IR

    Statute-enhanced lexical retrieval of court cases for COLIEE 2022

    Authors: Tobias Fink, Gabor Recski, Wojciech Kusa, Allan Hanbury

    Abstract: We discuss our experiments for COLIEE Task 1, a court case retrieval competition using cases from the Federal Court of Canada. During experiments on the training data we observe that passage level retrieval with rank fusion outperforms document level retrieval. By explicitly adding extracted statute information to the queries and documents we can further improve the results. We submit two passage… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: Sixteenth International Workshop on Juris-informatics (JURISIN). 2022

  2. POTATO: exPlainable infOrmation exTrAcTion framewOrk

    Authors: Ádám Kovács, Kinga Gémes, Eszter Iklódi, Gábor Recski

    Abstract: We present POTATO, a task- and languageindependent framework for human-in-the-loop (HITL) learning of rule-based text classifiers using graph-based features. POTATO handles any type of directed graph and supports parsing text into Abstract Meaning Representations (AMR), Universal Dependencies (UD), and 4lang semantic graphs. A streamlit-based user interface allows users to build rule systems from… ▽ More

    Submitted 16 October, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: 4 pages

  3. arXiv:2004.12752  [pdf, other

    cs.CL

    The Gutenberg Dialogue Dataset

    Authors: Richard Csaky, Gabor Recski

    Abstract: Large datasets are essential for neural modeling of many NLP tasks. Current publicly available open-domain dialogue datasets offer a trade-off between quality (e.g., DailyDialog) and size (e.g., Opensubtitles). We narrow this gap by building a high-quality dataset of 14.8M utterances in English, and smaller datasets in German, Dutch, Spanish, Portuguese, Italian, and Hungarian. We extract and proc… ▽ More

    Submitted 22 January, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: Accepted at EACL 2021

  4. arXiv:1905.05471  [pdf, other

    cs.CL cs.AI

    Improving Neural Conversational Models with Entropy-Based Data Filtering

    Authors: Richard Csaky, Patrik Purgai, Gabor Recski

    Abstract: Current neural network-based conversational models lack diversity and generate boring responses to open-ended utterances. Priors such as persona, emotion, or topic provide additional information to dialog models to aid response generation, but annotating a dataset with priors is expensive and such annotations are rarely available. While previous methods for improving the quality of open-domain res… ▽ More

    Submitted 2 August, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: 20 pages. same as ACL version: https://www.aclweb.org/anthology/P19-1567

    Journal ref: Proceedings of the 57th Conference of the ACL (2019) 5650-5669