Skip to main content

Showing 1–4 of 4 results for author: Galuščáková, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07540  [pdf, other

    cs.HC cs.AI cs.CL

    PKG API: A Tool for Personal Knowledge Graph Management

    Authors: Nolwenn Bernard, Ivica Kostric, Weronika Łajewska, Krisztian Balog, Petra Galuščáková, Vinay Setty, Martin G. Skjæveland

    Abstract: Personal knowledge graphs (PKGs) offer individuals a way to store and consolidate their fragmented personal data in a central place, improving service personalization while maintaining full user control. Despite their potential, practical PKG implementations with user-friendly interfaces remain scarce. This work addresses this gap by proposing a complete solution to represent, manage, and interfac… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  2. arXiv:2111.05988  [pdf, ps, other

    cs.IR cs.AI cs.CL

    Cross-language Information Retrieval

    Authors: Petra Galuščáková, Douglas W. Oard, Suraj Nair

    Abstract: Two key assumptions shape the usual view of ranked retrieval: (1) that the searcher can choose words for their query that might appear in the documents that they wish to see, and (2) that ranking retrieved documents will suffice because the searcher will be able to recognize those which they wished to find. When the documents to be searched are in a language not known by the searcher, neither assu… ▽ More

    Submitted 8 June, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: 49 pages, 0 figures

  3. arXiv:2106.02293  [pdf, other

    cs.CL cs.IR

    Cross-language Sentence Selection via Data Augmentation and Rationale Training

    Authors: Yanda Chen, Chris Kedzie, Suraj Nair, Petra Galuščáková, Rui Zhang, Douglas W. Oard, Kathleen McKeown

    Abstract: This paper proposes an approach to cross-language sentence selection in a low-resource setting. It uses data augmentation and negative sampling techniques on noisy parallel sentence data to directly learn a cross-lingual embedding-based query relevance model. Results show that this approach performs as well as or better than multiple state-of-the-art machine translation + monolingual retrieval sys… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: ACL 2021 main conference

  4. arXiv:2104.07868  [pdf, other

    cs.CL

    Segmenting Subtitles for Correcting ASR Segmentation Errors

    Authors: David Wan, Chris Kedzie, Faisal Ladhak, Elsbeth Turcan, Petra Galuščáková, Elena Zotkina, Zheng** Jiang, Peter Bell, Kathleen McKeown

    Abstract: Typical ASR systems segment the input audio into utterances using purely acoustic information, which may not resemble the sentence-like units that are expected by conventional machine translation (MT) systems for Spoken Language Translation. In this work, we propose a model for correcting the acoustic segmentation of ASR models for low-resource languages to improve performance on downstream tasks.… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.