Skip to main content

Showing 1–1 of 1 results for author: Lasoňová, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20994  [pdf, other

    cs.IR cs.CL

    CWRCzech: 100M Query-Document Czech Click Dataset and Its Application to Web Relevance Ranking

    Authors: Josef Vonášek, Milan Straka, Rostislav Krč, Lenka Lasoňová, Ekaterina Egorova, Jana Straková, Jakub Náplava

    Abstract: We present CWRCzech, Click Web Ranking dataset for Czech, a 100M query-document Czech click dataset for relevance ranking with user behavior data collected from search engine logs of Seznam.cz. To the best of our knowledge, CWRCzech is the largest click dataset with raw text published so far. It provides document positions in the search results as well as information about user behavior: 27.6M cli… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted to SIGIR 2024