Skip to main content

Showing 1–3 of 3 results for author: Petrov, A V

Searching in archive cs. Search in all archives.
.
  1. Shallow Cross-Encoders for Low-Latency Retrieval

    Authors: Aleksandr V. Petrov, Sean MacAvaney, Craig Macdonald

    Abstract: Transformer-based Cross-Encoders achieve state-of-the-art effectiveness in text retrieval. However, Cross-Encoders based on large transformer models (such as BERT or T5) are computationally expensive and allow for scoring only a small number of documents within a reasonably small latency window. However, kee** search latencies low is important for user satisfaction and energy usage. In this pape… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Accepted by ECIR2024

  2. RecJPQ: Training Large-Catalogue Sequential Recommenders

    Authors: Aleksandr V. Petrov, Craig Macdonald

    Abstract: Sequential Recommendation is a popular recommendation task that uses the order of user-item interaction to model evolving users' interests and sequential patterns in their behaviour. Current state-of-the-art Transformer-based models for sequential recommendation, such as BERT4Rec and SASRec, generate sequence embeddings and compute scores for catalogue items, but the increasing catalogue size make… ▽ More

    Submitted 18 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted by ACM WSDM 2024

  3. arXiv:2306.11114  [pdf, other

    cs.IR

    Generative Sequential Recommendation with GPTRec

    Authors: Aleksandr V. Petrov, Craig Macdonald

    Abstract: Sequential recommendation is an important recommendation task that aims to predict the next item in a sequence. Recently, adaptations of language models, particularly Transformer-based models such as SASRec and BERT4Rec, have achieved state-of-the-art results in sequential recommendation. In these models, item ids replace tokens in the original language models. However, this approach has limitatio… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted at Gen-IR@SIGIR2023 workshop