Skip to main content

Showing 1–5 of 5 results for author: Trotman, A

.
  1. arXiv:2110.11540  [pdf, other

    cs.IR

    Wacky Weights in Learned Sparse Representations and the Revenge of Score-at-a-Time Query Evaluation

    Authors: Joel Mackenzie, Andrew Trotman, Jimmy Lin

    Abstract: Recent advances in retrieval models based on learned sparse representations generated by transformers have led us to, once again, consider score-at-a-time query evaluation techniques for the top-k retrieval problem. Previous studies comparing document-at-a-time and score-at-a-time approaches have consistently found that the former approach yields lower mean query latency, although the latter appro… ▽ More

    Submitted 27 October, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

  2. arXiv:2003.08276  [pdf, other

    cs.IR

    Supporting Interoperability Between Open-Source Search Engines with the Common Index File Format

    Authors: Jimmy Lin, Joel Mackenzie, Chris Kamphuis, Craig Macdonald, Antonio Mallia, MichaƂ Siedlaczek, Andrew Trotman, Arjen de Vries

    Abstract: There exists a natural tension between encouraging a diverse ecosystem of open-source search engines and supporting fair, replicable comparisons across those systems. To balance these two goals, we examine two approaches to providing interoperability between the inverted indexes of several systems. The first takes advantage of internal abstractions around index structures and building wrappers tha… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

  3. arXiv:1912.12282  [pdf, ps, other

    cs.IR

    Report on the SIGIR 2019 Workshop on eCommerce (ECOM19)

    Authors: Jon Degenhardt, Surya Kallumadi, Utkarsh Porwal, Andrew Trotman

    Abstract: The SIGIR 2019 Workshop on eCommerce (ECOM19), was a full day workshop that took place on Thursday, July 25, 2019 in Paris, France. The purpose of the workshop was to serve as a platform for publication and discussion of Information Retrieval and NLP research and their applications in the domain of eCommerce. The workshop program was designed to bring together practitioners and researchers from ac… ▽ More

    Submitted 27 December, 2019; originally announced December 2019.

  4. arXiv:1307.1179  [pdf

    cs.IR

    Future Web Growth and its Consequences for Web Search Architectures

    Authors: Andrew Trotman, **glan Zhang

    Abstract: Introduction: Before embarking on the design of any computer system it is first necessary to assess the magnitude of the problem. In the case of a web search engine this assessment amounts to determining the current size of the web, the growth rate of the web, and the quantity of computing resource necessary to search it, and projecting the historical growth of this into the future. Method: The ov… ▽ More

    Submitted 3 July, 2013; originally announced July 2013.

  5. arXiv:1208.5654  [pdf, ps, other

    cs.IR cs.AI

    Document Clustering Evaluation: Divergence from a Random Baseline

    Authors: Christopher M. De Vries, Shlomo Geva, Andrew Trotman

    Abstract: Divergence from a random baseline is a technique for the evaluation of document clustering. It ensures cluster quality measures are performing work that prevents ineffective clusterings from giving high scores to clusterings that provide no useful result. These concepts are defined and analysed using intrinsic and extrinsic approaches to the evaluation of document cluster quality. This includes th… ▽ More

    Submitted 29 August, 2012; v1 submitted 28 August, 2012; originally announced August 2012.

    Comments: 8 pages, 11 figures, WIR2012