Skip to main content

Showing 1–2 of 2 results for author: Buunk, E

.
  1. arXiv:2402.19411  [pdf, ps, other

    cs.IR cs.CL cs.LG

    PaECTER: Patent-level Representation Learning using Citation-informed Transformers

    Authors: Mainak Ghosh, Sebastian Erhardt, Michael E. Rose, Erik Buunk, Dietmar Harhoff

    Abstract: PaECTER is a publicly available, open-source document-level encoder specific for patents. We fine-tune BERT for Patents with examiner-added citation information to generate numerical representations for patent documents. PaECTER performs better in similarity tasks than current state-of-the-art models used in the patent domain. More specifically, our model outperforms the next-best patent specific… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 7 pages, 3 figures

  2. arXiv:2301.00200  [pdf, other

    cs.CL

    Logic Mill -- A Knowledge Navigation System

    Authors: Sebastian Erhardt, Mainak Ghosh, Erik Buunk, Michael E. Rose, Dietmar Harhoff

    Abstract: Logic Mill is a scalable and openly accessible software system that identifies semantically similar documents within either one domain-specific corpus or multi-domain corpora. It uses advanced Natural Language Processing (NLP) techniques to generate numerical representations of documents. Currently it leverages a large pre-trained language model to generate these document representations. The syst… ▽ More

    Submitted 20 October, 2023; v1 submitted 31 December, 2022; originally announced January 2023.

    Comments: 10 pages, 3 figures, 1 table