Skip to main content

Showing 1–6 of 6 results for author: Broscheit, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2207.06220  [pdf, other

    cs.IR cs.AI

    Improving Wikipedia Verifiability with AI

    Authors: Fabio Petroni, Samuel Broscheit, Aleksandra Piktus, Patrick Lewis, Gautier Izacard, Lucas Hosseini, Jane Dwivedi-Yu, Maria Lomeli, Timo Schick, Pierre-Emmanuel Mazaré, Armand Joulin, Edouard Grave, Sebastian Riedel

    Abstract: Verifiability is a core content policy of Wikipedia: claims that are likely to be challenged need to be backed by citations. There are millions of articles available online and thousands of new articles are released each month. For this reason, finding relevant sources is a difficult task: many claims do not have any references that support them. Furthermore, even existing citations might not supp… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  2. arXiv:2112.09924  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    The Web Is Your Oyster - Knowledge-Intensive NLP against a Very Large Web Corpus

    Authors: Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Dmytro Okhonko, Samuel Broscheit, Gautier Izacard, Patrick Lewis, Barlas Oğuz, Edouard Grave, Wen-tau Yih, Sebastian Riedel

    Abstract: In order to address increasing demands of real-world applications, the research for knowledge-intensive NLP (KI-NLP) should advance by capturing the challenges of a truly open-domain environment: web-scale knowledge, lack of structure, inconsistent quality and noise. To this end, we propose a new setup for evaluating existing knowledge intensive tasks in which we generalize the background corpus t… ▽ More

    Submitted 24 May, 2022; v1 submitted 18 December, 2021; originally announced December 2021.

  3. Investigating Entity Knowledge in BERT with Simple Neural End-To-End Entity Linking

    Authors: Samuel Broscheit

    Abstract: A typical architecture for end-to-end entity linking systems consists of three steps: mention detection, candidate generation and entity disambiguation. In this study we investigate the following questions: (a) Can all those steps be learned jointly with a model for contextualized text-representations, i.e. BERT (Devlin et al., 2019)? (b) How much entity knowledge is already contained in pretraine… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

    Comments: Published at CoNLL 2019

    Journal ref: Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), 2019, 677-685

  4. arXiv:1904.12324  [pdf, other

    cs.CL

    OPIEC: An Open Information Extraction Corpus

    Authors: Kiril Gashteovski, Sebastian Wanner, Sven Hertling, Samuel Broscheit, Rainer Gemulla

    Abstract: Open information extraction (OIE) systems extract relations and their arguments from natural language text in an unsupervised manner. The resulting extractions are a valuable resource for downstream tasks such as knowledge base construction, open question answering, or event schema induction. In this paper, we release, describe, and analyze an OIE corpus called OPIEC, which was extracted from the… ▽ More

    Submitted 28 April, 2019; originally announced April 2019.

    Comments: In Proceedings of the Conference of Automatic Knowledge Base Construction (AKBC) 2019

    Journal ref: In Proceedings of the Conference of Automatic Knowledge Base Construction (AKBC) 2019

  5. arXiv:1902.00898  [pdf, other

    cs.LG stat.ML

    A Relational Tucker Decomposition for Multi-Relational Link Prediction

    Authors: Yanjie Wang, Samuel Broscheit, Rainer Gemulla

    Abstract: We propose the Relational Tucker3 (RT) decomposition for multi-relational link prediction in knowledge graphs. We show that many existing knowledge graph embedding models are special cases of the RT decomposition with certain predefined sparsity patterns in its components. In contrast to these prior models, RT decouples the sizes of entity and relation embeddings, allows parameter sharing across r… ▽ More

    Submitted 3 February, 2019; originally announced February 2019.

  6. arXiv:1810.07180  [pdf, other

    cs.AI cs.LG stat.ML

    On Evaluating Embedding Models for Knowledge Base Completion

    Authors: Yanjie Wang, Daniel Ruffinelli, Rainer Gemulla, Samuel Broscheit, Christian Meilicke

    Abstract: Knowledge bases contribute to many web search and mining tasks, yet they are often incomplete. To add missing facts to a given knowledge base, various embedding models have been proposed in the recent literature. Perhaps surprisingly, relatively simple models with limited expressiveness often performed remarkably well under today's most commonly used evaluation protocols. In this paper, we explore… ▽ More

    Submitted 31 January, 2019; v1 submitted 17 October, 2018; originally announced October 2018.