Skip to main content

Showing 1–6 of 6 results for author: Diefenbach, D

.
  1. QAnswer: Towards Question Answering Search over Websites

    Authors: Kunpeng Guo, Clement Defretiere, Dennis Diefenbach, Christophe Gravier, Antoine Gourru

    Abstract: Question Answering (QA) is increasingly used by search engines to provide results to their end-users, yet very few websites currently use QA technologies for their search functionality. To illustrate the potential of QA technologies for the website search practitioner, we demonstrate web searches that combine QA over knowledge graphs and QA over free text -- each being usually tackled separately.… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  2. Fine-tuning Strategies for Domain Specific Question Answering under Low Annotation Budget Constraints

    Authors: Kunpeng Guo, Dennis Diefenbach, Antoine Gourru, Christophe Gravier

    Abstract: The progress introduced by pre-trained language models and their fine-tuning has resulted in significant improvements in most downstream NLP tasks. The unsupervised training of a language model combined with further target task fine-tuning has become the standard QA fine-tuning procedure. In this work, we demonstrate that this strategy is sub-optimal for fine-tuning QA models, especially under a l… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  3. Wikidata as a seed for Web Extraction

    Authors: Kunpeng Guo, Dennis Diefenbach, Antoine Gourru, Christophe Gravier

    Abstract: Wikidata has grown to a knowledge graph with an impressive size. To date, it contains more than 17 billion triples collecting information about people, places, films, stars, publications, proteins, and many more. On the other side, most of the information on the Web is not published in highly structured data repositories like Wikidata, but rather as unstructured and semi-structured content, more c… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  4. arXiv:2202.00120  [pdf, other

    cs.CL cs.IR

    QALD-9-plus: A Multilingual Dataset for Question Answering over DBpedia and Wikidata Translated by Native Speakers

    Authors: Aleksandr Perevalov, Dennis Diefenbach, Ricardo Usbeck, Andreas Both

    Abstract: The ability to have the same experience for different user groups (i.e., accessibility) is one of the most important characteristics of Web-based systems. The same is true for Knowledge Graph Question Answering (KGQA) systems that provide the access to Semantic Web data via natural language interface. While following our research agenda on the multilingual aspect of accessibility of KGQA systems,… ▽ More

    Submitted 7 February, 2022; v1 submitted 31 January, 2022; originally announced February 2022.

  5. arXiv:1809.06859  [pdf, other

    cs.DB

    HDTCat: let's make HDT scale

    Authors: Dennis Diefenbach, Josée M. Giménez-García

    Abstract: HDT (Header, Dictionary, Triples) is a serialization for RDF. HDT has become very popular in the last years because it allows to store RDF data with a small disk footprint, while remaining at the same time queriable. For this reason HDT is often used when scalability becomes an issue. Once RDF data is serialized into HDT, the disk footprint to store it and the memory footprint to query it are very… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

  6. arXiv:1803.00832  [pdf, other

    cs.AI cs.CL

    Towards a Question Answering System over the Semantic Web

    Authors: Dennis Diefenbach, Andreas Both, Kamal Singh, Pierre Maret

    Abstract: Thanks to the development of the Semantic Web, a lot of new structured data has become available on the Web in the form of knowledge bases (KBs). Making this valuable data accessible and usable for end-users is one of the main goals of Question Answering (QA) over KBs. Most current QA systems query one KB, in one language (namely English). The existing approaches are not designed to be easily adap… ▽ More

    Submitted 2 March, 2018; originally announced March 2018.

    Comments: There is a Patent Pending for the presented approach. It was submitted the 18 January 2018 at the EPO and has the number EP18305035.0