Skip to main content

Showing 1–9 of 9 results for author: Meij, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.02740  [pdf, other

    cs.IR cs.CL

    Dense Retrieval Adaptation using Target Domain Description

    Authors: Helia Hashemi, Yong Zhuang, Sachith Sri Ram Kothur, Srivas Prasad, Edgar Meij, W. Bruce Croft

    Abstract: In information retrieval (IR), domain adaptation is the process of adapting a retrieval model to a new domain whose data distribution is different from the source domain. Existing methods in this area focus on unsupervised domain adaptation where they have access to the target document collection or supervised (often few-shot) domain adaptation where they additionally have access to (limited) labe… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  2. News Article Retrieval in Context for Event-centric Narrative Creation

    Authors: Nikos Voskarides, Edgar Meij, Sabrina Sauer, Maarten de Rijke

    Abstract: Writers such as journalists often use automatic tools to find relevant content to include in their narratives. In this paper, we focus on supporting writers in the news domain to develop event-centric narratives. Given an incomplete narrative that specifies a main event and a context, we aim to retrieve news articles that discuss relevant events that would enable the continuation of the narrative.… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: ICTIR 2021

  3. arXiv:2010.04706  [pdf, other

    cs.CL

    Uncertainty over Uncertainty: Investigating the Assumptions, Annotations, and Text Measurements of Economic Policy Uncertainty

    Authors: Katherine A. Keith, Christoph Teichmann, Brendan O'Connor, Edgar Meij

    Abstract: Methods and applications are inextricably linked in science, and in particular in the domain of text-as-data. In this paper, we examine one such text-as-data application, an established economic index that measures economic policy uncertainty from keyword occurrences in news. This index, which is shown to correlate with firm investment, employment, and excess market returns, has had substantive im… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: Accepted to the 2020 Natural Language Processing + Computational Social Science Workshop (NLP+CSS) at EMNLP

    Journal ref: 2020 Natural Language Processing + Computational Social Science Workshop (NLP+CSS) at EMNLP

  4. arXiv:2007.11659   

    cs.IR cs.DB

    Proceedings of the KG-BIAS Workshop 2020 at AKBC 2020

    Authors: Edgar Meij, Tara Safavi, Chenyan Xiong, Gianluca Demartini, Miriam Redi, Fatma Özcan

    Abstract: The KG-BIAS 2020 workshop touches on biases and how they surface in knowledge graphs (KGs), biases in the source data that is used to create KGs, methods for measuring or remediating bias in KGs, but also identifying other biases such as how and which languages are represented in automatically constructed KGs or how personal KGs might incur inherent biases. The goal of this workshop is to uncover… ▽ More

    Submitted 18 June, 2020; originally announced July 2020.

  5. arXiv:2004.01168  [pdf, other

    cs.AI cs.CL cs.LG

    Evaluating the Calibration of Knowledge Graph Embeddings for Trustworthy Link Prediction

    Authors: Tara Safavi, Danai Koutra, Edgar Meij

    Abstract: Little is known about the trustworthiness of predictions made by knowledge graph embedding (KGE) models. In this paper we take initial steps toward this direction by investigating the calibration of KGE models, or the extent to which they output confidence scores that reflect the expected correctness of predicted knowledge graph triples. We first conduct an evaluation under the standard closed-wor… ▽ More

    Submitted 6 October, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: EMNLP 2020

  6. arXiv:2003.07461  [pdf, other

    cs.IR

    Identifying Notable News Stories

    Authors: Antonia Saravanou, Giorgio Stefanoni, Edgar Meij

    Abstract: The volume of news content has increased significantly in recent years and systems to process and deliver this information in an automated fashion at scale are becoming increasingly prevalent. One critical component that is required in such systems is a method to automatically determine how notable a certain news story is, in order to prioritize these stories during delivery. One way to do so is t… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: Proceedings of The 42nd European Conference on Information Retrieval 2020 (ECIR '20), 2020

  7. Novel Entity Discovery from Web Tables

    Authors: Shuo Zhang, Edgar Meij, Krisztian Balog, Ridho Reinanda

    Abstract: When working with any sort of knowledge base (KB) one has to make sure it is as complete and also as up-to-date as possible. Both tasks are non-trivial as they require recall-oriented efforts to determine which entities and relationships are missing from the KB. As such they require a significant amount of labor. Tables on the Web, on the other hand, are abundant and have the distinct potential to… ▽ More

    Submitted 1 February, 2020; originally announced February 2020.

    Comments: Proceedings of The Web Conference 2020 (WWW '20), 2020

  8. Weakly-supervised Contextualization of Knowledge Graph Facts

    Authors: Nikos Voskarides, Edgar Meij, Ridho Reinanda, Abhinav Khaitan, Miles Osborne, Giorgio Stefanoni, Prabhanjan Kambadur, Maarten de Rijke

    Abstract: Knowledge graphs (KGs) model facts about the world, they consist of nodes (entities such as companies and people) that are connected by edges (relations such as founderOf). Facts encoded in KGs are frequently used by search applications to augment result pages. When presenting a KG fact to the user, providing other facts that are pertinent to that main fact can enrich the user experience and suppo… ▽ More

    Submitted 8 July, 2018; v1 submitted 7 May, 2018; originally announced May 2018.

    Comments: SIGIR 2018: 41st international ACM SIGIR conference on Research and Development in Information Retrieval. July version: corrected typos

  9. Document Filtering for Long-tail Entities

    Authors: Ridho Reinanda, Edgar Meij, Maarten de Rijke

    Abstract: Filtering relevant documents with respect to entities is an essential task in the context of knowledge base construction and maintenance. It entails processing a time-ordered stream of documents that might be relevant to an entity in order to select only those that contain vital information. State-of-the-art approaches to document filtering for popular entities are entity-dependent: they rely on a… ▽ More

    Submitted 14 September, 2016; originally announced September 2016.

    Comments: CIKM2016, Proceedings of the 25th ACM International Conference on Information and Knowledge Management. 2016

    ACM Class: H.3.3