Skip to main content

Showing 1–3 of 3 results for author: Alberts, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.13163  [pdf, other

    cs.CL cs.AI

    Endowing Language Models with Multimodal Knowledge Graph Representations

    Authors: Ningyuan Huang, Yash R. Deshpande, Yibo Liu, Houda Alberts, Kyunghyun Cho, Clara Vania, Iacer Calixto

    Abstract: We propose a method to make natural language understanding models more parameter efficient by storing knowledge in an external knowledge graph (KG) and retrieving from this KG using a dense index. Given (possibly multilingual) downstream task data, e.g., sentences in German, we retrieve entities from the KG and use their multimodal representations to improve downstream task performance. We use the… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: 14 pages with appendix, 2 figures, 15 tables

    MSC Class: 68T50 ACM Class: I.2.7; I.2.10; I.2.4

  2. arXiv:2008.09152  [pdf, other

    cs.CV cs.CL

    ImagiFilter: A resource to enable the semi-automatic mining of images at scale

    Authors: Houda Alberts, Iacer Calixto

    Abstract: Datasets (semi-)automatically collected from the web can easily scale to millions of entries, but a dataset's usefulness is directly related to how clean and high-quality its examples are. In this paper, we describe and publicly release an image dataset along with pretrained models designed to (semi-)automatically filter out undesirable images from very large image collections, possibly obtained f… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

    Comments: 10 pages, 6 figures, 2 tables

    ACM Class: E.0

  3. arXiv:2008.09150  [pdf, other

    cs.CL cs.AI cs.CV

    VisualSem: A High-quality Knowledge Graph for Vision and Language

    Authors: Houda Alberts, Teresa Huang, Yash Deshpande, Yibo Liu, Kyunghyun Cho, Clara Vania, Iacer Calixto

    Abstract: An exciting frontier in natural language understanding (NLU) and generation (NLG) calls for (vision-and-) language models that can efficiently access external structured knowledge repositories. However, many existing knowledge bases only cover limited domains, or suffer from noisy data, and most of all are typically hard to integrate into neural language pipelines. To fill this gap, we release Vis… ▽ More

    Submitted 20 October, 2021; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: Accepted for publication at the 1st Multilingual Representation Learning workshop (MRL 2021) co-located with EMNLP 2021. 15 pages, 8 figures, 6 tables

    ACM Class: E.0; E.2