Skip to main content

Showing 1–6 of 6 results for author: Hämmerl, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.06228  [pdf, other

    cs.CL

    Understanding Cross-Lingual Alignment -- A Survey

    Authors: Katharina Hämmerl, **dřich Libovický, Alexander Fraser

    Abstract: Cross-lingual alignment, the meaningful similarity of representations across languages in multilingual language models, has been an active field of research in recent years. We survey the literature of techniques to improve cross-lingual alignment, providing a taxonomy of methods and summarising insights from throughout the field. We present different understandings of cross-lingual alignment and… ▽ More

    Submitted 11 June, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: Camera-ready version, ACL Findings 2024

  2. arXiv:2401.16092  [pdf, other

    cs.CL cs.CY cs.LG

    Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You

    Authors: Felix Friedrich, Katharina Hämmerl, Patrick Schramowski, Manuel Brack, **drich Libovicky, Kristian Kersting, Alexander Fraser

    Abstract: Text-to-image generation models have recently achieved astonishing results in image quality, flexibility, and text alignment, and are consequently employed in a fast-growing number of applications. Through improvements in multilingual abilities, a larger community now has access to this technology. However, our results show that multilingual models suffer from significant gender biases just as mon… ▽ More

    Submitted 15 May, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  3. arXiv:2306.00458  [pdf, other

    cs.CL

    Exploring Anisotropy and Outliers in Multilingual Language Models for Cross-Lingual Semantic Sentence Similarity

    Authors: Katharina Hämmerl, Alina Fastowski, **dřich Libovický, Alexander Fraser

    Abstract: Previous work has shown that the representations output by contextual language models are more anisotropic than static type embeddings, and typically display outlier dimensions. This seems to be true for both monolingual and multilingual models, although much less work has been done on the multilingual context. Why these outliers occur and how they affect the representations is still an active are… ▽ More

    Submitted 7 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: To appear in ACL Findings 2023. Fixed a citation in this version

  4. arXiv:2211.07733  [pdf, other

    cs.CL

    Speaking Multiple Languages Affects the Moral Bias of Language Models

    Authors: Katharina Hämmerl, Björn Deiseroth, Patrick Schramowski, **dřich Libovický, Constantin A. Rothkopf, Alexander Fraser, Kristian Kersting

    Abstract: Pre-trained multilingual language models (PMLMs) are commonly used when dealing with data from multiple languages and cross-lingual transfer. However, PMLMs are trained on varying amounts of data for each language. In practice this means their performance is often much better on English than many other languages. We explore to what extent this also applies to moral norms. Do the models capture mor… ▽ More

    Submitted 1 June, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: To appear in ACL Findings 2023

  5. arXiv:2203.09904  [pdf, ps, other

    cs.CL

    Do Multilingual Language Models Capture Differing Moral Norms?

    Authors: Katharina Hämmerl, Björn Deiseroth, Patrick Schramowski, **dřich Libovický, Alexander Fraser, Kristian Kersting

    Abstract: Massively multilingual sentence representations are trained on large corpora of uncurated data, with a very imbalanced proportion of languages included in the training. This may cause the models to grasp cultural values including moral judgments from the high-resource languages and impose them on the low-resource languages. The lack of data in certain languages can also lead to develo** random a… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  6. arXiv:2203.09326  [pdf, other

    cs.CL

    Combining Static and Contextualised Multilingual Embeddings

    Authors: Katharina Hämmerl, **dřich Libovický, Alexander Fraser

    Abstract: Static and contextual multilingual embeddings have complementary strengths. Static embeddings, while less expressive than contextual language models, can be more straightforwardly aligned across multiple languages. We combine the strengths of static and contextual models to improve multilingual representations. We extract static embeddings for 40 languages from XLM-R, validate those embeddings wit… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: Accepted to Findings of ACL 2022