Skip to main content

Showing 1–9 of 9 results for author: Karidi, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14863  [pdf, other

    cs.CL cs.AI cs.LG

    A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns

    Authors: Asaf Yehudai, Taelin Karidi, Gabriel Stanovsky, Ariel Goldstein, Omri Abend

    Abstract: Cross-domain alignment refers to the task of map** a concept from one domain to another. For example, ``If a \textit{doctor} were a \textit{color}, what color would it be?''. This seemingly peculiar task is designed to investigate how people represent concrete and abstract concepts through their map**s between categories and their reasoning processes over those map**s. In this paper, we adap… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: CogSci

  2. arXiv:2404.06833  [pdf, other

    cs.CL

    Does Mapo Tofu Contain Coffee? Probing LLMs for Food-related Cultural Knowledge

    Authors: Li Zhou, Taelin Karidi, Nicolas Garneau, Yong Cao, Wanlong Liu, Wenyu Chen, Daniel Hershcovich

    Abstract: Recent studies have highlighted the presence of cultural biases in Large Language Models (LLMs), yet often lack a robust methodology to dissect these phenomena comprehensively. Our work aims to bridge this gap by delving into the Food domain, a universally relevant yet culturally diverse aspect of human life. We introduce FmLAMA, a multilingual dataset centered on food-related cultural facts and v… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 20 pages,8 figures

  3. arXiv:2310.13583  [pdf, other

    cs.CL cs.LG

    Improving Cross-Lingual Transfer through Subtree-Aware Word Reordering

    Authors: Ofir Arviv, Dmitry Nikolaev, Taelin Karidi, Omri Abend

    Abstract: Despite the impressive growth of the abilities of multilingual language models, such as XLM-R and mT5, it has been shown that they still face difficulties when tackling typologically-distant languages, particularly in the low-resource setting. One obstacle for effective cross-lingual transfer is variability in word-order patterns. It can be potentially mitigated via source- or target-side word reo… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP Findings 2023

  4. arXiv:2305.14991  [pdf, other

    cs.CL cs.AI

    MuLER: Detailed and Scalable Reference-based Evaluation

    Authors: Taelin Karidi, Leshem Choshen, Gal Patel, Omri Abend

    Abstract: We propose a novel methodology (namely, MuLER) that transforms any reference-based evaluation metric for text generation, such as machine translation (MT) into a fine-grained analysis tool. Given a system and a metric, MuLER quantifies how much the chosen metric penalizes specific error types (e.g., errors in translating names of locations). MuLER thus enables a detailed error analysis which can l… ▽ More

    Submitted 29 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  5. arXiv:2303.09435  [pdf, other

    cs.CL

    Jump to Conclusions: Short-Cutting Transformers With Linear Transformations

    Authors: Alexander Yom Din, Taelin Karidi, Leshem Choshen, Mor Geva

    Abstract: Transformer-based language models create hidden representations of their inputs at every layer, but only use final-layer representations for prediction. This obscures the internal decision-making process of the model and the utility of its intermediate representations. One way to elucidate this is to cast the hidden representations as final representations, bypassing the transformer computation in… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

    Journal ref: LREC-COLING 2024

  6. arXiv:2110.04644  [pdf, other

    cs.CL cs.LG

    On the Relation between Syntactic Divergence and Zero-Shot Performance

    Authors: Ofir Arviv, Dmitry Nikolaev, Taelin Karidi, Omri Abend

    Abstract: We explore the link between the extent to which syntactic relations are preserved in translation and the ease of correctly constructing a parse tree in a zero-shot setting. While previous work suggests such a relation, it tends to focus on the macro level and not on the level of individual edges-a gap we aim to address. As a test case, we take the transfer of Universal Dependencies (UD) parsing fr… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: Accepted to EMNLP 2021

  7. arXiv:2109.11491  [pdf, other

    cs.CL

    Putting Words in BERT's Mouth: Navigating Contextualized Vector Spaces with Pseudowords

    Authors: Taelin Karidi, Yichu Zhou, Nathan Schneider, Omri Abend, Vivek Srikumar

    Abstract: We present a method for exploring regions around individual points in a contextualized vector space (particularly, BERT space), as a way to investigate how these regions correspond to word senses. By inducing a contextualized "pseudoword" as a stand-in for a static embedding in the input layer, and then performing masked prediction of a word in the sentence, we are able to investigate the geometry… ▽ More

    Submitted 4 October, 2021; v1 submitted 23 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 camera-ready version

  8. arXiv:2005.03436  [pdf, other

    cs.CL

    Fine-Grained Analysis of Cross-Linguistic Syntactic Divergences

    Authors: Dmitry Nikolaev, Ofir Arviv, Taelin Karidi, Neta Kenneth, Veronika Mitnik, Lilja Maria Saeboe, Omri Abend

    Abstract: The patterns in which the syntax of different languages converges and diverges are often used to inform work on cross-lingual transfer. Nevertheless, little empirical work has been done on quantifying the prevalence of different syntactic divergences across language pairs. We propose a framework for extracting divergence patterns for any language pair from a parallel corpus, building on Universal… ▽ More

    Submitted 13 July, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

  9. arXiv:1903.05181  [pdf, other

    cs.CL math.AT

    Topological Analysis of Syntactic Structures

    Authors: Alexander Port, Taelin Karidi, Matilde Marcolli

    Abstract: We use the persistent homology method of topological data analysis and dimensional analysis techniques to study data of syntactic structures of world languages. We analyze relations between syntactic parameters in terms of dimensionality, of hierarchical clustering structures, and of non-trivial loops. We show there are relations that hold across language families and additional relations that are… ▽ More

    Submitted 12 March, 2019; originally announced March 2019.

    Comments: 83 pages, LaTeX, 44 figures

    MSC Class: 91F20; 55U10; 55N35; 62-07