Skip to main content

Showing 1–9 of 9 results for author: Altszyler, E

.
  1. arXiv:2406.19951  [pdf, other

    cs.CL

    Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation

    Authors: Damián Ariel Furman, Juan Junqueras, Z. Burçe Gümüslü, Edgar Altszyler, Joaquin Navajas, Ophelia Deroy, Justin Sulik

    Abstract: We present Reasons For and Against Vaccination (RFAV), a dataset for predicting reasons for and against vaccination, and scientific authorities used to justify them, annotated through nichesourcing and augmented using GPT4 and GPT3.5-Turbo. We show how it is possible to mine these reasons in non-structured text, under different task definitions, despite the high level of subjectivity involved and… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 8 pages + references and appendix

  2. arXiv:2301.00792  [pdf, other

    cs.CL cs.AI

    The Undesirable Dependence on Frequency of Gender Bias Metrics Based on Word Embeddings

    Authors: Francisco Valentini, Germán Rosati, Diego Fernandez Slezak, Edgar Altszyler

    Abstract: Numerous works use word embedding-based metrics to quantify societal biases and stereotypes in texts. Recent studies have found that word embeddings can capture semantic similarity but may be affected by word frequency. In this work we study the effect of frequency when measuring female vs. male gender bias with word embedding-based bias quantification methods. We find that Skip-gram with negative… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

    Comments: Camera Ready for EMNLP 2022 (Findings)

  3. arXiv:2211.08203  [pdf, other

    cs.CL

    Investigating the Frequency Distortion of Word Embeddings and Its Impact on Bias Metrics

    Authors: Francisco Valentini, Juan Cruz Sosa, Diego Fernandez Slezak, Edgar Altszyler

    Abstract: Recent research has shown that static word embeddings can encode word frequency information. However, little has been studied about this phenomenon and its effects on downstream tasks. In the present work, we systematically study the association between frequency and semantic similarity in several static word embeddings. We find that Skip-gram, GloVe and FastText embeddings tend to produce higher… ▽ More

    Submitted 19 October, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: Camera Ready for EMNLP 2023 (Findings)

  4. arXiv:2104.06474  [pdf, other

    cs.CL

    On the Interpretability and Significance of Bias Metrics in Texts: a PMI-based Approach

    Authors: Francisco Valentini, Germán Rosati, Damián Blasi, Diego Fernandez Slezak, Edgar Altszyler

    Abstract: In recent years, word embeddings have been widely used to measure biases in texts. Even if they have proven to be effective in detecting a wide variety of biases, metrics based on word embeddings lack transparency and interpretability. We analyze an alternative PMI-based metric to quantify biases in texts. It can be expressed as a function of conditional probabilities, which provides a simple inte… ▽ More

    Submitted 18 July, 2023; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: Camera Ready for ACL 2023 (main conference)

  5. Gender bias in magazines oriented to men and women: a computational approach

    Authors: Diego Kozlowski, Gabriela Lozano, Carla M. Felcher, Fernando Gonzalez, Edgar Altszyler

    Abstract: Cultural products are a source to acquire individual values and behaviours. Therefore, the differences in the content of the magazines aimed specifically at women or men are a means to create and reproduce gender stereotypes. In this study, we compare the content of a women-oriented magazine with that of a men-oriented one, both produced by the same editorial group, over a decade (2008-2018). With… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Journal ref: Feminist Media Studies (2022)

  6. arXiv:2009.13275  [pdf, other

    cs.CL cs.AI

    Zero-shot Multi-Domain Dialog State Tracking Using Descriptive Rules

    Authors: Edgar Altszyler, Pablo Brusco, Nikoletta Basiou, John Byrnes, Dimitra Vergyri

    Abstract: In this work, we present a framework for incorporating descriptive logical rules in state-of-the-art neural networks, enabling them to learn how to handle unseen labels without the introduction of any new training data. The rules are integrated into existing networks without modifying their architecture, through an additional term in the network's loss function that penalizes states of the network… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

  7. arXiv:1712.10054  [pdf, ps, other

    cs.CL cs.AI

    Corpus specificity in LSA and Word2vec: the role of out-of-domain documents

    Authors: Edgar Altszyler, Mariano Sigman, Diego Fernandez Slezak

    Abstract: Latent Semantic Analysis (LSA) and Word2vec are some of the most widely used word embeddings. Despite the popularity of these techniques, the precise mechanisms by which they acquire new semantic relations between words remain unclear. In the present article we investigate whether LSA and Word2vec capacity to identify relevant semantic dimensions increases with size of corpus. One intuitive hypoth… ▽ More

    Submitted 28 December, 2017; originally announced December 2017.

    Journal ref: Proceedings of the 3rd Workshop on Representation Learning for NLP, pages 1-10, 2018, ACL

  8. Comparative study of LSA vs Word2vec embeddings in small corpora: a case study in dreams database

    Authors: Edgar Altszyler, Mariano Sigman, Sidarta Ribeiro, Diego Fernández Slezak

    Abstract: Word embeddings have been extensively studied in large text datasets. However, only a few studies analyze semantic representations of small corpora, particularly relevant in single-person text production studies. In the present paper, we compare Skip-gram and LSA capabilities in this scenario, and we test both techniques to extract relevant semantic patterns in single-series dreams reports. LSA sh… ▽ More

    Submitted 11 April, 2017; v1 submitted 5 October, 2016; originally announced October 2016.

    Journal ref: Conscious Cogn. 2017 Nov;56:178-187

  9. Ultrasensitivity on signaling cascades revisited: Linking local and global ultrasensitivity estimations

    Authors: Edgar Altszyler, Alejandra Ventura, Alejandro Colman-Lerner, Ariel Chernomoretz

    Abstract: Ultrasensitive response motifs, which are capable of converting graded stimulus in binary responses, are very well-conserved in signal transduction networks. Although it has been shown that a cascade arrangement of multiple ultrasensitive modules can produce an enhancement of the system's ultrasensitivity, how the combination of layers affects the cascade's ultrasensitivity remains an open questio… ▽ More

    Submitted 3 April, 2017; v1 submitted 29 August, 2016; originally announced August 2016.

    Journal ref: PLoS ONE 12(6), 2017