Skip to main content

Showing 1–10 of 10 results for author: Furman, D

.
  1. arXiv:2406.19951  [pdf, other

    cs.CL

    Mining Reasons For And Against Vaccination From Unstructured Data Using Nichesourcing and AI Data Augmentation

    Authors: Damián Ariel Furman, Juan Junqueras, Z. Burçe Gümüslü, Edgar Altszyler, Joaquin Navajas, Ophelia Deroy, Justin Sulik

    Abstract: We present Reasons For and Against Vaccination (RFAV), a dataset for predicting reasons for and against vaccination, and scientific authorities used to justify them, annotated through nichesourcing and augmented using GPT4 and GPT3.5-Turbo. We show how it is possible to mine these reasons in non-structured text, under different task definitions, despite the high level of subjectivity involved and… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 8 pages + references and appendix

  2. arXiv:2306.02978  [pdf, other

    cs.CL cs.AI

    Which Argumentative Aspects of Hate Speech in Social Media can be reliably identified?

    Authors: Damián Furman, Pablo Torres, José A. Rodríguez, Diego Letzen, Vanina Martínez, Laura Alonso Alemany

    Abstract: With the increasing diversity of use cases of large language models, a more informative treatment of texts seems necessary. An argumentative analysis could foster a more reasoned usage of chatbots, text completion mechanisms or other applications. However, it is unclear which aspects of argumentation can be reliably identified and integrated in language models. In this paper, we present an empiric… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 9 Pages plus reference and appendix

  3. arXiv:2305.13675  [pdf, other

    cs.CL

    Polyglot or Not? Measuring Multilingual Encyclopedic Knowledge in Foundation Models

    Authors: Tim Schott, Daniel Furman, Shreshta Bhat

    Abstract: In this work, we assess the ability of foundation models to recall encyclopedic knowledge across a wide range of linguistic contexts. To support this, we: 1) produce a 20-language dataset that contains 303k factual associations paired with counterfactuals, 2) evaluate 5 models in a multilingual test, and 3) benchmark a diverse set of 24 models in an English-only test. Meta's LLaMA achieves the hig… ▽ More

    Submitted 5 December, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 (Main)

  4. arXiv:2209.03788  [pdf, other

    quant-ph stat.ML

    Quantum Sparse Coding

    Authors: Yaniv Romano, Harel Primack, Talya Vaknin, Idan Meirzada, Ilan Karpas, Dov Furman, Chene Tradonsky, Ruti Ben Shlomi

    Abstract: The ultimate goal of any sparse coding method is to accurately recover from a few noisy linear measurements, an unknown sparse vector. Unfortunately, this estimation problem is NP-hard in general, and it is therefore always approached with an approximation method, such as lasso or orthogonal matching pursuit, thus trading off accuracy for less computational complexity. In this paper, we develop a… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

  5. arXiv:2208.13947  [pdf, other

    cs.CL

    A Spanish dataset for Targeted Sentiment Analysis of political headlines

    Authors: Tomás Alves Salgueiro, Emilio Recart Zapata, Damián Furman, Juan Manuel Pérez, Pablo Nicolás Fernández Larrosa

    Abstract: Subjective texts have been studied by several works as they can induce certain behaviours in their users. Most work focuses on user-generated texts in social networks, but some other texts also comprise opinions on certain topics and could influence judgement criteria during political decisions. In this work, we address the task of Targeted Sentiment Analysis for the domain of news headlines, publ… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  6. arXiv:2208.01099  [pdf, other

    cs.CL

    Parsimonious Argument Annotations for Hate Speech Counter-narratives

    Authors: Damian A. Furman, Pablo Torres, Jose A. Rodriguez, Lautaro Martinez, Laura Alonso Alemany, Diego Letzen, Maria Vanina Martinez

    Abstract: We present an enrichment of the Hateval corpus of hate speech tweets (Basile et. al 2019) aimed to facilitate automated counter-narrative generation. Comparably to previous work (Chung et. al. 2019), manually written counter-narratives are associated to tweets. However, this information alone seems insufficient to obtain satisfactory language models for counter-narrative generation. That is why we… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  7. arXiv:2207.09517  [pdf, other

    quant-ph cs.ET

    LightSolver -- A New Quantum-inspired Solver Cracks the 3-Regular 3-XORSAT Challenge

    Authors: Idan Meirzada, Assaf Kalinski, Dov Furman, Tsafrir Armon, Talya Vaknin, Harel Primack, Chene Tradonsky, Ruti Ben-Shlomi

    Abstract: The increasing complexity of required computational tasks alongside the inherent limitations in conventional computing calls for disruptive innovation. LightSolver devised a new quantum-inspired computing paradigm, which utilizes an all-optical platform for solving hard optimization problems. In this work, LightSolver introduces its digital simulator and joins the 3-Regular 3-XORSAT (3R3X) challen… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  8. arXiv:2111.09453  [pdf, other

    cs.CL cs.AI

    RoBERTuito: a pre-trained language model for social media text in Spanish

    Authors: Juan Manuel Pérez, Damián A. Furman, Laura Alonso Alemany, Franco Luque

    Abstract: Since BERT appeared, Transformer language models and transfer learning have become state-of-the-art for Natural Language Understanding tasks. Recently, some works geared towards pre-training specially-crafted models for particular domains, such as scientific papers, medical documents, user-generated texts, among others. These domain-specific models have been shown to improve performance significan… ▽ More

    Submitted 4 May, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: LREC 2022

  9. arXiv:2106.09462  [pdf, other

    cs.CL

    pysentimiento: A Python Toolkit for Opinion Mining and Social NLP tasks

    Authors: Juan Manuel Pérez, Mariela Rajngewerc, Juan Carlos Giudici, Damián A. Furman, Franco Luque, Laura Alonso Alemany, María Vanina Martínez

    Abstract: In recent years, the extraction of opinions and information from user-generated text has attracted a lot of interest, largely due to the unprecedented volume of content in Social Media. However, social researchers face some issues in adopting cutting-edge tools for these tasks, as they are usually behind commercial APIs, unavailable for other languages than English, or very complex to use for non-… ▽ More

    Submitted 25 October, 2023; v1 submitted 17 June, 2021; originally announced June 2021.

  10. Magnetic ordering and magnetodielectric phenomena in CoSeO$_4$

    Authors: Brent C. Melot, Lucy E. Darago, Ram Seshadri, Abby Goldman, Joshua D. Furman, Efrain E. Rodriguez

    Abstract: CoSeO$_4$ has a structure consisting of edge-sharing chains of Co$^{2+}$ octahedra which are held together by SeO$_4^{2-}$ tetrahedra via shared oxygen atoms at the edges of the octahedra. DC magnetization measurements indicate a transition to an ordered state below 30 K. Powder neutron diffraction refinements suggest an ordered state with two unique antiferrromagnetic chains within the unit cell… ▽ More

    Submitted 18 March, 2010; originally announced March 2010.

    Comments: 7 pages, 10 figures

    Report number: J. Phys.: Condens. Matter 22 506003