Skip to main content

Showing 1–7 of 7 results for author: Orbach, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.14019  [pdf, other

    cs.CL cs.AI

    Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI

    Authors: Elron Bandel, Yotam Perlitz, Elad Venezian, Roni Friedman-Melamed, Ofir Arviv, Matan Orbach, Shachar Don-Yehyia, Dafna Sheinwald, Ariel Gera, Leshem Choshen, Michal Shmueli-Scheuer, Yoav Katz

    Abstract: In the dynamic landscape of generative NLP, traditional text processing pipelines limit research flexibility and reproducibility, as they are tailored to specific dataset, task, and model combinations. The escalating complexity, involving system prompts, model-specific formats, instructions, and more, calls for a shift to a structured, modular, and customizable solution. Addressing this need, we p… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Submitted to NAACL demo track

  2. arXiv:2205.03804  [pdf, other

    cs.CL

    Multi-Domain Targeted Sentiment Analysis

    Authors: Orith Toledo-Ronen, Matan Orbach, Yoav Katz, Noam Slonim

    Abstract: Targeted Sentiment Analysis (TSA) is a central task for generating insights from consumer reviews. Such content is extremely diverse, with sites like Amazon or Yelp containing reviews on products and businesses from many different domains. A real-world TSA system should gracefully handle that diversity. This can be achieved by a multi-domain model -- one that is robust to the domain of the analyze… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

    Comments: Accepted to NAACL 2022 (long paper)

  3. arXiv:2012.14541  [pdf, other

    cs.CL cs.IR cs.LG

    YASO: A Targeted Sentiment Analysis Evaluation Dataset for Open-Domain Reviews

    Authors: Matan Orbach, Orith Toledo-Ronen, Artem Spector, Ranit Aharonov, Yoav Katz, Noam Slonim

    Abstract: Current TSA evaluation in a cross-domain setup is restricted to the small set of review domains available in existing datasets. Such an evaluation is limited, and may not reflect true performance on sites like Amazon or Yelp that host diverse reviews from many domains. To address this gap, we present YASO - a new TSA evaluation dataset of open-domain user reviews. YASO contains 2,215 English sente… ▽ More

    Submitted 13 September, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: Accepted to EMNLP 2021 (long paper). To download YASO, see https://github.com/IBM/yaso-tsa

  4. arXiv:2010.06432  [pdf, other

    cs.CL cs.AI cs.LG

    Multilingual Argument Mining: Datasets and Analysis

    Authors: Orith Toledo-Ronen, Matan Orbach, Yonatan Bilu, Artem Spector, Noam Slonim

    Abstract: The growing interest in argument mining and computational argumentation brings with it a plethora of Natural Language Understanding (NLU) tasks and corresponding datasets. However, as with many other NLU tasks, the dominant language is English, with resources in other languages being few and far between. In this work, we explore the potential of transfer learning using the multilingual BERT model… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: Accepted to Findings of EMNLP 2020 (Long Paper). For the associated multilingual arguments and evidence corpus, see https://www.research.ibm.com/haifa/dept/vst/debating_data.shtml#Multilingual%20Argument%20Mining

  5. arXiv:2005.01157  [pdf, other

    cs.CL cs.AI cs.LG

    Out of the Echo Chamber: Detecting Countering Debate Speeches

    Authors: Matan Orbach, Yonatan Bilu, Assaf Toledo, Dan Lahav, Michal Jacovi, Ranit Aharonov, Noam Slonim

    Abstract: An educated and informed consumption of media content has become a challenge in modern times. With the shift from traditional news outlets to social media and similar venues, a major concern is that readers are becoming encapsulated in "echo chambers" and may fall prey to fake news and disinformation, lacking easy access to dissenting views. We suggest a novel task aiming to alleviate some of thes… ▽ More

    Submitted 3 May, 2020; originally announced May 2020.

    Comments: Accepted to ACL 2020 as Long Paper. For the associated debate speeches corpus, see https://www.research.ibm.com/haifa/dept/vst/debating_data.shtml#Debate%20Speech%20Analysis

  6. arXiv:1909.00393  [pdf, other

    cs.CL cs.AI cs.LG

    A Dataset of General-Purpose Rebuttal

    Authors: Matan Orbach, Yonatan Bilu, Ariel Gera, Yoav Kantor, Lena Dankin, Tamar Lavee, Lili Kotlerman, Shachar Mirkin, Michal Jacovi, Ranit Aharonov, Noam Slonim

    Abstract: In Natural Language Understanding, the task of response generation is usually focused on responses to short texts, such as tweets or a turn in a dialog. Here we present a novel task of producing a critical response to a long argumentative text, and suggest a method based on general rebuttal arguments to address it. We do this in the context of the recently-suggested task of listening comprehension… ▽ More

    Submitted 1 September, 2019; originally announced September 2019.

    Comments: EMNLP 2019

  7. arXiv:1907.11889  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Effective Rebuttal: Listening Comprehension using Corpus-Wide Claim Mining

    Authors: Tamar Lavee, Matan Orbach, Lili Kotlerman, Yoav Kantor, Shai Gretz, Lena Dankin, Shachar Mirkin, Michal Jacovi, Yonatan Bilu, Ranit Aharonov, Noam Slonim

    Abstract: Engaging in a live debate requires, among other things, the ability to effectively rebut arguments claimed by your opponent. In particular, this requires identifying these arguments. Here, we suggest doing so by automatically mining claims from a corpus of news articles containing billions of sentences, and searching for them in a given speech. This raises the question of whether such claims indee… ▽ More

    Submitted 27 July, 2019; originally announced July 2019.

    Comments: 6th Argument Mining Workshop @ ACL 2019