Skip to main content

Showing 1–16 of 16 results for author: Katz, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.14019  [pdf, other

    cs.CL cs.AI

    Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI

    Authors: Elron Bandel, Yotam Perlitz, Elad Venezian, Roni Friedman-Melamed, Ofir Arviv, Matan Orbach, Shachar Don-Yehyia, Dafna Sheinwald, Ariel Gera, Leshem Choshen, Michal Shmueli-Scheuer, Yoav Katz

    Abstract: In the dynamic landscape of generative NLP, traditional text processing pipelines limit research flexibility and reproducibility, as they are tailored to specific dataset, task, and model combinations. The escalating complexity, involving system prompts, model-specific formats, instructions, and more, calls for a shift to a structured, modular, and customizable solution. Addressing this need, we p… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Submitted to NAACL demo track

  2. arXiv:2302.04863  [pdf, other

    cs.LG cs.AI cs.CL

    Knowledge is a Region in Weight Space for Fine-tuned Language Models

    Authors: Almog Gueta, Elad Venezian, Colin Raffel, Noam Slonim, Yoav Katz, Leshem Choshen

    Abstract: Research on neural networks has focused on understanding a single model trained on a single dataset. However, relatively little is known about the relationships between different models, particularly those trained or tested on different datasets. We address this by studying how the weight space and the underlying loss landscape of different models are interconnected. Specifically, we demonstrate… ▽ More

    Submitted 12 October, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

  3. arXiv:2212.10498  [pdf, other

    cs.CL

    SimpleStyle: An Adaptable Style Transfer Approach

    Authors: Elron Bandel, Yoav Katz, Noam Slonim, Liat Ein-Dor

    Abstract: Attribute-controlled text rewriting, also known as text style-transfer, has a crucial role in regulating attributes and biases of textual training data and a machine generated text. In this work we present SimpleStyle, a minimalist yet effective approach for style-transfer composed of two simple ingredients: controlled denoising and output filtering. Despite the simplicity of our approach, which c… ▽ More

    Submitted 22 December, 2022; v1 submitted 20 December, 2022; originally announced December 2022.

  4. arXiv:2212.01378  [pdf, other

    cs.LG cs.CL cs.DC

    ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning

    Authors: Shachar Don-Yehiya, Elad Venezian, Colin Raffel, Noam Slonim, Yoav Katz, Leshem Choshen

    Abstract: We propose a new paradigm to continually evolve pretrained models, denoted ColD Fusion. It provides the benefits of multitask learning but leverages distributed computation with limited communication and eliminates the need for shared data. Consequentially, ColD Fusion can give rise to a synergistic loop, where finetuned models can be recycled to continually improve the pretrained model they are b… ▽ More

    Submitted 13 September, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: ACL 23

  5. arXiv:2211.00107  [pdf, other

    cs.CL cs.AI cs.LG

    Where to start? Analyzing the potential value of intermediate models

    Authors: Leshem Choshen, Elad Venezian, Shachar Don-Yehia, Noam Slonim, Yoav Katz

    Abstract: Previous studies observed that finetuned models may be better base models than the vanilla pretrained model. Such a model, finetuned on some source dataset, may provide a better starting point for a new finetuning process on a desired target dataset. Here, we perform a systematic analysis of this intertraining scheme, over a wide range of English classification tasks. Surprisingly, our analysis su… ▽ More

    Submitted 10 November, 2022; v1 submitted 31 October, 2022; originally announced November 2022.

    Comments: https://ibm.github.io/model-recycling/

  6. arXiv:2208.01483  [pdf, other

    cs.CL cs.HC

    Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours

    Authors: Eyal Shnarch, Alon Halfon, Ariel Gera, Marina Danilevsky, Yannis Katsis, Leshem Choshen, Martin Santillan Cooper, Dina Epelboim, Zheng Zhang, Dakuo Wang, Lucy Yip, Liat Ein-Dor, Lena Dankin, Ilya Shnayderman, Ranit Aharonov, Yunyao Li, Naftali Liberman, Philip Levin Slesarev, Gwilym Newton, Shila Ofek-Koifman, Noam Slonim, Yoav Katz

    Abstract: Text classification can be useful in many real-world scenarios, saving a lot of time for end users. However, building a custom classifier typically requires coding skills and ML knowledge, which poses a significant barrier for many potential users. To lift this barrier, we introduce Label Sleuth, a free open source system for labeling and creating text classifiers. This system is unique for (a) be… ▽ More

    Submitted 31 October, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: 7 pages, 2 figures To be published at EMNLP 2022

  7. arXiv:2205.12240  [pdf, other

    cs.CL

    VIRATrustData: A Trust-Annotated Corpus of Human-Chatbot Conversations About COVID-19 Vaccines

    Authors: Roni Friedman, João Sedoc, Shai Gretz, Assaf Toledo, Rose Weeks, Naor Bar-Zeev, Yoav Katz, Noam Slonim

    Abstract: Public trust in medical information is crucial for successful application of public health policies such as vaccine uptake. This is especially true when the information is offered remotely, by chatbots, which have become increasingly popular in recent years. Here, we explore the challenging task of human-bot turn-level trust classification. We rely on a recently released data of observationally-co… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  8. arXiv:2205.11966  [pdf, other

    cs.CL

    Benchmark Data and Evaluation Framework for Intent Discovery Around COVID-19 Vaccine Hesitancy

    Authors: Shai Gretz, Assaf Toledo, Roni Friedman, Dan Lahav, Rose Weeks, Naor Bar-Zeev, João Sedoc, Pooja Sangha, Yoav Katz, Noam Slonim

    Abstract: The COVID-19 pandemic has made a huge global impact and cost millions of lives. As COVID-19 vaccines were rolled out, they were quickly met with widespread hesitancy. To address the concerns of hesitant people, we launched VIRA, a public dialogue system aimed at addressing questions and concerns surrounding the COVID-19 vaccines. Here, we release VIRADialogs, a dataset of over 8k dialogues conduct… ▽ More

    Submitted 11 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  9. arXiv:2205.03804  [pdf, other

    cs.CL

    Multi-Domain Targeted Sentiment Analysis

    Authors: Orith Toledo-Ronen, Matan Orbach, Yoav Katz, Noam Slonim

    Abstract: Targeted Sentiment Analysis (TSA) is a central task for generating insights from consumer reviews. Such content is extremely diverse, with sites like Amazon or Yelp containing reviews on products and businesses from many different domains. A real-world TSA system should gracefully handle that diversity. This can be achieved by a multi-domain model -- one that is robust to the domain of the analyze… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

    Comments: Accepted to NAACL 2022 (long paper)

  10. arXiv:2204.03044  [pdf, other

    cs.CL cs.CV cs.LG

    Fusing finetuned models for better pretraining

    Authors: Leshem Choshen, Elad Venezian, Noam Slonim, Yoav Katz

    Abstract: Pretrained models are the standard starting point for training. This approach consistently outperforms the use of a random initialization. However, pretraining is a costly endeavour that few can undertake. In this paper, we create better base models at hardly any cost, by fusing multiple existing fine tuned models into one. Specifically, we fuse by averaging the weights of these models. We show… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  11. arXiv:2110.10577  [pdf, other

    cs.CL

    Overview of the 2021 Key Point Analysis Shared Task

    Authors: Roni Friedman, Lena Dankin, Yufang Hou, Ranit Aharonov, Yoav Katz, Noam Slonim

    Abstract: We describe the 2021 Key Point Analysis (KPA-2021) shared task on key point analysis that we organized as a part of the 8th Workshop on Argument Mining (ArgMining 2021) at EMNLP 2021. We outline various approaches and discuss the results of the shared task. We expect the task and the findings reported in this paper to be relevant for researchers working on text summarization and argument mining.

    Submitted 20 October, 2021; originally announced October 2021.

  12. arXiv:2110.01029  [pdf, other

    cs.CL

    Project Debater APIs: Decomposing the AI Grand Challenge

    Authors: Roy Bar-Haim, Yoav Kantor, Elad Venezian, Yoav Katz, Noam Slonim

    Abstract: Project Debater was revealed in 2019 as the first AI system that can debate human experts on complex topics. Engaging in a live debate requires a diverse set of skills, and Project Debater has been developed accordingly as a collection of components, each designed to perform a specific subtask. Project Debater APIs provide access to many of these capabilities, as well as to more recently developed… ▽ More

    Submitted 3 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021 (Demonstrations)

  13. arXiv:2012.14541  [pdf, other

    cs.CL cs.IR cs.LG

    YASO: A Targeted Sentiment Analysis Evaluation Dataset for Open-Domain Reviews

    Authors: Matan Orbach, Orith Toledo-Ronen, Artem Spector, Ranit Aharonov, Yoav Katz, Noam Slonim

    Abstract: Current TSA evaluation in a cross-domain setup is restricted to the small set of review domains available in existing datasets. Such an evaluation is limited, and may not reflect true performance on sites like Amazon or Yelp that host diverse reviews from many domains. To address this gap, we present YASO - a new TSA evaluation dataset of open-domain user reviews. YASO contains 2,215 English sente… ▽ More

    Submitted 13 September, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: Accepted to EMNLP 2021 (long paper). To download YASO, see https://github.com/IBM/yaso-tsa

  14. arXiv:1911.10783  [pdf, other

    cs.CL

    Financial Event Extraction Using Wikipedia-Based Weak Supervision

    Authors: Liat Ein-Dor, Ariel Gera, Orith Toledo-Ronen, Alon Halfon, Benjamin Sznajder, Lena Dankin, Yonatan Bilu, Yoav Katz, Noam Slonim

    Abstract: Extraction of financial and economic events from text has previously been done mostly using rule-based methods, with more recent works employing machine learning techniques. This work is in line with this latter approach, leveraging relevant Wikipedia sections to extract weak labels for sentences describing economic events. Whereas previous weakly supervised approaches required a knowledge-base of… ▽ More

    Submitted 28 November, 2022; v1 submitted 25 November, 2019; originally announced November 2019.

  15. arXiv:1908.06785  [pdf, other

    cs.CL

    Fast End-to-End Wikification

    Authors: Ilya Shnayderman, Liat Ein-Dor, Yosi Mass, Alon Halfon, Benjamin Sznajder, Artem Spector, Yoav Katz, Dafna Sheinwald, Ranit Aharonov, Noam Slonim

    Abstract: Wikification of large corpora is beneficial for various NLP applications. Existing methods focus on quality performance rather than run-time, and are therefore non-feasible for large data. Here, we introduce RedW, a run-time oriented Wikification solution, based on Wikipedia redirects, that can Wikify massive corpora with competitive performance. We further propose an efficient method for estimati… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  16. arXiv:1906.03897  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Learning to combine Grammatical Error Corrections

    Authors: Yoav Kantor, Yoav Katz, Leshem Choshen, Edo Cohen-Karlik, Naftali Liberman, Assaf Toledo, Amir Menczel, Noam Slonim

    Abstract: The field of Grammatical Error Correction (GEC) has produced various systems to deal with focused phenomena or general text editing. We propose an automatic way to combine black-box systems. Our method automatically detects the strength of a system or the combination of several systems per error type, improving precision and recall while optimizing $F$ score directly. We show consistent improvemen… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: BEA 2019