Skip to main content

Showing 1–13 of 13 results for author: Sznajder, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.17347  [pdf, other

    cs.SE cs.HC

    InspectorRAGet: An Introspection Platform for RAG Evaluation

    Authors: Kshitij Fadnis, Siva Sankalp Patel, Odellia Boni, Yannis Katsis, Sara Rosenthal, Benjamin Sznajder, Marina Danilevsky

    Abstract: Large Language Models (LLM) have become a popular approach for implementing Retrieval Augmented Generation (RAG) systems, and a significant amount of effort has been spent on building good models and metrics. In spite of increased recognition of the need for rigorous evaluation of RAG systems, few tools exist that go beyond the creation of model output and automatic calculation. We present Inspect… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  2. arXiv:2402.07891  [pdf, other

    cs.CL cs.LG

    Label-Efficient Model Selection for Text Generation

    Authors: Shir Ashury-Tahan, Ariel Gera, Benjamin Sznajder, Leshem Choshen, Liat Ein-Dor, Eyal Shnarch

    Abstract: Model selection for a given target task can be costly, as it may entail extensive annotation of the quality of outputs of different models. We introduce DiffUse, an efficient method to make an informed decision between candidate text generation models based on preference annotations. DiffUse reduces the required amount of annotations, thus saving valuable time and resources in performing evaluatio… ▽ More

    Submitted 6 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL (main conference)

  3. arXiv:2305.01628  [pdf, other

    cs.CL cs.LG

    The Benefits of Bad Advice: Autocontrastive Decoding across Model Layers

    Authors: Ariel Gera, Roni Friedman, Ofir Arviv, Chulaka Gunasekara, Benjamin Sznajder, Noam Slonim, Eyal Shnarch

    Abstract: Applying language models to natural language processing tasks typically relies on the representations in the final model layer, as intermediate hidden layer representations are presumed to be less informative. In this work, we argue that due to the gradual improvement across model layers, additional information can be gleaned from the contrast between higher and lower layers during inference. Spec… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 9 pages, 8 figures; To be published in ACL 2023

  4. arXiv:2203.15590  [pdf, other

    cs.CL

    Heuristic-based Inter-training to Improve Few-shot Multi-perspective Dialog Summarization

    Authors: Benjamin Sznajder, Chulaka Gunasekara, Guy Lev, Sachin Joshi, Eyal Shnarch, Noam Slonim

    Abstract: Many organizations require their customer-care agents to manually summarize their conversations with customers. These summaries are vital for decision making purposes of the organizations. The perspective of the summary that is required to be created depends on the application of the summaries. With this work, we study the multi-perspective summarization of customer-care conversations between supp… ▽ More

    Submitted 30 March, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    ACM Class: I.2.7

  5. arXiv:2111.11894  [pdf, other

    cs.CL

    TWEETSUMM -- A Dialog Summarization Dataset for Customer Service

    Authors: Guy Feigenblat, Chulaka Gunasekara, Benjamin Sznajder, Sachindra Joshi, David Konopnicki, Ranit Aharonov

    Abstract: In a typical customer service chat scenario, customers contact a support center to ask for help or raise complaints, and human agents try to solve the issues. In most cases, at the end of the conversation, agents are asked to write a short summary emphasizing the problem and the proposed solution, usually for the benefit of other agents that may have to deal with the same customer or issue. The go… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Journal ref: Findings of the Association for Computational Linguistics: EMNLP (2021) 245--260

  6. arXiv:2110.03179  [pdf, other

    cs.CL

    HowSumm: A Multi-Document Summarization Dataset Derived from WikiHow Articles

    Authors: Odellia Boni, Guy Feigenblat, Guy Lev, Michal Shmueli-Scheuer, Benjamin Sznajder, David Konopnicki

    Abstract: We present HowSumm, a novel large-scale dataset for the task of query-focused multi-document summarization (qMDS), which targets the use-case of generating actionable instructions from a set of sources. This use-case is different from the use-cases covered in existing multi-document summarization (MDS) datasets and is applicable to educational and industrial scenarios. We employed automatic method… ▽ More

    Submitted 8 October, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: 8 pages, 4 figures, 5 tables. HowSumm dataset is publicly available at \url{https://ibm.biz/BdfhzH}

  7. arXiv:2106.03337  [pdf, other

    cs.CL

    Summary Grounded Conversation Generation

    Authors: Chulaka Gunasekara, Guy Feigenblat, Benjamin Sznajder, Sachindra Joshi, David Konopnicki

    Abstract: Many conversation datasets have been constructed in the recent years using crowdsourcing. However, the data collection process can be time consuming and presents many challenges to ensure data quality. Since language generation has improved immensely in recent years with the advancement of pre-trained language models, we investigate how such models can be utilized to generate entire conversations,… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Findings of ACL - 2021, 9 pages

  8. arXiv:1911.10783  [pdf, other

    cs.CL

    Financial Event Extraction Using Wikipedia-Based Weak Supervision

    Authors: Liat Ein-Dor, Ariel Gera, Orith Toledo-Ronen, Alon Halfon, Benjamin Sznajder, Lena Dankin, Yonatan Bilu, Yoav Katz, Noam Slonim

    Abstract: Extraction of financial and economic events from text has previously been done mostly using rule-based methods, with more recent works employing machine learning techniques. This work is in line with this latter approach, leveraging relevant Wikipedia sections to extract weak labels for sentences describing economic events. Whereas previous weakly supervised approaches required a knowledge-base of… ▽ More

    Submitted 28 November, 2022; v1 submitted 25 November, 2019; originally announced November 2019.

  9. arXiv:1911.10763  [pdf, other

    cs.CL cs.AI cs.IR

    Corpus Wide Argument Mining -- a Working Solution

    Authors: Liat Ein-Dor, Eyal Shnarch, Lena Dankin, Alon Halfon, Benjamin Sznajder, Ariel Gera, Carlos Alzate, Martin Gleize, Leshem Choshen, Yufang Hou, Yonatan Bilu, Ranit Aharonov, Noam Slonim

    Abstract: One of the main tasks in argument mining is the retrieval of argumentative content pertaining to a given topic. Most previous work addressed this task by retrieving a relatively small number of relevant documents as the initial source for such content. This line of research yielded moderate success, which is of limited use in a real-world system. Furthermore, for such a system to yield a comprehen… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Journal ref: AAAI 2020

  10. arXiv:1908.08336  [pdf, other

    cs.CL

    Argument Invention from First Principles

    Authors: Yonatan Bilu, Ariel Gera, Daniel Hershcovich, Benjamin Sznajder, Dan Lahav, Guy Moshkowich, Anael Malet, Assaf Gavron, Noam Slonim

    Abstract: Competitive debaters often find themselves facing a challenging task -- how to debate a topic they know very little about, with only minutes to prepare, and without access to books or the Internet? What they often do is rely on "first principles", commonplace arguments which are relevant to many topics, and which they have refined in past debates. In this work we aim to explicitly define a taxon… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    Comments: Presented at ACL 2019

  11. arXiv:1908.07491  [pdf, ps, other

    cs.CL

    Controversy in Context

    Authors: Benjamin Sznajder, Ariel Gera, Yonatan Bilu, Dafna Sheinwald, Ella Rabinovich, Ranit Aharonov, David Konopnicki, Noam Slonim

    Abstract: With the growing interest in social applications of Natural Language Processing and Computational Argumentation, a natural question is how controversial a given concept is. Prior works relied on Wikipedia's metadata and on content analysis of the articles pertaining to a concept in question. Here we show that the immediate textual context of a concept is strongly indicative of this property, and,… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: 5 pages

  12. arXiv:1908.06785  [pdf, other

    cs.CL

    Fast End-to-End Wikification

    Authors: Ilya Shnayderman, Liat Ein-Dor, Yosi Mass, Alon Halfon, Benjamin Sznajder, Artem Spector, Yoav Katz, Dafna Sheinwald, Ranit Aharonov, Noam Slonim

    Abstract: Wikification of large corpora is beneficial for various NLP applications. Existing methods focus on quality performance rather than run-time, and are therefore non-feasible for large data. Here, we introduce RedW, a run-time oriented Wikification solution, based on Wikipedia redirects, that can Wikify massive corpora with competitive performance. We further propose an efficient method for estimati… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  13. arXiv:1809.01285  [pdf, ps, other

    cs.CL

    Learning Concept Abstractness Using Weak Supervision

    Authors: Ella Rabinovich, Benjamin Sznajder, Artem Spector, Ilya Shnayderman, Ranit Aharonov, David Konopnicki, Noam Slonim

    Abstract: We introduce a weakly supervised approach for inferring the property of abstractness of words and expressions in the complete absence of labeled data. Exploiting only minimal linguistic clues and the contextual usage of a concept as manifested in textual data, we train sufficiently powerful classifiers, obtaining high correlation with human labels. The results imply the applicability of this appro… ▽ More

    Submitted 4 September, 2018; originally announced September 2018.

    Comments: 6 pages, EMNLP 2018