Skip to main content

Showing 1–12 of 12 results for author: Carmeli, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.14705  [pdf, other

    cs.AI cs.CL

    Concept-Best-Matching: Evaluating Compositionality in Emergent Communication

    Authors: Boaz Carmeli, Yonatan Belinkov, Ron Meir

    Abstract: Artificial agents that learn to communicate in order to accomplish a given task acquire communication protocols that are typically opaque to a human. A large body of work has attempted to evaluate the emergent communication via various evaluation measures, with \emph{compositionality} featuring as a prominent desired trait. However, current evaluation procedures do not directly expose the composit… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  2. arXiv:2401.14367  [pdf, other

    cs.CL cs.AI cs.LG

    Genie: Achieving Human Parity in Content-Grounded Datasets Generation

    Authors: Asaf Yehudai, Boaz Carmeli, Yosi Mass, Ofir Arviv, Nathaniel Mills, Assaf Toledo, Eyal Shnarch, Leshem Choshen

    Abstract: The lack of high-quality data for content-grounded generation tasks has been identified as a major obstacle to advancing these tasks. To address this gap, we propose Genie, a novel method for automatically generating high-quality content-grounded data. It consists of three stages: (a) Content Preparation, (b) Generation: creating task-specific examples from the content (e.g., question-answer pairs… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted to ICLR24

  3. arXiv:2303.01593  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    QAID: Question Answering Inspired Few-shot Intent Detection

    Authors: Asaf Yehudai, Matan Vetzler, Yosi Mass, Koren Lazar, Doron Cohen, Boaz Carmeli

    Abstract: Intent detection with semantically similar fine-grained intents is a challenging task. To address it, we reformulate intent detection as a question-answering retrieval task by treating utterances and intent names as questions and answers. To that end, we utilize a question-answering retrieval architecture and adopt a two stages training schema with batch contrastive loss. In the pre-training stage… ▽ More

    Submitted 21 March, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: ICLR paper

  4. arXiv:2211.16259  [pdf, other

    cs.CL

    Measuring the Measuring Tools: An Automatic Evaluation of Semantic Metrics for Text Corpora

    Authors: George Kour, Samuel Ackerman, Orna Raz, Eitan Farchi, Boaz Carmeli, Ateret Anaby-Tavor

    Abstract: The ability to compare the semantic similarity between text corpora is important in a variety of natural language processing applications. However, standard methods for evaluating these metrics have yet to be established. We propose a set of automatic and interpretable measures for assessing the characteristics of corpus-level semantic similarity metrics, allowing sensible comparison of their beha… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Published at GEM (https://gem-benchmark.com/workshop) workshop at the Empirical Methods in Natural Language Processing (EMNLP) conference in 2022

  5. arXiv:2211.02412  [pdf, other

    cs.AI cs.MA

    Emergent Quantized Communication

    Authors: Boaz Carmeli, Ron Meir, Yonatan Belinkov

    Abstract: The field of emergent communication aims to understand the characteristics of communication as it emerges from artificial agents solving tasks that require information exchange. Communication with discrete messages is considered a desired characteristic, for both scientific and applied reasons. However, training a multi-agent system with discrete communication is not straightforward, requiring eit… ▽ More

    Submitted 19 January, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    MSC Class: 68T07 ACM Class: I.2.6

  6. arXiv:2210.11905  [pdf, other

    cs.CL

    Exploration of the Usage of Color Terms by Color-blind Participants in Online Discussion Platforms

    Authors: Ella Rabinovich, Boaz Carmeli

    Abstract: Prominent questions about the role of sensory vs. linguistic input in the way we acquire and use language have been extensively studied in the psycholinguistic literature. However, the relative effect of various factors in a person's overall experience on their linguistic system remains unclear. We study this question by making a step forward towards a better understanding of the conceptual percep… ▽ More

    Submitted 30 October, 2022; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022 (main conference), 13 pages

  7. arXiv:2206.11219  [pdf, other

    cs.CL

    Understanding the Properties of Generated Corpora

    Authors: Naama Zwerdling, Segev Shlomov, Esther Goldbraich, George Kour, Boaz Carmeli, Naama Tepper, Inbal Ronen, Vitaly Zabershinsky, Ateret Anaby-Tavor

    Abstract: Models for text generation have become focal for many research tasks and especially for the generation of sentence corpora. However, understanding the properties of an automatically generated text corpus remains challenging. We propose a set of tools that examine the properties of generated text corpora. Applying these tools on various generated corpora allowed us to gain new insights into the pro… ▽ More

    Submitted 27 October, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  8. arXiv:2202.10137  [pdf, other

    cs.CL eess.AS

    A new data augmentation method for intent classification enhancement and its application on spoken conversation datasets

    Authors: Zvi Kons, Aharon Satt, Hong-Kwang Kuo, Samuel Thomas, Boaz Carmeli, Ron Hoory, Brian Kingsbury

    Abstract: Intent classifiers are vital to the successful operation of virtual agent systems. This is especially so in voice activated systems where the data can be noisy with many ambiguous directions for user intents. Before operation begins, these classifiers are generally lacking in real-world training data. Active learning is a common approach used to help label large amounts of collected user input. Ho… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: \c{opyright} 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  9. arXiv:2110.12412  [pdf, other

    cs.CL cs.AI cs.LG

    Improved Goal Oriented Dialogue via Utterance Generation and Look Ahead

    Authors: Eyal Ben-David, Boaz Carmeli, Ateret Anaby-Tavor

    Abstract: Goal oriented dialogue systems have become a prominent customer-care interaction channel for most businesses. However, not all interactions are smooth, and customer intent misunderstanding is a major cause of dialogue failure. We show that intent prediction can be improved by training a deep text-to-text neural model to generate successive user utterances from unlabeled dialogue data. For that, we… ▽ More

    Submitted 24 October, 2021; originally announced October 2021.

  10. arXiv:1911.03118  [pdf, other

    cs.CL cs.LG

    Not Enough Data? Deep Learning to the Rescue!

    Authors: Ateret Anaby-Tavor, Boaz Carmeli, Esther Goldbraich, Amir Kantor, George Kour, Segev Shlomov, Naama Tepper, Naama Zwerdling

    Abstract: Based on recent advances in natural language modeling and those in text generation capabilities, we propose a novel data augmentation method for text classification tasks. We use a powerful pre-trained neural network model to artificially synthesize new labeled data for supervised learning. We mainly focus on cases with scarce labeled data. Our method, referred to as language-model-based data augm… ▽ More

    Submitted 27 November, 2019; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: 20 pages

  11. arXiv:1901.03995  [pdf, other

    cs.LG stat.ML

    Neural network gradient-based learning of black-box function interfaces

    Authors: Alon Jacovi, Guy Hadash, Einat Kermany, Boaz Carmeli, Ofer Lavi, George Kour, Jonathan Berant

    Abstract: Deep neural networks work well at approximating complicated functions when provided with data and trained by gradient descent methods. At the same time, there is a vast amount of existing functions that programmatically solve different tasks in a precise manner eliminating the need for training. In many cases, it is possible to decompose a task to a series of functions, of which for some we may pr… ▽ More

    Submitted 13 January, 2019; originally announced January 2019.

    Comments: Published as a conference paper at ICLR 2019

  12. arXiv:1804.09028  [pdf, other

    cs.LG cs.CL stat.ML

    Estimate and Replace: A Novel Approach to Integrating Deep Neural Networks with Existing Applications

    Authors: Guy Hadash, Einat Kermany, Boaz Carmeli, Ofer Lavi, George Kour, Alon Jacovi

    Abstract: Existing applications include a huge amount of knowledge that is out of reach for deep neural networks. This paper presents a novel approach for integrating calls to existing applications into deep learning architectures. Using this approach, we estimate each application's functionality with an estimator, which is implemented as a deep neural network (DNN). The estimator is then embedded into a ba… ▽ More

    Submitted 24 April, 2018; originally announced April 2018.