Skip to main content

Showing 1–11 of 11 results for author: Dankin, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2208.01483  [pdf, other

    cs.CL cs.HC

    Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours

    Authors: Eyal Shnarch, Alon Halfon, Ariel Gera, Marina Danilevsky, Yannis Katsis, Leshem Choshen, Martin Santillan Cooper, Dina Epelboim, Zheng Zhang, Dakuo Wang, Lucy Yip, Liat Ein-Dor, Lena Dankin, Ilya Shnayderman, Ranit Aharonov, Yunyao Li, Naftali Liberman, Philip Levin Slesarev, Gwilym Newton, Shila Ofek-Koifman, Noam Slonim, Yoav Katz

    Abstract: Text classification can be useful in many real-world scenarios, saving a lot of time for end users. However, building a custom classifier typically requires coding skills and ML knowledge, which poses a significant barrier for many potential users. To lift this barrier, we introduce Label Sleuth, a free open source system for labeling and creating text classifiers. This system is unique for (a) be… ▽ More

    Submitted 31 October, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: 7 pages, 2 figures To be published at EMNLP 2022

  2. arXiv:2203.10581  [pdf, other

    cs.CL cs.LG

    Cluster & Tune: Boost Cold Start Performance in Text Classification

    Authors: Eyal Shnarch, Ariel Gera, Alon Halfon, Lena Dankin, Leshem Choshen, Ranit Aharonov, Noam Slonim

    Abstract: In real-world scenarios, a text classification task often begins with a cold start, when labeled data is scarce. In such cases, the common practice of fine-tuning pre-trained models, such as BERT, for a target classification task, is prone to produce poor performance. We suggest a method to boost the performance of such models by adding an intermediate unsupervised classification task, between the… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: 9 pages, 6 figures; To be published in ACL 2022

  3. arXiv:2201.02026  [pdf, other

    cs.CL

    Fortunately, Discourse Markers Can Enhance Language Models for Sentiment Analysis

    Authors: Liat Ein-Dor, Ilya Shnayderman, Artem Spector, Lena Dankin, Ranit Aharonov, Noam Slonim

    Abstract: In recent years, pretrained language models have revolutionized the NLP world, while achieving state of the art performance in various downstream tasks. However, in many cases, these models do not perform well when labeled data is scarce and the model is expected to perform in the zero or few shot setting. Recently, several works have shown that continual pretraining or performing a second phase o… ▽ More

    Submitted 5 April, 2022; v1 submitted 6 January, 2022; originally announced January 2022.

    Comments: Published in AAAI 2022

  4. arXiv:2110.10577  [pdf, other

    cs.CL

    Overview of the 2021 Key Point Analysis Shared Task

    Authors: Roni Friedman, Lena Dankin, Yufang Hou, Ranit Aharonov, Yoav Katz, Noam Slonim

    Abstract: We describe the 2021 Key Point Analysis (KPA-2021) shared task on key point analysis that we organized as a part of the 8th Workshop on Argument Mining (ArgMining 2021) at EMNLP 2021. We outline various approaches and discuss the results of the shared task. We expect the task and the findings reported in this paper to be relevant for researchers working on text summarization and argument mining.

    Submitted 20 October, 2021; originally announced October 2021.

  5. arXiv:2010.02665  [pdf, other

    cs.CL

    Metaphor Interpretation Using Word Embeddings

    Authors: Kfir Bar, Nachum Dershowitz, Lena Dankin

    Abstract: We suggest a model for metaphor interpretation using word embeddings trained over a relatively large corpus. Our system handles nominal metaphors, like "time is money". It generates a ranked list of potential interpretations of given metaphors. Candidate meanings are drawn from collocations of the topic ("time") and vehicle ("money") components, automatically extracted from a dependency-parsed cor… ▽ More

    Submitted 6 December, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: Presented at 19th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing), 2018

  6. arXiv:1911.10783  [pdf, other

    cs.CL

    Financial Event Extraction Using Wikipedia-Based Weak Supervision

    Authors: Liat Ein-Dor, Ariel Gera, Orith Toledo-Ronen, Alon Halfon, Benjamin Sznajder, Lena Dankin, Yonatan Bilu, Yoav Katz, Noam Slonim

    Abstract: Extraction of financial and economic events from text has previously been done mostly using rule-based methods, with more recent works employing machine learning techniques. This work is in line with this latter approach, leveraging relevant Wikipedia sections to extract weak labels for sentences describing economic events. Whereas previous weakly supervised approaches required a knowledge-base of… ▽ More

    Submitted 28 November, 2022; v1 submitted 25 November, 2019; originally announced November 2019.

  7. arXiv:1911.10763  [pdf, other

    cs.CL cs.AI cs.IR

    Corpus Wide Argument Mining -- a Working Solution

    Authors: Liat Ein-Dor, Eyal Shnarch, Lena Dankin, Alon Halfon, Benjamin Sznajder, Ariel Gera, Carlos Alzate, Martin Gleize, Leshem Choshen, Yufang Hou, Yonatan Bilu, Ranit Aharonov, Noam Slonim

    Abstract: One of the main tasks in argument mining is the retrieval of argumentative content pertaining to a given topic. Most previous work addressed this task by retrieving a relatively small number of relevant documents as the initial source for such content. This line of research yielded moderate success, which is of limited use in a real-world system. Furthermore, for such a system to yield a comprehen… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Journal ref: AAAI 2020

  8. arXiv:1909.00393  [pdf, other

    cs.CL cs.AI cs.LG

    A Dataset of General-Purpose Rebuttal

    Authors: Matan Orbach, Yonatan Bilu, Ariel Gera, Yoav Kantor, Lena Dankin, Tamar Lavee, Lili Kotlerman, Shachar Mirkin, Michal Jacovi, Ranit Aharonov, Noam Slonim

    Abstract: In Natural Language Understanding, the task of response generation is usually focused on responses to short texts, such as tweets or a turn in a dialog. Here we present a novel task of producing a critical response to a long argumentative text, and suggest a method based on general rebuttal arguments to address it. We do this in the context of the recently-suggested task of listening comprehension… ▽ More

    Submitted 1 September, 2019; originally announced September 2019.

    Comments: EMNLP 2019

  9. arXiv:1907.11889  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Effective Rebuttal: Listening Comprehension using Corpus-Wide Claim Mining

    Authors: Tamar Lavee, Matan Orbach, Lili Kotlerman, Yoav Kantor, Shai Gretz, Lena Dankin, Shachar Mirkin, Michal Jacovi, Yonatan Bilu, Ranit Aharonov, Noam Slonim

    Abstract: Engaging in a live debate requires, among other things, the ability to effectively rebut arguments claimed by your opponent. In particular, this requires identifying these arguments. Here, we suggest doing so by automatically mining claims from a corpus of news articles containing billions of sentences, and searching for them in a given speech. This raises the question of whether such claims indee… ▽ More

    Submitted 27 July, 2019; originally announced July 2019.

    Comments: 6th Argument Mining Workshop @ ACL 2019

  10. arXiv:1907.08971  [pdf, other

    cs.LG cs.CL stat.ML

    Are You Convinced? Choosing the More Convincing Evidence with a Siamese Network

    Authors: Martin Gleize, Eyal Shnarch, Leshem Choshen, Lena Dankin, Guy Moshkowich, Ranit Aharonov, Noam Slonim

    Abstract: With the advancement in argument detection, we suggest to pay more attention to the challenging task of identifying the more convincing arguments. Machines capable of responding and interacting with humans in helpful ways have become ubiquitous. We now expect them to discuss with us the more delicate questions in our world, and they should do so armed with effective arguments. But what makes an ar… ▽ More

    Submitted 23 July, 2019; v1 submitted 21 July, 2019; originally announced July 2019.

    Comments: accepted to ACL 2019 - long paper

  11. arXiv:1609.08389  [pdf

    cs.CL cs.CY

    A Hackathon for Classical Tibetan

    Authors: Orna Almogi, Lena Dankin, Nachum Dershowitz, Lior Wolf

    Abstract: We describe the course of a hackathon dedicated to the development of linguistic tools for Tibetan Buddhist studies. Over a period of five days, a group of seventeen scholars, scientists, and students developed and compared algorithms for intertextual alignment and text classification, along with some basic language tools, including a stemmer and word segmenter.

    Submitted 31 December, 2018; v1 submitted 27 September, 2016; originally announced September 2016.