Skip to main content

Showing 1–7 of 7 results for author: Haviv, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17691  [pdf, other

    cs.CV cs.CL

    Not All Similarities Are Created Equal: Leveraging Data-Driven Biases to Inform GenAI Copyright Disputes

    Authors: Uri Hacohen, Adi Haviv, Shahar Sarfaty, Bruria Friedman, Niva Elkin-Koren, Roi Livni, Amit H Bermano

    Abstract: The advent of Generative Artificial Intelligence (GenAI) models, including GitHub Copilot, OpenAI GPT, and Stable Diffusion, has revolutionized content creation, enabling non-professionals to produce high-quality content across various domains. This transformative technology has led to a surge of synthetic content and sparked legal disputes over copyright infringement. To address these challenges,… ▽ More

    Submitted 7 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Presented at ACM CSLAW 2024

  2. arXiv:2310.03707  [pdf, other

    cs.LG cs.CV

    OMG-ATTACK: Self-Supervised On-Manifold Generation of Transferable Evasion Attacks

    Authors: Ofir Bar Tal, Adi Haviv, Amit H. Bermano

    Abstract: Evasion Attacks (EA) are used to test the robustness of trained neural networks by distorting input data to misguide the model into incorrect classifications. Creating these attacks is a challenging task, especially with the ever-increasing complexity of models and datasets. In this work, we introduce a self-supervised, computationally economical method for generating adversarial examples, designe… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: ICCV 2023, AROW Workshop

  3. arXiv:2210.03588  [pdf, other

    cs.CL

    Understanding Transformer Memorization Recall Through Idioms

    Authors: Adi Haviv, Ido Cohen, Jacob Gidron, Roei Schuster, Yoav Goldberg, Mor Geva

    Abstract: To produce accurate predictions, language models (LMs) must balance between generalization and memorization. Yet, little is known about the mechanism by which transformer LMs employ their memorization capacity. When does a model decide to output a memorized phrase, and how is this phrase then retrieved from memory? In this work, we offer the first methodological framework for probing and character… ▽ More

    Submitted 13 February, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

  4. arXiv:2203.16634  [pdf, other

    cs.CL cs.AI cs.LG

    Transformer Language Models without Positional Encodings Still Learn Positional Information

    Authors: Adi Haviv, Ori Ram, Ofir Press, Peter Izsak, Omer Levy

    Abstract: Causal transformer language models (LMs), such as GPT-3, typically require some form of positional encoding, such as positional embeddings. However, we show that LMs without any explicit positional encoding are still competitive with standard models, and that this phenomenon is robust across different datasets, model sizes, and sequence lengths. Probing experiments reveal that such models acquire… ▽ More

    Submitted 5 December, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Findings of EMNLP 2022

  5. arXiv:2201.03533  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    SCROLLS: Standardized CompaRison Over Long Language Sequences

    Authors: Uri Shaham, Elad Segal, Maor Ivgi, Avia Efrat, Ori Yoran, Adi Haviv, Ankit Gupta, Wenhan Xiong, Mor Geva, Jonathan Berant, Omer Levy

    Abstract: NLP benchmarks have largely focused on short texts, such as sentences and paragraphs, even though long texts comprise a considerable amount of natural language in the wild. We introduce SCROLLS, a suite of tasks that require reasoning over long texts. We examine existing long-text datasets, and handpick ones where the text is naturally long, while prioritizing tasks that involve synthesizing infor… ▽ More

    Submitted 11 October, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: EMNLP 2022

  6. arXiv:2104.09554  [pdf, other

    cs.CL cs.AI cs.LG

    Can Latent Alignments Improve Autoregressive Machine Translation?

    Authors: Adi Haviv, Lior Vassertail, Omer Levy

    Abstract: Latent alignment objectives such as CTC and AXE significantly improve non-autoregressive machine translation models. Can they improve autoregressive models as well? We explore the possibility of training autoregressive machine translation models with latent alignment objectives, and observe that, in practice, this approach results in degenerate models. We provide a theoretical explanation for thes… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted to NAACL 2021

  7. arXiv:2103.05327  [pdf, other

    cs.CL cs.LG

    BERTese: Learning to Speak to BERT

    Authors: Adi Haviv, Jonathan Berant, Amir Globerson

    Abstract: Large pre-trained language models have been shown to encode large amounts of world and commonsense knowledge in their parameters, leading to substantial interest in methods for extracting that knowledge. In past work, knowledge was extracted by taking manually-authored queries and gathering paraphrases for them using a separate pipeline. In this work, we propose a method for automatically rewritin… ▽ More

    Submitted 11 March, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: Accepted to EACL 2021