Skip to main content

Showing 1–1 of 1 results for author: Nizar, N J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2010.16336  [pdf, other

    cs.LG cs.AI cs.CL

    Leveraging Extracted Model Adversaries for Improved Black Box Attacks

    Authors: Naveen Jafer Nizar, Ari Kobren

    Abstract: We present a method for adversarial input generation against black box models for reading comprehension based question answering. Our approach is composed of two steps. First, we approximate a victim black box model via model extraction (Krishna et al., 2020). Second, we use our own white box method to generate input perturbations that cause the approximate model to fail. These perturbed inputs ar… ▽ More

    Submitted 2 November, 2020; v1 submitted 30 October, 2020; originally announced October 2020.

    Journal ref: Analyzing and interpreting neural networks for NLP, 2020