Skip to main content

Showing 1–5 of 5 results for author: Kariyappa, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.02625  [pdf, other

    cs.LG cs.AI stat.ML

    Progressive Inference: Explaining Decoder-Only Sequence Classification Models Using Intermediate Predictions

    Authors: Sanjay Kariyappa, Freddy Lécué, Saumitra Mishra, Christopher Pond, Daniele Magazzeni, Manuela Veloso

    Abstract: This paper proposes Progressive Inference - a framework to compute input attributions to explain the predictions of decoder-only sequence classification models. Our work is based on the insight that the classification head of a decoder-only Transformer model can be used to make intermediate predictions by evaluating them at different points in the input sequence. Due to the causal attention mechan… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2104.02261  [pdf, other

    cs.CR cs.LG stat.ML

    Enabling Inference Privacy with Adaptive Noise Injection

    Authors: Sanjay Kariyappa, Ousmane Dia, Moinuddin K Qureshi

    Abstract: User-facing software services are becoming increasingly reliant on remote servers to host Deep Neural Network (DNN) models, which perform inference tasks for the clients. Such services require the client to send input data to the service provider, who processes it using a DNN and returns the output predictions to the client. Due to the rich nature of the inputs such as images and speech, the input… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

  3. arXiv:2005.03161  [pdf, other

    stat.ML cs.LG

    MAZE: Data-Free Model Stealing Attack Using Zeroth-Order Gradient Estimation

    Authors: Sanjay Kariyappa, Atul Prakash, Moinuddin Qureshi

    Abstract: Model Stealing (MS) attacks allow an adversary with black-box access to a Machine Learning model to replicate its functionality, compromising the confidentiality of the model. Such attacks train a clone model by using the predictions of the target model for different inputs. The effectiveness of such attacks relies heavily on the availability of data necessary to query the target model. Existing a… ▽ More

    Submitted 28 October, 2022; v1 submitted 6 May, 2020; originally announced May 2020.

  4. arXiv:1911.07100  [pdf, other

    stat.ML cs.CR cs.LG

    Defending Against Model Stealing Attacks with Adaptive Misinformation

    Authors: Sanjay Kariyappa, Moinuddin K Qureshi

    Abstract: Deep Neural Networks (DNNs) are susceptible to model stealing attacks, which allows a data-limited adversary with no knowledge of the training dataset to clone the functionality of a target model, just by using black-box query access. Such attacks are typically carried out by querying the target model using inputs that are synthetically generated or sampled from a surrogate dataset to construct a… ▽ More

    Submitted 16 November, 2019; originally announced November 2019.

  5. arXiv:1901.09981  [pdf, other

    stat.ML cs.LG

    Improving Adversarial Robustness of Ensembles with Diversity Training

    Authors: Sanjay Kariyappa, Moinuddin K. Qureshi

    Abstract: Deep Neural Networks are vulnerable to adversarial attacks even in settings where the attacker has no direct access to the model being attacked. Such attacks usually rely on the principle of transferability, whereby an attack crafted on a surrogate model tends to transfer to the target model. We show that an ensemble of models with misaligned loss gradients can provide an effective defense against… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.