Skip to main content

Showing 1–3 of 3 results for author: Tonni, S M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19642  [pdf, other

    cs.CL cs.CR cs.LG

    IDT: Dual-Task Adversarial Attacks for Privacy Protection

    Authors: Pedro Faustini, Shakila Mahjabin Tonni, Annabelle McIver, Qiongkai Xu, Mark Dras

    Abstract: Natural language processing (NLP) models may leak private information in different ways, including membership inference, reconstruction or attribute inference attacks. Sensitive information may not be explicit in the text, but hidden in underlying writing characteristics. Methods to protect privacy can involve using representations inside models that are demonstrated not to detect sensitive attrib… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 28 pages, 1 figure

  2. arXiv:2309.10916  [pdf, other

    cs.LG cs.CL

    What Learned Representations and Influence Functions Can Tell Us About Adversarial Examples

    Authors: Shakila Mahjabin Tonni, Mark Dras

    Abstract: Adversarial examples, deliberately crafted using small perturbations to fool deep neural networks, were first studied in image processing and more recently in NLP. While approaches to detecting adversarial examples in NLP have largely relied on search over input perturbations, image processing has seen a range of techniques that aim to characterise adversarial subspaces over the learned representa… ▽ More

    Submitted 10 October, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: 20 pages, Accepted in IJCNLP_AACL 2023

  3. arXiv:2002.06856  [pdf, other

    cs.LG stat.ML

    Data and Model Dependencies of Membership Inference Attack

    Authors: Shakila Mahjabin Tonni, Dinusha Vatsalan, Farhad Farokhi, Dali Kaafar, Zhigang Lu, Gioacchino Tangari

    Abstract: Machine learning (ML) models have been shown to be vulnerable to Membership Inference Attacks (MIA), which infer the membership of a given data point in the target dataset by observing the prediction output of the ML model. While the key factors for the success of MIA have not yet been fully understood, existing defense mechanisms such as using L2 regularization \cite{10shokri2017membership} and d… ▽ More

    Submitted 25 July, 2020; v1 submitted 17 February, 2020; originally announced February 2020.