Skip to main content

Showing 1–2 of 2 results for author: Makelov, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2307.10163  [pdf, other

    cs.CR cs.LG stat.ML

    Rethinking Backdoor Attacks

    Authors: Alaa Khaddaj, Guillaume Leclerc, Aleksandar Makelov, Kristian Georgiev, Hadi Salman, Andrew Ilyas, Aleksander Madry

    Abstract: In a backdoor attack, an adversary inserts maliciously constructed backdoor examples into a training set to make the resulting model vulnerable to manipulation. Defending against such attacks typically involves viewing these inserted examples as outliers in the training set and using techniques from robust statistics to detect and remove them. In this work, we present a different approach to the… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: ICML 2023

  2. arXiv:1706.06083  [pdf, other

    stat.ML cs.LG cs.NE

    Towards Deep Learning Models Resistant to Adversarial Attacks

    Authors: Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, Adrian Vladu

    Abstract: Recent work has demonstrated that deep neural networks are vulnerable to adversarial examples---inputs that are almost indistinguishable from natural data and yet classified incorrectly by the network. In fact, some of the latest findings suggest that the existence of adversarial attacks may be an inherent weakness of deep learning models. To address this problem, we study the adversarial robustne… ▽ More

    Submitted 4 September, 2019; v1 submitted 19 June, 2017; originally announced June 2017.

    Comments: ICLR'18