-
A Closer Look at the Adversarial Robustness of Information Bottleneck Models
Authors:
Iryna Korshunova,
David Stutz,
Alexander A. Alemi,
Olivia Wiles,
Sven Gowal
Abstract:
We study the adversarial robustness of information bottleneck models for classification. Previous works showed that the robustness of models trained with information bottlenecks can improve upon adversarial training. Our evaluation under a diverse range of white-box $l_{\infty}$ attacks suggests that information bottlenecks alone are not a strong defense strategy, and that previous results were li…
▽ More
We study the adversarial robustness of information bottleneck models for classification. Previous works showed that the robustness of models trained with information bottlenecks can improve upon adversarial training. Our evaluation under a diverse range of white-box $l_{\infty}$ attacks suggests that information bottlenecks alone are not a strong defense strategy, and that previous results were likely influenced by gradient obfuscation.
△ Less
Submitted 12 July, 2021;
originally announced July 2021.
-
Discriminative Topic Modeling with Logistic LDA
Authors:
Iryna Korshunova,
Hanchen Xiong,
Mateusz Fedoryszak,
Lucas Theis
Abstract:
Despite many years of research into latent Dirichlet allocation (LDA), applying LDA to collections of non-categorical items is still challenging. Yet many problems with much richer data share a similar structure and could benefit from the vast literature on LDA. We propose logistic LDA, a novel discriminative variant of latent Dirichlet allocation which is easy to apply to arbitrary inputs. In par…
▽ More
Despite many years of research into latent Dirichlet allocation (LDA), applying LDA to collections of non-categorical items is still challenging. Yet many problems with much richer data share a similar structure and could benefit from the vast literature on LDA. We propose logistic LDA, a novel discriminative variant of latent Dirichlet allocation which is easy to apply to arbitrary inputs. In particular, our model can easily be applied to groups of images, arbitrary text embeddings, and integrates well with deep neural networks. Although it is a discriminative model, we show that logistic LDA can learn from unlabeled data in an unsupervised manner by exploiting the group structure present in the data. In contrast to other recent topic models designed to handle arbitrary inputs, our model does not sacrifice the interpretability and principled motivation of LDA.
△ Less
Submitted 7 January, 2020; v1 submitted 3 September, 2019;
originally announced September 2019.
-
Faster gaze prediction with dense networks and Fisher pruning
Authors:
Lucas Theis,
Iryna Korshunova,
Alykhan Tejani,
Ferenc Huszár
Abstract:
Predicting human fixations from images has recently seen large improvements by leveraging deep representations which were pretrained for object recognition. However, as we show in this paper, these networks are highly overparameterized for the task of fixation prediction. We first present a simple yet principled greedy pruning method which we call Fisher pruning. Through a combination of knowledge…
▽ More
Predicting human fixations from images has recently seen large improvements by leveraging deep representations which were pretrained for object recognition. However, as we show in this paper, these networks are highly overparameterized for the task of fixation prediction. We first present a simple yet principled greedy pruning method which we call Fisher pruning. Through a combination of knowledge distillation and Fisher pruning, we obtain much more runtime-efficient architectures for saliency prediction, achieving a 10x speedup for the same AUC performance as a state of the art network on the CAT2000 dataset. Speeding up single-image gaze prediction is important for many real-world applications, but it is also a crucial step in the development of video saliency models, where the amount of data to be processed is substantially larger.
△ Less
Submitted 9 July, 2018; v1 submitted 17 January, 2018;
originally announced January 2018.
-
Fast Face-swap Using Convolutional Neural Networks
Authors:
Iryna Korshunova,
Wenzhe Shi,
Joni Dambre,
Lucas Theis
Abstract:
We consider the problem of face swap** in images, where an input identity is transformed into a target identity while preserving pose, facial expression, and lighting. To perform this map**, we use convolutional neural networks trained to capture the appearance of the target identity from an unstructured collection of his/her photographs.This approach is enabled by framing the face swap** pr…
▽ More
We consider the problem of face swap** in images, where an input identity is transformed into a target identity while preserving pose, facial expression, and lighting. To perform this map**, we use convolutional neural networks trained to capture the appearance of the target identity from an unstructured collection of his/her photographs.This approach is enabled by framing the face swap** problem in terms of style transfer, where the goal is to render an image in the style of another one. Building on recent advances in this area, we devise a new loss function that enables the network to produce highly photorealistic results. By combining neural networks with simple pre- and post-processing steps, we aim at making face swap work in real-time with no input from the user.
△ Less
Submitted 27 July, 2017; v1 submitted 29 November, 2016;
originally announced November 2016.
-
Music transcription modelling and composition using deep learning
Authors:
Bob L. Sturm,
João Felipe Santos,
Oded Ben-Tal,
Iryna Korshunova
Abstract:
We apply deep learning methods, specifically long short-term memory (LSTM) networks, to music transcription modelling and composition. We build and train LSTM networks using approximately 23,000 music transcriptions expressed with a high-level vocabulary (ABC notation), and use them to generate new transcriptions. Our practical aim is to create music transcription models useful in particular conte…
▽ More
We apply deep learning methods, specifically long short-term memory (LSTM) networks, to music transcription modelling and composition. We build and train LSTM networks using approximately 23,000 music transcriptions expressed with a high-level vocabulary (ABC notation), and use them to generate new transcriptions. Our practical aim is to create music transcription models useful in particular contexts of music composition. We present results from three perspectives: 1) at the population level, comparing descriptive statistics of the set of training transcriptions and generated transcriptions; 2) at the individual level, examining how a generated transcription reflects the conventions of a music practice in the training transcriptions (Celtic folk); 3) at the application level, using the system for idea generation in music composition. We make our datasets, software and sound examples open and available: \url{https://github.com/IraKorshunova/folk-rnn}.
△ Less
Submitted 29 April, 2016;
originally announced April 2016.