Search | arXiv e-print repository

A Closer Look at the Adversarial Robustness of Information Bottleneck Models

Authors: Iryna Korshunova, David Stutz, Alexander A. Alemi, Olivia Wiles, Sven Gowal

Abstract: We study the adversarial robustness of information bottleneck models for classification. Previous works showed that the robustness of models trained with information bottlenecks can improve upon adversarial training. Our evaluation under a diverse range of white-box $l_{\infty}$ attacks suggests that information bottlenecks alone are not a strong defense strategy, and that previous results were li… ▽ More We study the adversarial robustness of information bottleneck models for classification. Previous works showed that the robustness of models trained with information bottlenecks can improve upon adversarial training. Our evaluation under a diverse range of white-box $l_{\infty}$ attacks suggests that information bottlenecks alone are not a strong defense strategy, and that previous results were likely influenced by gradient obfuscation. △ Less

Submitted 12 July, 2021; originally announced July 2021.

arXiv:1909.01436 [pdf, other]

Discriminative Topic Modeling with Logistic LDA

Authors: Iryna Korshunova, Hanchen Xiong, Mateusz Fedoryszak, Lucas Theis

Abstract: Despite many years of research into latent Dirichlet allocation (LDA), applying LDA to collections of non-categorical items is still challenging. Yet many problems with much richer data share a similar structure and could benefit from the vast literature on LDA. We propose logistic LDA, a novel discriminative variant of latent Dirichlet allocation which is easy to apply to arbitrary inputs. In par… ▽ More Despite many years of research into latent Dirichlet allocation (LDA), applying LDA to collections of non-categorical items is still challenging. Yet many problems with much richer data share a similar structure and could benefit from the vast literature on LDA. We propose logistic LDA, a novel discriminative variant of latent Dirichlet allocation which is easy to apply to arbitrary inputs. In particular, our model can easily be applied to groups of images, arbitrary text embeddings, and integrates well with deep neural networks. Although it is a discriminative model, we show that logistic LDA can learn from unlabeled data in an unsupervised manner by exploiting the group structure present in the data. In contrast to other recent topic models designed to handle arbitrary inputs, our model does not sacrifice the interpretability and principled motivation of LDA. △ Less

Submitted 7 January, 2020; v1 submitted 3 September, 2019; originally announced September 2019.

Journal ref: Advances in Neural Information Processing Systems 32, 2019

arXiv:1801.05787 [pdf, other]

Faster gaze prediction with dense networks and Fisher pruning

Authors: Lucas Theis, Iryna Korshunova, Alykhan Tejani, Ferenc Huszár

Abstract: Predicting human fixations from images has recently seen large improvements by leveraging deep representations which were pretrained for object recognition. However, as we show in this paper, these networks are highly overparameterized for the task of fixation prediction. We first present a simple yet principled greedy pruning method which we call Fisher pruning. Through a combination of knowledge… ▽ More Predicting human fixations from images has recently seen large improvements by leveraging deep representations which were pretrained for object recognition. However, as we show in this paper, these networks are highly overparameterized for the task of fixation prediction. We first present a simple yet principled greedy pruning method which we call Fisher pruning. Through a combination of knowledge distillation and Fisher pruning, we obtain much more runtime-efficient architectures for saliency prediction, achieving a 10x speedup for the same AUC performance as a state of the art network on the CAT2000 dataset. Speeding up single-image gaze prediction is important for many real-world applications, but it is also a crucial step in the development of video saliency models, where the amount of data to be processed is substantially larger. △ Less

Submitted 9 July, 2018; v1 submitted 17 January, 2018; originally announced January 2018.

arXiv:1611.09577 [pdf, other]

Fast Face-swap Using Convolutional Neural Networks

Authors: Iryna Korshunova, Wenzhe Shi, Joni Dambre, Lucas Theis

Abstract: We consider the problem of face swap** in images, where an input identity is transformed into a target identity while preserving pose, facial expression, and lighting. To perform this map**, we use convolutional neural networks trained to capture the appearance of the target identity from an unstructured collection of his/her photographs.This approach is enabled by framing the face swap** pr… ▽ More We consider the problem of face swap** in images, where an input identity is transformed into a target identity while preserving pose, facial expression, and lighting. To perform this map**, we use convolutional neural networks trained to capture the appearance of the target identity from an unstructured collection of his/her photographs.This approach is enabled by framing the face swap** problem in terms of style transfer, where the goal is to render an image in the style of another one. Building on recent advances in this area, we devise a new loss function that enables the network to produce highly photorealistic results. By combining neural networks with simple pre- and post-processing steps, we aim at making face swap work in real-time with no input from the user. △ Less

Submitted 27 July, 2017; v1 submitted 29 November, 2016; originally announced November 2016.

arXiv:1604.08723 [pdf, other]

Music transcription modelling and composition using deep learning

Authors: Bob L. Sturm, João Felipe Santos, Oded Ben-Tal, Iryna Korshunova

Abstract: We apply deep learning methods, specifically long short-term memory (LSTM) networks, to music transcription modelling and composition. We build and train LSTM networks using approximately 23,000 music transcriptions expressed with a high-level vocabulary (ABC notation), and use them to generate new transcriptions. Our practical aim is to create music transcription models useful in particular conte… ▽ More We apply deep learning methods, specifically long short-term memory (LSTM) networks, to music transcription modelling and composition. We build and train LSTM networks using approximately 23,000 music transcriptions expressed with a high-level vocabulary (ABC notation), and use them to generate new transcriptions. Our practical aim is to create music transcription models useful in particular contexts of music composition. We present results from three perspectives: 1) at the population level, comparing descriptive statistics of the set of training transcriptions and generated transcriptions; 2) at the individual level, examining how a generated transcription reflects the conventions of a music practice in the training transcriptions (Celtic folk); 3) at the application level, using the system for idea generation in music composition. We make our datasets, software and sound examples open and available: \url{https://github.com/IraKorshunova/folk-rnn}. △ Less

Submitted 29 April, 2016; originally announced April 2016.

Comments: 16 pages, 4 figures, contribution to 1st Conference on Computer Simulation of Musical Creativity

Showing 1–5 of 5 results for author: Korshunova, I