Skip to main content

Showing 1–16 of 16 results for author: Vollgraf, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2107.03742  [pdf, other

    cs.CV cs.LG

    Grid Partitioned Attention: Efficient TransformerApproximation with Inductive Bias for High Resolution Detail Generation

    Authors: Nikolay Jetchev, Gökhan Yildirim, Christian Bracher, Roland Vollgraf

    Abstract: Attention is a general reasoning mechanism than can flexibly deal with image information, but its memory requirements had made it so far impractical for high resolution image generation. We present Grid Partitioned Attention (GPA), a new approximate attention algorithm that leverages a sparse inductive bias for higher computational and memory efficiency in image domains: queries attend only to few… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: code available at https://github.com/zalandoresearch/gpa

  2. arXiv:2101.12072  [pdf, other

    cs.LG cs.AI

    Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting

    Authors: Kashif Rasul, Calvin Seward, Ingmar Schuster, Roland Vollgraf

    Abstract: In this work, we propose \texttt{TimeGrad}, an autoregressive model for multivariate probabilistic time series forecasting which samples from the data distribution at each time step by estimating its gradient. To this end, we use diffusion probabilistic models, a class of latent variable models closely connected to score matching and energy-based methods. Our model learns gradients by optimizing a… ▽ More

    Submitted 2 February, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:8857-8868, 2021

  3. arXiv:2006.04942  [pdf, other

    cs.SI cs.LG stat.ML

    CRISP: A Probabilistic Model for Individual-Level COVID-19 Infection Risk Estimation Based on Contact Data

    Authors: Ralf Herbrich, Rajeev Rastogi, Roland Vollgraf

    Abstract: We present CRISP (COVID-19 Risk Score Prediction), a probabilistic graphical model for COVID-19 infection spread through a population based on the SEIR model where we assume access to (1) mutual contacts between pairs of individuals across time across various channels (e.g., Bluetooth contact traces), as well as (2) test outcomes at given times for infection, exposure and immunity tests. Our micro… ▽ More

    Submitted 30 June, 2022; v1 submitted 9 June, 2020; originally announced June 2020.

  4. arXiv:2002.06103  [pdf, other

    cs.LG stat.ML

    Multivariate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows

    Authors: Kashif Rasul, Abdul-Saboor Sheikh, Ingmar Schuster, Urs Bergmann, Roland Vollgraf

    Abstract: Time series forecasting is often fundamental to scientific and engineering problems and enables decision making. With ever increasing data set sizes, a trivial solution to scale up predictions is to assume independence between interacting time series. However, modeling statistical dependencies can improve accuracy and enable analysis of interaction effects. Deep learning methods are well suited fo… ▽ More

    Submitted 14 January, 2021; v1 submitted 14 February, 2020; originally announced February 2020.

  5. arXiv:1909.02775  [pdf, other

    cs.LG stat.ML

    Set Flow: A Permutation Invariant Normalizing Flow

    Authors: Kashif Rasul, Ingmar Schuster, Roland Vollgraf, Urs Bergmann

    Abstract: We present a generative model that is defined on finite sets of exchangeable, potentially high dimensional, data. As the architecture is an extension of RealNVPs, it inherits all its favorable properties, such as being invertible and allowing for exact log-likelihood evaluation. We show that this architecture is able to learn finite non-i.i.d. set data distributions, learn statistical dependencies… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

  6. arXiv:1908.08847  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Generating High-Resolution Fashion Model Images Wearing Custom Outfits

    Authors: Gökhan Yildirim, Nikolay Jetchev, Roland Vollgraf, Urs Bergmann

    Abstract: Visualizing an outfit is an essential part of shop** for clothes. Due to the combinatorial aspect of combining fashion articles, the available images are limited to a pre-determined set of outfits. In this paper, we broaden these visualizations by generating high-resolution images of fashion models wearing a custom outfit under an input body pose. We show that our approach can not only transfer… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: Accepted to the International Conference on Computer Vision, ICCV 2019, Workshop on Computer Vision for Fashion, Art and Design

  7. A Deep Learning System for Predicting Size and Fit in Fashion E-Commerce

    Authors: Abdul-Saboor Sheikh, Romain Guigoures, Evgenii Koriagin, Yuen King Ho, Reza Shirvany, Roland Vollgraf, Urs Bergmann

    Abstract: Personalized size and fit recommendations bear crucial significance for any fashion e-commerce platform. Predicting the correct fit drives customer satisfaction and benefits the business by reducing costs incurred due to size-related returns. Traditional collaborative filtering algorithms seek to model customer preferences based on their previous orders. A typical challenge for such methods stems… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: Published at the Thirteenth ACM Conference on Recommender Systems (RecSys '19), September 16--20, 2019, Copenhagen, Denmark

  8. arXiv:1906.09400  [pdf, other

    cs.LG stat.ML

    Learning Set-equivariant Functions with SWARM Map**s

    Authors: Roland Vollgraf

    Abstract: In this work we propose a new neural network architecture that efficiently implements and learns general purpose set-equivariant functions. Such a function f maps a set of entities x = {x1, . . . , xn} from one domain to a set of same cardinality y = f (x) = {y1, . . . , yn} in another domain regardless of the ordering of the entities. The architecture is based on a gated recurrent network which i… ▽ More

    Submitted 20 September, 2019; v1 submitted 22 June, 2019; originally announced June 2019.

  9. arXiv:1902.03657  [pdf, other

    cs.LG stat.ML

    A Bandit Framework for Optimal Selection of Reinforcement Learning Agents

    Authors: Andreas Merentitis, Kashif Rasul, Roland Vollgraf, Abdul-Saboor Sheikh, Urs Bergmann

    Abstract: Deep Reinforcement Learning has been shown to be very successful in complex games, e.g. Atari or Go. These games have clearly defined rules, and hence allow simulation. In many practical applications, however, interactions with the environment are costly and a good simulator of the environment is not available. Further, as environments differ by application, the optimal inductive bias (architectur… ▽ More

    Submitted 10 February, 2019; originally announced February 2019.

    Comments: Published at the 32nd Conference on Neural Information Processing Systems (NIPS 2018), Montreal, Canada. Deep Reinforcement Learning Workshop

  10. Studio2Shop: from studio photo shoots to fashion articles

    Authors: Julia Lasserre, Katharina Rasch, Roland Vollgraf

    Abstract: Fashion is an increasingly important topic in computer vision, in particular the so-called street-to-shop task of matching street images with shop images containing similar fashion items. Solving this problem promises new means of making fashion searchable and hel** shoppers find the articles they are looking for. This paper focuses on finding pieces of clothing worn by a person in full-body or… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

    Comments: 12 pages, 9 figures (Figure 1 has 5 subfigures, Figure 2 has 3 subfigures), 7 tables

    Journal ref: Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods (January 16-18, 2018, in Funchal, Madeira, Portugal), Vol. 1 (ISBN 978-989-758-276-9), P. 37-48

  11. arXiv:1803.03665  [pdf, other

    cs.CL cs.LG

    Syntax-Aware Language Modeling with Recurrent Neural Networks

    Authors: Duncan Blythe, Alan Akbik, Roland Vollgraf

    Abstract: Neural language models (LMs) are typically trained using only lexical features, such as surface forms of words. In this paper, we argue this deprives the LM of crucial syntactic signals that can be detected at high confidence using existing parsers. We present a simple but highly effective approach for training neural LMs using both lexical and syntactic information, and a novel approach for apply… ▽ More

    Submitted 2 March, 2018; originally announced March 2018.

  12. arXiv:1708.07747  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

    Authors: Han Xiao, Kashif Rasul, Roland Vollgraf

    Abstract: We present Fashion-MNIST, a new dataset comprising of 28x28 grayscale images of 70,000 fashion products from 10 categories, with 7,000 images per category. The training set has 60,000 images and the test set has 10,000 images. Fashion-MNIST is intended to serve as a direct drop-in replacement for the original MNIST dataset for benchmarking machine learning algorithms, as it shares the same image s… ▽ More

    Submitted 15 September, 2017; v1 submitted 25 August, 2017; originally announced August 2017.

    Comments: Dataset is freely available at https://github.com/zalandoresearch/fashion-mnist Benchmark is available at http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/

  13. arXiv:1708.07347  [pdf, other

    cs.IR cs.LG

    An LSTM-Based Dynamic Customer Model for Fashion Recommendation

    Authors: Sebastian Heinz, Christian Bracher, Roland Vollgraf

    Abstract: Online fashion sales present a challenging use case for personalized recommendation: Stores offer a huge variety of items in multiple sizes. Small stocks, high return rates, seasonality, and changing trends cause continuous turnover of articles for sale on all time scales. Customers tend to shop rarely, but often buy multiple items at once. We report on backtest experiments with sales data of 100k… ▽ More

    Submitted 24 August, 2017; originally announced August 2017.

  14. arXiv:1705.06566  [pdf, other

    cs.CV stat.ML

    Learning Texture Manifolds with the Periodic Spatial GAN

    Authors: Urs Bergmann, Nikolay Jetchev, Roland Vollgraf

    Abstract: This paper introduces a novel approach to texture synthesis based on generative adversarial networks (GAN) (Goodfellow et al., 2014). We extend the structure of the input noise distribution by constructing tensors with different types of dimensions. We call this technique Periodic Spatial GAN (PSGAN). The PSGAN has several novel abilities which surpass the current state of the art in texture synth… ▽ More

    Submitted 8 September, 2017; v1 submitted 18 May, 2017; originally announced May 2017.

    Comments: Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia, 2017. JMLR: W&CP. Copyright 2017 by the author(s)

  15. arXiv:1611.08207  [pdf, other

    cs.CV stat.ML

    Texture Synthesis with Spatial Generative Adversarial Networks

    Authors: Nikolay Jetchev, Urs Bergmann, Roland Vollgraf

    Abstract: Generative adversarial networks (GANs) are a recent approach to train generative models of data, which have been shown to work particularly well on image data. In the current paper we introduce a new model for texture synthesis based on GAN learning. By extending the input noise distribution space from a single vector to a whole spatial tensor, we create an architecture with properties well suited… ▽ More

    Submitted 8 September, 2017; v1 submitted 24 November, 2016; originally announced November 2016.

    Comments: presented at the NIPS 2016 adversarial learning workshop, Barcelona, Spain

  16. arXiv:1609.02489  [pdf, other

    cs.IR cs.LG

    Fashion DNA: Merging Content and Sales Data for Recommendation and Article Map**

    Authors: Christian Bracher, Sebastian Heinz, Roland Vollgraf

    Abstract: We present a method to determine Fashion DNA, coordinate vectors locating fashion items in an abstract space. Our approach is based on a deep neural network architecture that ingests curated article information such as tags and images, and is trained to predict sales for a large set of frequent customers. In the process, a dual space of customer style preferences naturally arises. Interpretation o… ▽ More

    Submitted 8 September, 2016; originally announced September 2016.

    Comments: 10 pages, 13 figures. Paper presented at the workshop "Machine Learning Meets Fashion," KDD 2016 Conference, San Francisco, USA, March 14, 2016