Skip to main content

Showing 1–41 of 41 results for author: Cohen, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.20838  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    einspace: Searching for Neural Architectures from Fundamental Operations

    Authors: Linus Ericsson, Miguel Espinosa, Chenhongyi Yang, Antreas Antoniou, Amos Storkey, Shay B. Cohen, Steven McDonagh, Elliot J. Crowley

    Abstract: Neural architecture search (NAS) finds high performing networks for a given task. Yet the results of NAS are fairly prosaic; they did not e.g. create a shift from convolutional structures to transformers. This is not least because the search spaces in NAS often aren't diverse enough to include such transformations a priori. Instead, for NAS to provide greater potential for fundamental design shift… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Project page at https://linusericsson.github.io/einspace/

  2. arXiv:2402.07025  [pdf, other

    stat.ML cs.IT cs.LG

    Generalization Error of Graph Neural Networks in the Mean-field Regime

    Authors: Gholamali Aminian, Yixuan He, Gesine Reinert, Łukasz Szpruch, Samuel N. Cohen

    Abstract: This work provides a theoretical framework for assessing the generalization error of graph neural networks in the over-parameterized regime, where the number of parameters surpasses the quantity of data points. We explore two widely utilized types of graph neural networks: graph convolutional neural networks and message passing graph neural networks. Prior to this study, existing bounds on the gen… ▽ More

    Submitted 1 July, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Accepted in ICML 2024

  3. arXiv:2306.11623  [pdf, ps, other

    stat.ML cs.LG math.ST

    Mean-field Analysis of Generalization Errors

    Authors: Gholamali Aminian, Samuel N. Cohen, Łukasz Szpruch

    Abstract: We propose a novel framework for exploring weak and $L_2$ generalization errors of algorithms through the lens of differential calculus on the space of probability measures. Specifically, we consider the KL-regularized empirical risk minimization problem and establish generic conditions under which the generalization error convergence rate, when training on a sample of size $n$, is… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 49 pages

    MSC Class: 62B10; 60F99; 49N80; 46N30

  4. arXiv:2305.10256  [pdf, other

    econ.EM math.PR stat.ME

    Nowcasting with signature methods

    Authors: Samuel N. Cohen, Silvia Lui, Will Malpass, Giulia Mantoan, Lars Nesheim, Áureo de Paula, Andrew Reeves, Craig Scott, Emma Small, Lingyi Yang

    Abstract: Key economic variables are often published with a significant delay of over a month. The nowcasting literature has arisen to provide fast, reliable estimates of delayed economic indicators and is closely related to filtering methods in signal processing. The path signature is a mathematical object which captures geometric properties of sequential data; it naturally handles missing data from mixed… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Our implementation of this algorithm in Python to reproduce the results is available at https://github.com/alan-turing-institute/Nowcasting_with_signatures To apply this method for your application, see also SigNow at https://github.com/datasciencecampus/SigNow_ONS_Turing

    MSC Class: 60L10; 60L90; 60G35; 62M10; 62M20

  5. arXiv:2207.04711  [pdf, other

    stat.ML cs.LG

    Matching Normalizing Flows and Probability Paths on Manifolds

    Authors: Heli Ben-Hamu, Samuel Cohen, Joey Bose, Brandon Amos, Aditya Grover, Maximilian Nickel, Ricky T. Q. Chen, Yaron Lipman

    Abstract: Continuous Normalizing Flows (CNFs) are a class of generative models that transform a prior distribution to a model distribution by solving an ordinary differential equation (ODE). We propose to train CNFs on manifolds by minimizing probability path divergence (PPD), a novel family of divergences between the probability density path generated by the CNF and a target probability density path. PPD i… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: ICML 2022

  6. arXiv:2206.05262  [pdf, other

    cs.LG cs.AI stat.ML

    Meta Optimal Transport

    Authors: Brandon Amos, Samuel Cohen, Giulia Luise, Ievgen Redko

    Abstract: We study the use of amortized optimization to predict optimal transport (OT) maps from the input measures, which we call Meta OT. This helps repeatedly solve similar OT problems between different measures by leveraging the knowledge and information present from past problems to rapidly predict and solve new problems. Otherwise, standard methods ignore the knowledge of the past solutions and subopt… ▽ More

    Submitted 2 June, 2023; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: ICML 2023

  7. arXiv:2205.15991  [pdf, other

    q-fin.CP math.PR q-fin.RM q-fin.ST stat.ML

    Hedging option books using neural-SDE market models

    Authors: Samuel N. Cohen, Christoph Reisinger, Sheng Wang

    Abstract: We study the capability of arbitrage-free neural-SDE market models to yield effective strategies for hedging options. In particular, we derive sensitivity-based and minimum-variance-based hedging strategies using these models and examine their performance when applied to various option portfolios using real-world data. Through backtesting analysis over typical and stressed market periods, we show… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    MSC Class: 91B28; 91B70; 62M45; 62P05

  8. arXiv:2203.17128  [pdf, other

    math.NA cs.LG math.AP math.PR stat.ML

    Neural Q-learning for solving PDEs

    Authors: Samuel N. Cohen, Deqing Jiang, Justin Sirignano

    Abstract: Solving high-dimensional partial differential equations (PDEs) is a major challenge in scientific computing. We develop a new numerical method for solving elliptic-type PDEs by adapting the Q-learning algorithm in reinforcement learning. Our "Q-PDE" algorithm is mesh-free and therefore has the potential to overcome the curse of dimensionality. Using a neural tangent kernel (NTK) approach, we prove… ▽ More

    Submitted 24 June, 2023; v1 submitted 31 March, 2022; originally announced March 2022.

    MSC Class: 65N12; 62M45; 37N30; 60H30

  9. arXiv:2202.07148  [pdf, other

    q-fin.CP math.PR q-fin.RM q-fin.ST stat.ML

    Estimating risks of option books using neural-SDE market models

    Authors: Samuel N. Cohen, Christoph Reisinger, Sheng Wang

    Abstract: In this paper, we examine the capacity of an arbitrage-free neural-SDE market model to produce realistic scenarios for the joint dynamics of multiple European options on a single underlying. We subsequently demonstrate its use as a risk simulation engine for option portfolios. Through backtesting analysis, we show that our models are more computationally efficient and accurate for evaluating the V… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    MSC Class: 91B28; 91B70; 62M45; 62P05

  10. arXiv:2111.10637  [pdf, other

    stat.ME

    Gradient-based estimation of linear Hawkes processes with general kernels

    Authors: Álvaro Cartea, Samuel N. Cohen, Saad Labyad

    Abstract: Linear multivariate Hawkes processes (MHP) are a fundamental class of point processes with self-excitation. When estimating parameters for these processes, a difficulty is that the two main error functionals, the log-likelihood and the least squares error (LSE), as well as the evaluation of their gradients, have a quadratic complexity in the number of observed events. In practice, this prohibits t… ▽ More

    Submitted 20 November, 2021; originally announced November 2021.

    Comments: 51 pages, 17 figures, 1 table

    MSC Class: 60G55; 62M09; 90C52; 93E10

  11. arXiv:2110.03684  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Cross-Domain Imitation Learning via Optimal Transport

    Authors: Arnaud Fickinger, Samuel Cohen, Stuart Russell, Brandon Amos

    Abstract: Cross-domain imitation learning studies how to leverage expert demonstrations of one agent to train an imitation agent with a different embodiment or morphology. Comparing trajectories and stationary distributions between the expert and imitation agents is challenging because they live on different systems that may not even have the same dimensionality. We propose Gromov-Wasserstein Imitation Lear… ▽ More

    Submitted 25 April, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: ICLR 2022

  12. arXiv:2106.10272  [pdf, other

    cs.LG stat.ML

    Riemannian Convex Potential Maps

    Authors: Samuel Cohen, Brandon Amos, Yaron Lipman

    Abstract: Modeling distributions on Riemannian manifolds is a crucial component in understanding non-Euclidean data that arises, e.g., in physics and geology. The budding approaches in this space are limited by representational and computational tradeoffs. We propose and study a class of flows that uses convex potentials from Riemannian optimal transport. These are universal and can model distributions on a… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  13. arXiv:2105.11053  [pdf, other

    q-fin.CP math.PR q-fin.RM q-fin.ST stat.ML

    Arbitrage-free neural-SDE market models

    Authors: Samuel N. Cohen, Christoph Reisinger, Sheng Wang

    Abstract: Modelling joint dynamics of liquid vanilla options is crucial for arbitrage-free pricing of illiquid derivatives and managing risks of option trade books. This paper develops a nonparametric model for the European options book respecting underlying financial constraints and while being practically implementable. We derive a state space for prices which are free from static (or model-independent) a… ▽ More

    Submitted 23 August, 2021; v1 submitted 23 May, 2021; originally announced May 2021.

    MSC Class: 91B28; 91B70; 62M45; 62P05

  14. arXiv:2102.07115  [pdf, other

    stat.ML cs.LG

    Sliced Multi-Marginal Optimal Transport

    Authors: Samuel Cohen, Alexander Terenin, Yannik Pitcan, Brandon Amos, Marc Peter Deisenroth, K S Sesh Kumar

    Abstract: Multi-marginal optimal transport enables one to compare multiple probability measures, which increasingly finds application in multi-task learning problems. One practical limitation of multi-marginal transport is computational scalability in the number of measures, samples and dimensionality. In this work, we propose a multi-marginal optimal transport paradigm based on random one-dimensional proje… ▽ More

    Submitted 23 November, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Journal ref: NeurIPS Workshop on Optimal Transport and Machine Learning, 2021

  15. arXiv:2102.07106  [pdf, other

    stat.ML cs.LG

    Healing Products of Gaussian Processes

    Authors: Samuel Cohen, Rendani Mbuvha, Tshilidzi Marwala, Marc Peter Deisenroth

    Abstract: Gaussian processes (GPs) are nonparametric Bayesian models that have been applied to regression and classification problems. One of the approaches to alleviate their cubic training cost is the use of local GP experts trained on subsets of the data. In particular, product-of-expert models combine the predictive distributions of local experts through a tractable product operation. While these expert… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

    Comments: ICML 2020

  16. arXiv:2102.04263  [pdf, other

    math.OC cs.CE cs.LG econ.GN stat.ML

    Generalised correlated batched bandits via the ARC algorithm with application to dynamic pricing

    Authors: Samuel Cohen, Tanut Treetanthiploet

    Abstract: The Asymptotic Randomised Control (ARC) algorithm provides a rigorous approximation to the optimal strategy for a wide class of Bayesian bandits, while retaining low computational complexity. In particular, the ARC approach provides nearly optimal choices even when the payoffs are correlated or more than the reward is observed. The algorithm is guaranteed to asymptotically optimise the expected di… ▽ More

    Submitted 12 October, 2022; v1 submitted 8 February, 2021; originally announced February 2021.

    MSC Class: 62J12; 90B50; 91B38; 93C41

  17. arXiv:2010.07252  [pdf, other

    math.OC stat.ML

    Asymptotic Randomised Control with applications to bandits

    Authors: Samuel N. Cohen, Tanut Treetanthiploet

    Abstract: We consider a general multi-armed bandit problem with correlated (and simple contextual and restless) elements, as a relaxed control problem. By introducing an entropy regularisation, we obtain a smooth asymptotic approximation to the value function. This yields a novel semi-index approximation of the optimal decision process. This semi-index can be interpreted as explicitly balancing an explorati… ▽ More

    Submitted 3 September, 2022; v1 submitted 14 October, 2020; originally announced October 2020.

    MSC Class: 60J20; 93E35; 90C40; 41A58

  18. arXiv:2008.07648  [pdf, other

    cs.LG stat.ML

    Nonparametric Learning of Two-Layer ReLU Residual Units

    Authors: Zhunxuan Wang, Linyun He, Chunchuan Lyu, Shay B. Cohen

    Abstract: We describe an algorithm that learns two-layer residual units using rectified linear unit (ReLU) activation: suppose the input $\mathbf{x}$ is from a distribution with support space $\mathbb{R}^d$ and the ground-truth generative model is a residual unit of this type, given by $\mathbf{y} = \boldsymbol{B}^\ast\left[\left(\boldsymbol{A}^\ast\mathbf{x}\right)^+ + \mathbf{x}\right]$, where ground-trut… ▽ More

    Submitted 10 December, 2022; v1 submitted 17 August, 2020; originally announced August 2020.

    Comments: Published in Transactions on Machine Learning Research (11/2022), slightly typographically revised

  19. arXiv:2007.07105  [pdf, other

    stat.ML cs.LG

    Estimating Barycenters of Measures in High Dimensions

    Authors: Samuel Cohen, Michael Arbel, Marc Peter Deisenroth

    Abstract: Barycentric averaging is a principled way of summarizing populations of measures. Existing algorithms for estimating barycenters typically parametrize them as weighted sums of Diracs and optimize their weights and/or locations. However, these approaches do not scale to high-dimensional settings due to the curse of dimensionality. In this paper, we propose a scalable and general algorithm for estim… ▽ More

    Submitted 14 February, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: In submission

  20. arXiv:2006.12648  [pdf, other

    cs.LG stat.ML

    Aligning Time Series on Incomparable Spaces

    Authors: Samuel Cohen, Giulia Luise, Alexander Terenin, Brandon Amos, Marc Peter Deisenroth

    Abstract: Dynamic time war** (DTW) is a useful method for aligning, comparing and combining time series, but it requires them to live in comparable spaces. In this work, we consider a setting in which time series live on different spaces without a sensible ground metric, causing DTW to become ill-defined. To alleviate this, we propose Gromov dynamic time war** (GDTW), a distance between time series on p… ▽ More

    Submitted 22 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Journal ref: Artificial Intelligence and Statistics, 2021

  21. arXiv:2005.04064  [pdf, other

    cs.LG stat.ML

    Lossy Compression with Distortion Constrained Optimization

    Authors: Ties van Rozendaal, Guillaume Sautière, Taco S. Cohen

    Abstract: When training end-to-end learned models for lossy compression, one has to balance the rate and distortion losses. This is typically done by manually setting a tradeoff parameter $β$, an approach called $β$-VAE. Using this approach it is difficult to target a specific rate or distortion value, because the result can be very sensitive to $β$, and the appropriate value for $β$ depends on the model an… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: Accepted as a CVPR 2020 workshop paper: Workshop and Challenge on Learned Image Compression (CLIC)

  22. arXiv:2004.09691  [pdf, other

    cs.LG eess.IV stat.ML

    A Data and Compute Efficient Design for Limited-Resources Deep Learning

    Authors: Mirgahney Mohamed, Gabriele Cesa, Taco S. Cohen, Max Welling

    Abstract: Thanks to their improved data efficiency, equivariant neural networks have gained increased interest in the deep learning community. They have been successfully applied in the medical domain where symmetries in the data can be effectively exploited to build more accurate and robust models. To be able to reach a much larger body of patients, mobile, on-device implementations of deep learning soluti… ▽ More

    Submitted 8 July, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: Accepted for poster presentation at the Practical Machine Learning for Develo** Countries (PML4DC) workshop, ICLR 2020

  23. arXiv:2004.04342  [pdf, other

    cs.LG cs.CV stat.ML

    Feedback Recurrent Autoencoder for Video Compression

    Authors: Adam Golinski, Reza Pourreza, Yang Yang, Guillaume Sautiere, Taco S Cohen

    Abstract: Recent advances in deep generative modeling have enabled efficient modeling of high dimensional data distributions and opened up a new horizon for solving data compression problems. Specifically, autoencoder based learned image or video compression solutions are emerging as strong competitors to traditional approaches. In this work, We propose a new network architecture, based on common and well s… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

  24. arXiv:2001.11235  [pdf, other

    cs.LG stat.ML

    Learning Discrete Distributions by Dequantization

    Authors: Emiel Hoogeboom, Taco S. Cohen, Jakub M. Tomczak

    Abstract: Media is generally stored digitally and is therefore discrete. Many successful deep distribution models in deep learning learn a density, i.e., the distribution of a continuous random variable. Naïve optimization on discrete data leads to arbitrarily high likelihoods, and instead, it has become standard practice to add noise to datapoints. In this paper, we present a general framework for dequanti… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

  25. arXiv:1911.04018  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Feedback Recurrent AutoEncoder

    Authors: Yang Yang, Guillaume Sautière, J. Jon Ryu, Taco S Cohen

    Abstract: In this work, we propose a new recurrent autoencoder architecture, termed Feedback Recurrent AutoEncoder (FRAE), for online compression of sequential data with temporal dependency. The recurrent structure of FRAE is designed to efficiently extract the redundancy along the time dimension and allows a compact discrete representation of the data to be learned. We demonstrate its effectiveness in spee… ▽ More

    Submitted 17 February, 2020; v1 submitted 10 November, 2019; originally announced November 2019.

    Journal ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  26. arXiv:1908.05717  [pdf, other

    eess.IV cs.LG stat.ML

    Video Compression With Rate-Distortion Autoencoders

    Authors: Amirhossein Habibian, Ties van Rozendaal, Jakub M. Tomczak, Taco S. Cohen

    Abstract: In this paper we present a a deep generative model for lossy video compression. We employ a model that consists of a 3D autoencoder with a discrete latent space and an autoregressive prior used for entropy coding. Both autoencoder and prior are trained jointly to minimize a rate-distortion loss, which is closely related to the ELBO used in variational autoencoders. Despite its simplicity, we find… ▽ More

    Submitted 13 November, 2019; v1 submitted 14 August, 2019; originally announced August 2019.

    Comments: Accepted to ICCV 2019

  27. arXiv:1906.02481  [pdf, ps, other

    cs.LG hep-th stat.ML

    Covariance in Physics and Convolutional Neural Networks

    Authors: Miranda C. N. Cheng, Vassilis Anagiannis, Maurice Weiler, Pim de Haan, Taco S. Cohen, Max Welling

    Abstract: In this proceeding we give an overview of the idea of covariance (or equivariance) featured in the recent development of convolutional neural networks (CNNs). We study the similarities and differences between the use of covariance in theoretical physics and in the CNN context. Additionally, we demonstrate that the simple assumption of covariance, together with the required properties of locality,… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  28. arXiv:1905.10144  [pdf, other

    cs.LG cs.AI stat.ML

    Adaptive Symmetric Reward Noising for Reinforcement Learning

    Authors: Refael Vivanti, Talya D. Sohlberg-Baris, Shlomo Cohen, Orna Cohen

    Abstract: Recent reinforcement learning algorithms, though achieving impressive results in various fields, suffer from brittle training effects such as regression in results and high sensitivity to initialization and parameters. We claim that some of the brittleness stems from variance differences, i.e. when different environment areas - states and/or actions - have different rewards variance. This causes t… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

    Comments: 9 pages, 7 figures, conference

  29. arXiv:1904.09585  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Obfuscation for Privacy-preserving Syntactic Parsing

    Authors: Zhifeng Hu, Serhii Havrylov, Ivan Titov, Shay B. Cohen

    Abstract: The goal of homomorphic encryption is to encrypt data such that another party can operate on it without being explicitly exposed to the content of the original data. We introduce an idea for a privacy-preserving transformation on natural language data, inspired by homomorphic encryption. Our primary tool is {\em obfuscation}, relying on the properties of natural language. Specifically, a given Eng… ▽ More

    Submitted 27 May, 2020; v1 submitted 21 April, 2019; originally announced April 2019.

    Comments: Accepted to IWPT 2020

  30. arXiv:1902.04615  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Gauge Equivariant Convolutional Networks and the Icosahedral CNN

    Authors: Taco S. Cohen, Maurice Weiler, Berkay Kicanaoglu, Max Welling

    Abstract: The principle of equivariance to symmetry transformations enables a theoretically grounded approach to neural network architecture design. Equivariant networks have shown excellent performance and data efficiency on vision and medical imaging problems that exhibit symmetries. Here we show how this principle can be extended beyond global symmetries to local gauge transformations. This enables the d… ▽ More

    Submitted 13 May, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

    Comments: Proceedings of the International Conference on Machine Learning (ICML), 2019

  31. arXiv:1807.04689  [pdf, other

    stat.ML cs.LG

    Explorations in Homeomorphic Variational Auto-Encoding

    Authors: Luca Falorsi, Pim de Haan, Tim R. Davidson, Nicola De Cao, Maurice Weiler, Patrick Forré, Taco S. Cohen

    Abstract: The manifold hypothesis states that many kinds of high-dimensional data are concentrated near a low-dimensional manifold. If the topology of this data manifold is non-trivial, a continuous encoder network cannot embed it in a one-to-one manner without creating holes of low density in the latent space. This is at odds with the Gaussian prior assumption typically made in Variational Auto-Encoders (V… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: 16 pages, 8 figures, ICML workshop on Theoretical Foundations and Applications of Deep Generative Models

  32. arXiv:1807.00583  [pdf, other

    cs.CV cs.LG stat.ML

    Sample Efficient Semantic Segmentation using Rotation Equivariant Convolutional Networks

    Authors: Jasper Linmans, Jim Winkens, Bastiaan S. Veeling, Taco S. Cohen, Max Welling

    Abstract: We propose a semantic segmentation model that exploits rotation and reflection symmetries. We demonstrate significant gains in sample efficiency due to increased weight sharing, as well as improvements in robustness to symmetry transformations. The group equivariant CNN framework is extended for segmentation by introducing a new equivariant (G->Z2)-convolution that transforms feature maps on a gro… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

    Comments: Presented at the ICML workshop: Towards learning with limited labels: Equivariance, Invariance, and Beyond, 2018

  33. arXiv:1804.04656  [pdf, other

    cs.LG stat.ML

    3D G-CNNs for Pulmonary Nodule Detection

    Authors: Marysia Winkels, Taco S. Cohen

    Abstract: Convolutional Neural Networks (CNNs) require a large amount of annotated data to learn from, which is often difficult to obtain in the medical domain. In this paper we show that the sample complexity of CNNs can be significantly improved by using 3D roto-translation group convolutions (G-Convs) instead of the more conventional translational convolutions. These 3D G-CNNs were applied to the problem… ▽ More

    Submitted 12 April, 2018; originally announced April 2018.

    Journal ref: International conference on Medical Imaging with Deep Learning, 2018

  34. arXiv:1803.10743  [pdf, other

    cs.LG cs.CV stat.ML

    Intertwiners between Induced Representations (with Applications to the Theory of Equivariant Neural Networks)

    Authors: Taco S. Cohen, Mario Geiger, Maurice Weiler

    Abstract: Group equivariant and steerable convolutional neural networks (regular and steerable G-CNNs) have recently emerged as a very effective model class for learning from signal data such as 2D and 3D images, video, and other data where symmetries are present. In geometrical terms, regular G-CNNs represent data in terms of scalar fields ("feature channels"), whereas the steerable G-CNN can also use vect… ▽ More

    Submitted 30 March, 2018; v1 submitted 28 March, 2018; originally announced March 2018.

  35. arXiv:1803.02108  [pdf, other

    cs.LG stat.ML

    HexaConv

    Authors: Emiel Hoogeboom, Jorn W. T. Peters, Taco S. Cohen, Max Welling

    Abstract: The effectiveness of Convolutional Neural Networks stems in large part from their ability to exploit the translation invariance that is inherent in many learning problems. Recently, it was shown that CNNs can exploit other invariances, such as rotation invariance, by using group convolutions instead of planar convolutions. However, for reasons of performance and ease of implementation, it has been… ▽ More

    Submitted 6 March, 2018; originally announced March 2018.

  36. arXiv:1801.10130  [pdf, other

    cs.LG stat.ML

    Spherical CNNs

    Authors: Taco S. Cohen, Mario Geiger, Jonas Koehler, Max Welling

    Abstract: Convolutional Neural Networks (CNNs) have become the method of choice for learning problems involving 2D planar images. However, a number of problems of recent interest have created a demand for models that can analyze spherical images. Examples include omnidirectional vision for drones, robots, and autonomous cars, molecular regression problems, and global weather and climate modelling. A naive a… ▽ More

    Submitted 25 February, 2018; v1 submitted 30 January, 2018; originally announced January 2018.

    Comments: Proceedings of the 6th International Conference on Learning Representations (ICLR), 2018

    Journal ref: Proceedings of the International Conference on Learning Representations, 2018

  37. arXiv:1612.08498  [pdf, other

    cs.LG stat.ML

    Steerable CNNs

    Authors: Taco S. Cohen, Max Welling

    Abstract: It has long been recognized that the invariance and equivariance properties of a representation are critically important for success in many vision tasks. In this paper we present Steerable Convolutional Neural Networks, an efficient and flexible class of equivariant convolutional networks. We show that steerable CNNs achieve state of the art results on the CIFAR image classification benchmark. Th… ▽ More

    Submitted 26 December, 2016; originally announced December 2016.

    Journal ref: Proceedings of the International Conference on Learning Representations, 2017

  38. arXiv:1606.00229  [pdf, ps, other

    stat.ME math.OC math.PR

    Uncertainty and filtering of hidden Markov models in discrete time

    Authors: Samuel N. Cohen

    Abstract: We consider the problem of filtering an unseen Markov chain from noisy observations, in the presence of uncertainty regarding the parameters of the processes involved. Using the theory of nonlinear expectations, we describe the uncertainty in terms of a penalty function, which can be propagated forward in time in the place of the filter.

    Submitted 12 May, 2018; v1 submitted 1 June, 2016; originally announced June 2016.

    MSC Class: 62M20; 60G35; 93E11

  39. arXiv:1602.07576  [pdf, ps, other

    cs.LG stat.ML

    Group Equivariant Convolutional Networks

    Authors: Taco S. Cohen, Max Welling

    Abstract: We introduce Group equivariant Convolutional Neural Networks (G-CNNs), a natural generalization of convolutional neural networks that reduces sample complexity by exploiting symmetries. G-CNNs use G-convolutions, a new type of layer that enjoys a substantially higher degree of weight sharing than regular convolution layers. G-convolutions increase the expressive capacity of the network without inc… ▽ More

    Submitted 3 June, 2016; v1 submitted 24 February, 2016; originally announced February 2016.

    Journal ref: Proceedings of the International Conference on Machine Learning (ICML), 2016

  40. arXiv:1505.04413  [pdf, other

    stat.ML

    Harmonic Exponential Families on Manifolds

    Authors: Taco S. Cohen, Max Welling

    Abstract: In a range of fields including the geosciences, molecular biology, robotics and computer vision, one encounters problems that involve random variables on manifolds. Currently, there is a lack of flexible probabilistic models on manifolds that are fast and easy to train. We define an extremely flexible class of exponential family distributions on manifolds such as the torus, sphere, and rotation gr… ▽ More

    Submitted 20 May, 2015; v1 submitted 17 May, 2015; originally announced May 2015.

    Comments: fixed typo

    Journal ref: Proceedings of the International Conference on Machine Learning, 2015

  41. arXiv:1311.6257  [pdf, other

    q-fin.CP math.PR q-fin.TR stat.OT

    Filters and smoothers for self-exciting Markov modulated counting processes

    Authors: Samuel N. Cohen, Robert J. Elliott

    Abstract: We consider a self-exciting counting process, the parameters of which depend on a hidden finite-state Markov chain. We derive the optimal filter and smoother for the hidden chain based on observation of the jump process. This filter is in closed form and is finite dimensional. We demonstrate the performance of this filter both with simulated data, and by analysing the `flash crash' of 6th May 2010… ▽ More

    Submitted 25 November, 2013; originally announced November 2013.

    MSC Class: 62M05; 60G55; 60J28; 91G70