Skip to main content

Showing 1–33 of 33 results for author: Chorowski, J

.
  1. arXiv:2307.13116  [pdf, other

    cs.LG cs.AI cs.DC

    Pathway: a fast and flexible unified stream data processing framework for analytical and Machine Learning applications

    Authors: Michal Bartoszkiewicz, Jan Chorowski, Adrian Kosowski, Jakub Kowalski, Sergey Kulik, Mateusz Lewandowski, Krzysztof Nowicki, Kamil Piechowiak, Olivier Ruas, Zuzanna Stamirowska, Przemyslaw Uznanski

    Abstract: We present Pathway, a new unified data processing framework that can run workloads on both bounded and unbounded data streams. The framework was created with the original motivation of resolving challenges faced when analyzing and processing data from the physical economy, including streams of data generated by IoT and enterprise systems. These required rapid reaction while calling for the applica… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  2. Efficient Transformers with Dynamic Token Pooling

    Authors: Piotr Nawrot, Jan Chorowski, Adrian Łańcucki, Edoardo M. Ponti

    Abstract: Transformers achieve unrivalled performance in modelling language, but remain inefficient in terms of memory and time complexity. A possible remedy is to reduce the sequence length in the intermediate layers by pooling fixed-length segments of tokens. Nevertheless, natural units of meaning, such as words or phrases, display varying sizes. To address this mismatch, we equip language models with a d… ▽ More

    Submitted 24 May, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Journal ref: Proceedings of the 61st (Toronto 2023) Annual Meeting of the Association for Computational Linguistics (Volume 1 Long Papers) Pages 6403 to 6417

  3. arXiv:2206.02211  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.NE eess.AS

    Variable-rate hierarchical CPC leads to acoustic unit discovery in speech

    Authors: Santiago Cuervo, Adrian Łańcucki, Ricard Marxer, Paweł Rychlikowski, Jan Chorowski

    Abstract: The success of deep learning comes from its ability to capture the hierarchical structure of data by learning high-level representations defined in terms of low-level ones. In this paper we explore self-supervised learning of hierarchical representations of speech by applying multiple levels of Contrastive Predictive Coding (CPC). We observe that simply stacking two CPC models does not yield signi… ▽ More

    Submitted 4 December, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: Accepted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

    Journal ref: Advances in Neural Information Processing Systems, 2022

  4. arXiv:2110.15909  [pdf, other

    cs.LG cs.SD eess.AS

    Contrastive prediction strategies for unsupervised segmentation and categorization of phonemes and words

    Authors: Santiago Cuervo, Maciej Grabias, Jan Chorowski, Grzegorz Ciesielski, Adrian Łańcucki, Paweł Rychlikowski, Ricard Marxer

    Abstract: We investigate the performance on phoneme categorization and phoneme and word segmentation of several self-supervised learning (SSL) methods based on Contrastive Predictive Coding (CPC). Our experiments show that with the existing algorithms there is a trade off between categorization and segmentation performance. We investigate the source of this conflict and conclude that the use of context buil… ▽ More

    Submitted 25 February, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

  5. arXiv:2106.11603  [pdf, ps, other

    cs.LG cs.SD eess.AS

    Information Retrieval for ZeroSpeech 2021: The Submission by University of Wroclaw

    Authors: Jan Chorowski, Grzegorz Ciesielski, Jarosław Dzikowski, Adrian Łańcucki, Ricard Marxer, Mateusz Opala, Piotr Pusz, Paweł Rychlikowski, Michał Stypułkowski

    Abstract: We present a number of low-resource approaches to the tasks of the Zero Resource Speech Challenge 2021. We build on the unsupervised representations of speech proposed by the organizers as a baseline, derived from CPC and clustered with the k-means algorithm. We demonstrate that simple methods of refining those representations can narrow the gap, or even improve upon the solutions which use a high… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: Published in Interspeech 2021

  6. arXiv:2104.11946  [pdf, other

    cs.LG cs.SD eess.AS

    Aligned Contrastive Predictive Coding

    Authors: Jan Chorowski, Grzegorz Ciesielski, Jarosław Dzikowski, Adrian Łańcucki, Ricard Marxer, Mateusz Opala, Piotr Pusz, Paweł Rychlikowski, Michał Stypułkowski

    Abstract: We investigate the possibility of forcing a self-supervised model trained using a contrastive predictive loss to extract slowly varying latent representations. Rather than producing individual predictions for each of the future representations, the model emits a sequence of predictions shorter than that of the upcoming representations to which they will be aligned. In this way, the prediction netw… ▽ More

    Submitted 22 June, 2021; v1 submitted 24 April, 2021; originally announced April 2021.

    Comments: Published in Interspeech 2021

  7. arXiv:2010.11087  [pdf, other

    cs.CV cs.LG

    Representing Point Clouds with Generative Conditional Invertible Flow Networks

    Authors: Michał Stypułkowski, Kacper Kania, Maciej Zamorski, Maciej Zięba, Tomasz Trzciński, Jan Chorowski

    Abstract: In this paper, we propose a simple yet effective method to represent point clouds as sets of samples drawn from a cloud-specific probability distribution. This interpretation matches intrinsic characteristics of point clouds: the number of points and their ordering within a cloud is not important as all points are drawn from the proximity of the object boundary. We postulate to represent each clou… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  8. arXiv:2006.02547  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning

    Authors: Sameer Khurana, Antoine Laurent, Wei-Ning Hsu, Jan Chorowski, Adrian Lancucki, Ricard Marxer, James Glass

    Abstract: Probabilistic Latent Variable Models (LVMs) provide an alternative to self-supervised learning approaches for linguistic representation learning from speech. LVMs admit an intuitive probabilistic interpretation where the latent structure shapes the information extracted from the signal. Even though LVMs have recently seen a renewed interest due to the introduction of Variational Autoencoders (VAEs… ▽ More

    Submitted 8 September, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: Proceedings of Interspeech, 2020

  9. arXiv:2005.08520  [pdf, other

    cs.LG cs.CL stat.ML

    Robust Training of Vector Quantized Bottleneck Models

    Authors: Adrian Łańcucki, Jan Chorowski, Guillaume Sanchez, Ricard Marxer, Nanxin Chen, Hans J. G. A. Dolfing, Sameer Khurana, Tanel Alumäe, Antoine Laurent

    Abstract: In this paper we demonstrate methods for reliable and efficient training of discrete representation using Vector-Quantized Variational Auto-Encoder models (VQ-VAEs). Discrete latent variable models have been shown to learn nontrivial representations of speech, applicable to unsupervised voice conversion and reaching state-of-the-art performance on unit discovery tasks. For unsupervised representat… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: Published at IJCNN 2020

  10. arXiv:1910.07344  [pdf, other

    cs.LG cs.CV stat.ML

    Conditional Invertible Flow for Point Cloud Generation

    Authors: Michał Stypułkowski, Maciej Zamorski, Maciej Zięba, Jan Chorowski

    Abstract: This paper focuses on a novel generative approach for 3D point clouds that makes use of invertible flow-based models. The main idea of the method is to treat a point cloud as a probability density in 3D space that is modeled using a cloud-specific neural network. To capture the similarity between point clouds we rely on parameter sharing among networks, with each cloud having only a small embeddin… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: Published in Sets & Partitions Workshop at NeurIPS 2019 (https://www.sets.parts/)

  11. arXiv:1902.08295  [pdf, other

    cs.LG stat.ML

    Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

    Authors: Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob , et al. (66 additional authors not shown)

    Abstract: Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models. Lingvo models are composed of modular building blocks that are flexible and easily extensible, and experiment configurations are centralized and highly customizable. Distributed training and quantized inference are supported directly w… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

  12. arXiv:1901.08810  [pdf, other

    cs.LG eess.AS stat.ML

    Unsupervised speech representation learning using WaveNet autoencoders

    Authors: Jan Chorowski, Ron J. Weiss, Samy Bengio, Aäron van den Oord

    Abstract: We consider the task of unsupervised extraction of meaningful latent representations of speech by applying autoencoding neural networks to speech waveforms. The goal is to learn a representation able to capture high level semantic content from the signal, e.g.\ phoneme identities, while being invariant to confounding low level details in the signal such as the underlying pitch contour or backgroun… ▽ More

    Submitted 11 September, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: Accepted to IEEE TASLP, final version available at http://dx.doi.org/10.1109/TASLP.2019.2938863

  13. arXiv:1901.04379  [pdf, other

    cs.CL

    Towards Using Context-Dependent Symbols in CTC Without State-Tying Decision Trees

    Authors: Jan Chorowski, Adrian Lancucki, Bartosz Kostka, Michal Zapotoczny

    Abstract: Deep neural acoustic models benefit from context-dependent (CD) modeling of output symbols. We consider direct training of CTC networks with CD outputs, and identify two issues. The first one is frame-level normalization of probabilities in CTC, which induces strong language modeling behavior that leads to overfitting and interference with external language models. The second one is poor generaliz… ▽ More

    Submitted 23 April, 2019; v1 submitted 14 January, 2019; originally announced January 2019.

  14. arXiv:1808.01160  [pdf, other

    cs.CL

    Efficient Purely Convolutional Text Encoding

    Authors: Szymon Malik, Adrian Lancucki, Jan Chorowski

    Abstract: In this work, we focus on a lightweight convolutional architecture that creates fixed-size vector embeddings of sentences. Such representations are useful for building NLP systems, including conversational agents. Our work derives from a recently proposed recursive convolutional architecture for auto-encoding text paragraphs at byte level. We propose alternations that significantly reduce training… ▽ More

    Submitted 3 August, 2018; originally announced August 2018.

    Comments: As accepted to: LaCATODA Workshop, ICML 2018

  15. arXiv:1805.08032  [pdf, ps, other

    cs.CL

    A Talker Ensemble: the University of Wrocław's Entry to the NIPS 2017 Conversational Intelligence Challenge

    Authors: Jan Chorowski, Adrian Łańcucki, Szymon Malik, Maciej Pawlikowski, Paweł Rychlikowski, Paweł Zykowski

    Abstract: We present Poetwannabe, a chatbot submitted by the University of Wrocław to the NIPS 2017 Conversational Intelligence Challenge, in which it ranked first ex-aequo. It is able to conduct a conversation with a user in a natural language. The primary functionality of our dialogue system is context-aware question answering (QA), while its secondary function is maintaining user engagement. The chatbot… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: To appear in NIPS 2017 Competition track Springer Proceedings

  16. arXiv:1712.08363  [pdf, other

    cs.SD eess.AS stat.ML

    On Using Backpropagation for Speech Texture Generation and Voice Conversion

    Authors: Jan Chorowski, Ron J. Weiss, Rif A. Saurous, Samy Bengio

    Abstract: Inspired by recent work on neural network image generation which rely on backpropagation towards the network inputs, we present a proof-of-concept system for speech texture synthesis and voice conversion based on two mechanisms: approximate inversion of the representation learned by a speech recognition neural network, and on matching statistics of neuron activations between different source and t… ▽ More

    Submitted 8 March, 2018; v1 submitted 22 December, 2017; originally announced December 2017.

    Comments: Accepted to ICASSP 2018

  17. arXiv:1712.01769  [pdf, other

    cs.CL cs.SD eess.AS stat.ML

    State-of-the-art Speech Recognition With Sequence-to-Sequence Models

    Authors: Chung-Cheng Chiu, Tara N. Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J. Weiss, Kanishka Rao, Ekaterina Gonina, Navdeep Jaitly, Bo Li, Jan Chorowski, Michiel Bacchiani

    Abstract: Attention-based encoder-decoder architectures such as Listen, Attend, and Spell (LAS), subsume the acoustic, pronunciation and language model components of a traditional automatic speech recognition (ASR) system into a single neural network. In previous work, we have shown that such architectures are comparable to state-of-theart ASR systems on dictation tasks, but it was not clear if such archite… ▽ More

    Submitted 23 February, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

    Comments: ICASSP camera-ready version

  18. arXiv:1705.10209  [pdf, other

    cs.CL cs.LG cs.NE

    On Multilingual Training of Neural Dependency Parsers

    Authors: Michał Zapotoczny, Paweł Rychlikowski, Jan Chorowski

    Abstract: We show that a recently proposed neural dependency parser can be improved by joint training on multiple languages from the same family. The parser is implemented as a deep neural network whose only input is orthographic representations of words. In order to successfully parse, the network has to discover how linguistically relevant concepts can be inferred from word spellings. We analyze the repre… ▽ More

    Submitted 29 May, 2017; originally announced May 2017.

    Comments: preprint accepted into the TSD2017

  19. arXiv:1703.08581  [pdf, other

    cs.CL cs.LG stat.ML

    Sequence-to-Sequence Models Can Directly Translate Foreign Speech

    Authors: Ron J. Weiss, Jan Chorowski, Navdeep Jaitly, Yonghui Wu, Zhifeng Chen

    Abstract: We present a recurrent encoder-decoder deep neural network architecture that directly translates speech in one language into text in another. The model does not explicitly transcribe the speech into text in the source language, nor does it require supervision from the ground truth source language transcription during training. We apply a slightly modified sequence-to-sequence with attention archit… ▽ More

    Submitted 12 June, 2017; v1 submitted 24 March, 2017; originally announced March 2017.

    Comments: 5 pages, 1 figure. Interspeech 2017

  20. arXiv:1701.06548  [pdf, other

    cs.NE cs.LG

    Regularizing Neural Networks by Penalizing Confident Output Distributions

    Authors: Gabriel Pereyra, George Tucker, Jan Chorowski, Łukasz Kaiser, Geoffrey Hinton

    Abstract: We systematically explore regularizing neural networks by penalizing low entropy output distributions. We show that penalizing low entropy output distributions, which has been shown to improve exploration in reinforcement learning, acts as a strong regularizer in supervised learning. Furthermore, we connect a maximum entropy based confidence penalty to label smoothing through the direction of the… ▽ More

    Submitted 23 January, 2017; originally announced January 2017.

    Comments: Submitted to ICLR 2017

  21. arXiv:1612.02695  [pdf, other

    cs.NE cs.CL cs.LG stat.ML

    Towards better decoding and language model integration in sequence to sequence models

    Authors: Jan Chorowski, Navdeep Jaitly

    Abstract: The recently proposed Sequence-to-Sequence (seq2seq) framework advocates replacing complex data processing pipelines, such as an entire automatic speech recognition system, with a single neural network trained in an end-to-end fashion. In this contribution, we analyse an attention-based seq2seq speech recognition system that directly transcribes recordings into characters. We observe two shortcomi… ▽ More

    Submitted 8 December, 2016; originally announced December 2016.

  22. arXiv:1611.09434  [pdf, other

    cs.AI cs.CL cs.LG cs.NE

    Input Switched Affine Networks: An RNN Architecture Designed for Interpretability

    Authors: Jakob N. Foerster, Justin Gilmer, Jan Chorowski, Jascha Sohl-Dickstein, David Sussillo

    Abstract: There exist many problem domains where the interpretability of neural network models is essential for deployment. Here we introduce a recurrent architecture composed of input-switched affine transformations - in other words an RNN without any explicit nonlinearities, but with input-dependent recurrent weights. This simple form allows the RNN to be analyzed via straightforward linear methods: we ca… ▽ More

    Submitted 12 June, 2017; v1 submitted 28 November, 2016; originally announced November 2016.

    Comments: ICLR 2107 submission: https://openreview.net/forum?id=H1MjAnqxg

  23. arXiv:1610.05225  [pdf, ps, other

    math.PR math.ST

    Estimation error for occupation time functionals of stationary Markov processes

    Authors: Randolf Altmeyer, Jakub Chorowski

    Abstract: The approximation of integral functionals with respect to a stationary Markov process by a Riemann-sum estimator is studied. Stationarity and the functional calculus of the infinitesimal generator of the process are used to get a better understanding of the estimation error and to prove a general error bound. The presented approach admits general integrands and gives a unifying explanation for dif… ▽ More

    Submitted 17 October, 2016; originally announced October 2016.

    MSC Class: Primary 62M05; Secondary 60J55; 60J35

  24. arXiv:1609.03441  [pdf, other

    cs.CL

    Read, Tag, and Parse All at Once, or Fully-neural Dependency Parsing

    Authors: Jan Chorowski, Michał Zapotoczny, Paweł Rychlikowski

    Abstract: We present a dependency parser implemented as a single deep neural network that reads orthographic representations of words and directly generates dependencies and their labels. Unlike typical approaches to parsing, the model doesn't require part-of-speech (POS) tagging of the sentences. With proper regularization and additional supervision achieved with multitask learning we reach state-of-the-ar… ▽ More

    Submitted 5 June, 2017; v1 submitted 12 September, 2016; originally announced September 2016.

  25. arXiv:1605.02688  [pdf, other

    cs.SC cs.LG cs.MS

    Theano: A Python framework for fast computation of mathematical expressions

    Authors: The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano , et al. (88 additional authors not shown)

    Abstract: Theano is a Python library that allows to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements. Theano is being actively and continuously developed since 2008, mu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 19 pages, 5 figures

  26. arXiv:1511.06456  [pdf, other

    cs.LG

    Task Loss Estimation for Sequence Prediction

    Authors: Dzmitry Bahdanau, Dmitriy Serdyuk, Philémon Brakel, Nan Rosemary Ke, Jan Chorowski, Aaron Courville, Yoshua Bengio

    Abstract: Often, the performance on a supervised machine learning task is evaluated with a emph{task loss} function that cannot be optimized directly. Examples of such loss functions include the classification error, the edit distance and the BLEU score. A common workaround for this problem is to instead optimize a emph{surrogate loss} function, such as for instance cross-entropy or hinge loss. In order for… ▽ More

    Submitted 19 January, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: Submitted to ICLR 2016

  27. arXiv:1508.04395  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    End-to-End Attention-based Large Vocabulary Speech Recognition

    Authors: Dzmitry Bahdanau, Jan Chorowski, Dmitriy Serdyuk, Philemon Brakel, Yoshua Bengio

    Abstract: Many of the current state-of-the-art Large Vocabulary Continuous Speech Recognition Systems (LVCSR) are hybrids of neural networks and Hidden Markov Models (HMMs). Most of these systems contain separate components that deal with the acoustic modelling, language modelling and sequence decoding. We investigate a more direct approach in which the HMM is replaced with a Recurrent Neural Network (RNN)… ▽ More

    Submitted 14 March, 2016; v1 submitted 18 August, 2015; originally announced August 2015.

  28. arXiv:1507.07139  [pdf, ps, other

    stat.AP math.PR

    Nonparametric volatility estimation in scalar diffusions: Optimality across observation frequencies

    Authors: Jakub Chorowski

    Abstract: The nonparametric volatility estimation problem of a scalar diffusion process observed at equidistant time points is addressed. Using the spectral representation of the volatility in terms of the invariant density and an eigenpair of the infinitesimal generator the first known estimator that attains the minimax optimal convergence rates for both high and low-frequency observations is constructed.… ▽ More

    Submitted 31 March, 2016; v1 submitted 25 July, 2015; originally announced July 2015.

    Comments: 41 pages, 1 figure

    MSC Class: Primary 62M05; secondary 62G99; 62M15; 60J60

  29. arXiv:1506.07503  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Attention-Based Models for Speech Recognition

    Authors: Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, Kyunghyun Cho, Yoshua Bengio

    Abstract: Recurrent sequence generators conditioned on input data through an attention mechanism have recently shown very good performance on a range of tasks in- cluding machine translation, handwriting synthesis and image caption gen- eration. We extend the attention-mechanism with features needed for speech recognition. We show that while an adaptation of the model used for machine translation in reaches… ▽ More

    Submitted 24 June, 2015; originally announced June 2015.

  30. arXiv:1506.00619  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Blocks and Fuel: Frameworks for deep learning

    Authors: Bart van Merriënboer, Dzmitry Bahdanau, Vincent Dumoulin, Dmitriy Serdyuk, David Warde-Farley, Jan Chorowski, Yoshua Bengio

    Abstract: We introduce two Python frameworks to train neural networks on large datasets: Blocks and Fuel. Blocks is based on Theano, a linear algebra compiler with CUDA-support. It facilitates the training of complex neural network models by providing parametrized Theano operations, attaching metadata to Theano's symbolic computational graph, and providing an extensive set of utilities to assist training th… ▽ More

    Submitted 1 June, 2015; originally announced June 2015.

  31. arXiv:1503.00466  [pdf, ps, other

    math.ST math.PR stat.ME

    Spectral estimation for diffusions with random sampling times

    Authors: Jakub Chorowski, Mathias Trabs

    Abstract: The nonparametric estimation of the volatility and the drift coefficient of a scalar diffusion is studied when the process is observed at random time points. The constructed estimator generalizes the spectral method by Gobet, Hoffmann and Reiß [Ann. Statist. 32 (2006), 2223-2253]. The estimation procedure is optimal in the minimax sense and adaptive with respect to the sampling time distribution a… ▽ More

    Submitted 17 December, 2015; v1 submitted 2 March, 2015; originally announced March 2015.

    Comments: 30 pages, 2 figures

    MSC Class: Primary 62M05; Secondary 60J60; 62G99; 62M15

    Journal ref: Stochastic Processes and their Applications, 126 (10), 2976-3008, 2016

  32. arXiv:1412.1602  [pdf, other

    cs.NE cs.LG stat.ML

    End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results

    Authors: Jan Chorowski, Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio

    Abstract: We replace the Hidden Markov Model (HMM) which is traditionally used in in continuous speech recognition with a bi-directional recurrent neural network encoder coupled to a recurrent neural network decoder that directly emits a stream of phonemes. The alignment between the input and output sequences is established using an attention mechanism: the decoder emits each symbol based on a context creat… ▽ More

    Submitted 4 December, 2014; originally announced December 2014.

    Comments: As accepted to: Deep Learning and Representation Learning Workshop, NIPS 2014

  33. arXiv:1109.0182  [pdf, ps, other

    math.PR

    Hitting half-spaces or spheres by the Ornstein-Uhlenbeck type diffusions

    Authors: Tomasz Byczkowski, Jakub Chorowski, Piotr Graczyk, Jacek Malecki

    Abstract: The purpose of the paper is to provide a general method for computing hitting distributions of some regular subsets D for Ornstein-Uhlenbeck type operators of the form 1/2Δ+ F\cdot\nabla, with F bounded and orthogonal to the boundary of D. As an important application we obtain integral representations of the Poisson kernel for a half-space and balls for hyperbolic Brownian motion and for the class… ▽ More

    Submitted 3 November, 2011; v1 submitted 1 September, 2011; originally announced September 2011.

    Comments: 22 pages

    MSC Class: 60J45; 60G15; 60G40