Skip to main content

Showing 1–37 of 37 results for author: Graves, A

.
  1. arXiv:2308.07037  [pdf, other

    cs.LG cs.AI

    Bayesian Flow Networks

    Authors: Alex Graves, Rupesh Kumar Srivastava, Timothy Atkinson, Faustino Gomez

    Abstract: This paper introduces Bayesian Flow Networks (BFNs), a new class of generative model in which the parameters of a set of independent distributions are modified with Bayesian inference in the light of noisy data samples, then passed as input to a neural network that outputs a second, interdependent distribution. Starting from a simple prior and iteratively updating the two distributions yields a ge… ▽ More

    Submitted 3 February, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

  2. Jammed solids with pins: Thresholds, Force networks and Elasticity

    Authors: Andy L. Zhang, Sean A. Ridout, Celia Parts, Aarushi Sachdeva, Cacey S. Bester, Katharina Vollmayr-Lee, Brian C. Utter, Ted Brzinski, Amy L. Graves

    Abstract: The role of fixed degrees of freedom in soft/granular matter systems has broad applicability and theoretical interest. Here we address questions of the geometrical role that a scaffolding of fixed particles plays in tuning the threshold volume fraction and force network in the vicinity of jamming. Our 2d simulated system consists of soft particles and fixed "pins", both of which harmonically repel… ▽ More

    Submitted 25 August, 2022; v1 submitted 29 May, 2022; originally announced May 2022.

    Comments: 13 pages, 15 figures

  3. arXiv:2006.07232  [pdf, other

    cs.LG cs.NE stat.ML

    A Practical Sparse Approximation for Real Time Recurrent Learning

    Authors: Jacob Menick, Erich Elsen, Utku Evci, Simon Osindero, Karen Simonyan, Alex Graves

    Abstract: Current methods for training recurrent neural networks are based on backpropagation through time, which requires storing a complete history of network states, and prohibits updating the weights `online' (after every timestep). Real Time Recurrent Learning (RTRL) eliminates the need for history storage and allows for online weight updates, but does so at the expense of computational costs that are… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  4. arXiv:2004.04792  [pdf, other

    cond-mat.soft cond-mat.dis-nn cond-mat.mtrl-sci

    Structured randomness: Jamming of soft discs and pins

    Authors: Prairie Wentworth-Nice, Sean A. Ridout, Brian Jenike, Ari Liloia, Amy L. Graves

    Abstract: Simulations are used to find the zero temperature jamming threshold, $φ_j$, for soft, bidisperse disks in the presence of small fixed particles, or "pins", arranged in a lattice. The presence of pins leads, as one expects, to a decrease in $φ_j$. Structural properties of the system near the jamming threshold are calculated as a function of the pin density. While the correlation length exponent rem… ▽ More

    Submitted 29 April, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

    Comments: 9 pages, 11 figures, 1 table; This is v2 of an article, revised thanks to peer review

  5. arXiv:2004.04237  [pdf

    physics.ed-ph physics.comp-ph

    Hitting the Ground Running: Computational physics education to prepare students for computational physics research

    Authors: Amy Lisa Graves, Adam D. Light

    Abstract: Momentum exists in the physics community for integrating computation into the undergraduate curriculum. One of many benefits would be preparation for computational research. Our investigation poses the question of which computational skills might be best learned in the curriculum (prior to research) versus during research. Based on a survey of computational physicists, we present evidence that man… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: 15 pages, 3 figures, to appear in CiSE (2020)

  6. arXiv:1910.11135  [pdf, other

    nucl-ex

    Proton-induced reactions on Fe, Cu, & Ti from threshold to 55 MeV

    Authors: Andrew S. Voyles, Amanda M. Lewis, Jonathan T. Morrell, M. Shamsuzzoha Basunia, Lee A. Bernstein, Jonathan W. Engle, Stephen A. Graves, Eric F. Matthews

    Abstract: Theoretical models often differ significantly from measured data in their predictions of the magnitude of nuclear reactions that produce radionuclides for medical, research, and national security applications. In this paper, we compare a priori predictions from several state-of-the-art reaction modeling packages (CoH, EMPIRE, TALYS, and ALICE) to cross sections measured using the stacked-target ac… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

    Comments: Submitted to Phys Rev C, 25 pages, 41 figures. arXiv admin note: text overlap with arXiv:1804.06548

  7. Excitation functions for (p,x) reactions of niobium in the energy range of E$_{\text{p}}$ = 40-90 MeV

    Authors: Andrew S. Voyles, Lee A. Bernstein, Eva R. Birnbaum, Jonathan W. Engle, Stephen A. Graves, Toshihiko Kawano, Amanda M. Lewis, Francois M. Nortier

    Abstract: A stack of thin Nb foils was irradiated with the 100 MeV proton beam at Los Alamos National Laboratory's Isotope Production Facility, to investigate the $^{93}$Nb(p,4n)$^{90}$Mo nuclear reaction as a monitor for intermediate energy proton experiments and to benchmark state-of-the-art reaction model codes. A set of 38 measured cross sections for $^{\text{nat}}$Nb(p,x) and $^{\text{nat}}$Cu(p,x) rea… ▽ More

    Submitted 21 June, 2018; v1 submitted 18 April, 2018; originally announced April 2018.

    Comments: 34 pages, submitted to NIM-B

    Report number: LA-UR-18-22980

    Journal ref: Nuclear Instruments and Methods in Physics Research B, 429 (2018) 53-74

  8. arXiv:1804.02476  [pdf, other

    cs.NE cs.LG stat.ML

    Associative Compression Networks for Representation Learning

    Authors: Alex Graves, Jacob Menick, Aaron van den Oord

    Abstract: This paper introduces Associative Compression Networks (ACNs), a new framework for variational autoencoding with neural networks. The system differs from existing variational autoencoders (VAEs) in that the prior distribution used to model each code is conditioned on a similar code from the dataset. In compression terms this equates to sequentially transmitting the dataset using an ordering determ… ▽ More

    Submitted 26 April, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

    Comments: Revised to clarify difference between ACN and IID loss

  9. arXiv:1804.01756  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    The Kanerva Machine: A Generative Distributed Memory

    Authors: Yan Wu, Greg Wayne, Alex Graves, Timothy Lillicrap

    Abstract: We present an end-to-end trained memory system that quickly adapts to new data and generates samples like them. Inspired by Kanerva's sparse distributed memory, it has a robust distributed reading and writing mechanism. The memory is analytically tractable, which enables optimal on-line compression via a Bayesian update-rule. We formulate it as a hierarchical conditional generative model, where me… ▽ More

    Submitted 18 June, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

    Comments: Published as a conference paper at ICLR 2018 (corrected typos in revision)

  10. arXiv:1711.10433  [pdf, other

    cs.LG

    Parallel WaveNet: Fast High-Fidelity Speech Synthesis

    Authors: Aaron van den Oord, Yazhe Li, Igor Babuschkin, Karen Simonyan, Oriol Vinyals, Koray Kavukcuoglu, George van den Driessche, Edward Lockhart, Luis C. Cobo, Florian Stimberg, Norman Casagrande, Dominik Grewe, Seb Noury, Sander Dieleman, Erich Elsen, Nal Kalchbrenner, Heiga Zen, Alex Graves, Helen King, Tom Walters, Dan Belov, Demis Hassabis

    Abstract: The recently-developed WaveNet architecture is the current state of the art in realistic speech synthesis, consistently rated as more natural sounding for many different languages than any previous system. However, because WaveNet relies on sequential generation of one audio sample at a time, it is poorly suited to today's massively parallel computers, and therefore hard to deploy in a real-time p… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

  11. arXiv:1706.10295  [pdf, other

    cs.LG stat.ML

    Noisy Networks for Exploration

    Authors: Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian Osband, Alex Graves, Vlad Mnih, Remi Munos, Demis Hassabis, Olivier Pietquin, Charles Blundell, Shane Legg

    Abstract: We introduce NoisyNet, a deep reinforcement learning agent with parametric noise added to its weights, and show that the induced stochasticity of the agent's policy can be used to aid efficient exploration. The parameters of the noise are learned with gradient descent along with the remaining network weights. NoisyNet is straightforward to implement and adds little computational overhead. We find… ▽ More

    Submitted 9 July, 2019; v1 submitted 30 June, 2017; originally announced June 2017.

    Comments: ICLR 2018

  12. arXiv:1705.09636  [pdf

    physics.ed-ph physics.soc-ph

    Swimming against the tide: Gender bias in the physics classroom

    Authors: Amy L. Graves, Estuko Hoshino-Browne, Kristine P. H. Lui

    Abstract: This study examines physics students' evaluations of identical, video-recorded lectures performed by female and male actors playing the role of professors. The results indicate that evaluations by male students show statistically significant overall biases with male professors rated more positively than female professors. Female students tended to be egalitarian, except in two areas. Female studen… ▽ More

    Submitted 1 June, 2017; v1 submitted 26 May, 2017; originally announced May 2017.

    Comments: 4 figures, 4 tables, one Appendix with table. Appears in "Journal of Women and Minorities in Science and Engineering" (2017)

  13. arXiv:1704.03003  [pdf, other

    cs.NE

    Automated Curriculum Learning for Neural Networks

    Authors: Alex Graves, Marc G. Bellemare, Jacob Menick, Remi Munos, Koray Kavukcuoglu

    Abstract: We introduce a method for automatically selecting the path, or syllabus, that a neural network follows through a curriculum so as to maximise learning efficiency. A measure of the amount that the network learns from each data sample is provided as a reward signal to a nonstationary multi-armed bandit algorithm, which then determines a stochastic syllabus. We consider a range of signals derived fro… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

  14. arXiv:1610.10099  [pdf, other

    cs.CL cs.LG

    Neural Machine Translation in Linear Time

    Authors: Nal Kalchbrenner, Lasse Espeholt, Karen Simonyan, Aaron van den Oord, Alex Graves, Koray Kavukcuoglu

    Abstract: We present a novel neural network for processing sequences. The ByteNet is a one-dimensional convolutional neural network that is composed of two parts, one to encode the source sequence and the other to decode the target sequence. The two network parts are connected by stacking the decoder on top of the encoder and preserving the temporal resolution of the sequences. To address the differing leng… ▽ More

    Submitted 15 March, 2017; v1 submitted 31 October, 2016; originally announced October 2016.

    Comments: 9 pages

  15. arXiv:1610.09027  [pdf, other

    cs.LG

    Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

    Authors: Jack W Rae, Jonathan J Hunt, Tim Harley, Ivo Danihelka, Andrew Senior, Greg Wayne, Alex Graves, Timothy P Lillicrap

    Abstract: Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in both space and time as the amount of memory grows --- limiting their applicability to real-world domains. Here, we present an end-to-end differentiable memory… ▽ More

    Submitted 27 October, 2016; originally announced October 2016.

    Comments: in 30th Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain

  16. arXiv:1610.00527  [pdf, other

    cs.CV cs.LG

    Video Pixel Networks

    Authors: Nal Kalchbrenner, Aaron van den Oord, Karen Simonyan, Ivo Danihelka, Oriol Vinyals, Alex Graves, Koray Kavukcuoglu

    Abstract: We propose a probabilistic video model, the Video Pixel Network (VPN), that estimates the discrete joint distribution of the raw pixel values in a video. The model and the neural architecture reflect the time, space and color structure of video tensors and encode it as a four-dimensional dependency chain. The VPN approaches the best possible performance on the Moving MNIST benchmark, a leap over t… ▽ More

    Submitted 3 October, 2016; originally announced October 2016.

    Comments: 16 pages

  17. arXiv:1609.03499  [pdf, other

    cs.SD cs.LG

    WaveNet: A Generative Model for Raw Audio

    Authors: Aaron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves, Nal Kalchbrenner, Andrew Senior, Koray Kavukcuoglu

    Abstract: This paper introduces WaveNet, a deep neural network for generating raw audio waveforms. The model is fully probabilistic and autoregressive, with the predictive distribution for each audio sample conditioned on all previous ones; nonetheless we show that it can be efficiently trained on data with tens of thousands of samples per second of audio. When applied to text-to-speech, it yields state-of-… ▽ More

    Submitted 19 September, 2016; v1 submitted 12 September, 2016; originally announced September 2016.

  18. arXiv:1608.05343  [pdf, other

    cs.LG

    Decoupled Neural Interfaces using Synthetic Gradients

    Authors: Max Jaderberg, Wojciech Marian Czarnecki, Simon Osindero, Oriol Vinyals, Alex Graves, David Silver, Koray Kavukcuoglu

    Abstract: Training directed neural networks typically requires forward-propagating data through a computation graph, followed by backpropagating error signal, to produce weight updates. All layers, or more generally, modules, of the network are therefore locked, in the sense that they must wait for the remainder of the network to execute forwards and propagate error backwards before they can be updated. In… ▽ More

    Submitted 3 July, 2017; v1 submitted 18 August, 2016; originally announced August 2016.

  19. arXiv:1607.05690  [pdf, ps, other

    cs.NE

    Stochastic Backpropagation through Mixture Density Distributions

    Authors: Alex Graves

    Abstract: The ability to backpropagate stochastic gradients through continuous latent distributions has been crucial to the emergence of variational autoencoders and stochastic gradient variational Bayes. The key ingredient is an unbiased and low-variance way of estimating gradients with respect to distribution parameters from gradients evaluated at distribution samples. The "reparameterization trick" provi… ▽ More

    Submitted 19 July, 2016; originally announced July 2016.

  20. arXiv:1606.05328  [pdf, other

    cs.CV cs.LG

    Conditional Image Generation with PixelCNN Decoders

    Authors: Aaron van den Oord, Nal Kalchbrenner, Oriol Vinyals, Lasse Espeholt, Alex Graves, Koray Kavukcuoglu

    Abstract: This work explores conditional image generation with a new image density model based on the PixelCNN architecture. The model can be conditioned on any vector, including descriptive labels or tags, or latent embeddings created by other networks. When conditioned on class labels from the ImageNet database, the model is able to generate diverse, realistic scenes representing distinct animals, objects… ▽ More

    Submitted 18 June, 2016; v1 submitted 16 June, 2016; originally announced June 2016.

  21. arXiv:1606.04695  [pdf, other

    cs.AI cs.LG

    Strategic Attentive Writer for Learning Macro-Actions

    Authors: Alexander, Vezhnevets, Volodymyr Mnih, John Agapiou, Simon Osindero, Alex Graves, Oriol Vinyals, Koray Kavukcuoglu

    Abstract: We present a novel deep recurrent neural network architecture that learns to build implicit plans in an end-to-end manner by purely interacting with an environment in reinforcement learning setting. The network builds an internal plan, which is continuously updated upon observation of the next input from the environment. It can also partition this internal representation into contiguous sub- seque… ▽ More

    Submitted 15 June, 2016; originally announced June 2016.

  22. arXiv:1606.03401  [pdf, other

    cs.NE cs.LG

    Memory-Efficient Backpropagation Through Time

    Authors: Audrūnas Gruslys, Remi Munos, Ivo Danihelka, Marc Lanctot, Alex Graves

    Abstract: We propose a novel approach to reduce memory consumption of the backpropagation through time (BPTT) algorithm when training recurrent neural networks (RNNs). Our approach uses dynamic programming to balance a trade-off between caching of intermediate results and recomputation. The algorithm is capable of tightly fitting within almost any user-set memory budget while finding an optimal execution po… ▽ More

    Submitted 10 June, 2016; originally announced June 2016.

  23. arXiv:1603.08983  [pdf, other

    cs.NE

    Adaptive Computation Time for Recurrent Neural Networks

    Authors: Alex Graves

    Abstract: This paper introduces Adaptive Computation Time (ACT), an algorithm that allows recurrent neural networks to learn how many computational steps to take between receiving an input and emitting an output. ACT requires minimal changes to the network architecture, is deterministic and differentiable, and does not add any noise to the parameter gradients. Experimental results are provided for four synt… ▽ More

    Submitted 21 February, 2017; v1 submitted 29 March, 2016; originally announced March 2016.

  24. arXiv:1602.03032  [pdf, other

    cs.NE

    Associative Long Short-Term Memory

    Authors: Ivo Danihelka, Greg Wayne, Benigno Uria, Nal Kalchbrenner, Alex Graves

    Abstract: We investigate a new method to augment recurrent neural networks with extra memory without increasing the number of network parameters. The system has an associative memory based on complex-valued vectors and is closely related to Holographic Reduced Representations and Long Short-Term Memory networks. Holographic Reduced Representations have limited capacity: as they store more information, each… ▽ More

    Submitted 19 May, 2016; v1 submitted 9 February, 2016; originally announced February 2016.

    Comments: ICML-2016

  25. arXiv:1602.01783  [pdf, other

    cs.LG

    Asynchronous Methods for Deep Reinforcement Learning

    Authors: Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu

    Abstract: We propose a conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers. We present asynchronous variants of four standard reinforcement learning algorithms and show that parallel actor-learners have a stabilizing effect on training allowing all four methods to successfully train neural n… ▽ More

    Submitted 16 June, 2016; v1 submitted 4 February, 2016; originally announced February 2016.

    Journal ref: ICML 2016

  26. arXiv:1509.00806  [pdf, other

    cond-mat.soft cond-mat.stat-mech

    Pinning Susceptibility: The effect of dilute, quenched disorder on jamming

    Authors: Amy L. Graves, Samer Nashed, Elliot Padgett, Carl P. Goodrich, Andrea J. Liu, James P. Sethna

    Abstract: We study the effect of dilute pinning on the jamming transition. Pinning reduces the average contact number needed to jam unpinned particles and shifts the jamming threshold to lower densities, leading to a pinning susceptibility, $χ_p$. Our main results are that this susceptibility obeys scaling form and diverges in the thermodynamic limit as $χ_p \propto |φ- φ_c^\infty|^{-γ_p}$ where… ▽ More

    Submitted 18 May, 2016; v1 submitted 2 September, 2015; originally announced September 2015.

    Comments: 5 pages, 3 figures (1a, 1b, 2, 3a, 3b, 3c)

    Journal ref: Phys. Rev. Lett. 116, 235501 (2016)

  27. arXiv:1507.01526  [pdf, other

    cs.NE cs.CL cs.LG

    Grid Long Short-Term Memory

    Authors: Nal Kalchbrenner, Ivo Danihelka, Alex Graves

    Abstract: This paper introduces Grid Long Short-Term Memory, a network of LSTM cells arranged in a multidimensional grid that can be applied to vectors, sequences or higher dimensional data such as images. The network differs from existing deep LSTM architectures in that the cells are connected between network layers as well as along the spatiotemporal dimensions of the data. The network provides a unified… ▽ More

    Submitted 7 January, 2016; v1 submitted 6 July, 2015; originally announced July 2015.

    Comments: 15 pages

  28. arXiv:1502.04623  [pdf, other

    cs.CV cs.LG cs.NE

    DRAW: A Recurrent Neural Network For Image Generation

    Authors: Karol Gregor, Ivo Danihelka, Alex Graves, Danilo Jimenez Rezende, Daan Wierstra

    Abstract: This paper introduces the Deep Recurrent Attentive Writer (DRAW) neural network architecture for image generation. DRAW networks combine a novel spatial attention mechanism that mimics the foveation of the human eye, with a sequential variational auto-encoding framework that allows for the iterative construction of complex images. The system substantially improves on the state of the art for gener… ▽ More

    Submitted 20 May, 2015; v1 submitted 16 February, 2015; originally announced February 2015.

  29. arXiv:1410.5401  [pdf, other

    cs.NE

    Neural Turing Machines

    Authors: Alex Graves, Greg Wayne, Ivo Danihelka

    Abstract: We extend the capabilities of neural networks by coupling them to external memory resources, which they can interact with by attentional processes. The combined system is analogous to a Turing Machine or Von Neumann architecture but is differentiable end-to-end, allowing it to be efficiently trained with gradient descent. Preliminary results demonstrate that Neural Turing Machines can infer simple… ▽ More

    Submitted 10 December, 2014; v1 submitted 20 October, 2014; originally announced October 2014.

  30. arXiv:1406.6247  [pdf, other

    cs.LG cs.CV stat.ML

    Recurrent Models of Visual Attention

    Authors: Volodymyr Mnih, Nicolas Heess, Alex Graves, Koray Kavukcuoglu

    Abstract: Applying convolutional neural networks to large images is computationally expensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent neural network model that is capable of extracting information from an image or video by adaptively selecting a sequence of regions or locations and only processing the selected regions at high resolution… ▽ More

    Submitted 24 June, 2014; originally announced June 2014.

  31. arXiv:1312.5602  [pdf, other

    cs.LG

    Playing Atari with Deep Reinforcement Learning

    Authors: Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller

    Abstract: We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning E… ▽ More

    Submitted 19 December, 2013; originally announced December 2013.

    Comments: NIPS Deep Learning Workshop 2013

  32. arXiv:1308.0850  [pdf, other

    cs.NE cs.CL

    Generating Sequences With Recurrent Neural Networks

    Authors: Alex Graves

    Abstract: This paper shows how Long Short-term Memory recurrent neural networks can be used to generate complex sequences with long-range structure, simply by predicting one data point at a time. The approach is demonstrated for text (where the data are discrete) and online handwriting (where the data are real-valued). It is then extended to handwriting synthesis by allowing the network to condition its pre… ▽ More

    Submitted 5 June, 2014; v1 submitted 4 August, 2013; originally announced August 2013.

    Comments: Thanks to Peng Liu and Sergey Zyrianov for various corrections

  33. arXiv:1303.5778  [pdf, other

    cs.NE cs.CL

    Speech Recognition with Deep Recurrent Neural Networks

    Authors: Alex Graves, Abdel-rahman Mohamed, Geoffrey Hinton

    Abstract: Recurrent neural networks (RNNs) are a powerful model for sequential data. End-to-end training methods such as Connectionist Temporal Classification make it possible to train RNNs for sequence labelling problems where the input-output alignment is unknown. The combination of these methods with the Long Short-term Memory RNN architecture has proved particularly fruitful, delivering state-of-the-art… ▽ More

    Submitted 22 March, 2013; originally announced March 2013.

    Comments: To appear in ICASSP 2013

  34. arXiv:1211.3711  [pdf, other

    cs.NE cs.LG stat.ML

    Sequence Transduction with Recurrent Neural Networks

    Authors: Alex Graves

    Abstract: Many machine learning tasks can be expressed as the transformation---or \emph{transduction}---of input sequences into output sequences: speech recognition, machine translation, protein secondary structure prediction and text-to-speech to name but a few. One of the key challenges in sequence transduction is learning to represent both the input and output sequences in a way that is invariant to sequ… ▽ More

    Submitted 14 November, 2012; originally announced November 2012.

    Comments: First published in the International Conference of Machine Learning (ICML) 2012 Workshop on Representation Learning

  35. arXiv:1208.3101  [pdf, ps, other

    cs.DL cs.SI physics.soc-ph

    Statistical Common Author Networks (SCAN)

    Authors: F. G. Serpa, Adam M. Graves, Artjay Javier

    Abstract: A new method for visualizing the relatedness of scientific areas is developed that is based on measuring the overlap of researchers between areas. It is found that closely related areas have a high propensity to share a larger number of common authors. A methodology for comparing areas of vastly different sizes and to handle name homonymy is constructed, allowing for the robust deployment of this… ▽ More

    Submitted 8 March, 2013; v1 submitted 15 August, 2012; originally announced August 2012.

    Comments: Accepted to JASIST (February 2013). Copyright 2013 American Society of Information Science and Technology

  36. arXiv:0804.3269  [pdf, ps, other

    cs.CL cs.NE

    Phoneme recognition in TIMIT with BLSTM-CTC

    Authors: Santiago Fernández, Alex Graves, Juergen Schmidhuber

    Abstract: We compare the performance of a recurrent neural network with the best results published so far on phoneme recognition in the TIMIT database. These published results have been obtained with a combination of classifiers. However, in this paper we apply a single recurrent neural network to the same task. Our recurrent neural network attains an error rate of 24.6%. This result is not significantly… ▽ More

    Submitted 21 April, 2008; originally announced April 2008.

    Comments: 8 pages

    Report number: IDSIA-04-08 ACM Class: I.2.7; I.5.4

  37. arXiv:0705.2011  [pdf, other

    cs.AI cs.CV

    Multi-Dimensional Recurrent Neural Networks

    Authors: Alex Graves, Santiago Fernandez, Juergen Schmidhuber

    Abstract: Recurrent neural networks (RNNs) have proved effective at one dimensional sequence learning tasks, such as speech and online handwriting recognition. Some of the properties that make RNNs suitable for such tasks, for example robustness to input war**, and the ability to access contextual information, are also desirable in multidimensional domains. However, there has so far been no direct way o… ▽ More

    Submitted 14 May, 2007; originally announced May 2007.

    Comments: 10 pages, 10 figures

    Report number: 04-07