Skip to main content

Showing 1–9 of 9 results for author: Turek, J S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2105.05944  [pdf, other

    cs.LG

    Slower is Better: Revisiting the Forgetting Mechanism in LSTM for Slower Information Decay

    Authors: Hsiang-Yun Sherry Chien, Javier S. Turek, Nicole Beckage, Vy A. Vo, Christopher J. Honey, Ted L. Willke

    Abstract: Sequential information contains short- to long-range dependencies; however, learning long-timescale information has been a challenge for recurrent neural networks. Despite improvements in long short-term memory networks (LSTMs), the forgetting mechanism results in the exponential decay of information, limiting their capacity to capture long-timescale information. Here, we propose a power law forge… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: 16 pages, 10 figures

  2. arXiv:2009.12727  [pdf, other

    cs.CL cs.LG

    Multi-timescale Representation Learning in LSTM Language Models

    Authors: Shivangi Mahto, Vy A. Vo, Javier S. Turek, Alexander G. Huth

    Abstract: Language models must capture statistical dependencies between words at timescales ranging from very short to very long. Earlier work has demonstrated that dependencies in natural language tend to decay with distance between words according to a power law. However, it is unclear how this knowledge can be used for analyzing or designing neural network language models. In this work, we derived a theo… ▽ More

    Submitted 17 March, 2021; v1 submitted 26 September, 2020; originally announced September 2020.

    MSC Class: 91F20 ACM Class: I.2.7; I.2.6

    Journal ref: International Conference on Learning Representations 2021

  3. arXiv:1909.00021  [pdf, ps, other

    cs.LG cs.CL cs.NE stat.ML

    Approximating Stacked and Bidirectional Recurrent Architectures with the Delayed Recurrent Neural Network

    Authors: Javier S. Turek, Shailee Jain, Vy Vo, Mihai Capota, Alexander G. Huth, Theodore L. Willke

    Abstract: Recent work has shown that topological enhancements to recurrent neural networks (RNNs) can increase their expressiveness and representational capacity. Two popular enhancements are stacked RNNs, which increases the capacity for learning non-linear functions, and bidirectional processing, which exploits acausal information in a sequence. In this work, we explore the delayed-RNN, which is a single-… ▽ More

    Submitted 18 June, 2020; v1 submitted 30 August, 2019; originally announced September 2019.

    Comments: to be published in Proceedings of International Conference on Machine Learning 2020 (ICML)

    MSC Class: 62M45 ACM Class: I.2.6; I.5.1

  4. arXiv:1908.08783  [pdf, other

    cs.NE cs.LG stat.ML

    Learning Fitness Functions for Machine Programming

    Authors: Shantanu Mandal, Todd A. Anderson, Javier S. Turek, Justin Gottschlich, Shengtian Zhou, Abdullah Muzahid

    Abstract: The problem of automatic software generation is known as Machine Programming. In this work, we propose a framework based on genetic algorithms to solve this problem. Although genetic algorithms have been used successfully for many problems, one criticism is that hand-crafting its fitness function, the test that aims to effectively guide its evolution, can be notably challenging. Our framework pres… ▽ More

    Submitted 23 January, 2021; v1 submitted 22 August, 2019; originally announced August 2019.

    Journal ref: Proceedings of Machine Learning and Systems (MLSys), 3 (2021), 139-155

  5. arXiv:1809.04195  [pdf, other

    physics.med-ph cs.CV cs.DC

    Clinically Deployed Distributed Magnetic Resonance Imaging Reconstruction: Application to Pediatric Knee Imaging

    Authors: Michael J. Anderson, Jonathan I. Tamir, Javier S. Turek, Marcus T. Alley, Theodore L. Willke, Shreyas S. Vasanawala, Michael Lustig

    Abstract: Magnetic resonance imaging is capable of producing volumetric images without ionizing radiation. Nonetheless, long acquisitions lead to prohibitively long exams. Compressed sensing (CS) can enable faster scanning via sub-sampling with reduced artifacts. However, CS requires significantly higher reconstruction computation, limiting current clinical applications to 2D/3D or limited-resolution dynami… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    MSC Class: 68W15; 68U10

  6. arXiv:1705.10887  [pdf, other

    stat.ML cs.CV cs.LG math.NA

    Efficient, sparse representation of manifold distance matrices for classical scaling

    Authors: Javier S. Turek, Alexander Huth

    Abstract: Geodesic distance matrices can reveal shape properties that are largely invariant to non-rigid deformations, and thus are often used to analyze and represent 3-D shapes. However, these matrices grow quadratically with the number of points. Thus for large point sets it is common to use a low-rank approximation to the distance matrix, which fits in memory and can be efficiently analyzed using method… ▽ More

    Submitted 29 March, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

    Comments: Conference CVPR 2018

    MSC Class: 65D05; 68T99; 65F50; 68T45 ACM Class: I.2.10; G.1; I.4

  7. arXiv:1609.09432  [pdf, other

    stat.ML cs.CV q-bio.NC

    A Searchlight Factor Model Approach for Locating Shared Information in Multi-Subject fMRI Analysis

    Authors: Hejia Zhang, Po-Hsuan Chen, Janice Chen, Xia Zhu, Javier S. Turek, Theodore L. Willke, Uri Hasson, Peter J. Ramadge

    Abstract: There is a growing interest in joint multi-subject fMRI analysis. The challenge of such analysis comes from inherent anatomical and functional variability across subjects. One approach to resolving this is a shared response factor model. This assumes a shared and time synchronized stimulus across subjects. Such a model can often identify shared information, but it may not be able to pinpoint with… ▽ More

    Submitted 29 September, 2016; originally announced September 2016.

  8. arXiv:1608.04846  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    A Convolutional Autoencoder for Multi-Subject fMRI Data Aggregation

    Authors: Po-Hsuan Chen, Xia Zhu, Hejia Zhang, Javier S. Turek, Janice Chen, Theodore L. Willke, Uri Hasson, Peter J. Ramadge

    Abstract: Finding the most effective way to aggregate multi-subject fMRI data is a long-standing and challenging problem. It is of increasing interest in contemporary fMRI studies of human cognition due to the scarcity of data per subject and the variability of brain anatomy and functional response across subjects. Recent work on latent factor models shows promising results in this task but this approach do… ▽ More

    Submitted 16 August, 2016; originally announced August 2016.

  9. Enabling Factor Analysis on Thousand-Subject Neuroimaging Datasets

    Authors: Michael J. Anderson, Mihai Capotă, Javier S. Turek, Xia Zhu, Theodore L. Willke, Yida Wang, Po-Hsuan Chen, Jeremy R. Manning, Peter J. Ramadge, Kenneth A. Norman

    Abstract: The scale of functional magnetic resonance image data is rapidly increasing as large multi-subject datasets are becoming widely available and high-resolution scanners are adopted. The inherent low-dimensionality of the information in this data has led neuroscientists to consider factor analysis methods to extract and analyze the underlying brain activity. In this work, we consider two recent multi… ▽ More

    Submitted 17 August, 2016; v1 submitted 16 August, 2016; originally announced August 2016.

    MSC Class: 68W15 ACM Class: I.2