Skip to main content

Showing 1–6 of 6 results for author: Blitzer, J

.
  1. arXiv:1909.13128  [pdf, other

    cs.CL

    Integrated Triaging for Fast Reading Comprehension

    Authors: Felix Wu, Boyi Li, Lequn Wang, Ni Lao, John Blitzer, Kilian Q. Weinberger

    Abstract: Although according to several benchmarks automatic machine reading comprehension (MRC) systems have recently reached super-human performance, less attention has been paid to their computational efficiency. However, efficiency is of crucial importance for training and deployment in real world applications. This paper introduces Integrated Triaging, a framework that prunes almost all context in earl… ▽ More

    Submitted 28 September, 2019; originally announced September 2019.

    Comments: Technical report

  2. arXiv:1902.11291  [pdf, other

    cs.CL

    FastFusionNet: New State-of-the-Art for DAWNBench SQuAD

    Authors: Felix Wu, Boyi Li, Lequn Wang, Ni Lao, John Blitzer, Kilian Q. Weinberger

    Abstract: In this technical report, we introduce FastFusionNet, an efficient variant of FusionNet [12]. FusionNet is a high performing reading comprehension architecture, which was designed primarily for maximum retrieval accuracy with less regard towards computational requirements. For FastFusionNets we remove the expensive CoVe layers [21] and substitute the BiLSTMs with far more efficient SRU layers [19]… ▽ More

    Submitted 2 March, 2019; v1 submitted 28 February, 2019; originally announced February 2019.

    Comments: A Technical Report

  3. arXiv:1711.04352  [pdf, other

    cs.CL

    Fast Reading Comprehension with ConvNets

    Authors: Felix Wu, Ni Lao, John Blitzer, Guandao Yang, Kilian Weinberger

    Abstract: State-of-the-art deep reading comprehension models are dominated by recurrent neural nets. Their sequential nature is a natural fit for language, but it also precludes parallelization within an instances and often becomes the bottleneck for deploying such models to latency critical scenarios. This is particularly problematic for longer texts. Here we present a convolutional architecture as an alte… ▽ More

    Submitted 12 November, 2017; originally announced November 2017.

    Comments: 15 pages, 10 figures, submitted to ICLR 2018

  4. arXiv:1609.09405  [pdf, other

    cs.CL cs.AI

    Evaluating Induced CCG Parsers on Grounded Semantic Parsing

    Authors: Yonatan Bisk, Siva Reddy, John Blitzer, Julia Hockenmaier, Mark Steedman

    Abstract: We compare the effectiveness of four different syntactic CCG parsers for a semantic slot-filling task to explore how much syntactic supervision is required for downstream semantic analysis. This extrinsic, task-based evaluation provides a unique window to explore the strengths and weaknesses of semantics captured by unsupervised grammar induction systems. We release a new Freebase semantic parsing… ▽ More

    Submitted 31 January, 2017; v1 submitted 29 September, 2016; originally announced September 2016.

    Comments: EMNLP 2016, Table 2 erratum, Code and Freebase Semantic Parsing data URL

  5. arXiv:1210.4914  [pdf

    cs.LG cs.IR stat.ML

    Latent Structured Ranking

    Authors: Jason Weston, John Blitzer

    Abstract: Many latent (factorized) models have been proposed for recommendation tasks like collaborative filtering and for ranking tasks like document or image retrieval and annotation. Common to all those methods is that during inference the items are scored independently by their similarity to the query in the latent embedding space. The structure of the ranked list (i.e. considering the set of items retu… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-903-913

  6. arXiv:1206.3256  [pdf

    cs.LG stat.ML

    Multi-View Learning over Structured and Non-Identical Outputs

    Authors: Kuzman Ganchev, Joao Graca, John Blitzer, Ben Taskar

    Abstract: In many machine learning problems, labeled training data is limited but unlabeled data is ample. Some of these problems have instances that can be factored into multiple views, each of which is nearly sufficent in determining the correct labels. In this paper we present a new algorithm for probabilistic multi-view learning which uses the idea of stochastic agreement between views as regularization… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-204-211