Skip to main content

Showing 1–11 of 11 results for author: Strope, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:1907.04307  [pdf, other

    cs.CL

    Multilingual Universal Sentence Encoder for Semantic Retrieval

    Authors: Yinfei Yang, Daniel Cer, Amin Ahmad, Mandy Guo, Jax Law, Noah Constant, Gustavo Hernandez Abrego, Steve Yuan, Chris Tar, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: We introduce two pre-trained retrieval focused multilingual sentence encoding models, respectively based on the Transformer and CNN model architectures. The models embed text from 16 languages into a single semantic space using a multi-task trained dual-encoder that learns tied representations using translation based bridge tasks (Chidambaram al., 2018). The models provide performance that is comp… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

    Comments: 6 pages, 6 tables, 2 listings, and 1 figure

  2. arXiv:1906.08401  [pdf, other

    cs.CL

    Hierarchical Document Encoder for Parallel Corpus Mining

    Authors: Mandy Guo, Yinfei Yang, Keith Stevens, Daniel Cer, Heming Ge, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: We explore using multilingual document embeddings for nearest neighbor mining of parallel data. Three document-level representations are investigated: (i) document embeddings generated by simply averaging multilingual sentence embeddings; (ii) a neural bag-of-words (BoW) document encoding model; (iii) a hierarchical multilingual document encoder (HiDE) that builds on our sentence-level model. The… ▽ More

    Submitted 30 June, 2019; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: accepted by WMT2019

  3. arXiv:1902.08564  [pdf, other

    cs.CL

    Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax

    Authors: Yinfei Yang, Gustavo Hernandez Abrego, Steve Yuan, Mandy Guo, Qinlan Shen, Daniel Cer, Yun-hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: In this paper, we present an approach to learn multilingual sentence embeddings using a bi-directional dual-encoder with additive margin softmax. The embeddings are able to achieve state-of-the-art results on the United Nations (UN) parallel corpus retrieval task. In all the languages tested, the system achieves P@1 of 86% or higher. We use pairs retrieved by our approach to train NMT models that… ▽ More

    Submitted 14 June, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: Accepted by IJCAI'19(International Joint Conference on Artificial Intelligence)

  4. arXiv:1810.12836  [pdf, other

    cs.CL

    Learning Cross-Lingual Sentence Representations via a Multi-task Dual-Encoder Model

    Authors: Muthuraman Chidambaram, Yinfei Yang, Daniel Cer, Steve Yuan, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: A significant roadblock in multilingual neural language modeling is the lack of labeled non-English data. One potential method for overcoming this issue is learning cross-lingual text representations that can be used to transfer the performance from training on English tasks to non-English tasks, despite little to no task-specific non-English data. In this paper, we explore a natural setup for lea… ▽ More

    Submitted 1 August, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: Accepted at the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019)

    Journal ref: In Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019)

  5. arXiv:1807.11906  [pdf, other

    cs.CL

    Effective Parallel Corpus Mining using Bilingual Sentence Embeddings

    Authors: Mandy Guo, Qinlan Shen, Yinfei Yang, Heming Ge, Daniel Cer, Gustavo Hernandez Abrego, Keith Stevens, Noah Constant, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: This paper presents an effective approach for parallel corpus mining using bilingual sentence embeddings. Our embedding models are trained to produce similar representations exclusively for bilingual sentence pairs that are translations of each other. This is achieved using a novel training method that introduces hard negatives consisting of sentences that are not translations but that have some d… ▽ More

    Submitted 2 August, 2018; v1 submitted 31 July, 2018; originally announced July 2018.

  6. arXiv:1804.07754  [pdf, other

    cs.CL

    Learning Semantic Textual Similarity from Conversations

    Authors: Yinfei Yang, Steve Yuan, Daniel Cer, Sheng-yi Kong, Noah Constant, Petr Pilar, Heming Ge, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: We present a novel approach to learn representations for sentence-level semantic similarity using conversational data. Our method trains an unsupervised model to predict conversational input-response pairs. The resulting sentence embeddings perform well on the semantic textual similarity (STS) benchmark and SemEval 2017's Community Question Answering (CQA) question similarity subtask. Performance… ▽ More

    Submitted 20 April, 2018; originally announced April 2018.

    Comments: 10 pages, 8 Figures, 6 Tables

  7. arXiv:1803.11175  [pdf, other

    cs.CL

    Universal Sentence Encoder

    Authors: Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua, Nicole Limtiaco, Rhomni St. John, Noah Constant, Mario Guajardo-Cespedes, Steve Yuan, Chris Tar, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity, r… ▽ More

    Submitted 12 April, 2018; v1 submitted 29 March, 2018; originally announced March 2018.

    Comments: 7 pages; fixed module URL in Listing 1

  8. arXiv:1705.00652  [pdf, other

    cs.CL

    Efficient Natural Language Response Suggestion for Smart Reply

    Authors: Matthew Henderson, Rami Al-Rfou, Brian Strope, Yun-hsuan Sung, Laszlo Lukacs, Ruiqi Guo, Sanjiv Kumar, Balint Miklos, Ray Kurzweil

    Abstract: This paper presents a computationally efficient machine-learned method for natural language response suggestion. Feed-forward neural networks using n-gram embedding features encode messages into vectors which are optimized to give message-response pairs a high dot-product value. An optimized search finds response suggestions. The method is evaluated in a large-scale commercial e-mail application,… ▽ More

    Submitted 1 May, 2017; originally announced May 2017.

  9. arXiv:1701.03185  [pdf, other

    cs.CL

    Generating High-Quality and Informative Conversation Responses with Sequence-to-Sequence Models

    Authors: Louis Shao, Stephan Gouws, Denny Britz, Anna Goldie, Brian Strope, Ray Kurzweil

    Abstract: Sequence-to-sequence models have been applied to the conversation response generation problem where the source sequence is the conversation history and the target sequence is the response. Unlike translation, conversation responding is inherently creative. The generation of long, informative, coherent, and diverse responses remains a hard task. In this work, we focus on the single turn setting. We… ▽ More

    Submitted 31 July, 2017; v1 submitted 11 January, 2017; originally announced January 2017.

    Comments: To appear in EMNLP 2017

  10. arXiv:1606.00372  [pdf, other

    cs.CL cs.LG

    Conversational Contextual Cues: The Case of Personalization and History for Response Ranking

    Authors: Rami Al-Rfou, Marc Pickett, Javier Snaider, Yun-hsuan Sung, Brian Strope, Ray Kurzweil

    Abstract: We investigate the task of modeling open-domain, multi-turn, unstructured, multi-participant, conversational dialogue. We specifically study the effect of incorporating different elements of the conversation. Unlike previous efforts, which focused on modeling messages and responses, we extend the modeling to long context and participant's history. Our system does not rely on handwritten rules or e… ▽ More

    Submitted 1 June, 2016; originally announced June 2016.

    Comments: 10 pages, 6 figures

  11. arXiv:1602.06291  [pdf, other

    cs.CL

    Contextual LSTM (CLSTM) models for Large scale NLP tasks

    Authors: Shalini Ghosh, Oriol Vinyals, Brian Strope, Scott Roy, Tom Dean, Larry Heck

    Abstract: Documents exhibit sequential structure at multiple levels of abstraction (e.g., sentences, paragraphs, sections). These abstractions constitute a natural hierarchy for representing the context in which to infer the meaning of words and larger fragments of text. In this paper, we present CLSTM (Contextual LSTM), an extension of the recurrent neural network LSTM (Long-Short Term Memory) model, where… ▽ More

    Submitted 31 May, 2016; v1 submitted 19 February, 2016; originally announced February 2016.