Skip to main content

Showing 1–3 of 3 results for author: Bakalov, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.11755  [pdf, other

    cs.CL cs.IR

    Promptagator: Few-shot Dense Retrieval From 8 Examples

    Authors: Zhuyun Dai, Vincent Y. Zhao, Ji Ma, Yi Luan, Jianmo Ni, **g Lu, Anton Bakalov, Kelvin Guu, Keith B. Hall, Ming-Wei Chang

    Abstract: Much recent research on information retrieval has focused on how to transfer from one task (typically with abundant supervised data) to various other tasks where supervision is limited, with the implicit assumption that it is possible to generalize from one task to all the rest. However, this overlooks the fact that there are many diverse and unique retrieval tasks, each targeting different search… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

  2. arXiv:1810.04142  [pdf, other

    cs.CL

    A Fast, Compact, Accurate Model for Language Identification of Codemixed Text

    Authors: Yuan Zhang, Jason Riesa, Daniel Gillick, Anton Bakalov, Jason Baldridge, David Weiss

    Abstract: We address fine-grained multilingual language identification: providing a language code for every token in a sentence, including codemixed text containing multiple languages. Such text is prevalent online, in documents, social media, and message boards. We show that a feed-forward network with a simple globally constrained decoder can accurately and rapidly label both codemixed and monolingual tex… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

    Comments: EMNLP 2018

  3. arXiv:1708.00214  [pdf, other

    cs.CL cs.NE

    Natural Language Processing with Small Feed-Forward Networks

    Authors: Jan A. Botha, Emily Pitler, Ji Ma, Anton Bakalov, Alex Salcianu, David Weiss, Ryan McDonald, Slav Petrov

    Abstract: We show that small and shallow feed-forward neural networks can achieve near state-of-the-art results on a range of unstructured and structured language processing tasks while being considerably cheaper in memory and computational requirements than deep recurrent models. Motivated by resource-constrained environments like mobile phones, we showcase simple techniques for obtaining such small neural… ▽ More

    Submitted 1 August, 2017; originally announced August 2017.

    Comments: EMNLP 2017 short paper

    MSC Class: 68T50 ACM Class: I.2.7