Skip to main content

Showing 1–4 of 4 results for author: Baines, M

.
  1. arXiv:2112.10684  [pdf, other

    cs.CL cs.AI cs.LG

    Efficient Large Scale Language Modeling with Mixtures of Experts

    Authors: Mikel Artetxe, Shruti Bhosale, Naman Goyal, Todor Mihaylov, Myle Ott, Sam Shleifer, Xi Victoria Lin, **gfei Du, Srinivasan Iyer, Ramakanth Pasunuru, Giri Anantharaman, Xian Li, Shuohui Chen, Halil Akin, Mandeep Baines, Louis Martin, Xing Zhou, Punit Singh Koura, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Mona Diab, Zornitsa Kozareva, Ves Stoyanov

    Abstract: Mixture of Experts layers (MoEs) enable efficient scaling of language models through conditional computation. This paper presents a detailed empirical study of how autoregressive MoE language models scale in comparison with dense models in a wide range of settings: in- and out-of-domain language modeling, zero- and few-shot priming, and full-shot fine-tuning. With the exception of fine-tuning, we… ▽ More

    Submitted 26 October, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: EMNLP 2022

  2. arXiv:2010.11125  [pdf, other

    cs.CL cs.LG

    Beyond English-Centric Multilingual Machine Translation

    Authors: Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Edouard Grave, Michael Auli, Armand Joulin

    Abstract: Existing work in translation demonstrated the potential of massively multilingual machine translation by training a single model able to translate between any pair of languages. However, much of this work is English-Centric by training only on data which was translated from or to English. While this is supported by large sources of training data, it does not reflect translation needs worldwide. In… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  3. arXiv:1812.04710  [pdf, other

    cond-mat.soft physics.flu-dyn

    Capillary transport in low saturated sands: superfast non-linear diffusion model versus direct experimental observations

    Authors: Alex V. Lukyanov, Vladimir Mitkin, Theo G. Theofanous, Mike Baines

    Abstract: We have established in a pilot study, that the spreading of liquids in sandy porous materials at low levels of saturation, typically less than ten percent of the available void space, has very distinctive features in comparison to that at higher saturation levels. In particular, it has been shown, on theoretical grounds, that the spreading is controlled by a special type of diffusional process, th… ▽ More

    Submitted 11 December, 2018; originally announced December 2018.

    Comments: 25 pages, 22 figures

  4. arXiv:1310.8262  [pdf, other

    physics.flu-dyn cond-mat.soft

    Superfast non-linear diffusion: Capillary transport in particulate porous media

    Authors: A. V. Lukyanov, M. M. Sushchikh, M. J. Baines, T. G. Theofanous

    Abstract: The migration of liquids in porous media, such as sand, has been commonly considered at high saturation levels with liquid pathways at pore dimensions. In this letter we reveal a low saturation regime observed in our experiments with droplets of extremely low volatility liquids deposited on sand. In this regime the liquid is mostly found within the grain surface roughness and in the capillary brid… ▽ More

    Submitted 30 October, 2013; originally announced October 2013.

    Comments: 4 pages 4 figures

    Journal ref: Phys. Rev. Lett. (2012) Volume: 109 Article Number: 214501