Skip to main content

Showing 1–3 of 3 results for author: Moshkov, I

.
  1. arXiv:2406.11704  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 340B Technical Report

    Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

    Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2402.10176  [pdf, other

    cs.CL cs.AI cs.LG

    OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

    Authors: Shubham Toshniwal, Ivan Moshkov, Sean Narenthiran, Daria Gitman, Fei Jia, Igor Gitman

    Abstract: Recent work has shown the immense potential of synthetically generated datasets for training large language models (LLMs), especially for acquiring targeted skills. Current large-scale math instruction tuning datasets such as MetaMathQA (Yu et al., 2024) and MAmmoTH (Yue et al., 2024) are constructed using outputs from closed-source LLMs with commercially restrictive licenses. A key reason limitin… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: Data and models are available at https://huggingface.co/collections/nvidia/openmath-65c5619de2ba059be0775014

  3. arXiv:1701.09124  [pdf, other

    cond-mat.supr-con

    Point-contact spectroscopy of the high-temperature superconductor BiSrCaCuO

    Authors: L. F. Rybal'chenko, V. V. Fisun, N. L. Bobrov, M. B. Kosmyna, A. I. Moshkov, V. P. Seminozhenko, I. K. Yanson

    Abstract: The maximum value of the energy gap $Δ\simeq 8\ meV$ and the ratio $2δ/kT_c\simeq~2.5$ are determined for a high-temperature superconductor $\rm Bi_2Sr_2CaCu_2O_{8+y}$ by using point contacts. It was found that the high-temperature superconductor is transformed under the effect of current injection of quasiparticles to a new modified state with a reduced gap, which is stable in a wide range of inj… ▽ More

    Submitted 1 February, 2017; v1 submitted 31 January, 2017; originally announced January 2017.

    Comments: 4 pages, 3 figures

    Journal ref: Fiz. Nizk. Temp., 15, 95 (1989), Sov. J. Low Temp. Phys., 15, 54 (1989)