Skip to main content

Showing 1–5 of 5 results for author: Kulshreshtha, A

.
  1. arXiv:2201.08239  [pdf, other

    cs.CL cs.AI

    LaMDA: Language Models for Dialog Applications

    Authors: Romal Thoppilan, Daniel De Freitas, Jamie Hall, Noam Shazeer, Apoorv Kulshreshtha, Heng-Tze Cheng, Alicia **, Taylor Bos, Leslie Baker, Yu Du, YaGuang Li, Hongrae Lee, Huaixiu Steven Zheng, Amin Ghafouri, Marcelo Menegali, Yan** Huang, Maxim Krikun, Dmitry Lepikhin, James Qin, Dehao Chen, Yuanzhong Xu, Zhifeng Chen, Adam Roberts, Maarten Bosma, Vincent Zhao , et al. (35 additional authors not shown)

    Abstract: We present LaMDA: Language Models for Dialog Applications. LaMDA is a family of Transformer-based neural language models specialized for dialog, which have up to 137B parameters and are pre-trained on 1.56T words of public dialog data and web text. While model scaling alone can improve quality, it shows less improvements on safety and factual grounding. We demonstrate that fine-tuning with annotat… ▽ More

    Submitted 10 February, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

  2. arXiv:2001.09977  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Towards a Human-like Open-Domain Chatbot

    Authors: Daniel Adiwardana, Minh-Thang Luong, David R. So, Jamie Hall, Noah Fiedel, Romal Thoppilan, Zi Yang, Apoorv Kulshreshtha, Gaurav Nemade, Yifeng Lu, Quoc V. Le

    Abstract: We present Meena, a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations. This 2.6B parameter neural network is simply trained to minimize perplexity of the next token. We also propose a human evaluation metric called Sensibleness and Specificity Average (SSA), which captures key elements of a human-like multi-turn conversation.… ▽ More

    Submitted 27 February, 2020; v1 submitted 27 January, 2020; originally announced January 2020.

    Comments: 38 pages, 12 figures

  3. ExplainIt! -- A declarative root-cause analysis engine for time series data (extended version)

    Authors: Vimalkumar Jeyakumar, Omid Madani, Ali Parandeh, Ashutosh Kulshreshtha, Weifei Zeng, Navindra Yadav

    Abstract: We present ExplainIt!, a declarative, unsupervised root-cause analysis engine that uses time series monitoring data from large complex systems such as data centres. ExplainIt! empowers operators to succinctly specify a large number of causal hypotheses to search for causes of interesting events. ExplainIt! then ranks these hypotheses, reducing the number of causal dependencies from hundreds of tho… ▽ More

    Submitted 22 March, 2019; v1 submitted 19 March, 2019; originally announced March 2019.

    Comments: SIGMOD Industry Track 2019

  4. arXiv:1811.00442  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech quant-ph

    Approximating observables on eigenstates of large many-body localized systems

    Authors: Abishek K. Kulshreshtha, Arijeet Pal, Thorsten B. Wahl, Steven H. Simon

    Abstract: Eigenstates of fully many-body localized (FMBL) systems can be organized into spin algebras based on quasilocal operators called l-bits. These spin algebras define quasilocal l-bit measurement ($τ^z_i$) and l-bit flip ($τ^x_i$) operators. For a disordered Heisenberg spin chain in the MBL regime we approximate l-bit flip operators by finding them exactly on small windows of systems and extending th… ▽ More

    Submitted 10 February, 2020; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: 10 pages, 7 figures, added references

    Journal ref: Phys. Rev. B 99, 104201 (2019)

  5. arXiv:1707.05362  [pdf, other

    cond-mat.dis-nn cond-mat.mes-hall cond-mat.stat-mech quant-ph

    Behavior of l-bits near the many-body localization transition

    Authors: Abishek K. Kulshreshtha, Arijeet Pal, Thorsten B. Wahl, Steven H. Simon

    Abstract: Eigenstates of fully many-body localized (FMBL) systems are described by quasilocal operators $τ_i^z$ (l-bits), which are conserved exactly under Hamiltonian time evolution. The algebra of the operators $τ_i^z$ and $τ_i^x$ associated with l-bits ($\boldsymbolτ_i$) completely defines the eigenstates and the matrix elements of local operators between eigenstates at all energies. We develop a non-per… ▽ More

    Submitted 10 February, 2020; v1 submitted 17 July, 2017; originally announced July 2017.

    Comments: 5+3 pages, 6 Figures, added results on finite size scaling and thermal-quantum critical crossover, additional references

    Journal ref: Phys. Rev. B 98, 184201 (2018)