Skip to main content

Showing 1–2 of 2 results for author: Valmeekam, C S K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.00226  [pdf, other

    eess.SP cs.LG

    Transformers are Provably Optimal In-context Estimators for Wireless Communications

    Authors: Vishnu Teja Kunde, Vicram Rajagopalan, Chandra Shekhara Kaushik Valmeekam, Krishna Narayanan, Srinivas Shakkottai, Dileep Kalathil, Jean-Francois Chamberland

    Abstract: Pre-trained transformers exhibit the capability of adapting to new tasks through in-context learning (ICL), where they efficiently utilize a limited set of prompts without explicit model optimization. The canonical communication problem of estimating transmitted symbols from received observations can be modelled as an in-context learning problem: Received observations are essentially a noisy fun… ▽ More

    Submitted 14 June, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: 13 pages, 2 figures, 2 tables, preprint; abstract, references, theory updated

  2. arXiv:2306.04050  [pdf, ps, other

    cs.IT cs.CL cs.LG

    LLMZip: Lossless Text Compression using Large Language Models

    Authors: Chandra Shekhara Kaushik Valmeekam, Krishna Narayanan, Dileep Kalathil, Jean-Francois Chamberland, Srinivas Shakkottai

    Abstract: We provide new estimates of an asymptotic upper bound on the entropy of English using the large language model LLaMA-7B as a predictor for the next token given a window of past tokens. This estimate is significantly smaller than currently available estimates in \cite{cover1978convergent}, \cite{lutati2023focus}. A natural byproduct is an algorithm for lossless compression of English text which com… ▽ More

    Submitted 26 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 7 pages, 4 figures, 4 tables, preprint, added results on using LLMs with arithmetic coding