Skip to main content

Showing 1–2 of 2 results for author: Milliken, L

.
  1. arXiv:2406.14836  [pdf, other

    cs.SE

    Identifying Inaccurate Descriptions in LLM-generated Code Comments via Test Execution

    Authors: Sungmin Kang, Louis Milliken, Shin Yoo

    Abstract: Software comments are critical for human understanding of software, and as such many comment generation techniques have been proposed. However, we find that a systematic evaluation of the factual accuracy of generated comments is rare; only subjective accuracy labels have been given. Evaluating comments generated by three Large Language Models (LLMs), we find that even for the best-performing LLM,… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: The supplementary material is provided at: https://smkang96.github.io/assets/pdf/doctest_supplementary_arxiv.pdf

  2. arXiv:2307.11224  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    **a Embeddings: A Novel Set of High-Performance Sentence Embedding Models

    Authors: Michael Günther, Louis Milliken, Jonathan Geuter, Georgios Mastrapas, Bo Wang, Han Xiao

    Abstract: **a Embeddings constitutes a set of high-performance sentence embedding models adept at translating textual inputs into numerical representations, capturing the semantics of the text. These models excel in applications like dense retrieval and semantic textual similarity. This paper details the development of **a Embeddings, starting with the creation of high-quality pairwise and triplet dataset… ▽ More

    Submitted 20 October, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 9 pages, 2 page appendix

    MSC Class: 68T50 ACM Class: H.3.1; H.3.3; I.2.7; I.5.4