Skip to main content

Showing 1–2 of 2 results for author: Karydas, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.13160  [pdf, other

    cs.LG cs.CL

    SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced Token Detection

    Authors: Ke Ye, Heinrich Jiang, Afshin Rostamizadeh, Ayan Chakrabarti, Giulia DeSalvo, Jean-François Kagy, Lazaros Karydas, Gui Citovsky, Sanjiv Kumar

    Abstract: Pre-training large language models is known to be extremely resource intensive and often times inefficient, under-utilizing the information encapsulated in the training text sequences. In this paper, we present SpacTor, a new training procedure consisting of (1) a hybrid objective combining span corruption (SC) and token replacement detection (RTD), and (2) a two-stage curriculum that optimizes th… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 9+13 pages, 5 figures

  2. arXiv:2107.14263  [pdf, other

    cs.LG cs.AI

    Batch Active Learning at Scale

    Authors: Gui Citovsky, Giulia DeSalvo, Claudio Gentile, Lazaros Karydas, Anand Rajagopalan, Afshin Rostamizadeh, Sanjiv Kumar

    Abstract: The ability to train complex and highly effective models often requires an abundance of training data, which can easily become a bottleneck in cost, time, and computational resources. Batch active learning, which adaptively issues batched queries to a labeling oracle, is a common approach for addressing this problem. The practical benefits of batch sampling come with the downside of less adaptivit… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.