Skip to main content

Showing 1–4 of 4 results for author: Goddard, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14971  [pdf, other

    cs.CL cs.AI cs.LG

    Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation

    Authors: Shamane Siriwardhana, Mark McQuade, Thomas Gauthier, Lucas Atkins, Fernando Fernandes Neto, Luke Meyers, Anneketh Vij, Tyler Odenthal, Charles Goddard, Mary MacCarthy, Jacob Solawetz

    Abstract: We conducted extensive experiments on domain adaptation of the Meta-Llama-3-70B-Instruct model on SEC data, exploring its performance on both general and domain-specific benchmarks. Our focus included continual pre-training (CPT) and model merging, aiming to enhance the model's domain-specific capabilities while mitigating catastrophic forgetting. Through this study, we evaluated the impact of int… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 8 pages, 6 figures

  2. arXiv:2403.13257  [pdf, other

    cs.CL cs.AI cs.LG

    Arcee's MergeKit: A Toolkit for Merging Large Language Models

    Authors: Charles Goddard, Shamane Siriwardhana, Malikeh Ehghaghi, Luke Meyers, Vlad Karpukhin, Brian Benedict, Mark McQuade, Jacob Solawetz

    Abstract: The rapid expansion of the open-source language model landscape presents an opportunity to merge the competencies of these model checkpoints by combining their parameters. Advances in transfer learning, the process of fine-tuning pretrained models for specific tasks, has resulted in the development of vast amounts of task-specific models, typically specialized in individual tasks and unable to uti… ▽ More

    Submitted 20 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 11 pages, 4 figures

  3. Policies for constraining the behaviour of coalitions of agents in the context of algebraic information theory

    Authors: Christopher Goddard

    Abstract: This article takes an oblique sidestep from two previous papers, wherein an approach to reformulation of game theory in terms of information theory, topology, as well as a few other notions was indicated. In this document a description is provided as to how one might determine an approach for an agent to choose a policy concerning which actions to take in a game that constrains behaviour of subsid… ▽ More

    Submitted 28 November, 2019; originally announced December 2019.

    Comments: 27 pages

  4. arXiv:1608.06697  [pdf

    cs.CL

    Semantic descriptions of 24 evaluational adjectives, for application in sentiment analysis

    Authors: Cliff Goddard, Maite Taboada, Radoslava Trnavac

    Abstract: We apply the Natural Semantic Metalanguage (NSM) approach (Goddard and Wierzbicka 2014) to the lexical-semantic analysis of English evaluational adjectives and compare the results with the picture developed in the Appraisal Framework (Martin and White 2005). The analysis is corpus-assisted, with examples mainly drawn from film and book reviews, and supported by collocational and statistical inform… ▽ More

    Submitted 23 August, 2016; originally announced August 2016.

    Report number: SFU-CMPT TR 2016-42-1