Skip to main content

Showing 1–6 of 6 results for author: Barrie, C

.
  1. arXiv:2407.02039  [pdf, other

    cs.CL

    Prompt Stability Scoring for Text Annotation with Large Language Models

    Authors: Christopher Barrie, Elli Palaiologou, Petter Törnberg

    Abstract: Researchers are increasingly using language models (LMs) for text annotation. These approaches rely only on a prompt telling the model to return a given output according to a set of instructions. The reproducibility of LM outputs may nonetheless be vulnerable to small changes in the prompt design. This calls into question the replicability of classification routines. To tackle this problem, resear… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 33 pages, 4 figures

  2. arXiv:2406.12069  [pdf, other

    cs.CL

    Satyrn: A Platform for Analytics Augmented Generation

    Authors: Marko Sterbentz, Cameron Barrie, Shubham Shahi, Abhratanu Dutta, Donna Hooshmand, Harper Pack, Kristian J. Hammond

    Abstract: Large language models (LLMs) are capable of producing documents, and retrieval augmented generation (RAG) has shown itself to be a powerful method for improving accuracy without sacrificing fluency. However, not all information can be retrieved from text. We propose an approach that uses the analysis of structured data to generate fact sets that are used to guide generation in much the same way th… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2311.12848  [pdf, other

    cs.DB cs.AI

    Lightweight Knowledge Representations for Automating Data Analysis

    Authors: Marko Sterbentz, Cameron Barrie, Donna Hooshmand, Shubham Shahi, Abhratanu Dutta, Harper Pack, Andong Li Zhao, Andrew Paley, Alexander Einarsson, Kristian Hammond

    Abstract: The principal goal of data science is to derive meaningful information from data. To do this, data scientists develop a space of analytic possibilities and from it reach their information goals by using their knowledge of the domain, the available data, the operations that can be performed on those data, the algorithms/models that are fed the data, and how all of these facets interweave. In this w… ▽ More

    Submitted 15 October, 2023; originally announced November 2023.

  4. arXiv:2212.10646  [pdf, other

    cs.SI cs.CY

    Did the Musk Takeover Boost Contentious Actors on Twitter?

    Authors: Christopher Barrie

    Abstract: Twitter has been accused of a liberal bias in its account verification and content moderation policies. Elon Musk pledged, after his acquisition of the company, to promote free speech on the platform by overhauling verification and moderation policies. These events sparked fears of a rise in influence of contentious actors -- notably from the political right. In this article, I use a publicly rele… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  5. arXiv:2112.03119  [pdf, ps, other

    cs.AI cs.HC

    Requirements for Open Political Information: Transparency Beyond Open Data

    Authors: Andong Luis Li Zhao, Andrew Paley, Rachel Adler, Harper Pack, Sergio Servantez, Alexander Einarsson, Cameron Barrie, Marko Sterbentz, Kristian Hammond

    Abstract: A politically informed citizenry is imperative for a welldeveloped democracy. While the US government has pursued policies for open data, these efforts have been insufficient in achieving an open government because only people with technical and domain knowledge can access information in the data. In this work, we conduct user interviews to identify wants and needs among stakeholders. We further u… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Presented at AAAI FSS-21: Artificial Intelligence in Government and Public Sector, Washington, DC, USA

  6. arXiv:2106.01814  [pdf, other

    stat.ME stat.AP

    Explaining Recruitment to Extremism: A Bayesian Case-Control Approach

    Authors: Roberto Cerina, Christopher Barrie, Neil Ketchley, Aaron Zelin

    Abstract: Who joins extremist movements? Answering this question poses considerable methodological challenges. Survey techniques are infeasible and selective samples provide no counterfactual. Recruits can be assigned to contextual units, but this is vulnerable to problems of ecological inference. In this article, we take inspiration from epidemiology and elaborate a technique that combines survey and ecolo… ▽ More

    Submitted 17 May, 2022; v1 submitted 3 June, 2021; originally announced June 2021.