-
Who could be behind QAnon? Authorship attribution with supervised machine-learning
Authors:
Florian Cafiero,
Jean-Baptiste Camps
Abstract:
A series of social media posts signed under the pseudonym "Q", started a movement known as QAnon, which led some of its most radical supporters to violent and illegal actions. To identify the person(s) behind Q, we evaluate the coincidence between the linguistic properties of the texts written by Q and to those written by a list of suspects provided by journalistic investigation. To identify the a…
▽ More
A series of social media posts signed under the pseudonym "Q", started a movement known as QAnon, which led some of its most radical supporters to violent and illegal actions. To identify the person(s) behind Q, we evaluate the coincidence between the linguistic properties of the texts written by Q and to those written by a list of suspects provided by journalistic investigation. To identify the authors of these posts, serious challenges have to be addressed. The "Q drops" are very short texts, written in a way that constitute a sort of literary genre in itself, with very peculiar features of style. These texts might have been written by different authors, whose other writings are often hard to find. After an online ethnology of the movement, necessary to collect enough material written by these thirteen potential authors, we use supervised machine learning to build stylistic profiles for each of them. We then performed a rolling analysis on Q's writings, to see if any of those linguistic profiles match the so-called 'QDrops' in part or entirety. We conclude that two different individuals, Paul F. and Ron W., are the closest match to Q's linguistic signature, and they could have successively written Q's texts. These potential authors are not high-ranked personality from the U.S. administration, but rather social media activists.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
Reinforcement of vaccine mandates and public attitudes towards vaccines: What can we learn from google search activity ?
Authors:
Florian Cafiero,
Jeremy Ward
Abstract:
International public health policies increasingly favor mandatory immunization. If its short-term effects on vaccine coverage are well documented, there has been little consideration to its effects on public attitudes towards vaccines. In this paper, we examine Google searches related to vaccines in five countries (Australia, France, Germany, Italy, Serbia) and two American states (California) whi…
▽ More
International public health policies increasingly favor mandatory immunization. If its short-term effects on vaccine coverage are well documented, there has been little consideration to its effects on public attitudes towards vaccines. In this paper, we examine Google searches related to vaccines in five countries (Australia, France, Germany, Italy, Serbia) and two American states (California) which experienced at least one vaccine mandate extension in the past decade. We found that the effects of a new mandate implementation heavily depends on the context in each specific country or state. We also observed that there is little indication that the passing of new or extended mandates attenuated public doubt towards vaccines.
△ Less
Submitted 11 January, 2022;
originally announced January 2022.
-
No comments: Addressing commentary sections in websites' analyses
Authors:
Florian Cafiero,
Paul Guille-Escuret,
Jeremy Ward
Abstract:
Removing or extracting the commentary sections from a series of websites is a tedious task, as no standard way to code them is widely adopted. This operation is thus very rarely performed. In this paper, we show that these commentary sections can induce significant biases in the analyses, especially in the case of controversial Highlights $\bullet$ Commentary sections can induce biases in the anal…
▽ More
Removing or extracting the commentary sections from a series of websites is a tedious task, as no standard way to code them is widely adopted. This operation is thus very rarely performed. In this paper, we show that these commentary sections can induce significant biases in the analyses, especially in the case of controversial Highlights $\bullet$ Commentary sections can induce biases in the analysis of websites' contents $\bullet$ Analyzing these sections can be interesting per se. $\bullet$ We illustrate these points using a corpus of anti-vaccine websites. $\bullet$ We provide guidelines to remove or extract these sections.
△ Less
Submitted 19 April, 2021;
originally announced April 2021.
-
Corpus and Models for Lemmatisation and POS-tagging of Classical French Theatre
Authors:
Jean-Baptiste Camps,
Simon Gabay,
Paul Fièvre,
Thibault Clérice,
Florian Cafiero
Abstract:
This paper describes the process of building an annotated corpus and training models for classical French literature, with a focus on theatre, and particularly comedies in verse. It was originally developed as a preliminary step to the stylometric analyses presented in Cafiero and Camps [2019]. The use of a recent lemmatiser based on neural networks and a CRF tagger allows to achieve accuracies be…
▽ More
This paper describes the process of building an annotated corpus and training models for classical French literature, with a focus on theatre, and particularly comedies in verse. It was originally developed as a preliminary step to the stylometric analyses presented in Cafiero and Camps [2019]. The use of a recent lemmatiser based on neural networks and a CRF tagger allows to achieve accuracies beyond the current state-of-the art on the in-domain test, and proves to be robust during out-of-domain tests, i.e.up to 20th c.novels.
△ Less
Submitted 5 February, 2021; v1 submitted 15 May, 2020;
originally announced May 2020.
-
Why Molière most likely did write his plays
Authors:
Florian Cafiero,
Jean-Baptiste Camps
Abstract:
As for Shakespeare, a hard-fought debate has emerged about Molière, a supposedly uneducated actor who, according to some, could not have written the masterpieces attributed to him. In the past decades, the century-old thesis according to which Pierre Corneille would be their actual author has become popular, mostly because of new works in computational linguistics. These results are reassessed her…
▽ More
As for Shakespeare, a hard-fought debate has emerged about Molière, a supposedly uneducated actor who, according to some, could not have written the masterpieces attributed to him. In the past decades, the century-old thesis according to which Pierre Corneille would be their actual author has become popular, mostly because of new works in computational linguistics. These results are reassessed here through state-of-the-art attribution methods. We study a corpus of comedies in verse by major authors of Molière and Corneille's time. Analysis of lexicon, rhymes, word forms, affixes, morphosyntactic sequences, and function words do not give any clue that another author among the major playwrights of the time would have written the plays signed under the name Molière.
△ Less
Submitted 2 January, 2020;
originally announced January 2020.
-
Asymmetric participation of defenders and critics of vaccines to debates on French-speaking Twitter
Authors:
Floriana Gargiulo,
Florian Cafiero,
Paul Guille-Escuret,
Valerie Seror,
Jeremy Ward
Abstract:
For more than a decade, doubt about vaccines has become an increasingly important global issue. Polarization of opinions on this matter, especially through social media, has been repeatedly observed, but details about the balance of forces are left unclear. In this paper, we analyse the flow of information on vaccines on the French-speaking realm of Twitter between 2016 and 2017. Two major asymmet…
▽ More
For more than a decade, doubt about vaccines has become an increasingly important global issue. Polarization of opinions on this matter, especially through social media, has been repeatedly observed, but details about the balance of forces are left unclear. In this paper, we analyse the flow of information on vaccines on the French-speaking realm of Twitter between 2016 and 2017. Two major asymmetries appear. Rather than opposing themselves on each vaccine-related controversy, pro and anti-vaccine accounts focus on different vaccines and vaccine-related topics. Pro-vaccine accounts focus on hopes for new groundbreaking vaccines and on ongoing outbreaks of vaccine-preventable illnesses. Vaccine critics concentrate their posts on a limited number of controversial vaccines and adjuvants. Furthermore, vaccine-critical accounts display greater craft and energy, using a wider variety of sources, and a more coordinated set of hashtags. This double asymmetry can have serious consequences. Despite the presence of a large number of pro-vaccine accounts, some arguments raised by efficiently organized and very active vaccine-critical activists are left unanswered.
△ Less
Submitted 4 May, 2020; v1 submitted 18 September, 2019;
originally announced September 2019.