-
Sampled Datasets Risk Substantial Bias in the Identification of Political Polarization on Social Media
Authors:
Gabriele Di Bona,
Emma Fraxanet,
Björn Komander,
Andrea Lo Sasso,
Virginia Morini,
Antoine Vendeville,
Max Falkenberg,
Alessandro Galeazzi
Abstract:
Following recent policy changes by X (Twitter) and other social media platforms, user interaction data has become increasingly difficult to access. These restrictions are impeding robust research pertaining to social and political phenomena online, which is critical due to the profound impact social media platforms may have on our societies. Here, we investigate the reliability of polarization mea…
▽ More
Following recent policy changes by X (Twitter) and other social media platforms, user interaction data has become increasingly difficult to access. These restrictions are impeding robust research pertaining to social and political phenomena online, which is critical due to the profound impact social media platforms may have on our societies. Here, we investigate the reliability of polarization measures obtained from different samples of social media data by studying the structural polarization of the Polish political debate on Twitter over a 24-hour period. First, we show that the political discussion on Twitter is only a small subset of the wider Twitter discussion. Second, we find that large samples can be representative of the whole political discussion on a platform, but small samples consistently fail to accurately reflect the true structure of polarization online. Finally, we demonstrate that keyword-based samples can be representative if keywords are selected with great care, but that poorly selected keywords can result in substantial political bias in the sampled data. Our findings demonstrate that it is not possible to measure polarization in a reliable way with small, sampled datasets, highlighting why the current lack of research data is so problematic, and providing insight into the practical implementation of the European Union's Digital Service Act which aims to improve researchers' access to social media data.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
How Language, Culture, and Geography shape Online Dialogue: Insights from Koo
Authors:
Amin Mekacher,
Max Falkenberg,
Andrea Baronchelli
Abstract:
Koo is a microblogging platform based in India launched in 2020 with the explicit aim of catering to non-Western communities in their vernacular languages. With a near-complete dataset totalling over 71M posts and 399M user interactions, we show how Koo has attracted users from several countries including India, Nigeria and Brazil, but with variable levels of sustained user engagement. We highligh…
▽ More
Koo is a microblogging platform based in India launched in 2020 with the explicit aim of catering to non-Western communities in their vernacular languages. With a near-complete dataset totalling over 71M posts and 399M user interactions, we show how Koo has attracted users from several countries including India, Nigeria and Brazil, but with variable levels of sustained user engagement. We highlight how Koo's interaction network has been shaped by multiple country-specific migrations and displays strong divides between linguistic and cultural communities, for instance, with English-speaking communities from India and Nigeria largely isolated from one another. Finally, we analyse the content shared by each linguistic community and identify cultural patterns that promote similar discourses across language groups. Our study raises the prospect that a multilingual and politically diverse platform like Koo may be able to cultivate vernacular communities that have, historically, not been prioritised by US-based social media platforms.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
The Koo Dataset: An Indian Microblogging Platform With Global Ambitions
Authors:
Amin Mekacher,
Max Falkenberg,
Andrea Baronchelli
Abstract:
Increasingly, alternative platforms are playing a key role in the social media ecosystem. Koo, a microblogging platform based in India, has emerged as a major new social network hosting high profile politicians from several countries (India, Brazil, Nigeria) and many internationally renowned celebrities. This paper presents the largest publicly available Koo dataset, spanning from the platform's f…
▽ More
Increasingly, alternative platforms are playing a key role in the social media ecosystem. Koo, a microblogging platform based in India, has emerged as a major new social network hosting high profile politicians from several countries (India, Brazil, Nigeria) and many internationally renowned celebrities. This paper presents the largest publicly available Koo dataset, spanning from the platform's founding in early 2020 to September 2023, providing detailed metadata for 72M posts, 75M comments, 40M shares, 284M likes and 1.4M user profiles. Along with the release of the dataset, we provide an overview of the platform including a discussion of the news ecosystem on the platform, hashtag usage and user engagement. Our results highlight the pivotal role that new platforms play in sha** online communities in emerging economies and the Global South, connecting local politicians and public figures with their followers. With Koo's ambition to become the town hall for diverse non-English speaking communities, our dataset offers new opportunities for studying social media beyond a Western context.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Affective and interactional polarization align across countries
Authors:
Max Falkenberg,
Fabiana Zollo,
Walter Quattrociocchi,
Jürgen Pfeffer,
Andrea Baronchelli
Abstract:
Political polarization plays a pivotal and potentially harmful role in a democracy. However, existing studies are often limited to a single country and one form of polarization, hindering a comprehensive understanding of the phenomena. Here we investigate how affective and interactional polarization are related across nine countries (Canada, France, Germany, Italy, Poland, Spain, Turkey, UK, USA).…
▽ More
Political polarization plays a pivotal and potentially harmful role in a democracy. However, existing studies are often limited to a single country and one form of polarization, hindering a comprehensive understanding of the phenomena. Here we investigate how affective and interactional polarization are related across nine countries (Canada, France, Germany, Italy, Poland, Spain, Turkey, UK, USA). First, we show that political interaction networks are polarized on Twitter. Second, we reveal that out-group interactions, defined by the network, are more toxic than in-group interactions, meaning that affective and interactional polarization are aligned. Third, we show that out-group interactions receive lower engagement than in-group interactions. Finally, we show that the political right reference lower reliability media than the political left, and that interactions between politically engaged accounts are limited and rarely reciprocated. These results hold across countries and represent a first step towards a more unified understanding of polarization.
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
Hurricanes Increase Climate Change Conversations on Twitter
Authors:
Maddalena Torricelli,
Max Falkenberg,
Alessandro Galeazzi,
Fabiana Zollo,
Walter Quattrociocchi,
Andrea Baronchelli
Abstract:
The public understanding of climate change plays a critical role in translating climate science into climate action. In the public discourse, climate impacts are often discussed in the context of extreme weather events. Here, we analyse 65 million Twitter posts and 240 thousand news media articles related to 18 major hurricanes from 2010 to 2022 to clarify how hurricanes impact the public discussi…
▽ More
The public understanding of climate change plays a critical role in translating climate science into climate action. In the public discourse, climate impacts are often discussed in the context of extreme weather events. Here, we analyse 65 million Twitter posts and 240 thousand news media articles related to 18 major hurricanes from 2010 to 2022 to clarify how hurricanes impact the public discussion around climate change. First, we analyse news content and show that climate change is the most prominent non-hurricane specific topic discussed by the news media in relation to hurricanes. Second, we perform a comparative analysis between reliable and questionable news media outlets, finding that the language around climate change varies between news media providers. Finally, using geolocated data, we show that accounts in regions affected by hurricanes discuss climate change at a significantly higher rate than accounts in unaffected areas, with references to climate change increasing by, on average, 80% after impact, and up to 200% for the largest hurricanes. Our findings demonstrate how hurricanes have a key impact on the public awareness of climate change.
△ Less
Submitted 12 May, 2023;
originally announced May 2023.
-
Child Face Recognition at Scale: Synthetic Data Generation and Performance Benchmark
Authors:
Magnus Falkenberg,
Anders Bensen Ottsen,
Mathias Ibsen,
Christian Rathgeb
Abstract:
We address the need for a large-scale database of children's faces by using generative adversarial networks (GANs) and face age progression (FAP) models to synthesize a realistic dataset referred to as HDA-SynChildFaces. To this end, we proposed a processing pipeline that initially utilizes StyleGAN3 to sample adult subjects, which are subsequently progressed to children of varying ages using Inte…
▽ More
We address the need for a large-scale database of children's faces by using generative adversarial networks (GANs) and face age progression (FAP) models to synthesize a realistic dataset referred to as HDA-SynChildFaces. To this end, we proposed a processing pipeline that initially utilizes StyleGAN3 to sample adult subjects, which are subsequently progressed to children of varying ages using InterFaceGAN. Intra-subject variations, such as facial expression and pose, are created by further manipulating the subjects in their latent space. Additionally, the presented pipeline allows to evenly distribute the races of subjects, allowing to generate a balanced and fair dataset with respect to race distribution. The created HDA-SynChildFaces consists of 1,652 subjects and a total of 188,832 images, each subject being present at various ages and with many different intra-subject variations. Subsequently, we evaluates the performance of various facial recognition systems on the generated database and compare the results of adults and children at different ages. The study reveals that children consistently perform worse than adults, on all tested systems, and the degradation in performance is proportional to age. Additionally, our study uncovers some biases in the recognition systems, with Asian and Black subjects and females performing worse than White and Latino Hispanic subjects and males.
△ Less
Submitted 23 April, 2023;
originally announced April 2023.
-
The Systemic Impact of Deplatforming on Social Media
Authors:
Amin Mekacher,
Max Falkenberg,
Andrea Baronchelli
Abstract:
Deplatforming, or banning malicious accounts from social media, is a key tool for moderating online harms. However, the consequences of deplatforming for the wider social media ecosystem have been largely overlooked so far, due to the difficulty of tracking banned users. Here, we address this gap by studying the ban-induced platform migration from Twitter to Gettr. With a matched dataset of 15M Ge…
▽ More
Deplatforming, or banning malicious accounts from social media, is a key tool for moderating online harms. However, the consequences of deplatforming for the wider social media ecosystem have been largely overlooked so far, due to the difficulty of tracking banned users. Here, we address this gap by studying the ban-induced platform migration from Twitter to Gettr. With a matched dataset of 15M Gettr posts and 12M Twitter tweets, we show that users active on both platforms post similar content as users active on Gettr but banned from Twitter, but the latter have higher retention and are 5 times more active. Then, we reveal that matched users are more toxic on Twitter, where they can engage in abusive cross-ideological interactions, than Gettr. Our analysis shows that the matched cohort are ideologically aligned with the far-right, and that the ability to interact with political opponents may be part of the appeal of Twitter to these users. Finally, we identify structural changes in the Gettr network preceding the 2023 Brasilia insurrections, highlighting how deplatforming from mainstream social media can fuel poorly-regulated alternatives that may pose a risk to democratic life.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
Growing polarisation around climate change on social media
Authors:
Max Falkenberg,
Alessandro Galeazzi,
Maddalena Torricelli,
Niccolo Di Marco,
Francesca Larosa,
Madalina Sas,
Amin Mekacher,
Warren Pearce,
Fabiana Zollo,
Walter Quattrociocchi,
Andrea Baronchelli
Abstract:
Climate change and political polarisation are two of the 21st century's critical socio-political issues. Here, we investigate their intersection by studying the discussion around the UN Conference of The Parties on Climate Change (COP) using Twitter data from 2014 to 2021. First, we reveal a large increase in ideological polarisation during COP26, following low polarisation between COP20 and COP25…
▽ More
Climate change and political polarisation are two of the 21st century's critical socio-political issues. Here, we investigate their intersection by studying the discussion around the UN Conference of The Parties on Climate Change (COP) using Twitter data from 2014 to 2021. First, we reveal a large increase in ideological polarisation during COP26, following low polarisation between COP20 and COP25. Second, we show that this increase is driven by growing right-wing activity, a 4-fold increase since COP21 relative to pro-climate groups. Finally, we identify a broad range of ''climate contrarian'' views during COP26, emphasising the theme of ''political hypocrisy'' as a topic of cross-ideological appeal; contrarian views and accusations of hypocrisy have become key themes in the Twitter climate discussion since 2019. With future climate action reliant on negotiations at COP27 and beyond, our results highlight the importance of monitoring polarisation, and its impacts, in the public climate discourse.
△ Less
Submitted 14 November, 2022; v1 submitted 22 December, 2021;
originally announced December 2021.
-
Heterogeneous node copying from hidden network structure
Authors:
Max Falkenberg
Abstract:
Node copying is an important mechanism for network formation, yet most models assume uniform copying rules. Motivated by observations of heterogeneous triadic closure in real networks, we introduce the concept of a hidden network model - a generative two-layer model in which an observed network evolves according to the structure of an underlying hidden layer - and apply the framework to a model of…
▽ More
Node copying is an important mechanism for network formation, yet most models assume uniform copying rules. Motivated by observations of heterogeneous triadic closure in real networks, we introduce the concept of a hidden network model - a generative two-layer model in which an observed network evolves according to the structure of an underlying hidden layer - and apply the framework to a model of heterogeneous copying. Framed in a social context, these two layers represent a node's inner social circle, and wider social circle, such that the model can bias copying probabilities towards, or against, a node's inner circle of friends. Comparing the case of extreme inner circle bias to an equivalent model with uniform copying, we find that heterogeneous copying suppresses the power-law degree distributions commonly seen in copying models, and results in networks with much higher clustering than even the most optimum scenario for uniform copying. Similarly large clustering values are found in real collaboration networks, lending empirical support to the mechanism.
△ Less
Submitted 24 July, 2021; v1 submitted 18 February, 2021;
originally announced February 2021.
-
Identifying time dependence in network growth
Authors:
Max Falkenberg,
Jong-Hyeok Lee,
Shun-ichi Amano,
Ken-ichiro Ogawa,
Kazuo Yano,
Yoshihiro Miyake,
Tim S. Evans,
Kim Christensen
Abstract:
Identifying power-law scaling in real networks - indicative of preferential attachment - has proved controversial. Critics argue that measuring the temporal evolution of a network directly is better than measuring the degree distribution when looking for preferential attachment. However, many of the established methods do not account for any potential time-dependence in the attachment kernels of g…
▽ More
Identifying power-law scaling in real networks - indicative of preferential attachment - has proved controversial. Critics argue that measuring the temporal evolution of a network directly is better than measuring the degree distribution when looking for preferential attachment. However, many of the established methods do not account for any potential time-dependence in the attachment kernels of growing networks, or methods assume that node degree is the key observable determining network evolution. In this paper, we argue that these assumptions may lead to misleading conclusions about the evolution of growing networks. We illustrate this by introducing a simple adaptation of the Barab{á}si-Albert model, the "k2 model", where new nodes attach to nodes in the existing network in proportion to the number of nodes one or two steps from the target node. The k2 model results in time dependent degree distributions and attachment kernels, despite initially appearing to grow as linear preferential attachment, and without the need to include explicit time dependence in key network parameters (such as the average out-degree). We show that similar effects are seen in several real world networks where constant network growth rules do not describe their evolution. This implies that measurements of specific degree distributions in real networks are also likely to change over time.
△ Less
Submitted 14 May, 2020; v1 submitted 24 January, 2020;
originally announced January 2020.
-
Understanding the transition from paroxysmal to persistent atrial fibrillation from micro-anatomical re-entry in a simple model
Authors:
Alberto Ciacci,
Max Falkenberg,
Kishan A. Manani,
Tim S. Evans,
Nicholas S. Peters,
Kim Christensen
Abstract:
Atrial fibrillation (AF) is the most common cardiac arrhytmia, characterised by the chaotic motion of electrical wavefronts in the atria. In clinical practice, AF is classified under two primary categories: paroxysmal AF, short intermittent episodes separated by periods of normal electrical activity, and persistent AF, longer uninterrupted episodes of chaotic electrical activity. However, the prec…
▽ More
Atrial fibrillation (AF) is the most common cardiac arrhytmia, characterised by the chaotic motion of electrical wavefronts in the atria. In clinical practice, AF is classified under two primary categories: paroxysmal AF, short intermittent episodes separated by periods of normal electrical activity, and persistent AF, longer uninterrupted episodes of chaotic electrical activity. However, the precise reasons why AF in a given patient is paroxysmal or persistent is poorly understood. Recently, we have introduced the percolation based Christensen-Manani-Peters (CMP) model of AF which naturally exhibits both paroxysmal and persistent AF, but precisely how these differences emerge in the model is unclear. In this paper, we dissect the CMP model to identify the cause of these different AF classifications. Starting from a mean-field model where we describe AF as a simple birth-death process, we add layers of complexity to the model and show that persistent AF arises from re-entrant circuits which exhibit an asymmetry in their probability of activation relative to deactivation. As a result, different simulations generated at identical model parameters can exhibit fibrillatory episodes spanning several orders of magnitude from a few seconds to months. These findings demonstrate that diverse, complex fibrillatory dynamics can emerge from very simple dynamics in models of AF.
△ Less
Submitted 13 May, 2020; v1 submitted 5 August, 2019;
originally announced August 2019.
-
Unified Mechanism of Atrial Fibrillation in a Simple Model
Authors:
Max Falkenberg,
Andrew J. Ford,
Anthony C. Li,
Alberto Ciacci,
Nicholas S. Peters,
Kim Christensen
Abstract:
The mechanism of atrial fibrillation (AF) is poorly understood, resulting in disappointing success rates of ablative treatment. Different mechanisms defined largely by different atrial activation patterns have been proposed and, arguably, this dispute has slowed the progress of AF research. Recent clinical evidence suggests a unifying mechanism based on sustained re-entrant circuits in the complex…
▽ More
The mechanism of atrial fibrillation (AF) is poorly understood, resulting in disappointing success rates of ablative treatment. Different mechanisms defined largely by different atrial activation patterns have been proposed and, arguably, this dispute has slowed the progress of AF research. Recent clinical evidence suggests a unifying mechanism based on sustained re-entrant circuits in the complex atrial architecture. Here, we present a simple computational model showing spontaneous emergence of AF that strongly supports, and gives a theoretical explanation for, the clinically observed diversity of activation. We show that the difference in surface activation patterns is a direct consequence of the thickness of the discrete network of heart muscle cells through which electrical signals percolate to reach the imaged surface. The model naturally follows the clinical spectrum of AF spanning sinus rhythm, paroxysmal and persistent AF as the decoupling of myocardial cells results in the lattice approaching the percolation threshold. This allows the model to make additional predictions beyond the current clinical understanding, showing that for paroxysmal AF re-entrant circuits emerge near the endocardium, but in persistent AF they emerge deeper in the bulk of the atrial wall where endocardial ablation is less effective. If clinically confirmed, this may explain the lower success rate of ablation in long-lasting persistent AF.
△ Less
Submitted 29 October, 2018;
originally announced October 2018.
-
The role of E+A and post-starburst galaxies - II. Spectral energy distributions and comparison with observations
Authors:
M. A. Falkenberg,
R. Kotulla,
U. Fritze
Abstract:
In a previous paper (Falkenberg, Kotulla & Fritze 2009, arXiv:0901.1665) we have shown that the classical definition of E+A galaxies excludes a significant number of post-starburst galaxies. We suggested that analysing broad-band spectral energy distributions (SEDs) is a more comprehensive method to select and distinguish poststarburst galaxies than the classical definition of measuring equivale…
▽ More
In a previous paper (Falkenberg, Kotulla & Fritze 2009, arXiv:0901.1665) we have shown that the classical definition of E+A galaxies excludes a significant number of post-starburst galaxies. We suggested that analysing broad-band spectral energy distributions (SEDs) is a more comprehensive method to select and distinguish poststarburst galaxies than the classical definition of measuring equivalent widths of (Hdelta) and [OII] lines.
In this paper we will carefully investigate this new method and evaluate it by comparing our model grid of post-starburst galaxies to observed E+A galaxies from the MORPHS catalog.
We find that the post-starburst models can be distinguished from undisturbed spiral, S0 and E galaxies and galaxies in their starburst phase on the basis of their SEDs. It is even possible to distinguish most of the different post-starburst by their SEDs. From the comparison with observations we find that all observed E+A galaxies from the MORPHS catalog can be matched by our models. However only models with short decline timescales for the star formation rate are possible scenarios for the observed E+A galaxies in agreement with our results from the first paper (see Falkenberg, Kotulla & Fritze 2009a).
(abridged)
△ Less
Submitted 6 May, 2009;
originally announced May 2009.
-
The role of E+A and post-starburst galaxies - I. Models and model results
Authors:
M. A. Falkenberg,
R. Kotulla,
U. Fritze
Abstract:
Different compositions of galaxy types in the field in comparison to galaxy clusters as described by the morphology-density relation in the local universe is interpreted as a result of transformation processes from late- to early-type galaxies. This interpretation is supported by the Butcher-Oemler effect.
We investigate E+A galaxies as an intermediate state between late-type galaxies in low d…
▽ More
Different compositions of galaxy types in the field in comparison to galaxy clusters as described by the morphology-density relation in the local universe is interpreted as a result of transformation processes from late- to early-type galaxies. This interpretation is supported by the Butcher-Oemler effect.
We investigate E+A galaxies as an intermediate state between late-type galaxies in low density environments and early-type galaxies in high density environment to constrain the possible transformation processes. For this purpose we model a grid of post-starburst galaxies by inducing a burst and/or a halting of star formation on the normal evolution of spiral galaxies with our galaxy evolution code GALEV.
From our models we find that the common E+A criteria exclude a significant number of post-starburst galaxies and propose that comparing their spectral energy distributions leads to a more sufficient method to investigate post-starbust galaxies. We predict that a higher number of E+A galaxies in the early universe can not be ascribed solely to a higher number of starburst, but is a result of a lower metallicity and a higher burst strength due to more gas content of the galaxies in the early universe. We find that even galaxies with a normal evolution without a starburst have a Hdelta-strong phase at early galaxy ages.
△ Less
Submitted 2 March, 2009; v1 submitted 12 January, 2009;
originally announced January 2009.