Search | arXiv e-print repository

arXiv:2406.19867 [pdf, other]

Sampled Datasets Risk Substantial Bias in the Identification of Political Polarization on Social Media

Authors: Gabriele Di Bona, Emma Fraxanet, Björn Komander, Andrea Lo Sasso, Virginia Morini, Antoine Vendeville, Max Falkenberg, Alessandro Galeazzi

Abstract: Following recent policy changes by X (Twitter) and other social media platforms, user interaction data has become increasingly difficult to access. These restrictions are impeding robust research pertaining to social and political phenomena online, which is critical due to the profound impact social media platforms may have on our societies. Here, we investigate the reliability of polarization mea… ▽ More Following recent policy changes by X (Twitter) and other social media platforms, user interaction data has become increasingly difficult to access. These restrictions are impeding robust research pertaining to social and political phenomena online, which is critical due to the profound impact social media platforms may have on our societies. Here, we investigate the reliability of polarization measures obtained from different samples of social media data by studying the structural polarization of the Polish political debate on Twitter over a 24-hour period. First, we show that the political discussion on Twitter is only a small subset of the wider Twitter discussion. Second, we find that large samples can be representative of the whole political discussion on a platform, but small samples consistently fail to accurately reflect the true structure of polarization online. Finally, we demonstrate that keyword-based samples can be representative if keywords are selected with great care, but that poorly selected keywords can result in substantial political bias in the sampled data. Our findings demonstrate that it is not possible to measure polarization in a reliable way with small, sampled datasets, highlighting why the current lack of research data is so problematic, and providing insight into the practical implementation of the European Union's Digital Service Act which aims to improve researchers' access to social media data. △ Less

Submitted 28 June, 2024; originally announced June 2024.

arXiv:2403.07531 [pdf, other]

How Language, Culture, and Geography shape Online Dialogue: Insights from Koo

Authors: Amin Mekacher, Max Falkenberg, Andrea Baronchelli

Abstract: Koo is a microblogging platform based in India launched in 2020 with the explicit aim of catering to non-Western communities in their vernacular languages. With a near-complete dataset totalling over 71M posts and 399M user interactions, we show how Koo has attracted users from several countries including India, Nigeria and Brazil, but with variable levels of sustained user engagement. We highligh… ▽ More Koo is a microblogging platform based in India launched in 2020 with the explicit aim of catering to non-Western communities in their vernacular languages. With a near-complete dataset totalling over 71M posts and 399M user interactions, we show how Koo has attracted users from several countries including India, Nigeria and Brazil, but with variable levels of sustained user engagement. We highlight how Koo's interaction network has been shaped by multiple country-specific migrations and displays strong divides between linguistic and cultural communities, for instance, with English-speaking communities from India and Nigeria largely isolated from one another. Finally, we analyse the content shared by each linguistic community and identify cultural patterns that promote similar discourses across language groups. Our study raises the prospect that a multilingual and politically diverse platform like Koo may be able to cultivate vernacular communities that have, historically, not been prioritised by US-based social media platforms. △ Less

Submitted 12 March, 2024; originally announced March 2024.

Comments: 15 pages, 6 figures

arXiv:2401.07599 [pdf, other]

The Koo Dataset: An Indian Microblogging Platform With Global Ambitions

Authors: Amin Mekacher, Max Falkenberg, Andrea Baronchelli

Abstract: Increasingly, alternative platforms are playing a key role in the social media ecosystem. Koo, a microblogging platform based in India, has emerged as a major new social network hosting high profile politicians from several countries (India, Brazil, Nigeria) and many internationally renowned celebrities. This paper presents the largest publicly available Koo dataset, spanning from the platform's f… ▽ More Increasingly, alternative platforms are playing a key role in the social media ecosystem. Koo, a microblogging platform based in India, has emerged as a major new social network hosting high profile politicians from several countries (India, Brazil, Nigeria) and many internationally renowned celebrities. This paper presents the largest publicly available Koo dataset, spanning from the platform's founding in early 2020 to September 2023, providing detailed metadata for 72M posts, 75M comments, 40M shares, 284M likes and 1.4M user profiles. Along with the release of the dataset, we provide an overview of the platform including a discussion of the news ecosystem on the platform, hashtag usage and user engagement. Our results highlight the pivotal role that new platforms play in sha** online communities in emerging economies and the Global South, connecting local politicians and public figures with their followers. With Koo's ambition to become the town hall for diverse non-English speaking communities, our dataset offers new opportunities for studying social media beyond a Western context. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: 10 pages, 5 figures, 4 tables

arXiv:2311.18535 [pdf, other]

Affective and interactional polarization align across countries

Authors: Max Falkenberg, Fabiana Zollo, Walter Quattrociocchi, Jürgen Pfeffer, Andrea Baronchelli

Abstract: Political polarization plays a pivotal and potentially harmful role in a democracy. However, existing studies are often limited to a single country and one form of polarization, hindering a comprehensive understanding of the phenomena. Here we investigate how affective and interactional polarization are related across nine countries (Canada, France, Germany, Italy, Poland, Spain, Turkey, UK, USA).… ▽ More Political polarization plays a pivotal and potentially harmful role in a democracy. However, existing studies are often limited to a single country and one form of polarization, hindering a comprehensive understanding of the phenomena. Here we investigate how affective and interactional polarization are related across nine countries (Canada, France, Germany, Italy, Poland, Spain, Turkey, UK, USA). First, we show that political interaction networks are polarized on Twitter. Second, we reveal that out-group interactions, defined by the network, are more toxic than in-group interactions, meaning that affective and interactional polarization are aligned. Third, we show that out-group interactions receive lower engagement than in-group interactions. Finally, we show that the political right reference lower reliability media than the political left, and that interactions between politically engaged accounts are limited and rarely reciprocated. These results hold across countries and represent a first step towards a more unified understanding of polarization. △ Less

Submitted 30 November, 2023; originally announced November 2023.

Comments: 13 pages, 6 figures, 68 references

arXiv:2305.07529 [pdf, ps, other]

doi 10.1371/journal.pclm.0000277

Hurricanes Increase Climate Change Conversations on Twitter

Authors: Maddalena Torricelli, Max Falkenberg, Alessandro Galeazzi, Fabiana Zollo, Walter Quattrociocchi, Andrea Baronchelli

Abstract: The public understanding of climate change plays a critical role in translating climate science into climate action. In the public discourse, climate impacts are often discussed in the context of extreme weather events. Here, we analyse 65 million Twitter posts and 240 thousand news media articles related to 18 major hurricanes from 2010 to 2022 to clarify how hurricanes impact the public discussi… ▽ More The public understanding of climate change plays a critical role in translating climate science into climate action. In the public discourse, climate impacts are often discussed in the context of extreme weather events. Here, we analyse 65 million Twitter posts and 240 thousand news media articles related to 18 major hurricanes from 2010 to 2022 to clarify how hurricanes impact the public discussion around climate change. First, we analyse news content and show that climate change is the most prominent non-hurricane specific topic discussed by the news media in relation to hurricanes. Second, we perform a comparative analysis between reliable and questionable news media outlets, finding that the language around climate change varies between news media providers. Finally, using geolocated data, we show that accounts in regions affected by hurricanes discuss climate change at a significantly higher rate than accounts in unaffected areas, with references to climate change increasing by, on average, 80% after impact, and up to 200% for the largest hurricanes. Our findings demonstrate how hurricanes have a key impact on the public awareness of climate change. △ Less

Submitted 12 May, 2023; originally announced May 2023.

Journal ref: PLOS Climate 2(11): e0000277, 2023

arXiv:2304.11685 [pdf, other]

Child Face Recognition at Scale: Synthetic Data Generation and Performance Benchmark

Authors: Magnus Falkenberg, Anders Bensen Ottsen, Mathias Ibsen, Christian Rathgeb

Abstract: We address the need for a large-scale database of children's faces by using generative adversarial networks (GANs) and face age progression (FAP) models to synthesize a realistic dataset referred to as HDA-SynChildFaces. To this end, we proposed a processing pipeline that initially utilizes StyleGAN3 to sample adult subjects, which are subsequently progressed to children of varying ages using Inte… ▽ More We address the need for a large-scale database of children's faces by using generative adversarial networks (GANs) and face age progression (FAP) models to synthesize a realistic dataset referred to as HDA-SynChildFaces. To this end, we proposed a processing pipeline that initially utilizes StyleGAN3 to sample adult subjects, which are subsequently progressed to children of varying ages using InterFaceGAN. Intra-subject variations, such as facial expression and pose, are created by further manipulating the subjects in their latent space. Additionally, the presented pipeline allows to evenly distribute the races of subjects, allowing to generate a balanced and fair dataset with respect to race distribution. The created HDA-SynChildFaces consists of 1,652 subjects and a total of 188,832 images, each subject being present at various ages and with many different intra-subject variations. Subsequently, we evaluates the performance of various facial recognition systems on the generated database and compare the results of adults and children at different ages. The study reveals that children consistently perform worse than adults, on all tested systems, and the degradation in performance is proportional to age. Additionally, our study uncovers some biases in the recognition systems, with Asian and Black subjects and females performing worse than White and Latino Hispanic subjects and males. △ Less

Submitted 23 April, 2023; originally announced April 2023.

arXiv:2303.11147 [pdf, other]

doi 10.1093/pnasnexus/pgad346

The Systemic Impact of Deplatforming on Social Media

Authors: Amin Mekacher, Max Falkenberg, Andrea Baronchelli

Abstract: Deplatforming, or banning malicious accounts from social media, is a key tool for moderating online harms. However, the consequences of deplatforming for the wider social media ecosystem have been largely overlooked so far, due to the difficulty of tracking banned users. Here, we address this gap by studying the ban-induced platform migration from Twitter to Gettr. With a matched dataset of 15M Ge… ▽ More Deplatforming, or banning malicious accounts from social media, is a key tool for moderating online harms. However, the consequences of deplatforming for the wider social media ecosystem have been largely overlooked so far, due to the difficulty of tracking banned users. Here, we address this gap by studying the ban-induced platform migration from Twitter to Gettr. With a matched dataset of 15M Gettr posts and 12M Twitter tweets, we show that users active on both platforms post similar content as users active on Gettr but banned from Twitter, but the latter have higher retention and are 5 times more active. Then, we reveal that matched users are more toxic on Twitter, where they can engage in abusive cross-ideological interactions, than Gettr. Our analysis shows that the matched cohort are ideologically aligned with the far-right, and that the ability to interact with political opponents may be part of the appeal of Twitter to these users. Finally, we identify structural changes in the Gettr network preceding the 2023 Brasilia insurrections, highlighting how deplatforming from mainstream social media can fuel poorly-regulated alternatives that may pose a risk to democratic life. △ Less

Submitted 20 March, 2023; originally announced March 2023.

Comments: 12 pages, 6 figures

ACM Class: J.4

Journal ref: PNAS Nexus, Volume 2, Issue 11, November 2023, pgad346

arXiv:2112.12137 [pdf, other]

Growing polarisation around climate change on social media

Authors: Max Falkenberg, Alessandro Galeazzi, Maddalena Torricelli, Niccolo Di Marco, Francesca Larosa, Madalina Sas, Amin Mekacher, Warren Pearce, Fabiana Zollo, Walter Quattrociocchi, Andrea Baronchelli

Abstract: Climate change and political polarisation are two of the 21st century's critical socio-political issues. Here, we investigate their intersection by studying the discussion around the UN Conference of The Parties on Climate Change (COP) using Twitter data from 2014 to 2021. First, we reveal a large increase in ideological polarisation during COP26, following low polarisation between COP20 and COP25… ▽ More Climate change and political polarisation are two of the 21st century's critical socio-political issues. Here, we investigate their intersection by studying the discussion around the UN Conference of The Parties on Climate Change (COP) using Twitter data from 2014 to 2021. First, we reveal a large increase in ideological polarisation during COP26, following low polarisation between COP20 and COP25. Second, we show that this increase is driven by growing right-wing activity, a 4-fold increase since COP21 relative to pro-climate groups. Finally, we identify a broad range of ''climate contrarian'' views during COP26, emphasising the theme of ''political hypocrisy'' as a topic of cross-ideological appeal; contrarian views and accusations of hypocrisy have become key themes in the Twitter climate discussion since 2019. With future climate action reliant on negotiations at COP27 and beyond, our results highlight the importance of monitoring polarisation, and its impacts, in the public climate discourse. △ Less

Submitted 14 November, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

Comments: 13 pages main, 4 pages extended dta

arXiv:2102.09489 [pdf, other]

Heterogeneous node copying from hidden network structure

Authors: Max Falkenberg

Abstract: Node copying is an important mechanism for network formation, yet most models assume uniform copying rules. Motivated by observations of heterogeneous triadic closure in real networks, we introduce the concept of a hidden network model - a generative two-layer model in which an observed network evolves according to the structure of an underlying hidden layer - and apply the framework to a model of… ▽ More Node copying is an important mechanism for network formation, yet most models assume uniform copying rules. Motivated by observations of heterogeneous triadic closure in real networks, we introduce the concept of a hidden network model - a generative two-layer model in which an observed network evolves according to the structure of an underlying hidden layer - and apply the framework to a model of heterogeneous copying. Framed in a social context, these two layers represent a node's inner social circle, and wider social circle, such that the model can bias copying probabilities towards, or against, a node's inner circle of friends. Comparing the case of extreme inner circle bias to an equivalent model with uniform copying, we find that heterogeneous copying suppresses the power-law degree distributions commonly seen in copying models, and results in networks with much higher clustering than even the most optimum scenario for uniform copying. Similarly large clustering values are found in real collaboration networks, lending empirical support to the mechanism. △ Less

Submitted 24 July, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

Comments: 12 pages, 5 figures. See supplement attached

ACM Class: G.2.2

arXiv:2001.09118 [pdf, other]

doi 10.1103/PhysRevResearch.2.023352

Identifying time dependence in network growth

Authors: Max Falkenberg, Jong-Hyeok Lee, Shun-ichi Amano, Ken-ichiro Ogawa, Kazuo Yano, Yoshihiro Miyake, Tim S. Evans, Kim Christensen

Abstract: Identifying power-law scaling in real networks - indicative of preferential attachment - has proved controversial. Critics argue that measuring the temporal evolution of a network directly is better than measuring the degree distribution when looking for preferential attachment. However, many of the established methods do not account for any potential time-dependence in the attachment kernels of g… ▽ More Identifying power-law scaling in real networks - indicative of preferential attachment - has proved controversial. Critics argue that measuring the temporal evolution of a network directly is better than measuring the degree distribution when looking for preferential attachment. However, many of the established methods do not account for any potential time-dependence in the attachment kernels of growing networks, or methods assume that node degree is the key observable determining network evolution. In this paper, we argue that these assumptions may lead to misleading conclusions about the evolution of growing networks. We illustrate this by introducing a simple adaptation of the Barab{á}si-Albert model, the "k2 model", where new nodes attach to nodes in the existing network in proportion to the number of nodes one or two steps from the target node. The k2 model results in time dependent degree distributions and attachment kernels, despite initially appearing to grow as linear preferential attachment, and without the need to include explicit time dependence in key network parameters (such as the average out-degree). We show that similar effects are seen in several real world networks where constant network growth rules do not describe their evolution. This implies that measurements of specific degree distributions in real networks are also likely to change over time. △ Less

Submitted 14 May, 2020; v1 submitted 24 January, 2020; originally announced January 2020.

Comments: 19 pages, 11 figures

Journal ref: Phys. Rev. Research 2, 023352 (2020)

arXiv:1908.01646 [pdf, other]

doi 10.1103/PhysRevResearch.2.023311

Understanding the transition from paroxysmal to persistent atrial fibrillation from micro-anatomical re-entry in a simple model

Authors: Alberto Ciacci, Max Falkenberg, Kishan A. Manani, Tim S. Evans, Nicholas S. Peters, Kim Christensen

Abstract: Atrial fibrillation (AF) is the most common cardiac arrhytmia, characterised by the chaotic motion of electrical wavefronts in the atria. In clinical practice, AF is classified under two primary categories: paroxysmal AF, short intermittent episodes separated by periods of normal electrical activity, and persistent AF, longer uninterrupted episodes of chaotic electrical activity. However, the prec… ▽ More Atrial fibrillation (AF) is the most common cardiac arrhytmia, characterised by the chaotic motion of electrical wavefronts in the atria. In clinical practice, AF is classified under two primary categories: paroxysmal AF, short intermittent episodes separated by periods of normal electrical activity, and persistent AF, longer uninterrupted episodes of chaotic electrical activity. However, the precise reasons why AF in a given patient is paroxysmal or persistent is poorly understood. Recently, we have introduced the percolation based Christensen-Manani-Peters (CMP) model of AF which naturally exhibits both paroxysmal and persistent AF, but precisely how these differences emerge in the model is unclear. In this paper, we dissect the CMP model to identify the cause of these different AF classifications. Starting from a mean-field model where we describe AF as a simple birth-death process, we add layers of complexity to the model and show that persistent AF arises from re-entrant circuits which exhibit an asymmetry in their probability of activation relative to deactivation. As a result, different simulations generated at identical model parameters can exhibit fibrillatory episodes spanning several orders of magnitude from a few seconds to months. These findings demonstrate that diverse, complex fibrillatory dynamics can emerge from very simple dynamics in models of AF. △ Less

Submitted 13 May, 2020; v1 submitted 5 August, 2019; originally announced August 2019.

Report number: Imperial/TP/19/TSE/2

Journal ref: Phys. Rev. Research 2, 023311 (2020)

arXiv:1810.12062 [pdf, other]

doi 10.1103/PhysRevE.100.062406

Unified Mechanism of Atrial Fibrillation in a Simple Model

Authors: Max Falkenberg, Andrew J. Ford, Anthony C. Li, Alberto Ciacci, Nicholas S. Peters, Kim Christensen

Abstract: The mechanism of atrial fibrillation (AF) is poorly understood, resulting in disappointing success rates of ablative treatment. Different mechanisms defined largely by different atrial activation patterns have been proposed and, arguably, this dispute has slowed the progress of AF research. Recent clinical evidence suggests a unifying mechanism based on sustained re-entrant circuits in the complex… ▽ More The mechanism of atrial fibrillation (AF) is poorly understood, resulting in disappointing success rates of ablative treatment. Different mechanisms defined largely by different atrial activation patterns have been proposed and, arguably, this dispute has slowed the progress of AF research. Recent clinical evidence suggests a unifying mechanism based on sustained re-entrant circuits in the complex atrial architecture. Here, we present a simple computational model showing spontaneous emergence of AF that strongly supports, and gives a theoretical explanation for, the clinically observed diversity of activation. We show that the difference in surface activation patterns is a direct consequence of the thickness of the discrete network of heart muscle cells through which electrical signals percolate to reach the imaged surface. The model naturally follows the clinical spectrum of AF spanning sinus rhythm, paroxysmal and persistent AF as the decoupling of myocardial cells results in the lattice approaching the percolation threshold. This allows the model to make additional predictions beyond the current clinical understanding, showing that for paroxysmal AF re-entrant circuits emerge near the endocardium, but in persistent AF they emerge deeper in the bulk of the atrial wall where endocardial ablation is less effective. If clinically confirmed, this may explain the lower success rate of ablation in long-lasting persistent AF. △ Less

Submitted 29 October, 2018; originally announced October 2018.

Journal ref: Phys. Rev. E 100, 062406 (2019)

arXiv:0905.0909 [pdf, ps, other]

doi 10.1111/j.1365-2966.2009.15036.x

The role of E+A and post-starburst galaxies - II. Spectral energy distributions and comparison with observations

Authors: M. A. Falkenberg, R. Kotulla, U. Fritze

Abstract: In a previous paper (Falkenberg, Kotulla & Fritze 2009, arXiv:0901.1665) we have shown that the classical definition of E+A galaxies excludes a significant number of post-starburst galaxies. We suggested that analysing broad-band spectral energy distributions (SEDs) is a more comprehensive method to select and distinguish poststarburst galaxies than the classical definition of measuring equivale… ▽ More In a previous paper (Falkenberg, Kotulla & Fritze 2009, arXiv:0901.1665) we have shown that the classical definition of E+A galaxies excludes a significant number of post-starburst galaxies. We suggested that analysing broad-band spectral energy distributions (SEDs) is a more comprehensive method to select and distinguish poststarburst galaxies than the classical definition of measuring equivalent widths of (Hdelta) and [OII] lines. In this paper we will carefully investigate this new method and evaluate it by comparing our model grid of post-starburst galaxies to observed E+A galaxies from the MORPHS catalog. We find that the post-starburst models can be distinguished from undisturbed spiral, S0 and E galaxies and galaxies in their starburst phase on the basis of their SEDs. It is even possible to distinguish most of the different post-starburst by their SEDs. From the comparison with observations we find that all observed E+A galaxies from the MORPHS catalog can be matched by our models. However only models with short decline timescales for the star formation rate are possible scenarios for the observed E+A galaxies in agreement with our results from the first paper (see Falkenberg, Kotulla & Fritze 2009a). (abridged) △ Less

Submitted 6 May, 2009; originally announced May 2009.

Comments: accepted for publication in MNRAS; 14 pages, 15 figures; Companion paper to "The role of E+A and post-starburst galaxies - I. Models and model results" by M. A. Falkenberg, R. Kotulla & U. Fritze, arXiv:0901.1665

arXiv:0901.1665 [pdf, ps, other]

doi 10.1111/j.1365-2966.2009.14416.x

The role of E+A and post-starburst galaxies - I. Models and model results

Authors: M. A. Falkenberg, R. Kotulla, U. Fritze

Abstract: Different compositions of galaxy types in the field in comparison to galaxy clusters as described by the morphology-density relation in the local universe is interpreted as a result of transformation processes from late- to early-type galaxies. This interpretation is supported by the Butcher-Oemler effect. We investigate E+A galaxies as an intermediate state between late-type galaxies in low d… ▽ More Different compositions of galaxy types in the field in comparison to galaxy clusters as described by the morphology-density relation in the local universe is interpreted as a result of transformation processes from late- to early-type galaxies. This interpretation is supported by the Butcher-Oemler effect. We investigate E+A galaxies as an intermediate state between late-type galaxies in low density environments and early-type galaxies in high density environment to constrain the possible transformation processes. For this purpose we model a grid of post-starburst galaxies by inducing a burst and/or a halting of star formation on the normal evolution of spiral galaxies with our galaxy evolution code GALEV. From our models we find that the common E+A criteria exclude a significant number of post-starburst galaxies and propose that comparing their spectral energy distributions leads to a more sufficient method to investigate post-starbust galaxies. We predict that a higher number of E+A galaxies in the early universe can not be ascribed solely to a higher number of starburst, but is a result of a lower metallicity and a higher burst strength due to more gas content of the galaxies in the early universe. We find that even galaxies with a normal evolution without a starburst have a Hdelta-strong phase at early galaxy ages. △ Less

Submitted 2 March, 2009; v1 submitted 12 January, 2009; originally announced January 2009.

Comments: accepted for publication in MNRAS; 14 pages, 21 figures

Showing 1–14 of 14 results for author: Falkenberg, M