-
The rise and fall of WallStreetBets: social roles and opinion leaders across the GameStop saga
Authors:
Anna Mancini,
Antonio Desiderio,
Giovanni Palermo,
Riccardo Di Clemente,
Giulio Cimini
Abstract:
Nowadays human interactions largely take place on social networks, with online users' behavior often falling into a few general typologies or "social roles". Among these, opinion leaders are of crucial importance as they have the ability to spread an idea or opinion on a large scale across the network, with possible tangible consequences in the real world. In this work we extract and characterize…
▽ More
Nowadays human interactions largely take place on social networks, with online users' behavior often falling into a few general typologies or "social roles". Among these, opinion leaders are of crucial importance as they have the ability to spread an idea or opinion on a large scale across the network, with possible tangible consequences in the real world. In this work we extract and characterize the different social roles of users within the Reddit WallStreetBets community, around the time of the GameStop short squeeze of January 2021 -- when a handful of committed users led the whole community to engage in a large and risky financial operation. We identify the profiles of both average users and of relevant outliers, including opinion leaders, using an iterative, semi-supervised classification algorithm, which allows us to discern the characteristics needed to play a particular social role. The key features of opinion leaders are large risky investments and constant updates on a single stock, which allowed them to attract a large following and, in the case of GameStop, ignite the interest of the community. Finally, we observe a substantial change in the behavior and attitude of users after the short squeeze event: no new opinion leaders are found and the community becomes less focused on investments. Overall, this work sheds light on the users' roles and dynamics that led to the GameStop short squeeze, while also suggesting why WallStreetBets no longer wielded such large influence on financial markets, in the aftermath of this event.
△ Less
Submitted 9 March, 2024;
originally announced March 2024.
-
Understanding vehicular routing behavior with location-based service data
Authors:
Yanyan Xu,
Riccardo Di Clemente,
Marta C. Gonzalez
Abstract:
Properly extracting patterns of individual mobility with high resolution data sources such as the one extracted from smartphone applications offers important opportunities. Potential opportunities not offered by call detailed records (CDRs), which offer resolutions triangulated from antennas, are route choices, travel modes detection and close encounters. Nowadays, there is not a standard and larg…
▽ More
Properly extracting patterns of individual mobility with high resolution data sources such as the one extracted from smartphone applications offers important opportunities. Potential opportunities not offered by call detailed records (CDRs), which offer resolutions triangulated from antennas, are route choices, travel modes detection and close encounters. Nowadays, there is not a standard and large scale data set collected over long periods that allows us to characterize these. In this work we thoroughly examine the use of data from smartphone applications, also referred to as location-based services (LBS) data, to extract and understand the vehicular route choice behavior. Taking the Dallas-Fort Worth metroplex as an example, we first extract the vehicular trips with simple rules and reconstruct the origin-destination matrix by coupling the extracted vehicular trips of the active LBS users and the United States census data. We then present a method to derive the commonly used routes by individuals from the LBS traces with varying sample rate intervals. We further inspect the relation between the number of routes and the trip characteristics, including the departure time, trip length and travel time. Specifically, we consider the travel time index and buffer index for the LBS users taking different number of routes. Empirical results demonstrate that during the peak hours, travelers tend to reduce the impact of traffic congestion by taking alternative routes. Overall, the proposed data analysis framework is cost-effective to treat sparse data generated from the use of smartphones to inform routing behavior. The potential in practice is to inform demand management strategies, by targeting individual users while generating large scale estimates of congestion mitigation.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Spontaneous Opinion Swings in the Voter Model with Latency
Authors:
Giovanni Palermo,
Anna Mancini,
Antonio Desiderio,
Riccardo Di Clemente,
Giulio Cimini
Abstract:
The cognitive process of opinion formation is often characterized by stubbornness or resistance of agents to changes of opinion. To capture such a feature we introduce a constant latency time in the standard voter model of opinion dynamics: after switching opinion, an agent must keep it for a while. This seemingly simple modification drastically changes the stochastic diffusive behavior of the ori…
▽ More
The cognitive process of opinion formation is often characterized by stubbornness or resistance of agents to changes of opinion. To capture such a feature we introduce a constant latency time in the standard voter model of opinion dynamics: after switching opinion, an agent must keep it for a while. This seemingly simple modification drastically changes the stochastic diffusive behavior of the original model, leading to deterministic dynamical oscillations in the average opinion of the agents. We explain the origin of the oscillations and develop a mathematical formulation of the dynamics that is confirmed by extensive numerical simulations. We further characterize the rich phase space of the model and its asymptotic behavior. Our work offers insights into understanding and modeling opinion swings in diverse social contexts.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Time-space dynamics of income segregation: a case study of Milan's neighbourhoods
Authors:
Lavinia Rossi Mori,
Vittorio Loreto,
Riccardo Di Clemente
Abstract:
Traditional approaches to urban income segregation focus on static residential patterns, often failing to capture the dynamic nature of social mixing at the neighborhood level. Leveraging high-resolution location-based data from mobile phones, we capture the interplay of three different income groups (high, medium, low) based on their daily routines. We propose a three-dimensional space to analyze…
▽ More
Traditional approaches to urban income segregation focus on static residential patterns, often failing to capture the dynamic nature of social mixing at the neighborhood level. Leveraging high-resolution location-based data from mobile phones, we capture the interplay of three different income groups (high, medium, low) based on their daily routines. We propose a three-dimensional space to analyze social mixing, which is embedded in the temporal dynamics of urban activities. This framework offers a more detailed perspective on social interactions, closely linked to the geographical features of each neighborhood. While residential areas fail to encourage social mixing in the nighttime, the working hours foster inclusion, with the city center showing a heightened level of interaction. As evening sets in, leisure areas emerge as potential facilitators for social interactions, depending on urban features such as public transport and a variety of Points Of Interest. These characteristics significantly modulate the magnitude and type of social stratification involved in social mixing, also underscoring the significance of urban design in either bridging or widening socio-economic divides.
△ Less
Submitted 28 February, 2024; v1 submitted 29 September, 2023;
originally announced September 2023.
-
Recurring patterns in online social media interactions during highly engaging events
Authors:
Antonio Desiderio,
Anna Mancini,
Giulio Cimini,
Riccardo Di Clemente
Abstract:
People nowadays express their opinions in online spaces, using different forms of interactions such as posting, sharing and discussing with one another. These digital traces allow to capture how people dynamically react to the myriad of events occurring in the world. By unfolding the structure of Reddit conversations, we describe how highly engaging events happening in the society affect user inte…
▽ More
People nowadays express their opinions in online spaces, using different forms of interactions such as posting, sharing and discussing with one another. These digital traces allow to capture how people dynamically react to the myriad of events occurring in the world. By unfolding the structure of Reddit conversations, we describe how highly engaging events happening in the society affect user interactions and behaviour with respect to unperturbed discussion patterns. Conversations, defined as a post and the comments underneath, are analysed along their temporal and semantic dimensions. We disclose that changes in the pace and language used in conversations exhibit notable similarities across diverse events. Conversations tend to become repetitive with a more limited vocabulary, display different semantic structures and feature heightened emotions. As the event approaches, the shifts occurring in conversations are reflected in the users' dynamics. Users become more active and they exchange information with a growing audience, despite using a less rich vocabulary and repetitive messages. The peers of each user fill up more semantic space, shifting the dialogue and widening the exchange of information. The recurring patterns we discovered are persistent across several contexts, thus represent a fingerprint of human behavior, which could impact the modeling of online social networks interactions.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
Spatiotemporal gender differences in urban vibrancy
Authors:
Thomas Collins,
Riccardo Di Clemente,
Mario Gutiérrez-Roig,
Federico Botta
Abstract:
Urban vibrancy is the dynamic activity of humans in urban locations. It can vary with urban features and the opportunities for human interactions, but it might also differ according to the underlying social conditions of city inhabitants across and within social surroundings. Such heterogeneity in how different demographic groups may experience cities has the potential to cause gender segregation…
▽ More
Urban vibrancy is the dynamic activity of humans in urban locations. It can vary with urban features and the opportunities for human interactions, but it might also differ according to the underlying social conditions of city inhabitants across and within social surroundings. Such heterogeneity in how different demographic groups may experience cities has the potential to cause gender segregation because of differences in the preferences of inhabitants, their accessibility and opportunities, and large-scale mobility behaviours. However, traditional studies have failed to capture fully a high-frequency understanding of how urban vibrancy is linked to urban features, how this might differ for different genders, and how this might affect segregation in cities. Our results show that (1) there are differences between males and females in terms of urban vibrancy, (2) the differences relate to `Points of Interest` as well as transportation networks, and (3) that there are both positive and negative `spatial spillovers` existing across each city. To do this, we use a quantitative approach using Call Detail Record data--taking advantage of the near-ubiquitous use of mobile phones--to gain high-frequency observations of spatial behaviours across the seven most prominent cities of Italy. We use a spatial model comparison approach of the direct and `spillover` effects from urban features on male-female differences. Our results increase our understanding of inequality in cities and how we can make future cities fairer.
△ Less
Submitted 11 October, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Understanding European Integration with Bipartite Networks of Comparative Advantage
Authors:
Riccardo Di Clemente,
Balázs Lengyel,
Lars F. Andersson,
Rikard Eriksson
Abstract:
Core objectives of European common market integration are convergence and economic growth, but these are hampered by redundancy, and value chain asymmetries. The challenge is how to harmonize labor division to reach global competitiveness, meanwhile bridging productivity differences across the EU. We develop a bipartite network approach to trace pairwise co-specialization, by applying the Revealed…
▽ More
Core objectives of European common market integration are convergence and economic growth, but these are hampered by redundancy, and value chain asymmetries. The challenge is how to harmonize labor division to reach global competitiveness, meanwhile bridging productivity differences across the EU. We develop a bipartite network approach to trace pairwise co-specialization, by applying the Revealed Comparative Advantage method, within and between EU15 and Central and Eastern European (CEE). This approach assesses redundancies and division of labor in the EU at the level of industries and countries. We find significant co-specialization among CEE countries but a diverging specialization between EU15 and CEE. Productivity increases in those CEE industries that have co-specialized with other CEE countries after EU accession, while co-specialization across CEE and EU15 countries is less related to productivity growth. These results show that a division of sectoral specialization can lead to productivity convergence between EU15 and CEE countries.
△ Less
Submitted 28 October, 2022; v1 submitted 2 February, 2022;
originally announced February 2022.
-
COVID-19 is linked to changes in the time-space dimension of human mobility
Authors:
Clodomir Santana,
Federico Botta,
Hugo Barbosa,
Filippo Privitera,
Ronaldo Menezes,
Riccardo Di Clemente
Abstract:
Socio-economic constructs and urban topology are crucial drivers of human mobility patterns. During the coronavirus disease 2019 pandemic, these patterns were reshaped in their components: the spatial dimension represented by the daily travelled distance, and the temporal dimension expressed as the synchronization time of commuting routines. Here, leveraging location-based data from de-identified…
▽ More
Socio-economic constructs and urban topology are crucial drivers of human mobility patterns. During the coronavirus disease 2019 pandemic, these patterns were reshaped in their components: the spatial dimension represented by the daily travelled distance, and the temporal dimension expressed as the synchronization time of commuting routines. Here, leveraging location-based data from de-identified mobile phone users, we observed that, during lockdowns restrictions, the decrease of spatial mobility is interwoven with the emergence of asynchronous mobility dynamics. The lifting of restriction in urban mobility allowed a faster recovery of the spatial dimension compared with the temporal one. Moreover, the recovery in mobility was different depending on urbanization levels and economic stratification. In rural and low-income areas, the spatial mobility dimension suffered a more considerable disruption when compared with urbanized and high-income areas. In contrast, the temporal dimension was more affected in urbanized and high-income areas than in rural and low-income areas.
△ Less
Submitted 27 July, 2023; v1 submitted 17 January, 2022;
originally announced January 2022.
-
Self-induced consensus of Reddit users to characterise the GameStop short squeeze
Authors:
Anna Mancini,
Antonio Desiderio,
Riccardo Di Clemente,
Giulio Cimini
Abstract:
The short squeeze of GameStop (GME) shares in mid-January 2021 has been primarily orchestrated by retail investors of the Reddit r/wallstreetbets community. As such, it represents a paramount example of collective coordination action on social media, resulting in large-scale consensus formation and significant market impact. In this work we characterise the structure and time evolution of Reddit c…
▽ More
The short squeeze of GameStop (GME) shares in mid-January 2021 has been primarily orchestrated by retail investors of the Reddit r/wallstreetbets community. As such, it represents a paramount example of collective coordination action on social media, resulting in large-scale consensus formation and significant market impact. In this work we characterise the structure and time evolution of Reddit conversation data, showing that the occurrence and sentiment of GME-related comments (representing how much users are engaged with GME) increased significantly much before the short squeeze actually took place. Taking inspiration from these early warnings as well as evidence from previous literature, we introduce a model of opinion dynamics where user engagement can trigger a self-reinforcing mechanism leading to the emergence of consensus, which in this particular case is associated to the success of the short squeeze operation. Analytical solutions and model simulations on interaction networks of Reddit users feature a phase transition from heterogeneous to homogeneous opinions as engagement grows, which we qualitatively compare to the sudden hike of GME stock price. Although the model cannot be validated with available data, it offers a possible and minimal interpretation for the increasingly important phenomenon of self-organized collective actions taking place on social networks.
△ Less
Submitted 8 August, 2022; v1 submitted 13 December, 2021;
originally announced December 2021.
-
Mobilkit: A Python Toolkit for Urban Resilience and Disaster Risk Management Analytics using High Frequency Human Mobility Data
Authors:
Enrico Ubaldi,
Takahiro Yabe,
Nicholas K. W. Jones,
Maham Faisal Khan,
Satish V. Ukkusuri,
Riccardo Di Clemente,
Emanuele Strano
Abstract:
Increasingly available high-frequency location datasets derived from smartphones provide unprecedented insight into trajectories of human mobility. These datasets can play a significant and growing role in informing preparedness and response to natural disasters. However, limited tools exist to enable rapid analytics using mobility data, and tend not to be tailored specifically for disaster risk m…
▽ More
Increasingly available high-frequency location datasets derived from smartphones provide unprecedented insight into trajectories of human mobility. These datasets can play a significant and growing role in informing preparedness and response to natural disasters. However, limited tools exist to enable rapid analytics using mobility data, and tend not to be tailored specifically for disaster risk management. We present an open-source, Python-based toolkit designed to conduct replicable and scalable post-disaster analytics using GPS location data. Privacy, system capabilities, and potential expansions of \textit{Mobilkit} are discussed.
△ Less
Submitted 16 September, 2021; v1 submitted 29 July, 2021;
originally announced July 2021.
-
Urbanization and Economic Complexity
Authors:
Riccardo Di Clemente,
Emanuele Strano,
Michael Batty
Abstract:
Urbanization plays a crucial role in the economic development of every country. The mutual relationship between the urbanization of any country and its economic productive structure is far from being understood. We analyzed the historical evolution of product exports for all countries using the World Trade Web (WTW) with respect to patterns of urbanization from 1995-2010. Using the evolving framew…
▽ More
Urbanization plays a crucial role in the economic development of every country. The mutual relationship between the urbanization of any country and its economic productive structure is far from being understood. We analyzed the historical evolution of product exports for all countries using the World Trade Web (WTW) with respect to patterns of urbanization from 1995-2010. Using the evolving framework of economic complexity, we reveal that a country's economic development in terms of its production and export of goods, is interwoven with the urbanization process during the early stages of its economic development and growth. Meanwhile in urbanized countries, the reciprocal relation between economic growth and urbanization fades away with respect to its later stages, becoming negligible for countries highly dependent on the export of resources where urbanization is not linked to any structural economic transformation.
△ Less
Submitted 21 January, 2021; v1 submitted 13 September, 2020;
originally announced September 2020.
-
Mining urban lifestyles: urban computing, human behavior and recommender systems
Authors:
Sharon Xu,
Riccardo Di Clemente,
Marta C. González
Abstract:
In the last decade, the digital age has sharply redefined the way we study human behavior. With the advancement of data storage and sensing technologies, electronic records now encompass a diverse spectrum of human activity, ranging from location data, phone and email communication to Twitter activity and open-source contributions on Wikipedia and OpenStreetMap. In particular, the study of the sho…
▽ More
In the last decade, the digital age has sharply redefined the way we study human behavior. With the advancement of data storage and sensing technologies, electronic records now encompass a diverse spectrum of human activity, ranging from location data, phone and email communication to Twitter activity and open-source contributions on Wikipedia and OpenStreetMap. In particular, the study of the shop** and mobility patterns of individual consumers has the potential to give deeper insight into the lifestyles and infrastructure of the region. Credit card records (CCRs) provide detailed insight into purchase behavior and have been found to have inherent regularity in consumer shop** patterns; call detail records (CDRs) present new opportunities to understand human mobility, analyze wealth, and model social network dynamics. In this chapter, we jointly model the lifestyles of individuals, a more challenging problem with higher variability when compared to the aggregated behavior of city regions. Using collective matrix factorization, we propose a unified dual view of lifestyles. Understanding these lifestyles will not only inform commercial opportunities, but also help policymakers and nonprofit organizations understand the characteristics and needs of the entire region, as well as of the individuals within that region. The applications of this range from targeted advertisements and promotions to the diffusion of digital financial services among low-income groups.
△ Less
Submitted 4 November, 2019;
originally announced November 2019.
-
Inequality is rising where social network segregation interacts with urban topology
Authors:
Gergő Tóth,
Johannes Wachs,
Riccardo Di Clemente,
Ákos Jakobi,
Bence Ságvári,
János Kertész,
Balázs Lengyel
Abstract:
Social networks amplify inequalities due to fundamental mechanisms of social tie formation such as homophily and triadic closure. These forces sharpen social segregation reflected in network fragmentation. Yet, little is known about what structural factors facilitate fragmentation. In this paper we use big data from a widely-used online social network to demonstrate that there is a significant rel…
▽ More
Social networks amplify inequalities due to fundamental mechanisms of social tie formation such as homophily and triadic closure. These forces sharpen social segregation reflected in network fragmentation. Yet, little is known about what structural factors facilitate fragmentation. In this paper we use big data from a widely-used online social network to demonstrate that there is a significant relationship between social network fragmentation and income inequality in cities and towns. We find that the organization of the physical urban space has a stronger relationship with fragmentation than unequal access to education, political segregation, or the presence of ethnic and religious minorities. Fragmentation of social networks is significantly higher in towns in which residential neighborhoods are divided by physical barriers such as rivers and railroads and are relatively distant from the center of town. Towns in which amenities are spatially concentrated are also typically more socially segregated. These relationships suggest how urban planning may be a useful point of intervention to mitigate inequalities in the long run.
△ Less
Submitted 25 September, 2019;
originally announced September 2019.
-
Reconstructing mesoscale network structures
Authors:
Jeroen van Lidth de Jeude,
Riccardo Di Clemente,
Guido Caldarelli,
Fabio Saracco,
Tiziano Squartini
Abstract:
When facing complex mesoscale network structures, it is generally believed that (null) models encoding the modular organization of nodes must be employed. The present paper focuses on two block structures that characterize the mesoscale organization of many real-world networks, i.e. the bow-tie and the core-periphery ones. Our analysis shows that constraining the network degree sequence is often e…
▽ More
When facing complex mesoscale network structures, it is generally believed that (null) models encoding the modular organization of nodes must be employed. The present paper focuses on two block structures that characterize the mesoscale organization of many real-world networks, i.e. the bow-tie and the core-periphery ones. Our analysis shows that constraining the network degree sequence is often enough to reproduce such structures, as confirmed by model selection criteria as AIC or BIC. As a byproduct, our paper enriches the toolbox for the analysis of bipartite networks - still far from being complete. The aforementioned structures, in fact, partition the networks into asymmetric blocks characterized by binary, directed connections, thus calling for the extension of a recently-proposed method to randomize undirected, bipartite networks to the directed case.
△ Less
Submitted 23 December, 2018; v1 submitted 15 May, 2018;
originally announced May 2018.
-
The role of geography in the complex diffusion of innovations
Authors:
Balázs Lengyel,
Eszter Bokányi,
Riccardo Di Clemente,
János Kertész,
Marta C. González
Abstract:
The urban-rural divide is increasing in modern societies calling for geographical extensions of social influence modelling. Improved understanding of innovation diffusion across locations and through social connections can provide us with new insights into the spread of information, technological progress and economic development. In this work, we analyze the spatial adoption dynamics of iWiW, an…
▽ More
The urban-rural divide is increasing in modern societies calling for geographical extensions of social influence modelling. Improved understanding of innovation diffusion across locations and through social connections can provide us with new insights into the spread of information, technological progress and economic development. In this work, we analyze the spatial adoption dynamics of iWiW, an Online Social Network (OSN) in Hungary and uncover empirical features about the spatial adoption in social networks. During its entire life cycle from 2002 to 2012, iWiW reached up to 300 million friendship ties of 3 million users. We find that the number of adopters as a function of town population follows a scaling law that reveals a strongly concentrated early adoption in large towns and a less concentrated late adoption. We also discover a strengthening distance decay of spread over the life-cycle indicating high fraction of distant diffusion in early stages but the dominance of local diffusion in late stages. The spreading process is modelled within the Bass diffusion framework that enables us to compare the differential equation version with an agent-based version of the model run on the empirical network. Although both models can capture the macro trend of adoption, they have limited capacity to describe the observed trends of urban scaling and distance decay. We find, however that incorporating adoption thresholds, defined by the fraction of social connections that adopt a technology before the individual adopts, improves the network model fit to the urban scaling of early adopters. Controlling for the threshold distribution enables us to eliminate the bias induced by local network structure on predicting local adoption peaks. Finally, we show that geographical features such as distance from the innovation origin and town size influence prediction of adoption peak at local scales.
△ Less
Submitted 27 August, 2020; v1 submitted 4 April, 2018;
originally announced April 2018.
-
Big Data Fusion to Estimate Fuel Consumption: A Case Study of Riyadh
Authors:
Adham Kalila,
Zeyad Awwad,
Riccardo Di Clemente,
Marta C. González
Abstract:
Falling oil revenues and rapid urbanization are putting a strain on the budgets of oil producing nations which often subsidize domestic fuel consumption. A direct way to decrease the impact of subsidies is to reduce fuel consumption by reducing congestion and car trips. While fuel consumption models have started to incorporate data sources from ubiquitous sensing devices, the opportunity is to dev…
▽ More
Falling oil revenues and rapid urbanization are putting a strain on the budgets of oil producing nations which often subsidize domestic fuel consumption. A direct way to decrease the impact of subsidies is to reduce fuel consumption by reducing congestion and car trips. While fuel consumption models have started to incorporate data sources from ubiquitous sensing devices, the opportunity is to develop comprehensive models at urban scale leveraging sources such as Global Positioning System (GPS) data and Call Detail Records. We combine these big data sets in a novel method to model fuel consumption within a city and estimate how it may change due to different scenarios. To do so we calibrate a fuel consumption model for use on any car fleet fuel economy distribution and apply it in Riyadh, Saudi Arabia. The model proposed, based on speed profiles, is then used to test the effects on fuel consumption of reducing flow, both randomly and by targeting the most fuel inefficient trips in the city. The estimates considerably improve baseline methods based on average speeds, showing the benefits of the information added by the GPS data fusion. The presented method can be adapted to also measure emissions. The results constitute a clear application of data analysis tools to help decision makers compare policies aimed at achieving economic and environmental goals.
△ Less
Submitted 20 November, 2017;
originally announced November 2017.
-
Complex delay dynamics on railway networks: from universal laws to realistic modelling
Authors:
Bernardo Monechi,
Pietro Gravino,
Riccardo di Clemente,
Vito D. P. Servedio
Abstract:
Railways are a key infrastructure for any modern country. The reliability and resilience of this peculiar transportation system may be challenged by different shocks such as disruptions, strikes and adverse weather conditions. These events compromise the correct functioning of the system and trigger the spreading of delays into the railway network on a daily basis. Despite their importance, a gene…
▽ More
Railways are a key infrastructure for any modern country. The reliability and resilience of this peculiar transportation system may be challenged by different shocks such as disruptions, strikes and adverse weather conditions. These events compromise the correct functioning of the system and trigger the spreading of delays into the railway network on a daily basis. Despite their importance, a general theoretical understanding of the underlying causes of these disruptions is still lacking. In this work, we analyse the Italian and German railway networks by leveraging on the train schedules and actual delay data retrieved during the year 2015. We use {these} data to infer simple statistical laws ruling the emergence of localized delays in different areas of the network and we model the spreading of these delays throughout the network by exploiting a framework inspired by epidemic spreading models. Our model offers a fast and easy tool for the preliminary assessment of the {effectiveness of} traffic handling policies, and of the railway {network} criticalities.
△ Less
Submitted 18 June, 2018; v1 submitted 26 July, 2017;
originally announced July 2017.
-
Sequences of purchases in credit card data reveal life styles in urban populations
Authors:
Riccardo Di Clemente,
Miguel Luengo-Oroz,
Matias Travizano,
Sharon Xu,
Bapu Vaitla,
Marta C. González
Abstract:
Zipf-like distributions characterize a wide set of phenomena in physics, biology, economics and social sciences. In human activities, Zipf-laws describe for example the frequency of words appearance in a text or the purchases types in shop** patterns. In the latter, the uneven distribution of transaction types is bound with the temporal sequences of purchases of individual choices. In this work,…
▽ More
Zipf-like distributions characterize a wide set of phenomena in physics, biology, economics and social sciences. In human activities, Zipf-laws describe for example the frequency of words appearance in a text or the purchases types in shop** patterns. In the latter, the uneven distribution of transaction types is bound with the temporal sequences of purchases of individual choices. In this work, we define a framework using a text compression technique on the sequences of credit card purchases to detect ubiquitous patterns of collective behavior. Clustering the consumers by their similarity in purchases sequences, we detect five consumer groups. Remarkably, post checking, individuals in each group are also similar in their age, total expenditure, gender, and the diversity of their social and mobility networks extracted by their mobile phone records. By properly deconstructing transaction data with Zipf-like distributions, this method uncovers sets of significant sequences that reveal insights on collective human behavior.
△ Less
Submitted 6 August, 2018; v1 submitted 1 March, 2017;
originally announced March 2017.
-
Epidemics of Liquidity Shortages in Interbank Markets
Authors:
Giuseppe Brandi,
Riccardo Di Clemente,
Giulio Cimini
Abstract:
Financial contagion from liquidity shocks has being recently ascribed as a prominent driver of systemic risk in interbank lending markets. Building on standard compartment models used in epidemics, in this work we develop an EDB (Exposed-Distressed-Bankrupted) model for the dynamics of liquidity shocks reverberation between banks, and validate it on electronic market for interbank deposits data. W…
▽ More
Financial contagion from liquidity shocks has being recently ascribed as a prominent driver of systemic risk in interbank lending markets. Building on standard compartment models used in epidemics, in this work we develop an EDB (Exposed-Distressed-Bankrupted) model for the dynamics of liquidity shocks reverberation between banks, and validate it on electronic market for interbank deposits data. We show that the interbank network was highly susceptible to liquidity contagion at the beginning of the 2007/2008 global financial crisis, and that the subsequent micro-prudential and liquidity hoarding policies adopted by banks increased the network resilience to systemic risk---yet with the undesired side effect of drying out liquidity from the market. We finally show that the individual riskiness of a bank is better captured by its network centrality than by its participation to the market, along with the currently debated concept of "too interconnected to fail".
△ Less
Submitted 16 May, 2018; v1 submitted 11 October, 2016;
originally announced October 2016.
-
The Build-Up of Diversity in Complex Ecosystems
Authors:
Andrea Tacchella,
Riccardo Di Clemente,
Andrea Gabrielli,
Luciano Pietronero
Abstract:
Diversity is a fundamental feature of ecosystems, even when the concept of ecosystem is extended to sociology or economics. Diversity can be intended as the count of different items, animals, or, more generally, interactions. There are two classes of stylized facts that emerge when diversity is taken into account. The first are Diversity explosions: evolutionary radiations in biology, or the proce…
▽ More
Diversity is a fundamental feature of ecosystems, even when the concept of ecosystem is extended to sociology or economics. Diversity can be intended as the count of different items, animals, or, more generally, interactions. There are two classes of stylized facts that emerge when diversity is taken into account. The first are Diversity explosions: evolutionary radiations in biology, or the process of esca** 'Poverty Traps' in economics are two well known examples. The second is nestedness: entities with a very diverse set of interactions are the only ones that interact with more specialized ones. In a single sentence: specialists interact with generalists. Nestedness is observed in a variety of bipartite networks of interactions: Biogeographic, macroeconomic and mutualistic to name a few. This indicates that entities diversify following a pattern. Since they appear in such very different systems, these two stylized facts point out that the build up of diversity is driven by a fundamental probabilistic mechanism, and here we sketch its minimal features. We show how the contraction of a random tripartite network, which is maximally entropic in all its degree distributions but one, can reproduce stylized facts of real data with great accuracy which is qualitatively lost when that degree distribution is changed. We base our reasoning on the combinatoric picture that the nodes on one layer of these bipartite networks can be described as combinations of a number of fundamental building blocks. The stylized facts of diversity that we observe in real systems can be explained with an extreme heterogeneity (a scale-free distribution) in the number of meaningful combinations in which each building block is involved. We show that if the usefulness of the building blocks has a scale-free distribution, then maximally entropic baskets of building blocks will give rise to very rich behaviors.
△ Less
Submitted 12 September, 2016;
originally announced September 2016.
-
Inferring monopartite projections of bipartite networks: an entropy-based approach
Authors:
Fabio Saracco,
Mika J. Straka,
Riccardo Di Clemente,
Andrea Gabrielli,
Guido Caldarelli,
Tiziano Squartini
Abstract:
Bipartite networks are currently regarded as providing a major insight into the organization of many real-world systems, unveiling the mechanisms driving the interactions occurring between distinct groups of nodes. One of the most important issues encountered when modeling bipartite networks is devising a way to obtain a (monopartite) projection on the layer of interest, which preserves as much as…
▽ More
Bipartite networks are currently regarded as providing a major insight into the organization of many real-world systems, unveiling the mechanisms driving the interactions occurring between distinct groups of nodes. One of the most important issues encountered when modeling bipartite networks is devising a way to obtain a (monopartite) projection on the layer of interest, which preserves as much as possible the information encoded into the original bipartite structure. In the present paper we propose an algorithm to obtain statistically-validated projections of bipartite networks, according to which any two nodes sharing a statistically-significant number of neighbors are linked. Since assessing the statistical significance of nodes similarity requires a proper statistical benchmark, here we consider a set of four null models, defined within the exponential random graph framework. Our algorithm outputs a matrix of link-specific p-values, from which a validated projection is straightforwardly obtainable, upon running a multiple hypothesis testing procedure. Finally, we test our method on an economic network (i.e. the countries-products World Trade Web representation) and a social network (i.e. MovieLens, collecting the users' ratings of a list of movies). In both cases non-trivial communities are detected: while projecting the World Trade Web on the countries layer reveals modules of similarly-industrialized nations, projecting it on the products layer allows communities characterized by an increasing level of complexity to be detected; in the second case, projecting MovieLens on the films layer allows clusters of movies whose affinity cannot be fully accounted for by genre similarity to be individuated.
△ Less
Submitted 17 May, 2017; v1 submitted 8 July, 2016;
originally announced July 2016.
-
Statistically validated network of portfolio overlaps and systemic risk
Authors:
Stanislao Gualdi,
Giulio Cimini,
Kevin Primicerio,
Riccardo Di Clemente,
Damien Challet
Abstract:
Common asset holding by financial institutions, namely portfolio overlap, is nowadays regarded as an important channel for financial contagion with the potential to trigger fire sales and thus severe losses at the systemic level. In this paper we propose a method to assess the statistical significance of the overlap between pairs of heterogeneously diversified portfolios, which then allows us to b…
▽ More
Common asset holding by financial institutions, namely portfolio overlap, is nowadays regarded as an important channel for financial contagion with the potential to trigger fire sales and thus severe losses at the systemic level. In this paper we propose a method to assess the statistical significance of the overlap between pairs of heterogeneously diversified portfolios, which then allows us to build a validated network of financial institutions where links indicate potential contagion channels due to realized portfolio overlaps. The method is implemented on a historical database of institutional holdings ranging from 1999 to the end of 2013, but can be in general applied to any bipartite network where the presence of similar sets of neighbors is of interest. We find that the proportion of validated network links (i.e., of statistically significant overlaps) increased steadily before the 2007-2008 global financial crisis and reached a maximum when the crisis occurred. We argue that the nature of this measure implies that systemic risk from fire sales liquidation was maximal at that time. After a sharp drop in 2008, systemic risk resumed its growth in 2009, with a notable acceleration in 2013, reaching levels not seen since 2007. We finally show that market trends tend to be amplified in the portfolios identified by the algorithm, such that it is possible to have an informative signal about financial institutions that are about to suffer (enjoy) the most significant losses (gains).
△ Less
Submitted 27 September, 2016; v1 submitted 18 March, 2016;
originally announced March 2016.
-
From innovation to diversification: a simple competitive model
Authors:
Fabio Saracco,
Riccardo Di Clemente,
Andrea Gabrielli,
Luciano Pietronero
Abstract:
Few attempts have been proposed in order to describe the statistical features and historical evolution of the export bipartite matrix countries/products. An important standpoint is the introduction of a products network, namely a hierarchical forest of products that models the formation and the evolution of commodities. In the present article, we propose a simple dynamical model where countries co…
▽ More
Few attempts have been proposed in order to describe the statistical features and historical evolution of the export bipartite matrix countries/products. An important standpoint is the introduction of a products network, namely a hierarchical forest of products that models the formation and the evolution of commodities. In the present article, we propose a simple dynamical model where countries compete with each other to acquire the ability to produce and export new products. Countries will have two possibilities to expand their export: innovating, i.e. introducing new goods, namely new nodes in the product networks, or copying the productive process of others, i.e. occupying a node already present in the same network. In this way, the topology of the products network and the country-product matrix evolve simultaneously, driven by the countries push toward innovation.
△ Less
Submitted 6 November, 2015; v1 submitted 14 August, 2015;
originally announced August 2015.
-
Detecting early signs of the 2007-2008 crisis in the world trade
Authors:
Fabio Saracco,
Riccardo Di Clemente,
Andrea Gabrielli,
Tiziano Squartini
Abstract:
Since 2007, several contributions have tried to identify early-warning signals of the financial crisis. However, the vast majority of analyses has focused on financial systems and little theoretical work has been done on the economic counterpart. In the present paper we fill this gap and employ the theoretical tools of network theory to shed light on the response of world trade to the financial cr…
▽ More
Since 2007, several contributions have tried to identify early-warning signals of the financial crisis. However, the vast majority of analyses has focused on financial systems and little theoretical work has been done on the economic counterpart. In the present paper we fill this gap and employ the theoretical tools of network theory to shed light on the response of world trade to the financial crisis of 2007 and the economic recession of 2008-2009. We have explored the evolution of the bipartite World Trade Web (WTW) across the years 1995-2010, monitoring the behavior of the system both before and after 2007. Our analysis shows early structural changes in the WTW topology: since 2003, the WTW becomes increasingly compatible with the picture of a network where correlations between countries and products are progressively lost. Moreover, the WTW structural modification can be considered as concluded in 2010, after a seemingly stationary phase of three years. We have also refined our analysis by considering specific subsets of countries and products: the most statistically significant early-warning signals are provided by the most volatile macrosectors, especially when measured on develo** countries, suggesting the emerging economies as being the most sensitive ones to the global economic cycles.
△ Less
Submitted 8 July, 2016; v1 submitted 31 July, 2015;
originally announced August 2015.
-
Randomizing bipartite networks: the case of the World Trade Web
Authors:
Fabio Saracco,
Riccardo Di Clemente,
Andrea Gabrielli,
Tiziano Squartini
Abstract:
Within the last fifteen years, network theory has been successfully applied both to natural sciences and to socioeconomic disciplines. In particular, bipartite networks have been recognized to provide a particularly insightful representation of many systems, ranging from mutualistic networks in ecology to trade networks in economy, whence the need of a pattern detection-oriented analysis in order…
▽ More
Within the last fifteen years, network theory has been successfully applied both to natural sciences and to socioeconomic disciplines. In particular, bipartite networks have been recognized to provide a particularly insightful representation of many systems, ranging from mutualistic networks in ecology to trade networks in economy, whence the need of a pattern detection-oriented analysis in order to identify statistically-significant structural properties. Such an analysis rests upon the definition of suitable null models, i.e. upon the choice of the portion of network structure to be preserved while randomizing everything else. However, quite surprisingly, little work has been done so far to define null models for real bipartite networks. The aim of the present work is to fill this gap, extending a recently-proposed method to randomize monopartite networks to bipartite networks. While the proposed formalism is perfectly general, we apply our method to the binary, undirected, bipartite representation of the World Trade Web, comparing the observed values of a number of structural quantities of interest with the expected ones, calculated via our randomization procedure. Interestingly, the behavior of the World Trade Web in this new representation is strongly different from the monopartite analogue, showing highly non-trivial patterns of self-organization.
△ Less
Submitted 6 June, 2015; v1 submitted 17 March, 2015;
originally announced March 2015.
-
The Italian primary school-size distribution and the city-size: a complex nexus
Authors:
Alessandro Belmonte,
Riccardo Di Clemente,
Sergey V. Buldyrev
Abstract:
We characterize the statistical law according to which Italian primary school-size distributes. We find that the school-size can be approximated by a log-normal distribution, with a fat lower tail that collects a large number of very small schools. The upper tail of the school-size distribution decreases exponentially and the growth rates are distributed with a Laplace PDF. These distributions are…
▽ More
We characterize the statistical law according to which Italian primary school-size distributes. We find that the school-size can be approximated by a log-normal distribution, with a fat lower tail that collects a large number of very small schools. The upper tail of the school-size distribution decreases exponentially and the growth rates are distributed with a Laplace PDF. These distributions are similar to those observed for firms and are consistent with a Bose-Einstein preferential attachment process. The body of the distribution features a bimodal shape suggesting some source of heterogeneity in the school organization that we uncover by an in-depth analysis of the relation between schools-size and city-size. We propose a novel cluster methodology and a new spatial interaction approach among schools which outline the variety of policies implemented in Italy. Different regional policies are also discussed shedding lights on the relation between policy and geographical features.
△ Less
Submitted 17 July, 2014;
originally announced July 2014.
-
Statistical Agent Based Modelization of the Phenomenon of Drug Abuse
Authors:
Riccardo Di Clemente,
Luciano Pietronero
Abstract:
We introduce a statistical agent based model to describe the phenomenon of drug abuse and its dynamical evolution at the individual and global level. The agents are heterogeneous with respect to their intrinsic inclination to drugs, to their budget attitude and social environment. The various levels of drug use were inspired by the professional description of the phenomenon and this permits a dire…
▽ More
We introduce a statistical agent based model to describe the phenomenon of drug abuse and its dynamical evolution at the individual and global level. The agents are heterogeneous with respect to their intrinsic inclination to drugs, to their budget attitude and social environment. The various levels of drug use were inspired by the professional description of the phenomenon and this permits a direct comparison with all available data. We show that certain elements have a great importance to start the use of drugs, for example the rare events in the personal experiences which permit to overcame the barrier of drug use occasionally. The analysis of how the system reacts to perturbations is very important to understand its key elements and it provides strategies for effective policy making. The present model represents the first step of a realistic description of this phenomenon and can be easily generalized in various directions.
△ Less
Submitted 29 July, 2012;
originally announced July 2012.