-
Homophilic organization of egocentric communities in ICT services
Authors:
Chandreyee Roy,
Hang-Hyun Jo,
János Kertész,
Kimmo Kaski,
János Török
Abstract:
Members of a society can be characterized by a large number of features, such as gender, age, ethnicity, religion, social status, and shared activities. One of the main tie-forming factors between individuals in human societies is homophily, the tendency of being attracted to similar others. Homophily has been mainly studied with focus on one of the features and little is known about the roles of…
▽ More
Members of a society can be characterized by a large number of features, such as gender, age, ethnicity, religion, social status, and shared activities. One of the main tie-forming factors between individuals in human societies is homophily, the tendency of being attracted to similar others. Homophily has been mainly studied with focus on one of the features and little is known about the roles of similarities of different origins in the formation of communities. To close this gap, we analyze three datasets from Information and Communications Technology (ICT) services, namely, two online social networks and a network deduced from mobile phone calls, in all of which metadata about individual features are available. We identify communities within egocentric networks and surprisingly find that the larger the community is, the more overlap is found between features of its members and the ego. We interpret this finding in terms of the effort needed to manage the communities; the larger diversity requires more effort such that to maintain a large diverse group may exceed the capacity of the members. As the ego reaches out to her alters on an ICT service, we observe that the first alter in each community tends to have a higher feature overlap with the ego than the rest. Moreover the feature overlap of the ego with all her alters displays a non-monotonic behaviors as a function of the ego's degree. We propose a simple mechanism of how people add links in their egocentric networks of alters that reproduces all the empirical observations and shows the reason behind non-monotonic tendency of the egocentric feature overlap as a function of the ego's degree.
△ Less
Submitted 5 May, 2024;
originally announced May 2024.
-
Differences of communication activity and mobility patterns between urban and rural people
Authors:
Fumiko Ogushi,
Chandreyee Roy,
Kimmo Kaski
Abstract:
Human mobility and other social activity patterns influence various aspects of society such as urban planning, traffic predictions, crisis resilience, and epidemic prevention. The behaviour of individuals, like their communication frequencies and movements, are shaped by societal and socio-economic factors. In addition, the differences in the geolocation of people as well as their gender and age c…
▽ More
Human mobility and other social activity patterns influence various aspects of society such as urban planning, traffic predictions, crisis resilience, and epidemic prevention. The behaviour of individuals, like their communication frequencies and movements, are shaped by societal and socio-economic factors. In addition, the differences in the geolocation of people as well as their gender and age cast effects on their activity patterns. In this study we focus on investigating these patterns by using mobile phone data, specifically the call detail records (CDRs), to analyze the social communication and mobility patterns of people. This dataset can provide us insight into the individual and population-level behaviours in rural and urban environments on a daily, weekly and seasonal basis. The results of our analyses show that in the urban areas people have high calling activity but low mobility, while in the rural areas they show the opposite behaviour, i.e. low calling activity combined with high mobility. Overall, there is a decreasing trend in people's mobility through the year even though their calling activity remained consistent except for the holidays during which time the communication frequency drops markedly. We have also observed that there are significant differences in the mobility between the work days and free days. Finally, the age and gender of individuals have also been observed to play a role in the seasonal patterns differently in urban and rural areas.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.
-
Residential clustering and mobility of ethnic groups
Authors:
Kunal Bhattacharya,
Chandreyee Roy,
Tuomas Takko,
Anna Rotkirch,
Kimmo Kaski
Abstract:
We studied residential clustering and mobility of ethnic minorities using a theoretical framework based on null models of spatial distributions and movements of populations. Using microdata from population registers we compared the patterns of clustering amongst various socioethnic groups living in and around the capital region of Finland. Using the models we were able to connect the factors influ…
▽ More
We studied residential clustering and mobility of ethnic minorities using a theoretical framework based on null models of spatial distributions and movements of populations. Using microdata from population registers we compared the patterns of clustering amongst various socioethnic groups living in and around the capital region of Finland. Using the models we were able to connect the factors influencing intraurban migration to the spatial patterns that have been developed over time. We could also demonstrate the interrelationship of the movement and clustering with fertility. The observed clustering seems to be a combined effect of fertility and the tendency to migrate locally. The models also highlight the importance of factors like proximity to the city-centre, average neighbourhood income, and similarity of socioeconomic profiles.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Reproducibility analysis of automated deep learning based localisation of mandibular canals on a temporal CBCT dataset
Authors:
Jorma Järnstedt,
Jaakko Sahlsten,
Joel Jaskari,
Kimmo Kaski,
Helena Mehtonen,
Ari Hietanen,
Osku Sundqvist,
Vesa Varjonen,
Vesa Mattila,
Sangsom Prapayasotok,
Sakarat Nalampang
Abstract:
Preoperative radiological identification of mandibular canals is essential for maxillofacial surgery. This study demonstrates the reproducibility of a deep learning system (DLS) by evaluating its localisation performance on 165 heterogeneous cone beam computed tomography (CBCT) scans from 72 patients in comparison to an experienced radiologist's annotations. We evaluated the performance of the DLS…
▽ More
Preoperative radiological identification of mandibular canals is essential for maxillofacial surgery. This study demonstrates the reproducibility of a deep learning system (DLS) by evaluating its localisation performance on 165 heterogeneous cone beam computed tomography (CBCT) scans from 72 patients in comparison to an experienced radiologist's annotations. We evaluated the performance of the DLS using the symmetric mean curve distance (SMCD), the average symmetric surface distance (ASSD), and the Dice similarity coefficient (DSC). The reproducibility of the SMCD was assessed using the within-subject coefficient of repeatability (RC). Three other experts rated the diagnostic validity twice using a 0-4 Likert scale. The reproducibility of the Likert scoring was assessed using the repeatability measure (RM). The RC of SMCD was 0.969 mm, the median (interquartile range) SMCD and ASSD were 0.643 (0.186) mm and 0.351 (0.135) mm, respectively, and the mean (standard deviation) DSC was 0.548 (0.138). The DLS performance was most affected by postoperative changes. The RM of the Likert scoring was 0.923 for the radiologist and 0.877 for the DLS. The mean (standard deviation) Likert score was 3.94 (0.27) for the radiologist and 3.84 (0.65) for the DLS. The DLS demonstrated proficient qualitative and quantitative reproducibility, temporal generalisability, and clinical validity.
△ Less
Submitted 28 April, 2023;
originally announced May 2023.
-
A simple model of edit activity in Wikipedia
Authors:
Takashi Shimada,
Fumiko Ogushi,
Janos Torok,
Janos Kertesz,
Kimmo Kaski
Abstract:
A simple dynamical model of collective edit activity of Wikipedia articles and their content evolution is introduced. Based on the recent empirical findings, each editor in the model is characterized by an ability to make content edit, i.e., improving the article by adding content and a tendency to make maintenance edit, i.e., dealing with formal aspects and maintaining the edit flow. In addition,…
▽ More
A simple dynamical model of collective edit activity of Wikipedia articles and their content evolution is introduced. Based on the recent empirical findings, each editor in the model is characterized by an ability to make content edit, i.e., improving the article by adding content and a tendency to make maintenance edit, i.e., dealing with formal aspects and maintaining the edit flow. In addition, each article is characterized by a level of maturity as compared to a potential quality needed to comprehensively cover its topic. This model is found to reproduce the basic structure of the bipartite network between editors and articles of Wikipedia. Furthermore, the relation between the model parameters of editors and articles and the metrics of those calculated from the emergent network turns out to be robust, i.e. depending only on the rate of the introduction of new articles to the editing activity. This results provides us a way to relate observations in the real data to the hidden characteristics of editors and articles. For the nestedness of the networks, systems with weighted parameter distribution gives better match to the empirical one. This suggests the importance of high-dimensional nature of the ability of editors and quality of articles in the real system.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Modelling exposure between populations using networks of mobility during Covid-19
Authors:
Tuomas Takko,
Kunal Bhattacharya,
Kimmo Kaski
Abstract:
The use of mobile phone call detail records and device location data for the calling patterns, movements, and social contacts of individuals, has proven to be valuable for devising models and understanding of their mobility and behaviour patterns. In this study we investigate weighted exposure-networks of human daily activities in the capital region of Finland as a proxy for contacts between posta…
▽ More
The use of mobile phone call detail records and device location data for the calling patterns, movements, and social contacts of individuals, has proven to be valuable for devising models and understanding of their mobility and behaviour patterns. In this study we investigate weighted exposure-networks of human daily activities in the capital region of Finland as a proxy for contacts between postal code areas during the pre-pandemic year 2019 and pandemic years 2020, 2021 and early 2022. We investigate the suitability of gravity and radiation type models for reconstructing the exposure-networks based on geo-spatial and population mobility information. For this we use a mobile phone dataset of aggregated daily visits from a postal code area to cellphone grid locations, and treat it as a bipartite network to create weighted one mode projections using a weighted co-occurrence function. We fit a gravitation model and a radiation model to the averaged weekly and yearly projection networks with geo-spatial and socioeconomic variables of the postal code areas and their populations. We also consider an extended gravity type model comprising of additional postal area information such as distance via public transportation and population density. The results show that the co-occurrence of human activities, or exposure, between postal code areas follows both the gravity and radiation type interactions, once fitted to the empirical network. The effects of the pandemic beginning in 2020 can be observed as a decrease of the overall activity as well as of the exposure of the projected networks. In general, the results show that the postal code level networks changed to be more proximity weighted after the pandemic began, following the government imposed non-pharmaceutical interventions, with differences based on the geo-spatial and socioeconomic structure of the areas.
△ Less
Submitted 17 April, 2023; v1 submitted 9 January, 2023;
originally announced January 2023.
-
Exploration of the effects of epidemics on the regional socio-economics: a modelling approach
Authors:
Jan E. Snellman,
Rafael A. Barrio,
Kimmo K. Kaski,
Maarit J. Korpi--Lagg
Abstract:
Pandemics, in addition to affecting the health of populations, can have huge impacts on their social and economic behavior. These factors, on the other hand, have the potential to feed back to and influence the disease spreading. It is important to systematically study these interrelations, to determine which ones have significant effects, and whether the effects are adverse or beneficial. Our rec…
▽ More
Pandemics, in addition to affecting the health of populations, can have huge impacts on their social and economic behavior. These factors, on the other hand, have the potential to feed back to and influence the disease spreading. It is important to systematically study these interrelations, to determine which ones have significant effects, and whether the effects are adverse or beneficial. Our recently developed epidemic model with agent-based and geographical elements is used in this study for such a purpose. We perform an extensive parameter space exploration of the socio-economic part of the model, including factors like the attitudes (called values) of the agents towards the disease spreading, health, economic situation, and regulations by government agents. We search for prominent patterns from the resulting simulated data using basic classification tools, namely self-organizing maps and principal component analysis. We seek to isolate the most important value parameters of the population and government agents influencing the disease spreading speed and patterns, and monitor different quantities of the model output, such as infection rates, the propagation speed of the epidemic, economic activity, government regulations, and the compliance of population. Out of these, the ones describing the epidemic spreading were resulting in the most distinctive clustering of the data, and they were selected as the basis of the remaining analysis. We relate the found clusters to three distinct types of disease spreading: wave-like, chaotic, and transitional spreading patterns. The most important value parameter contributing to phase changes between these phases was found to be the compliance of the population agents towards the government regulations.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
Turnover in close friendships: age and gender differences
Authors:
Chandreyee Roy,
Kunal Bhattacharya,
Robin I. M. Dunbar,
Kimmo Kaski
Abstract:
Humans are social animals and the interpersonal bonds formed between them are crucial for their development and well being in a society. These relationships are usually structured into several layers (Dunbar's layers of friendship) depending on their significance in an individual's life with closest friends and family being the most important ones taking major part of their time and communication…
▽ More
Humans are social animals and the interpersonal bonds formed between them are crucial for their development and well being in a society. These relationships are usually structured into several layers (Dunbar's layers of friendship) depending on their significance in an individual's life with closest friends and family being the most important ones taking major part of their time and communication effort. However, we have little idea how the initiation and termination of these relationships occurs across the lifespan. To explore this, we analyse a national cellphone database to determine how and when changes in close relationships occur in the two genders. In general, membership of this inner circle of intimate relationships is extremely stable, at least over a three-year period. However, around 1-4% of alters change every year, with the rate of change being higher among 17-21 year olds than older adults. Young adult females terminate more of their opposite-gender relationships, while older males are more persistent in trying to maintain relationships in decline. These results emphasise the variability in relationship dynamics across age and gender, and remind us that individual differences play an important role in the structure of social networks. Overall, our study provides a holistic understanding of the dynamic nature of relationships during the life-course of humans.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
Deep learning based parameter search for an agent based social network model
Authors:
Yohsuke Murase,
Hang-Hyun Jo,
János Török,
János Kertész,
Kimmo Kaski
Abstract:
Interactions between humans give rise to complex social networks that are characterized by heterogeneous degree distribution, weight-topology relation, overlap** community structure, and dynamics of links. Understanding such networks is a primary goal of science due to serving as the scaffold for many emergent social phenomena from disease spreading to political movements. An appropriate tool fo…
▽ More
Interactions between humans give rise to complex social networks that are characterized by heterogeneous degree distribution, weight-topology relation, overlap** community structure, and dynamics of links. Understanding such networks is a primary goal of science due to serving as the scaffold for many emergent social phenomena from disease spreading to political movements. An appropriate tool for studying them is agent-based modeling, in which nodes, representing persons, make decisions about creating and deleting links, thus yielding various macroscopic behavioral patterns. Here we focus on studying a generalization of the weighted social network model, being one of the most fundamental agent-based models for describing the formation of social ties and social networks. This Generalized Weighted Social Network (GWSN) model incorporates triadic closure, homophilic interactions, and various link termination mechanisms, which have been studied separately in the previous works. Accordingly, the GWSN model has an increased number of input parameters and the model behavior gets excessively complex, making it challenging to clarify the model behavior. We have executed massive simulations with a supercomputer and using the results as the training data for deep neural networks to conduct regression analysis for predicting the properties of the generated networks from the input parameters. The obtained regression model was also used for global sensitivity analysis to identify which parameters are influential or insignificant. We believe that this methodology is applicable for a large class of complex network models, thus opening the way for more realistic quantitative agent-based modeling.
△ Less
Submitted 14 July, 2021;
originally announced July 2021.
-
Ecology in the digital world of Wikipedia
Authors:
Fumiko Ogushi,
János Kertész,
Kimmo Kaski,
Takashi Shimada
Abstract:
Wikipedia, a paradigmatic example of online knowledge space is organized in a collaborative, bottom-up way with voluntary contributions, yet it maintains a level of reliability comparable to that of traditional encyclopedias. The lack of selected professional writers and editors makes the judgement about quality and trustworthiness of the articles a real challenge. Here we show that a self-consist…
▽ More
Wikipedia, a paradigmatic example of online knowledge space is organized in a collaborative, bottom-up way with voluntary contributions, yet it maintains a level of reliability comparable to that of traditional encyclopedias. The lack of selected professional writers and editors makes the judgement about quality and trustworthiness of the articles a real challenge. Here we show that a self-consistent metrics for the network defined by the edit records captures well the character of editors' activity and the articles' level of complexity. Using our metrics, one can better identify the human-labeled high-quality articles, e.g., "featured" ones, and differentiate them from the popular and controversial articles. Furthermore, the dynamics of the editor-article system is also well captured by the metrics, revealing the evolutionary pathways of articles and diverse roles of editors. We demonstrate that the collective effort of the editors indeed drives to the direction of article improvement.
△ Less
Submitted 21 May, 2021;
originally announced May 2021.
-
Human-agent coordination in a group formation game
Authors:
Tuomas Takko,
Kunal Bhattacharya,
Daniel Monsivais,
Kimmo Kaski
Abstract:
Coordination and cooperation between humans and autonomous agents in cooperative games raises interesting questions of human decision making and behaviour changes. Here we report our findings from a group formation game in a small-world network of different mixes of human and agent players, aiming to achieve connected clusters of the same colour by swap** places with neighbouring players using n…
▽ More
Coordination and cooperation between humans and autonomous agents in cooperative games raises interesting questions of human decision making and behaviour changes. Here we report our findings from a group formation game in a small-world network of different mixes of human and agent players, aiming to achieve connected clusters of the same colour by swap** places with neighbouring players using non-overlap** information. In the experiments the human players are incentivized by rewarding to prioritize their own cluster while the model of agents' decision making is derived from our previous experiment of purely cooperative game between human players. The experiments were performed by grou** the players in three different setups to investigate the overall effect of having cooperative autonomous agents within teams. We observe that the change in the behavior of human subjects adjusts to playing with autonomous agents by being less risk averse, while kee** the overall performance efficient by splitting the behaviour into selfish and cooperative in the two actions performed during the rounds of the game. Moreover, results from two hybrid human-agent setups suggest that the group composition affects the evolution of clusters. Our findings indicate that in purely or lesser cooperative settings, providing more control to humans could help in maximizing the overall performance of hybrid systems.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.
-
Modeling the interplay between epidemics and regional socio-economics
Authors:
Jan E. Snellman,
Rafael A. Barrio,
Kimmo K. Kaski,
Maarit J. Käpylä
Abstract:
In this study we present a dynamical agent-based model to investigate the interplay between the socio-economy of and SEIRS-type epidemic spreading over a geographical area, divided to smaller area districts and further to smallest area cells. The model treats the populations of cells and authorities of districts as agents, such that the former can reduce their economic activity and the latter can…
▽ More
In this study we present a dynamical agent-based model to investigate the interplay between the socio-economy of and SEIRS-type epidemic spreading over a geographical area, divided to smaller area districts and further to smallest area cells. The model treats the populations of cells and authorities of districts as agents, such that the former can reduce their economic activity and the latter can recommend economic activity reduction both with the overall goal to slow down the epidemic spreading. The agents make decisions with the aim of attaining as high socio-economic standings as possible relative to other agents of the same type by evaluating their standings based on the local and regional infection rates, compliance to the authorities' regulations, regional drops in economic activity, and efforts to mitigate the spread of epidemic. We find that the willingness of population to comply with authorities' recommendations has the most drastic effect on the epidemic spreading: periodic waves spread almost unimpeded in non-compliant populations, while in compliant ones the spread is minimal with chaotic spreading pattern and significantly lower infection rates. Health and economic concerns of agents turn out to have lesser roles, the former increasing their efforts and the latter decreasing them.
△ Less
Submitted 12 October, 2021; v1 submitted 14 May, 2021;
originally announced May 2021.
-
Balanced-imbalanced transitions in indirect reciprocity dynamics on networks
Authors:
Koji Oishi,
Shuhei Miyano,
Kimmo Kaski,
Takashi Shimada
Abstract:
Here we investigate the dynamics of indirect reciprocity on networks, a type of social dynamics in which the attitude of individuals, either cooperative or antagonistic, toward other individuals changes over time by their actions and mutual monitoring. We observe an absorbing state phase transition as we change the network's link or edge density. When the edge density is either small or large enou…
▽ More
Here we investigate the dynamics of indirect reciprocity on networks, a type of social dynamics in which the attitude of individuals, either cooperative or antagonistic, toward other individuals changes over time by their actions and mutual monitoring. We observe an absorbing state phase transition as we change the network's link or edge density. When the edge density is either small or large enough, opinions quickly reach an absorbing state, from which opinions never change anymore once reached. In contrast, if the edge density is in the middle range the absorbing state is not reached and the state keeps changing thus being active. The result shows a novel effect of social networks on spontaneous group formation.
△ Less
Submitted 21 April, 2021;
originally announced April 2021.
-
Internal migration and mobile communication patterns among pairs with strong ties
Authors:
Mikaela Irene D. Fudolig,
Daniel Monsivais,
Kunal Bhattacharya,
Hang-Hyun Jo,
Kimmo Kaski
Abstract:
Using large-scale call detail records of anonymised mobile phone service subscribers with demographic and location information, we investigate how a long-distance residential move within the country affects the mobile communication patterns between an ego who moved and a frequently called alter who did not move. By using clustering methods in analysing the call frequency time series, we find that…
▽ More
Using large-scale call detail records of anonymised mobile phone service subscribers with demographic and location information, we investigate how a long-distance residential move within the country affects the mobile communication patterns between an ego who moved and a frequently called alter who did not move. By using clustering methods in analysing the call frequency time series, we find that such ego-alter pairs are grouped into two clusters, those with the call frequency increasing and those with the call frequency decreasing after the move of the ego. This indicates that such residential moves are correlated with a change in the communication pattern soon after moving. We find that the pre-move calling behaviour is a relevant predictor for the post-move calling behaviour. While demographic and location information can help in predicting whether the call frequency will rise or decay, they are not relevant in predicting the actual call frequency volume. We also note that at four months after the move, most of these close pairs maintain contact, even if the call frequency is decreased.
△ Less
Submitted 5 April, 2021; v1 submitted 1 September, 2020;
originally announced September 2020.
-
Modelling Covid-19 epidemic in Mexico, Finland and Iceland
Authors:
Rafael A. Barrio,
Kimmo K. Kaski,
Gudmundur G. Haraldsson,
Thor Aspelund,
Tzipe Govezensky
Abstract:
Over the past two decades there has been a number of global outbreaks of viral diseases. This has accelerated the efforts to model and forecast the disease spreading, in order to find ways to confine the spreading regionally and between regions. Towards this we have devised a model of geographical spreading of viral infections due to human spatial mobility and adapted it to the latest COVID-19 pan…
▽ More
Over the past two decades there has been a number of global outbreaks of viral diseases. This has accelerated the efforts to model and forecast the disease spreading, in order to find ways to confine the spreading regionally and between regions. Towards this we have devised a model of geographical spreading of viral infections due to human spatial mobility and adapted it to the latest COVID-19 pandemic. In this the region to be modelled is overlaid with a two-dimensional grid weighted with the population density defined cells, in each of which a compartmental SEIRS system of delay difference equations simulate the local dynamics (microdynamics) of the disease. The infections between cells are stochastic and allow for the geographical spreading of the virus over the two-dimensional space (macrodynamics). This approach allows to separate the parameters related to the biological aspects of the disease from the ones that represent the spatial contagious behaviour through different kinds of mobility of people acting as virus carriers. These provide sufficient information to trace the evolution of the pandemic in different situations. In particular we have applied this approach to three in many ways different countries, Mexico, Finland and Iceland and found that the model is capable of reproducing and predicting the stochastic global path of the pandemic. This study sheds light on how the diverse cultural and socioeconomic aspects of a country influence the evolution of the epidemics and also the efficacy of social distancing and other confinement measures.
△ Less
Submitted 18 July, 2020;
originally announced July 2020.
-
Dynamical properties of hierarchical networks of Van Der Pol oscillators
Authors:
Daniel Monsivais,
Kunal Bhattacharya,
Rafael A. Barrio,
Philip K. Maini,
Kimmo K. Kaski
Abstract:
Oscillator networks found in social and biological systems are characterized by the presence of wide ranges of coupling strengths and complex organization. Yet robustness and synchronization of oscillations are found to emerge on macro-scales that eventually become key to the functioning of these systems. In order to model this kind of dynamics observed, for example, in systems of circadian oscill…
▽ More
Oscillator networks found in social and biological systems are characterized by the presence of wide ranges of coupling strengths and complex organization. Yet robustness and synchronization of oscillations are found to emerge on macro-scales that eventually become key to the functioning of these systems. In order to model this kind of dynamics observed, for example, in systems of circadian oscillators, we study networks of Van der Pol oscillators that are connected with hierarchical couplings. For each isolated oscillator we assume the same fundamental frequency. Using numerical simulations, we show that the coupled system goes to a phase-locked state, with both phase and frequency being the same for every oscillator at each level of the hierarchy. The observed frequency at each level of the hierarchy changes, reaching an asymptotic lowest value at the uppermost level. Notably, the asymptotic frequency can be tuned to any value below the fundamental frequency of an uncoupled Van der Pol oscillator. We compare the numerical results with those of an approximate analytic solution and find them to be in qualitative agreement.
△ Less
Submitted 16 December, 2019; v1 submitted 10 September, 2019;
originally announced September 2019.
-
Going beneath the shoulders of giants: tracking the cumulative knowledge spreading in a comprehensive citation network
Authors:
Pietro della Briotta Parolo,
Rainer Kujala,
Kimmo Kaski,
Mikko Kivelä
Abstract:
In all of science, the authors of publications depend on the knowledge presented by the previous publications. Thus they "stand on the shoulders of giants" and there is a flow of knowledge from previous publications to more recent ones. The dominating paradigm for tracking this flow of knowledge is to count the number of direct citations, but this neglects the fact that beneath the first layer of…
▽ More
In all of science, the authors of publications depend on the knowledge presented by the previous publications. Thus they "stand on the shoulders of giants" and there is a flow of knowledge from previous publications to more recent ones. The dominating paradigm for tracking this flow of knowledge is to count the number of direct citations, but this neglects the fact that beneath the first layer of citations there is a full body of literature. In this study, we go underneath the "shoulders" by investigating the cumulative knowledge creation process in a citation network of around 35 million publications. In particular, we study stylized models of persistent influence and diffusion that take into account all the possible chains of citations. When we study the persistent influence values of publications and their citation counts, we find that the publications related to Nobel Prizes i.e. Nobel papers have higher ranks in terms of persistent influence than that due to citations, and that the most outperforming publications are typically early works leading to hot research topics of their time. The diffusion model reveals a significant variation in the rates at which different fields of research share knowledge. We find that these rates have been increasing systematically for several decades, which can be explained by the increase in the publication volumes. Overall, our results suggest that analyzing cumulative knowledge creation on a global scale can be useful in estimating the type and scale of scientific influence of individual publications and entire research areas as well as yielding insights which could not be discovered by using only the direct citation counts.
△ Less
Submitted 29 August, 2019;
originally announced August 2019.
-
Link-centric analysis of variation by demographics in mobile phone communication patterns
Authors:
Mikaela Irene D. Fudolig,
Kunal Bhattacharya,
Daniel Monsivais,
Hang-Hyun Jo,
Kimmo Kaski
Abstract:
We present a link-centric approach to study variation in the mobile phone communication patterns of individuals. Unlike most previous research on call detail records that focused on the variation of phone usage across individual users, we examine how the calling and texting patterns obtained from call detail records vary among pairs of users and how these patterns are affected by the nature of rel…
▽ More
We present a link-centric approach to study variation in the mobile phone communication patterns of individuals. Unlike most previous research on call detail records that focused on the variation of phone usage across individual users, we examine how the calling and texting patterns obtained from call detail records vary among pairs of users and how these patterns are affected by the nature of relationships between users. To demonstrate this link-centric perspective, we extract factors that contribute to the variation in the mobile phone communication patterns and predict demographics-related quantities for pairs of users. The time of day and the channel of communication (calls or texts) are found to explain most of the variance among pairs that frequently call each other. Furthermore, we find that this variation can be used to predict the relationship between the pairs of users, as inferred from their age and gender, as well as the age of the younger user in a pair. From the classifier performance across different age and gender groups as well as the inherent class overlap suggested by the estimate of the bounds of the Bayes error, we gain insights into the similarity and differences of communication patterns across different relationships.
△ Less
Submitted 16 December, 2019; v1 submitted 31 July, 2019;
originally announced July 2019.
-
Social structure formation in a network of agents playing a hybrid of ultimatum and dictator games
Authors:
Jan E. Snellman,
Rafael Barrio,
Kimmo Kaski
Abstract:
Here we present an agent-based model where agents interact with other agents by playing a hybrid of dictator and ultimatum games in a co-evolving social network. The basic assumption about the behaviour of the agents in both games is that they try to attain superior socioeconomic positions relative to other agents. As the model parameters we have chosen the relative proportions of the dictator and…
▽ More
Here we present an agent-based model where agents interact with other agents by playing a hybrid of dictator and ultimatum games in a co-evolving social network. The basic assumption about the behaviour of the agents in both games is that they try to attain superior socioeconomic positions relative to other agents. As the model parameters we have chosen the relative proportions of the dictator and ultimatum game strategies being played between a pair of agents in a single social transaction and a parameter depicting the living costs of the agents. The motivation of the study is to examine how different types of social interactions affect the formation of social structures and networks, when the agents have a tendency to maximize their socioeconomic standing. We find that such social networks of agents invariably undergo a community formation process from simple chain-like structure to more complex networks as the living cost parameter is increased. The point where this occurs, depends also on the relative proportion of the dictator and ultimatum games being played. We find that it is harder for complex social structures to form when the dictator game strategy in social transactions of agents becomes more dominant over that of the ultimatum game.
△ Less
Submitted 19 August, 2020; v1 submitted 11 April, 2019;
originally announced April 2019.
-
Sampling networks by nodal attributes
Authors:
Yohsuke Murase,
Hang-Hyun Jo,
János Török,
János Kertész,
Kimmo Kaski
Abstract:
In a social network individuals or nodes connect to other nodes by choosing one of the channels of communication at a time to re-establish the existing social links. Since available data sets are usually restricted to a limited number of channels or layers, these autonomous decision making processes by the nodes constitute the sampling of a multiplex network leading to just one (though very import…
▽ More
In a social network individuals or nodes connect to other nodes by choosing one of the channels of communication at a time to re-establish the existing social links. Since available data sets are usually restricted to a limited number of channels or layers, these autonomous decision making processes by the nodes constitute the sampling of a multiplex network leading to just one (though very important) example of sampling bias caused by the behavior of the nodes. We develop a general setting to get insight and understand the class of network sampling models, where the probability of sampling a link in the original network depends on the attributes $h$ of its adjacent nodes. Assuming that the nodal attributes are independently drawn from an arbitrary distribution $ρ(h)$ and that the sampling probability $r(h_i , h_j)$ for a link $ij$ of nodal attributes $h_i$ and $h_j$ is also arbitrary, we derive exact analytic expressions of the sampled network for such network characteristics as the degree distribution, degree correlation, and clustering spectrum. The properties of the sampled network turn out to be sums of quantities for the original network topology weighted by the factors stemming from the sampling. Based on our analysis, we find that the sampled network may have sampling-induced network properties that are absent in the original network, which implies the potential risk of a naive generalization of the results of the sample to the entire original network. We also consider the case, when neighboring nodes have correlated attributes to show how to generalize our formalism for such sampling bias and we get good agreement between the analytic results and the numerical simulations.
△ Less
Submitted 22 May, 2019; v1 submitted 12 February, 2019;
originally announced February 2019.
-
Cumulative effects of triadic closure and homophily in social networks
Authors:
Aili Asikainen,
Gerardo Iñiguez,
Kimmo Kaski,
Mikko Kivelä
Abstract:
Much of the structure in social networks has been explained by two seemingly independent network evolution mechanisms: triadic closure and homophily. While it is common to consider these mechanisms separately or in the frame of a static model, empirical studies suggest that their dynamic interplay is the very process responsible for the homophilous patterns of association seen in off- and online s…
▽ More
Much of the structure in social networks has been explained by two seemingly independent network evolution mechanisms: triadic closure and homophily. While it is common to consider these mechanisms separately or in the frame of a static model, empirical studies suggest that their dynamic interplay is the very process responsible for the homophilous patterns of association seen in off- and online social networks. By combining these two mechanisms in a minimal solvable dynamic model, we confirm theoretically the long-held and empirically established hypothesis that homophily can be amplified by the triadic closure mechanism. This research approach allows us to estimate how much of the observed homophily in various friendship and communication networks is due to amplification for a given amount of triadic closure. We find that the cumulative advantage-like process leading to homophily amplification can, under certain circumstances, also lead to the widely documented core-periphery structure of social networks, as well as to the emergence of memory of previous homophilic constraints (equivalent to hysteresis phenomena in physics). The theoretical understanding provided by our results highlights the importance of early intervention in managing at the societal level the most adverse effects of homophilic decision-making, such as inequality, segregation and online echo chambers.
△ Less
Submitted 17 September, 2018;
originally announced September 2018.
-
Different patterns of social closeness observed in mobile phone communication
Authors:
Mikaela Irene D. Fudolig,
Daniel Monsivais,
Kunal Bhattacharya,
Hang-Hyun Jo,
Kimmo Kaski
Abstract:
We analyze a large-scale mobile phone call dataset containing information on the age, gender, and billing locality of users to get insight into social closeness in pairs of individuals of similar age. We show that in addition to using the demographic information, the ranking of contacts by their call frequency in egocentric networks is crucial to characterize the different communication patterns.…
▽ More
We analyze a large-scale mobile phone call dataset containing information on the age, gender, and billing locality of users to get insight into social closeness in pairs of individuals of similar age. We show that in addition to using the demographic information, the ranking of contacts by their call frequency in egocentric networks is crucial to characterize the different communication patterns. We find that mutually top-ranked opposite-gender pairs show the highest levels of call frequency and daily regularity, which is consistent with the behavior of real-life romantic partners. At somewhat lower level of call frequency and daily regularity come the mutually top-ranked same-gender pairs, while the lowest call frequency and daily regularity are observed for mutually non-top-ranked pairs. We have also observed that older pairs tend to call less frequently and less regularly than younger pairs, while the average call durations exhibit a more complex dependence on age. We expect that a more detailed analysis can help us better characterize the nature of relationships between pairs of individuals and distinguish between various types of relations, such as siblings, friends, and romantic partners.
△ Less
Submitted 7 August, 2019; v1 submitted 30 August, 2018;
originally announced August 2018.
-
Structural transition in social networks: The role of homophily
Authors:
Yohsuke Murase,
Hang-Hyun Jo,
János Török,
János Kertész,
Kimmo Kaski
Abstract:
We introduce a model for the formation of social networks, which takes into account the homophily or the tendency of individuals to associate and bond with similar others, and the mechanisms of global and local attachment as well as tie reinforcement due to social interactions between people. We generalize the weighted social network model such that the nodes or individuals have $F$ features and e…
▽ More
We introduce a model for the formation of social networks, which takes into account the homophily or the tendency of individuals to associate and bond with similar others, and the mechanisms of global and local attachment as well as tie reinforcement due to social interactions between people. We generalize the weighted social network model such that the nodes or individuals have $F$ features and each feature can have $q$ different values. Here the tendency for the tie formation between two individuals due to the overlap in their features represents homophily. We find a phase transition as a function of $F$ or $q$, resulting in a phase diagram. For fixed $q$ and as a function of $F$ the system shows two phases separated at $F_c$. For $F{<}F_c$ large, homogeneous, and well separated communities can be identified within which the features match almost perfectly (segregated phase). When $F$ becomes larger than $F_c$, the nodes start to belong to several communities and within a community the features match only partially (overlap** phase). Several quantities reflect this transition, including the average degree, clustering coefficient, feature overlap, and the number of communities per node. We also make an attempt to interpret these results in terms of observations on social behavior of humans.
△ Less
Submitted 26 March, 2019; v1 submitted 15 August, 2018;
originally announced August 2018.
-
Social Physics: Uncovering Human Behaviour from Communication
Authors:
Kunal Bhattacharya,
Kimmo Kaski
Abstract:
In the post year 2000 era the technologies that facilitate human communication have rapidly multiplied. While the adoption of these technologies has hugely impacted the behaviour and sociality of people, specifically in urban but also in rural environments, their "digital footprints" on different data bases have become an active area of research. The existence and accessibility of such large popul…
▽ More
In the post year 2000 era the technologies that facilitate human communication have rapidly multiplied. While the adoption of these technologies has hugely impacted the behaviour and sociality of people, specifically in urban but also in rural environments, their "digital footprints" on different data bases have become an active area of research. The existence and accessibility of such large population-level datasets, has allowed scientists to study and model innate human tendencies and social patterns in an unprecedented way that complements traditional research approaches like questionnaire studies. In this review we focus on data analytics and modelling research - we call Social Physics - as it has been carried out using the mobile phone data sets to get insight into the various aspects of human sociality, burstiness in communication, mobility patterns, and daily rhythms.
△ Less
Submitted 13 April, 2018;
originally announced April 2018.
-
Bursty Human Dynamics
Authors:
Márton Karsai,
Hang-Hyun Jo,
Kimmo Kaski
Abstract:
Bursty dynamics is a common temporal property of various complex systems in Nature but it also characterises the dynamics of human actions and interactions. At the phenomenological level it is a feature of all systems that evolve heterogeneously over time by alternating between periods of low and high event frequencies. In such systems, bursts are identified as periods in which the events occur wi…
▽ More
Bursty dynamics is a common temporal property of various complex systems in Nature but it also characterises the dynamics of human actions and interactions. At the phenomenological level it is a feature of all systems that evolve heterogeneously over time by alternating between periods of low and high event frequencies. In such systems, bursts are identified as periods in which the events occur with a rapid pace within a short time-interval while these periods are separated by long periods of time with low frequency of events. As such dynamical patterns occur in a wide range of natural phenomena, their observation, characterisation, and modelling have been a long standing challenge in several fields of research. However, due to some recent developments in communication and data collection techniques it has become possible to follow digital traces of actions and interactions of humans from the individual up to the societal level. This led to several new observations of bursty phenomena in the new but largely unexplored area of human dynamics, which called for the renaissance to study these systems using research concepts and methodologies, including data analytics and modelling. As a result, a large amount of new insight and knowledge as well as innovations have been accumulated in the field, which provided us a timely opportunity to write this brief monograph to make an up-to-date review and summary of the observations, appropriate measures, modelling, and applications of heterogeneous bursty patterns occurring in the dynamics of human behaviour.
△ Less
Submitted 7 March, 2018;
originally announced March 2018.
-
Group formation on a small-world: experiment and modelling
Authors:
Kunal Bhattacharya,
Tuomas Takko,
Daniel Monsivais,
Kimmo Kaski
Abstract:
As a step towards studying human-agent collectives we conduct an online game with human participants cooperating on a network. The game is presented in the context of achieving group formation through local coordination. The players set initially to a small world network with limited information on the location of other players, coordinate their movements to arrange themselves into groups. To unde…
▽ More
As a step towards studying human-agent collectives we conduct an online game with human participants cooperating on a network. The game is presented in the context of achieving group formation through local coordination. The players set initially to a small world network with limited information on the location of other players, coordinate their movements to arrange themselves into groups. To understand the decision making process we construct a data-driven model of agents based on probability matching. The model allows us to gather insight into the nature and degree of rationality employed by the human players. By varying the parameters in agent based simulations we are able to benchmark the human behaviour. We observe that while the players utilize the neighbourhood information in limited capacity, the perception of risk is optimal. We also find that for certain parameter ranges the agents are able to act more efficiently when compared to the human players. This approach would allow us to simulate the collective dynamics in games with agents having varying strategies playing alongside human proxies.
△ Less
Submitted 10 July, 2019; v1 submitted 2 March, 2018;
originally announced March 2018.
-
Peer relations with mobile phone data: Best friends and family formation
Authors:
Tamas David-Barrett,
Anna Rotkirch,
Asim Ghosh,
Kunal Bhattacharya,
Daniel Monsivais,
Isabel Behncke,
Janos Kertesz,
Kimmo Kaski
Abstract:
Earlier attempts to investigate the changes of the role of friendship in different life stages have failed due to lack of data. We close this gap by using a large data set of mobile phone calls from a European country in 2007, to study how the people's call patterns to their close social contacts are associated with age and gender of the callers. We hypothesize that (i) communication with peers, d…
▽ More
Earlier attempts to investigate the changes of the role of friendship in different life stages have failed due to lack of data. We close this gap by using a large data set of mobile phone calls from a European country in 2007, to study how the people's call patterns to their close social contacts are associated with age and gender of the callers. We hypothesize that (i) communication with peers, defined as callers of similar age, will be most important during the period of family formation and that (ii) the importance of best friends defined as same-sex callers of exactly the same age, will be stronger for women than for men. Results show that the frequency of phone calls with the same-sex peers in this population turns out to be relatively stable through life for both men and women. In line with the first hypothesis, there was a significant increase in the length of the phone calls for callers between ages 30 to 40 years. Partly in line with the second hypothesis, the increase in phone calls turned out to be particularly pronounced among females, although there were only minor gender differences in call frequencies. Furthermore, women tended to have long phone conversations with their same-age female friend, and also with somewhat older peers. In sum, we provide evidence from big data for the adult life stages at which peers are most important, and suggest that best friends appear to have a niche of their own in human sociality.
△ Less
Submitted 25 August, 2017;
originally announced August 2017.
-
Network of families in a contemporary population: regional and cultural assortativity
Authors:
Kunal Bhattacharya,
Venla Berg,
Asim Ghosh,
Daniel Monsivais,
Janos Kertesz,
Anna Rotkirch,
Kimmo Kaski
Abstract:
Using a large dataset with individual-level demographic information of 60,000 families in contemporary Finland, we analyse the variation and cultural assortativity in a network of families. Families are considered as vertices and unions between males and females who have a common child and belong to different families are considered as edges in such a network of families. The sampled network is a…
▽ More
Using a large dataset with individual-level demographic information of 60,000 families in contemporary Finland, we analyse the variation and cultural assortativity in a network of families. Families are considered as vertices and unions between males and females who have a common child and belong to different families are considered as edges in such a network of families. The sampled network is a collection of many disjoint components with the largest connected component being dominated by families rooted in one specific region. We characterize the network in terms of the basic structural properties and then explore the network transitivity and assortativity with regards to regions of origin and linguistic identity. Transitivity is seen to result from linguistic homophily in the network. Overall, our results demonstrate that geographic proximity and language strongly influence the structuring of network.
△ Less
Submitted 22 August, 2017; v1 submitted 21 August, 2017;
originally announced August 2017.
-
Migration patterns across the life course of families: Gender differences and proximity with parents and siblings in Finland
Authors:
Asim Ghosh,
Venla Berg,
Kunal Bhattacharya,
Daniel Monsivais,
Janos Kertesz,
Kimmo Kaski,
Anna Rotkirch
Abstract:
Family members' life course tendencies to remain geographically close to each other or to migrate due to education or job opportunities have been studied relatively little. Here we investigate migration patterns of parents and their children between 19 administrative regions of Finland from 1970 to 2012. Using the FinnFamily register dataset of 60 000 index individuals and their family members, we…
▽ More
Family members' life course tendencies to remain geographically close to each other or to migrate due to education or job opportunities have been studied relatively little. Here we investigate migration patterns of parents and their children between 19 administrative regions of Finland from 1970 to 2012. Using the FinnFamily register dataset of 60 000 index individuals and their family members, we investigate the patterns of regional migration and regional co-residence of parents and their children. Specifically, we analyse how likely it is for children to reside in the same region as their parents at any specific age, whether parents and children who live in different regions are likely to reunite, and whether siblings function as regional attractors to each other. Results show an intense regional migration of people to the capital area. The migration propensity of individuals is high in early childhood and peaks in early adulthood. About two thirds of Finnish children live in the same region as their parents throughout their adult lives. Females show higher propensity to migrate than males, since daughters move away from their parents earlier and with a higher rate than sons do. The propensity for two full sibling brothers to be in the same region is higher than that for other types of sibling dyads. We conclude that family members serve as important geographical attractors to each other through the life course and that family attraction is stronger for sons and brothers than for daughters and sisters in contemporary Finland.
△ Less
Submitted 8 August, 2017;
originally announced August 2017.
-
Service adoption spreading in online social networks
Authors:
Gerardo Iñiguez,
Zhongyuan Ruan,
Kimmo Kaski,
János Kertész,
Márton Karsai
Abstract:
The collective behaviour of people adopting an innovation, product or online service is commonly interpreted as a spreading phenomenon throughout the fabric of society. This process is arguably driven by social influence, social learning and by external effects like media. Observations of such processes date back to the seminal studies by Rogers and Bass, and their mathematical modelling has taken…
▽ More
The collective behaviour of people adopting an innovation, product or online service is commonly interpreted as a spreading phenomenon throughout the fabric of society. This process is arguably driven by social influence, social learning and by external effects like media. Observations of such processes date back to the seminal studies by Rogers and Bass, and their mathematical modelling has taken two directions: One paradigm, called simple contagion, identifies adoption spreading with an epidemic process. The other one, named complex contagion, is concerned with behavioural thresholds and successfully explains the emergence of large cascades of adoption resulting in a rapid spreading often seen in empirical data. The observation of real world adoption processes has become easier lately due to the availability of large digital social network and behavioural datasets. This has allowed simultaneous study of network structures and dynamics of online service adoption, shedding light on the mechanisms and external effects that influence the temporal evolution of behavioural or innovation adoption. These advancements have induced the development of more realistic models of social spreading phenomena, which in turn have provided remarkably good predictions of various empirical adoption processes. In this chapter we review recent data-driven studies addressing real-world service adoption processes. Our studies provide the first detailed empirical evidence of a heterogeneous threshold distribution in adoption. We also describe the modelling of such phenomena with formal methods and data-driven simulations. Our objective is to understand the effects of identified social mechanisms on service adoption spreading, and to provide potential new directions and open questions for future research.
△ Less
Submitted 29 June, 2017;
originally announced June 2017.
-
Stochastic Block Model Reveals the Map of Citation Patterns and Their Evolution in Time
Authors:
Darko Hric,
Kimmo Kaski,
Mikko Kivelä
Abstract:
In this study we map out the large-scale structure of citation networks of science journals and follow their evolution in time by using stochastic block models (SBMs). The SBM fitting procedures are principled methods that can be used to find hierarchical grou** of journals into blocks that show similar incoming and outgoing citations patterns. These methods work directly on the citation network…
▽ More
In this study we map out the large-scale structure of citation networks of science journals and follow their evolution in time by using stochastic block models (SBMs). The SBM fitting procedures are principled methods that can be used to find hierarchical grou** of journals into blocks that show similar incoming and outgoing citations patterns. These methods work directly on the citation network without the need to construct auxiliary networks based on similarity of nodes. We fit the SBMs to the networks of journals we have constructed from the data set of around 630 million citations and find a variety of different types of blocks, such as clusters, bridges, sources, and sinks. In addition we use a recent generalization of SBMs to determine how much a manually curated classification of journals into subfields of science is related to the block structure of the journal network and how this relationship changes in time. The SBM method tries to find a network of blocks that is the best high-level representation of the network of journals, and we illustrate how these block networks (at various levels of resolution) can be used as maps of science.
△ Less
Submitted 28 April, 2017;
originally announced May 2017.
-
Tracking Urban Human Activity from Mobile Phone Calling Patterns
Authors:
Daniel Monsivais,
Asim Ghosh,
Kunal Bhattacharya,
Robin I. M Dunbar,
Kimmo Kaski
Abstract:
Timings of human activities are marked by circadian clocks which in turn are entrained to different environmental signals. In an urban environment the presence of artificial lighting and various social cues tend to disrupt the natural entrainment with the sunlight. However, it is not completely understood to what extent this is the case. Here we exploit the large-scale data analysis techniques to…
▽ More
Timings of human activities are marked by circadian clocks which in turn are entrained to different environmental signals. In an urban environment the presence of artificial lighting and various social cues tend to disrupt the natural entrainment with the sunlight. However, it is not completely understood to what extent this is the case. Here we exploit the large-scale data analysis techniques to study the mobile phone calling activity of people in large cities to infer the dynamics of urban daily rhythms. From the calling patterns of about 1,000,000 users spread over different cities but lying inside the same time-zone, we show that the onset and termination of the calling activity synchronizes with the east-west progression of the sun. We also find that the onset and termination of the calling activity of users follows a yearly dynamics, varying across seasons, and that its timings are entrained to solar midnight. Furthermore, we show that the average mid-sleep time of people living in urban areas depends on the age and gender of each cohort as a result of biological and social factors.
△ Less
Submitted 17 October, 2017; v1 submitted 20 April, 2017;
originally announced April 2017.
-
Modelling community formation driven by the status of individual in a society
Authors:
Jan E. Snellman,
Gerardo Iñiguez,
Tzipe Govezensky,
Rafael A. Barrio,
Kimmo K. Kaski
Abstract:
In human societies, people's willingness to compete and strive for better social status as well as being envious of those perceived in some way superior lead to social structures that are intrinsically hierarchical. Here we propose an agent-based, network model to mimic the ranking behaviour of individuals and its possible repercussions in human society. The main ingredient of the model is the ass…
▽ More
In human societies, people's willingness to compete and strive for better social status as well as being envious of those perceived in some way superior lead to social structures that are intrinsically hierarchical. Here we propose an agent-based, network model to mimic the ranking behaviour of individuals and its possible repercussions in human society. The main ingredient of the model is the assumption that the relevant feature of social interactions is each individual's keenness to maximise his or her status relative to others. The social networks produced by the model are homophilous and assortative, as frequently observed in human communities and most of the network properties seem quite independent of its size. However, it is seen that for small number of agents the resulting network consists of disjoint weakly connected communities while being highly assortative and homophilic. On the other hand larger networks turn out to be more cohesive with larger communities but less homophilic. We find that the reason for these changes is that larger network size allows agents to use new strategies for maximizing their social status allowing for more diverse links between them.
△ Less
Submitted 8 February, 2017;
originally announced February 2017.
-
Quantifying gender preferences across humans lifespan
Authors:
Asim Ghosh,
Daniel Monsivais,
Kunal Bhattacharya,
Robin I. M. Dunbar,
Kimmo Kaski
Abstract:
In human relations individuals' gender and age play a key role in the structures and dynamics of their social arrangements. In order to analyze the gender preferences of individuals in interaction with others at different stages of their lives we study a large mobile phone dataset. To do this we consider four fundamental gender-related caller and callee combinations of human interactions, namely m…
▽ More
In human relations individuals' gender and age play a key role in the structures and dynamics of their social arrangements. In order to analyze the gender preferences of individuals in interaction with others at different stages of their lives we study a large mobile phone dataset. To do this we consider four fundamental gender-related caller and callee combinations of human interactions, namely male to male, male to female, female to male, and female to female, which together with age, kinship, and different levels of friendship give rise to a wide scope of human sociality. Here we analyse the relative strength of these four types of interaction using a large dataset of mobile phone communication records. Our analysis suggests strong age dependence for an ego of one gender choosing to call an individual of either gender. We observe a strong opposite sex bonding across most of their reproductive age. However, older women show a strong tendency to connect to another female that is one generation younger in a way that is suggestive of the \emph{grandmothering effect}. We also find that the relative strength among the four possible interactions depends on phone call duration. For calls of medium and long duration, opposite gender interactions are significantly more probable than same gender interactions during the reproductive years, suggesting potential emotional exchange between spouses. By measuring the fraction of calls to other generations we find that mothers tend to make calls more to their daughters than to their sons, whereas fathers make calls more to their sons than to their daughters. For younger people, most of their calls go to same generation alters, while older people call the younger people more frequently, which supports the suggestion that \emph{affection flows downward}.
△ Less
Submitted 18 November, 2016;
originally announced November 2016.
-
Stylized facts in social networks: Community-based static modeling
Authors:
Hang-Hyun Jo,
Yohsuke Murase,
János Török,
János Kertész,
Kimmo Kaski
Abstract:
The past analyses of datasets of social networks have enabled us to make empirical findings of a number of aspects of human society, which are commonly featured as stylized facts of social networks, such as broad distributions of network quantities, existence of communities, assortative mixing, and intensity-topology correlations. Since the understanding of the structure of these complex social ne…
▽ More
The past analyses of datasets of social networks have enabled us to make empirical findings of a number of aspects of human society, which are commonly featured as stylized facts of social networks, such as broad distributions of network quantities, existence of communities, assortative mixing, and intensity-topology correlations. Since the understanding of the structure of these complex social networks is far from complete, for deeper insight into human society more comprehensive datasets and modeling of the stylized facts are needed. Although the existing dynamical and static models can generate some stylized facts, here we take an alternative approach by devising a community-based static model with heterogeneous community sizes and larger communities having smaller link density and weight. With these few assumptions we are able to generate realistic social networks that show most stylized facts for a wide range of parameters, as demonstrated numerically and analytically. Since our community-based static model is simple to implement and easily scalable, it can be used as a reference system, benchmark, or testbed for further applications.
△ Less
Submitted 8 August, 2017; v1 submitted 11 November, 2016;
originally announced November 2016.
-
Multiplex Modeling of the Society
Authors:
Janos Kertesz,
Janos Torok,
Yohsuke Murase,
Hang-Hyun Jo,
Kimmo Kaski
Abstract:
The society has a multi-layered structure, where the layers represent the different contexts. To model this structure we begin with a single-layer weighted social network (WSN) model showing the Granovetterian structure. We find that when merging such WSN models, a sufficient amount of inter-layer correlation is needed to maintain the relationship between topology and link weights, while these cor…
▽ More
The society has a multi-layered structure, where the layers represent the different contexts. To model this structure we begin with a single-layer weighted social network (WSN) model showing the Granovetterian structure. We find that when merging such WSN models, a sufficient amount of inter-layer correlation is needed to maintain the relationship between topology and link weights, while these correlations destroy the enhancement in the community overlap due to multiple layers. To resolve this, we devise a geographic multi-layer WSN model, where the indirect inter-layer correlations due to the geographic constraints of individuals enhance the overlaps between the communities and, at the same time, the Granovetterian structure is preserved. Furthermore, the network of social interactions can be considered as a multiplex from another point of view too: each layer corresponds to one communication channel and the aggregate of all them constitutes the entire social network. However, usually one has information only about one of the channels, which should be considered as a sample of the whole. Here we show by simulations and analytical methods that this sampling may lead to bias. For example, while it is expected that the degree distribution of the whole social network has a maximum at a value larger than one, we get with reasonable assumptions about the sampling process a monotonously decreasing distribution as observed in empirical studies of single channel data. We analyse the far-reaching consequences of our findings.
△ Less
Submitted 27 September, 2016;
originally announced September 2016.
-
Absence makes the heart grow fonder: social compensation when failure to interact risks weakening a relationship
Authors:
Kunal Bhattacharya,
Asim Ghosh,
Daniel Monsivais,
Robin Dunbar,
Kimmo Kaski
Abstract:
Social networks require active relationship maintenance if they are to be kept at a constant level of emotional closeness. For primates, including humans, failure to interact leads inexorably to a decline in relationship quality, and a consequent loss of the benefits that derive from individual relationships. As a result, many social species compensate for weakened relationships by investing more…
▽ More
Social networks require active relationship maintenance if they are to be kept at a constant level of emotional closeness. For primates, including humans, failure to interact leads inexorably to a decline in relationship quality, and a consequent loss of the benefits that derive from individual relationships. As a result, many social species compensate for weakened relationships by investing more heavily in them. Here we study how humans behave in similar situations, using data from mobile call detail records from a European country. For the less frequent contacts between pairs of communicating individuals we observe a logarithmic dependence of the duration of the succeeding call on the time gap with the previous call. We find that such behaviour is likely when the individuals in these dyadic pairs have the same gender and are in the same age bracket as well as being geographically distant. Our results indicate that these pairs deliberately invest more time in communication so as to reinforce their social bonding and prevent their relationships decaying when these are threatened by lack of interaction.
△ Less
Submitted 5 August, 2016;
originally announced August 2016.
-
Seasonal and geographical impact on human resting periods
Authors:
Daniel Monsivais,
Kunal Bhattacharya,
Asim Ghosh,
Robin I. M. Dunbar,
Kimmo Kaski
Abstract:
We study the influence of seasonally and geographically related daily dynamics of daylight and ambient temperature on human resting or slee** patterns using mobile phone data of a large number of individuals. We observe two daily inactivity periods in the people's aggregated mobile phone calling patterns and infer these to represent the resting times of the population. We find that the nocturnal…
▽ More
We study the influence of seasonally and geographically related daily dynamics of daylight and ambient temperature on human resting or slee** patterns using mobile phone data of a large number of individuals. We observe two daily inactivity periods in the people's aggregated mobile phone calling patterns and infer these to represent the resting times of the population. We find that the nocturnal resting period is strongly influenced by the length of daylight, and that its seasonal variation depends on the latitude, such that for people living in two different cities separated by eight latitudinal degrees, the difference in the resting period of people between the summer and winter in southern cities is almost twice that in the northern cities. We also observe that the duration of the afternoon resting period is influenced by the temperature, and that there is a threshold from which this influence sets in. Finally, we observe that the yearly dynamics of the afternoon and nocturnal resting periods appear to be counterbalancing each other. This also lends support to the notion that the total daily resting time of people is more or less conserved across the year.
△ Less
Submitted 20 April, 2017; v1 submitted 21 July, 2016;
originally announced July 2016.
-
Modelling Trading Networks and the Role of Trust
Authors:
Rafael A. Barrio,
Tzipe Govezensky,
Élfego Ruiz-Gutiérrez,
Kimmo K. Kaski
Abstract:
We present a simple dynamical model for describing trading interactions between agents in a social network by considering only two dynamical variables, namely money and goods or services, that are assumed conserved over the whole time span of the agents' trading transactions. A key feature of the model is that agent-to-agent transactions are governed by the price in units of money per goods, which…
▽ More
We present a simple dynamical model for describing trading interactions between agents in a social network by considering only two dynamical variables, namely money and goods or services, that are assumed conserved over the whole time span of the agents' trading transactions. A key feature of the model is that agent-to-agent transactions are governed by the price in units of money per goods, which is dynamically changing, and by a trust variable, which is related to the trading history of each agent. All agents are able to sell or buy, and the decision to do either has to do with the level of trust the buyer has in the seller, the price of the goods and the amount of money and goods at the disposal of the buyer. Here we show the results of extensive numerical calculations under various initial conditions in a random network of agents and compare the results with the available related data. In most cases the agreement between the model results and real data turns out to be fairly good, which allow us to draw some general conclusions as how different trading strategies could affect the distribution of wealth in different kinds of societies.
△ Less
Submitted 28 May, 2016;
originally announced May 2016.
-
Calling Dunbar's Numbers
Authors:
Pádraig MacCarron,
Kimmo Kaski,
Robin Dunbar
Abstract:
The social brain hypothesis predicts that humans have an average of about 150 relationships at any given time. Within this 150, there are layers of friends of an ego, where the number of friends in a layer increases as the emotional closeness decreases. Here we analyse a mobile phone dataset, firstly, to ascertain whether layers of friends can be identified based on call frequency. We then apply d…
▽ More
The social brain hypothesis predicts that humans have an average of about 150 relationships at any given time. Within this 150, there are layers of friends of an ego, where the number of friends in a layer increases as the emotional closeness decreases. Here we analyse a mobile phone dataset, firstly, to ascertain whether layers of friends can be identified based on call frequency. We then apply different clustering algorithms to break the call frequency of egos into clusters and compare the number of alters in each cluster with the layer size predicted by the social brain hypothesis. In this dataset we find strong evidence for the existence of a layered structure. The clustering yields results that match well with previous studies for the innermost and outermost layers, but for layers in between we observe large variability.
△ Less
Submitted 4 August, 2016; v1 submitted 8 April, 2016;
originally announced April 2016.
-
Local cascades induced global contagion: How heterogeneous thresholds, exogenous effects, and unconcerned behaviour govern online adoption spreading
Authors:
Márton Karsai,
Gerardo Iñiguez,
Riivo Kikas,
Kimmo Kaski,
János Kertész
Abstract:
Adoption of innovations, products or online services is commonly interpreted as a spreading process driven to large extent by social influence and conditioned by the needs and capacities of individuals. To model this process one usually introduces behavioural threshold mechanisms, which can give rise to the evolution of global cascades if the system satisfies a set of conditions. However, these mo…
▽ More
Adoption of innovations, products or online services is commonly interpreted as a spreading process driven to large extent by social influence and conditioned by the needs and capacities of individuals. To model this process one usually introduces behavioural threshold mechanisms, which can give rise to the evolution of global cascades if the system satisfies a set of conditions. However, these models do not address temporal aspects of the emerging cascades, which in real systems may evolve through various pathways ranging from slow to rapid patterns. Here we fill this gap through the analysis and modelling of product adoption in the world's largest voice over internet service, the social network of Skype. We provide empirical evidence about the heterogeneous distribution of fractional behavioural thresholds, which appears to be independent of the degree of adopting egos. We show that the structure of real-world adoption clusters is radically different from previous theoretical expectations, since vulnerable adoptions --induced by a single adopting neighbour-- appear to be important only locally, while spontaneous adopters arriving at a constant rate and the involvement of unconcerned individuals govern the global emergence of social spreading.
△ Less
Submitted 29 January, 2016;
originally announced January 2016.
-
Communication with family and friends across the life course
Authors:
Tamas David-Barrett,
Janos Kertesz,
Anna Rotkirch,
Asim Ghosh,
Kunal Bhattacharya,
Daniel Monsivais,
Kimmo Kaski
Abstract:
Each stage of the human life course is characterized by a distinctive pattern of social relations. We study how the intensity and importance of the closest social contacts vary across the life course, using a large database of mobile communication from a European country. We first determine the most likely social relationship type from these mobile phone records by relating the age and gender of t…
▽ More
Each stage of the human life course is characterized by a distinctive pattern of social relations. We study how the intensity and importance of the closest social contacts vary across the life course, using a large database of mobile communication from a European country. We first determine the most likely social relationship type from these mobile phone records by relating the age and gender of the caller and recipient to the frequency, length, and direction of calls. We then show how communication patterns between parents and children, romantic partner, and friends vary across the six main stages of the adult family life course. Young adulthood is dominated by a gradual shift of call activity from parents to close friends, and then to a romantic partner, culminating in the period of early family formation during which the focus is on the romantic partner. During middle adulthood call patterns suggest a high dependence on the parents of the ego, who, presumably often provide alloparental care, while at this stage female same-gender friendship also peaks. During post-reproductive adulthood, individuals and especially women balance close social contacts among three generations. The age of grandparenthood brings the children entering adulthood and family formation into the focus, and is associated with a realignment of close social contacts especially among women, while the old age is dominated by dependence on their children.
△ Less
Submitted 30 December, 2015;
originally announced December 2015.
-
What does Big Data tell? Sampling the social network by communication channels
Authors:
János Török,
Yohsuke Murase,
Hang-Hyun Jo,
János Kertész,
Kimmo Kaski
Abstract:
Big Data has become the primary source of understanding the structure and dynamics of the society at large scale. The network of social interactions can be considered as a multiplex, where each layer corresponds to one communication channel and the aggregate of all of them constitutes the entire social network. However, usually one has information only about one of the channels or even a part of i…
▽ More
Big Data has become the primary source of understanding the structure and dynamics of the society at large scale. The network of social interactions can be considered as a multiplex, where each layer corresponds to one communication channel and the aggregate of all of them constitutes the entire social network. However, usually one has information only about one of the channels or even a part of it, which should be considered as a subset or sample of the whole. Here we introduce a model based on a natural bilateral communication channel selection mechanism, which for one channel leads to consistent changes in the network properties. For example, while it is expected that the degree distribution of the whole social network has a maximum at a value larger than one, we get a monotonously decreasing distribution as observed in empirical studies of single channel data. We also find that assortativity may occur or get strengthened due to the sampling method. We analyze the far-reaching consequences of our findings.
△ Less
Submitted 28 October, 2016; v1 submitted 27 November, 2015;
originally announced November 2015.
-
Dynamics of deceptive interactions in social networks
Authors:
Rafael A. Barrio,
Tzipe Govezensky,
Robin Dunbar,
Gerardo Iñiguez,
Kimmo Kaski
Abstract:
In this paper we examine the role of lies in human social relations by implementing some salient characteristics of deceptive interactions into an opinion formation model, so as to describe the dynamical behaviour of a social network more realistically. In this model we take into account such basic properties of social networks as the dynamics of the intensity of interactions, the influence of pub…
▽ More
In this paper we examine the role of lies in human social relations by implementing some salient characteristics of deceptive interactions into an opinion formation model, so as to describe the dynamical behaviour of a social network more realistically. In this model we take into account such basic properties of social networks as the dynamics of the intensity of interactions, the influence of public opinion, and the fact that in every human interaction it might be convenient to deceive or withhold information depending on the instantaneous situation of each individual in the network. We find that lies shape the topology of social networks, especially the formation of tightly linked, small communities with loose connections between them. We also find that agents with a larger proportion of deceptive interactions are the ones that connect communities of different opinion, and in this sense they have substantial centrality in the network. We then discuss the consequences of these results for the social behaviour of humans and predict the changes that could arise due to a varying tolerance for lies in society.
△ Less
Submitted 13 September, 2015;
originally announced September 2015.
-
Sex differences in social focus across the lifecycle in humans
Authors:
Kunal Bhattacharya,
Asim Ghosh,
Daniel Monsivais,
Robin I. M. Dunbar,
Kimmo Kaski
Abstract:
Age and gender are two important factors that play crucial roles in the way organisms allocate their social effort. In this study, we analyse a large mobile phone dataset to explore the way lifehistory influences human sociality and the way social networks are structured. Our results indicate that these aspects of human behaviour are strongly related to the age and gender such that younger individ…
▽ More
Age and gender are two important factors that play crucial roles in the way organisms allocate their social effort. In this study, we analyse a large mobile phone dataset to explore the way lifehistory influences human sociality and the way social networks are structured. Our results indicate that these aspects of human behaviour are strongly related to the age and gender such that younger individuals have more contacts and, among them, males more than females. However, the rate of decrease in the number of contacts with age differs between males and females, such that there is a reversal in the number of contacts around the late 30s. We suggest that this pattern can be attributed to the difference in reproductive investments that are made by the two sexes. We analyse the inequality in social investment patterns and suggest that the age and gender-related differences that we find reflect the constraints imposed by reproduction in a context where time (a form of social capital) is limited.
△ Less
Submitted 27 August, 2015;
originally announced August 2015.
-
Correlated bursts and the role of memory range
Authors:
Hang-Hyun Jo,
Juan I. Perotti,
Kimmo Kaski,
Janos Kertesz
Abstract:
Inhomogeneous temporal processes in natural and social phenomena have been described by bursts that are rapidly occurring events within short time periods alternating with long periods of low activity. In addition to the analysis of heavy-tailed inter-event time distributions, higher-order correlations between inter-event times, called correlated bursts, have been studied only recently. As the pos…
▽ More
Inhomogeneous temporal processes in natural and social phenomena have been described by bursts that are rapidly occurring events within short time periods alternating with long periods of low activity. In addition to the analysis of heavy-tailed inter-event time distributions, higher-order correlations between inter-event times, called correlated bursts, have been studied only recently. As the possible mechanisms underlying such correlated bursts are far from being fully understood, we devise a simple model for correlated bursts by using a self-exciting point process with variable memory range. Here the probability that a new event occurs is determined by a memory function that is the sum of decaying memories of the past events. In order to incorporate the noise and/or limited memory capacity of systems, we apply two memory loss mechanisms, namely either fixed number or variable number of memories. By using theoretical analysis and numerical simulations we find that excessive amount of memory effect may lead to a Poissonian process, which implies that for memory effect there exists an intermediate range that will generate correlated bursts of magnitude comparable to empirical findings. Hence our results provide deeper understanding of how long-range memory affects correlated bursts.
△ Less
Submitted 13 July, 2015; v1 submitted 8 May, 2015;
originally announced May 2015.
-
Modeling the role of relationship fading and breakup in social network formation
Authors:
Yohsuke Murase,
Hang-Hyun Jo,
János Török,
János Kertész,
Kimmo Kaski
Abstract:
In social networks of human individuals, social relationships do not necessarily last forever as they can either fade gradually with time, resulting in link aging, or terminate abruptly, causing link deletion, as even old friendships may cease. In this paper, we study a social network formation model where we introduce several ways by which a link termination takes place. If we adopt the link agin…
▽ More
In social networks of human individuals, social relationships do not necessarily last forever as they can either fade gradually with time, resulting in link aging, or terminate abruptly, causing link deletion, as even old friendships may cease. In this paper, we study a social network formation model where we introduce several ways by which a link termination takes place. If we adopt the link aging, we get a more modular structure with more homogeneously distributed link weights within communities than when link deletion is used. By investigating distributions and relations of various network characteristics, we find that the empirical findings are better reproduced with the link deletion model. This indicates that link deletion plays a more prominent role in organizing social networks than link aging.
△ Less
Submitted 22 June, 2015; v1 submitted 4 May, 2015;
originally announced May 2015.
-
Attention decay in science
Authors:
Pietro Della Briotta Parolo,
Raj Kumar Pan,
Rumi Ghosh,
Bernardo A. Huberman,
Kimmo Kaski,
Santo Fortunato
Abstract:
The exponential growth in the number of scientific papers makes it increasingly difficult for researchers to keep track of all the publications relevant to their work. Consequently, the attention that can be devoted to individual papers, measured by their citation counts, is bound to decay rapidly. In this work we make a thorough study of the life-cycle of papers in different disciplines. Typicall…
▽ More
The exponential growth in the number of scientific papers makes it increasingly difficult for researchers to keep track of all the publications relevant to their work. Consequently, the attention that can be devoted to individual papers, measured by their citation counts, is bound to decay rapidly. In this work we make a thorough study of the life-cycle of papers in different disciplines. Typically, the citation rate of a paper increases up to a few years after its publication, reaches a peak and then decreases rapidly. This decay can be described by an exponential or a power law behavior, as in ultradiffusive processes, with exponential fitting better than power law for the majority of cases. The decay is also becoming faster over the years, signaling that nowadays papers are forgotten more quickly. However, when time is counted in terms of the number of published papers, the rate of decay of citations is fairly independent of the period considered. This indicates that the attention of scholars depends on the number of published items, and not on real time.
△ Less
Submitted 23 November, 2015; v1 submitted 6 March, 2015;
originally announced March 2015.
-
Multilayer weighted social network model
Authors:
Yohsuke Murase,
János Török,
Hang-Hyun Jo,
Kimmo Kaski,
János Kertész
Abstract:
Recent empirical studies using large-scale data sets have validated the Granovetter hypothesis on the structure of the society in that there are strongly wired communities connected by weak ties. However, as interaction between individuals takes place in diverse contexts, these communities turn out to be overlap**. This implies that the society has a multilayered structure, where the layers repr…
▽ More
Recent empirical studies using large-scale data sets have validated the Granovetter hypothesis on the structure of the society in that there are strongly wired communities connected by weak ties. However, as interaction between individuals takes place in diverse contexts, these communities turn out to be overlap**. This implies that the society has a multilayered structure, where the layers represent the different contexts. To model this structure we begin with a single-layer weighted social network (WSN) model showing the Granovetterian structure. We find that when merging such WSN models, a sufficient amount of interlayer correlation is needed to maintain the relationship between topology and link weights, while these correlations destroy the enhancement in the community overlap due to multiple layers. To resolve this, we devise a geographic multilayer WSN model, where the indirect interlayer correlations due to the geographic constraints of individuals enhance the overlaps between the communities and, at the same time, the Granovetterian structure is preserved.
△ Less
Submitted 10 November, 2014; v1 submitted 6 August, 2014;
originally announced August 2014.
-
Spatial patterns of close relationships across the lifespan
Authors:
Hang-Hyun Jo,
Jari Saramäki,
Robin I. M. Dunbar,
Kimmo Kaski
Abstract:
The dynamics of close relationships is important for understanding the migration patterns of individual life-courses. The bottom-up approach to this subject by social scientists has been limited by sample size, while the more recent top-down approach using large-scale datasets suffers from a lack of detail about the human individuals. We incorporate the geographic and demographic information of mi…
▽ More
The dynamics of close relationships is important for understanding the migration patterns of individual life-courses. The bottom-up approach to this subject by social scientists has been limited by sample size, while the more recent top-down approach using large-scale datasets suffers from a lack of detail about the human individuals. We incorporate the geographic and demographic information of millions of mobile phone users with their communication patterns to study the dynamics of close relationships and its effect in their life-course migration. We demonstrate how the close age- and sex-biased dyadic relationships are correlated with the geographic proximity of the pair of individuals, e.g., young couples tend to live further from each other than old couples. In addition, we find that emotionally closer pairs are living geographically closer to each other. These findings imply that the life-course framework is crucial for understanding the complex dynamics of close relationships and their effect on the migration patterns of human individuals.
△ Less
Submitted 17 September, 2014; v1 submitted 18 July, 2014;
originally announced July 2014.