-
Higher-order modeling of face-to-face interactions
Authors:
Luca Gallo,
Chiara Zappalà,
Fariba Karimi,
Federico Battiston
Abstract:
The most fundamental social interactions among humans occur face to face. Their features have been extensively studied in recent years, owing to the availability of high-resolution data on individuals' proximity. Mathematical models based on mobile agents have been crucial to understand the spatio-temporal organization of face-to-face interactions. However, these models focus on dyadic relationshi…
▽ More
The most fundamental social interactions among humans occur face to face. Their features have been extensively studied in recent years, owing to the availability of high-resolution data on individuals' proximity. Mathematical models based on mobile agents have been crucial to understand the spatio-temporal organization of face-to-face interactions. However, these models focus on dyadic relationships only, failing to characterize interactions in larger groups of individuals. Here, we propose a model in which agents interact with each other by forming groups of different sizes. Each group has a degree of social attractiveness, based on which neighboring agents decide whether to join. Our framework reproduces different properties of groups in face-to-face interactions, including their distribution, the correlation in their number, and their persistence in time, which cannot be replicated by dyadic models. Furthermore, it captures homophilic patterns at the level of higher-order interactions, going beyond standard pairwise approaches. Our work sheds light on the higher-order mechanisms at the heart of human face-to-face interactions, paving the way for further investigation of how group dynamics at a microscopic scale affects social phenomena at a macroscopic scale.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
The diaspora model for human migration
Authors:
Rafael Prieto-Curiel,
Ola Ali,
Elma Dervic,
Fariba Karimi,
Elisa Omodei,
Rainer Stütz,
Georg Heiler,
Yurij Holovatch
Abstract:
Migration's impact spans various social dimensions, including demography, sustainability, politics, economy and gender disparities. Yet, the decision-making process behind migrants choosing their destination remains elusive. Existing models primarily rely on population size and travel distance to explain flow fluctuations, overlooking significant population heterogeneities. Paradoxically, migrants…
▽ More
Migration's impact spans various social dimensions, including demography, sustainability, politics, economy and gender disparities. Yet, the decision-making process behind migrants choosing their destination remains elusive. Existing models primarily rely on population size and travel distance to explain flow fluctuations, overlooking significant population heterogeneities. Paradoxically, migrants often travel long distances and to smaller destinations if their diaspora is present in those locations. To address this gap, we propose the diaspora model of migration, incorporating intensity (the number of people moving to a country) and assortativity (the destination within the country). Our model considers only the existing diaspora sizes in the destination country, influencing the probability of migrants selecting a specific residence. Despite its simplicity, our model accurately reproduces the observed stable flow and distribution of migration in Austria (postal code level) and US metropolitan areas, yielding precise estimates of migrant inflow at various geographic scales. Given the increase in international migrations due to recent natural and societal crises, this study enlightens our understanding of migration flow heterogeneities, hel** design more inclusive, integrated cities.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Social network modeling and applications, a tutorial
Authors:
Lisette Espín-Noboa,
Tiago Peixoto,
Fariba Karimi
Abstract:
Social networks have been widely studied over the last century from multiple disciplines to understand societal issues such as inequality in employment rates, managerial performance, and epidemic spread. Today, these and many more issues can be studied at global scale thanks to the digital footprints that we generate when browsing the Web or using social media platforms. Unfortunately, scientists…
▽ More
Social networks have been widely studied over the last century from multiple disciplines to understand societal issues such as inequality in employment rates, managerial performance, and epidemic spread. Today, these and many more issues can be studied at global scale thanks to the digital footprints that we generate when browsing the Web or using social media platforms. Unfortunately, scientists often struggle to access to such data primarily because it is proprietary, and even when it is shared with privacy guarantees, such data is either no representative or too big. In this tutorial, we will discuss recent advances and future directions in network modeling. In particular, we focus on how to exploit synthetic networks to study real-world problems such as data privacy, spreading dynamics, algorithmic bias, and ranking inequalities. We start by reviewing different types of generative models for social networks including node-attributed and scale-free networks. Then, we showcase how to perform a network selection analysis to characterize the mechanisms of edge formation of any given real-world network.
△ Less
Submitted 19 June, 2023;
originally announced June 2023.
-
On the inadequacy of nominal assortativity for assessing homophily in networks
Authors:
Fariba Karimi,
Marcos Oliveira
Abstract:
Nominal assortativity (or discrete assortativity) is widely used to characterize group mixing patterns and homophily in networks, enabling researchers to analyze how groups interact with one another. Here we demonstrate that the measure presents severe shortcomings when applied to networks with unequal group sizes and asymmetric mixing. We characterize these shortcomings analytically and use synth…
▽ More
Nominal assortativity (or discrete assortativity) is widely used to characterize group mixing patterns and homophily in networks, enabling researchers to analyze how groups interact with one another. Here we demonstrate that the measure presents severe shortcomings when applied to networks with unequal group sizes and asymmetric mixing. We characterize these shortcomings analytically and use synthetic and empirical networks to show that nominal assortativity fails to account for group imbalance and asymmetric group interactions, thereby producing an inaccurate characterization of mixing patterns. We propose adjusted nominal assortativity and show that this adjustment recovers the expected assortativity in networks with various level of mixing. Furthermore, we propose an analytical method to assess asymmetric mixing by estimating the tendency of inter- and intra-group connectivities. Finally, we discuss how this approach enables uncovering hidden mixing patterns in real-world networks.
△ Less
Submitted 5 September, 2023; v1 submitted 18 November, 2022;
originally announced November 2022.
-
Longitudinal Analysis of Heart Rate and Physical Activity Collected from Smartwatches
Authors:
Fatemeh Karimi,
Zohre Amoozgar,
Reza Reiazi,
Mehdi Hosseinzadeh,
Reza Rawassizadeh
Abstract:
Smartwatches (SWs) can continuously and autonomously monitor vital signs, including heart rates and physical activities involving wrist movement. The monitoring capability of SWs has several key health benefits arising from their role in preventive and diagnostic medicine. Current research, however, has not explored many of these opportunities, including longitudinal studies. In our work, we gathe…
▽ More
Smartwatches (SWs) can continuously and autonomously monitor vital signs, including heart rates and physical activities involving wrist movement. The monitoring capability of SWs has several key health benefits arising from their role in preventive and diagnostic medicine. Current research, however, has not explored many of these opportunities, including longitudinal studies. In our work, we gathered longitudinal data points, e.g., heart rate and physical activity, from various brands of SWs worn by 1,014 users. Our analysis shows three common heart rate patterns during sleep but two common patterns during the day. We find that heart rate and physical activities are higher in summer and the first month of the new year compared to other months. Moreover, physical activities are reduced on weekends compared with weekdays. Interestingly, the highest peak of physical activity is during the evening.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
LightDepth: A Resource Efficient Depth Estimation Approach for Dealing with Ground Truth Sparsity via Curriculum Learning
Authors:
Fatemeh Karimi,
Amir Mehrpanah,
Reza Rawassizadeh
Abstract:
Advances in neural networks enable tackling complex computer vision tasks such as depth estimation of outdoor scenes at unprecedented accuracy. Promising research has been done on depth estimation. However, current efforts are computationally resource-intensive and do not consider the resource constraints of autonomous devices, such as robots and drones. In this work, we present a fast and battery…
▽ More
Advances in neural networks enable tackling complex computer vision tasks such as depth estimation of outdoor scenes at unprecedented accuracy. Promising research has been done on depth estimation. However, current efforts are computationally resource-intensive and do not consider the resource constraints of autonomous devices, such as robots and drones. In this work, we present a fast and battery-efficient approach for depth estimation. Our approach devises model-agnostic curriculum-based learning for depth estimation. Our experiments show that the accuracy of our model performs on par with the state-of-the-art models, while its response time outperforms other models by 71%. All codes are available online at https://github.com/fatemehkarimii/LightDepth.
△ Less
Submitted 19 November, 2022; v1 submitted 15 November, 2022;
originally announced November 2022.
-
Improving the visibility of minorities through network growth interventions
Authors:
Leonie Neuhäuser,
Fariba Karimi,
Jan Bachmann,
Markus Strohmaier,
Michael T. Schaub
Abstract:
Improving the position of minorities in networks via interventions is a challenge of high theoretical and societal importance. In this work, we examine how different network growth interventions impact the position of minority nodes in degree rankings over time. We distinguish between two kinds of interventions: (i) group size interventions, such as introducing quotas, that regulate the ratio of i…
▽ More
Improving the position of minorities in networks via interventions is a challenge of high theoretical and societal importance. In this work, we examine how different network growth interventions impact the position of minority nodes in degree rankings over time. We distinguish between two kinds of interventions: (i) group size interventions, such as introducing quotas, that regulate the ratio of incoming minority and majority nodes; and (ii) behavioural interventions, such as homophily, i.e. varying how groups interact and connect to each other. We find that even extreme group size interventions do not have a strong effect on the position of minorities in rankings if certain behavioural changes do not manifest at the same time. For example, minority representation in rankings is not increased by high quotas if the actors in the network do not adopt homophilic behaviour. As a result, a key finding of our research is that in order for the visibility of minorities to improve, group size and behavioural interventions need to be coordinated. Moreover, their potential benefit is highly dependent on pre-intervention conditions in social networks. In a real-world case study, we explore the effectiveness of interventions to reach gender parity in academia. Our work lays a theoretical and computational foundation for further studies aiming to explore the effectiveness of interventions in growing networks.
△ Less
Submitted 5 August, 2022;
originally announced August 2022.
-
Minorities in networks and algorithms
Authors:
Fariba Karimi,
Marcos Oliveira,
Markus Strohmaier
Abstract:
In this chapter, we provide an overview of recent advances in data-driven and theory-informed complex models of social networks and their potential in understanding societal inequalities and marginalization. We focus on inequalities arising from networks and network-based algorithms and how they affect minorities. In particular, we examine how homophily and mixing biases shape large and small soci…
▽ More
In this chapter, we provide an overview of recent advances in data-driven and theory-informed complex models of social networks and their potential in understanding societal inequalities and marginalization. We focus on inequalities arising from networks and network-based algorithms and how they affect minorities. In particular, we examine how homophily and mixing biases shape large and small social networks, influence perception of minorities, and affect collaboration patterns. We also discuss dynamical processes on and of networks and the formation of norms and health inequalities. Additionally, we argue that network modeling is paramount for unveiling the effect of ranking and social recommendation algorithms on the visibility of minorities. Finally, we highlight the key challenges and future opportunities in this emerging research topic.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
Link recommendations: Their impact on network structure and minorities
Authors:
Antonio Ferrara,
Lisette Espín-Noboa,
Fariba Karimi,
Claudia Wagner
Abstract:
Network-based people recommendation algorithms are widely employed on the Web to suggest new connections in social media or professional platforms. While such recommendations bring people together, the feedback loop between the algorithms and the changes in network structure may exacerbate social biases. These biases include rich-get-richer effects, filter bubbles, and polarization. However, socia…
▽ More
Network-based people recommendation algorithms are widely employed on the Web to suggest new connections in social media or professional platforms. While such recommendations bring people together, the feedback loop between the algorithms and the changes in network structure may exacerbate social biases. These biases include rich-get-richer effects, filter bubbles, and polarization. However, social networks are diverse complex systems and recommendations may affect them differently, depending on their structural properties. In this work, we explore five people recommendation algorithms by systematically applying them over time to different synthetic networks. In particular, we measure to what extent these recommendations change the structure of bi-populated networks and show how these changes affect the minority group. Our systematic experimentation helps to better understand when link recommendation algorithms are beneficial or harmful to minority groups in social networks. In particular, our findings suggest that, while all algorithms tend to close triangles and increase cohesion, all algorithms except Node2Vec are prone to favor and suggest nodes with high in-degree. Furthermore, we found that, especially when both classes are heterophilic, recommendation algorithms can reduce the visibility of minorities.
△ Less
Submitted 12 May, 2022;
originally announced May 2022.
-
The many facets of academic mobility and its impact on scholars' career
Authors:
Fakhri Momeni,
Fariba Karimi,
Philipp Mayr,
Isabella Peters,
Stefan Dietze
Abstract:
International mobility in academia can enhance the human and social capital of researchers and consequently their scientific outcome. However, there is still a very limited understanding of the different mobility patterns among scholars with various socio-demographic characteristics. The aim of this study is twofold. First, we investigate to what extent individual factors associate with the mobili…
▽ More
International mobility in academia can enhance the human and social capital of researchers and consequently their scientific outcome. However, there is still a very limited understanding of the different mobility patterns among scholars with various socio-demographic characteristics. The aim of this study is twofold. First, we investigate to what extent individual factors associate with the mobility of researchers. Second, we explore the relationship between mobility and scientific activity and impact. For this purpose, we used a bibliometric approach to track the mobility of authors. To compare the scientific outcomes of researchers, we considered the number of publications and received citations as indicators, as well as the number of unique co-authors in all their publications. We also analysed the co-authorship network of researchers and compared centrality measures of mobile and non-mobile researchers. Results show that researchers from North America and Sub-Saharan Africa, particularly female ones, have the lowest, respectively, highest tendency towards international mobility. Having international co-authors increases the probability of international movement. Our findings uncover gender inequality in international mobility across scientific fields and countries. Across genders, researchers in the Physical sciences have the most and in the Social sciences the least rate of mobility. We observed more mobility for Social scientists at the advanced career stage, while researchers in other fields prefer to move at earlier career stages. Also, we found a positive correlation between mobility and scientific outcomes, but no apparent difference between females and males. Comparing the centrality of mobile and non-mobile researchers in the co-authorship networks reveals a higher social capital advantage for mobile researchers.
△ Less
Submitted 29 March, 2022; v1 submitted 14 March, 2022;
originally announced March 2022.
-
Inequality and Inequity in Network-based Ranking and Recommendation Algorithms
Authors:
Lisette Espín-Noboa,
Claudia Wagner,
Markus Strohmaier,
Fariba Karimi
Abstract:
Though algorithms promise many benefits including efficiency, objectivity and accuracy, they may also introduce or amplify biases. Here we study two well-known algorithms, namely PageRank and Who-to-Follow (WTF), and show to what extent their ranks produce inequality and inequity when applied to directed social networks. To this end, we propose a directed network model with preferential attachment…
▽ More
Though algorithms promise many benefits including efficiency, objectivity and accuracy, they may also introduce or amplify biases. Here we study two well-known algorithms, namely PageRank and Who-to-Follow (WTF), and show to what extent their ranks produce inequality and inequity when applied to directed social networks. To this end, we propose a directed network model with preferential attachment and homophily (DPAH) and demonstrate the influence of network structure on the rank distributions of these algorithms. Our main findings suggest that (i) inequality is positively correlated with inequity, (ii) inequality is driven by the interplay between preferential attachment, homophily, node activity and edge density, and (iii) inequity is driven by the interplay between homophily and minority size. In particular, these two algorithms reduce, replicate and amplify the representation of minorities in top ranks when majorities are homophilic, neutral and heterophilic, respectively. Moreover, when this representation is reduced, minorities may improve their visibility in the rank by connecting strategically in the network. For instance, by increasing their out-degree or homophily when majorities are also homophilic. These findings shed light on the social and algorithmic mechanisms that hinder equality and equity in network-based ranking and recommendation algorithms.
△ Less
Submitted 22 July, 2022; v1 submitted 30 September, 2021;
originally announced October 2021.
-
Ultra-Fast, High-Performance 8x8 Approximate Multipliers by a New Multicolumn 3,3:2 Inexact Compressor and its Derivatives
Authors:
Fereshteh Karimi,
Reza Faghih Mirzaee,
Ali Fakeri-Tabrizi,
Arman Roohi
Abstract:
A multiplier, as a key component in many different applications, is a time-consuming, energy-intensive computation block. Approximate computing is a practical design paradigm that attempts to improve hardware efficacy while kee** computation quality satisfactory. A novel multicolumn 3,3:2 inexact compressor is presented in this paper. It takes three partial products from two adjacent columns eac…
▽ More
A multiplier, as a key component in many different applications, is a time-consuming, energy-intensive computation block. Approximate computing is a practical design paradigm that attempts to improve hardware efficacy while kee** computation quality satisfactory. A novel multicolumn 3,3:2 inexact compressor is presented in this paper. It takes three partial products from two adjacent columns each for rapid partial product reduction. The proposed inexact compressor and its derivates enable us to design a high-speed approximate multiplier. Then, another ultra-fast, high-efficient approximate multiplier is achieved utilizing a systematic truncation strategy. The proposed multipliers accumulate partial products in only two stages, one fewer stage than other approximate multipliers in the literature. Implementation results by Synopsys Design Compiler and 45 nm technology node demonstrates nearly 11.11% higher speed for the second proposed design over the fastest existing approximate multiplier. Furthermore, the new approximate multipliers are applied to the image processing application of image sharpening, and their performance in this application is highly satisfactory. It is shown in this paper that the error pattern of an approximate multiplier, in addition to the mean error distance and error rate, has a direct effect on the outcomes of the image processing application.
△ Less
Submitted 15 August, 2023; v1 submitted 25 July, 2021;
originally announced July 2021.
-
Group mixing drives inequality in face-to-face gatherings
Authors:
Marcos Oliveira,
Fariba Karimi,
Maria Zens,
Johann Schaible,
Mathieu Génois,
Markus Strohmaier
Abstract:
Uncovering how inequality emerges from human interaction is imperative for just societies. Here we show that the way social groups interact in face-to-face situations can enable the emergence of disparities in the visibility of social groups. These disparities translate into members of specific social groups having fewer social ties than the average (i.e., degree inequality). We characterize group…
▽ More
Uncovering how inequality emerges from human interaction is imperative for just societies. Here we show that the way social groups interact in face-to-face situations can enable the emergence of disparities in the visibility of social groups. These disparities translate into members of specific social groups having fewer social ties than the average (i.e., degree inequality). We characterize group degree inequality in sensor-based data sets and present a mechanism that explains these disparities as the result of group mixing and group-size imbalance. We investigate how group sizes affect this inequality, thereby uncovering the critical size and mixing conditions in that a critical minority group emerges. If a minority group is larger than this critical size, it can be a well-connected, cohesive group; if it is smaller, minority cohesion widens degree inequality. Finally, we expose the under-representation of individuals in degree rankings due to mixing dynamics and propose a way to reduce such biases.
△ Less
Submitted 16 March, 2022; v1 submitted 22 June, 2021;
originally announced June 2021.
-
Bias in Data-driven AI Systems -- An Introductory Survey
Authors:
Eirini Ntoutsi,
Pavlos Fafalios,
Ujwal Gadiraju,
Vasileios Iosifidis,
Wolfgang Nejdl,
Maria-Esther Vidal,
Salvatore Ruggieri,
Franco Turini,
Symeon Papadopoulos,
Emmanouil Krasanakis,
Ioannis Kompatsiaris,
Katharina Kinder-Kurlanda,
Claudia Wagner,
Fariba Karimi,
Miriam Fernandez,
Harith Alani,
Bettina Berendt,
Tina Kruegel,
Christian Heinze,
Klaus Broelemann,
Gjergji Kasneci,
Thanassis Tiropanis,
Steffen Staab
Abstract:
AI-based systems are widely employed nowadays to make decisions that have far-reaching impacts on individuals and society. Their decisions might affect everyone, everywhere and anytime, entailing concerns about potential human rights issues. Therefore, it is necessary to move beyond traditional AI algorithms optimized for predictive performance and embed ethical and legal principles in their desig…
▽ More
AI-based systems are widely employed nowadays to make decisions that have far-reaching impacts on individuals and society. Their decisions might affect everyone, everywhere and anytime, entailing concerns about potential human rights issues. Therefore, it is necessary to move beyond traditional AI algorithms optimized for predictive performance and embed ethical and legal principles in their design, training and deployment to ensure social good while still benefiting from the huge potential of the AI technology. The goal of this survey is to provide a broad multi-disciplinary overview of the area of bias in AI systems, focusing on technical challenges and solutions as well as to suggest new research directions towards approaches well-grounded in a legal frame. In this survey, we focus on data-driven AI, as a large part of AI is powered nowadays by (big) data and powerful Machine Learning (ML) algorithms. If otherwise not specified, we use the general term bias to describe problems related to the gathering or processing of data that might result in prejudiced decisions on the bases of demographic features like race, sex, etc.
△ Less
Submitted 14 January, 2020;
originally announced January 2020.
-
The Role of Network Structure and Initial Group Norm Distributions in Norm Conflict
Authors:
Julian Kohne,
Natalie Gallagher,
Zeynep Melis Kirgil,
Rocco Paolillo,
Lars Padmos,
Fariba Karimi
Abstract:
Social norms can facilitate societal coexistence in groups by providing an implicitly shared set of expectations and behavioral guidelines. However, different social groups can hold different norms, and lacking an overarching normative consensus can lead to conflict within and between groups. In this paper, we present an agent-based model that simulates the adoption of norms in two interacting gro…
▽ More
Social norms can facilitate societal coexistence in groups by providing an implicitly shared set of expectations and behavioral guidelines. However, different social groups can hold different norms, and lacking an overarching normative consensus can lead to conflict within and between groups. In this paper, we present an agent-based model that simulates the adoption of norms in two interacting groups. We explore this phenomenon while varying relative group sizes and homophily/heterophily (two features of network structure), and initial group norm distributions. Agents update their norm according to an adapted version of Granovetter's threshold model, using a uniform distribution of thresholds. We study the impact of network structure and initial norm distributions on the process of achieving normative consensus and the resulting potential for intragroup and intergroup conflict. Our results show that norm change is most likely when norms are strongly tied to group membership. Groups end up with the most similar norm distributions when networks are heterophilic, with small to middling minority groups. High homophilic networks show high potential intergroup conflict and low potential intragroup conflict, while the opposite pattern emerges for high heterophilic networks.
△ Less
Submitted 1 July, 2019;
originally announced July 2019.
-
Collective Attention towards Scientists and Research Topics
Authors:
Claudia Wagner,
Olga Zagovora,
Tatiana Sennikova,
Fariba Karimi
Abstract:
Emergent patterns of collective attention towards scientists and their research may function as a proxy for scientific impact which traditionally is assessed via committees that award prizes to scientists. Therefore it is crucial to understand the relationships between scientific impact and online demand and supply for information about scientists and their work. In this paper, we compare the temp…
▽ More
Emergent patterns of collective attention towards scientists and their research may function as a proxy for scientific impact which traditionally is assessed via committees that award prizes to scientists. Therefore it is crucial to understand the relationships between scientific impact and online demand and supply for information about scientists and their work. In this paper, we compare the temporal pattern of information supply (article creations) and information demand (article views) on Wikipedia for two groups of scientists: scientists who received one of the most prestigious awards in their field and influential scientists from the same field who did not receive an award. Our research highlights that awards function as external shocks which increase supply and demand for information about scientists, but hardly affect information supply and demand for their research topics. Further, we find interesting differences in the temporal ordering of information supply between the two groups: (i) award-winners have a higher probability that interest in them precedes interest in their work; (ii) for award winners interest in articles about them and their work is temporally more clustered than for non-awarded scientists.
△ Less
Submitted 17 April, 2018;
originally announced April 2018.
-
Decay of Relevance in Exponentially Growing Networks
Authors:
Jun Sun,
Steffen Staab,
Fariba Karimi
Abstract:
We propose a new preferential attachment-based network growth model in order to explain two properties of growing networks: (1) the power-law growth of node degrees and (2) the decay of node relevance. In preferential attachment models, the ability of a node to acquire links is affected by its degree, its fitness, as well as its relevance which typically decays over time. After a review of existin…
▽ More
We propose a new preferential attachment-based network growth model in order to explain two properties of growing networks: (1) the power-law growth of node degrees and (2) the decay of node relevance. In preferential attachment models, the ability of a node to acquire links is affected by its degree, its fitness, as well as its relevance which typically decays over time. After a review of existing models, we argue that they cannot explain the above-mentioned two properties (1) and (2) at the same time. We have found that apart from being empirically observed in many systems, the exponential growth of the network size over time is the key to sustain the power-law growth of node degrees when node relevance decays. We therefore make a clear distinction between the event time and the physical time in our model, and show that under the assumption that the relevance of a node decays with its age $τ$, there exists an analytical solution of the decay function $f_R$ with the form $f_R(τ) = τ^{-1}$. Other properties of real networks such as power-law alike degree distributions can still be preserved, as supported by our experiments. This makes our model useful in explaining and analysing many real systems such as citation networks.
△ Less
Submitted 9 April, 2018;
originally announced April 2018.
-
Analyzing the network structure and gender differences among the members of the Networked Knowledge Organization Systems (NKOS) community
Authors:
Fariba Karimi,
Philipp Mayr,
Fakhri Momeni
Abstract:
In this paper, we analyze a major part of the research output of the Networked Knowledge Organization Systems (NKOS) community in the period 2000 to 2016 from a network analytical perspective. We focus on the papers presented at the European and U.S. NKOS workshops and in addition four special issues on NKOS in the last 16 years. For this purpose, we have generated an open dataset, the "NKOS bibli…
▽ More
In this paper, we analyze a major part of the research output of the Networked Knowledge Organization Systems (NKOS) community in the period 2000 to 2016 from a network analytical perspective. We focus on the papers presented at the European and U.S. NKOS workshops and in addition four special issues on NKOS in the last 16 years. For this purpose, we have generated an open dataset, the "NKOS bibliography" which covers the bibliographic information of the research output. We analyze the co-authorship network of this community which results in 123 papers with a sum of 256 distinct authors. We use standard network analytic measures such as degree, betweenness and closeness centrality to describe the co-authorship network of the NKOS dataset. First, we investigate global properties of the network over time. Second, we analyze the centrality of the authors in the NKOS network. Lastly, we investigate gender differences in collaboration behavior in this community. Our results show that apart from differences in centrality measures of the scholars, they have higher tendency to collaborate with those in the same institution or the same geographic proximity. We also find that homophily is higher among women in this community. Apart from small differences in closeness and clustering among men and women, we do not find any significant dissimilarities with respect to other centralities.
△ Less
Submitted 12 March, 2018;
originally announced March 2018.
-
Towards Quantifying Sampling Bias in Network Inference
Authors:
Lisette Espín-Noboa,
Claudia Wagner,
Fariba Karimi,
Kristina Lerman
Abstract:
Relational inference leverages relationships between entities and links in a network to infer information about the network from a small sample. This method is often used when global information about the network is not available or difficult to obtain. However, how reliable is inference from a small labelled sample? How should the network be sampled, and what effect does it have on inference erro…
▽ More
Relational inference leverages relationships between entities and links in a network to infer information about the network from a small sample. This method is often used when global information about the network is not available or difficult to obtain. However, how reliable is inference from a small labelled sample? How should the network be sampled, and what effect does it have on inference error? How does the structure of the network impact the sampling strategy? We address these questions by systematically examining how network sampling strategy and sample size affect accuracy of relational inference in networks. To this end, we generate a family of synthetic networks where nodes have a binary attribute and a tunable level of homophily. As expected, we find that in heterophilic networks, we can obtain good accuracy when only small samples of the network are initially labelled, regardless of the sampling strategy. Surprisingly, this is not the case for homophilic networks, and sampling strategies that work well in heterophilic networks lead to large inference errors. These findings suggest that the impact of network structure on relational classification is more complex than previously thought.
△ Less
Submitted 6 March, 2018;
originally announced March 2018.
-
Homophily and minority size explain perception biases in social networks
Authors:
Eun Lee,
Fariba Karimi,
Claudia Wagner,
Hang-Hyun Jo,
Markus Strohmaier,
Mirta Galesic
Abstract:
People's perceptions about the size of minority groups in social networks can be biased, often showing systematic over- or underestimation. These social perception biases are often attributed to biased cognitive or motivational processes. Here we show that both over- and underestimation of the size of a minority group can emerge solely from structural properties of social networks. Using a generat…
▽ More
People's perceptions about the size of minority groups in social networks can be biased, often showing systematic over- or underestimation. These social perception biases are often attributed to biased cognitive or motivational processes. Here we show that both over- and underestimation of the size of a minority group can emerge solely from structural properties of social networks. Using a generative network model, we show analytically that these biases depend on the level of homophily and its asymmetric nature, as well as on the size of the minority group. Our model predictions correspond well with empirical data from a cross-cultural survey and with numerical calculations on six real-world networks. We also show under what circumstances individuals can reduce their biases by relying on perceptions of their neighbors. This work advances our understanding of the impact of network structure on social perception biases and offers a quantitative approach for addressing related issues in society.
△ Less
Submitted 22 July, 2019; v1 submitted 24 October, 2017;
originally announced October 2017.
-
Gender Disparities in Science? Dropout, Productivity, Collaborations and Success of Male and Female Computer Scientists
Authors:
Mohsen Jadidi,
Fariba Karimi,
Haiko Lietz,
Claudia Wagner
Abstract:
Scientific collaborations shape ideas as well as innovations and are both the substrate for, and the outcome of, academic careers. Recent studies show that gender inequality is still present in many scientific practices ranging from hiring to peer-review processes and grant applications. In this work, we investigate gender-specific differences in collaboration patterns of more than one million com…
▽ More
Scientific collaborations shape ideas as well as innovations and are both the substrate for, and the outcome of, academic careers. Recent studies show that gender inequality is still present in many scientific practices ranging from hiring to peer-review processes and grant applications. In this work, we investigate gender-specific differences in collaboration patterns of more than one million computer scientists over the course of 47 years. We explore how these patterns change over years and career ages and how they impact scientific success. Our results highlight that successful male and female scientists reveal the same collaboration patterns: compared to scientists in the same career age, they tend to collaborate with more colleagues than other scientists, seek innovations as brokers and establish longer-lasting and more repetitive collaborations. However, women are on average less likely to adapt the collaboration patterns that are related with success, more likely to embed into ego networks devoid of structural holes, and they exhibit stronger gender homophily as well as a consistently higher dropout rate than men in all career ages.
△ Less
Submitted 9 August, 2017; v1 submitted 19 April, 2017;
originally announced April 2017.
-
Sampling from Social Networks with Attributes
Authors:
Claudia Wagner,
Philipp Singer,
Fariba Karimi,
Jürgen Pfeffer,
Markus Strohmaier
Abstract:
Sampling from large networks represents a fundamental challenge for social network research. In this paper, we explore the sensitivity of different sampling techniques (node sampling, edge sampling, random walk sampling, and snowball sampling) on social networks with attributes. We consider the special case of networks (i) where we have one attribute with two values (e.g., male and female in the c…
▽ More
Sampling from large networks represents a fundamental challenge for social network research. In this paper, we explore the sensitivity of different sampling techniques (node sampling, edge sampling, random walk sampling, and snowball sampling) on social networks with attributes. We consider the special case of networks (i) where we have one attribute with two values (e.g., male and female in the case of gender), (ii) where the size of the two groups is unequal (e.g., a male majority and a female minority), and (iii) where nodes with the same or different attribute value attract or repel each other (i.e., homophilic or heterophilic behavior). We evaluate the different sampling techniques with respect to conserving the position of nodes and the visibility of groups in such networks. Experiments are conducted both on synthetic and empirical social networks. Our results provide evidence that different network sampling techniques are highly sensitive with regard to capturing the expected centrality of nodes, and that their accuracy depends on relative group size differences and on the level of homophily that can be observed in the network. We conclude that uninformed sampling from social networks with attributes thus can significantly impair the ability of researchers to draw valid conclusions about the centrality of nodes and the visibility or invisibility of groups in social networks.
△ Less
Submitted 17 February, 2017;
originally announced February 2017.
-
Visibility of minorities in social networks
Authors:
Fariba Karimi,
Mathieu Génois,
Claudia Wagner,
Philipp Singer,
Markus Strohmaier
Abstract:
Homophily can put minority groups at a disadvantage by restricting their ability to establish links with people from a majority group. This can limit the overall visibility of minorities in the network. Building on a Barabási-Albert model variation with groups and homophily, we show how the visibility of minority groups in social networks is a function of (i) their relative group size and (ii) the…
▽ More
Homophily can put minority groups at a disadvantage by restricting their ability to establish links with people from a majority group. This can limit the overall visibility of minorities in the network. Building on a Barabási-Albert model variation with groups and homophily, we show how the visibility of minority groups in social networks is a function of (i) their relative group size and (ii) the presence or absence of homophilic behavior. We provide an analytical solution for this problem and demonstrate the existence of asymmetric behavior. Finally, we study the visibility of minority groups in examples of real-world social networks: sexual contacts, scientific collaboration, and scientific citation. Our work presents a foundation for assessing the visibility of minority groups in social networks in which homophilic or heterophilic behaviour is present.
△ Less
Submitted 1 February, 2017;
originally announced February 2017.
-
Inferring Gender from Names on the Web: A Comparative Evaluation of Gender Detection Methods
Authors:
Fariba Karimi,
Claudia Wagner,
Florian Lemmerich,
Mohsen Jadidi,
Markus Strohmaier
Abstract:
Computational social scientists often harness the Web as a "societal observatory" where data about human social behavior is collected. This data enables novel investigations of psychological, anthropological and sociological research questions. However, in the absence of demographic information, such as gender, many relevant research questions cannot be addressed. To tackle this problem, researche…
▽ More
Computational social scientists often harness the Web as a "societal observatory" where data about human social behavior is collected. This data enables novel investigations of psychological, anthropological and sociological research questions. However, in the absence of demographic information, such as gender, many relevant research questions cannot be addressed. To tackle this problem, researchers often rely on automated methods to infer gender from name information provided on the web. However, little is known about the accuracy of existing gender-detection methods and how biased they are against certain sub-populations. In this paper, we address this question by systematically comparing several gender detection methods on a random sample of scientists for whom we know their full name, their gender and the country of their workplace. We further suggest a novel method that employs web-based image retrieval and gender recognition in facial images in order to augment name-based approaches. Our findings show that the performance of name-based gender detection approaches can be biased towards countries of origin and such biases can be reduced by combining name-based an image-based gender detection methods.
△ Less
Submitted 14 March, 2016;
originally announced March 2016.
-
Linguistic neighbourhoods: explaining cultural borders on Wikipedia through multilingual co-editing activity
Authors:
Anna Samoilenko,
Fariba karimi,
Daniel Edler,
Jérôme Kunegis,
Markus Strohmaier
Abstract:
In this paper, we study the network of global interconnections between language communities, based on shared co-editing interests of Wikipedia editors, and show that although English is discussed as a potential lingua franca of the digital space, its domination disappears in the network of co-editing similarities, and instead local connections come to the forefront. Out of the hypotheses we explor…
▽ More
In this paper, we study the network of global interconnections between language communities, based on shared co-editing interests of Wikipedia editors, and show that although English is discussed as a potential lingua franca of the digital space, its domination disappears in the network of co-editing similarities, and instead local connections come to the forefront. Out of the hypotheses we explored, bilingualism, linguistic similarity of languages, and shared religion provide the best explanations for the similarity of interests between cultural communities. Population attraction and geographical proximity are also significant, but much weaker factors bringing communities together. In addition, we present an approach that allows for extracting significant cultural borders from editing activity of Wikipedia users, and comparing a set of hypotheses about the social mechanisms generating these borders. Our study sheds light on how culture is reflected in the collective process of archiving knowledge on Wikipedia, and demonstrates that cross-lingual interconnections on Wikipedia are not dominated by one powerful language. Our findings also raise some important policy questions for the Wikimedia Foundation.
△ Less
Submitted 14 March, 2016;
originally announced March 2016.
-
Map** bilateral information interests using the activity of Wikipedia editors
Authors:
Fariba Karimi,
Ludvig Bohlin,
Anna Samoilenko,
Martin Rosvall,
Andrea Lancichinetti
Abstract:
We live in a global village where electronic communication has eliminated the geographical barriers of information exchange. The road is now open to worldwide convergence of information interests, shared values, and understanding. Nevertheless, interests still vary between countries around the world. This raises important questions about what today's world map of in- formation interests actually l…
▽ More
We live in a global village where electronic communication has eliminated the geographical barriers of information exchange. The road is now open to worldwide convergence of information interests, shared values, and understanding. Nevertheless, interests still vary between countries around the world. This raises important questions about what today's world map of in- formation interests actually looks like and what factors cause the barriers of information exchange between countries. To quantitatively construct a world map of information interests, we devise a scalable statistical model that identifies countries with similar information interests and measures the countries' bilateral similarities. From the similarities we connect countries in a global network and find that countries can be mapped into 18 clusters with similar information interests. Through regression we find that language and religion best explain the strength of the bilateral ties and formation of clusters. Our findings provide a quantitative basis for further studies to better understand the complex interplay between shared interests and conflict on a global scale. The methodology can also be extended to track changes over time and capture important trends in global information exchange.
△ Less
Submitted 25 January, 2016; v1 submitted 18 March, 2015;
originally announced March 2015.
-
The Problem of Action at a Distance in Networks and the Emergence of Preferential Attachment from Triadic Closure
Authors:
Jérôme Kunegis,
Fariba Karimi,
Jun Sun
Abstract:
In this paper, we characterise the notion of preferential attachment in networks as action at a distance, and argue that it can only be an emergent phenomenon -- the actual mechanism by which networks grow always being the closing of triangles. After a review of the concepts of triangle closing and preferential attachment, we present our argument, as well as a simplified model in which preferentia…
▽ More
In this paper, we characterise the notion of preferential attachment in networks as action at a distance, and argue that it can only be an emergent phenomenon -- the actual mechanism by which networks grow always being the closing of triangles. After a review of the concepts of triangle closing and preferential attachment, we present our argument, as well as a simplified model in which preferential attachment can be derived mathematically from triangle closing. Additionally, we perform experiments on synthetic graphs to demonstrate the emergence of preferential attachment in graph growth models based only on triangle closing.
△ Less
Submitted 24 April, 2017; v1 submitted 1 August, 2014;
originally announced August 2014.
-
Structural differences between open and direct communication in an online community
Authors:
Fariba Karimi,
Verónica C. Ramenzoni,
Petter Holme
Abstract:
Most research of online communication focuses on modes of communication that are either open (like forums, bulletin boards, Twitter, etc.) or direct (like e-mails). In this work, we study a dataset that has both types of communication channels. We relate our findings to theories of social organization and human dynamics. The data comprises 36,492 users of a movie discussion community. Our results…
▽ More
Most research of online communication focuses on modes of communication that are either open (like forums, bulletin boards, Twitter, etc.) or direct (like e-mails). In this work, we study a dataset that has both types of communication channels. We relate our findings to theories of social organization and human dynamics. The data comprises 36,492 users of a movie discussion community. Our results show that there are differences in the way users communicate in the two channels that are reflected in the shape of degree- and interevent time distributions. The open communication that is designed to facilitate conversations with any member, shows a broader degree distribution and more of the triangles in the network are primarily formed in this mode of communication. The direct channel is presumably preferred by closer communication and the response time in dialogues is shorter. On a more coarse-grained level, there are common patterns in the two networks. The differences and overlaps between communication networks, thus, provide a unique window into how social and structural aspects of communication establish and evolve.
△ Less
Submitted 30 July, 2014;
originally announced July 2014.
-
Threshold model of cascades in temporal networks
Authors:
Fariba Karimi,
Petter Holme
Abstract:
Threshold models try to explain the consequences of social influence like the spread of fads and opinions. Along with models of epidemics, they constitute a major theoretical framework of social spreading processes. In threshold models on static networks, an individual changes her state if a certain fraction of her neighbors has done the same. When there are strong correlations in the temporal asp…
▽ More
Threshold models try to explain the consequences of social influence like the spread of fads and opinions. Along with models of epidemics, they constitute a major theoretical framework of social spreading processes. In threshold models on static networks, an individual changes her state if a certain fraction of her neighbors has done the same. When there are strong correlations in the temporal aspects of contact patterns, it is useful to represent the system as a temporal network. In such a system, not only contacts but also the time of the contacts are represented explicitly. There is a consensus that bursty temporal patterns slow down disease spreading. However, as we will see, this is not a universal truth for threshold models. In this work, we propose an extension of Watts' classic threshold model to temporal networks. We do this by assuming that an agent is influenced by contacts which lie a certain time into the past. I.e., the individuals are affected by contacts within a time window. In addition to thresholds as the fraction of contacts, we also investigate the number of contacts within the time window as a basis for influence. To elucidate the model's behavior, we run the model on real and randomized empirical contact datasets.
△ Less
Submitted 7 September, 2012; v1 submitted 5 July, 2012;
originally announced July 2012.