-
Increasing, not Diminishing: Investigating the Returns of Highly Maintainable Code
Authors:
Markus Borg,
Ilyana Pruvost,
Enys Mones,
Adam Tornhill
Abstract:
Understanding and effectively managing Technical Debt (TD) remains a vital challenge in software engineering. While many studies on code-level TD have been published, few illustrate the business impact of low-quality source code. In this study, we combine two publicly available datasets to study the association between code quality on the one hand, and defect count and implementation time on the o…
▽ More
Understanding and effectively managing Technical Debt (TD) remains a vital challenge in software engineering. While many studies on code-level TD have been published, few illustrate the business impact of low-quality source code. In this study, we combine two publicly available datasets to study the association between code quality on the one hand, and defect count and implementation time on the other hand. We introduce a value-creation model, derived from regression analyses, to explore relative changes from a baseline. Our results show that the associations vary across different intervals of code quality. Furthermore, the value model suggests strong non-linearities at the extremes of the code quality spectrum. Most importantly, the model suggests amplified returns on investment in the upper end. We discuss the findings within the context of the "broken windows" theory and recommend organizations to diligently prevent the introduction of code smells in files with high churn. Finally, we argue that the value-creation model can be used to initiate discussions regarding the return on investment in refactoring efforts.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
U Owns the Code That Changes and How Marginal Owners Resolve Issues Slower in Low-Quality Source Code
Authors:
Markus Borg,
Adam Tornhill,
Enys Mones
Abstract:
[Context] Accurate time estimation is a critical aspect of predictable software engineering. Previous work shows that low source code quality increases the uncertainty in issue resolution times. [Objective] Our goal is to evaluate how developers' project experience and file ownership are related to issue resolution times. [Method] We mine 40 proprietary software repositories and conduct an observa…
▽ More
[Context] Accurate time estimation is a critical aspect of predictable software engineering. Previous work shows that low source code quality increases the uncertainty in issue resolution times. [Objective] Our goal is to evaluate how developers' project experience and file ownership are related to issue resolution times. [Method] We mine 40 proprietary software repositories and conduct an observational study. Using CodeScene, we measure source code quality and active development time connected to Jira issues. [Results] Most source code changes are made by either a marginal or dominant code owner. Also, most changes to low-quality source code are made by developers with low levels of ownership. In low-quality source code, marginal owners need 45\% more time for small changes, and 93\% more time for large changes. [Conclusions] Collective code ownership is a popular target, but industry practice results in many dominant and marginal owners. Marginal owners are particularly hampered when working with low-quality source code, which leads to productivity losses. In codebases plagued by technical debt, newly onboarded developers will require more time to complete tasks.
△ Less
Submitted 23 April, 2023;
originally announced April 2023.
-
The effectiveness of backward contact tracing in networks
Authors:
Sadamori Kojaku,
Laurent Hébert-Dufresne,
Enys Mones,
Sune Lehmann,
Yong-Yeol Ahn
Abstract:
Discovering and isolating infected individuals is a cornerstone of epidemic control. Because many infectious diseases spread through close contacts, contact tracing is a key tool for case discovery and control. However, although contact tracing has been performed widely, the mathematical understanding of contact tracing has not been fully established and it has not been clearly understood what det…
▽ More
Discovering and isolating infected individuals is a cornerstone of epidemic control. Because many infectious diseases spread through close contacts, contact tracing is a key tool for case discovery and control. However, although contact tracing has been performed widely, the mathematical understanding of contact tracing has not been fully established and it has not been clearly understood what determines the efficacy of contact tracing. Here, we reveal that, compared with "forward" tracing---tracing to whom disease spreads, "backward" tracing---tracing from whom disease spreads---is profoundly more effective. The effectiveness of backward tracing is due to simple but overlooked biases arising from the heterogeneity in contacts. Using simulations on both synthetic and high-resolution empirical contact datasets, we show that even at a small probability of detecting infected individuals, strategically executed contact tracing can prevent a significant fraction of further transmissions. We also show that---in terms of the number of prevented transmissions per isolation---case isolation combined with a small amount of contact tracing is more efficient than case isolation alone. By demonstrating that backward contact tracing is highly effective at discovering super-spreading events, we argue that the potential effectiveness of contact tracing has been underestimated. Therefore, there is a critical need for revisiting current contact tracing strategies so that they leverage all forms of biases. Our results also have important consequences for digital contact tracing because it will be crucial to incorporate the capability for backward and deep tracing while adhering to the privacy-preserving requirements of these new platforms.
△ Less
Submitted 14 September, 2020; v1 submitted 5 May, 2020;
originally announced May 2020.
-
Emergence of leader-follower hierarchy among players in an on-line experiment
Authors:
Bálint J. Tóth,
Gergely Palla,
Enys Mones,
Gergő Havadi,
Nóra Páll,
Péter Pollner,
Tamás Vicsek
Abstract:
Hierarchical networks are prevalent in nature and society, corresponding to groups of actors - animals, humans or even robots - organised according to a pyramidal structure with decision makers at the top and followers at the bottom. While this phenomenon is seemingly universal, the underlying governing principles are poorly understood. Here we study the emergence of hierarchies in groups of peopl…
▽ More
Hierarchical networks are prevalent in nature and society, corresponding to groups of actors - animals, humans or even robots - organised according to a pyramidal structure with decision makers at the top and followers at the bottom. While this phenomenon is seemingly universal, the underlying governing principles are poorly understood. Here we study the emergence of hierarchies in groups of people playing a simple dot guessing game in controlled experiments, lasting for about 40 rounds, conducted over the Internet. During the games, the players had the possibility to look at the answer of a limited number of other players of their choice. This act of asking for advice defines a directed connection between the involved players, and according to our analysis, the initial random configuration of the emerging networks became more structured overt time, showing signs of hierarchy towards the end of the game. In addition, the achieved score of the players appeared to be correlated with their position in the hierarchy. These results indicate that under certain conditions imitation and limited knowledge about the performance of other actors is sufficient for the emergence of hierarchy in a social group.
△ Less
Submitted 24 January, 2019;
originally announced January 2019.
-
Temporal Limits of Privacy in Human Behavior
Authors:
Vedran Sekara,
Enys Mones,
Håkan Jonsson
Abstract:
Large-scale collection of human behavioral data by companies raises serious privacy concerns. We show that behavior captured in the form of application usage data collected from smartphones is highly unique even in very large datasets encompassing millions of individuals. This makes behavior-based re-identification of users across datasets possible. We study 12 months of data from 3.5 million user…
▽ More
Large-scale collection of human behavioral data by companies raises serious privacy concerns. We show that behavior captured in the form of application usage data collected from smartphones is highly unique even in very large datasets encompassing millions of individuals. This makes behavior-based re-identification of users across datasets possible. We study 12 months of data from 3.5 million users and show that four apps are enough to uniquely re-identify 91.2% of users using a simple strategy based on public information. Furthermore, we show that there is seasonal variability in uniqueness and that application usage fingerprints drift over time at an average constant rate.
△ Less
Submitted 10 June, 2018;
originally announced June 2018.
-
Academic Performance and Behavioral Patterns
Authors:
Valentin Kassarnig,
Enys Mones,
Andreas Bjerre-Nielsen,
Piotr Sapiezynski,
David Dreyer Lassen,
Sune Lehmann
Abstract:
Identifying the factors that influence academic performance is an essential part of educational research. Previous studies have documented the importance of personality traits, class attendance, and social network structure. Because most of these analyses were based on a single behavioral aspect and/or small sample sizes, there is currently no quantification of the interplay of these factors. Here…
▽ More
Identifying the factors that influence academic performance is an essential part of educational research. Previous studies have documented the importance of personality traits, class attendance, and social network structure. Because most of these analyses were based on a single behavioral aspect and/or small sample sizes, there is currently no quantification of the interplay of these factors. Here, we study the academic performance among a cohort of 538 undergraduate students forming a single, densely connected social network. Our work is based on data collected using smartphones, which the students used as their primary phones for two years. The availability of multi-channel data from a single population allows us to directly compare the explanatory power of individual and social characteristics. We find that the most informative indicators of performance are based on social ties and that network indicators result in better model performance than individual characteristics (including both personality and class attendance). We confirm earlier findings that class attendance is the most important predictor among individual characteristics. Finally, our results suggest the presence of strong homophily and/or peer effects among university students.
△ Less
Submitted 9 April, 2018; v1 submitted 21 June, 2017;
originally announced June 2017.
-
The Role of Gender in Social Network Organization
Authors:
Ioanna Psylla,
Piotr Sapiezynski,
Enys Mones,
Sune Lehmann
Abstract:
The digital traces we leave behind when engaging with the modern world offer an interesting lens through which we study behavioral patterns as expression of gender. Although gender differentiation has been observed in a number of settings, the majority of studies focus on a single data stream in isolation. Here we use a dataset of high resolution data collected using mobile phones, as well as deta…
▽ More
The digital traces we leave behind when engaging with the modern world offer an interesting lens through which we study behavioral patterns as expression of gender. Although gender differentiation has been observed in a number of settings, the majority of studies focus on a single data stream in isolation. Here we use a dataset of high resolution data collected using mobile phones, as well as detailed questionnaires, to study gender differences in a large cohort.
We consider mobility behavior and individual personality traits among a group of more than $800$ university students. We also investigate interactions among them expressed via person-to-person contacts, interactions on online social networks, and telecommunication. Thus, we are able to study the differences between male and female behavior captured through a multitude of channels for a single cohort. We find that while the two genders are similar in a number of aspects, there are robust deviations that include multiple facets of social interactions, suggesting the existence of inherent behavioral differences. Finally, we quantify how aspects of an individual's characteristics and social behavior reveals their gender by posing it as a classification problem. We ask: How well can we distinguish between male and female study participants based on behavior alone? Which behavioral features are most predictive?
△ Less
Submitted 15 June, 2017;
originally announced June 2017.
-
Class attendance, peer similarity, and academic performance in a large field study
Authors:
Valentin Kassarnig,
Andreas Bjerre-Nielsen,
Enys Mones,
Sune Lehmann,
David Dreyer Lassen
Abstract:
Identifying the factors that determine academic performance is an essential part of educational research. Existing research indicates that class attendance is a useful predictor of subsequent course achievements. The majority of the literature is, however, based on surveys and self-reports, methods which have well-known systematic biases that lead to limitations on conclusions and generalizability…
▽ More
Identifying the factors that determine academic performance is an essential part of educational research. Existing research indicates that class attendance is a useful predictor of subsequent course achievements. The majority of the literature is, however, based on surveys and self-reports, methods which have well-known systematic biases that lead to limitations on conclusions and generalizability as well as being costly to implement. Here we propose a novel method for measuring class attendance that overcomes these limitations by using location and bluetooth data collected from smartphone sensors. Based on measured attendance data of nearly 1,000 undergraduate students, we demonstrate that early and consistent class attendance strongly correlates with academic performance. In addition, our novel dataset allows us to determine that attendance among social peers was substantially correlated ($>$0.5), suggesting either an important peer effect or homophily with respect to attendance.
△ Less
Submitted 9 April, 2018; v1 submitted 4 February, 2017;
originally announced February 2017.
-
Phenomenological theory of collective decision-making
Authors:
Anna Zafeiris,
Zsombor Koman,
Enys Mones,
Tamás Vicsek
Abstract:
An essential task of groups is to provide efficient solutions for the complex problems they face. Indeed, considerable efforts have been devoted to the question of collective decision-making related to problems involving a single dominant feature. Here we introduce a quantitative formalism for finding the optimal distribution of the group members' competences in the more typical case when the unde…
▽ More
An essential task of groups is to provide efficient solutions for the complex problems they face. Indeed, considerable efforts have been devoted to the question of collective decision-making related to problems involving a single dominant feature. Here we introduce a quantitative formalism for finding the optimal distribution of the group members' competences in the more typical case when the underlying problem is complex, i.e., multidimensional. Thus, we consider teams that are aiming at obtaining the best possible answer to a problem having a number of independent sub-problems. Our approach is based on a generic scheme for the process of evaluating the proposed solutions (i.e., negotiation). We demonstrate that the best performing groups have at least one specialist for each sub-problem -- but a far less intuitive result is that finding the optimal solution by the interacting group members requires that the specialists also have some insight into the sub-problems beyond their unique field(s). We present empirical results obtained by using a large-scale database of citations being in good agreement with the above theory. The framework we have developed can easily be adapted to a variety of realistic situations since taking into account the weights of the sub-problems, the opinions or the relations of the group is straightforward. Consequently, our method can be used in several contexts, especially when the optimal composition of a group of decision-makers is designed.
△ Less
Submitted 21 December, 2016; v1 submitted 30 November, 2016;
originally announced December 2016.
-
Contact activity and dynamics of the online elite
Authors:
Enys Mones,
Arkadiusz Stopczynski,
Sune Lehmann
Abstract:
Humans interact through numerous channels to build and maintain social connections: they meet face-to-face, initiate phone calls or send text messages, and interact via social media. Although it is known that the network of physical contacts, for example, is distinct from the network arising from communication events via phone calls and instant messages, the extent to which these networks differ i…
▽ More
Humans interact through numerous channels to build and maintain social connections: they meet face-to-face, initiate phone calls or send text messages, and interact via social media. Although it is known that the network of physical contacts, for example, is distinct from the network arising from communication events via phone calls and instant messages, the extent to which these networks differ is not clear. In fact, the network structure of these channels shows large structural variations. Each network of interactions, however, contains both central and peripheral individuals: central members are characterized by higher connectivity and can reach a high fraction of the network within a low number of connections, contrary to the nodes on the periphery. Here we show that the various channels account for diverse relationships between pairs of individuals and the corresponding interaction patterns across channels differ to an extent that hinders the simple reduction of social ties to a single layer. Furthemore, the origin and purpose of each network also determine the role of their respective central members: highly connected individuals in the person-to-person networks interact with their environment in a regular manner, while members central in the social communication networks display irregular behavior with respect to their physical contacts and are more active through rare, social events. These results suggest that due to the inherently different functions of communication channels, each one favors different social behaviors and different strategies for interacting with the environment. Our findings can facilitate the understanding of the varying roles and impact individuals have on the population, which can further shed light on the prediction and prevention of epidemic outbreaks, or information propagation.
△ Less
Submitted 12 November, 2016;
originally announced November 2016.
-
Vaccination and Complex Social Dynamics
Authors:
Enys Mones,
Arkadiusz Stopczynski,
Alex Pentland,
Nathaniel Hupert,
Sune Lehmann
Abstract:
Vaccination and outbreak monitoring are essential tools for preventing and minimizing outbreaks of infectious diseases. Targeted strategies, where the individuals most important for monitoring or preventing outbreaks are selected for intervention, offer a possibility to significantly improve these measures. Although targeted strategies carry a strong potential, identifying optimal target groups re…
▽ More
Vaccination and outbreak monitoring are essential tools for preventing and minimizing outbreaks of infectious diseases. Targeted strategies, where the individuals most important for monitoring or preventing outbreaks are selected for intervention, offer a possibility to significantly improve these measures. Although targeted strategies carry a strong potential, identifying optimal target groups remains a challenge. Here we consider the problem of identifying target groups based on digital communication networks (telecommunication, online social media) in order to predict and contain an infectious disease spreading on a real-world person-to-person network of more than 500 individuals. We show that target groups for efficient outbreak monitoring can be determined based on both telecommunication and online social network information. In case of vaccination the information regarding the digital communication networks improves the efficacy for short-range disease transmissions but, surprisingly, performance is severely reduced in the case of long-range transmission. These results are robust with respect to the strategy used to identify targeted individuals and time-gap between identification of targets and the intervention. Thus, we demonstrate that data available from telecommunication and online social networks can greatly improve epidemic control measures, but it is important to consider the details of the pathogen spreading mechanism when such policies are applied.
△ Less
Submitted 2 March, 2016;
originally announced March 2016.
-
Hierarchical networks of scientific journals
Authors:
Gergely Palla,
Gergely Tibély,
Enys Mones,
Péter Pollner,
Tamás Vicsek
Abstract:
Scientific journals are the repositories of the gradually accumulating knowledge of mankind about the world surrounding us. Just as our knowledge is organised into classes ranging from major disciplines, subjects and fields to increasingly specific topics, journals can also be categorised into groups using various metrics. In addition to the set of topics characteristic for a journal, they can als…
▽ More
Scientific journals are the repositories of the gradually accumulating knowledge of mankind about the world surrounding us. Just as our knowledge is organised into classes ranging from major disciplines, subjects and fields to increasingly specific topics, journals can also be categorised into groups using various metrics. In addition to the set of topics characteristic for a journal, they can also be ranked regarding their relevance from the point of overall influence. One widespread measure is impact factor, but in the present paper we intend to reconstruct a much more detailed description by studying the hierarchical relations between the journals based on citation data. We use a measure related to the notion of m-reaching centrality and find a network which shows the level of influence of a journal from the point of the direction and efficiency with which information spreads through the network. We can also obtain an alternative network using a suitably modified nested hierarchy extraction method applied to the same data. The results are weakly methodology-dependent and reveal non-trivial relations among journals. The two alternative hierarchies show large similarity with some striking differences, providing together a complex picture of the intricate relations between scientific journals.
△ Less
Submitted 12 August, 2015; v1 submitted 18 June, 2015;
originally announced June 2015.
-
Shock waves on complex networks
Authors:
Enys Mones,
Nuno A. M. Araújo,
Tamás Vicsek,
Hans J. Herrmann
Abstract:
Power grids, road maps, and river streams are examples of infrastructural networks which are highly vulnerable to external perturbations. An abrupt local change of load (voltage, traffic density, or water level) might propagate in a cascading way and affect a significant fraction of the network. Almost discontinuous perturbations can be modeled by shock waves which can eventually interfere constru…
▽ More
Power grids, road maps, and river streams are examples of infrastructural networks which are highly vulnerable to external perturbations. An abrupt local change of load (voltage, traffic density, or water level) might propagate in a cascading way and affect a significant fraction of the network. Almost discontinuous perturbations can be modeled by shock waves which can eventually interfere constructively and endanger the normal functionality of the infrastructure. We study their dynamics by solving the Burgers equation under random perturbations on several real and artificial directed graphs. Even for graphs with a narrow distribution of node properties (e.g., degree or betweenness), a steady state is reached exhibiting a heterogeneous load distribution, having a difference of one order of magnitude between the highest and average loads. Unexpectedly we find for the European power grid and for finite Watts-Strogatz networks a broad pronounced bimodal distribution for the loads. To identify the most vulnerable nodes, we introduce the concept of node-basin size, a purely topological property which we show to be strongly correlated to the average load of a node.
△ Less
Submitted 18 February, 2014;
originally announced February 2014.
-
Universal hierarchical behavior of citation networks
Authors:
Enys Mones,
Péter Pollner,
Tamás Vicsek
Abstract:
Many of the essential features of the evolution of scientific research are imprinted in the structure of citation networks. Connections in these networks imply information about the transfer of knowledge among papers, or in other words, edges describe the impact of papers on other publications. This inherent meaning of the edges infers that citation networks can exhibit hierarchical features, that…
▽ More
Many of the essential features of the evolution of scientific research are imprinted in the structure of citation networks. Connections in these networks imply information about the transfer of knowledge among papers, or in other words, edges describe the impact of papers on other publications. This inherent meaning of the edges infers that citation networks can exhibit hierarchical features, that is typical of networks based on decision-making. In this paper, we investigate the hierarchical structure of citation networks consisting of papers in the same field. We find that the majority of the networks follow a universal trend towards a highly hierarchical state, and i) the various fields display differences only concerning their phase in life (distance from the "birth" of a field) or ii) the characteristic time according to which they are approaching the stationary state. We also show by a simple argument that the alterations in the behavior are related to and can be understood by the degree of specialization corresponding to the fields. Our results suggest that during the accumulation of knowledge in a given field, some papers are gradually becoming relatively more influential than most of the other papers.
△ Less
Submitted 19 January, 2014;
originally announced January 2014.
-
Anomalous segregation dynamics of self-propelled particles
Authors:
Enys Mones,
András Czirók,
Tamás Vicsek
Abstract:
A number of novel experimental and theoretical results have recently been obtained on active soft matter, demonstrating the various interesting universal and anomalous features of this kind of driven systems. Here we consider a fundamental but still unexplored aspect of the patterns arising in the system of actively moving units, i.e., their segregation taking place when two kinds of them with dif…
▽ More
A number of novel experimental and theoretical results have recently been obtained on active soft matter, demonstrating the various interesting universal and anomalous features of this kind of driven systems. Here we consider a fundamental but still unexplored aspect of the patterns arising in the system of actively moving units, i.e., their segregation taking place when two kinds of them with different adhesive properties are present. The process of segregation is studied by a model made of self-propelled particles such that the particles have a tendency to adhere only to those which are of the same kind. The calculations corresponding to the related differential equations can be made in parallel, thus a powerful GPU card allows large scale simulations. We find that the segregation kinetics is very different from the non-driven counterparts and is described by the new scaling exponents $z\simeq 1$ and $z\simeq 0.8$ for the 1:1 and the non-equal ratio of the two constituents, respectively. Our results are in agreement with a recent observation of segregating tissue cells \emph{in vitro}.
△ Less
Submitted 17 June, 2014; v1 submitted 5 January, 2014;
originally announced January 2014.
-
Strong random correlations in networks of heterogeneous agents
Authors:
Imre Kondor,
István Csabai,
Gábor Papp,
Enys Mones,
Gábor Czimbalmos,
Máté Csaba Sándor
Abstract:
Correlations and other collective phenomena in a schematic model of heterogeneous binary agents (individual spin-glass samples) are considered on the complete graph and also on 2d and 3d regular lattices. The system's stochastic dynamics is studied by numerical simulations. The dynamics is so slow that one can meaningfully speak of quasi-equilibrium states. Performing measurements of correlations…
▽ More
Correlations and other collective phenomena in a schematic model of heterogeneous binary agents (individual spin-glass samples) are considered on the complete graph and also on 2d and 3d regular lattices. The system's stochastic dynamics is studied by numerical simulations. The dynamics is so slow that one can meaningfully speak of quasi-equilibrium states. Performing measurements of correlations in such a quasi-equilibrium state we find that they are random both as to their sign and absolute value, but on average they fall off very slowly with distance in all instances that we have studied. This means that the system is essentially non-local, small changes at one end may have a strong impact at the other. Correlations and other local quantities are extremely sensitive to the boundary conditions all across the system, although this sensitivity disappears upon averaging over the samples or partially averaging over the agents. The strong, random correlations tend to organize a large fraction of the agents into strongly correlated clusters that act together. If we think about this model as a distant metaphor of economic agents or bank networks, the systemic risk implications of this tendency are clear: any impact on even a single strongly correlated agent will spread, in an unforeseeable manner, to the whole system via the strong random correlations.
△ Less
Submitted 24 February, 2014; v1 submitted 11 October, 2012;
originally announced October 2012.
-
Hierarchy in directed random networks
Authors:
Enys Mones
Abstract:
In recent years, the theory and application of complex networks have been quickly develo** in a markable way due to the increasing amount of data from real systems and to the fruitful application of powerful methods used in statistical physics. Many important characteristics of social or biological systems can be described by the study of their underlying structure of interactions. Hierarchy is…
▽ More
In recent years, the theory and application of complex networks have been quickly develo** in a markable way due to the increasing amount of data from real systems and to the fruitful application of powerful methods used in statistical physics. Many important characteristics of social or biological systems can be described by the study of their underlying structure of interactions. Hierarchy is one of these features that can be formulated in the language of networks. In this paper we present some (qualitative) analytic results on the hierarchical properties of random network models with zero correlations and also investigate, mainly numerically, the effects of different type of correlations. The behavior of hierarchy is different in the absence and the presence of the giant components. We show that the hierarchical structure can be drastically different if there are one-point correlations in the network. We also show numerical results suggesting that hierarchy does not change monotonously with the correlations and there is an optimal level of non-zero correlations maximizing the level of hierarchy.
△ Less
Submitted 4 February, 2013; v1 submitted 30 August, 2012;
originally announced August 2012.
-
Hierarchy measure for complex networks
Authors:
Enys Mones,
Lilla Vicsek,
Tamás Vicsek
Abstract:
Nature, technology and society are full of complexity arising from the intricate web of the interactions among the units of the related systems (e.g., proteins, computers, people). Consequently, one of the most successful recent approaches to capturing the fundamental features of the structure and dynamics of complex systems has been the investigation of the networks associated with the above unit…
▽ More
Nature, technology and society are full of complexity arising from the intricate web of the interactions among the units of the related systems (e.g., proteins, computers, people). Consequently, one of the most successful recent approaches to capturing the fundamental features of the structure and dynamics of complex systems has been the investigation of the networks associated with the above units (nodes) together with their relations (edges). Most complex systems have an inherently hierarchical organization and, correspondingly, the networks behind them also exhibit hierarchical features. Indeed, several papers have been devoted to describing this essential aspect of networks, however, without resulting in a widely accepted, converging concept concerning the quantitative characterization of the level of their hierarchy. Here we develop an approach and propose a quantity (measure) which is simple enough to be widely applicable, reveals a number of universal features of the organization of real-world networks and, as we demonstrate, is capable of capturing the essential features of the structure and the degree of hierarchy in a complex network. The measure we introduce is based on a generalization of the m-reach centrality, which we first extend to directed/partially directed graphs. Then, we define the global reaching centrality (GRC), which is the difference between the maximum and the average value of the generalized reach centralities over the network. We investigate the behavior of the GRC considering both a synthetic model with an adjustable level of hierarchy and real networks. Results for real networks show that our hierarchy measure is related to the controllability of the given system. We also propose a visualization procedure for large complex networks that can be used to obtain an overall qualitative picture about the nature of their hierarchical structure.
△ Less
Submitted 2 February, 2012; v1 submitted 1 February, 2012;
originally announced February 2012.