-
Two-step estimators of high dimensional correlation matrices
Authors:
Andrés García-Medina,
Salvatore Miccichè,
Rosario N. Mantegna
Abstract:
We investigate block diagonal and hierarchical nested stochastic multivariate Gaussian models by studying their sample cross-correlation matrix on high dimensions. By performing numerical simulations, we compare a filtered sample cross-correlation with the population cross-correlation matrices by using several rotationally invariant estimators (RIE) and hierarchical clustering estimators (HCE) und…
▽ More
We investigate block diagonal and hierarchical nested stochastic multivariate Gaussian models by studying their sample cross-correlation matrix on high dimensions. By performing numerical simulations, we compare a filtered sample cross-correlation with the population cross-correlation matrices by using several rotationally invariant estimators (RIE) and hierarchical clustering estimators (HCE) under several loss functions. We show that at large but finite sample size, sample cross-correlation filtered by RIE estimators are often outperformed by HCE estimators for several of the loss functions. We also show that for block models and for hierarchically nested block models the best determination of the filtered sample cross-correlation is achieved by introducing two-step estimators combining state-of-the-art non-linear shrinkage models with hierarchical clustering estimators.
△ Less
Submitted 10 October, 2023; v1 submitted 30 December, 2022;
originally announced December 2022.
-
Identifying maximal sets of significantly interacting nodes in higher-order networks
Authors:
Federico Musciotto,
Federico Battiston,
Rosario N. Mantegna
Abstract:
We introduce a method for the detection of Statistically Validated Simplices in higher-order networks. Statistically validated simplices represent the maximal sets of nodes of any size that consistently interact collectively and do not include co-interacting nodes that appears only occasionally. Using properly designed higher-order benchmarks, we show that our approach is highly effective in syste…
▽ More
We introduce a method for the detection of Statistically Validated Simplices in higher-order networks. Statistically validated simplices represent the maximal sets of nodes of any size that consistently interact collectively and do not include co-interacting nodes that appears only occasionally. Using properly designed higher-order benchmarks, we show that our approach is highly effective in systems where the maximal sets are likely to be diluted into interactions of larger sizes that include occasional participants. By applying our method to two real world datasets, we also show how it allows to detect simplices whose nodes are characterized by significant levels of similarity, providing new insights on the generative processes of real world higher-order networks.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
Quantifying the relationship between specialisation and reputation in an online platform
Authors:
Giacomo Livan,
Giuseppe Pappalardo,
Rosario N. Mantegna
Abstract:
Online platforms experience a tension between decentralisation and incentives to steer user behaviour, which are usually implemented through digital reputation systems. We provide a statistical characterisation of the user behaviour emerging from the interplay of such competing forces in Stack Overflow, a long-standing knowledge sharing platform. Over the 11 years covered by our analysis, we find…
▽ More
Online platforms experience a tension between decentralisation and incentives to steer user behaviour, which are usually implemented through digital reputation systems. We provide a statistical characterisation of the user behaviour emerging from the interplay of such competing forces in Stack Overflow, a long-standing knowledge sharing platform. Over the 11 years covered by our analysis, we find that the platform's user base consistently self-organise into specialists and generalists, i.e., users who focus their activity on narrow and broad sets of topics, respectively. We relate the emergence of these behaviours to the platform's reputation system with a series of data-driven models, and find specialisation to be statistically associated with a higher ability to post the best answers to a question. Our findings are in stark contrast with observations made in top-down environments - such as firms and corporations - where generalist skills are consistently found to be more successful.
△ Less
Submitted 13 November, 2021;
originally announced November 2021.
-
Detecting informative higher-order interactions in statistically validated hypergraphs
Authors:
Federico Musciotto,
Federico Battiston,
Rosario N. Mantegna
Abstract:
Recent empirical evidence has shown that in many real-world systems, successfully represented as networks, interactions are not limited to dyads, but often involve three or more agents at a time. These data are better described by hypergraphs, where hyperlinks encode higher-order interactions among a group of nodes. In spite of the large number of works on networks, highlighting informative hyperl…
▽ More
Recent empirical evidence has shown that in many real-world systems, successfully represented as networks, interactions are not limited to dyads, but often involve three or more agents at a time. These data are better described by hypergraphs, where hyperlinks encode higher-order interactions among a group of nodes. In spite of the large number of works on networks, highlighting informative hyperlinks in hypergraphs obtained from real world data is still an open problem. Here we propose an analytic approach to filter hypergraphs by identifying those hyperlinks that are over-expressed with respect to a random null hypothesis, and represent the most relevant higher-order connections. We apply our method to a class of synthetic benchmarks and to several datasets. For all cases, the method highlights hyperlinks that are more informative than those extracted with pairwise approaches. Our method provides a first way to obtain statistically validated hypergraphs, separating informative connections from redundant and noisy ones.
△ Less
Submitted 31 March, 2021; v1 submitted 30 March, 2021;
originally announced March 2021.
-
Dynamics of fintech terms in news and blogs and specialization of companies of the fintech industry
Authors:
Fabio Ciulla,
Rosario N. Mantegna
Abstract:
We perform a large scale analysis of a list of fintech terms in (i) news and blogs in English language and (ii) professional descriptions of companies operating in many countries. The occurrence and co-occurrence of fintech terms and locutions shows a progressive evolution of the list of fintech terms in a compact and coherent set of terms used worldwide to describe fintech business activities. By…
▽ More
We perform a large scale analysis of a list of fintech terms in (i) news and blogs in English language and (ii) professional descriptions of companies operating in many countries. The occurrence and co-occurrence of fintech terms and locutions shows a progressive evolution of the list of fintech terms in a compact and coherent set of terms used worldwide to describe fintech business activities. By using methods of complex networks that are specifically designed to deal with heterogeneous systems, our analysis of a large set of professional descriptions of companies shows that companies having fintech terms in their description present over-expressions of specific attributes of country, municipality, and economic sector. By using the approach of statistically validated networks, we detect geographical and economic over-expressions of a set of companies related to the multi-industry, geographically and economically distributed fintech movement.
△ Less
Submitted 14 July, 2020;
originally announced July 2020.
-
A primer on statistically validated networks
Authors:
Salvatore Miccichè,
Rosario Nunzio Mantegna
Abstract:
In this contribution we discuss some approaches of network analysis providing information about single links or single nodes with respect to a null hypothesis taking into account the heterogeneity of the system empirically observed. With this approach, a selection of nodes and links is feasible when the null hypothesis is statistically rejected. We focus our discussion on approaches using (i) the…
▽ More
In this contribution we discuss some approaches of network analysis providing information about single links or single nodes with respect to a null hypothesis taking into account the heterogeneity of the system empirically observed. With this approach, a selection of nodes and links is feasible when the null hypothesis is statistically rejected. We focus our discussion on approaches using (i) the so-called disparity filter and (ii) statistically validated network in bipartite networks. For both methods we discuss the importance of using multiple hypothesis test correction. Specific applications of statistically validated networks are discussed. We also discuss how statistically validated networks can be used to (i) pre-process large sets of data and (ii) detect cores of communities that are forming the most close-knit and stable subsets of clusters of nodes present in a complex system.
△ Less
Submitted 19 February, 2019;
originally announced February 2019.
-
Core of communities in bipartite networks
Authors:
Christian Bongiorno,
András London,
Salvatore Miccichè,
Rosario N. Mantegna
Abstract:
We use the information present in a bipartite network to detect cores of communities of each set of the bipartite system. Cores of communities are found by investigating statistically validated projected networks obtained using information present in the bipartite network. Cores of communities are highly informative and robust with respect to the presence of errors or missing entries in the bipart…
▽ More
We use the information present in a bipartite network to detect cores of communities of each set of the bipartite system. Cores of communities are found by investigating statistically validated projected networks obtained using information present in the bipartite network. Cores of communities are highly informative and robust with respect to the presence of errors or missing entries in the bipartite network. We assess the statistical robustness of cores by investigating an artificial benchmark network, the co-authorship network, and the actor-movie network. The accuracy and precision of the partition obtained with respect to the reference partition are measured in terms of the adjusted Rand index and of the adjusted Wallace index respectively. The detection of cores is highly precise although the accuracy of the methodology can be limited in some cases.
△ Less
Submitted 6 March, 2017;
originally announced April 2017.
-
An empirically grounded agent based model for modeling directs, conflict detection and resolution operations in Air Traffic Management
Authors:
C. Bongiorno,
S. Micciche',
Rosario N. Mantegna
Abstract:
We present an agent based model of the Air Traffic Management socio-technical complex system that aims at modeling the interactions between aircrafts and air traffic controllers at a tactical level. The core of the model is given by the conflict detection and resolution module and by the directs module. Directs are flight shortcuts that are given by air controllers to speed up the passage of an ai…
▽ More
We present an agent based model of the Air Traffic Management socio-technical complex system that aims at modeling the interactions between aircrafts and air traffic controllers at a tactical level. The core of the model is given by the conflict detection and resolution module and by the directs module. Directs are flight shortcuts that are given by air controllers to speed up the passage of an aircraft within a certain airspace and therefore to facilitate airline operations. Conflicts resolution between flight trajectories can arise during the en-route phase of each flight due to both not detailed flight trajectory planning or unforeseen events that perturb the planned flight plan. Our model performs a local conflict detection and resolution procedure. Once a flight trajectory has been made conflict-free, the model searches for possible improvements of the system efficiency by issuing directs. We give an example of model calibration based on real data. We then provide an illustration of the capability of our model in generating scenario simulations able to give insights about the air traffic management system. We show that the calibrated model is able to reproduce the existence of a geographical localization of air traffic controllers' operations. Finally, we use the model to investigate the relationship between directs and conflict resolutions (i) in the presence of perfect forecast ability of controllers, and (ii) in the presence of some degree of uncertainty in flight trajectory forecast.
△ Less
Submitted 26 September, 2016;
originally announced September 2016.
-
Statistical characterization of deviations from planned flight trajectories in air traffic management
Authors:
C. Bongiorno,
G. Gurtner,
F. Lillo,
R. N. Mantegna,
S. Miccichè
Abstract:
Understanding the relation between planned and realized flight trajectories and the determinants of flight deviations is of great importance in air traffic management. In this paper we perform an in depth investigation of the statistical properties of planned and realized air traffic on the German airspace during a 28 day periods, corresponding to an AIRAC cycle. We find that realized trajectories…
▽ More
Understanding the relation between planned and realized flight trajectories and the determinants of flight deviations is of great importance in air traffic management. In this paper we perform an in depth investigation of the statistical properties of planned and realized air traffic on the German airspace during a 28 day periods, corresponding to an AIRAC cycle. We find that realized trajectories are on average shorter than planned ones and this effect is stronger during night-time than daytime. Flights are more frequently deviated close to the departure airport and at a relatively large angle to destination. Moreover, the probability of a deviation is higher in low traffic phases. All these evidences indicate that deviations are mostly used by controllers to give directs to flights when traffic conditions allow it. Finally we introduce a new metric, termed difork, which is able to characterize navigation points according to the likelihood that a deviation occurs there. Difork allows to identify in a statistically rigorous way navigation point pairs where deviations are more (less) frequent than expected under a null hypothesis of randomness that takes into account the heterogeneity of the navigation points. Such pairs can therefore be seen as sources of flexibility (stability) of controllers traffic management while conjugating safety and efficiency.
△ Less
Submitted 9 March, 2016;
originally announced March 2016.
-
Backbone of credit relationships in the Japanese credit market
Authors:
Luca Marotta,
Salvatore Miccichè,
Yoshi Fujiwara,
Hiroshi Iyetomi,
Hideaki Aoyama,
Mauro Gallegati,
Rosario N. Mantegna
Abstract:
We detect the backbone of the weighted bipartite network of the Japanese credit market relationships. The backbone is detected by adapting a general method used in the investigation of weighted networks. With this approach we detect a backbone that is statistically validated against a null hypothesis of uniform diversification of loans for banks and firms. Our investigation is done year by year an…
▽ More
We detect the backbone of the weighted bipartite network of the Japanese credit market relationships. The backbone is detected by adapting a general method used in the investigation of weighted networks. With this approach we detect a backbone that is statistically validated against a null hypothesis of uniform diversification of loans for banks and firms. Our investigation is done year by year and it covers more than thirty years during the period from 1980 to 2011. We relate some of our findings with economic events that have characterized the Japanese credit market during the last years. The study of the time evolution of the backbone allows us to detect changes occurred in network size, fraction of credit explained, and attributes characterizing the banks and the firms present in the backbone.
△ Less
Submitted 21 November, 2015;
originally announced November 2015.
-
Hybrid recommendation methods in complex networks
Authors:
A. Fiasconaro,
M. Tumminello,
V. Nicosia,
V. Latora,
R. N. Mantegna
Abstract:
We propose here two new recommendation methods, based on the appropriate normalization of already existing similarity measures, and on the convex combination of the recommendation scores derived from similarity between users and between objects. We validate the proposed measures on three relevant data sets, and we compare their performance with several recommendation systems recently proposed in t…
▽ More
We propose here two new recommendation methods, based on the appropriate normalization of already existing similarity measures, and on the convex combination of the recommendation scores derived from similarity between users and between objects. We validate the proposed measures on three relevant data sets, and we compare their performance with several recommendation systems recently proposed in the literature. We show that the proposed similarity measures allow to attain an improvement of performances of up to 20\% with respect to existing non-parametric methods, and that the accuracy of a recommendation can vary widely from one specific bipartite network to another, which suggests that a careful choice of the most suitable method is highly relevant for an effective recommendation on a given system. Finally, we studied how an increasing presence of random links in the network affects the recommendation scores, and we found that one of the two recommendation algorithms introduced here can systematically outperform the others in noisy data sets.
△ Less
Submitted 10 December, 2014;
originally announced December 2014.
-
Sicily and the development of Econophysics: the pioneering work of Ettore Majorana and the Econophysics Workshop in Palermo
Authors:
Rosario N. Mantegna
Abstract:
Sicily has played an important role in the development of the new research area named "Econophysics". In fact some key ideas supporting this new hybrid discipline were originally formulated in a pioneering work of the Sicilian born physicist Ettore Majorana. The article he wrote was entitled "The value of statistical laws in physics and social sciences". I will discuss its origin and history that…
▽ More
Sicily has played an important role in the development of the new research area named "Econophysics". In fact some key ideas supporting this new hybrid discipline were originally formulated in a pioneering work of the Sicilian born physicist Ettore Majorana. The article he wrote was entitled "The value of statistical laws in physics and social sciences". I will discuss its origin and history that has been recently discovered in the study of Stefano Roncoroni. This recent study documents the true reasons and motivations that triggered the pioneering work of Majorana. It also shows that the description of this work provided by Edoardo Amaldi was shallow and misleading. In the second part of the talk I will recollect the first years of development of econophysics and in particular the role of the "International Workshop on Econophysics and Statistical Finance" held in Palermo on 28-30 September 1998 and the setting in 1999 of the "Observatory of Complex Systems" the research group on Econophysics of Palermo University and Istituto Nazionale di Fisica della Materia.
△ Less
Submitted 1 September, 2014;
originally announced September 2014.
-
Bank-firm credit network in Japan. An analysis of a bipartite network
Authors:
Luca Marotta,
Salvatore Miccichè,
Yoshi Fujiwara,
Hiroshi Iyetomi,
Hideaki Aoyama,
Mauro Gallegati,
Rosario N. Mantegna
Abstract:
We present an analysis of the credit market of Japan. The analysis is performed by investigating the bipartite network of banks and firms which is obtained by setting a link between a bank and a firm when a credit relationship is present in a given time window. In our investigation we focus on a community detection algorithm which is identifying communities composed by both banks and firms. We sho…
▽ More
We present an analysis of the credit market of Japan. The analysis is performed by investigating the bipartite network of banks and firms which is obtained by setting a link between a bank and a firm when a credit relationship is present in a given time window. In our investigation we focus on a community detection algorithm which is identifying communities composed by both banks and firms. We show that the clusters obtained by directly working on the bipartite network carry information about the networked nature of the Japanese credit market. Our analysis is performed for each calendar year during the time period from 1980 to 2011. Specifically, we obtain communities of banks and networks for each of the 32 investigated years, and we introduce a method to track the time evolution of these communities on a statistical basis. We then characterize communities by detecting the simultaneous over-expression of attributes of firms and banks. Specifically, we consider as attributes the economic sector and the geographical location of firms and the type of banks. In our 32 year long analysis we detect a persistence of the over-expression of attributes of clusters of banks and firms together with a slow dynamics of changes from some specific attributes to new ones. Our empirical observations show that the credit market in Japan is a networked market where the type of banks, geographical location of firms and banks and economic sector of the firm play a role in sha** the credit relationships between banks and firms.
△ Less
Submitted 21 July, 2014;
originally announced July 2014.
-
Statistically validated mobile communication networks: Evolution of motifs in European and Chinese data
Authors:
Ming-Xia Li,
Vasyl Palchykov,
Zhi-Qiang Jiang,
Kimmo Kaski,
Janos Kertész,
Salvatore Miccichè,
Michele Tumminello,
Wei-Xing Zhou,
Rosario N. Mantegna
Abstract:
Big data open up unprecedented opportunities to investigate complex systems including the society. In particular, communication data serve as major sources for computational social sciences but they have to be cleaned and filtered as they may contain spurious information due to recording errors as well as interactions, like commercial and marketing activities, not directly related to the social ne…
▽ More
Big data open up unprecedented opportunities to investigate complex systems including the society. In particular, communication data serve as major sources for computational social sciences but they have to be cleaned and filtered as they may contain spurious information due to recording errors as well as interactions, like commercial and marketing activities, not directly related to the social network. The network constructed from communication data can only be considered as a proxy for the network of social relationships. Here we apply a systematic method, based on multiple hypothesis testing, to statistically validate the links and then construct the corresponding Bonferroni network, generalized to the directed case. We study two large datasets of mobile phone records, one from Europe and the other from China. For both datasets we compare the raw data networks with the corresponding Bonferroni networks and point out significant differences in the structures and in the basic network measures. We show evidence that the Bonferroni network provides a better proxy for the network of social interactions than the original one. By using the filtered networks we investigated the statistics and temporal evolution of small directed 3-motifs and conclude that closed communication triads have a formation time-scale, which is quite fast and typically intraday. We also find that open communication triads preferentially evolve to other open triads with a higher fraction of reciprocated calls. These stylized facts were observed for both datasets.
△ Less
Submitted 15 March, 2014;
originally announced March 2014.
-
A comparative analysis of the statistical properties of large mobile phone calling networks
Authors:
Ming-Xia Li,
Zhi-Qiang Jiang,
Wen-Jie Xie,
Salvatore Miccichè,
Michele Tumminello,
Wei-Xing Zhou,
Rosario N. Mantegna
Abstract:
Mobile phone calling is one of the most widely used communication methods in modern society. The records of calls among mobile phone users provide us a valuable proxy for the understanding of human communication patterns embedded in social networks. Mobile phone users call each other forming a directed calling network. If only reciprocal calls are considered, we obtain an undirected mutual calling…
▽ More
Mobile phone calling is one of the most widely used communication methods in modern society. The records of calls among mobile phone users provide us a valuable proxy for the understanding of human communication patterns embedded in social networks. Mobile phone users call each other forming a directed calling network. If only reciprocal calls are considered, we obtain an undirected mutual calling network. The preferential communication behavior between two connected users can be statistically tested and it results in two Bonferroni networks with statistically validated edges. We perform a comparative analysis of the statistical properties of these four networks, which are constructed from the calling records of more than nine million individuals in Shanghai over a period of 110 days. We find that these networks share many common structural properties and also exhibit idiosyncratic features when compared with previously studied large mobile calling networks. The empirical findings provide us an intriguing picture of a representative large social network that might shed new lights on the modelling of large social networks.
△ Less
Submitted 6 May, 2014; v1 submitted 25 February, 2014;
originally announced February 2014.
-
Evolution of correlation structure of industrial indices of US equity markets
Authors:
Giuseppe Buccheri,
Stefano Marmi,
Rosario N. Mantegna
Abstract:
We investigate the dynamics of correlations present between pairs of industry indices of US stocks traded in US markets by studying correlation based networks and spectral properties of the correlation matrix. The study is performed by using 49 industry index time series computed by K. French and E. Fama during the time period from July 1969 to December 2011 that is spanning more than 40 years. We…
▽ More
We investigate the dynamics of correlations present between pairs of industry indices of US stocks traded in US markets by studying correlation based networks and spectral properties of the correlation matrix. The study is performed by using 49 industry index time series computed by K. French and E. Fama during the time period from July 1969 to December 2011 that is spanning more than 40 years. We show that the correlation between industry indices presents both a fast and a slow dynamics. The slow dynamics has a time scale longer than five years showing that a different degree of diversification of the investment is possible in different periods of time. On top to this slow dynamics, we also detect a fast dynamics associated with exogenous or endogenous events. The fast time scale we use is a monthly time scale and the evaluation time period is a 3 month time period. By investigating the correlation dynamics monthly, we are able to detect two examples of fast variations in the first and second eigenvalue of the correlation matrix. The first occurs during the dot-com bubble (from March 1999 to April 2001) and the second occurs during the period of highest impact of the subprime crisis (from August 2008 to August 2009).
△ Less
Submitted 20 June, 2013;
originally announced June 2013.
-
Multi-scale analysis of the European airspace using network community detection
Authors:
Gérald Gurtner,
Stefania Vitali,
Marco Cipolla,
Fabrizio Lillo,
Rosario Nunzio Mantegna,
Salvatore Miccichè,
Simone Pozzi
Abstract:
We show that the European airspace can be represented as a multi-scale traffic network whose nodes are airports, sectors, or navigation points and links are defined and weighted according to the traffic of flights between the nodes. By using a unique database of the air traffic in the European airspace, we investigate the architecture of these networks with a special emphasis on their community st…
▽ More
We show that the European airspace can be represented as a multi-scale traffic network whose nodes are airports, sectors, or navigation points and links are defined and weighted according to the traffic of flights between the nodes. By using a unique database of the air traffic in the European airspace, we investigate the architecture of these networks with a special emphasis on their community structure. We propose that unsupervised network community detection algorithms can be used to monitor the current use of the airspaces and improve it by guiding the design of new ones. Specifically, we compare the performance of three community detection algorithms, also by using a null model which takes into account the spatial distance between nodes, and we discuss their ability to find communities that could be used to define new control units of the airspace.
△ Less
Submitted 17 June, 2013;
originally announced June 2013.
-
Identification of clusters of investors from their real trading activity in a financial market
Authors:
Michele Tumminello,
Fabrizio Lillo,
Jyrki Piilo,
Rosario N. Mantegna
Abstract:
We use statistically validated networks, a recently introduced method to validate links in a bipartite system, to identify clusters of investors trading in a financial market. Specifically, we investigate a special database allowing to track the trading activity of individual investors of the stock Nokia. We find that many statistically detected clusters of investors show a very high degree of syn…
▽ More
We use statistically validated networks, a recently introduced method to validate links in a bipartite system, to identify clusters of investors trading in a financial market. Specifically, we investigate a special database allowing to track the trading activity of individual investors of the stock Nokia. We find that many statistically detected clusters of investors show a very high degree of synchronization in the time when they decide to trade and in the trading action taken. We investigate the composition of these clusters and we find that several of them show an over-expression of specific categories of investors.
△ Less
Submitted 20 July, 2011;
originally announced July 2011.
-
Evolution of worldwide stock markets, correlation structure and correlation based graphs
Authors:
Dong-Ming Song,
Michele Tumminello,
Wei-Xing Zhou,
Rosario N. Mantegna
Abstract:
We investigate the daily correlation present among market indices of stock exchanges located all over the world in the time period Jan 1996 - Jul 2009. We discover that the correlation among market indices presents both a fast and a slow dynamics. The slow dynamics reflects the development and consolidation of globalization. The fast dynamics is associated with critical events that originate in a…
▽ More
We investigate the daily correlation present among market indices of stock exchanges located all over the world in the time period Jan 1996 - Jul 2009. We discover that the correlation among market indices presents both a fast and a slow dynamics. The slow dynamics reflects the development and consolidation of globalization. The fast dynamics is associated with critical events that originate in a specific country or region of the world and rapidly affect the global system. We provide evidence that the short term timescale of correlation among market indices is less than 3 trading months (about 60 trading days). The average values of the non diagonal elements of the correlation matrix, correlation based graphs and the spectral properties of the largest eigenvalues and eigenvectors of the correlation matrix are carrying information about the fast and slow dynamics of correlation of market indices. We introduce a measure of mutual information based on link co-occurrence in networks, in order to detect the fast dynamics of successive changes of correlation based graphs in a quantitative way.
△ Less
Submitted 29 March, 2011;
originally announced March 2011.
-
Do firms share the same functional form of their growth rate distribution? A new statistical test
Authors:
Josè T. Lunardi,
Salvatore Miccichè,
Fabrizio Lillo,
Rosario N. Mantegna,
Mauro Gallegati
Abstract:
We introduce a new statistical test of the hypothesis that a balanced panel of firms have the same growth rate distribution or, more generally, that they share the same functional form of growth rate distribution. We applied the test to European Union and US publicly quoted manufacturing firms data, considering functional forms belonging to the Subbotin family of distributions. While our hypothese…
▽ More
We introduce a new statistical test of the hypothesis that a balanced panel of firms have the same growth rate distribution or, more generally, that they share the same functional form of growth rate distribution. We applied the test to European Union and US publicly quoted manufacturing firms data, considering functional forms belonging to the Subbotin family of distributions. While our hypotheses are rejected for the vast majority of sets at the sector level, we cannot rejected them at the subsector level, indicating that homogenous panels of firms could be described by a common functional form of growth rate distribution.
△ Less
Submitted 11 March, 2011;
originally announced March 2011.
-
Community characterization of heterogeneous complex systems
Authors:
Michele Tumminello,
Salvatore Miccichè,
Fabrizio Lillo,
Jan Varho,
Jyrki Piilo,
Rosario N. Mantegna
Abstract:
We introduce an analytical statistical method to characterize the communities detected in heterogeneous complex systems. By posing a suitable null hypothesis, our method makes use of the hypergeometric distribution to assess the probability that a given property is over-expressed in the elements of a community with respect to all the elements of the investigated set. We apply our method to two spe…
▽ More
We introduce an analytical statistical method to characterize the communities detected in heterogeneous complex systems. By posing a suitable null hypothesis, our method makes use of the hypergeometric distribution to assess the probability that a given property is over-expressed in the elements of a community with respect to all the elements of the investigated set. We apply our method to two specific complex networks, namely a network of world movies and a network of physics preprints. The characterization of the elements and of the communities is done in terms of languages and countries for the movie network and of journals and subject categories for papers. We find that our method is able to characterize clearly the identified communities. Moreover our method works well both for large and for small communities.
△ Less
Submitted 18 November, 2010;
originally announced November 2010.
-
Statistically validated networks in bipartite complex systems
Authors:
Michele Tumminello,
Salvatore Miccichè,
Fabrizio Lillo,
Jyrki Piilo,
Rosario N. Mantegna
Abstract:
Many complex systems present an intrinsic bipartite nature and are often described and modeled in terms of networks [1-5]. Examples include movies and actors [1, 2, 4], authors and scientific papers [6-9], email accounts and emails [10], plants and animals that pollinate them [11, 12]. Bipartite networks are often very heterogeneous in the number of relationships that the elements of one set estab…
▽ More
Many complex systems present an intrinsic bipartite nature and are often described and modeled in terms of networks [1-5]. Examples include movies and actors [1, 2, 4], authors and scientific papers [6-9], email accounts and emails [10], plants and animals that pollinate them [11, 12]. Bipartite networks are often very heterogeneous in the number of relationships that the elements of one set establish with the elements of the other set. When one constructs a projected network with nodes from only one set, the system heterogeneity makes it very difficult to identify preferential links between the elements. Here we introduce an unsupervised method to statistically validate each link of the projected network against a null hypothesis taking into account the heterogeneity of the system. We apply our method to three different systems, namely the set of clusters of orthologous genes (COG) in completely sequenced genomes [13, 14], a set of daily returns of 500 US financial stocks, and the set of world movies of the IMDb database [15]. In all these systems, both different in size and level of heterogeneity, we find that our method is able to detect network structures which are informative about the system and are not simply expression of its heterogeneity. Specifically, our method (i) identifies the preferential relationships between the elements, (ii) naturally highlights the clustered structure of investigated systems, and (iii) allows to classify links according to the type of statistically validated relationships between the connected nodes.
△ Less
Submitted 8 August, 2010;
originally announced August 2010.
-
When do improved covariance matrix estimators enhance portfolio optimization? An empirical comparative study of nine estimators
Authors:
Ester Pantaleo,
Michele Tumminello,
Fabrizio Lillo,
Rosario N. Mantegna
Abstract:
The use of improved covariance matrix estimators as an alternative to the sample estimator is considered an important approach for enhancing portfolio optimization. Here we empirically compare the performance of 9 improved covariance estimation procedures by using daily returns of 90 highly capitalized US stocks for the period 1997-2007. We find that the usefulness of covariance matrix estimators…
▽ More
The use of improved covariance matrix estimators as an alternative to the sample estimator is considered an important approach for enhancing portfolio optimization. Here we empirically compare the performance of 9 improved covariance estimation procedures by using daily returns of 90 highly capitalized US stocks for the period 1997-2007. We find that the usefulness of covariance matrix estimators strongly depends on the ratio between estimation period T and number of stocks N, on the presence or absence of short selling, and on the performance metric considered. When short selling is allowed, several estimation methods achieve a realized risk that is significantly smaller than the one obtained with the sample covariance method. This is particularly true when T/N is close to one. Moreover many estimators reduce the fraction of negative portfolio weights, while little improvement is achieved in the degree of diversification. On the contrary when short selling is not allowed and T>N, the considered methods are unable to outperform the sample covariance in terms of realized risk but can give much more diversified portfolios than the one obtained with the sample covariance. When T<N the use of the sample covariance matrix and of the pseudoinverse gives portfolios with very poor performance.
△ Less
Submitted 24 April, 2010;
originally announced April 2010.
-
Correlation, hierarchies, and networks in financial markets
Authors:
M. Tumminello,
F. Lillo,
R. N. Mantegna
Abstract:
We discuss some methods to quantitatively investigate the properties of correlation matrices. Correlation matrices play an important role in portfolio optimization and in several other quantitative descriptions of asset price dynamics in financial markets. Specifically, we discuss how to define and obtain hierarchical trees, correlation based trees and networks from a correlation matrix. The hie…
▽ More
We discuss some methods to quantitatively investigate the properties of correlation matrices. Correlation matrices play an important role in portfolio optimization and in several other quantitative descriptions of asset price dynamics in financial markets. Specifically, we discuss how to define and obtain hierarchical trees, correlation based trees and networks from a correlation matrix. The hierarchical clustering and other procedures performed on the correlation matrix to detect statistically reliable aspects of the correlation matrix are seen as filtering procedures of the correlation matrix. We also discuss a method to associate a hierarchically nested factor model to a hierarchical tree obtained from a correlation matrix. The information retained in filtering procedures and its stability with respect to statistical fluctuations is quantified by using the Kullback-Leibler distance.
△ Less
Submitted 26 September, 2008;
originally announced September 2008.
-
Generation of hierarchically correlated multivariate symbolic sequences
Authors:
Mi. Tumminello,
F. Lillo,
R. N. Mantegna
Abstract:
We introduce an algorithm to generate multivariate series of symbols from a finite alphabet with a given hierarchical structure of similarities. The target hierarchical structure of similarities is arbitrary, for instance the one obtained by some hierarchical clustering procedure as applied to an empirical matrix of Hamming distances. The algorithm can be interpreted as the finite alphabet equiv…
▽ More
We introduce an algorithm to generate multivariate series of symbols from a finite alphabet with a given hierarchical structure of similarities. The target hierarchical structure of similarities is arbitrary, for instance the one obtained by some hierarchical clustering procedure as applied to an empirical matrix of Hamming distances. The algorithm can be interpreted as the finite alphabet equivalent of the recently introduced hierarchically nested factor model (M. Tumminello et al. EPL 78 (3) 30006 (2007)). The algorithm is based on a generating mechanism that is different from the one used in the mutation rate approach. We apply the proposed methodology for investigating the relationship between the bootstrap value associated with a node of a phylogeny and the probability of finding that node in the true phylogeny.
△ Less
Submitted 12 February, 2008;
originally announced February 2008.
-
Shrinkage and spectral filtering of correlation matrices: a comparison via the Kullback-Leibler distance
Authors:
M. Tumminello,
F. Lillo,
R. N. Mantegna
Abstract:
The problem of filtering information from large correlation matrices is of great importance in many applications. We have recently proposed the use of the Kullback-Leibler distance to measure the performance of filtering algorithms in recovering the underlying correlation matrix when the variables are described by a multivariate Gaussian distribution. Here we use the Kullback-Leibler distance to…
▽ More
The problem of filtering information from large correlation matrices is of great importance in many applications. We have recently proposed the use of the Kullback-Leibler distance to measure the performance of filtering algorithms in recovering the underlying correlation matrix when the variables are described by a multivariate Gaussian distribution. Here we use the Kullback-Leibler distance to investigate the performance of filtering methods based on Random Matrix Theory and on the shrinkage technique. We also present some results on the application of the Kullback-Leibler distance to multivariate data which are non Gaussian distributed.
△ Less
Submitted 2 October, 2007;
originally announced October 2007.
-
Specialization of strategies and herding behavior of trading firms in a financial market
Authors:
Fabrizio Lillo,
Esteban Moro,
Gabriella Vaglica,
Rosario N. Mantegna
Abstract:
The understanding of complex social or economic systems is an important scientific challenge. Here we present a comprehensive study of the Spanish Stock Exchange showing that most financial firms trading in that market are characterized by a resulting strategy and can be classified in groups of firms with different specialization. Few large firms overally act as trending firms whereas many heter…
▽ More
The understanding of complex social or economic systems is an important scientific challenge. Here we present a comprehensive study of the Spanish Stock Exchange showing that most financial firms trading in that market are characterized by a resulting strategy and can be classified in groups of firms with different specialization. Few large firms overally act as trending firms whereas many heterogeneous firm act as reversing firms. The herding properties of these two groups are markedly different and consistently observed over a four-year period of trading.
△ Less
Submitted 3 July, 2007;
originally announced July 2007.
-
Kullback-Leibler distance as a measure of the information filtered from multivariate data
Authors:
Michele Tumminello,
Fabrizio Lillo,
Rosario Nunzio Mantegna
Abstract:
We show that the Kullback-Leibler distance is a good measure of the statistical uncertainty of correlation matrices estimated by using a finite set of data. For correlation matrices of multivariate Gaussian variables we analytically determine the expected values of the Kullback-Leibler distance of a sample correlation matrix from a reference model and we show that the expected values are known a…
▽ More
We show that the Kullback-Leibler distance is a good measure of the statistical uncertainty of correlation matrices estimated by using a finite set of data. For correlation matrices of multivariate Gaussian variables we analytically determine the expected values of the Kullback-Leibler distance of a sample correlation matrix from a reference model and we show that the expected values are known also when the specific model is unknown. We propose to make use of the Kullback-Leibler distance to estimate the information extracted from a correlation matrix by correlation filtering procedures. We also show how to use this distance to measure the stability of filtering procedures with respect to statistical uncertainty. We explain the effectiveness of our method by comparing four filtering procedures, two of them being based on spectral analysis and the other two on hierarchical clustering. We compare these techniques as applied both to simulations of factor models and empirical data. We investigate the ability of these filtering procedures in recovering the correlation matrix of models from simulations. We discuss such an ability in terms of both the heterogeneity of model parameters and the length of data series. We also show that the two spectral techniques are typically more informative about the sample correlation matrix than techniques based on hierarchical clustering, whereas the latter are more stable with respect to statistical uncertainty.
△ Less
Submitted 1 June, 2007;
originally announced June 2007.
-
Scaling laws of strategic behaviour and size heterogeneity in agent dynamics
Authors:
Gabriella Vaglica,
Fabrizio Lillo,
Esteban Moro,
Rosario N. Mantegna
Abstract:
The dynamics of many socioeconomic systems is determined by the decision making process of agents. The decision process depends on agent's characteristics, such as preferences, risk aversion, behavioral biases, etc.. In addition, in some systems the size of agents can be highly heterogeneous leading to very different impacts of agents on the system dynamics. The large size of some agents poses c…
▽ More
The dynamics of many socioeconomic systems is determined by the decision making process of agents. The decision process depends on agent's characteristics, such as preferences, risk aversion, behavioral biases, etc.. In addition, in some systems the size of agents can be highly heterogeneous leading to very different impacts of agents on the system dynamics. The large size of some agents poses challenging problems to agents who want to control their impact, either by forcing the system in a given direction or by hiding their intentionality. Here we consider the financial market as a model system, and we study empirically how agents strategically adjust the properties of large orders in order to meet their preference and minimize their impact. We quantify this strategic behavior by detecting scaling relations of allometric nature between the variables characterizing the trading activity of different institutions. We observe power law distributions in the investment time horizon, in the number of transactions needed to execute a large order and in the traded value exchanged by large institutions and we show that heterogeneity of agents is a key ingredient for the emergence of some aggregate properties characterizing this complex system.
△ Less
Submitted 16 April, 2007;
originally announced April 2007.
-
Diffusive behavior and the modeling of characteristic times in limit order executions
Authors:
Zoltan Eisler,
Janos Kertesz,
Fabrizio Lillo,
Rosario N. Mantegna
Abstract:
We present an empirical study of the first passage time (FPT) of order book prices needed to observe a prescribed price change Delta, the time to fill (TTF) for executed limit orders and the time to cancel (TTC) for canceled ones in a double auction market. We find that the distribution of all three quantities decays asymptotically as a power law, but that of FPT has significantly fatter tails t…
▽ More
We present an empirical study of the first passage time (FPT) of order book prices needed to observe a prescribed price change Delta, the time to fill (TTF) for executed limit orders and the time to cancel (TTC) for canceled ones in a double auction market. We find that the distribution of all three quantities decays asymptotically as a power law, but that of FPT has significantly fatter tails than that of TTF. Thus a simple first passage time model cannot account for the observed TTF of limit orders. We propose that the origin of this difference is the presence of cancellations. We outline a simple model, which assumes that prices are characterized by the empirically observed distribution of the first passage time and orders are canceled randomly with lifetimes that are asymptotically power law distributed with an exponent lambda_LT. In spite of the simplifying assumptions of the model, the inclusion of cancellations is enough to account for the above observations and enables one to estimate characteristics of the cancellation strategies from empirical data.
△ Less
Submitted 21 December, 2008; v1 submitted 30 January, 2007;
originally announced January 2007.
-
Economic sector identification in a set of stocks traded at the New York Stock Exchange: a comparative analysis
Authors:
C. Coronnello,
M. Tumminello,
F. Lillo,
S. Micciche`,
R. N. Mantegna
Abstract:
We review some methods recently used in the literature to detect the existence of a certain degree of common behavior of stock returns belonging to the same economic sector. Specifically, we discuss methods based on random matrix theory and hierarchical clustering techniques. We apply these methods to a set of stocks traded at the New York Stock Exchange. The investigated time series are recorde…
▽ More
We review some methods recently used in the literature to detect the existence of a certain degree of common behavior of stock returns belonging to the same economic sector. Specifically, we discuss methods based on random matrix theory and hierarchical clustering techniques. We apply these methods to a set of stocks traded at the New York Stock Exchange. The investigated time series are recorded at a daily time horizon.
All the considered methods are able to detect economic information and the presence of clusters characterized by the economic sector of stocks. However, different methodologies provide different information about the considered set. Our comparative analysis suggests that the application of just a single method could not be able to extract all the economic information present in the correlation coefficient matrix of a set of stocks.
△ Less
Submitted 5 September, 2006;
originally announced September 2006.
-
The Tenth Article of Ettore Majorana
Authors:
Rosario Nunzio Mantegna
Abstract:
This year is the centenary of the birth of Ettore Majorana, one of the major Italian physicists of all times. In this note we briefly sketch a few biographical details about Ettore Majorana and introduce and discuss the main points of Majorana's 10th article. In his article Majorana explicitly considers quantum mechanics as an irreducible statistical theory because the theory is not able to desc…
▽ More
This year is the centenary of the birth of Ettore Majorana, one of the major Italian physicists of all times. In this note we briefly sketch a few biographical details about Ettore Majorana and introduce and discuss the main points of Majorana's 10th article. In his article Majorana explicitly considers quantum mechanics as an irreducible statistical theory because the theory is not able to describe the time evolution of a single particle or atom in a precise environment at a deterministic level. This lack of determinism at the level of an elementary physical system motivated him to suggest a formal analogy between statistical laws observed in physics and in the social sciences. We hope the occasion of the centenary of the birth of Ettore Majorana will be useful to remember and to reconsider not only his exceptional achievements in theoretical physics but also his fresh and original views on the role of statistical laws in physics and in other disciplines such as the social sciences.
△ Less
Submitted 29 August, 2006;
originally announced August 2006.
-
Market reaction to temporary liquidity crises and the permanent market impact
Authors:
Adam Ponzi,
Fabrizio Lillo,
Rosario N. Mantegna
Abstract:
We study the relaxation dynamics of the bid-ask spread and of the midprice after a sudden, large variation of the spread, corresponding to a temporary crisis of liquidity in a double auction financial market. We find that the spread decays very slowly to its normal value as a consequence of the strategic limit order placement of liquidity providers. We consider several quantities, such as order…
▽ More
We study the relaxation dynamics of the bid-ask spread and of the midprice after a sudden, large variation of the spread, corresponding to a temporary crisis of liquidity in a double auction financial market. We find that the spread decays very slowly to its normal value as a consequence of the strategic limit order placement of liquidity providers. We consider several quantities, such as order placement rates and distribution, that affect the decay of the spread. We measure the permanent impact both of a generic event altering the spread and of a single transaction and we find an approximately linear relation between immediate and permanent impact in both cases.
△ Less
Submitted 3 August, 2006;
originally announced August 2006.
-
Correlation based networks of equity returns sampled at different time horizons
Authors:
M. Tumminello,
T. Di Matteo,
T. Aste,
R. N. Mantegna
Abstract:
We investigate the planar maximally filtered graphs of the portfolio of the 300 most capitalized stocks traded at the New York Stock Exchange during the time period 2001-2003. Topological properties such as the average length of shortest paths, the betweenness and the degree are computed on different planar maximally filtered graphs generated by sampling the returns at different time horizons ra…
▽ More
We investigate the planar maximally filtered graphs of the portfolio of the 300 most capitalized stocks traded at the New York Stock Exchange during the time period 2001-2003. Topological properties such as the average length of shortest paths, the betweenness and the degree are computed on different planar maximally filtered graphs generated by sampling the returns at different time horizons ranging from 5 min up to one trading day. This analysis confirms that the selected stocks compose a hierarchical system progressively structuring as the sampling time horizon increases. Finally, a cluster formation, associated to economic sectors, is quantitatively investigated.
△ Less
Submitted 3 April, 2007; v1 submitted 30 May, 2006;
originally announced May 2006.
-
Spanning Trees and bootstrap reliability estimation in correlation based networks
Authors:
M. Tumminello,
C. Coronnello,
F. Lillo,
S. Micciche',
R. N. Mantegna
Abstract:
We introduce a new technique to associate a spanning tree to the average linkage cluster analysis. We term this tree as the Average Linkage Minimum Spanning Tree. We also introduce a technique to associate a value of reliability to links of correlation based graphs by using bootstrap replicas of data. Both techniques are applied to the portfolio of the 300 most capitalized stocks traded at New Y…
▽ More
We introduce a new technique to associate a spanning tree to the average linkage cluster analysis. We term this tree as the Average Linkage Minimum Spanning Tree. We also introduce a technique to associate a value of reliability to links of correlation based graphs by using bootstrap replicas of data. Both techniques are applied to the portfolio of the 300 most capitalized stocks traded at New York Stock Exchange during the time period 2001-2003. We show that the Average Linkage Minimum Spanning Tree recognizes economic sectors and sub-sectors as communities in the network slightly better than the Minimum Spanning Tree does. We also show that the average reliability of links in the Minimum Spanning Tree is slightly greater than the average reliability of links in the Average Linkage Minimum Spanning Tree.
△ Less
Submitted 15 May, 2006;
originally announced May 2006.
-
Correlation filtering in financial time series
Authors:
T. Aste,
T. Di Matteo,
M. Tumminello,
R. N. Mantegna
Abstract:
We apply a method to filter relevant information from the correlation coefficient matrix by extracting a network of relevant interactions. This method succeeds to generate networks with the same hierarchical structure of the Minimum Spanning Tree but containing a larger amount of links resulting in a richer network topology allowing loops and cliques. In Tumminello et al. \cite{TumminielloPNAS05…
▽ More
We apply a method to filter relevant information from the correlation coefficient matrix by extracting a network of relevant interactions. This method succeeds to generate networks with the same hierarchical structure of the Minimum Spanning Tree but containing a larger amount of links resulting in a richer network topology allowing loops and cliques. In Tumminello et al. \cite{TumminielloPNAS05}, we have shown that this method, applied to a financial portfolio of 100 stocks in the USA equity markets, is pretty efficient in filtering relevant information about the clustering of the system and its hierarchical structure both on the whole system and within each cluster. In particular, we have found that triangular loops and 4 element cliques have important and significant relations with the market structure and properties. Here we apply this filtering procedure to the analysis of correlation in two different kind of interest rate time series (16 Eurodollars and 34 US interest rates).
△ Less
Submitted 17 August, 2005;
originally announced August 2005.
-
Sector identification in a set of stock return time series traded at the London Stock Exchange
Authors:
C. Coronnello,
M. Tumminello,
F. Lillo,
S. Miccichè,
R. N. Mantegna
Abstract:
We compare some methods recently used in the literature to detect the existence of a certain degree of common behavior of stock returns belonging to the same economic sector. Specifically, we discuss methods based on random matrix theory and hierarchical clustering techniques. We apply these methods to a portfolio of stocks traded at the London Stock Exchange. The investigated time series are re…
▽ More
We compare some methods recently used in the literature to detect the existence of a certain degree of common behavior of stock returns belonging to the same economic sector. Specifically, we discuss methods based on random matrix theory and hierarchical clustering techniques. We apply these methods to a portfolio of stocks traded at the London Stock Exchange. The investigated time series are recorded both at a daily time horizon and at a 5-minute time horizon. The correlation coefficient matrix is very different at different time horizons confirming that more structured correlation coefficient matrices are observed for long time horizons. All the considered methods are able to detect economic information and the presence of clusters characterized by the economic sector of stocks. However different methods present a different degree of sensitivity with respect to different sectors. Our comparative analysis suggests that the application of just a single method could not be able to extract all the economic information present in the correlation coefficient matrix of a stock portfolio.
△ Less
Submitted 4 August, 2005;
originally announced August 2005.
-
Scaling and data collapse for the mean exit time of asset prices
Authors:
Miquel Montero,
Josep Perello,
Jaume Masoliver,
Fabrizio Lillo,
Salvatore Micciche,
Rosario N. Mantegna
Abstract:
We study theoretical and empirical aspects of the mean exit time of financial time series. The theoretical modeling is done within the framework of continuous time random walk. We empirically verify that the mean exit time follows a quadratic scaling law and it has associated a pre-factor which is specific to the analyzed stock. We perform a series of statistical tests to determine which kind of…
▽ More
We study theoretical and empirical aspects of the mean exit time of financial time series. The theoretical modeling is done within the framework of continuous time random walk. We empirically verify that the mean exit time follows a quadratic scaling law and it has associated a pre-factor which is specific to the analyzed stock. We perform a series of statistical tests to determine which kind of correlation are responsible for this specificity. The main contribution is associated with the autocorrelation property of stock returns. We introduce and solve analytically both a two-state and a three-state Markov chain models. The analytical results obtained with the two-state Markov chain model allows us to obtain a data collapse of the 20 measured MET profiles in a single master curve.
△ Less
Submitted 6 July, 2005;
originally announced July 2005.
-
Cluster analysis for portfolio optimization
Authors:
Vincenzo Tola,
Fabrizio Lillo,
Mauro Gallegati,
Rosario N. Mantegna
Abstract:
We consider the problem of the statistical uncertainty of the correlation matrix in the optimization of a financial portfolio. We show that the use of clustering algorithms can improve the reliability of the portfolio in terms of the ratio between predicted and realized risk. Bootstrap analysis indicates that this improvement is obtained in a wide range of the parameters N (number of assets) and…
▽ More
We consider the problem of the statistical uncertainty of the correlation matrix in the optimization of a financial portfolio. We show that the use of clustering algorithms can improve the reliability of the portfolio in terms of the ratio between predicted and realized risk. Bootstrap analysis indicates that this improvement is obtained in a wide range of the parameters N (number of assets) and T (investment horizon). The predicted and realized risk level and the relative portfolio composition of the selected portfolio for a given value of the portfolio return are also investigated for each considered filtering method.
△ Less
Submitted 1 July, 2005;
originally announced July 2005.
-
A tool for filtering information in complex systems
Authors:
M. Tumminello,
T. Aste,
T. Di Matteo,
R. N. Mantegna
Abstract:
We introduce a technique to filter out complex data-sets by extracting a subgraph of representative links. Such a filtering can be tuned up to any desired level by controlling the genus of the resulting graph. We show that this technique is especially suitable for correlation based graphs giving filtered graphs which preserve the hierarchical organization of the minimum spanning tree but contain…
▽ More
We introduce a technique to filter out complex data-sets by extracting a subgraph of representative links. Such a filtering can be tuned up to any desired level by controlling the genus of the resulting graph. We show that this technique is especially suitable for correlation based graphs giving filtered graphs which preserve the hierarchical organization of the minimum spanning tree but containing a larger amount of information in their internal structure. In particular in the case of planar filtered graphs (genus equal to 0) triangular loops and 4 element cliques are formed. The application of this filtering procedure to 100 stocks in the USA equity markets shows that such loops and cliques have important and significant relations with the market structure and properties.
△ Less
Submitted 3 August, 2005; v1 submitted 14 January, 2005;
originally announced January 2005.
-
Degree stability of a minimum spanning tree of price return and volatility
Authors:
Salvatore Miccichè,
Giovanni Bonanno,
Fabrizio Lillo,
Rosario N. Mantegna
Abstract:
We investigate the time series of the degree of minimum spanning trees obtained by using a correlation based clustering procedure which is starting from (i) asset return and (ii) volatility time series. The minimum spanning tree is obtained at different times by computing correlation among time series over a time window of fixed length $T$. We find that the minimum spanning tree of asset return…
▽ More
We investigate the time series of the degree of minimum spanning trees obtained by using a correlation based clustering procedure which is starting from (i) asset return and (ii) volatility time series. The minimum spanning tree is obtained at different times by computing correlation among time series over a time window of fixed length $T$. We find that the minimum spanning tree of asset return is characterized by stock degree values, which are more stable in time than the ones obtained by analyzing a minimum spanning tree computed starting from volatility time series. Our analysis also shows that the degree of stocks has a very slow dynamics with a time-scale of several years in both cases.
△ Less
Submitted 14 December, 2002;
originally announced December 2002.