-
Dimensionality reduction techniques to support insider trading detection
Authors:
Adele Ravagnani,
Fabrizio Lillo,
Paola Deriu,
Piero Mazzarisi,
Francesca Medda,
Antonio Russo
Abstract:
Identification of market abuse is an extremely complicated activity that requires the analysis of large and complex datasets. We propose an unsupervised machine learning method for contextual anomaly detection, which allows to support market surveillance aimed at identifying potential insider trading activities. This method lies in the reconstruction-based paradigm and employs principal component…
▽ More
Identification of market abuse is an extremely complicated activity that requires the analysis of large and complex datasets. We propose an unsupervised machine learning method for contextual anomaly detection, which allows to support market surveillance aimed at identifying potential insider trading activities. This method lies in the reconstruction-based paradigm and employs principal component analysis and autoencoders as dimensionality reduction techniques. The only input of this method is the trading position of each investor active on the asset for which we have a price sensitive event (PSE). After determining reconstruction errors related to the trading profiles, several conditions are imposed in order to identify investors whose behavior could be suspicious of insider trading related to the PSE. As a case study, we apply our method to investor resolved data of Italian stocks around takeover bids.
△ Less
Submitted 8 May, 2024; v1 submitted 1 March, 2024;
originally announced March 2024.
-
Interbank network reconstruction enforcing density and reciprocity
Authors:
Valentina Macchiati,
Piero Mazzarisi,
Diego Garlaschelli
Abstract:
Networks of financial exposures are the key propagators of risk and distress among banks, but their empirical structure is not publicly available because of confidentiality. This limitation has triggered the development of methods of network reconstruction from partial, aggregate information. Unfortunately, even the best methods available fail in replicating the number of directed cycles, which on…
▽ More
Networks of financial exposures are the key propagators of risk and distress among banks, but their empirical structure is not publicly available because of confidentiality. This limitation has triggered the development of methods of network reconstruction from partial, aggregate information. Unfortunately, even the best methods available fail in replicating the number of directed cycles, which on the other hand play a crucial role in determining graph spectra and hence the degree of network stability and systemic risk. Here we address this challenge by exploiting the hypothesis that the statistics of higher-order cycles is strongly constrained by that of the shortest ones, i.e. by the amount of dyads with reciprocated links. First, we provide a detailed analysis of link reciprocity on the e-MID dataset of Italian banks, finding that correlations between reciprocal links systematically increase with the temporal resolution, typically changing from negative to positive around a timescale of up to 50 days. Then, we propose a new network reconstruction method capable of enforcing, only from the knowledge of aggregate interbank assets and liabilities, both a desired sparsity and a desired link reciprocity. We confirm that the addition of reciprocity dramatically improves the prediction of several structural and spectral network properties, including the largest real eigenvalue and the eccentricity of the elliptical distribution of the other eigenvalues in the complex plane. These results illustrate the importance of correctly addressing the temporal resolution and the resulting level of reciprocity in the reconstruction of financial networks.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Online Learning of Order Flow and Market Impact with Bayesian Change-Point Detection Methods
Authors:
Ioanna-Yvonni Tsaknaki,
Fabrizio Lillo,
Piero Mazzarisi
Abstract:
Financial order flow exhibits a remarkable level of persistence, wherein buy (sell) trades are often followed by subsequent buy (sell) trades over extended periods. This persistence can be attributed to the division and gradual execution of large orders. Consequently, distinct order flow regimes might emerge, which can be identified through suitable time series models applied to market data. In th…
▽ More
Financial order flow exhibits a remarkable level of persistence, wherein buy (sell) trades are often followed by subsequent buy (sell) trades over extended periods. This persistence can be attributed to the division and gradual execution of large orders. Consequently, distinct order flow regimes might emerge, which can be identified through suitable time series models applied to market data. In this paper, we propose the use of Bayesian online change-point detection (BOCPD) methods to identify regime shifts in real-time and enable online predictions of order flow and market impact. To enhance the effectiveness of our approach, we have developed a novel BOCPD method using a score-driven approach. This method accommodates temporal correlations and time-varying parameters within each regime. Through empirical application to NASDAQ data, we have found that: (i) Our newly proposed model demonstrates superior out-of-sample predictive performance compared to existing models that assume i.i.d. behavior within each regime; (ii) When examining the residuals, our model demonstrates good specification in terms of both distributional assumptions and temporal correlations; (iii) Within a given regime, the price dynamics exhibit a concave relationship with respect to time and volume, mirroring the characteristics of actual large orders; (iv) By incorporating regime information, our model produces more accurate online predictions of order flow and market impact compared to models that do not consider regimes.
△ Less
Submitted 2 May, 2024; v1 submitted 5 July, 2023;
originally announced July 2023.
-
A machine learning approach to support decision in insider trading detection
Authors:
Piero Mazzarisi,
Adele Ravagnani,
Paola Deriu,
Fabrizio Lillo,
Francesca Medda,
Antonio Russo
Abstract:
Identifying market abuse activity from data on investors' trading activity is very challenging both for the data volume and for the low signal to noise ratio. Here we propose two complementary unsupervised machine learning methods to support market surveillance aimed at identifying potential insider trading activities. The first one uses clustering to identify, in the vicinity of a price sensitive…
▽ More
Identifying market abuse activity from data on investors' trading activity is very challenging both for the data volume and for the low signal to noise ratio. Here we propose two complementary unsupervised machine learning methods to support market surveillance aimed at identifying potential insider trading activities. The first one uses clustering to identify, in the vicinity of a price sensitive event such as a takeover bid, discontinuities in the trading activity of an investor with respect to his/her own past trading history and on the present trading activity of his/her peers. The second unsupervised approach aims at identifying (small) groups of investors that act coherently around price sensitive events, pointing to potential insider rings, i.e. a group of synchronised traders displaying strong directional trading in rewarding position in a period before the price sensitive event. As a case study, we apply our methods to investor resolved data of Italian stocks around takeover bids.
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
Variance of entropy for testing time-varying regimes with an application to meme stocks
Authors:
Andrey Shternshis,
Piero Mazzarisi
Abstract:
Shannon entropy is the most common metric to measure the degree of randomness of time series in many fields, ranging from physics and finance to medicine and biology. Real-world systems may be in general non stationary, with an entropy value that is not constant in time. The goal of this paper is to propose a hypothesis testing procedure to test the null hypothesis of constant Shannon entropy for…
▽ More
Shannon entropy is the most common metric to measure the degree of randomness of time series in many fields, ranging from physics and finance to medicine and biology. Real-world systems may be in general non stationary, with an entropy value that is not constant in time. The goal of this paper is to propose a hypothesis testing procedure to test the null hypothesis of constant Shannon entropy for time series, against the alternative of a significant variation of the entropy between two subsequent periods. To this end, we find an unbiased approximation of the variance of the Shannon entropy's estimator, up to the order O(n^(-4)) with n the sample size. In order to characterize the variance of the estimator, we first obtain the explicit formulas of the central moments for both the binomial and the multinomial distributions, which describe the distribution of the Shannon entropy. Second, we find the optimal length of the rolling window used for estimating the time-varying Shannon entropy by optimizing a novel self-consistent criterion based on the counting of significant variations of entropy within a time window. We corroborate our findings by using the novel methodology to test for time-varying regimes of entropy for stock price dynamics, in particular considering the case of meme stocks in 2020 and 2021. We empirically show the existence of periods of market inefficiency for meme stocks. In particular, sharp increases of prices and trading volumes correspond to statistically significant drops of Shannon entropy.
△ Less
Submitted 7 June, 2023; v1 submitted 10 November, 2022;
originally announced November 2022.
-
Network-wide assessment of ATM mechanisms using an agent-based model
Authors:
Luis Delgado,
Gérald Gurtner,
Piero Mazzarisi,
Silvia Zaoli,
Damir Valput,
Andrew Cook,
Fabrizio Lillo
Abstract:
This paper presents results from the SESAR ER3 Domino project. Three mechanisms are assessed at the ECAC-wide level: 4D trajectory adjustments (a combination of actively waiting for connecting passengers and dynamic cost indexing), flight prioritisation (enabling ATFM slot swap** at arrival regulations), and flight arrival coordination (where flights are sequenced in extended arrival managers ba…
▽ More
This paper presents results from the SESAR ER3 Domino project. Three mechanisms are assessed at the ECAC-wide level: 4D trajectory adjustments (a combination of actively waiting for connecting passengers and dynamic cost indexing), flight prioritisation (enabling ATFM slot swap** at arrival regulations), and flight arrival coordination (where flights are sequenced in extended arrival managers based on an advanced cost-driven optimisation). Classical and new metrics, designed to capture network effects, are used to analyse the results of a micro-level agent-based model. A scenario with congestion at three hubs is used to assess the 4D trajectory adjustment and the flight prioritisation mechanisms. Two different scopes for the extended arrival manager are modelled to analyse the impact of the flight arrival coordination mechanism. Results show that the 4D trajectory adjustments mechanism succeeds in reducing costs and delays for connecting passengers. A trade-off between the interests of the airlines in reducing costs and those of non-connecting passengers emerges, although passengers benefit overall from the mechanism. Flight prioritisation is found to have no significant effects at the network level, as it is applied to a small number of flights. Advanced flight arrival coordination, as implemented, increases delays and costs in the system. The arrival manager optimises the arrival sequence of all flights within its scope but does not consider flight uncertainties, thus leading to sub-optimal actions.
△ Less
Submitted 20 September, 2022;
originally announced September 2022.
-
How Covid mobility restrictions modified the population of investors in Italian stock markets
Authors:
Paola Deriu,
Fabrizio Lillo,
Piero Mazzarisi,
Francesca Medda,
Adele Ravagnani,
Antonio Russo
Abstract:
This paper investigates how Covid mobility restrictions impacted the population of investors of the Italian stock market. The analysis tracks the trading activity of individual investors in Italian stocks in the period January 2019-September 2021, investigating how their composition and the trading activity changed around the Covid-19 lockdown period (March 9 - May 19, 2020) and more generally in…
▽ More
This paper investigates how Covid mobility restrictions impacted the population of investors of the Italian stock market. The analysis tracks the trading activity of individual investors in Italian stocks in the period January 2019-September 2021, investigating how their composition and the trading activity changed around the Covid-19 lockdown period (March 9 - May 19, 2020) and more generally in the period of the pandemic. The results pinpoint that the lockdown restriction was accompanied by a surge in interest toward stock market, as testified by the trading volume by households. Given the generically falling prices during the lockdown, the households, which are typically contrarian, were net buyers, even if less than expected from their trading activity in 2019. This can be explained by the arrival, during the lockdown, of a group of about 185k new investors (i.e. which had never traded since January 2019) which were on average ten year younger and with a larger fraction of males than the pre-lockdown investors. By looking at the gross P&L, there is clear evidence that these new investors were more skilled in trading. There are thus indications that the lockdown, and more generally the Covid pandemic, created a sort of regime change in the population of financial investors.
△ Less
Submitted 30 July, 2022;
originally announced August 2022.
-
Efficiency of the Moscow Stock Exchange before 2022
Authors:
Andrey Shternshis,
Piero Mazzarisi,
Stefano Marmi
Abstract:
This paper investigates the degree of efficiency for the Moscow Stock Exchange. A market is called efficient if prices of its assets fully reflect all available information. We show that the degree of market efficiency is significantly low for most of the months from 2012 to 2021. We calculate the degree of market efficiency by (i) filtering out regularities in financial data and (ii) computing th…
▽ More
This paper investigates the degree of efficiency for the Moscow Stock Exchange. A market is called efficient if prices of its assets fully reflect all available information. We show that the degree of market efficiency is significantly low for most of the months from 2012 to 2021. We calculate the degree of market efficiency by (i) filtering out regularities in financial data and (ii) computing the Shannon entropy of the filtered return time series. We have developed a simple method for estimating volatility and price staleness in empirical data, in order to filter out such regularity patterns from return time series. The resulting financial time series of stocks' returns are then clustered into different groups according to some entropy measures. In particular, we use the Kullback-Leibler distance and a novel entropy metric capturing the co-movements between pairs of stocks. By using Monte Carlo simulations, we are then able to identify the time periods of market inefficiency for a group of 18 stocks. The inefficiency of the Moscow Stock Exchange that we have detected is a signal of the possibility of devising profitable strategies, net of transaction costs. The deviation from the efficient behavior for a stock strongly depends on the industrial sector it belongs.
△ Less
Submitted 25 July, 2022; v1 submitted 21 July, 2022;
originally announced July 2022.
-
On the equivalence between the Kinetic Ising Model and discrete autoregressive processes
Authors:
Carlo Campajola,
Fabrizio Lillo,
Piero Mazzarisi,
Daniele Tantari
Abstract:
Binary random variables are the building blocks used to describe a large variety of systems, from magnetic spins to financial time series and neuron activity. In Statistical Physics the Kinetic Ising Model has been introduced to describe the dynamics of the magnetic moments of a spin lattice, while in time series analysis discrete autoregressive processes have been designed to capture the multivar…
▽ More
Binary random variables are the building blocks used to describe a large variety of systems, from magnetic spins to financial time series and neuron activity. In Statistical Physics the Kinetic Ising Model has been introduced to describe the dynamics of the magnetic moments of a spin lattice, while in time series analysis discrete autoregressive processes have been designed to capture the multivariate dependence structure across binary time series. In this article we provide a rigorous proof of the equivalence between the two models in the range of a unique and invertible map unambiguously linking one model parameters set to the other. Our result finds further justification acknowledging that both models provide maximum entropy distributions of binary time series with given means, auto-correlations, and lagged cross-correlations of order one. We further show that the equivalence between the two models permits to exploit the inference methods originally developed for one model in the inference of the other.
△ Less
Submitted 8 February, 2021; v1 submitted 24 August, 2020;
originally announced August 2020.
-
Tail Granger causalities and where to find them: extreme risk spillovers vs. spurious linkages
Authors:
Piero Mazzarisi,
Silvia Zaoli,
Carlo Campajola,
Fabrizio Lillo
Abstract:
Identifying risk spillovers in financial markets is of great importance for assessing systemic risk and portfolio management. Granger causality in tail (or in risk) tests whether past extreme events of a time series help predicting future extreme events of another time series. The topology and connectedness of networks built with Granger causality in tail can be used to measure systemic risk and t…
▽ More
Identifying risk spillovers in financial markets is of great importance for assessing systemic risk and portfolio management. Granger causality in tail (or in risk) tests whether past extreme events of a time series help predicting future extreme events of another time series. The topology and connectedness of networks built with Granger causality in tail can be used to measure systemic risk and to identify risk transmitters. Here we introduce a novel test of Granger causality in tail which adopts the likelihood ratio statistic and is based on the multivariate generalization of a discrete autoregressive process for binary time series describing the sequence of extreme events of the underlying price dynamics. The proposed test has very good size and power in finite samples, especially for large sample size, allows inferring the correct time scale at which the causal interaction takes place, and it is flexible enough for multivariate extension when more than two time series are considered in order to decrease false detections as spurious effect of neglected variables. An extensive simulation study shows the performances of the proposed method with a large variety of data generating processes and it introduces also the comparison with the test of Granger causality in tail by [Hong et al., 2009]. We report both advantages and drawbacks of the different approaches, pointing out some crucial aspects related to the false detections of Granger causality for tail events. An empirical application to high frequency data of a portfolio of US stocks highlights the merits of our novel approach.
△ Less
Submitted 6 May, 2021; v1 submitted 3 May, 2020;
originally announced May 2020.
-
Betweenness centrality for temporal multiplexes
Authors:
Silvia Zaoli,
Piero Mazzarisi,
Fabrizio Lillo
Abstract:
Betweenness centrality quantifies the importance of a vertex for the information flow in a network. We propose a flexible definition of betweenness for temporal multiplexes, where geodesics are determined accounting for the topological and temporal structure and the duration of paths. We propose an algorithm to compute the new metric via a map** to a static graph. We show the importance of consi…
▽ More
Betweenness centrality quantifies the importance of a vertex for the information flow in a network. We propose a flexible definition of betweenness for temporal multiplexes, where geodesics are determined accounting for the topological and temporal structure and the duration of paths. We propose an algorithm to compute the new metric via a map** to a static graph. We show the importance of considering the temporal multiplex structure and an appropriate distance metric comparing the results with those obtained with static or single-layer metrics on a dataset of $\sim 20$k European flights.
△ Less
Submitted 3 February, 2020;
originally announced February 2020.
-
New centrality and causality metrics assessing air traffic network interactions
Authors:
Piero Mazzarisi,
Silvia Zaoli,
Fabrizio Lillo,
Luis Delgado,
Gérald Gurtner
Abstract:
In ATM systems, the massive number of interacting entities makes it difficult to identify critical elements and paths of disturbance propagation, as well as to predict the system-wide effects that innovations might have. To this end, suitable metrics are required to assess the role of the interconnections between the elements and complex network science provides several network metrics to evaluate…
▽ More
In ATM systems, the massive number of interacting entities makes it difficult to identify critical elements and paths of disturbance propagation, as well as to predict the system-wide effects that innovations might have. To this end, suitable metrics are required to assess the role of the interconnections between the elements and complex network science provides several network metrics to evaluate the network functioning. Here we focus on centrality and causality metrics measuring, respectively, the importance of a node and the propagation of disturbances along links. By investigating a dataset of US flights, we show that existing centrality and causality metrics are not suited to characterise the effect of delays in the system. We then propose generalisations of such metrics that we prove suited to ATM applications. Specifically, the new centrality is able to account for the temporal and multi-layer structure of ATM network, while the new causality metric focuses on the propagation of extreme events along the system.
△ Less
Submitted 6 May, 2021; v1 submitted 6 November, 2019;
originally announced November 2019.
-
Non-Markovian temporal networks with auto- and cross-correlated link dynamics
Authors:
Oliver E. Williams,
Piero Mazzarisi,
Fabrizio Lillo,
Vito Latora
Abstract:
Many of the biological, social and man-made networks around us are inherently dynamic, with their links switching on and off over time. The evolution of these networks is often non-Markovian, and the dynamics of their links correlated. Hence, to accurately model these networks, predict their evolution, and understand how information and other quantities propagate over them, the inclusion of both m…
▽ More
Many of the biological, social and man-made networks around us are inherently dynamic, with their links switching on and off over time. The evolution of these networks is often non-Markovian, and the dynamics of their links correlated. Hence, to accurately model these networks, predict their evolution, and understand how information and other quantities propagate over them, the inclusion of both memory and dynamical dependencies between links is key. We here introduce a general class of models of temporal networks based on discrete autoregressive processes. As a case study we concentrate on a specific model within this class, generating temporal networks with a specified underlying backbone, and with precise control over the dynamical dependencies between links and the strength and length of their memories. In this network model the presence of each link is influenced by its own past activity and the past activities of other links, as specified by a coupling matrix, which directly controls the causal relations and correlations among links. We propose a method for estimating the models parameters and how to deal with heterogeneity and time-varying patterns, showing how the model allows for a more realistic description of real world temporal networks and also to predict their evolution. We then investigate the role that memory and correlations in link dynamics have on processes occurring over a temporal network by studying the speed of a spreading process, as measured by the time it takes for diffusion to reach equilibrium. Through both numerical simulations and analytical results, we are able to separate the roles of autocorrelations and neighbourhood correlations in link dynamics, showing that the speed of diffusion is non-monotonically dependent on the memory length, and that correlations among neighbouring links can speed up the spreading process, while autocorrelations slow it down.
△ Less
Submitted 22 July, 2021; v1 submitted 17 September, 2019;
originally announced September 2019.
-
Trip Centrality: walking on a temporal multiplex with non-instantaneous link travel time
Authors:
Silvia Zaoli,
Piero Mazzarisi,
Fabrizio Lillo
Abstract:
In complex networks, centrality metrics quantify the connectivity of nodes and identify the most important ones in the transmission of signals. In many real world networks, especially in transportation systems, links are dynamic, i.e. their presence depends on time, and travelling between two nodes requires a non-vanishing time. Additionally, many networks are structured on several layers, represe…
▽ More
In complex networks, centrality metrics quantify the connectivity of nodes and identify the most important ones in the transmission of signals. In many real world networks, especially in transportation systems, links are dynamic, i.e. their presence depends on time, and travelling between two nodes requires a non-vanishing time. Additionally, many networks are structured on several layers, representing, e.g., different transportation modes or service providers. Temporal generalisations of centrality metrics based on walk-counting, like Katz centrality, exist, however they do not account for non-zero link travel times and for the multiplex structure. We propose a generalisation of Katz centrality, termed Trip Centrality, counting only the paths that can be travelled according to the network temporal structure, i.e. "trips", while also differentiating the contributions of inter- and intra-layer walks to centrality. We show an application to the US air transport system, specifically computing airports' centrality losses due to delays in the flight network.
△ Less
Submitted 7 March, 2019;
originally announced March 2019.
-
Detectability of Macroscopic Structures in Directed Asymmetric Stochastic Block Model
Authors:
Mateusz Wilinski,
Piero Mazzarisi,
Daniele Tantari,
Fabrizio Lillo
Abstract:
We study the problem of identifying macroscopic structures in networks, characterizing the impact of introducing link directions on the detectability phase transition. To this end, building on the stochastic block model, we construct a class of hardly detectable directed networks. We find closed form solutions by using belief propagation method showing how the transition line depends on the assort…
▽ More
We study the problem of identifying macroscopic structures in networks, characterizing the impact of introducing link directions on the detectability phase transition. To this end, building on the stochastic block model, we construct a class of hardly detectable directed networks. We find closed form solutions by using belief propagation method showing how the transition line depends on the assortativity and the asymmetry of the network. Finally, we numerically identify the existence of a hard phase for detection close to the transition point.
△ Less
Submitted 24 February, 2019; v1 submitted 10 November, 2018;
originally announced November 2018.
-
When panic makes you blind: a chaotic route to systemic risk
Authors:
Piero Mazzarisi,
Fabrizio Lillo,
Stefano Marmi
Abstract:
We present an analytical model to study the role of expectation feedbacks and overlap** portfolios on systemic stability of financial systems. Building on [Corsi et al., 2016], we model a set of financial institutions having Value at Risk capital requirements and investing in a portfolio of risky assets, whose prices evolve stochastically in time and are endogenously driven by the trading decisi…
▽ More
We present an analytical model to study the role of expectation feedbacks and overlap** portfolios on systemic stability of financial systems. Building on [Corsi et al., 2016], we model a set of financial institutions having Value at Risk capital requirements and investing in a portfolio of risky assets, whose prices evolve stochastically in time and are endogenously driven by the trading decisions of financial institutions. Assuming that they use adaptive expectations of risk, we show that the evolution of the system is described by a slow-fast random dynamical system, which can be studied analytically in some regimes. The model shows how the risk expectations play a central role in determining the systemic stability of the financial system and how wrong risk expectations may create panic-induced reduction or over-optimistic expansion of balance sheets. Specifically, when investors are myopic in estimating the risk, the fixed point equilibrium of the system breaks into leverage cycles and financial variables display a bifurcation cascade eventually leading to chaos. We discuss the role of financial policy and the effects of some market frictions, as the cost of diversification and financial transaction taxes, in determining the stability of the system in the presence of adaptive expectations of risk.
△ Less
Submitted 2 May, 2018;
originally announced May 2018.
-
A dynamic network model with persistent links and node-specific latent variables, with an application to the interbank market
Authors:
Piero Mazzarisi,
Paolo Barucca,
Fabrizio Lillo,
Daniele Tantari
Abstract:
We propose a dynamic network model where two mechanisms control the probability of a link between two nodes: (i) the existence or absence of this link in the past, and (ii) node-specific latent variables (dynamic fitnesses) describing the propensity of each node to create links. Assuming a Markov dynamics for both mechanisms, we propose an Expectation-Maximization algorithm for model estimation an…
▽ More
We propose a dynamic network model where two mechanisms control the probability of a link between two nodes: (i) the existence or absence of this link in the past, and (ii) node-specific latent variables (dynamic fitnesses) describing the propensity of each node to create links. Assuming a Markov dynamics for both mechanisms, we propose an Expectation-Maximization algorithm for model estimation and inference of the latent variables. The estimated parameters and fitnesses can be used to forecast the presence of a link in the future. We apply our methodology to the e-MID interbank network for which the two linkage mechanisms are associated with two different trading behaviors in the process of network formation, namely preferential trading and trading driven by node-specific characteristics. The empirical results allow to recognise preferential lending in the interbank market and indicate how a method that does not account for time-varying network topologies tends to overestimate preferential linkage.
△ Less
Submitted 30 December, 2017;
originally announced January 2018.
-
Disentangling group and link persistence in Dynamic Stochastic Block models
Authors:
Paolo Barucca,
Fabrizio Lillo,
Piero Mazzarisi,
Daniele Tantari
Abstract:
We study the inference of a model of dynamic networks in which both communities and links keep memory of previous network states. By considering maximum likelihood inference from single snapshot observations of the network, we show that link persistence makes the inference of communities harder, decreasing the detectability threshold, while community persistence tends to make it easier. We analyti…
▽ More
We study the inference of a model of dynamic networks in which both communities and links keep memory of previous network states. By considering maximum likelihood inference from single snapshot observations of the network, we show that link persistence makes the inference of communities harder, decreasing the detectability threshold, while community persistence tends to make it easier. We analytically show that communities inferred from single network snapshot can share a maximum overlap with the underlying communities of a specific previous instant in time. This leads to time-lagged inference: the identification of past communities rather than present ones. Finally we compute the time lag and propose a corrected algorithm, the Lagged Snapshot Dynamic (LSD) algorithm, for community detection in dynamic networks. We analytically and numerically characterize the detectability transitions of such algorithm as a function of the memory parameters of the model and we make a comparison with a full dynamic inference.
△ Less
Submitted 19 December, 2018; v1 submitted 20 January, 2017;
originally announced January 2017.