Search | arXiv e-print repository

Longitudinal market structure detection using a dynamic modularity-spectral algorithm

Authors: Philipp Wirth, Francesca Medda, Thomas Schröder

Abstract: In this paper, we introduce the Dynamic Modularity-Spectral Algorithm (DynMSA), a novel approach to identify clusters of stocks with high intra-cluster correlations and low inter-cluster correlations by combining Random Matrix Theory with modularity optimisation and spectral clustering. The primary objective is to uncover hidden market structures and find diversifiers based on return correlations,… ▽ More In this paper, we introduce the Dynamic Modularity-Spectral Algorithm (DynMSA), a novel approach to identify clusters of stocks with high intra-cluster correlations and low inter-cluster correlations by combining Random Matrix Theory with modularity optimisation and spectral clustering. The primary objective is to uncover hidden market structures and find diversifiers based on return correlations, thereby achieving a more effective risk-reducing portfolio allocation. We applied DynMSA to constituents of the S&P 500 and compared the results to sector- and market-based benchmarks. Besides the conception of this algorithm, our contributions further include implementing a sector-based calibration for modularity optimisation and a correlation-based distance function for spectral clustering. Testing revealed that DynMSA outperforms baseline models in intra- and inter-cluster correlation differences, particularly over medium-term correlation look-backs. It also identifies stable clusters and detects regime changes due to exogenous shocks, such as the COVID-19 pandemic. Portfolios constructed using our clusters showed higher Sortino and Sharpe ratios, lower downside volatility, reduced maximum drawdown and higher annualised returns compared to an equally weighted market benchmark. △ Less

Submitted 5 July, 2024; originally announced July 2024.

arXiv:2403.00707 [pdf, other]

Dimensionality reduction techniques to support insider trading detection

Authors: Adele Ravagnani, Fabrizio Lillo, Paola Deriu, Piero Mazzarisi, Francesca Medda, Antonio Russo

Abstract: Identification of market abuse is an extremely complicated activity that requires the analysis of large and complex datasets. We propose an unsupervised machine learning method for contextual anomaly detection, which allows to support market surveillance aimed at identifying potential insider trading activities. This method lies in the reconstruction-based paradigm and employs principal component… ▽ More Identification of market abuse is an extremely complicated activity that requires the analysis of large and complex datasets. We propose an unsupervised machine learning method for contextual anomaly detection, which allows to support market surveillance aimed at identifying potential insider trading activities. This method lies in the reconstruction-based paradigm and employs principal component analysis and autoencoders as dimensionality reduction techniques. The only input of this method is the trading position of each investor active on the asset for which we have a price sensitive event (PSE). After determining reconstruction errors related to the trading profiles, several conditions are imposed in order to identify investors whose behavior could be suspicious of insider trading related to the PSE. As a case study, we apply our method to investor resolved data of Italian stocks around takeover bids. △ Less

Submitted 8 May, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2212.05912 [pdf, other]

A machine learning approach to support decision in insider trading detection

Authors: Piero Mazzarisi, Adele Ravagnani, Paola Deriu, Fabrizio Lillo, Francesca Medda, Antonio Russo

Abstract: Identifying market abuse activity from data on investors' trading activity is very challenging both for the data volume and for the low signal to noise ratio. Here we propose two complementary unsupervised machine learning methods to support market surveillance aimed at identifying potential insider trading activities. The first one uses clustering to identify, in the vicinity of a price sensitive… ▽ More Identifying market abuse activity from data on investors' trading activity is very challenging both for the data volume and for the low signal to noise ratio. Here we propose two complementary unsupervised machine learning methods to support market surveillance aimed at identifying potential insider trading activities. The first one uses clustering to identify, in the vicinity of a price sensitive event such as a takeover bid, discontinuities in the trading activity of an investor with respect to his/her own past trading history and on the present trading activity of his/her peers. The second unsupervised approach aims at identifying (small) groups of investors that act coherently around price sensitive events, pointing to potential insider rings, i.e. a group of synchronised traders displaying strong directional trading in rewarding position in a period before the price sensitive event. As a case study, we apply our methods to investor resolved data of Italian stocks around takeover bids. △ Less

Submitted 6 December, 2022; originally announced December 2022.

Comments: 42 pages, 16 Figures

MSC Class: 62H30

arXiv:2208.00181 [pdf, other]

How Covid mobility restrictions modified the population of investors in Italian stock markets

Authors: Paola Deriu, Fabrizio Lillo, Piero Mazzarisi, Francesca Medda, Adele Ravagnani, Antonio Russo

Abstract: This paper investigates how Covid mobility restrictions impacted the population of investors of the Italian stock market. The analysis tracks the trading activity of individual investors in Italian stocks in the period January 2019-September 2021, investigating how their composition and the trading activity changed around the Covid-19 lockdown period (March 9 - May 19, 2020) and more generally in… ▽ More This paper investigates how Covid mobility restrictions impacted the population of investors of the Italian stock market. The analysis tracks the trading activity of individual investors in Italian stocks in the period January 2019-September 2021, investigating how their composition and the trading activity changed around the Covid-19 lockdown period (March 9 - May 19, 2020) and more generally in the period of the pandemic. The results pinpoint that the lockdown restriction was accompanied by a surge in interest toward stock market, as testified by the trading volume by households. Given the generically falling prices during the lockdown, the households, which are typically contrarian, were net buyers, even if less than expected from their trading activity in 2019. This can be explained by the arrival, during the lockdown, of a group of about 185k new investors (i.e. which had never traded since January 2019) which were on average ten year younger and with a larger fraction of males than the pre-lockdown investors. By looking at the gross P&L, there is clear evidence that these new investors were more skilled in trading. There are thus indications that the lockdown, and more generally the Covid pandemic, created a sort of regime change in the population of financial investors. △ Less

Submitted 30 July, 2022; originally announced August 2022.

Comments: 25 pages, 16 figures

arXiv:2202.01178 [pdf, other]

Information Extraction through AI techniques: The KIDs use case at CONSOB

Authors: Domenico Lembo, Alessandra Limosani, Francesca Medda, Alessandra Monaco, Federico Maria Scafoglieri

Abstract: In this paper we report on the initial activities carried out within a collaboration between Consob and Sapienza University. We focus on Information Extraction from documents describing financial instruments. We discuss how we automate this task, via both rule-based and machine learning-based methods and provide our first results. In this paper we report on the initial activities carried out within a collaboration between Consob and Sapienza University. We focus on Information Extraction from documents describing financial instruments. We discuss how we automate this task, via both rule-based and machine learning-based methods and provide our first results. △ Less

Submitted 29 January, 2022; originally announced February 2022.

arXiv:2006.04896 [pdf, other]

doi 10.14428/esann/2021.ES2021-18

A Baseline for Shapley Values in MLPs: from Missingness to Neutrality

Authors: Cosimo Izzo, Aldo Lipani, Ramin Okhrati, Francesca Medda

Abstract: Deep neural networks have gained momentum based on their accuracy, but their interpretability is often criticised. As a result, they are labelled as black boxes. In response, several methods have been proposed in the literature to explain their predictions. Among the explanatory methods, Shapley values is a feature attribution method favoured for its robust theoretical foundation. However, the ana… ▽ More Deep neural networks have gained momentum based on their accuracy, but their interpretability is often criticised. As a result, they are labelled as black boxes. In response, several methods have been proposed in the literature to explain their predictions. Among the explanatory methods, Shapley values is a feature attribution method favoured for its robust theoretical foundation. However, the analysis of feature attributions using Shapley values requires choosing a baseline that represents the concept of missingness. An arbitrary choice of baseline could negatively impact the explanatory power of the method and possibly lead to incorrect interpretations. In this paper, we present a method for choosing a baseline according to a neutrality value: as a parameter selected by decision-makers, the point at which their choices are determined by the model predictions being either above or below it. Hence, the proposed baseline is set based on a parameter that depends on the actual use of the model. This procedure stands in contrast to how other baselines are set, i.e. without accounting for how the model is used. We empirically validate our choice of baseline in the context of binary classification tasks, using two datasets: a synthetic dataset and a dataset derived from the financial domain. △ Less

Submitted 9 August, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

Journal ref: ESANN 2021 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning

arXiv:1704.06791 [pdf, other]

doi 10.1142/S0219525916500168,

The effect of heterogeneity on financial contagion due to overlap** portfolios

Authors: Opeoluwa Banwo, Fabio Caccioli, Paul Harrald, Francesca Medda

Abstract: We consider a model of financial contagion in a bipartite network of assets and banks recently introduced in the literature, and we study the effect of power law distributions of degree and balance-sheet size on the stability of the system. Relative to the benchmark case of banks with homogeneous degrees and balance-sheet sizes, we find that if banks have a power-law degree distribution the system… ▽ More We consider a model of financial contagion in a bipartite network of assets and banks recently introduced in the literature, and we study the effect of power law distributions of degree and balance-sheet size on the stability of the system. Relative to the benchmark case of banks with homogeneous degrees and balance-sheet sizes, we find that if banks have a power-law degree distribution the system becomes less robust with respect to the initial failure of a random bank, and that targeted shocks to the most specialised banks (i.e. banks with low degrees) or biggest banks increases the probability of observing a cascade of defaults. In contrast, we find that a power-law degree distribution for assets increases stability with respect to random shocks, but not with respect to targeted shocks. We also study how allocations of capital buffers between banks affects the system's stability, and we find that assigning capital to banks in relation to their level of diversification reduces the probability of observing cascades of defaults relative to size based allocations. Finally, we propose a non-capital based policy that improves the resilience of the system by introducing disassortative mixing between banks and assets. △ Less

Submitted 22 April, 2017; originally announced April 2017.

Comments: 22 pages, 19 figures

MSC Class: 65-05

Journal ref: Advances in Complex Systems 19, 1650016 (2016)

Showing 1–7 of 7 results for author: Medda, F